Skew-Reflected-Gompertz Information Quantifiers with Application to Sea Surface Temperature Records

Javier E. Contreras-Reyes; Mohsen Maleki; Daniel Devia Cortés

doi:10.3390/math7050403

,

and

¹

Departamento de Estadística, Facultad de Ciencias, Universidad del Bío-Bío, Concepción 4081112, Chile

²

Department of Statistics, College of Sciences, Shiraz University, Shiraz 71946 85115, Iran

³

Departamento de Evaluación de Pesquerías, Instituto de Fomento Pesquero, Valparaíso 2361827, Chile

^*

Author to whom correspondence should be addressed.

Mathematics2019, 7(5), 403;https://doi.org/10.3390/math7050403

This article belongs to the Special Issue Uncertainty Quantification Techniques in Statistics

Version Notes

Order Reprints

Review Reports

Abstract

The Skew-Reflected-Gompertz (SRG) distribution, introduced by Hosseinzadeh et al. (J. Comput. Appl. Math. (2019) 349, 132–141), produces two-piece asymmetric behavior of the Gompertz (GZ) distribution, which extends the positive to a whole dominion by an extra parameter. The SRG distribution also permits a better fit than its well-known classical competitors, namely the skew-normal and epsilon-skew-normal distributions, for data with a high presence of skewness. In this paper, we study information quantifiers such as Shannon and Rényi entropies, and Kullback–Leibler divergence in terms of exact expressions of GZ information measures. We find the asymptotic test useful to compare two SRG-distributed samples. Finally, as a real-world data example, we apply these results to South Pacific sea surface temperature records.

Keywords:

Skew-Reflected-Gompertz distribution; Gompertz distribution; entropy; Kullback–Leibler divergence; sea surface temperature

1. Introduction

The Skew-Reflected-Gompertz (SRG) distribution was recently introduced by [1] and corresponds to an extension of the Gompertz distribution [2], named after Benjamin Gompertz (1779–1865). It extends the positive dominion

R_{+}

to the whole of

R

by an extra parameter,

ε

,

- 1 < ε < 1

, and produces two-piece asymmetric behavior of Gompertz (GZ) density. The SRG distribution has as particular cases the Reflected-GZ and GZ distributions, when

ε \to 1

and

ε \to - 1

, respectively. The SRG distribution family can also represent a suitable competitor against the skew-normal (SN, [3]) and epsilon-skew-normal (ESN, [4]) distributions as a way to fit asymmetrical datasets. Indeed, refs. [5,6] dealt with the frequentist and Bayesian inferences of ESN distribution. Contributions by [1] provided probability density function (pdf), cumulative distribution function (cdf), quantile function, moment-generating function (MGF), stochastic representation, the Expectation-Maximization (EM) algorithm for SRG parameter estimates and the Fisher information matrix (FIM).

Moreover, several recent investigations confirmed the usefulness of entropic quantifiers in the study of asymmetric distributions [3,7,8] and their applications to topics such as thermal wake [9], marine fish biology [3,8], sea surface temperature (SST), relative humidity measured in the Atlantic Ocean [10], and more. We build on the study of [3], which developed hypothesis testing for normality, i.e., if the shape parameter is close to zero. They considered the Kullback–Leibler (KL) divergence in terms of moments and cumulants of the modified SN distribution. Posteriorly, we consider a real-world data set of the anchovy condition factor for testing the shape parameter to decide if a food deficit produced by environmental conditions such as El Niño exists [11].

This work arose from a motivation to tackle the problem of determining the adequate pdf of SST [9,10]. Indeed, probabilistic modelling of SST is key for accurate predictions [9]. Therefore, we propose that the SRG model based on two-piece distributions could be more suitable for interpreting annual bimodal and asymmetric SST data. We also considered the existent results of Shannon and Rényi entropies, and KL divergence for GZ distributions for developed entropic quantifiers for SRG distributions. Posteriorly, we considered SST along the South Pacific and Chilean coasts from 2012 to 2014 to illustrate our results. Specifically, we introduced hypothesis testing developed by [12] for the SRG distribution, which is useful to compare two data sets with bimodal and asymmetric behavior such as SST.

2. The Skew-Reflected-Gompertz Distribution

The Gompertz (GZ, [2]) distribution is a continuous probability distribution with the following pdf

\begin{matrix} f (x | σ, η) = \frac{η}{σ} e^{\frac{x}{σ}} e^{- η (e^{\frac{x}{σ}} - 1)}, x \geq 0, \end{matrix}

(1)

where

σ > 0

and

η > 0

are the scale and shape parameters, respectively, and are denoted by

X \sim G Z (σ, η)

. The mean and variance of X are

\begin{matrix} E (X) & = & σ e^{η} E i (- η), \\ V a r (X) & = & σ^{2} e^{η} τ, \end{matrix}

(2)

respectively; where

E i (z) = \int_{- z}^{\infty} \frac{e^{- u}}{u} d u

,

τ = - 2 η F (- η) + γ^{2} + \frac{π^{2}}{6} + 2 γ log η + {(log η)}^{2} - e^{η} {[E i (- η)]}^{2}

,

γ = 0.5772156649

is the Euler constant and

F (z) = \sum_{k = 0}^{+ \infty} \frac{z^{k}}{k! {(k + 1)}^{3}} .

The SRG distribution is an extension of the GZ proposed by [1]. If Y follows, the SRG distribution is denoted by

Y \sim SRG (μ, σ, η, ε)

and has pdf

\begin{matrix} g (y | μ, σ, η, ε) = \{\begin{matrix} \frac{1}{2} f (\frac{μ - y}{1 + ε} | σ, η), & y \leq μ, \\ \frac{1}{2} f (\frac{y - μ}{1 - ε} | σ, η), & y > μ, \end{matrix} \end{matrix}

(3)

where

μ \in R

is the location parameter and

ε \in (- 1, 1)

is the slant parameter. Note that SRG is the GZ distribution when

μ = 0

and

ε \to - 1

, GZ distribution with negative support when

ε \to 1

, and Reflected-GZ distribution when

ε = 0

. Also, the Reflected-GZ distribution corresponds to a particular case of a more general class of two-piece asymmetric distributions proposed by [13,14]. The mean, variance and MGF of Y are

\begin{matrix} E (Y) & = & μ - 2 ε σ e^{η} E i (- η), \\ V a r (Y) & = & σ^{2} {τ e^{η} + 2 (1 - ε^{2}) e^{2 η} {[E i (- η)]}^{2}}, \\ M_{Y} (t) & = & \frac{1}{2} η e^{η + μ t} [(1 - ε) ϝ_{- σ t} (η) + (1 + ε) ϝ_{σ t} (η)], \end{matrix}

(4)

respectively; where

ϝ_{s} (z) = \int_{1}^{\infty} v^{s + 1} e^{- v z} d v

. Jafari et al. [15] provide the MGF of X using expansion series. However, (4) is considered a clearer expression that depends only on integral

ϝ_{s} (z)

. See Section 4.1 for some details of the MLE EM-based algorithm related to SRG parameters.

According to [1], the SRG distribution can be re-parametrized in terms of GZ and Reflected-GZ distributions as

\begin{matrix} g (y | μ, σ_{+}, σ_{-}, η) = p_{1} f (μ - y | σ_{+}, η) I_{(- \infty, μ]} (y) + p_{2} f (y - μ | σ_{-}, η) I_{(μ, + \infty)} (y), \end{matrix}

(5)

where

σ_{\pm} = σ (1 \pm ε)

,

p_{1} + p_{2} = 1

, and

p_{1} = σ_{+} / (σ_{+} + σ_{-}) = (1 + ε) / 2

. Let

Y = {(Y_{1}, \dots, Y_{n})}^{⊤}

be an i.i.d sample from the SRG distribution with parameters

(μ, σ_{\pm}, η)

and latent vectors

Z = (Z_{1}, \dots, Z_{n})

, thus (5) can be equivalently represented as

{(- 1)}^{j} (Y_{i} - μ) | Z_{i j} = 1 \sim G Z (σ_{\pm}, η)

,

i = 1, \dots, n

,

j = 1, 2

, where

Z_{i} = {(Z_{i 1}, Z_{i 2})}^{⊤} \sim M u l t (1, p_{1}, p_{2})

is a multinomial vector,

P (Z_{i 1} = z_{i 1}, Z_{i 2} = z_{i 2}) = p_{1}^{z_{i 1}} p_{2}^{z_{i 2}}

,

z_{i j} = {0, 1}

, and

z_{i 1} + z_{i 2} = 1

. Given that

P (Z_{i 1} = 1) = P (Z_{i 1} = 1, Z_{i k} = 0; \forall j \neq k)

, the complete log-likelihood function is

\begin{matrix} ℓ (μ, σ_{+}, σ_{-}, η | Y, Z) & = & - n log (2 σ) + n (η + log η) \\ + \sum_{i = 1}^{n} [z_{i 1} (\frac{μ - y_{i}}{σ_{+}} - η e^{\frac{μ - y_{i}}{σ_{+}}}) + z_{i 2} (\frac{y_{i} - μ}{σ_{-}} - η e^{\frac{y_{i} - μ}{σ_{-}}})] . \end{matrix}

(6)

Conditional expectations of latent variables

Z_{i}

are given by

\begin{matrix} {\hat{z}}_{i 1} & = & E [Z_{i 1} | \hat{μ}, {\hat{σ}}_{+}, {\hat{σ}}_{-}, y_{i}] = {\hat{p}}_{1} \frac{f (\hat{μ} - y_{i} | {\hat{σ}}_{+}, \hat{η})}{g (y_{i} | \hat{μ}, {\hat{σ}}_{+}, {\hat{σ}}_{-}, \hat{η})} I_{(- \infty, \hat{μ}]} (y_{i}), \end{matrix}

(7)

\begin{matrix} {\hat{z}}_{i 2} & = & 1 - {\hat{z}}_{i 1}, i = 1, \dots, n . \end{matrix}

(8)

The E- and M-steps on the

(k + 1)

th iteration of the EM algorithm are

E-step. From (6)–(8), we have

$\begin{matrix} Q (μ, σ_{+}, σ_{-}, η | μ^{(k)}, σ_{+}^{(k)}, σ_{-}^{(k)}, η^{(k)}) & = & E [ℓ (μ, σ_{+}, σ_{-}, η | Y, Z) | μ^{(k)}, σ_{+}^{(k)}, σ_{-}^{(k)}, η^{(k)}] \\ = & - n log (2 σ) + n (η + log η) \\ + \sum_{i = 1}^{n} [{\hat{z}}_{i 1}^{(k)} (\frac{μ - y_{i}}{σ_{+}} - η e^{\frac{μ - y_{i}}{σ_{+}}}) + {\hat{z}}_{i 2}^{(k)} (\frac{y_{i} - μ}{σ_{-}} - η e^{\frac{y_{i} - μ}{σ_{-}}})] . \end{matrix}$

and
M-step. Update $σ_{\pm}$ , by solving the following equation

$\sum_{i = 1}^{n} {\hat{z}}_{i j}^{(k)} (η^{(k)} \frac{| y_{i} - μ^{(k)} |}{σ_{\pm}^{2}} e^{\frac{| y_{i} - μ^{(k)} |}{σ_{\pm}}} - \frac{| y_{i} - μ^{(k)} |}{σ_{\pm}^{2}}) = \frac{n}{2 σ} .$

Update

μ

by solving the following equation

{\hat{μ}}^{(k + 1)} = {argmax}_{μ} \sum_{i = 1}^{n} \{{\hat{z}}_{i 1}^{(k)} (\frac{μ - y_{i}}{{\hat{σ}}_{+}^{(k + 1)}} - η e^{\frac{μ - y_{i}}{{\hat{σ}}_{+}^{(k + 1)}}}) + {\hat{z}}_{i 2}^{(k)} (\frac{μ - y_{i}}{{\hat{σ}}_{-}^{(k + 1)}} - η e^{\frac{μ - y_{i}}{{\hat{σ}}_{-}^{(k + 1)}}})\} .

Update

η

by

\hat{η} = n {(\sum_{i = 1}^{n} \{{\hat{z}}_{i 1}^{(k)} e^{\frac{μ - y_{i}}{{\hat{σ}}_{+}^{(k + 1)}}} + {\hat{z}}_{i 2}^{(k)} e^{\frac{μ - y_{i}}{{\hat{σ}}_{-}^{(k + 1)}}}\})}^{- 1} .

The EM-algorithm must be iterated until the sufficient convergence rule is satisfied:

∥ ({\hat{μ}}^{(k + 1)}, {\hat{σ}}_{+}^{(k + 1)}, {\hat{σ}}_{-}^{(k + 1)}, {\hat{η}}^{(k + 1)}) - ({\hat{μ}}^{(k)}, {\hat{σ}}_{+}^{(k)}, {\hat{σ}}_{-}^{(k)}, {\hat{η}}^{(k)}) ∥ < τ,

for a tolerance

τ

close to zero. The FIM for standard deviations of MLEs

(\hat{μ}, \hat{σ}, \hat{η}, \hat{ε})

and additional details of the EM-algorithm are described in [1].

3. Entropic Quantifiers

In the next section, we present the main results of entropic quantifiers for SRG distribution.

3.1. Shannon Entropy

The Shannon entropy (SE), introduced by [16] in the context of univariate continuous distributions, quantifies the information contained in a random variable X with pdf

f (x)

through the expression

\begin{matrix} H (X) = - \int_{- \infty}^{+ \infty} f (x) log f (x) d x . \end{matrix}

(9)

The SE concept is attributed to the uncertainty of the information presented in X [17]. Propositions 1 and 2 present the SE for GZ and SRG distributions, respectively.

Proposition 1.

[15]. The SE of

X \sim G Z (σ, η)

is

H (X) = log \{\frac{B (1, 1)}{η}\} - σ η - \frac{E (X)}{σ} + σ η M_{X} (σ^{- 1}),

where

B (\cdot, \cdot)

is the usual Beta function and

E (X)

is given in (2).

Substituting

μ = 0

and

ε = - 1

into (4) (i.e., reducing SRG to its special case GZ), we obtain

M_{X} (σ^{- 1}) = η e^{η} ϝ_{- 1} (η) = 1

. Therefore,

H (X)

in Proposition 1 is reduced to

\begin{matrix} H (X) = - log η - e^{η} E_{i} (- η), \end{matrix}

(10)

i.e., the SE of the GZ random variable only depends on shape parameter

η

.

Proposition 2.

The SE of

Y \sim S R G (μ, σ, η, ε)

is

H (Y) = \frac{1 + ε}{2} \{H (X_{+ ε}) - log (\frac{1 + ε}{2})\} + \frac{1 - ε}{2} \{H (X_{- ε}) - log (\frac{1 - ε}{2})\},

where

X_{\pm ε} \sim G Z (σ (1 \pm ε), η)

and

H (X_{\pm ε})

are obtained using Proposition 1.

Proof.

From (3) and (9), we obtained

\begin{matrix} H (Y) & = & - \int_{- \infty}^{+ \infty} g (y | μ, σ, η, ε) log g (y | μ, σ, η, ε) d y \\ = & - \frac{1}{2} \int_{0}^{+ \infty} f (\frac{x}{1 + ε} | σ, η) log \{\frac{1}{2} f (\frac{x}{1 + ε} | σ, η)\} d x \\ - \frac{1}{2} \int_{0}^{+ \infty} f (\frac{x}{1 - ε} | σ, η) log \{\frac{1}{2} f (\frac{x}{1 - ε} | σ, η)\} d x \\ = & - \frac{1}{2} \int_{0}^{+ \infty} (1 + ε) f (x | σ (1 + ε), η) log \{\frac{1 + ε}{2} f (x | σ (1 + ε), η)\} d x \\ - \frac{1}{2} \int_{0}^{+ \infty} (1 - ε) f (x | σ (1 - ε), η) log \{\frac{1 - ε}{2} f (x | σ (1 - ε), η)\} d x, \end{matrix}

which concludes the proof. □

From (10), given that

H (X_{\pm ε})

only depends on shape parameter

η

, we obtain

H (X_{\pm ε}) = H (X)

, and

H (Y)

only depends on

η

and

ε

parameters. Therefore,

\begin{matrix} H (Y) = - log η - e^{η} E_{i} (- η) - \frac{1 + ε}{2} log (\frac{1 + ε}{2}) - \frac{1 - ε}{2} log (\frac{1 - ε}{2}) . \end{matrix}

(11)

Figure 1 illustrates SE behavior for random variable Y. We observed that SE increases when

η

decreases. For each

η

, SE is maximized and minimized at

ε = 0

(Reflected-GZ) and

ε \to - 1

(Truncated-GZ and GZ), respectively. More details appear in [3,8] for the SE expressions of other asymmetric distributions.

Figure 1. Shannon entropy of Skew-Reflected-Gompertz (SRG) distributions for

ε \in (- 1, 1)

and several values of

η

.

3.2. Rényi Entropy

The

α

th-order Rényi entropy (RE), introduced by [18] in the context of univariate continuous distributions, extends the concept of SE information contained in a random variable X with pdf

f (x)

through a level

α

,

α \in N

,

α > 0

, and the expression

\begin{matrix} R_{α} (X) = \frac{1}{1 - α} log \int_{- \infty}^{+ \infty} {[f (x)]}^{α} d x . \end{matrix}

(12)

RE information can be negative and is ordered with respect to

α

, i.e.,

R_{α_{1}} (X) \geq R_{α_{2}} (X)

for any

α_{1} < α_{2}

(see, e.g., [7] and other properties of RE). From (12), the SE is obtained by the limit of

H (X) = {lim}_{α \to 1} R_{α} (X)

by applying l’Hôpital’s rule to

R_{α} (X)

with respect to

α

(see e.g., [7]). The RE of the GZ and SRG distributions is presented in Propositions 3 and 4, respectively.

Proposition 3.

[15,19]. The RE of

X \sim G Z (σ, η)

with

α > 1

,

α \in N

, is

R_{α} (X) = - \frac{log α}{1 - α} + log \frac{η}{σ} + \frac{1}{1 - α} log \{\sum_{j = 0}^{α - 1} (\binom{α - 1}{j}) \frac{Γ (j + 1)}{{(α η)}^{j}}\},

where

Γ (u) = \int_{0}^{\infty} t^{u - 1} e^{- t} d t

is the gamma function.

Proposition 4.

The RE of

Y \sim S R G (η, ε)

with

α > 1

,

α \in N

, is

R_{α} (Y) = \frac{1}{1 - α} log \{{(\frac{1 + ε}{2})}^{α} e^{(1 - α) R_{α} (X_{+ ε})} + {(\frac{1 - ε}{2})}^{α} e^{(1 - α) R_{α} (X_{- ε})}\},

where

X_{\pm ε} \sim G Z (σ (1 \pm ε), η)

and

R_{α} (X_{\pm ε})

are obtained using Proposition 3.

Proof.

From (3) and (12), we obtained

\begin{matrix} R_{α} (Y) & = & \frac{1}{1 - α} log \int_{- \infty}^{+ \infty} {[g (y | μ, σ, η, ε)]}^{α} d y, \\ = & \frac{1}{1 - α} log \{\int_{0}^{+ \infty} {[\frac{1}{2} f (\frac{x}{1 + ε} | σ, η)]}^{α} d x + \int_{0}^{+ \infty} {[\frac{1}{2} f (\frac{x}{1 - ε} | σ, η)]}^{α} d x\}, \\ = & \frac{1}{1 - α} log \{{(\frac{1 + ε}{2})}^{α} \int_{0}^{+ \infty} {[f (x | σ (1 + ε), η)]}^{α} d x + {(\frac{1 - ε}{2})}^{α} \int_{0}^{+ \infty} {[f (x | σ (1 - ε), η)]}^{α} d x\}, \end{matrix}

which concludes the proof. □

Figure 2a illustrates the behavior of RE for random variable Y when

α = 2

(quadratic RE). As in the SE case, we also observed that RE increases when

η

decreases and reaches maximum and minimum at

ε = 0

(Reflected-GZ) and

ε \to - 1

(Truncated-GZ and GZ), respectively. When

α = 5

(or

α > 2

) (see Figure 2b), RE decays faster than in the quadratic RE case as

ε \to - 1

. More details appear in [7] for the RE expressions of other asymmetric distributions.

Figure 2. Rényi entropy of SRG distributions for

σ = 1

,

- 1 < ε < 1

, several values of

η

and (a)

α = 2

and (b)

α = 5

values.

3.3. Kullback–Leibler Divergence

The Kullback–Leibler (KL) divergence introduced by [20] in the context of univariate continuous distributions, extends the concept of SE between two random variables

X_{1}

and

X_{2}

with pdfs

f_{1} (x_{1})

and

f_{2} (x_{2})

, respectively, through the expression

\begin{matrix} K (X_{1}, X_{2}) = \int_{- \infty}^{+ \infty} f_{1} (x) log \{\frac{f_{1} (x)}{f_{2} (x)}\} d x . \end{matrix}

(13)

The KL divergence measures the disparity between the pdfs of

X_{1}

and

X_{2}

, and is non-negative, non-symmetric and zero only if

X_{1} = X_{2}

in distribution. Also, the KL divergence does not satisfy the triangular inequality (see, e.g., [8,17] for other properties of KL and other divergences). The KL divergence for two GZ and two SRG distributions are presented in Propositions 5 and 6.

Proposition 5.

[21]. The KL divergence between

X_{1} \sim G Z (σ_{1}, η_{1})

and

X_{2} \sim G Z (σ_{2}, η_{2})

is

K (X_{1}, X_{2}) = log \{\frac{e^{η_{1}} σ_{2} η_{1}}{e^{η_{2}} σ_{1} η_{2}}\} + e^{η_{1}} [(\frac{σ_{1}}{σ_{2}} - 1) E_{i} (- η_{1}) + \frac{η_{2}}{η_{1}^{σ_{1} / σ_{2}}} Γ (\frac{σ_{1}}{σ_{2}} - 1, η_{1})] - (η_{1} + 1),

where

Γ (u, v) = \int_{v}^{\infty} t^{u - 1} e^{- t} d t

is the upper incomplete gamma function.

Proposition 6.

The KL divergence between

Y_{1} \sim S R G (0, σ_{1}, η_{1}, ε_{1})

and

Y_{2} \sim S R G (0, σ_{2}, η_{2}, ε_{2})

is

\begin{matrix} K (Y_{1}, Y_{2}) & = & \frac{1 + ε_{1}}{2} [log \{\frac{1 + ε_{1}}{1 + ε_{2}}\} + K (X_{+ ε_{1}}, X_{+ ε_{2}})] + \frac{1 - ε_{1}}{2} [log \{\frac{1 - ε_{1}}{1 - ε_{2}}\} + K (X_{- ε_{1}}, X_{- ε_{2}})], \end{matrix}

where

X_{\pm ε_{i}} \sim G Z (σ_{i} (1 \pm ε_{i}), η_{i})

,

i = 1, 2

, and

K (X_{\pm ε_{1}}, X_{\pm ε_{2}})

are obtained using Proposition 5.

Proof.

From (3) and (13), we obtained

\begin{matrix} K (Y_{1}, Y_{2}) & = & \int_{- \infty}^{+ \infty} g (x | 0, σ_{1}, η_{1}, ε_{1}) log \{\frac{g (x | 0, σ_{1}, η_{1}, ε_{1})}{g (x | 0, σ_{2}, η_{2}, ε_{2})}\} d x, \\ = & \frac{1}{2} \int_{0}^{+ \infty} f (\frac{x}{1 + ε_{1}} | σ_{1}, η_{1}) log \{\frac{f (\frac{x}{1 + ε_{1}} | σ_{1}, η_{1})}{f (\frac{x}{1 + ε_{2}} | σ_{2}, η_{2})}\} d x \\ + \frac{1}{2} \int_{0}^{+ \infty} f (\frac{x}{1 - ε_{1}} | σ_{1}, η_{1}) log \{\frac{f (\frac{x}{1 - ε_{1}} | σ_{1}, η_{1})}{f (\frac{x}{1 - ε_{2}} | σ_{2}, η_{2})}\} d x, \\ = & \frac{1 + ε_{1}}{2} [log \{\frac{1 + ε_{1}}{1 + ε_{2}}\} + \int_{0}^{+ \infty} f (x | σ_{1} (1 + ε_{1}), η_{1}) log \{\frac{f (x | σ_{1} (1 + ε_{1}), η_{1})}{f (x | σ_{2} (1 + ε_{2}), η_{2})}\} d x] \\ + \frac{1 - ε_{1}}{2} [log \{\frac{1 - ε_{1}}{1 - ε_{2}}\} + \int_{0}^{+ \infty} f (x | σ_{1} (1 - ε_{1}), η_{1}) log \{\frac{f (x | σ_{1} (1 - ε_{1}), η_{1})}{f (x | σ_{2} (1 - ε_{2}), η_{2})}\} d x], \end{matrix}

which concludes the proof. □

More details appear in [3,8] for the KL divergence expressions of other asymmetric distributions. Using Proposition 6, the asymptotic KL divergence between

Y \sim SRG (0, σ, η, ε)

and

X \sim GZ (σ, η)

is

K (Y, X) \approx \frac{1 + ε}{2} [lim_{ε_{2} \to - 1} log (\frac{1 + ε}{1 + ε_{2}}) + K (X_{+ ε}, X)] + \frac{1 - ε}{2} [log (\frac{1 - ε}{2}) + K (X_{- ε}, X)],

as

ε_{2} \to - 1

. However, we see that

log (\frac{1 + ε}{1 + ε_{2}}) = + \infty

as

ε_{2} \to - 1

and

K (Y, X)

is not finite. However, from Proposition 6 the asymptotic KL divergence between

Y_{1}

and

Y_{2}

is

K (Y_{1}, Y_{2}) \approx K (X, Y) = log (\frac{2}{1 - ε}) + K (X, X_{- ε}),

(14)

as

ε_{1} \to - 1

, where

X_{- ε} \sim GZ (σ (1 - ε), η)

. Therefore, while

K (Y, X)

is not finite,

K (X, Y)

is finite and can be used to study the disparity of

ε

from −1. Thus, hypothesis testing for

H_{0} : ε = - 1

can be addressed. Besides, we further study hypothesis testing for scale and shape parameters between two SRG distributions in Section 3.4. From (14), we also took that

K (Y_{1}, Y_{2}) \approx K (X, X_{1})

as

ε \to - 1

, with

X_{1} \sim GZ (2 σ, η)

.

Figure 3 illustrates the KL divergence between two SRG distributions. We observed that for the critical points of

(ε_{1}, ε_{2}) \to {(- 1, 1); (1, - 1)}

, the KL divergence reaches the highest values and is close to zero in the other values [panels (a) and (b)]. For large

η

’s [panel (c)], the KL divergence is zero for a concentrated region of the dominion where

ε_{1} = ε_{2}

.

Figure 3. Plots of Kullback–Leibler (KL) divergence between

Y_{1} \sim SRG (0, σ_{1}, η_{1}, ε_{1})

and

Y_{2} \sim SRG (0, σ_{2}, η_{2}, ε_{2})

for values

σ_{1} = σ_{2} = 1

and (a)

η_{1} = η_{2} = 0.25

; (b)

η_{1} = η_{2} = 3

; and (c)

η_{1} = η_{2} = 10

.

All information quantifiers and the EM algorithm for SRG distribution were implemented in [22].

3.4. Asymptotic Test

Consider two independent samples of sizes

n_{1}

and

n_{2}

from

Y_{1}

and

Y_{2}

, respectively; where

θ, θ^{'} \in Θ \subset R^{p}

, and

X_{1}

and

X_{2}

have pdfs

g (y; θ_{1})

and

g (y; θ_{2})

, respectively; with

θ_{i} = (σ_{i}, η_{i}, ε_{i})

,

i = 1, 2

. Suppose partition

θ_{i} = (θ_{i 1}, θ_{i 2})

, and assume

θ_{21} = θ_{11} \in Θ_{1} \subset R^{r}

, so that

θ_{i 2} \in Θ \cap Θ_{1}^{c} \subset R^{p - r}

. Let

{\hat{θ}}_{i} = ({\hat{θ}}_{11}, {\hat{θ}}_{i 2})

be the MLE of

θ_{i} = (θ_{11}, θ_{i 2})

for

i = 1, 2

, which corresponds to the MLE of the full model parameters

(θ_{1}, θ_{2})

under the null hypothesis

H_{0} : θ_{21} = θ_{11}

. Thus, part b) of Corollary 1 in [12] establishes that if the null hypothesis

H_{0} : θ_{22} = θ_{12}

holds and

\frac{n_{1}}{n_{1} + n_{2}} \underset{n_{1}, n_{2} \to \infty}{⟶} λ

, with

0 < λ < 1

, then

\begin{matrix} K_{0} = \frac{2 n_{1} n_{2}}{n_{1} + n_{2}} K ({\hat{θ}}_{1}, {\hat{θ}}_{2}) \underset{n_{1}, n_{2} \to \infty}{\overset{d}{⟶}} χ_{p - r}^{2}, \end{matrix}

(15)

where

r = 3

is the number of parameters of the SRG distribution (location parameter is not considered for KL divergence). Thus, a test of level

α

for the above homogeneity null hypothesis consists of rejecting

H_{0}

if

K_{0} > χ_{p - r, 1 - α}^{2}

, where

χ_{p - r, α}^{2}

is the

α

th percentile of the

χ_{p - r}^{2}

-distribution.

As [3] stated, the proposed asymptotic test is only valid for regular conditions of the SRG distribution, in particular for a non-singular FIM. Therefore, given that the SRG distributions’ FIM is singular at

ε \to \pm 1

[1], the SRG model does not serve for testing the null hypothesis using (15) when

ε

is close to −1 or 1.

4. Application

4.1. Sea Surface Temperature Data

The spatial information and SST data analyzed in this study were recorded by a scientific observer (whose labor concerns biological sampling of fishes, incidental captures of birds, turtles and marine mammals. Biological sampling was complemented with information such as time, longline and hook features, number of buoys, baits, etc.) (SO) in the Chilean longline fleet (industrial and artisanal), which was oriented to capture swordfish (Xiphias gladius, [23]) from 2012 to 2014 (obtaining a sampling of 83% in 2012, 55% in 2013, 90% in 2014, and 75% in 2012–2014). The covered area of the study was at

21^{°}

31′–

36^{°}

39′ LS and

71^{°}

08′–

85^{°}

52′ LW (see Figure 4).

Figure 4. Spatial distribution of Sea Surface Temperature (SST) observations by year (

21^{°}

31′–

36^{°}

39′ LS,

71^{°}

08′–

85^{°}

52′ LW).

SST records in swordfish captures are crucial for distributional analysis and fish abundance. Specifically, variations in SST are physical factors that control productivity, growth and migration of species [24]. In addition, SST is strongly correlated with atmospheric pressure at sea level and thus climatic time scales. Therefore, changes in SST overlap with ecosystem changes [25]. However, SST influence on ecosystems is not clear because other physical processes such as superficial warming, horizontal advection of currents, upwelling, etc. [11], modify SST. Therefore, SST anomalies could be symptomatic rather than causal.

4.1.1. SRG Parameter Estimates

Considering the smallest Akaike (AIC) and Schwarz (BIC) information criteria, we observed in Table 1 that SRG performs better than the SN and ESN models (see Appendix A and Appendix B, respectively). In addition, Table 1 shows the estimated parameters (based on the EM algorithm presented in Section 2) for SST datasets by year assuming SRG distribution. In 2012, a negative

ε

estimate corresponds to asymmetry to the right, and in 2013 and 2014 negative

ε

and

η

close to zero produce a two-piece distribution to fit “cold” and “warm” temperatures (Figure 5).

Table 1. Parameter estimates and their respective standard deviations (SD) for SST by year based on SRG, epsilon-skew-normal (ESN) and skew-normal (SN) models. For each model, log-likelihood function

ℓ (θ)

,

θ = (μ, σ, η, ε)

, Akaike’s (AIC) and Bayesian (BIC) information criteria, and goodness-of-fit tests (Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Cramer–von Mises, (C–V)) are also reported with respective p-values in parentheses.

Figure 5. MLE fit of SRG, ESN and SN models for SST data by year.

To evaluate the goodness-of-fit test, the Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Cramer–von Mises (C–V) tests were considered for all models, commonly used to analyze the goodness-of-fit test of a particular distribution see, e.g., [26]). Considering a 95% confidence level, SRG fits perform well for 2012 and 2013, and on a 90% confidence level, the SRG fit performs well for 2014.

4.1.2. Information Quantifiers and Asymptotic Test

Parameters estimated from the SRG model and presented in Table 1 are used to perform the quantifiers of Section 3.1–Section 3.3 for SST in each year and for the asymptotic test of Section 3.4 for comparing SST between two years. The results of these analyses are shown in Table 2. In Table 2,

K_{0} = \hat{K} (Y_{1}, Y_{2})

represents the KL divergence between the years

Y_{1}

(column) and

Y_{2}

(row).

Table 2. SRG Shannon,

H (Y)

, and Rényi,

R_{α} (Y)

,

α = 2, 3, 4

, entropies for SST data. For each year, the KL divergence

K_{0} = \hat{K} (Y_{1}, Y_{2})

, statistic and its respective p-values of Equation (15) are reported. All reported

K_{0}

estimates considered the estimated parameters and sample size n in Table 1.

The first quantifiers (SE and RE) illustrate that the highest information of SST is obtained by SE and increases with the increment of years. For all RE, the highest information of SST is obtained in 2012 and is negative for 2013 and 2014 and similar during that period. Differences in information between SE and RE are produced by the independency of SE with parameter

σ

, while RE depends on three parameters as in Proposition 4.

In addition, the asymptotic test presented in Table 2 is analogous for all the years in both groups. In fact, the null hypothesis

H_{0} : θ_{1} = θ_{2}

is rejected at a 95% confidence level. This rejection is reinforced by high values of statistics

K_{0}

, produced by a high sample size of both groups (

n_{1}

and

n_{2}

).

5. Conclusions

We have presented a methodology to compute the Shannon and the Rényi entropy and the Kullback–Leibler divergence for the family of Skew-Reflected-Gompertz distributions. Our methods consider the information quantifiers previously computed for the Gompertz distribution. Explicit formulas for Shannon and Rényi entropies (in terms of the Gompertz, Shannon and Rényi entropies, respectively), and the Kullback–Leibler divergence (using incomplete gamma function) facilitate easy computational implementation. Additionally, given the regularity conditions accomplished by the Skew-Reflected-Gompertz distribution, specifically by the Fisher information matrix convergence when

ε

is in

(- 1, 1)

, an asymptotic test for comparing two groups of datasets was developed.

The statistical application to South Pacific sea surface temperature was given. We first carried out SRG goodness-of-fit tests in samples over three years, where we find strong evidence (a 95% confidence level) for 2012, and moderate evidence (a 90% confidence level) for 2013 and 2014. The results show that the proposed methodology serves to compare two sets of samples, Skew-Reflected-Gompertz distributed. The proposed asymptotic test is therefore useful to detect anomalies in sea surface temperature, linked to extreme events influenced by environmental conditions [11,24,25]. We encourage researchers to consider the proposed methodology for further investigations related to environmental datasets [1].

Author Contributions

J.E.C.-R. and M.M. wrote the paper and contributed reagents/analysis/materials tools; J.E.C.-R. and D.D.C. conceived, designed and performed the experiments and analyzed the data. All authors have read and approved the final manuscript.

Funding

This research received no external funding.

Acknowledgments

We are grateful to the Instituto de Fomento Pesquero (IFOP) for providing access to the data used in this work. Special thanks to Fernando Espíndola for his helpful insights and discussion on an early version of this paper. The SST datasets and R codes used in this work are available upon request to the corresponding author. The authors thank the editor and two anonymous referees for their helpful comments and suggestions.

Conflicts of Interest

The authors declare that there is no conflict of interest in the publication of this paper.

Abbreviations

The following abbreviations are used in this manuscript:

A–D	Anderson–Darling
AIC	Akaike’s information criterion
BIC	Bayesian information criterion
C–V	Cramer–von Mises
CDF	Cumulative distribution function
EM	Expectation maximization
ESN	Epsilon-skew-normal
FIM	Fisher information matrix
GZ	Gompertz
K–S	Kolmogorov–Smirnov
KL	Kullback–Leibler
MGF	Moment-generating function
MLE	Maximum Likelihood Estimator
PDF	Probability density function
RE	Rényi entropy
SD	Standard deviation
SE	Shannon entropy
SN	Skew-normal
SRG	Skew-Reflected-Gompertz
SST	Sea surface temperature

Appendix A. The Epsilon-Skew-Normal Distribution

The epsilon-skew-normal distribution [4,27] in its location-scale version is denoted as

ESN (θ, ϖ, ϵ)

. It can be derived from a more general class of two-piece asymmetric distributions proposed by [14], by considering the standardized normal kernel

ϕ (\cdot)

(zero mean and variance 1), denoted as

N (0, 1)

, as the density f and the functions

a (ϵ) = 1 + ϵ

and

b (ϵ) = 1 - ϵ

. If

Z \sim ESN (θ, ϖ, ϵ)

, thus Z has pdf given by

\begin{matrix} h (z | θ, ϖ, ϵ) = \{\begin{matrix} ϕ (\frac{θ - z}{ϖ (1 + ϵ)}), & z \leq θ, \\ ϕ (\frac{z - θ}{ϖ (1 - ϵ)}), & z > θ, \end{matrix} \end{matrix}

(A1)

where

Z = θ + ϖ X

for location

θ \in R

and scale

ϖ > 0

parameters. The mean and variance of Z are

\begin{matrix} E (Z) & = & θ - 4 ϖ ϵ / \sqrt{2 π}, \\ V a r (Z) & = & \frac{ϖ^{2}}{π} [(3 π - 8) ϵ^{2} + π], \end{matrix}

and the MGF of X is given by

M_{X} (t) = (1 + ϵ) e^{\frac{{(1 + ϵ)}^{2} t^{2}}{2}} Φ [- (1 + ϵ) t] + (1 - ϵ) e^{\frac{{(1 - ϵ)}^{2} t^{2}}{2}} Φ [(1 - ϵ) t],

where

Φ (\cdot)

is the cdf of standardized Gaussian distribution.

Appendix B. The Skew-Normal Distribution

Let X be a skew-normal (SN, [28]) random variable denoted as

X \sim SN (ξ, ω, λ)

. The pdf of X is given by

\begin{matrix} f (x; λ) = 2 ϕ (z) Φ (λ z), \end{matrix}

(A2)

with

z = (x - ξ) / ω

. The SN model with the density (A2) is explained by its stochastic representation

X \overset{d}{=} ξ + δ | U_{0} | + \sqrt{1 - δ^{2}} U,

(A3)

where

δ = λ / \sqrt{1 + λ^{2}}

, X is represented as a linear combination of Gaussian U and a half-Gaussian

| U_{0} |

variable, and

U_{0} \sim N (0, 1)

and

U \sim N (0, ω^{2})

are independent (Theorem 1 of [29]). From (A3), the mean and variance of X are

E (X) = ξ + \sqrt{2 / π} δ

and

V a r (X) = ω^{2} - (2 / π) δ^{2}

, respectively.

References

Hoseinzadeh, A.; Maleki, M.; Khodadadi, Z.; Contreras-Reyes, J.E. The Skew-Reflected-Gompertz distribution for analyzing symmetric and asymmetric data. J. Comput. Appl. Math. 2019, 349, 132–141. [Google Scholar] [CrossRef]
Gompertz, B. On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies. Philos. Trans. R. Soc. Lond. 1825, 115, 513–583. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Contreras-Reyes, J.E.; Stehlík, M. Generalized skew-normal negentropy and its application to fish condition factor time series. Entropy 2017, 19, 528. [Google Scholar] [CrossRef]
Mudholkar, G.S.; Hutson, A.D. The epsilon-skew-normal distribution for analyzing near-normal data. J. Stat. Plan. Inference 2000, 83, 291–309. [Google Scholar] [CrossRef]
Maleki, M.; Mahmoudi, M.R. Two-Piece Location-Scale Distributions based on Scale Mixtures of Normal family. Commun. Stat. Theor. Meth. 2017, 46, 12356–12369. [Google Scholar] [CrossRef]
Moravveji, B.; Khodadai, Z.; Maleki, M. A Bayesian Analysis of Two-Piece distributions based on the Scale Mixtures of Normal Family. Iran. J. Sci. Technol. Trans. A 2019, 43, 991–1001. [Google Scholar] [CrossRef]
Contreras-Reyes, J.E. Rényi entropy and complexity measure for skew-gaussian distributions and related families. Physica A 2015, 433, 84–91. [Google Scholar] [CrossRef]
Contreras-Reyes, J.E. Analyzing fish condition factor index through skew-gaussian information theory quantifiers. Fluct. Noise Lett. 2016, 15, 1650013. [Google Scholar] [CrossRef]
Wang, Y.Q.; Derksen, R.W. The confirmation of the α–β model and the maximum entropy formulation in a thermal wake. Environmetrics 1998, 9, 269–282. [Google Scholar] [CrossRef]
De Queiroz, M.M.; Silva, R.W.; Loschi, R.H. Shannon entropy and Kullback–Leibler divergence in multivariate log fundamental skew-normal and related distributions. Can. J. Stat. 2016, 44, 219–237. [Google Scholar] [CrossRef]
Di Lorenzo, E.; Combes, V.; Keister, J.E.; Strub, P.T.; Thomas, A.C.; Franks, P.J.; Ohman, M.D.; Furtado, J.C.; Bracco, A.; Bograd, S.J.; et al. Synthesis of Pacific Ocean climate and ecosystem dynamics. Oceanography 2013, 26, 68–81. [Google Scholar] [CrossRef]
Salicrú, M.; Menéndez, M.L.; Pardo, L.; Morales, D. On the applications of divergence type measures in testing statistical hypothesis. J. Multivar. Anal. 1994, 51, 372–391. [Google Scholar] [CrossRef]
Maleki, M.; Contreras-Reyes, J.E.; Mahmoudi, M.R. Robust Mixture Modeling Based on Two-Piece Scale Mixtures of Normal Family. Axioms 2019, 8, 38. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Gómez, H.W.; Quintana, F.A. Statistical inference for a general class of asymmetric distributions. J. Stat. Plan. Inference 2005, 128, 427–443. [Google Scholar] [CrossRef]
Jafari, A.A.; Tahmasebi, S.; Alizadeh, M. The beta-Gompertz distribution. Rev. Colomb. Estad. 2014, 37, 141–158. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley & Son, Inc.: New York, NY, USA, 2006. [Google Scholar]
Rényi, A. Probability Theory; Dover Publications: New York, NY, USA, 2012. [Google Scholar]
Abu-Zinadah, H.H.; Aloufi, A.S. Some characterizations of the exponentiated Gompertz distribution. Int. Math. Forum 2014, 9, 1427–1439. [Google Scholar] [CrossRef][Green Version]
Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Bauckhage, C. Characterizations and Kullback–Leibler Divergence of Gompertz Distributions. arXiv 2014, arXiv:1402.3193. [Google Scholar]
R Core Team. A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; ISBN 3-900051-07-0. [Google Scholar]
Barría, P.; González, A.; Cortés, D.D.; Mora, S.; Miranda, H.; Cerna, F.; Cid, L.; Ortega, J.C. Seguimiento Pesquerías Recursos Altamente Migratorios, 2016. Convenio de Desempeño 2016; Informe Final, Subsecretaría de Economía y EMT; Instituto de Fomento Pesquero: Valparaíso, Chile, 2017. [Google Scholar]
Alheit, J.; Bernal, P. Effects of physical and biological changes on the biomass yield of the Humboldt Current ecosystem. In Large Marine Ecosystems—Stress, Mitigation and Sustainability; American Association for the Advancement of Science: Washington, DC, USA, 1993; pp. 53–68. [Google Scholar]
Oerder, V.; Bento, J.P.; Morales, C.E.; Hormazabal, S.; Pizarro, O. Coastal Upwelling Front Detection off Central Chile (36.5–37°S) and Spatio-Temporal Variability of Frontal Characteristics. Remote Sens. 2018, 10, 690. [Google Scholar] [CrossRef]
Lenart, A.; Missov, T.I. Goodness-of-fit tests for the Gompertz distribution. Commun. Stat. Theor. Meth. 2016, 45, 2920–2937. [Google Scholar] [CrossRef]
Bondon, P. Estimation of autoregressive models with epsilon-skew-normal innovations. J. Multivar. Anal. 2009, 100, 1761–1776. [Google Scholar] [CrossRef]
Azzalini, A. A Class of Distributions which includes the Normal Ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Henze, N. A probabilistic representation of the ‘skew-normal’ distribution. Scand. J. Stat. 1986, 13, 271–275. [Google Scholar]

Figure 1. Shannon entropy of Skew-Reflected-Gompertz (SRG) distributions for

ε \in (- 1, 1)

and several values of

η

.

Figure 2. Rényi entropy of SRG distributions for

σ = 1

,

- 1 < ε < 1

, several values of

η

and (a)

α = 2

and (b)

α = 5

values.

Figure 3. Plots of Kullback–Leibler (KL) divergence between

Y_{1} \sim SRG (0, σ_{1}, η_{1}, ε_{1})

and

Y_{2} \sim SRG (0, σ_{2}, η_{2}, ε_{2})

for values

σ_{1} = σ_{2} = 1

and (a)

η_{1} = η_{2} = 0.25

; (b)

η_{1} = η_{2} = 3

; and (c)

η_{1} = η_{2} = 10

.

Figure 4. Spatial distribution of Sea Surface Temperature (SST) observations by year (

21^{°}

31′–

36^{°}

39′ LS,

71^{°}

08′–

85^{°}

52′ LW).

Figure 5. MLE fit of SRG, ESN and SN models for SST data by year.

Table 1. Parameter estimates and their respective standard deviations (SD) for SST by year based on SRG, epsilon-skew-normal (ESN) and skew-normal (SN) models. For each model, log-likelihood function

ℓ (θ)

,

θ = (μ, σ, η, ε)

, Akaike’s (AIC) and Bayesian (BIC) information criteria, and goodness-of-fit tests (Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Cramer–von Mises, (C–V)) are also reported with respective p-values in parentheses.

Table 1. Parameter estimates and their respective standard deviations (SD) for SST by year based on SRG, epsilon-skew-normal (ESN) and skew-normal (SN) models. For each model, log-likelihood function

ℓ (θ)

,

θ = (μ, σ, η, ε)

, Akaike’s (AIC) and Bayesian (BIC) information criteria, and goodness-of-fit tests (Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Cramer–von Mises, (C–V)) are also reported with respective p-values in parentheses.

Year	Model	Param.	Estim.	(S.D)	$ℓ (θ)$	AIC	BIC	K–S	A–D	C–V
2012 ( $n = 774$ )	SRG	$μ$	17.992	0.103	−1401.896	2811.793	2830.399	0.044 (0.095)	2.014 (0.090)	0.214 (0.242)
		$σ$	2.590	0.067
		$η$	1.444	0.027
		$ε$	−0.207	0.075
	ESN	$θ$	18.000	0.031	−1507.534	3021.069	3035.023	0.118 (<0.01)	26.417 (<0.01)	2.059 (<0.01)
		$ϖ$	1.657	0.015
		$ϵ$	−0.418	0.069
	SN	$ξ$	16.777	0.114	−1404.581	2815.161	2829.116	0.041 (0.143)	1.752 (0.126)	0.198 (0.271)
		$ω$	5.199	0.043
		$λ$	2.527	0.311
2013 ( $n = 415$ )	SRG	$μ$	17.935	0.061	−687.420	1382.839	1398.942	0.082 (0.010)	2.632 (0.042)	0.491 (0.041)
		$σ$	1.112	0.026
		$η$	0.432	0.021
		$ε$	−0.108	0.029
	ESN	$θ$	17.600	0.046	−716.375	1438.750	1450.827	0.089 (<0.01)	7.721 (<0.01)	0.970 (0.002)
		$ϖ$	1.328	0.026
		$ϵ$	−0.376	0.092
	SN	$ξ$	16.598	0.200	−691.531	1389.063	1401.140	0.066 (0.054)	2.002 (0.092)	0.328 (0.113)
		$ω$	3.812	0.054
		$λ$	2.421	0.617
2014 ( $n = 439$ )	SRG	$μ$	17.454	0.048	−653.082	1314.164	1330.502	0.092 (<0.01)	2.848 (0.033)	0.533 (0.032)
		$σ$	0.896	0.020
		$η$	0.375	0.020
		$ε$	−0.106	0.025
	ESN	$θ$	17.200	0.053	−703.748	1413.496	1425.750	0.109 (<0.01)	11.996 (<0.01)	1.529 (<0.01)
		$ϖ$	0.956	0.035
		$ϵ$	−0.384	0.090
	SN	$ξ$	16.146	0.098	−666.984	1339.968	1352.222	0.096 (<0.01)	4.055 (<0.01)	0.711 (0.011)
		$ω$	3.245	0.045
		$λ$	3.434	0.618

Table 2. SRG Shannon,

H (Y)

, and Rényi,

R_{α} (Y)

,

α = 2, 3, 4

, entropies for SST data. For each year, the KL divergence

K_{0} = \hat{K} (Y_{1}, Y_{2})

, statistic and its respective p-values of Equation (15) are reported. All reported

K_{0}

estimates considered the estimated parameters and sample size n in Table 1.

Table 2. SRG Shannon,

H (Y)

, and Rényi,

R_{α} (Y)

,

α = 2, 3, 4

, entropies for SST data. For each year, the KL divergence

K_{0} = \hat{K} (Y_{1}, Y_{2})

, statistic and its respective p-values of Equation (15) are reported. All reported

K_{0}

estimates considered the estimated parameters and sample size n in Table 1.

Year	Quantifier	2012	2013	2014
	$H (Y)$	0.765	0.781	2.754
	$R_{2} (Y)$	0.384	−0.362	−0.365
	$R_{3} (Y)$	0.252	−0.417	−0.418
	$R_{4} (Y)$	0.163	−0.457	−0.457
2012	$K_{0}$	-	0.266	0.911
	Statistic	-	143.740	520.41
	p-value	-	<0.01	<0.01
2013	$K_{0}$	0.080	-	0.071
	Statistic	43.192	-	30.233
	p-value	<0.01	-	<0.01
2014	$K_{0}$	0.143	0.043	-
	Statistic	80.327	18.282	-
	p-value	<0.01	<0.01	-

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Skew-Reflected-Gompertz Information Quantifiers with Application to Sea Surface Temperature Records

Abstract

1. Introduction

2. The Skew-Reflected-Gompertz Distribution

3. Entropic Quantifiers

3.1. Shannon Entropy

3.2. Rényi Entropy

3.3. Kullback–Leibler Divergence

3.4. Asymptotic Test

4. Application

4.1. Sea Surface Temperature Data

4.1.1. SRG Parameter Estimates

4.1.2. Information Quantifiers and Asymptotic Test

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. The Epsilon-Skew-Normal Distribution

Appendix B. The Skew-Normal Distribution

References

Article Metrics

Citations

Article Access Statistics