The Asymmetric Alpha-Power Skew-t Distribution

Roger Tovar-Falón; Heleno Bolfarine; Guillermo Martínez-Flórez

doi:10.3390/sym12010082

,

and

¹

Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 230027, Colombia

²

Departamento de Estatística, IME, Universidade de São Paulo, São Paulo 1010, Brazil

^*

Author to whom correspondence should be addressed.

Symmetry2020, 12(1), 82;https://doi.org/10.3390/sym12010082

This article belongs to the Special Issue Symmetry in Applied Mathematics

Version Notes

Order Reprints

Abstract

In this paper, we propose a new asymmetric and heavy-tail model that generalizes both the skew-t and power-t models. Properties of the model are studied in detail. The score functions and the elements of the observed information matrix are given. The process to estimate the parameters in model is discussed by using the maximum likelihood approach. Also, the observed information matrix is shown to be non-singular at the whole parametric space. Two applications to real data sets are reported to demonstrate the usefulness of this new model.

Keywords:

alpha-power skew-t distribution; skew-t distribution; power-t distribution; asymmetry; Fisher information matrix; maximum likelihood estimation

1. Introduction

In recent years, there has been considerable interest in the statistical literature related to flexible families of distributions able of modeling data that present high degree of asymmetry, with kurtosis index greater or smaller than the captured by normal model. In this context, two proposals that have shown a promising behavior in this type of situations are the skew-normal (SN) distribution of Azzalini [1] and the power-normal (PN) distribution of Durrans [2]. The SN distribution has been widely studied by many authors, and its main drawback is that it presents singular Fisher information matrix, implying the inference is useless from the theory of large samples using the maximum likelihood (ML) approach. Although the PN model has a shorter asymmetry range than SN distribution, it presents non-singular information matrix and can easily be extended to censored scenarios, as it has a simple distribution function, see, for example, in Martínez-Flórez et al. [3].

The PN model is part of a wide family of distributions known as alpha-power, which has been widely studied by many authors. In addition to the normal distribution, the Birnbaum–Saunders (BS) distribution [4] has also been considered, see, for example, in Martínez-Flórez et al. [5], who propose an extension of the BS distribution based on the asymmetric alpha-power family of distributions to illustrate the applicability of the new proposal with a data set is related to the lifetimes in cycles

\times 10^{- 3}

n = 101

aluminum

6061 - T 6

pieces cut in parallel angle to the rotation direction of rolling at the rate of 18 cycles per second and maximum stress of 21.000 psi. More details of the PN distribution can be found in Gupta and Gupta [6] and Pewsey et al. [7].

An alternative propose for modeling asymmetric data that unifies the two previous approaches was introduced by Martínez-Flórez et al. [8]. The proposed model, which is called alpha-power skew-normal (APSN), has non-singular Fisher information matrix, and it can fit data with much more asymmetry than PN models it can handle. In addition, symmetry can be tested by using the likelihood ratio statistic, as the properties of large samples are satisfied for the ML estimator.

Another set of distributions with non-singular information matrices, useful for modeling asymmetric and heavy-tailed data, are based on generalizations of the Student-t distribution, see, for example, in [9,10,11,12,13]. Azzalini and Capitanio [9] for example, introduced a skew-t (ST) distribution as an extension of the SN model for modeling asymmetric and heavy-tailed data as follows; The random variable X is said to have the ST distribution with parameter

λ

and degrees of freedom

ν

, if X has the probability density function (PDF) given by

f_{S T} (x; λ, ν) = 2 f_{T} (x; ν) F_{T} (λ \sqrt{\frac{ν + 1}{x^{2} + ν}} x; ν + 1), x \in R

(1)

where

λ \in R

is a parameter that controls the skewness of the distribution, and

f_{T} (\cdot; ν)

and

F_{T} (\cdot; ν)

denote the PDF and the cumulative distribution function (CDF) of a standard Student-t distribution with

ν

degree of freedom, respectively. The ST distribution, like an extension of the SN model, inherits the problem of the singularity of the information matrix and before this inconvenience Zhao and Kim [14] proposed the power Student-t (PT) distribution, whose information matrix is non-singular and for a given degree of freedom, the kurtosis range surpasses the kurtosis range of the skew-t model at all times. The PT distribution is defined as follows. The random variable X is said to have the PT distribution with parameter

α

, and degrees of freedom

ν

, if X has PDF given by

f_{P T} (x; α, ν) = α f_{T} (x; ν) {[F_{T} (x; ν)]}^{α - 1}, x \in R

(2)

where

α > 0

is a parameter that controls the form of the distribution, and, again,

f_{T} (\cdot; ν)

and

F_{T} (\cdot; ν)

denote the PDF and the CDF of a standard Student-t distribution, respectively.

Based on the properties of the ST model, to fit data with high degree of asymmetry and the characteristic of the PN model to capture kurtosis larger than the normal model, in this paper, we introduce a new distribution for modeling asymmetric and heavy-tailed data. The proposed model possess non-singular information matrix, and it is able to fit data with far more asymmetry than ST and PT models can handle and with large sample properties satisfied for the ML estimator. The model introduced in this paper is named as alpha-power skew-t (APST) model and it extends both, ST and PT models. The APSN model by Martínez-Flórez et al. [8] is also a particular case when

ν

tends to infinite. Note that symmetry can be tested using the likelihood ratio statistics with its large sample chi-square distribution.

The rest of this paper is organized as follows. Section 2 introduces the APST model and some of its properties like moments are studied. In particular, skewness and kurtosis indices are computed showing that their ranges surpass those of the ST and PT models. Section 3 deals with the ML estimation for the location-scale situation and its observed information matrix is derived. The extension to censored data is also presented. Finally, two applications are shown in Section 4, revealing that the model proposed can present much improvement over competitors.

2. The Alpha-Power Skew-t Distribution

Definition 1.

The random variable X is said to have an alpha-power skew-t (APST) distribution, if X has PDF given by

f_{A P S T} (x; λ, α, ν) = α f_{S T} (x; λ, ν) {[F_{S T} (x; λ, ν)]}^{α - 1},

(3)

for

x \in R

,

λ \in R

, and

α, ν \in R^{+}

. Functions

f_{S T} (\cdot)

and

F_{S T} (\cdot)

denote the PDF and the CDF of the standard ST distribution. A random variable having

f_{A P S T} (x; λ, α, ν)

distribution is denoted shortly by

X \sim APST (λ, α, ν)

.

Figure 1 displays the form of the APST distribution for some selected values of the parameters

λ

and

α

for

ν = 6

. Note from the figure that the asymmetry and kurtosis of the APST distribution are affected by the parameters

α

and

λ

; therefore, the APST model is more flexible to model data that can be highly skewed, as well as heavier tails than ST and PT models.

Figure 1. Probability density function of

APST (λ, α, 10)

for some values of

λ

and

α

.

The following result provides some special cases of the model (3), which occur for different values of

λ

,

α

, and

ν

.

Proposition 1.

Let

X \sim APST (λ, α, ν)

,

(i): if $λ = 0$ , then $X \sim PT (α, ν)$ ,
(ii): if $α = 1$ , then $X \sim ST (λ, ν)$ ,
(iii): if $λ = 0$ and $α = 1$ , then $X \sim T (ν)$ , where $T (ν)$ denotes the Student-t disribution with ν degree of freedom.
(iv): if $ν \to + \infty$ , then $X \sim APSN (λ, α)$ ,
(v): if $λ = 0$ and $ν \to + \infty$ , then $X \sim PN (α)$ ,
(vi): if $α = 1$ and $ν \to + \infty$ , then $X \sim SN (λ)$ ,
(vii): if $λ = 0$ , $α = 1$ and $ν \to + \infty$ , then $X \sim N (0, 1)$ ,

Proof.

The proof of (i)–(vii) is immediate from the definition of APST distribution. □

2.1. Moments

The following proposition presents an expression to compute the k-th moment of a random variable

APST (λ, α, ν)

.

Proposition 2.

Let

X \sim APST (λ, α, ν)

, then

E [X^{k}] = E [{(F_{S T}^{- 1} (Y; λ, ν))}^{k}]

(4)

where Y follows a

Beta (α, 1)

distribution and

F_{S T}^{- 1} (\cdot; λ, ν)

is the inverse of the function

F_{S T} (\cdot; λ, ν)

.

Proof.

We have by definition that

E [X^{k}] = \int_{R} x^{k} α f_{S T} (x) {(F_{S T} (x; λ, ν))}^{α - 1} d x

thus, letting

y = F_{S T} (x; λ, ν)

, then

x = F_{S T}^{- 1} (y; λ, ν)

, it follows that

E [X^{k}] = \int_{0}^{1} α {(F_{S T}^{- 1} (y; λ, ν))}^{k} y^{α - 1} d y

which is the expected value of the function

{(F_{S T}^{- 1} (Y; λ, ν))}^{k}

, where Y follows a beta distribution with parameters

α

and 1. □

The indices of skewness

(\sqrt{β_{1}})

and kurtosis

(β_{2})

of APST distribution can be calculated by using the moments (4) as follows,

\sqrt{β_{1}} = \frac{μ_{3} - 3 μ_{1} μ_{2} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}} and β_{2} = \frac{μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{2} μ_{1}^{2} - 3 μ_{1}^{4}}{{(μ_{2} - μ_{1}^{2})}^{2}}

where

μ_{k} = E [X^{k}]

for

k = 1, \dots, 4

. Table 1 presents the ranges of possible values for the indices of asymmetry and kurtosis for

ST (λ, ν)

,

PT (α, ν)

, and

APST (λ, α, ν)

distributions, for values of

λ

between −40 and 40, values of

α

between 0.5 and 50, and for values of

ν = 2, 3, 4, 5, 6, 7

. It can seen from Table 1 that the length of the admissible intervals for the skewness and the kurtosis parameters of the APST distribution are larger than the corresponding intervals of the ST and PT distributions. This is an indicator that the APST model is more flexible in terms of asymmetry and kurtosis than the ST and PT models.

Table 1. Skewness and kurtosis for the models

ST (λ, ν)

,

PT (α, ν)

, and

APST (λ, α, ν)

, for

λ \in (- 40, 40)

,

α \in (0.5, 50)

and

ν = 2, \dots 7

.

2.2. Distribution Function

Proposition 3.

Let

X \sim APST (λ, α, ν)

, then the CDF of X, namely,

F_{A P S T} (x; λ, α, ν)

is

F_{A P S T} (x; λ, α, ν) = {[F_{S T} (x; λ, ν)]}^{α}, x \in R .

(5)

Proof.

The proof is immediate and it follows from results of Durrans [2]. □

The inversion method can be used to generate a random variable with APST distribution. Thus, taking

λ \in R

,

α, ν \in R^{+}

and a random variable with uniform distribution, namely,

U \sim U (0, 1)

, random variable X with

APST (λ, α, ν)

distribution is generated by taking

X = F_{S T}^{- 1} (U^{1 / α}; λ, ν) .

Remark 1.

We consider a truncated

APST (λ, α)

distribution to obtain a new and useful lifetime distribution. A random variable T has a truncated alpha-power skew-t distribution (at zero), denoted by

TAPST (λ, α, ν)

, if its PDF is given by

f (t) = \frac{α f_{S T} (t, λ, ν) {[F_{S T} (t, λ, ν)]}^{α - 1}}{1 - {[F_{S T} (0, λ, ν)]}^{α}}; t > 0

(6)

The survival and hazard rate functions of a random variable T following a

TAPST (λ, α, ν)

distribution are given by

S_{T} (t) = P (T > t) = \frac{1 - {[F_{S T} (0, λ, ν)]}^{α} - {[F_{S T} (t, λ, ν)]}^{α}}{1 - {[F_{S T} (0, λ, ν)]}^{α}}; t > 0

(7)

and

h_{T} (t) = \frac{α f_{S T} (t, λ, ν) {[f_{S T} (t, λ, ν)]}^{α - 1}}{1 - {[F_{S T} (0, λ, ν)]}^{α} - {[F_{S T} (t, λ, ν)]}^{α}}; t > 0

(8)

respectively.

2.3. Location and Scale Extension

We can also consider a generalization of a APST distribution by adding location and scale parameters. The following definition gives a generalization of the APST model.

Definition 2.

Let

X \sim APST (λ, α, ν)

. The APST density of location and scale is defined as the distribution of

Y = μ + σ X

, for

μ \in R

and

σ > 0

. The corresponding PDF is given by

f_{A P S T} (y; μ, σ, λ, α, ν) = \frac{α}{σ} f_{S T} (\frac{y - μ}{σ}; λ, ν) {[F_{S T} (\frac{y - μ}{σ}; λ, ν)]}^{α - 1}, x \in R,

(9)

for

λ \in R

and

α, ν \in R^{+}

. A random variable following a APST distribution of location and scale is denoted by

Y \sim APST (μ, σ, λ, α, ν)

.

The k-th moment of a random variable

Y \sim APST (μ, σ, λ, α, ν)

can be obtained from the formula

E [Y^{k}] = \sum_{i = 0}^{k} (\binom{k}{i}) μ^{i} σ^{k - i} E [X^{k - i}],

where

X \sim APST (λ, α, ν)

.

3. Statistical Inference for APST Distribution

This section concerns likelihood inference about the parameter vector

θ = {(μ, σ, λ, α, ν)}^{⊤}

of the location-scale family defined in Equation (9). Let

Y = {(Y_{1}, \dots, Y_{n})}^{⊤}

be a random sample of the distribution

APST (μ, σ, λ, α, ν)

. The log-likelihood function for

θ = {(μ, σ, λ, α, ν)}^{⊤}

can be written as follows,

\begin{array}{l} ℓ (θ; Y) \propto n & log α - n log σ - \frac{n}{2} log ν \\ + n log Γ (\frac{ν + 1}{2}) - n log Γ (\frac{ν}{2}) - \frac{ν + 1}{2} \sum_{i = 1}^{n} log (1 + \frac{z_{i}^{2}}{ν}) \\ + \sum_{i = 1}^{n} log F_{T} (λ z_{i} \sqrt{\frac{ν + 1}{z_{i}^{2} + ν}}; ν + 1) + (α - 1) \sum_{i = 1}^{n} log F_{S T} (z_{i}; λ, ν) \end{array}

(10)

where

z_{i} = (y_{i} - μ) / σ

. Thus, by differentiating the log-likelihood function, we obtain the following score equations,

\begin{matrix} \frac{\partial ℓ (θ; Y)}{\partial μ} & = \frac{ν + 1}{σ ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \\ - \frac{λ}{σ} \sum_{i = 1}^{n} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{f_{T} (λ z_{i} w_{i}; ν + 1)}{F_{T} (λ z_{i} w_{i}; ν + 1)} - \frac{α - 1}{σ} \sum_{i = 1}^{n} \frac{f_{S T} (z_{i}; λ, ν)}{F_{S T} (z_{i}; λ, ν)} = 0 \end{matrix}

(11)

\begin{matrix} \frac{\partial ℓ (θ; Y)}{\partial σ} & = - \frac{n}{σ} + \frac{ν + 1}{σ ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} - \frac{λ}{σ} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{f_{T} (λ z_{i} w_{i}; ν + 1)}{F_{T} (λ z_{i} w_{i}; ν + 1)} \\ - \frac{α - 1}{σ} \sum_{i = 1}^{n} z_{i} \frac{f_{S T} (z_{i}; λ, ν)}{F_{S T} (z_{i}; λ, ν)} = 0 \end{matrix}

(12)

\begin{array}{l} \frac{\partial ℓ (θ; Y)}{\partial λ} & = \sum_{i = 1}^{n} z_{i} w_{i} \frac{f_{T} (λ z_{i} w_{i}; ν + 1)}{F_{T} (λ z_{i} w_{i}; ν + 1)} - \frac{α - 1}{π (1 + λ^{2})} \sum_{i = 1}^{n} \frac{{(1 + (1 + λ^{2}) z_{i}^{2} / ν)}^{- \frac{ν}{2}}}{F_{S T} (z_{i}; λ, ν)} = 0, \end{array}

(13)

\begin{array}{l} \frac{\partial ℓ (θ; Y)}{\partial α} & = \frac{n}{α} + \sum_{i = 1}^{n} log F_{S T} (z_{i}; λ, ν) = 0, \end{array}

(14)

\begin{array}{l} \frac{\partial ℓ (θ; Y)}{\partial ν} & = \frac{n α}{2} (ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν}) - \frac{1}{2} \sum_{i = 1}^{n} log (1 + \frac{z_{i}^{2}}{ν}) \\ + \frac{ν + 1}{2 ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \\ + \frac{λ}{2 ν (ν + 1)} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{f_{T} (λ z_{i} w_{i}; ν + 1)}{F_{T} (λ z_{i} w_{i}; ν + 1)} \\ - \frac{λ}{2 ν (ν + 1)} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{f_{T} (λ z_{i} w_{i}; ν + 1)}{F_{T} (λ z_{i} w_{i}; ν + 1)} \\ - \frac{(α - 1)}{2 π (ν + 1)} \frac{λ}{(1 + λ^{2})} \sum_{i = 1}^{n} \frac{{(1 + (1 + λ^{2}) z_{i}^{2} / ν)}^{- \frac{ν}{2}}}{F_{S T} (z_{i}; λ, ν)} + \frac{α - 1}{2} \sum_{i = 1}^{n} \frac{g (z_{i}; ν)}{F_{S T} (z_{i}; λ, ν)} = 0 \end{array}

(15)

where

ψ (\cdot)

is the digamma function,

w_{i} = \sqrt{\frac{ν + 1}{x_{i}^{2} + ν}}

for

i = 1, \dots, n

, and

g (x; ν)

is the function defined by

\begin{matrix} g (x; ν) & = \int_{- \infty}^{x} \{\frac{(ν + 1)}{ν^{2}} s^{2} {(1 + \frac{s^{2}}{ν})}^{- 1} - log (1 + \frac{s^{2}}{ν})\} f_{S T} (s; λ, ν) d s \\ - \frac{λ}{π ν} \int_{- \infty}^{x} s {(1 + \frac{s^{2}}{ν})}^{- 1} {\{1 + (1 + λ^{2}) \frac{s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \end{matrix}

(16)

Equations (11)–(15) include nonlinear functions; therefore, it is not possible to obtain explicit forms of the maximum likelihood estimators (MLEs), and they must be calculated by using numerical methods. In this work, we used the maxLik function of R Development Core Team [15] which uses the Newton–Raphson optimization method. The elements of the observed information matrix are easily obtained after calculating the second derivative of the log-likelihood function and multiplying by −1, that is,

j_{θ_{i} θ_{k}} = - \frac{\partial ℓ (θ; Y)}{\partial θ_{i} \partial θ_{k}}, i, k = 1, 2, \dots, 5

(17)

where

θ = {(μ, σ, λ, α, ν)}^{⊤}

. This elements are given in the Appendix A. To find the standard errors (EE) of the MLEs and calculate confidence intervals, the information matrix

I

(or Fisher information) must be calculated, which is defined as the expected value of the second derived from the log-likelihood function or less the expected value of the Hessian matrix; from this matrix, we calculate the EE as the diagonal elements of the inverse of this matrix. The elements of the

I

matrix are obtained as

I (i, k) = - E (\frac{\partial ℓ (θ; Y)}{\partial θ_{i} \partial θ_{k}}), i, k = 1, 2, \dots, 5

(18)

The role of the Fisher information in the asymptotic theory of maximum-likelihood estimation was emphasized by Ronald Fisher following some initial results by Francis Edgeworth, see Lehman and Casella [16] and Frieden [17] for more details. The Fisher-information matrix is used to calculate the covariance matrices associated with maximum-likelihood estimates, and it can also be used in the formulation of test statistics, such as the Wald test.

As the expected value under the APST distribution and the second-order derivatives are not direct, numerical methods must be used to obtain the explicit form of the information matrix I. Therefore, we use the observed information matrix to calculate the standard errors in the rest of the document.

When

ν

tends to infinite the ST distribution converges to the SN distribution and we recall that the information matrix of a random variable

X \sim SN (μ, σ, λ)

which is denoted by

I_{λ} (φ)

, where

φ = {(μ, σ, λ)}^{⊤}

, is singular for

λ = 0

. Therefore, it is convenient to use a centered parameterization of the ST distribution proposed by Arellano-Valle and Azzalini [18].

The centered parameterization of the SN distribution was proposed as an alternative to the problem of singularity of the information matrix of the SN when

λ = 0

. Arellano-Valle and Azzalini [19] proposed a second representation of the SN by defining a new random variable X as

X = μ + σ (\frac{Z - E [Z]}{\sqrt{Var [Z]}}),

where

μ \in R

and

σ > 0

are parameters of the random variable X and

Z \sim SN (λ)

. This representation is called centered parameterization, as

E [X] = μ

and

Var [X] = σ^{2}

and it is denoted by

CSN (μ, σ, γ_{1})

, where

- 0.9953 < γ_{1} < 0.9953

. Under the centered parameterization model,

μ

,

σ

, and

γ_{1} = \sqrt{β_{1}}

represent the mean, the standard deviation and the skewness index of X, respectively. If

Z \sim SN (λ)

then

E [Z] = b δ

and

Var [Z] = 1 - {(b δ)}^{2}

, where

b = \sqrt{2 / π}

and

δ = λ / \sqrt{1 + λ^{2}}

; it has that the random variable X can be written as

X = μ + σ Z

which has

SN (λ_{1}, λ_{2}, λ)

distribution, where

λ_{1} = μ - c σ γ_{1}^{1 / 3}, λ_{2} = σ \sqrt{1 + c^{2} γ_{1}^{2 / 3}}, λ = \frac{c γ_{1}^{1 / 3}}{\sqrt{b^{2} + c^{2} (b^{2} - 1) γ_{1}^{2 / 3}}}

(19)

with

c = {2 / (4 - π)}^{1 / 3}

. Under this denomination, the information matrix can be written as

I_{γ_{1}} = D^{⊤} I_{λ} D

, where

D

is a matrix that represents the derivative of the parameters of the standard representation (

λ_{1}

,

λ_{2}

and

λ

) regarding to the new parameters (

μ

,

σ

and

γ_{1}

). It also follows that the information matrix converges to a diagonal matrix

Σ_{c}^{- 1} = diag (σ^{2}, σ^{2} / 2, 6)

when

λ \to 0

. This guarantees the existence and uniqueness of the MLEs of

λ_{1}

and

λ_{2}

for each fixed value of

λ

.

Following this same line of thought, we suppose that Y follows the model (1) with location parameter

μ \in R

and scale parameter

σ > 0

, that is,

f_{S T} (y; μ, σ, λ, ν) = \frac{2}{σ} f_{T} (\frac{y - μ}{σ}; ν) F_{T} (λ \sqrt{\frac{ν + 1}{Q_{y} + ν}} (\frac{y - μ}{σ}); ν + 1), y \in R

(20)

where

λ \in R

and

Q_{y} = {((y - μ) / σ)}^{2}

. This representation relates to the direct parameterization of the ST distribution with parameter vector

ρ = {(μ, σ, λ, ν)}^{⊤}

. It follows that

Z_{T} = (Y - μ) / σ \sim ST (λ, ν)

, and by the stochastic representation of the ST distribution is given by

Z_{T} = Z / \sqrt{V}

, where

Z \sim SN (λ)

and

V \sim χ_{ν}^{2} / ν

. This entails to compute the first four cumulants of

Z_{T}

denoted by

μ_{1} (δ, ν)

,

μ_{2} (δ, ν)

,

μ_{3} (δ, ν)

and

μ_{4} (δ, ν)

, see [18]. The centered parameterization of the ST distribution of a random variable Y comes by defining

\begin{matrix} μ_{t} & = E [Y] = μ + σ μ_{1} (δ, ν) = μ + σ b_{ν} δ \\ σ_{t}^{2} & = Var [Y] = σ^{2} μ_{2} (δ, ν) = η^{2} \{\frac{ν}{ν - 2} - b_{ν}^{2} δ^{2}\}, \end{matrix}

\begin{matrix} γ_{1 t} & = \frac{μ_{3} (δ, ν)}{μ_{2} {(δ, ν)}^{3 / 2}} = \frac{b_{ν} δ}{μ_{2} {(δ, ν)}^{3 / 2}} \{\frac{ν (3 - δ^{2})}{ν - 3} - \frac{3 ν}{ν - 2} + 2 b_{ν}^{2} δ^{2}\} \\ γ_{2 t} & = \frac{μ_{4} (δ, ν)}{μ_{2} {(δ, ν)}^{2}} = \frac{1}{μ_{2} {(δ, ν)}^{2}} \{\frac{3 ν^{2}}{(ν - 2) (ν - 4)} - \frac{4 b_{ν}^{2} δ^{2} ν (3 - δ^{2}) + \frac{6 b_{ν}^{2} δ^{2} ν}{ν - 2} - 4 b_{ν}^{4} δ^{4}}{ν - 3}\} - 3 . \end{matrix}

The new representation is defined as the centered skew-t distribution with parameter vector

\tilde{ρ} = {(μ, σ^{2}, γ_{1}, γ_{2})}^{⊤}

. According to Arellano-Valle and Azzalini [18], the information matrix of this representation can be written as

I (\tilde{ρ}) = B^{⊤} I (ρ) B,

where

B

is a matrix representing the derivative of the parameter vector

ρ

with respect to the new vector

\tilde{ρ}

. It can shown that

b_{ν} \to b

when

ν \to \infty

, see [18]. Therefore, the parameters of the centered ST model converge to

μ_{t} \to μ

,

σ_{t}^{2} \to σ^{2}

, and

γ_{1 t} \to γ_{1}

when

ν \to \infty

, that is, the parameters of the CSN. As

Z_{T} \to SN (λ)

when

ν \to \infty

, it follows that the random variable Y converges to a distribution with information matrix

I (μ, σ^{2}, γ_{1}, α) = (\begin{matrix} I_{θ_{1} θ_{1}} & I_{θ_{1}, α} \\ I_{θ_{1}, α}^{⊤} & I_{α, α} \end{matrix}),

(21)

where the elements of the diagonal correspond to the information of the parameter vector

θ_{1} = (μ, σ^{2}, γ_{1})

and

α

, and

I_{θ_{1}, α}

is the joint information of

θ_{1} = {(μ, σ^{2}, γ_{1})}^{⊤}

and

α

. Now, when

λ \to 0

and

α = 1

, it can be shown that

I_{θ_{1} θ_{1}} \to diag (σ^{2}, σ^{2} / 2, 6)

, with determinant equal to

0.3333 / σ^{4}

, and

I_{θ_{1}, α} = {(0.9031 / σ, - 0.5956 / σ, 0.7206)}^{⊤}

; therefore, the determinant

| I (μ, σ^{2}, γ_{1}, α) | \neq 0

, and it concludes that the random variable Y converges to a distribution with information matrix non-singular when

ν

tends to infinite.

3.1. Extension to Censored Data

Based on the goodness of the APST distribution to fit asymmetric and heavy-tailed data, in this section we introduce the censored APST model which we will be denote by CAPST.

Definition 3.

Suppose that the random variable Y follows APST distribution, and consider a random sample

Y = (Y_{1}, Y_{2}, \dots, Y_{n})

where only the

Y_{i}

values greater than a constant k are recorded. In addition, for values

Y_{i} \leq k

only the value of k is recorded. Therefore, for

i = 1, 2, \dots, n

, the observed values

Y_{i}^{o}

can be written as

Y_{i}^{o} = \{\begin{matrix} k, & i f Y_{i} \leq k, \\ Y_{i}, & i f Y_{i} > k . \end{matrix}

The resulting sample is said to be a censored APST, and we say that Y is a censored random variable APST. We will use the notation

Y \sim CAPST (θ)

, where

θ = {(μ, σ, λ, α, ν)}^{⊤}

.

From Definition 3 it follows that

P (Y_{i}^{o} = k) = P (Y_{i} \leq k) = {\{F_{S T} ((k - μ) / σ)\}}^{α}

and for the observations

Y_{i}^{o} = Y_{i}

, the distribution of

Y_{i}^{o}

is the same of

Y_{i}

, i.e.,

Y_{i}^{o} \sim APST (θ)

. For convenience, we choose to work with the case of left-censored data; however, the followings results can be extended to other types of censorship.

3.2. Properties of the CAPST Model

Let

Y \sim CAPST (μ, σ, λ, α, ν)

,

If $α = 1$ , then $Y \sim CST (μ, σ, λ, ν)$ , where CST indicates the censored skew-t model.
If $λ = 0$ , then $Y \sim CPT (μ, σ, α, ν)$ , where CPT indicates the censored power-t model.
If $α = 1$ and $λ = 0$ , then $Y \sim CT (μ, σ, ν)$ , that is, the censored Student-t model follows.
If $ν \to + \infty$ , then $Y \sim CAPSN (μ, σ, λ, α)$ , where CAPSN indicates the censored alpha-power skew-normal model.
If $α = 1$ and $ν \to + \infty$ , then $Y \sim CSN (μ, σ, λ)$ , that is, the censored skew-normal model follows.
If $λ = 0$ and $ν \to + \infty$ , then $Y \sim CPN (μ, σ, α)$ , that is, the censored power-normal model follows.
If $α = 1$ , $λ = 0$ and $ν \to + \infty$ , then $Y \sim CN (μ, σ^{2})$ , that is, the censored normal model follows.

The estimates of the parameters of the model can be obtained via maximum likelihood method, where the log-likelihood function is given by

\begin{matrix} ℓ (θ; Y) & \propto α \sum_{0} log F_{S T} (\frac{k - μ}{σ}; λ, ν) + n_{1} log α - n_{1} log σ - \frac{n_{1}}{2} log ν \\ + n_{1} log Γ (\frac{ν + 1}{2}) - n_{1} log Γ (\frac{ν}{2}) - \frac{ν + 1}{2} \sum_{1} log (1 + \frac{x_{i}^{2}}{ν}) \\ + \sum_{1} log F_{T} (λ x_{i} \sqrt{\frac{ν + 1}{x_{i}^{2} + ν}}; ν + 1) + (α - 1) \sum_{1} log F_{S T} (x_{i}; λ, ν) \end{matrix}

where

x_{i} = (y_{i} - μ) / σ

;

\sum_{1}

and

\sum_{0}

are the sum over censored individuals and uncensored individuals, respectively; and

n_{1}

is the number of uncensored individuals.

4. Real Data Applications

In this section, we illustrate the applicability of the proposed model in Section 2 by analyzing two data sets. We use the statistical software R [15], version 3.5.3 with the package maxLike for maximizing the corresponding likelihood functions. For comparing purposes of various models, the AIC Akaike [20], BIC Schwarz [21], and corrected AIC (CAIC) Bozdogan [22] information criteria were used.

4.1. Application 1: Volcano Heights Data

Consider the data set related to heights of 1520 volcanoes in the world which is available in website dx.doi.org/10.5479/si.GVP.VOTW4-2013 [23]. Table 2 presents the summary statistics for the data set. It can be noted that the asymmetry and kurtosis indices seem to indicate that the use of an asymmetric and heavy-tailed model is appropriate to analyze this data set. We analyzed these data by fitting the Student-t, ST, PT, and APST distributions.

Table 2. Volcano heights data: Statistical summary.

Table 3 shows the parameter estimates, together with their corresponding standard errors (SE). Note that the values of the standard errors of the

μ

and

σ

estimates for the APST model are smaller than the corresponding standard errors of the respective parameters for the Student-t, ST, and PT models. Table 3 also presents some model selection criteria, together with the values of the log-likelihood. The AIC, BIC, and CAIC criteria indicate that the APST model seems to provide better fit to the volcanoes heights data than the T, ST, and PT models, supporting the asymmetry assertion of the volcano’s heights variable. Figure 2 shows the graphs QQplot of the fitted models. It can be clearly seen from the figure that the APST model fits the data better than the Student-t, ST, and PT models. In addition, we can use the likelihood ratio (LR) test statistic to conform our claim. To do this, we consider the following hypotheses,

H_{0} : (λ, α) = (0, 1) (T (μ, σ, ν)) v . s H_{1} : (λ, α) \neq (0, 1) (APST (μ, σ, λ, α, ν)),

Table 3. Parameter estimates (SE) for the fitted models to the volcano height data.

Figure 2. Volcano height data: QQplot for Student-t, ST, PT, and APST models.

The value of the LR test statistic is

- 2 log (Λ) = - 2 (ℓ_{T} (\hat{θ}) - ℓ_{A P S T} (\hat{θ})) =

134.823 and comparing this quantity with

χ_{2}^{2} =

5.9914, the null hypotheses is rejected. The APST model is also compared with the ST and PT models by considering the hypotheses

H_{01} : α = 1 (ST (μ, σ, λ, ν)) v . s H_{11} : α \neq 1 (APST (μ, σ, λ, α, ν)),

and

H_{02} : λ = 0 (PT (μ, σ, α, ν)) v . s H_{12} : λ \neq 0 (APST (μ, σ, λ, α, ν)),

respectively. The respective values of the LR test statistic are given by

- 2 log (Λ_{1}) = - 2 (ℓ_{S T} (\hat{θ}) - ℓ_{A P S T} (\hat{θ})) =

26.620 and

- 2 log (Λ_{2}) = - 2 (ℓ_{P T} (\hat{θ}) - ℓ_{A P S T} (\hat{θ})) =

45.660 and comparing these quantities with

χ_{1}^{2} =

3.8414, both null hypotheses are rejected. Finally, Figure 3left shows the histogram of the volcano heights variable, whereas Figure 3right presents the empirical CDF (solid line) together with the CDF of the fitted APST model (dotted line).

Figure 3. (Left) Graph of fitted densities to volcano height data. (Right) Empirical CDF and CDF of fitted APST model.

4.2. Application 2: Stellar Abundances Data

The second data set is related to measurements for 68 solar-type stars, which are available in the package astrodatR of the software R [24] under the name Stellar abundances. These data were previously analyzed Mattos et al. [25] by using the Scale Mixture of Skew Normal Censored Regression (SMSNCR) models. We take only the response variable: log N(Be), which represents the log of the abundance of beryllium scaled to Sun’s abundance (i.e., the Sun has

log N (B e) = 0.0

)

In astronomical research, a previously identified sample of objects (stars, galaxies, quasars, X-ray sources, etc.) is observed at some new wavebands. According to Feigelson [24], due to limited sensitivities, some objects may be undetected, leading to upper limits in their derived luminosities. For this dataset we have 12 left-censored data points, i.e., 12 undetected beryllium measurement, that represents 19.35% of observations. Table 4 presents the ML estimates for the parameters of the censored Studen-t (CT), censored skew-t (CST), censored power-t (CPT), and censored alpha-power skew-t (CAPST) models, together with their corresponding standard errors. Table 4 also compares the fit of the four models using the model selection criteria (AIC, CAIC and BIC). Note that, again, the CAPST model with heavy tails have better fit than the CT, CST, and CPT models.

Table 4. Parameter estimates (SE) for the fitted models to the stellar abundances data.

To identify atypical observations and/or model mispecification, we analyzed the transformation of the martingale residual,

r_{M T_{i}}

, proposed in Barros et al. [26]. These residuals are defined by

r_{M T i} = sign (r_{M i}) \sqrt{- 2 [r_{M i} + δ_{i} log (δ_{i} - r_{M_{i}})]}, i = 1, \dots, n

where

r_{M i} = δ_{i} + log S (y_{i}; \hat{θ})

is the martingal residual proposed by Ortega et al. [27], where

δ_{i} = 0, 1

indicates whether the i-th observation is censored or not, respectively;

sign (r_{M i})

denotes the sign of

r_{M i}

; and

S (y_{i}; \hat{θ}) = P_{\hat{θ}} (Y_{i} > y_{i})

represents the survival function evaluated at

y_{i}

, where

\hat{θ}

are the MLE for

θ

. The plots of

r_{M T_{i}}

with generated confidence envelopes are presented in Figure 4. From this figure, we can see clearly that the CST, CPT, and CAPST models fit better the data than the CT model, since, in that cases, there are not observations which lie outside the envelopes. The Figure 5 shows the graph of the densities of the different models fitted to the stellar abundances data. From the figure, the CAPST model seems to fit better the stellar abundances data than CT, CST and CPT models.

Figure 4. Stellar abundances data. Envelopes of transformed martingale residuals for CT, CST, CPT, and CAPST models.

Figure 5. Graph of fitted densities to stellar abundances data.

5. Conclusions

In this work, a new asymmetric model has been introduced. It is based on the combination of skew-t [1] and power-t [2] models. The new model presents greater ranges of asymmetry and kurtosis, which is very useful for modeling skewed and heavy-tailed data. The problem of estimating the parameters in the model is dealt by using the maximum likelihood approach which is also used for developing large sample properties for the estimators. The elements of the observed information matrix are analytically obtained. The likelihood ratio statistics can be used for testing the APST null hypothesis since the Student-t, ST, and PT models are special cases of the model entertained. Two applications to volcano heights data and stellar abundances data indicate that the proposed model can be a useful alternative to the ST and PT models.

Author Contributions

Individual contributions to this article: conceptualization, R.T.-F., H.B., and G.M.-F.; methodology, R.T.-F., H.B., and G.M.-F.; software, R.T.-F., H.B., and G.M.-F.; validation, R.T.-F., H.B., and G.M.-F.; formal analysis, R.T.-F., H.B., and G.M.-F.; investigation, R.T.-F., H.B., and G.M.-F.; resources, R.T.-F., H.B., and G.M.-F.; writing-original draft preparation, R.T.-F., H.B., and G.M.-F.; writing-review and editing, R.T.-F., H.B., and G.M.-F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. The research of R. Tovar-Falón was supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq/Brazil) under grant 140831/2014-2.

Acknowledgments

We thank the anonymous referees for helpful suggestions which improved the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In this section, expressions for the elements of the observed information matrix of the alpha-power skew-t model are provided. Initially we suppose that

Y \sim APST (μ, σ, λ, α, ν)

, and for

i = 1, \dots, n

we define

z_{i} = (y_{i} - μ) / σ

,

w_{i} = \sqrt{(ν + 1) / (z_{i}^{2} + ν)}

,

r_{1} (z; ν) = f_{T} (z; ν) / F_{T} (z; ν)

,

r_{2} (z; λ, ν) = f_{S T} (z; λ, ν) / F_{S T} (z; λ, ν)

, and

r_{3} (z; λ, ν) = {(1 + (1 + λ^{2}) \frac{z^{2}}{ν})}^{- \frac{ν}{2}} / F_{S T} (z; λ, ν)

. Denoting the elements of the observed information matrix of the APST model by

j_{μ μ}, j_{μ σ}, \dots, j_{α α}

, and after some algebraic manipulations, we obtain

\begin{matrix} j_{μ μ} = & - \frac{1}{σ^{2}} \frac{ν + 1}{ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} + \frac{1}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ + \frac{λ}{σ^{2}} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{2}{σ^{2}} \frac{λ}{ν} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{3}}{σ^{2}} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \end{matrix}

\begin{matrix} - \frac{λ}{σ^{2}} \frac{α - 1}{π} \sum_{i = 1}^{n} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{2} (z_{i}; λ, ν) + \frac{α - 1}{σ^{2}} \sum_{i = 1}^{n} {[r_{2} (z_{i}; λ, ν)]}^{2} \\ j_{μ σ} = & \frac{2}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ + \frac{λ^{3}}{σ^{2}} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i}^{2} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{2}{σ^{2}} \frac{λ}{ν} \sum_{i = 1}^{n} z_{i}^{2} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ}{σ^{2}} \sum_{i = 1}^{n} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ - \frac{λ}{σ^{2}} \frac{α - 1}{π} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{σ^{2}} \sum_{i = 1}^{n} r_{2} (z_{i}; λ, ν) + \frac{α - 1}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{2} (z_{i}; λ, ν) \\ + \frac{α - 1}{σ^{2}} \sum_{i = 1}^{n} z_{i} {[r_{2} (z_{i}; λ, ν)]}^{2} \end{matrix}

\begin{matrix} j_{μ λ} = & - \frac{λ^{2}}{σ} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i}^{2} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{1}{σ} \sum_{i = 1}^{n} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ}{σ} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ + \frac{α - 1}{π σ} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{π σ} \frac{1}{1 + λ^{2}} \sum_{i = 1}^{n} r_{2} (z_{i}; λ, ν) r_{3} (z_{i}; λ, ν) \\ j_{μ α} = & \frac{1}{σ} \sum_{i = 1}^{n} r_{2} (z_{i}; λ, ν) \\ j_{μ ν} = & - \frac{1}{σ ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} + \frac{ν + 1}{σ ν^{2}} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ + \frac{λ}{σ ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ}{2 σ} \frac{1}{ν (ν + 1)} \sum_{i = 1}^{n} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{3}}{2 σ} \frac{ν + 2}{ν^{2} (ν + 1)} \sum_{i = 1}^{n} z_{i}^{2} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ^{2}}{2 σ} \frac{1}{ν^{2}} \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ + \frac{λ}{2 π σ} \frac{α - 1}{ν + 1} \sum_{i = 1}^{n} z_{i} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{2 π σ (ν + 1)} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} r_{2} (z_{i}; λ, ν) r_{3} (z_{i}; λ, ν) + \frac{α - 1}{2 σ} \frac{ν + 1}{ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{2} (z_{i}; λ, ν) \\ - \frac{α - 1}{2 σ} \sum_{i = 1}^{n} log (1 + \frac{z_{i}^{2}}{ν}) r_{2} (z_{i}; λ, ν) - \frac{α - 1}{2 σ} \sum_{i = 1}^{n} \frac{g (z_{i}, ν)}{F_{S T} (z_{i}; λ, ν)} r_{2} (z_{i}; λ, ν) \\ + \frac{λ}{2 π σ} \frac{α - 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \end{matrix}

\begin{matrix} j_{σ σ} = & - \frac{n}{σ^{2}} + \frac{1}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} + \frac{2}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ - \frac{2 λ}{σ^{2}} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ}{σ^{2} ν} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{3}}{σ^{2}} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ}{σ^{2}} \frac{α - 1}{π} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ - \frac{2 (α - 1)}{σ^{2}} \sum_{i = 1}^{n} z_{i} r_{2} (z_{i}; λ, ν) - \frac{α - 1}{σ^{2}} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i}^{3} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{2} (z_{i}; λ, ν) \\ + \frac{α - 1}{σ^{2}} \sum_{i = 1}^{n} z_{i}^{2} r_{2} (z_{i}; λ, ν) \end{matrix}

\begin{matrix} j_{σ λ} = & \frac{1}{σ} \sum_{i = 1}^{n} z_{i} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{σ} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ}{σ} \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ - \frac{α - 1}{π σ} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{π σ} \frac{1}{1 + λ^{2}} \sum_{i = 1}^{n} r_{2} (z_{i}; λ, ν) r_{3} (z_{i}; λ, ν) \\ j_{σ α} = & \frac{1}{σ} \sum_{i = 1}^{n} z_{i} r_{2} (z_{i}; λ, ν) \end{matrix}

\begin{matrix} j_{σ ν} = & - \frac{1}{σ ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} + \frac{1}{σ} \frac{ν + 1}{ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ + \frac{λ}{σ ν^{2}} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ}{2 σ} \frac{1}{ν (ν + 1)} \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ^{3}}{2 σ} \frac{ν + 2}{ν^{2} (ν + 1)} \sum_{i = 1}^{n} z_{i}^{3} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ^{2}}{2 σ ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} (z_{i}^{2} - 1) {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ + \frac{λ}{2 π σ} \frac{α - 1}{ν (ν + 1)} \sum_{i = 1}^{n} z_{i}^{2} (z_{i}^{2} - 1) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{2 π σ (ν + 1)} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} z_{i} r_{2} (z_{i}; λ, ν) r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{2 σ} \frac{ν + 1}{ν^{2}} \sum_{i = 1}^{n} z_{i}^{3} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{α - 1}{2 σ} \sum_{i = 1}^{n} z_{i} log (1 + \frac{z_{i}^{2}}{ν}) r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{α - 1}{2 σ} \sum_{i = 1}^{n} z_{i} \frac{g (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} r_{2} (z_{i}; λ, ν) \\ j_{λ λ} = & \frac{λ (ν + 2)}{ν} \sum_{i = 1}^{n} z_{i}^{3} w_{i} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{ν + 1}{ν} \sum_{i = 1}^{n} z_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ - \frac{2 (α - 1)}{π} \frac{λ}{{(1 + λ^{2})}^{2}} \sum_{i = 1}^{n} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{π} \frac{λ}{1 + λ^{2}} \frac{ν + 2}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{π^{2} {(1 + λ^{2})}^{2}} \sum_{i = 1}^{n} {[r_{3} (z_{i}; λ, ν)]}^{2} \end{matrix}

\begin{matrix} j_{λ α} = & \frac{1}{π (1 + λ^{2})} \sum_{i = 1}^{n} r_{3} (z_{i}; λ, ν) \\ j_{λ ν} = & - \frac{1}{2 ν (ν + 1)} \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{2 ν^{2}} \frac{ν + 2}{{(ν + 1)}^{2}} \sum_{i = 1}^{n} z_{i}^{3} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ}{2 ν^{2}} \sum_{i = 1}^{n} z_{i}^{2} (z_{i}^{2} - 1) {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {[r_{1} (λ z_{i} w_{i}; ν + 1)]}^{2} \\ + \frac{α - 1}{2 π (ν + 1)} \frac{1 - λ^{2}}{{(1 + λ^{2})}^{2}} \sum_{i = 1}^{n} r_{3} (z_{i}; λ, ν) + \frac{α - 1}{2 π (ν + 1)} \frac{λ}{{(1 + λ^{2})}^{2}} \sum_{i = 1}^{n} [r_{3} (z_{i}; λ, ν)]^{2} \\ - \frac{α - 1}{2 π (ν + 1)} \frac{λ^{2}}{1 + λ^{2}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{2 π (1 + λ^{2})} \sum_{i = 1}^{n} \frac{g (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{2 π} \sum_{i = 1}^{n} \frac{g_{1} (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} \\ j_{α α} = & \frac{n}{α^{2}} \\ j_{α ν} = & - \frac{n}{2} ψ (\frac{ν + 1}{2}) + \frac{n}{2} ψ (\frac{ν}{2}) + \frac{n}{2 ν} \\ + \frac{1}{2 π (ν + 1)} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} r_{3} (z_{i}; λ, ν) - \frac{1}{2} \sum_{i = 1} \frac{g (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} \end{matrix}

\begin{matrix} j_{ν ν} = & - \frac{n α}{2 ν^{2}} - \frac{n α}{4} ψ_{1} (\frac{ν + 1}{2}) + \frac{n α}{4} ψ_{1} (\frac{ν}{2}) - \frac{ν - 1}{2 ν^{3}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \\ + \frac{ν + 1}{2 ν^{3}} \sum_{i = 1}^{n} z_{i}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} \\ + \frac{λ}{4 {(ν + 1)}^{2}} \frac{1}{ν^{2}} \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) (z_{i}^{2} + 4 ν + 3) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ}{4 ν (ν + 1)} (ψ (\frac{ν + 2}{2}) - ψ (\frac{ν + 1}{2}) - \frac{1}{ν + 1}) \\ \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) w_{i} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ - \frac{λ^{3}}{4 (ν + 1)} \frac{ν + 2}{ν^{3}} \sum_{i = 1}^{n} z_{i}^{3} (z_{i}^{2} - 1) {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ}{4 ν (ν + 1)} \sum_{i = 1}^{n} z_{i} (z_{i}^{2} - 1) log (1 + \frac{λ^{2} z_{i}^{2}}{ν + z_{i}^{2}}) r_{1} (λ z_{i} w_{i}; ν + 1) \\ + \frac{λ^{2}}{4 ν^{3} (ν + 1)} \sum_{i = 1}^{n} z_{i}^{2} {(z_{i}^{2} - 1)}^{2} {(1 + \frac{z_{i}^{2}}{ν})}^{- 3} [r_{1} (λ z_{i} w_{i}; ν + 1)]^{2} \\ - \frac{α - 1}{2 π {(ν + 1)}^{2}} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{4 π (ν + 1)} \frac{λ}{ν} \sum_{i = 1}^{n} z_{i}^{2} {(1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν})}^{- 1} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{4 π (ν + 1)} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} log (1 + (1 + λ^{2}) \frac{z_{i}^{2}}{ν}) r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{4 π (ν + 1)} \frac{λ}{1 + λ^{2}} (ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν}) \sum_{i = 1}^{n} r_{3} (z_{i}; λ, ν) \\ - \frac{α - 1}{2 π (ν + 1)} \frac{λ}{1 + λ^{2}} \sum_{i = 1}^{n} \frac{g (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} r_{3} (z_{i}; λ, ν) \\ + \frac{α - 1}{4 π^{2} {(ν + 1)}^{2}} \frac{λ^{2}}{{(1 + λ^{2})}^{2}} \sum_{i = 1}^{n} {[r_{3} (z_{i}; λ, ν)]}^{2} \\ + \frac{α - 1}{4} \sum_{i = 1}^{n} {(\frac{g (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)})}^{2} - \frac{α - 1}{2} \sum_{i = 1}^{n} \frac{g_{2} (z_{i}, ν)}{F_{S T} (z_{i}, λ, ν)} \end{matrix}

where

g (z; ν)

is given in Equation (16), and

g_{1} (z; ν)

and

g_{2} (z; ν)

are given in Equations (A1) and (A3), respectively.

\begin{matrix} g_{1} (x; ν) = & \int_{- \infty}^{x} \{\frac{(ν + 1) s^{2}}{ν (s^{2} + ν)} - log (1 + \frac{s^{2}}{ν})\} {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} s d s \\ - \int_{- \infty}^{x} \frac{s}{s^{2} + ν} {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \\ + \int_{- \infty}^{x} \frac{λ^{2} (ν + 2) s^{3}}{(s^{2} + ν) (ν + (1 + λ^{2}) s^{2})} {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \end{matrix}

(A1)

\begin{matrix} g_{2} (x; ν) = & \int_{- \infty}^{x} \{\frac{s^{2} (s^{2} ν - 2 ν - s^{2})}{ν^{2} {(s^{2} + ν)}^{2}} + \frac{1}{2} {[\frac{(ν + 1) s^{2}}{ν (s^{2} + ν)} - log (1 + \frac{s^{2}}{ν})]}^{2}\} f_{S T} (s; λ, ν) d s \\ + \frac{λ}{2 π (ν + 1)} \int_{- \infty}^{x} \frac{s (s^{2} - 1)}{(s^{2} + ν)} \{\frac{(ν + 1) s^{2}}{ν (s^{2} + ν)} - log (1 + \frac{s^{2}}{ν})\} \\ \times {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \\ + \frac{λ}{π} \int_{- \infty}^{x} \frac{s}{{(s + ν)}^{2}} {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \end{matrix}

(A2)

\begin{matrix} + \frac{λ}{2 π} (ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν}) \int_{- \infty}^{x} \frac{s}{s^{2} + ν} {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \\ - \frac{λ}{2 π} \int_{- \infty}^{x} \frac{s}{s^{2} + ν} \{\frac{(ν + 2) (1 + λ^{2}) s^{2}}{ν (ν + (1 + λ^{2}) s^{2})} - log (1 + \frac{(1 + λ^{2}) s^{2}}{ν})\} \\ \times {\{1 + \frac{(1 + λ^{2}) s^{2}}{ν}\}}^{- \frac{ν + 2}{2}} d s \end{matrix}

(A3)

References

Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. The alpha–power tobit model. Commun. Stat. Theory Methods 2013, 42, 633–643. [Google Scholar] [CrossRef]
Birnbaum, Z.W.; Saunders, S.C. A new family of life distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. An alpha-power extension for the Birnbaum-Saunders distribution. Statistics 2014, 48, 896–912. [Google Scholar] [CrossRef]
Gupta, R.D.; Gupta, R.C. Analyzing skewed data by power-normal model. Test 2008, 17, 197–210. [Google Scholar] [CrossRef]
Pewsey, A.; Gómez, H.W.; Bolfarine, H. Likelihood–based inference for power distributions. Test 2012, 21, 775–789. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. Skew-normal alpha-power model. Stat. J. Theor. Appl. Stat. 2014, 48, 1414–1428. [Google Scholar] [CrossRef]
Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew-t distribution. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2003, 65, 367–389. [Google Scholar] [CrossRef]
Branco, M.D.; Dey, D.K. General class of multivariate skew-elliptical distributions. J. Multivar. Anal. 2001, 79, 99–113. [Google Scholar] [CrossRef]
Durrans, S.R. Multivariate skew t-distribution. Stat. J. Theor. Appl. Stat. 2003, 37, 359–363. [Google Scholar]
Sahu, S.K.; Dey, D.K.; Branco, M.D. A new class of multivariate skew distributions with applications to Bayesian regression models. Can. J. Stat. 2003, 31, 129–150. [Google Scholar] [CrossRef]
Jones, M.C.; Faddy, M.J. A skew extension of the t-distribution, with Applications. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2003, 65, 159–174. [Google Scholar] [CrossRef]
Zhao, J.; Kim, H.M. Power t distributions. Commun. Stat. Appl. Methods 2016, 23, 321–334. [Google Scholar]
R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: http://www.R-project.org (accessed on 10 October 2019).
Lehman, E.L.; Casella, G. Theory of Point Estimation, 2nd ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
Frieden, B.R. Science from Fisher Information: A Unification; Cambridge Univerisity Press: Cambridge, UK, 2004. [Google Scholar]
Arellano-Valle, R.B.; Azzalini, A. The centered parameterization and related quantities of the skew–t distribution. J. Multivar. Anal. 2013, 113, 73–90. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Azzalini, A. The centered parametrization for the multivariate skew-normal distribution. J. Multivar. Anal. 2008, 99, 1362–1382. [Google Scholar] [CrossRef]
Akaike, H. A new look at statistical model identification. IEEE Trans. Autom. Contr. 1974, 19, 716–722. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Bozdogan, H. Model selection and akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika 2010, 52, 345–370. [Google Scholar] [CrossRef]
Siebert, L.; Simkin, T.; Kimberly, P. Global Volcanism Program. In Volcanoes of the World; v. 4.6.0.; Venzke, E., Ed.; Smithsonian Institution: Washington, DC, USA, 2013; Available online: https://doi.org/10.5479/si.GVP.VOTW4-2013 (accessed on 10 October 2019).
Feigelson, E.D. astrodatR: Astronomical Data. R Package v. 0.1. Available online: http://CRAN.R-project.org/package=astrodatR (accessed on 10 October 2019).
Mattos, T.; Garay, A.M.; Lachos, V.H. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J. Appl. Stat. 2018, 45, 2019–2066. [Google Scholar] [CrossRef]
Barros, M.; Galea, M.; González, M.; Leiva, V. Influence diagnostics in the tobit censored response model. Stat. Methods Appl. 2010, 19, 379–397. [Google Scholar] [CrossRef]
Ortega, E.M.; Bolfarine, H.; Paula, G.A. Influence diagnostics in generalized log-gamma regression models. Comput. Stat. Data Anal. 2003, 42, 165–186. [Google Scholar] [CrossRef]

Figure 1. Probability density function of

APST (λ, α, 10)

for some values of

λ

and

α

.

Figure 2. Volcano height data: QQplot for Student-t, ST, PT, and APST models.

Figure 3. (Left) Graph of fitted densities to volcano height data. (Right) Empirical CDF and CDF of fitted APST model.

Figure 4. Stellar abundances data. Envelopes of transformed martingale residuals for CT, CST, CPT, and CAPST models.

Figure 5. Graph of fitted densities to stellar abundances data.

Table 1. Skewness and kurtosis for the models

ST (λ, ν)

,

PT (α, ν)

, and

APST (λ, α, ν)

, for

λ \in (- 40, 40)

,

α \in (0.5, 50)

and

ν = 2, \dots 7

.

Table 1. Skewness and kurtosis for the models

ST (λ, ν)

,

PT (α, ν)

, and

APST (λ, α, ν)

, for

λ \in (- 40, 40)

,

α \in (0.5, 50)

and

ν = 2, \dots 7

.

	Skew $— t$		Power $— t$		Alpha—Power Skew $— t$
$ν$	Skewness	Kurtosis	Skewness	Kurtosis	Skewness	Kurtosis
2	$(- 0.963, 0.963)$	$(3.170, 3.489)$	$(- 0.119, 3.040)$	$(1.552, 10.436)$	$(- 2.452, 14.314)$	$(1.395, 864.385)$
3	$(- 0.950, 0.950)$	$(3.146, 3.357)$	$(- 0.086, 1.362)$	$(1.325, 3.223)$	$(- 2.130, 4.902)$	$(1.628, 114.098)$
4	$(- 1.853, 1.853)$	$(5.099, 7.824)$	$(- 0.530, 1.178)$	$(3.461, 5.299)$	$(- 1.898, 3.215)$	$(3.153, 29.874)$
5	$(- 0.947, 0.947)$	$(3.051, 3.327)$	$(- 0.475, 0.271)$	$(1.176, 3.130)$	$(- 1.968, 3.046)$	$(3.862, 19.925)$
6	$(- 1.681, 1.681)$	$(4.554, 7.279)$	$(- 0.533, 1.118)$	$(3.974, 5.173)$	$(- 1.681, 2.145)$	$(3.892, 11.893)$
7	$(- 0.944, 0.944)$	$(3.007, 3.367)$	$(- 0.710, 0.243)$	$(1.264, 3.082)$	$(- 1.535, 2.536)$	$(3.136, 15.924)$

Table 2. Volcano heights data: Statistical summary.

n	Mean	Variance	$\sqrt{b_{1}}$	$b_{2}$
1520	16.7760	15.6682	0.6461	4.3809

Table 3. Parameter estimates (SE) for the fitted models to the volcano height data.

	Distribution
Estimates	Student-t	ST	PT	APST
$\hat{μ}$	14.7835(0.3615 )	4.7469(0.6892)	8.4027(0.7923)	11.5509(0.1337)
$\hat{σ}$	11.0045(0.3975)	14.1532(0.7237)	11.8146(0.4707)	22.6885(0.0792)
$\hat{λ}$	–	1.5673(0.1838)	–	5.2347(0.2870)
$\hat{α}$	–	–	1.7912(0.1147)	0.3205(0.0347)
$\hat{ν}$	3.4156(0.3601)	3.4075(0.3454)	2.7473(0.2566)	12.8734(2.9729)
$\hat{ℓ}$	−6273.35	−6219.25	−6228.77	−6205.94
AIC	12,552.70	12,446.49	12,465.53	12,421.87
BIC	12,568.68	12,467.79	12,486.53	12,448.50
CAIC	12,571.68	12,471.79	12,490.83	12,453.50

Table 4. Parameter estimates (SE) for the fitted models to the stellar abundances data.

	Distribution
Estimates	CT	CST	CPT	CAPST
$\hat{μ}$	1.0314(0.0010)	1.2306(0.0018)	1.2098(0.0052)	1.1761(0.0054)
$\hat{σ}$	0.1596(0.0012)	0.2712(0.0058)	0.0818(0.0008)	0.0905(0.0020)
$\hat{λ}$	–	−3.5655(3.7748)	–	0.6580(0.5031)
$\hat{α}$	–	–	0.1705(0.0208)	0.1518(0.0251)
$\hat{ν}$	0.9974(0.0884)	1.2501(0.1774)	6.0927(0.7501)	6.0999(0.7326)
$\hat{ℓ}$	−29.50743	−18.87016	−17.67113	−14.80241
AIC	65.01487	45.74033	43.34227	39.60482
BIC	71.67339	54.61836	52.22030	50.70236
CAIC	59.38987	38.37525	35.97719	30.57256

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

The Asymmetric Alpha-Power Skew-t Distribution

Abstract

1. Introduction

2. The Alpha-Power Skew-t Distribution

2.1. Moments

2.2. Distribution Function

2.3. Location and Scale Extension

3. Statistical Inference for APST Distribution

3.1. Extension to Censored Data

3.2. Properties of the CAPST Model

4. Real Data Applications

4.1. Application 1: Volcano Heights Data

4.2. Application 2: Stellar Abundances Data

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics