A Note on Ordering Probability Distributions by Skewness

V. J. García; M. Martel-Escobar; F. J. Vázquez-Polo

doi:10.3390/sym10070286

,

and

¹

Department of Statistics and Operations Research, University of Cádiz, 11002 Cádiz, Spain

²

Department of Quantitative Methods & TiDES Institute, University of Las Palmas de Gran Canaria, 35017 Las Palmas de Gran Canaria, Las Palmas, Spain

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Symmetry2018, 10(7), 286;https://doi.org/10.3390/sym10070286

This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications

Version Notes

Order Reprints

Abstract

This paper describes a complementary tool for fitting probabilistic distributions in data analysis. First, we examine the well known bivariate index of skewness and the aggregate skewness function, and then introduce orderings of the skewness of probability distributions. Using an example, we highlight the advantages of this approach and then present results for these orderings in common uniparametric families of continuous distributions, showing that the orderings are well suited to the intuitive conception of skewness and, moreover, that the skewness can be controlled via the parameter values.

Keywords:

positive and negative skewness; ordering; fitting distributions

MSC:

62E10; 62P99

1. Introduction

Detailed knowledge of the characteristics of probability models is desirable (if not essential) if data are to be modeled properly. In studying these properties, many authors have considered orderings within probability distribution families, according to diverse measuring criteria. The usual approach taken by researchers in this field is to evaluate or measure one or more theoretical characteristics of a given distribution and to study the effect produced by the value of its parameters on this measurement. In actuarial science, stochastic orders are widely used in order to make risk comparisons [1].

Some parametric distributions can be ordered according to the evaluation made of a given property, merely by comparing some of its parameters. Although most related orders are actually preorders, each one presents interesting applications. Many studies have been conducted in this area, and the following are particularly significant: Lehmann (1955) [2], which is of seminal importance; Arnold (1987) [3], who compared random variables according to stochastic ordering in a particular Lorenz order; Shaked and Shanthikumar (2006) [1], on stochastic orders; Nanda and Shaked (2001) [4], on reversed hazard rate orders; Ramos-Romero and Sordo-Díaz (2001) [5], on the likelihood ratio order; and Gupta and Aziz (2010) [6], on convex orders.

In this paper, we study the relationship between the skewness of some parametric distributions and the value of one of their parameters. The first question to be addressed is that of measuring the skewness. In this respect, Oja (1981) [7] introduced a set of axioms to be verified by any measurement of skewness considered. These axioms were established for indexes of skewness with one main constraint: that the skewness of a distribution should be evaluated by a single real number. This point is discussed below.

Many authors have proposed and obtained different descriptive elements to measure skewness (see, for instance, [8,9,10,11,12,13]). Ref [10] suggested a measurement of skewness corresponding to the (unique) mode, M, given by the following index:

γ_{M} (F) = 1 - 2 F (M) .

(1)

Ref [10] applied this index to ordering the gamma, log-logistic, lognormal and Weibull families of distributions by their skewness, taking into account the feasible values of their respective parameters. Index (1), which is proven to satisfy those axioms derived from Oja (1981) [7], is also recommended in [14] as a (very) good index of skewness. However, notice that (1) only compares the probability weight on the left side of a central point (the mode) with the value

1 / 2

, but it does not account for how the weights are distributed to each side of the centre.

García et al. (2015) [15] introduced some further elements to be incorporated into the list of skewness measurements of a probability distribution. According to these authors, given a unimodal probability distribution

F (x)

, its skewness is considered to be a local function of a given distance, z, from the mode, M. For such a distance, and given the interval

[M - z, M + z]

, the aggregate skewness function,

ν_{F} (z)

, compares the probability weight of F at either side of the interval:

ν_{F} (z) = \Pr (X > M + z) - \Pr (X < M - z),

(2)

where

z \geq 0

. Thus, the (maximum) right skewness of the distribution F and its (minimum) left skewness are respectively given by

S^{+} (F) = max_{z \geq 0} ν_{F} (z), S^{-} (F) = min_{z \geq 0} ν_{F} (z) .

(3)

The distances,

z_{p}

and

z_{n}

, where these extreme values are achieved, are termed the critical distances to the mode. As the skewness function is bounded inside the interval

[- 1, 1]

and

ν_{F} (\infty) = 0

, the bivariate index

(S^{-} (F), S^{+} (F))

belongs to

[- 1, 0] \times [0, 1]

. A given distribution function F such that

ν_{F} (z) \geq 0

for all

z \geq 0

is said to be only skewed to the right; and if

ν_{F} (z) \leq 0

for all

z \geq 0

, it is said to be only skewed to the left.

The relationship

F <_{c} G

(F

c -

precedes G) means that

G^{- 1} [F (x)]

is a convex function. For a continuous distribution F, the bivariate measurement of skewness

(S^{-} (F), S^{+} (F))

verifies the following properties, where

a F + b

and

- F

mean the distributions of the corresponding transformation of a random variable that is

F -

distributed:

$(S^{-} (F), S^{+} (F)) = 0,$ for any symmetric distribution F.
$(S^{-} (a F + b), S^{+} (a F + b)) = (S^{-} (F), S^{+} (F))$ , for all $a > 0$ , $- \infty < b < \infty$ .
$(S^{-} (- F), S^{+} (- F)) = (- S^{+} (F), - S^{-} (- F))$ .
If $F <_{c} G$ , then $(S^{-} (F), S^{+} (F)) \leq (S^{-} (G), S^{+} (G)),$ understood as vector dominance.

These properties can be considered as a vectorial interpretation of the axioms given by Oja (1981) [7].

As it is easily proven that

ν_{F} (0) = γ_{M} (F)

, we can establish that (2) and (3) give considerably clearer and more complete information than (1) about the skewness of any distribution function.

Most families of continuous distributions are only skewed to the right (or only to the left), while doubles-sign skewness is abundant within the discrete families, as shown in [15]. Nevertheless, the joint use of the function (2) and the bivariate index (3) makes it possible to improve the ordering of the skewness-based distribution discussed in [10], as can be seen in the following example.

Example 1.

Assume the following random variable

X \in [- 2, \infty)

with PDF given by:

f (x) = \{\begin{matrix} \frac{1}{4} x + \frac{1}{2}, & - 2 \leq x < 0; \\ \frac{1}{2} \exp (- x), & x \geq 0 . \end{matrix}

Assume also the PDF,

g (y)

of

Y = - X

. Then,

γ_{M} (F) = 0 = γ_{M} (G) .

That is, according to coefficient

γ_{M} (\cdot)

, both distributions have the same null skewness, although they do not even have a symmetric support set.

However, using expression (2), we find that

ν_{F} (z) = \{\begin{matrix} \frac{1}{2} \exp (- z) - \frac{1}{8} {(z - 2)}^{2}, & 0 \leq z < 2; \\ \frac{1}{2} \exp (- z), & z \geq 2, \end{matrix}

and

ν_{G} (z) = - ν_{F} (z),

for all

z \geq 0

. These functions are plotted in Figure 1, where it can be seen that

ν_{F} (z) \geq ν_{G} (z)

for all

z \geq 0

,

S^{+} (F) = - S^{-} (G) \neq 0

and

S^{-} (F) = 0 = S^{+} (G)

. Clearly, the information about skewness obtained from the aggregate skewness function

ν (z)

and the indices

S^{+} (\cdot)

and

S^{-} (\cdot)

is considerably more comprehensive than that obtained from

γ_{M} (\cdot)

.

Figure 1. Skewness functions

ν_{F} (z)

and

ν_{G} (z)

in Example 1.

Outline

In applied statistical analysis, it is useful to have a large catalogue of plausible distributions with which to fit the data. According to García et al. (2015) [15], common measures of skewness can be complemented with a bivariate index of positive–negative skewness, and the authors show that the mode is the relevant central value to study both right and left skewness. In this paper, we extend the tool-box approach to fit data from probability distributions, introducing two orderings that are deduced from the skewness measures given in [15]. The first of those orderings is based on the positive part of the bivariate index of skewness, which in many instances coincides with the well known

γ_{M} (F)

. Nevertheless, the differences can be highly significant, as in the previous example. The second, more noteworthy, order is based on the skewness function

ν_{F} (z)

and meets the first of the conditions, but not the reciprocal.

There are two reasons for ordering a family of distributions according to a given measurement of skewness. Firstly, as a property of the distribution, this ordering allows us to control its skewness by the appropriate selection of the parameter. When this is done (and the parameter is readily determined), the theoretical results have immediate applications in the data-fitting process. Secondly, when a given family of distributions is conceived as being more or less skewed according to the value of a parameter, and a measurement of skewness ratifies the ordering, it may be concluded that the functioning of this measurement provides a reasonably good fit with an intuitive conception of skewness.

The rest of this paper is organized as follows. In Section 2, we study the aggregate skewness function and the resultant skewness-based ordering of the gamma, log-logistic, lognormal, Weibull and asymmetric Laplace families of continuous probability distributions. In Section 3, we study the ordering of two of the most well-known distributions commonly used in PERT methods: the beta and the asymmetric triangular distributions. Finally, conclusions are presented in Section 4.

2. Families of Uniparametric Distributions Ordered by Skewness

Let F and G be unimodal distribution functions, with no centre or scale parameters, and modes

M_{F}

and

M_{G},

respectively. We compare their respective skewness by two different criteria.

Definition 1.

If

ν_{F} (z) - ν_{G} (z) \geq 0, \forall z \geq 0,

(4)

then we say that F has equal or more aggregate skewness to the right at any point than

G .

We denote this by

F \geq_{ν} G

.

Definition 2.

If

F

and G are both skewed only to the right, we say that F has equal or more maximum aggregate skewness to the right than G when

S^{+} (F) - S^{+} (G) \geq 0,

(5)

and we denote this by

F \geq_{+} G

.

With these definitions, it immediately follows that:

Proposition 1.

If

F \geq_{ν} G,

then

F \geq_{+} G .

The reverse implication is not true in general.

Proof.

The proof follows immediately from the definitions given in (4) and (5). ☐

In the next section, we consider some well known uniparametric families of continuous distributions, with no centre or scale parameters but depending on a skewness parameter, and examine whether they are ordered by aggregate skewness, or by maximum aggregate skewness. The gamma family is a very broad one, which includes many other well known distributions as particular cases. A study of the log-logistic, lognormal, Weibull and asymmetric Laplace families, one by one and in turn, when not included inside the previous one, will produce widely varying results.

2.1. Uniparametric Gamma Distributions

Let X be a uniparametric gamma distributed random variable,

G (α)

. That is, its CDF

G (x; α)

is given by

G (x; α) = \frac{1}{Γ (1 + α)} \int_{0}^{x} t^{α} e^{- t} d t,

(6)

for

x > 0

, where

- 1 < α < \infty

, and the mode is given by

M = max \{α, 0\}

. Then, for

- 1 < α \leq 0

, the density function decreases on x along the positive real line and we obtain that

ν_{G} (z; α) = 1 - G (z; α) = \frac{1}{Γ (1 + α)} \int_{z}^{\infty} t^{α} e^{- t} d t .

In these cases,

ν_{G} (z; α)

is a decreasing function on z, and

S^{+} (G (α_{1})) = ν_{G} (0; α_{1}) = 1 .

Proposition 2.

Let

G (α_{1})

and

G (α_{2})

be gamma distributions with CDF as in (6). Then:

1.: If $- 1 < α_{1} < α_{2} < 0$ , then $G (α_{2}) \geq_{ν} G (α_{1}) .$
2.: If $0 < α_{1} < α_{2}$ , then $G (α_{1}) \geq_{+} G (α_{2}) .$

Proof.

Part 1. We can write

ν_{G} (z; α_{1}) - ν_{G} (z; α_{2}) = G (z; α_{2}) - G (z; α_{1}) .

By denoting

α_{2} = α_{1} + ε

,

ε > 0

, and then considering

u (z) = \frac{d}{d z} [ν_{G} (z; α_{1}) - ν_{G} (z; α_{2})],

we obtain

u (z) = \frac{z^{α_{2}} e^{- z}}{Γ (1 + α_{2})} - \frac{z^{α_{1}} e^{- z}}{Γ (1 + α_{1})} = z^{α_{1}} e^{- z} [\frac{z^{ε} Γ (1 + α_{1}) - Γ (1 + α_{2})}{Γ (1 + α_{2}) Γ (1 + α_{1})}] .

Therefore,

u (z) = 0

when

z = z_{0} = {[\frac{Γ (1 + α_{2})}{Γ (1 + α_{1})}]}^{1 / ε},

u (z)

is negative for

0 < z < z_{0}

, and positive for

z > z_{0}

. Also,

ν_{G} (0^{+}; α_{1}) - ν_{G} (0^{+}; α_{2}) = 0

,

ν_{G} (\infty; α_{1}) - ν_{G} (\infty; α_{2}) = 0

. Then,

\begin{matrix} ν_{G} (z_{0}; α_{1}) - ν_{G} (z_{0}; α_{2}) & = & \frac{1}{Γ (1 + α_{1}) Γ (1 + α_{2})} \int_{0}^{x_{0}} t^{α_{1}} e^{- t} [Γ (1 + α_{1}) t^{ε} - Γ (1 + α_{2})] d t, \end{matrix}

is the integral of a negative function, so it is negative, and the proof is complete.

Part 2. For

0 < α < \infty

, we have that

ν_{G} (z; α) = \{\begin{matrix} \frac{1}{Γ (1 + α)} (\int_{z + α}^{\infty} t^{α} e^{- t} d t - \int_{0}^{α - z} t^{α} e^{- t} d t), & 0 \leq z < α, \\ \frac{1}{Γ (1 + α)} \int_{z + α}^{\infty} t^{α} e^{- t} d t, & z \geq α . \end{matrix}

and,

\frac{d ν_{G}}{d z} = \{\begin{matrix} \frac{1}{Γ (1 + α)} [{(α - z)}^{α} e^{- (α - z)} - {(α + z)}^{α} e^{- (α + z)}], & 0 \leq z < α, \\ \frac{- 1}{Γ (1 + α)} z^{α} e^{- z}, & z \geq α . \end{matrix}

Then, clearly we have that

d ν_{G} / d z < 0

for all

z \geq α

. For

0 \leq z < α

, if we denote

w (z) = {(α - z)}^{α} e^{z} - {(α + z)}^{α} e^{- x},

then the sign of

d ν_{G} / d z

is the sign of

w (x)

. As

w (0) = 0

,

w (α) = - {(2 α)}^{α} e^{- α} < 0

, and

\frac{d w}{d z} = z {(z + α)}^{α - 1} e^{- z} - z {(α - z)}^{α - 1} e^{z} \leq 0,

we conclude that

ν_{G}

is a decreasing function on

z \geq 0

and

S^{+} (G (α)) = ν_{G} (0; α) .

S^{+} (G (α)) = \frac{Γ (1 + α, α)}{Γ (1 + α)},

where

Γ (1 + α, α)

is the incomplete Gamma function, and then

S^{+} (G (α))

is a decreasing function on

α

, when

α \to \infty

. Nevertheless, a simple plotting of the functionals

ν_{G} (z; α_{i})

for any

0 < α_{1} < α_{2}

shows that both functionals cross each other and that they are not ordered by ”

\geq_{ν}

”. Thus, the proof is completed. ☐

2.2. Log–Logistic Distributions

The CDF of a uniparametric log–logistic distributed random variable X is given by

F_{L L} (x; θ) = {(1 + x^{- θ})}^{- 1},

(7)

for

x > 0

, with

θ > 0

. The mode of these distributions depends on

θ

. If

0 < θ \leq 1

, then

M = 0

, and

ν_{L L} (z; θ) = \frac{1}{1 + z^{θ}},

and

S^{+} (F_{L L} (θ)) = 1

. The functionals

ν_{L L} (z; θ)

for different values of

θ

inside the rank cross each other at

z = 1

, and these distributions are ordered neither by skewness function nor by skewness indexes. Nevertheless, for

θ > 1

, the mode is

0 < M = {(\frac{θ - 1}{θ + 1})}^{1 / θ} < 1 .

(8)

Notice that M is an increasing function of

θ

when

θ > 1,

because

\frac{d M}{d θ} = M \cdot [\frac{2}{(θ^{2} - 1) θ} - \frac{1}{θ^{2}} \ln \frac{θ - 1}{θ + 1}] > 0 .

(9)

When

θ > 1

, it is also known from Arnold and Groeneveld (1995) that

ν_{L L} (0; θ) = \frac{1}{θ} .

As

ν_{L} (z; θ)

is a decreasing function, it is then stated that

1 < θ_{1} < θ_{2}

implies

F_{L L} (θ_{1}) \geq_{+} F_{L L} (θ_{2})

. Furthermore, the skewness functions are ordered, as we prove below.

Proposition 3.

Let be

F_{L L} (θ_{1})

and

F_{L L} (θ_{2})

log-logistic distributions with CDF as in (7), where

1 < θ_{1} < θ_{2}

. Then,

F_{L L} (θ_{1}) \geq_{ν} F_{L L} (θ_{2}) .

(10)

Proof.

Let

θ > 1

. Then,

ν_{L L} (z; θ) = \{\begin{matrix} \frac{1 - {(M^{2} - z^{2})}^{θ}}{1 + {(M + z)}^{θ} + {(M - z)}^{θ} + {(M^{2} - z^{2})}^{θ}}, & 0 \leq z \leq M, \\ \frac{1}{1 + {(M + z)}^{θ}}, & z > M, \end{matrix}

If we consider

1 < θ_{1} < θ_{2}

, such that the respective modes verify

0 < M_{1} < M_{2} < 1

, we can then denote

a = {(M_{1} - z)}^{θ_{1}} < b = {(M_{1} + z)}^{θ_{1}},

c = {(M_{2} - z)}^{θ_{2}} < d = {(M_{2} + z)}^{θ_{2}} .

and consider the function h given by

h (θ) = {(M \pm z)}^{θ}, 0 \leq z \leq M,

with M as in (8). Then,

\begin{matrix} \frac{d h}{d θ} & = & \frac{{(M \pm z)}^{θ - 1}}{θ {(θ + 1)}^{2}} (\frac{2 θ}{M^{θ - 1}} + θ {(1 + θ)}^{2} (M \pm z) \ln (M \pm z) + {(1 + θ)}^{2} M \ln \frac{θ + 1}{θ - 1}), \end{matrix}

For

z < M,

this implies that

a < c

,

b < d

. With this notation, we can write

ν_{L L} (z; θ_{1}) - ν_{L L} (z; θ_{2})

as follows.

Firstly, for

0 \leq z \leq M_{1}

,

\begin{matrix} ν_{L L} (z; θ_{1}) - ν_{L L} (z; θ_{2}) & = & \frac{1 - a b}{(1 + a) (1 + b)} - \frac{1 - c d}{(1 + c) (1 + d)} \\ = & \frac{(c - a) + (d - b) + a c (d - b) + b d (c - a) + 2 (c d - a b)}{(a + 1) (b + 1) (c + 1) (d + 1)} > 0 . \end{matrix}

Secondly, for

M_{1} < z \leq M_{2}

, we only need to compare

d - b

, because

\begin{matrix} ν_{L L} (z; θ_{1}) - ν_{L L} (z; θ_{2}) & = & \frac{1}{1 + b} - \frac{1 - c d}{(1 + c) (1 + d)} \\ = & \frac{c + (d - b) + 2 c d + b c d}{(1 + b) (1 + c) (1 + d)} > 0 . \end{matrix}

Finally, when

z > M_{2}

,

ν_{L L} (z; θ_{1}) - ν_{L L} (z; θ_{2}) = \frac{1}{1 + b} - \frac{1}{1 + d} = \frac{d - b}{(1 + b) (1 + d)} > 0 .

Hence, the proof is completed. ☐

2.3. Lognormal Variance Distributions

L N (x; σ) = Φ (\frac{\ln x}{σ}),

(11)

for

x, σ > 0

, where

Φ (\cdot)

is the standard normal distribution function. The mode is given by

M_{σ} = \exp (- σ^{2})

and

ν_{L N} (z; σ) = 1 - Φ (\frac{\ln [z + \exp (- σ^{2})]}{σ}) - Φ (\frac{\ln [\exp (- σ^{2}) - z]}{σ}) .

Proposition 4.

Let

L N (σ_{1})

and

L N (σ_{2})

be lognormal distributions with CDF as in (11), where

0 < σ_{1} < σ_{2}

. Then,

L N (σ_{2}) \geq_{ν} L N (σ_{1}) .

Proof.

For

0 < σ_{1} < σ_{2},

the corresponding modes are

M_{1} > M_{2}

, and

Φ (\frac{\ln (z + M_{1})}{σ_{1}}) > Φ (\frac{\ln (z + M_{2})}{σ_{2}}),

Φ (\frac{\ln (- z + M_{1})}{σ_{1}}) > Φ (\frac{\ln (- z + M_{2})}{σ_{2}}),

because

Φ

is a strictly increasing function. Thus, we obtain that

ν_{L N} (z; σ_{1}) > ν_{L N} (z; σ_{2})

for all

z > 0

and the proof is completed. ☐

2.4. Uniparametric Weibull Distributions

Consider the uniparametric Weibull distributions family given by the CDF

W (x; c) = 1 - \exp (- x^{c}), x > 0, c > 0 .

(12)

The mode is known to be at 0, for

c \leq 1

(as a limit, when

c < 1

) and at

0 < M_{c} = {(\frac{c - 1}{c})}^{1 / c} < 1,

for

c > 1

. The expression for

ν_{W}

is given by

ν_{W} (z; c) = \{\begin{matrix} \exp [- {(M_{c} + z)}^{c}] + \exp [- {(M_{c} - z)}^{c}] - 1, & 0 < z < M_{c} \\ \exp [- {(M_{c} + z)}^{c}], & z \geq M_{c} \end{matrix}

On the one hand, when

c < 1

, note that

ν_{W (c)} (1) = e^{- 1}

, so all these functions intersect at this point. Graphically, it can be seen that there is no ordering by “

\geq_{ν}

”, and also that

S^{+} (W (c)) = 1

, when

c < 1

. On the other hand, for

1 \leq c_{1} < c_{2}

, the following result is obtained.

Proposition 5.

Let

W (c_{1})

and

W (c_{2})

be Weibull distributions with

1 \leq c_{1} < c_{2}

and CDF as in (12). Then,

W (c_{1}) \geq_{ν} W (c_{2}) .

Proof.

For

1 \leq c_{1} < c_{2}

, the corresponding modes are

M_{1} < M_{2}

. Then, for

0 < z < M_{1},

ν_{W} (z; c_{1}) - ν_{W} (z; c_{2}) = \{\exp [- {(M_{1} + z)}^{c_{1}}] - \exp [- {(M_{2} + z)}^{c_{2}}]\}

+ \{\exp [- {(M_{1} - z)}^{c_{1}}] - \exp [- {(M_{2} - z)}^{c_{2}}]\} > 0,

because each part of the expression inside brackets

\{\cdot\}

is positive. If we take

M_{1} \leq z < M_{2}

, then

ν_{W} (z; c_{1}) - ν_{W} (z; c_{2}) = \{\exp [- {(M_{1} + z)}^{c_{1}}] - \exp [- {(M_{2} + z)}^{c_{2}}]\}

+ \{1 - \exp [- {(M_{2} - z)}^{c_{2}}]\} > 0,

for a similar reason. Finally, if we take

z > M_{2}

, then

ν_{W} (z; c_{1}) - ν_{W} (z; c_{2}) = \exp [- {(M_{1} + z)}^{c_{1}}] - \exp [- {(M_{2} + z)}^{c_{2}}] > 0,

and the proof is completed. ☐

2.5. Asymmetric Laplace Distributions

The asymmetric Laplace distribution has been introduced in the literature by different ways ([16,17]). In this paper we will use Kozubowski and Podgórski (2002) [18] (later refined in [19]) to refer it. This distribution is obtained by using the scheme introduced by Fernández and Steel (1998) [20] to produce skewness on a symmetric distribution. In this way, the pdf of a skewed or asymmetric Laplace distribution can be written in the form

f (x; μ, σ, κ) = \{\begin{matrix} \frac{\sqrt{2}}{σ} \frac{κ}{1 + κ^{2}} \exp [- \frac{\sqrt{2}}{κ σ} (μ - x)], & x < μ, \\ \frac{\sqrt{2}}{σ} \frac{κ}{1 + κ^{2}} \exp [- \frac{κ \sqrt{2}}{σ} (x - μ)], & x \geq μ, \end{matrix}

where

σ, κ > 0

, and

- \infty < μ < \infty

. Then, we assign values

(0, 1)

to the centre and scale parameters (

μ

and

σ

, respectively) in order to study the aggregate skewness function, and the extreme right and left skewness indices then depend only on the skewness parameter

κ > 0

. Thus, it is easily proven that:

The aggregate skewness function of an $A L (κ)$ distribution can be written as

$ν_{A L} (z; κ) = \frac{1}{1 + κ^{2}} [\exp (- \sqrt{2} κ z) - κ^{2} \exp (- \frac{\sqrt{2}}{κ} z)] .$
$ν_{A L} (z; κ)$ is an increasing negative function of z when $κ > 1$ , and it is a decreasing positive function of z when $0 < κ < 1$ . $ν_{A L, 1} (z; 1) = 0$ , for all $z \geq 0$ . That is, any $A L$ distribution is skewed only to the right or to the left, depending on $κ$ . In any case, the function verifies ${lim}_{z \to \infty} ν_{A L} (z; κ) = 0$ but, when $κ \neq 1$ , the function never reaches that limit value. To prove these results, it is sufficient to note that

$\frac{d ν_{A L} (z; κ)}{d z} = \frac{\sqrt{2} κ}{κ^{2} + 1} [\exp (- \frac{\sqrt{2}}{κ} z) - \exp (- \sqrt{2} κ z)] .$
At $z = 0$ , the skewness function takes the following value:

$ν_{A L} (0; κ) = \frac{1 - κ^{2}}{1 + κ^{2}} .$

Then, $ν_{A L} (0; κ)$ is the value for $S^{+} (F_{A L} (κ))$ or $S^{-} (F_{A L} (κ))$ , depending on its sign.
$ν_{A L} (z; κ)$ is a strictly decreasing function on $κ$ . This is easily shown by means of

$\frac{d ν_{A L} (z; κ)}{d κ} = - \frac{\sqrt{2} z κ^{2} + 2 κ + \sqrt{2} z}{{(κ^{2} + 1)}^{2}} [\exp (- \sqrt{2} \frac{z}{κ}) + \exp (- \sqrt{2} κ z)] < 0,$

for all $z > 0$ , and all $κ > 0$ .

As a conclusion, we can enunciate the following Proposition, whose proof is straightforward and hence omitted.

Proposition 6.

Assume

0 < κ_{1} < κ_{2} < \infty

, and let

F_{A L} (κ_{1})

and

F_{A L} (κ_{2})

be the respective asymmetric Laplace distributions. Then:

1.: $F_{A L} (κ_{1}) \geq_{ν} F_{A L} (κ_{2}) .$
2.: If $0 < κ_{1} < 1$ , then $F_{A L} (κ_{1})$ is skewed only to the right.
3.: If $κ_{1} > 1$ , then $F_{A L} (κ_{1})$ is skewed only to the left.

3. The Beta and the AST Distributions

The methods for Project Management and Review Technique (PERT) are well known and widely applied when the needed activities for a given project must be ordered according to precedence in time. Some of these methods require modelling the time length of each activity as a random variable, following an expert’s opinion. The beta and the asymmetric triangular distributions are commonly used by engineers to describe these time lengths. In any case, the indications of the experts can be related to a maximum and a minimum values and a mode, often completed with further considerations about the shape and skewness of the PDF of the time random variable. Then, a deep study of the skewness of both families of probability distributions would be welcome to improve the model fit.

On the one hand, the asymmetric standard triangular distribution (ASTD) , free of center and scale parameters, depends on only one parameter

0 \leq θ \leq 1

, and has the pdf:

f (x | θ) = \{\begin{matrix} 2 x θ^{- 1}, & 0 \leq x \leq θ, \\ 2 (1 - x) {(1 - θ)}^{- 1}, & θ \leq x \leq 1, \\ 0, & elsewhere . \end{matrix}

There is a large body of literature that shows the use of the ASTD in PERT methods (see [21] and [19] and cites therein). Note that cases

θ = 0, 1

are members of the beta family of distributions.

For

0 < θ < 1

, the

A S T D (θ)

CDF can be written as follows:

F (x | θ) = \{\begin{matrix} x^{2} θ^{- 1}, & 0 \leq x \leq θ, \\ (2 x - x^{2} - θ) {(1 - θ)}^{- 1}, & θ \leq x \leq 1 . \end{matrix}

As the mode is found to be at

x = θ

, its skewness function is found to be

ν_{A S T D} (z; θ) = (1 - 2 θ) - \frac{(1 - 2 θ)}{θ (1 - θ)} z^{2},

for

0 \leq z \leq min \{θ, 1 - θ\} .

In the case

θ = 0.5

, the skewness function is null. Then, for

0 < θ < 0.5

and

θ < z \leq 1 - θ

,

ν_{A S T D} (z; θ) = \frac{{(z - 1 + θ)}^{2}}{1 - θ} .

In the case

0.5 < θ < 1

, for

1 - θ < z \leq θ

,

ν_{A S T D} (z; θ) = - \frac{{(θ - z)}^{2}}{θ},

and it is easily found that

ν_{A S T D} (z; θ) = - ν_{A S T D} (z; 1 - θ),

(13)

for

0 \leq z < \infty

.

Some algebra allows to prove that, being

0 < θ_{1} < θ_{2} < 1

,

$A S T D (θ_{1}) \geq_{ν} A S T D (θ_{2}) .$
If $0 < θ_{i} < 0.5$ , then $S_{A S T D}^{+} (θ_{i}) = ν_{A S T D} (0; θ_{i}) = 1 - 2 θ_{i}$ , and $S_{A S T D}^{-} (θ_{i}) = 0$ .
If $0.5 < θ_{i} < 1$ , then $S_{A S T D}^{+} (θ_{i}) = 0$ and $S_{A S T D}^{-} (θ_{i}) = ν_{A S T D} (0; θ_{i}) = 1 - 2 θ_{i}$ .

Therefore, the skewness of the ASTD distributions is completely controlled by the parameter

θ .

On the other hand, the pdf of a beta distribution is given by

f_{B} (x; α, β) = \frac{x^{α - 1} {(1 - x)}^{β - 1}}{B (α, β)}, 0 \leq x \leq 1,

where

α, β > 0

, and

B (\cdot, \cdot)

is the beta function. Given that its CDF

F (x; α, β)

verifies that

F (x; α, β) = 1 - F (x; β, α)

and the sign of its skewness depends only on the condition

β \geq α

or

β \leq α

, we can study only the case

β > α

.

We are interested on the cases

α, β > 1

, where there is an unique mode

M,

M = \frac{α - 1}{α + β - 2} ≐ b (α, β) .

Hence, we only consider cases where

1 < α < β

, where there exists a right skewness; the cases

1 < β < α

, with left skewness, can be immediately deducted by taking the parameters in reverse.

Notice that

\Pr (X > M + z) > 0

requires

0 \leq z \leq b (β, α)

, and that

\Pr (X < M - z) > 0

requires

0 \leq z \leq b (α, β)

. Then,

ν_{B} (z; α, β) = \{\begin{matrix} 1 - I_{M + z} (α, β) - I_{M - z} (α, β) > 0, & 0 \leq z \leq b (α, β) \\ 1 - I_{M + z} (α, β) > 0, & b (α, β) < z \leq b (β, α) \\ 0, & z > b (β, α), \end{matrix}

(14)

where,

I_{z} (α, β) = \int_{0}^{z} \frac{t^{α - 1} {(1 - t)}^{β - 1} d t}{B (α, β)}

is the well known Beta Regularized function.

Firstly, observe that

ν_{B} (0; α, β) = 1 - \frac{2}{B (α, β)} \int_{0}^{M} x^{α - 1} {(1 - x)}^{β - 1} d x,

and

\frac{1}{B (α, β)} \int_{0}^{M} x^{α - 1} {(1 - x)}^{β - 1} d x < \frac{1}{B (α, β)} \int_{0}^{m} x^{α - 1} {(1 - x)}^{β - 1} d x \approx \frac{1}{2},

where

m = \frac{α - \frac{1}{3}}{α + β - \frac{2}{3}}

is the approximate median of the distribution.

Secondly, if

0 \leq z \leq b (α, β) < b (β, α),

then

B (α, β) \cdot \frac{d ν_{B} (z; α, β)}{d z} = - {[b (α, β) + z]}^{α - 1} {[b (β, α) - z]}^{β - 1} - {[b (α, β) - z]}^{α - 1} {[b (β, α) + z]}^{β - 1},

which is negative within the rank of z. For

b (α, β) < z \leq b (β, α)

,

B (α, β) \cdot \frac{d ν_{B} (z; α, β)}{d z} = - {[b (α, β) + z]}^{α - 1} {[b (β, α) - z]}^{β - 1} < 0 .

Hence, for

0 \leq z \leq b (β, α)

,

ν_{B} (z; α, β)

is a strictly decreasing continuous function with

ν_{B} (0; α, β) > 0

and

ν_{B} (b (β, α); α, β) = 0 .

Now we focus on the family of Beta distributions with given mode, M. That is, we consider the subfamily of Beta distributions:

B (α + 1, 1 + \frac{1 - M}{M} α),

with

α > 0

. Then, with the aid of a proper software (we have used Wolfram Mathematica 10), one can obtain the derivative

\frac{\partial}{\partial α} ν_{B} (z; α + 1, 1 + \frac{1 - M}{M} α),

and maximize this function, in two cases:

First case, the constrains are

α \geq 1,

0 < m < 1 / 2,

0 \leq z \leq b (α + 1, 1 + (1 - M) α / M) .

The maximum value of the function is 0, and it is achieved when

M = 0.5

,

α ≃ 3.54147

,

z ≃ 0.309936

.

Second case, the constrain are

α \geq 1,

0 < m < 1 / 2,

b (α + 1, 1 + (1 - M) α / M) < z \leq b (1 + (1 - M) α / M, α + 1) .

The maximum value of the function is

- 5.07056 \times 10^{- 6}

, and it is achieved when

M = 0.123564

,

α ≃ 1.62726

,

z ≃ 0.632457

.

With these results, we can conclude that

ν_{B} (z; α + 1, 1 + (1 - M) α / M)

decreases with the feasible values of

α

. That way, the subsets of Beta distributions with fixed mode are ordered on skewness (see Figure 2). As the parameter values increase, these Beta distributions become less skewed.

Figure 2. Beta distributions with common given mode

M = 0.2

(left panel) for

α = 2, 4

and 9 and their skewness functions

ν_{B}

(right panel).

4. Conclusions

In this paper two main objectives are achieved: on the one hand, the given examples show that the skewness function orders the mesh in good accordance with the intuitive conception of skewness. Moreover, these examples show that the skewness of a distribution obtained from certain parametric families can be controlled by reference to their parameters.

As we show, the function

ν_{F} (z)

facilitates the description of a random variable by means of a probability distribution, by making any skewness in the model easily observable and should be undertaken to examine the use of these properties in data fitting.

In practice, much can be learned from this model, but there remains the risk that it may be wrongly specified in real applications. Thus, in practice we must be willing to assume that the underlying distribution has a unique mode and belongs to a uniparametric family of distributions.

In many practical situations, the maximum skewness index coincides with the well known

γ_{M} (F)

, but this second index only takes into account the difference of probability weights at each side of the mode, while the first takes a value from the point where this difference is maximum. Moreover, the aggregate skewness function gives more accurate information about how the probability weight is distributed along both sides of the mode. Accordingly, the condition

F \geq_{ν} G

provides highly valuable information.

Author Contributions

All authors have contributed equally to this paper.

Funding

This research received no external funding.

Acknowledgments

This research was partially funded by MINECO (Spain) grant number EC02017–85577–P. The authors are grateful for helpful suggestions made by two reviewers.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shaked, M.; Shanthikumar, J.G. Stochastic Orders. In Springer Series in Statistics 43; Springer: New York, NY, USA, 2007. [Google Scholar]
Lehmann, E.L. Ordered families of distributions. Ann. Math. Statist. 1955, 26, 399–419. [Google Scholar] [CrossRef]
Arnold, B.C. Majorization and the Lorenz Order: A brief introduction. In Lecture Notes in Statistics 43; Springer: New York, NY, USA, 1987. [Google Scholar]
Nanda, A.K.; Shaked, M. The hazard rate and reverse hazard rate orders, with applications to order statistics. Ann. Inst. Statist. Math. 2001, 53, 853–864. [Google Scholar] [CrossRef]
Ramos–Romero, H.M.; Sordo–Díaz, M.A. The proportional likelihood ratio order and applications. Questiio 2001, 25, 211–223. [Google Scholar]
Gupta, A.K.; Aziz, M.A.S. Convex Ordering of Random Variables and its Applications in Econometrics and Actuarial Science. Eur. J. Pure Appl. Math. 2010, 3, 79–85. [Google Scholar]
Oja, H. On location, scale, skewness and kurtosis of univariate distributions. Scand. J. Stat. 1981, 8, 154–168. [Google Scholar]
Van Zwet, W.R. Mean, Median, Mode II. Stat. Neerl. 1979, 33, 1–5. [Google Scholar] [CrossRef]
MacGillivray, H.L. Skewness and asymmetry: measures and orderings. Ann. Stat. 1986, 14, 994–1011. [Google Scholar] [CrossRef]
Arnold, B.C.; Groeneveld, R.A. Measuring Skewness with respect to the Mode. Am. Stat. 1995, 49, 34–38. [Google Scholar]
Sato, M. Some remarks on the mean, median, mode and skewness. Aust. J. Stat. 1997, 39, 219–224. [Google Scholar] [CrossRef]
Von Hippel, P.T. Mean, Median and Skew: Correcting a Textbook Rule. J. Stat. Educ. 2005, 13. [Google Scholar] [CrossRef]
Das, S.; Mandal, P.K.; Ghosh, D. On Homogeneous Skewness of Unimodal Distributions. Indian J. Stat. 2009, 71-B, 187–205. [Google Scholar]
Rubio, F.J.; Steel, M. On the Marshall–Olkin transformation as a skewing mechanism. Comput. Stat. Data Anal. 2012, 56, 2251–2257. [Google Scholar] [CrossRef]
García, V.J.; Martel–Escobar, M.; Vázquez–Polo, F.J. Complementary information for skewness measures. Stat. Neerl. 2015, 69, 442–459. [Google Scholar] [CrossRef]
Mc Gill, W.J. Random fluctuations of response rate. Psychometrika 1962, 27, 3–17. [Google Scholar] [CrossRef]
Holla, M.S.; Bhattacharya, S.K. On a compound Gaussian distribution. Ann. Instit. Stat. Math. 1968, 20, 331–336. [Google Scholar] [CrossRef]
Kozubowski, T.J.; Podgórski, K. Maximum likelihood estimation of asymmetric Laplace parameters. Ann. Inst. Stat. Math. 2002, 54, 816–826. [Google Scholar]
Kotz, S.; van Dorp, J.R. Uneven two-sided power distribution: with applications in econometric models. Stat. Methods Appl. 2004, 13, 285–313. [Google Scholar] [CrossRef]
Fernández, C.; Steel, M.F.J. On Bayesian Modeling of Fat Tails and Skewness. J. Am. Stat. Assoc. 1998, 93, 359–371. [Google Scholar]
Johnson, D. The triangular distribution as a proxy for the beta distribution in risk analysis. The Statistician 1997, 46, 387–398. [Google Scholar] [CrossRef]

Figure 1. Skewness functions

ν_{F} (z)

and

ν_{G} (z)

in Example 1.

Figure 2. Beta distributions with common given mode

M = 0.2

(left panel) for

α = 2, 4

and 9 and their skewness functions

ν_{B}

(right panel).

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Note on Ordering Probability Distributions by Skewness

Abstract

1. Introduction

Outline

2. Families of Uniparametric Distributions Ordered by Skewness

2.1. Uniparametric Gamma Distributions

2.2. Log–Logistic Distributions

2.3. Lognormal Variance Distributions

2.4. Uniparametric Weibull Distributions

2.5. Asymmetric Laplace Distributions

3. The Beta and the AST Distributions

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics