Unit Distributions: A General Framework, Some Special Cases, and the Regression Unit-Dagum Models

Francesca Condino; Filippo Domma

doi:10.3390/math11132888

and

Department of Economics, Statistics and Finance “Giovanni Anania”, University of Calabria, Via P. Bucci, Cubo 0C, 87036 Rende, CS, Italy

^*

Author to whom correspondence should be addressed.

Mathematics2023, 11(13), 2888;https://doi.org/10.3390/math11132888

This article belongs to the Special Issue New Advances in Distribution Theory and Its Applications

Version Notes

Order Reprints

Abstract

In this work, we propose a general framework for models with support in the unit interval, which is obtained using the technique of random variable transformations. For this class, the general expressions of distribution and density functions are given, together with the principal characteristics, such as quantiles, moments, and hazard and reverse hazard functions. It is possible to verify that different proposals already present in the literature can be seen as particular cases of this general structure by choosing a suitable transformation. Moreover, we focus on the class of unit-Dagum distributions and, by specifying two different kinds of transformations, we propose the type I and type II unit-Dagum distributions. For these two models, we first consider the possibility of expressing the distribution in terms of indicators of interest, and then, through the regression approach, relate the indicators and covariates. Finally, some applications using data on the unit interval are reported.

Keywords:

transformations; bounded support; flexible shape

MSC:

62J02

1. Introduction

In statistical literature, several authors have focused their attention on developing new and more flexible statistical distributions by using suitable transformation techniques (see, for example, [1,2,3]). Most of the obtained distributions deal with continuous random variables with unbounded support. Only in recent years has attention been devoted to filling the existing gap with respect to distributions with bounded support, in order to meet the need to describe empirical phenomena whose realizations cover limited ranges. Indeed, these kinds of data naturally arise in different contexts, such as rates, proportions, percentages, and so on, but just a few models, such as the widely used beta distribution model, the Kumaraswamy model [4], the Topp–Leone model [5], the arcsine model [6], the standard two-sided power model [7], and a few more (see [8,9]) were available in the past to describe them. Many others are very recent proposals. In particular, in the last decade, there have been many works in this field, for the most part, on models belonging to the class of so-called unit distributions. These models describe data with support in unit intervals and are often obtained by applying transformations to random variables. These include the unit-Burr III [10,11], unit-Lindley [12], unit-Gompertz [13], unit-Burr XII [14], unit inverse Gaussian [15], the arcsecant hyperbolic normal model [16], and logit slash [17], to name a few, as well as some new families of distributions [18,19].

The first aim of this work is to describe a general structure based on the random variable transformation technique, which includes most of the distributions for data on unit intervals already present in the literature. Moreover, other members of this class were obtained, considering different transformations. Particular attention is placed on building regression models, starting with unit distributions; this allows us to evaluate the impact of covariates on response variables with bounded support and consider alternative approaches to the most used regression models for unit data.

In a recent paper, [20] proposed a unified procedure to construct distribution functions in the (0,1) interval from the composition of two random variables with the same support, which turned out to be a special case of the

T - X

family introduced by [21]. Our approach differs from the one just mentioned in that it does not require the knowledge of a second distribution function or a second quantile function. Furthermore, we envisaged a reparameterization and construction of the regression models on the indicators of interest.

The rest of the paper is organized as follows. In Section 2, we define the general class of distributions and derive the expressions for distribution and probability density functions. Quantiles, moments, and general expressions for the hazard and reverse hazard rate are given. A particular case of distributions belonging to the general class is described in Section 3, starting with the Dagum random variable and considering two particular kinds of transformations. The maximum likelihood estimation is discussed in Section 4. Section 5 is devoted to showing the possibility of employing the proposed models according to a regression perspective. Finally, in Section 6, two different examples of applications are shown.

2. General Framework

Many of the recently suggested distributions, proposed for modeling data belonging to the unit interval, can be described by resorting to a single probabilistic structure based on a simple technique of a random variable transformation.

To this end, let Y be a random variable (rv) with a distribution function (pdf)

F_{Y} (y; θ)

and probability density function (df)

F_{Y} (y; θ)

, where

θ

is the parameter vector and

y \in S_{Y} \subset ℜ

,

S_{Y} = [{\underset{̲}{S}}_{Y}, {\bar{S}}_{Y}]

. Let

C : S_{Y} ⟼ J_{V}

be the application that identifies the transformation of Yrv in a new variable V, assuming values

V \in J_{V} = [{\underset{̲}{J}}_{V}, {\bar{J}}_{V}]

. In general, the distribution of V could also be characterized by a vector of parameters

a

, i.e.,

V : = C (Y; a)

.

In the present paper, in order to simplify the discussion, we assume that the boundaries of the support of V are finite, i.e.,

{lim}_{y \to {\underset{̲}{S}}_{Y}} C (y; a) = {\underset{̲}{J}}_{V} > - \infty

and

{lim}_{y \to {\bar{S}}_{Y}} C (y; a) = {\bar{J}}_{V} < \infty

, and we assume that the function

C (y; a)

is continuous, differentiable, and monotone over

S_{Y}

. Consequently,

C (y; a)

is invertible and its inverse

C^{- 1} (\cdot)

is differentiable on

J_{V}

:

{(C^{- 1} (v; a))}^{'} = \frac{1}{C^{'} (C^{- 1} (v; a))} .

(1)

Knowing the distribution function of Y and considering the transformation

C (\cdot)

, it is easy to obtain the distribution function of V and its characteristics, such as quantiles and moments. Moreover, it is typical in the literature to study the behavior of the hazard function

h_{Y} (y; θ)

(hf) and the reverse hazard function (rhf)

r h_{Y} (y; θ)

, with the aim of evaluating the flexibility of a distribution. Therefore, in the following, we obtain some general expressions of characteristics and properties for distributions belonging to this class. In doing this, we distinguish two cases, depending on whether

C (\cdot)

is an increasing or a decreasing monotonic function.

(1): $C (\cdot)$ is an increasing monotonic function:
the $d f$ of V is given by:

$F_{V} (v; θ, a) = P (V \leq v) = P (Y \leq C^{- 1} (v; a)) = F_{Y} (C^{- 1} (v; a); θ)$

(2)

and, by (1), we can obtain the $p d f$ of V as

$f_{V} (v; θ, a) = \frac{\partial F_{Y} (C^{- 1} (v; a); θ)}{\partial C^{- 1} (v; a)} \times \frac{\partial C^{- 1} (v; a)}{\partial v} = \frac{f_{Y} (C^{- 1} (v; a); θ)}{C^{'} (C^{- 1} (v; a))} .$

(3)

Moreover, let $y (p) = F_{Y}^{- 1} (y; θ)$ be the p-th quantile of Y, with $p \in (0, 1)$ . It is easy to verify that, from (2), the q-th quantile of V is as follows:

$v (q; θ, a) = C (y (q; θ); a),$

(4)

with $q \in (0, 1)$ .
The general expressions for $h r$ and $r h r$ functions are, respectively, given by:

$h_{V} (v; θ, a) = \frac{f_{V} (v; θ, a)}{1 - F_{V} (v; θ, a)} = \frac{h_{Y} (C^{- 1} (v; a); θ)}{C^{'} (C^{- 1} (v; a))}$

(5)

$r h_{V} (v; θ, a) = \frac{f_{V} (v; θ, a)}{F_{V} (v; θ, a)} = \frac{r h_{Y} (C^{- 1} (v; a); θ)}{C^{'} (C^{- 1} (v; a))} .$

(6)
(2): $C (.)$ is a decreasing monotonic function.
In this case, with little algebra, we can determine the quantities previously considered. In particular, the df and pdf of V, respectively, are as follows:

$F_{V} (v; θ, a) = 1 - F_{Y} (C^{- 1} (v; a); θ),$

$f_{V} (v; θ, a) = - \frac{f_{Y} (C^{- 1} (v; a); θ)}{C^{'} (C^{- 1} (v; a))}$

and the quantile of order q is as follows:

$v (q; θ, a) = C (y (1 - q; θ); a) .$

The hf and rhf are calculated accordingly.

We can use different methods, known in the literature, to determine the moment of order r.

We should note that most of the proposals in the literature can be thought of as particular cases of the comprehensive framework described earlier. For example, the most used transformations in the cases of positive rvs are as follows:

V = \frac{Y}{1 + Y}

and

V = e^{- Y}

. On the other hand, the most common transformation, when Y assumes a real value, is

V = \frac{1}{1 + e^{- Y}}

, as in the case of the logit slash model. Moreover,

V = \frac{2}{e^{- Y} + e^{Y}}

was used in the context of non-monotonic rv transformations to obtain the arcsecant hyperbolic normal model, which, strictly speaking, does not belong to the general framework proposed here, but it can be used in every case with small mathematical expedients. We should note that, in general, any distribution function

G (\cdot)

can be used to transform Yrv in a new variable

V = G (Y)

. Table 1 summarizes a classification of some unit distributions proposed in the literature, according to the used transformation.

Table 1. Unit distributions proposed in the literature according to the used transformation.

In many application contexts, researchers often focus on specific aspects when characterizing a distribution, such as quantiles, location measures (mode, median, mean), variability indicators, etc. For this reason, when possible, it is useful to express the distribution as a function of such characteristics. The utility derives from the fact that, with appropriate methodological tools, it is possible to construct regressive models on the characteristics of interest with the aim of inspecting the possible determinants of the phenomenon under investigation (see [28,29]). Each characteristic and/or indicator is, in general, a function of the vector of the distribution parameters, let us say

I = I (θ)

, with reference to the unit’s distribution function (2). If

θ

is a vector of dimension p and the system

\begin{matrix} I_{j} = I_{j} (θ_{1}, θ_{2}, \dots, θ_{p}) f o r j = 1, \dots, p \end{matrix}

has a unique finite solution, say,

\begin{matrix} θ_{j} = θ_{j} (I_{1}, I_{2}, \dots, I_{p}) f o r j = 1, \dots, p \end{matrix}

then the unit-distribution function

\begin{matrix} F_{V} (v; I_{1}, I_{2}, \dots, I_{p}) = F_{V} (v; θ_{1} (I_{1}, I_{2}, \dots, I_{p}), θ_{2} (I_{1}, I_{2}, \dots, I_{p}), \dots, θ_{p} (I_{1}, I_{2}, \dots, I_{p})) \end{matrix}

represents a reparameterization in terms of indicators and/or characteristics of interest of the distribution in (2).

3. Two Kinds of Unit-Dagum Distributions

In this section, two different transformations of the widely used Dagum rv [30,31] will be described. Given the ability of the Dagum model in fitting real data, the resulting new models may potentially be more flexible than unit distributions that have already appeared in the literature.

The df and pdf of Dagum rv Y are given, respectively, by:

\begin{matrix} F_{D a} (y; β, λ, δ) = {(1 + λ y^{- δ})}^{- β}, \end{matrix}

(7)

and

\begin{matrix} f_{D a} (y; β, λ, δ) = β λ δ y^{- δ - 1} {(1 + λ y^{- δ})}^{- β - 1}, \end{matrix}

(8)

with

y > 0

and

β, λ, δ > 0

. In particular, the vector of parameters of Dagum distribution (hereafter,

D a (β, δ, λ)

) is

θ = (β, λ, δ)

, where

λ

represents a scale parameter and

β

and

δ

are shape parameters.

The Dagum model is positively skewed and it can be unimodal or zero-modal, depending on

β δ > 1

or

β δ \leq 1

. In particular, the mode is given by

y_{m} = λ^{\frac{1}{δ}} {(\frac{β δ - 1}{δ + 1})}^{\frac{1}{δ}} .

(9)

It is easy to verify that the q-th quantile is

y (q) = F_{D a}^{- 1} (q; β, λ, δ) = λ^{\frac{1}{δ}} {(q^{- \frac{1}{β}} - 1)}^{- \frac{1}{δ}},

(10)

therefore, the expression of the median is explicit:

m e = λ^{\frac{1}{δ}} {(2^{\frac{1}{β}} - 1)}^{- \frac{1}{δ}} .

(11)

It is also possible to obtain the expression of the r-th moment, as follows

μ_{D a}^{r} = E (Y^{r}; β, λ, δ) = β λ^{\frac{r}{δ}} B (β + \frac{r}{δ}, 1 - \frac{r}{δ}),

(12)

which exists for

δ > r

. Here,

B (\cdot, \cdot)

indicates the complete beta function.

3.1. The First Kind of Unit-Dagum Distribution

In this section, we consider the hyperbolic secant transformation:

V : = C (Y) = \frac{2 e^{Y}}{1 + e^{2 Y}} .

In particular, it is simple to verify that, for

Y > 0

, it is a monotonic decreasing function with

{lim}_{y \to 0^{+}} C (y) = 1

,

{lim}_{y \to + \infty} C (y) = 0

and

C^{'} (y) = \frac{2 e^{y} (1 - e^{2 y})}{{(1 + e^{2 y})}^{2}} < 0

. Furthermore, it is known that the inverse hyperbolic secant is given by

y = C^{- 1} (v) = log \frac{1 + \sqrt{1 - v^{2}}}{v}

.

Taking into account the characteristics of the proposed transformation, the distribution function of the new rv V is given by

\begin{matrix} F_{I - U D a} (v; β, λ, δ) & = & 1 - F_{D a} (log \frac{1 + \sqrt{1 - v^{2}}}{v}; β, λ, δ) \\ = & 1 - {\{1 + λ {[log \frac{1 + \sqrt{1 - v^{2}}}{v}]}^{- δ}\}}^{- β} \end{matrix}

(13)

with

v \in (0, 1)

and

β, λ, δ > 0

(hereafter,

I - U D a (β, δ, λ)

). From (1), after simple algebra, we obtain the first derivative of the inverse of

C (y; a)

:

{(C^{- 1} (v; a))}^{'} = \frac{- 1}{v \sqrt{1 - v^{2}}}

and, consequently, the pdf of

I - U D a (β, δ, λ)

rv:

\begin{matrix} f_{I - U D a} (v; β, λ, δ) = \frac{β λ δ}{v \sqrt{1 - v^{2}}} {[log (v^{*})]}^{- δ - 1} {\{1 + λ {[log (v^{*})]}^{- δ}\}}^{- β - 1} \end{matrix}

(14)

where

v^{*} = \frac{1 + \sqrt{1 - v^{2}}}{v}

.

Figure 1 shows various behaviors of the pdf for the type I unit-Dagum model, according to different values of parameters.

Figure 1. Pdf of the type I unit-Dagum model for different values of parameters.

The q-th quantile of the

I - U D a (β, δ, λ)

distribution, by (10), is

\begin{matrix} v (q; β, λ, δ) = \frac{2 e^{λ^{\frac{1}{δ}} {({(1 - q)}^{- \frac{1}{β}} - 1)}^{- \frac{1}{δ}}}}{1 + e^{2 λ^{\frac{1}{δ}} {({(1 - q)}^{- \frac{1}{β}} - 1)}^{- \frac{1}{δ}}}} . \end{matrix}

(15)

In the following proposition, we show that the r-th moment of the type I unit-Dagum distribution can be expressed in terms of moments of the Dagum distribution.

Proposition 1.

The r-th moment of

V \sim I - U D a (β, δ, λ)

has the following expression:

\begin{matrix} E [V^{r}] = β 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) \sum_{s = 0}^{+ \infty} \frac{{(- 1)}^{s}}{s!} {(2 j + r)}^{s} λ^{s} B (β + \frac{s}{δ}, 1 - \frac{s}{δ}) . \end{matrix}

(16)

Proof.

See Appendix A.1. □

The hf and rhz are given, respectively, by

\begin{matrix} h f_{I - U D a} (v; β, λ, δ) = \frac{β λ δ}{v \sqrt{1 - v^{2}} [log (v^{*})] \{λ + {[log (v^{*})]}^{δ}\}} \end{matrix}

(17)

and

\begin{matrix} r h f_{I - U D a} (v; β, λ, δ) = \frac{β λ δ {[log (v^{*})]}^{- δ - 1} {\{1 + λ {[log (v^{*})]}^{- δ}\}}^{- β - 1}}{v \sqrt{1 - v^{2}} [1 - {\{1 + λ {[log \frac{1 + \sqrt{1 - v^{2}}}{v}]}^{- δ}\}}^{- β}]} \end{matrix}

(18)

The hazard rate function of the type I unit-Dagum model for some values of parameters is shown in Figure 2.

Figure 2. Hazard rate of the type I unit-Dagum model for different values of parameters.

We propose a possible reparametrization of the type I unit-Dagum distribution in terms of the median and the

q - t h

quantile. It is possible to verify that the system

\{\begin{matrix} β_{1} = \frac{1}{β} \\ m e = \frac{2 e^{λ^{\frac{1}{δ}} {[{0.5}^{- \frac{1}{β}} - 1]}^{- \frac{1}{δ}}}}{1 + e^{2 λ^{\frac{1}{δ}} {[{0.5}^{- \frac{1}{β}} - 1]}^{- \frac{1}{δ}}}} \\ v (q) = \frac{2 e^{λ^{\frac{1}{δ}} {[{(1 - q)}^{- \frac{1}{β}} - 1]}^{- \frac{1}{δ}}}}{1 + e^{2 λ^{\frac{1}{δ}} {[{(1 - q)}^{- \frac{1}{β}} - 1]}^{- \frac{1}{δ}}}} \end{matrix}

(19)

presents the following unique solution:

\{\begin{matrix} β = \frac{1}{β_{1}} \\ λ = [{0.5}^{- β_{1}} - 1] {[log (\frac{1 + \sqrt{1 - m e^{2}}}{m e})]}^{δ^{*}} = λ^{*} \\ δ = \frac{log [{0.5}^{- β_{1}} - 1] - log [{(1 - q)}^{- β_{1}} - 1]}{log [log (\frac{1 + \sqrt{1 - v {(q)}^{2}}}{v (q)})] - log [log (\frac{1 + \sqrt{1 - m e^{2}}}{m e})]} = δ^{*} \end{matrix}

(20)

The corresponding distribution function is

\begin{matrix} F_{R - I - U D a} (v; β_{1}, m e, v (q)) = 1 - {\{1 + λ^{*} {[log \frac{1 + \sqrt{1 - v^{2}}}{v}]}^{- δ^{*}}\}}^{- \frac{1}{β_{1}}} \end{matrix}

(21)

with

β_{1} > 0

,

m e \in (0, 1)

and

v (q) \in (0, 1)

for

q \in (0, 1)

.

3.2. A Second Kind of Unit-Dagum Distribution

In this section, we consider the monotonic decreasing transformation

V : = C (Y) = e^{- Y}

, with

{lim}_{y \to 0^{+}} C (y) = 1

,

{lim}_{y \to + \infty} C (y) = 0

and

C^{'} (y) = - e^{- y} < 0

,

\forall y

. The inverse is given by

y = C^{- 1} (v) = - log (v)

.

The distribution function of V is given by

\begin{matrix} F_{I I - U D a} (v; β, λ, δ) & = & 1 - F_{D a} (- log v; β, λ, δ) \\ = & 1 - {\{1 + λ {[- log (v)]}^{- δ}\}}^{- β} \end{matrix}

(22)

with

v \in (0, 1)

and

β, λ > 0, δ > 0

(hereafter,

I I - U D a (β, δ, λ)

). From (1), after simple algebra, we obtain the first derivative of the inverse of

C (y; a)

:

{(C^{- 1} (v; a))}^{'} = - \frac{1}{v}

and, consequently, the pdf of

I I - U D a (β, λ, δ)

rv:

\begin{matrix} f_{I I - U D a} (v; β, λ, δ) = \frac{β λ δ}{v} {[- log (v)]}^{- δ - 1} {\{1 + λ {[- log (v)]}^{- δ}\}}^{- β - 1} . \end{matrix}

(23)

It is worth noting that the distribution in (23) can be viewed as an extension of the unit-Burr III obtained by [11], using the same transformation. Indeed, the Dagum model has one more parameter than Burr III, that is a scale parameter, thus, by putting

λ = 1

, the unit-Burr III is obtained. Although the unit-Burr III is already studied in the literature, for the purposes of this work, as will be seen later, the

λ

parameter is essential for carrying out the reparameterization and building the regression model; therefore, here, we consider the type II unit-Dagum distribution, also considering the scale parameter.

Figure 3 shows various behaviors of the pdf for the type II unit-Dagum model according to different parameter values.

Figure 3. Pdf of II-UDa for different values of parameters.

The q-th quantile of the

I I - U D a (β, δ, λ)

distribution, by (10), is

\begin{matrix} v (q; β, λ, δ) = e^{- λ^{1 / δ {[{(1 - q)}^{- 1 β} - 1]}^{- 1 / δ}}} . \end{matrix}

(24)

It can be readily verified that the r-th moment of the type II unit-Dagum distribution coincides with the Laplace transform of the Dagum distribution and it can be expressed in terms of moments of the Dagum distribution.

Proposition 2.

The r-th moment of

V \sim I I - U D a (β, δ, λ)

has the following expression:

\begin{matrix} E [V^{r}] = β \sum_{s = 0}^{+ \infty} \frac{{(- r)}^{s}}{s!} λ^{\frac{s}{δ}} B (β + \frac{s}{δ}, 1 - \frac{s}{δ}) . \end{matrix}

(25)

Proof.

See Appendix A.2. □

The hf and rhf are given, respectively, by

h_{I I - U D a} (v; θ, a) = \frac{β λ δ {[- log (v)]}^{- δ - 1}}{v {1 + λ {[- log (v)]}^{- δ}}}

(26)

and

\begin{matrix} r h f_{I I - U D a} (v; β, λ, δ) = \frac{β λ δ {[- log (v)]}^{- δ - 1} {1 + λ {[- log (v)]}^{- δ}}^{- β - 1}}{v [1 - {1 + λ {[- log (v)]}^{- δ}}^{- β}]} \end{matrix}

(27)

The hazard rate function of the type II unit-Dagum model for some values of parameters is shown in Figure 4.

Figure 4. Hazard rate of II-UDa for different values of parameters.

It is easy to verify that a possible reparametrization of the type II unit-Dagum distribution in terms of the median and the

q - t h

quantile can be obtained as a solution of the following system:

\{\begin{matrix} β_{1} = \frac{1}{β} \\ m e = e^{- {[\frac{{0.5}^{- 1 / β} - 1}{λ}]}^{- 1 / δ}} \\ v (q) = e^{- {[\frac{{(1 - q)}^{- 1 / β} - 1}{λ}]}^{- 1 / δ}} \end{matrix}

(28)

that presents the following unique solution

\{\begin{matrix} β = \frac{1}{β_{1}} \\ λ = [{0.5}^{- β_{1}} - 1] \cdot {[- log (m e)]}^{\bar{δ}} = \bar{λ} \\ δ = \frac{log [{(1 - q)}^{- β_{1}} - 1] - log [{0.5}^{- β_{1}} - 1]}{log [log (m e) / log (v (q))]} = \bar{δ} \end{matrix}

(29)

The corresponding distribution function is as follows:

\begin{matrix} F_{R - I I - U D a} (v; β_{1}, m e, v (q)) = 1 - {\{1 + \bar{λ} {[- log (v)]}^{- \bar{δ}}\}}^{- \frac{1}{β_{1}}} \end{matrix}

(30)

with

β_{1} > 0

,

m e \in (0, 1)

and

v (q) \in (0, 1)

for

q \in (0, 1)

.

4. Inference

In this section, we use the maximum likelihood (ML) method to estimate the parameters of type I and type II unit-Dagum distributions under the hypothesis of homogeneity of the statistical units, i.e., assuming that there are no systematic factors (covariates), which make the observations heterogeneous. To this end, we first rewrite the probability density functions (14) and (23), in a single expression as follows

\begin{matrix} f_{I - I I - U D a} (v; β, λ, δ) = \\ β λ δ {[C^{- 1} (v)]}^{- δ - 1} {\{1 + λ {[C^{- 1} (v)]}^{- δ}\}}^{- β - 1} \{- \frac{\partial C^{- 1} (v)}{\partial v}\} \end{matrix}

(31)

where

C^{- 1} (v) = log \frac{1 + \sqrt{1 - v^{2}}}{v}

in the case of the type I unit-Dagum distribution or

C^{- 1} (v) = - log v

in the case of the type II unit-Dagum distribution. Let

v = (v_{1}, \dots, v_{n})

be a random sample of size n from (31), the log-likelihood function for

θ = (β, λ, δ)

is as follows:

\begin{matrix} ℓ_{I - I I - U D a} (θ; v) & \propto & n log (β λ δ) - (δ + 1) \sum_{i = 1}^{n} log C^{- 1} (v_{i}) \\ - & (β + 1) \sum_{i = 1}^{n} log \{1 + λ {[C^{- 1} (v_{i})]}^{- δ}\} \end{matrix}

(32)

Differentiating

ℓ_{I - I I - U D a} (θ; v)

with respect to

β

,

λ

, and

δ

, respectively, we obtain the components of vector score

U (θ) = (U_{β} (θ), U_{λ} (θ), U_{δ} (θ))

, where

\begin{matrix} U_{β} (θ) = \frac{\partial ℓ_{I - I I - U D a} (θ; v)}{\partial β} = \frac{n}{β} - \sum_{i = 1}^{n} log \{1 + λ {[C^{- 1} (v_{i})]}^{- δ}\} \end{matrix}

(33)

\begin{matrix} U_{λ} (θ) = \frac{\partial ℓ_{I - I I - U D a} (θ; v)}{\partial λ} = \frac{n}{λ} - (β + 1) \sum_{i = 1}^{n} \frac{{[C^{- 1} (v_{i})]}^{- δ}}{1 + λ {[C^{- 1} (v_{i})]}^{- δ}} \end{matrix}

(34)

\begin{matrix} U_{δ} (θ) & = & \frac{\partial ℓ_{I - I I - U D a} (θ; v)}{\partial δ} = \frac{n}{δ} - \sum_{i = 1}^{n} log C^{- 1} (v_{i}) \\ + & (β + 1) \frac{λ {[C (v_{i})]}^{- δ} log [C^{- 1} (v_{i})]}{1 + λ {[C^{- 1} (v_{i})]}^{- δ}} \end{matrix}

(35)

and setting the components of the score vector equal to zero, we obtain the system of likelihood equations, whose solution gives the ML estimates

\hat{θ} = (\hat{β}, \hat{λ}, \hat{δ})

of the parameter vector

θ = (β, λ, δ)

. The system does not admit any explicit solution; therefore, the ML estimates

\hat{θ}

can only be obtained by means of numerical procedures.

Confidence intervals and hypothesis tests for

θ

can be constructed using the usual asymptotic properties of the maximum likelihood estimators. In particular, we highlight that the expected Fisher information matrix of the parameter vector

θ

coincides with the expected Fisher information matrix of

θ

of the Dagum distribution (see Appendix A.3). This means that when constructing confidence intervals and hypothesis tests for the parameters of type I and II models of the unit-Dagum distribution, we can use the asymptotic variance and covariance matrix calculated in [32,33].

5. Unit-Dagum Regression Models

An important aspect to investigate is how heterogeneity among statistical units impacts possible measures of interest, such as median and extreme quantiles, simultaneously and directly. Given the particular nature of the dependent variable, this leads us to consider a regression approach where the response variable is defined on the unit interval.

The literature on this theme is wide and often deals with two different possibilities: properly transforming data to map the (0,1) interval to the real line and then using a common regression analysis, or choosing a suitable distribution and defining the relations among distribution parameters and covariates. Regarding the first kind of approach, various transformations are possible, and the logit is the most popular, but as [34] underlines, transformations can be inappropriate since the heteroscedasticity and skewness in data are not properly handled; moreover, the interpretation of results is possible only on the transformed scale. On the other hand, the second approach is nowadays preferred and widely explored, with different existing proposals based on various distributions and response variables. For example, when the attention is focused on the mean, the most popular distribution is the beta [35], but other possibilities are represented by simplex [36], log-Bilal [27], log-Lindley [37], log-weighted exponential [38], and unit gamma [39], to cite a few. When the focus is on the median or, in general, on the distribution quantiles, regression models can be based on Kumaraswamy [40], Johnson-t [41], log-extended exponential-geometric [42], L-logistic [43], or unit-type distributions (see, for example, [14,22,44]). Our proposal fits into the latter approach.

Specifically, given a sample of n observations, for each statistical unit i (

i = 1, \dots, n

), we observe the individual dependent variable value

v_{i}

and the sets of individual covariates supposedly related to indicators and summarized in the vectors,

x_{j, i}

, for

j = 1, 2, 3

. The three sets of covariates

x_{1, i}

,

x_{2, i}

, and

x_{3, i}

are not necessarily the same, and, even if equal, their impact on the corresponding indicator may be different.

The vectors

x_{j i} = (x_{j i 1}, x_{j i 2}, \dots, x_{j i p_{j}})

for

i = 1, \dots, n

,

j = 1, 2, 3

, define the rows of three block

n \times p_{j}

matrices

X_{j}

of

X

. Each one refers to the

p_{j}

covariates affecting the

j - t h

indicator

I_{j}

.

Each indicator, analogous to generalized linear models, is then related to the covariates, through an appropriate link function

h_{j} (\cdot)

, as follows:

I_{j, i} = h_{j} (x_{j, i}, γ_{j}) .

(36)

The link functions are chosen to guarantee suitable restrictions on the parameter space, considering if

I_{j, i}

is positive or varies on

(0, 1)

. The elements of the vector

γ_{j}

= {(γ_{j, 1}, γ_{j, 2}, \dots, γ_{j, p_{j}})}^{'}

are the unknown regression coefficients related to the

p_{j}

individual characteristics to be estimated, applying the maximum likelihood method. By using the reformulation of unit-Dagum models in terms of indicators of interest, as shown in expressions (20) and (29), it is possible to relate the new parameters, such as the median and q-th quantile, to individual characteristics. In particular, observing that the solutions given in (20) and (29) are functions of the indicators of interest, i.e.,

λ^{*} = λ^{*} (β_{1}, m e, v (q))

,

δ^{*} = δ^{*} (β_{1}, m e, v (q))

for the type I unit-Dagum distribution and

\bar{λ} = \bar{λ} (β_{1}, m e, v (q))

,

\bar{δ} = \bar{δ} (β_{1}, m e, v (q))

for the type II unit-Dagum distribution, and specifying the indicators of interest as functions of the covariates

β_{1, i} = h_{1} (x_{1, i}, γ_{1})

,

m e_{i} = h_{2} (x_{2, i}, γ_{2})

and

v {(q)}_{i} = h_{3} (x_{3, i}, γ_{3})

, from (21) and (30), for the i-th observation, we can rewrite the pdfs as functions of the regression coefficients

γ_{1}

,

γ_{2}

, and

γ_{3}

. Similar to what was done previously, we use a single structure to represent type I and type II unit-Dagum distributions, simultaneously, as follows:

\begin{matrix} f_{R - I - I I - U D a} (v_{i}; γ_{1}, γ_{2}, γ_{3}) & = & \frac{{\tilde{λ}}_{i} {\tilde{δ}}_{i}}{{\tilde{β}}_{1, i}} {[C^{- 1} (v)]}^{- {\tilde{δ}}_{i} - 1} {\{1 + {\tilde{λ}}_{i} {[C^{- 1} (v)]}^{- {\tilde{δ}}_{i}}\}}^{- \frac{1}{{\tilde{β}}_{1, i}} - 1} \\ \times & \{- \frac{\partial C^{- 1} (v)}{\partial v_{i}}\} \end{matrix}

(37)

where

{\tilde{λ}}_{i} = λ_{i}^{*}

,

{\tilde{δ}}_{i} = δ_{i}^{*}

in the case of the type I unit-Dagum, and

{\tilde{λ}}_{i} = {\bar{λ}}_{i}

,

{\tilde{δ}}_{i} = {\bar{δ}}_{i}

in the type II unit-Dagum distribution. Putting

γ = {(γ_{1}^{^{'}}, γ_{2}^{^{'}}, γ_{3}^{^{'}})}^{^{'}}

, by (37), the i-th element of the log-likelihood function is

\begin{matrix} ℓ (γ; v, X) & \propto & log ({\tilde{λ}}_{i}) + log ({\tilde{δ}}_{i}) - log ({\tilde{β}}_{1, i}) - ({\tilde{δ}}_{i} + 1) log [C^{- 1} (v_{i})] \\ - & (\frac{1}{{\tilde{β}}_{1, i}} + 1) log (1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}) . \end{matrix}

(38)

Remembering that the parameters

{\tilde{β}}_{1, i}

,

{\tilde{λ}}_{i}

, and

{\tilde{δ}}_{i}

are functions of the vector

γ

of the dimension

p = p_{1} + p_{2} + p_{3}

, the

j r_{j} - t h

equation of the likelihood system is given by

\begin{matrix} \frac{\partial ℓ (γ; v, X)}{\partial γ_{j, r_{j}}} & = & \frac{1}{{\tilde{λ}}_{i}} (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) + \frac{1}{{\tilde{δ}}_{i}} (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) - \frac{1}{{\tilde{β}}_{1, i}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) \\ - & log [C^{- 1} (v_{i})] (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) + \frac{1}{{({\tilde{β}}_{1, i})}^{2}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) log (1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}) \\ - & (\frac{1}{{\tilde{β}}_{1, i}} + 1) \frac{(\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) - {\tilde{λ}}_{i} (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) log [C^{- 1} (v_{i})]}{{[C^{- 1} (v_{i})]}^{{\tilde{δ}}_{i}} + {\tilde{λ}}_{i}} = 0 \end{matrix}

(39)

for

j = 1, 2, 3

and

r_{j} = 1, 2, \dots, p_{j}

. The partial derivatives in system (39) are given in Appendix A.4.

The system of the likelihood equations does not admit any explicit solution; therefore, the ML estimates

{\hat{γ}}_{j, r_{j}}

for

j = 1, 2, 3

and

r_{j} = 1, 2, \dots, p_{j}

can only be obtained by means of numerical procedures. Under the usual regularity conditions, the known asymptotic properties of the maximum likelihood method ensure that

\sqrt{(} n) ({\hat{γ}}_{n} - γ) \overset{d}{\to} N (0, Σ_{γ})

, where

Σ_{γ} = {[{lim}_{n \to \infty} I (γ) / n]}^{- 1}

is the

(p_{1} + p_{2} + p_{3}) \times (p_{1} + p_{2} + p_{3})

asymptotic variance–covariance matrix and

I (γ)

is the Fisher information matrix, given by

I (γ) = - E (H)

, where

H

is the Hessian matrix of the second partial derivatives of the log-likelihood function, i.e.,

\frac{\partial^{2} ℓ (γ; v, X)}{\partial γ_{j, r_{j}} γ_{h, r_{h}}}

. Elements of the

I (γ)

matrix are not reported here for space purposes, but are available upon request.

6. Applications

In order to show the potentiality of the proposed models, we consider two famous and widely used datasets, referred to data that fall into the unit interval and contained in the R package, betareg, namely household food expenditures and reading skills. In particular, the household food expenditure data regard the proportion of income spent on food for 38 households living in a large U.S. city and contain information on the perceived income and the number of persons living in the household. The reading skills dataset refers to the scores obtained in a test on reading accuracy involving 44 Australian children, including 19 dyslexic subjects and 25 non-dyslexic subjects. Moreover, the status of each child, and information regarding the nonverbal intelligent quotient (iq), are available.

These datasets were used by [34] to describe the implementation of the beta regression in the R system and to underline the advantage of this kind of regression with respect to the linear one when data belong to the unit interval. Therefore, as a further aim of this section, we will compare the performance of the unit-Dagum regression models with that of the widely used beta regression. Indeed, both methodologies give us the possibility to evaluate, among other aspects, the impact of some covariates on measures of central tendency, namely the mean in the case of the beta regression, and the median in the case of the unit-Dagum regression. It is worth noting that when data exhibit skewness, the median should be preferred as the centrality measure. Therefore, the proposed regression could be more appropriate in some cases.

6.1. Modeling Food/Income and Accuracy Data

In this section, we consider the proportion of income spent on food and the scores regarding reading accuracy. The corresponding empirical distributions are shown in Figure 5. To evaluate the adequacy of the proposed models in describing the considered data, the maximum likelihood estimates (MLEs) of the parameters for the I-UDa and II-Da densities reported in (14) and (23) are obtained, along with the corresponding standard errors and the values for the Akaike information criterion (AIC). Moreover, we compare the obtained results with the analogs for the beta and Kumuraswamy (KW) models, which are likely the most used models for data on bounded support. Table 2 presents the obtained results. Both the AIC values and the inspection of Figure 5 suggest that the proposed models better describe the considered data if compared with the beta and KW distributions. In particular, the lower value of the AIC for food expenditure data is obtained in correspondence with the type II unit-Dagum model, while, for reading skills data, the type I unit-Dagum reaches the lower result, far from the beta and KW ones. We should note that the chosen data are very different from each other in terms of the distribution shape, so these examples give us the possibility of testing the flexibilities of our models and their ability to properly reproduce different characteristics of the phenomena, such as unimodality, increasing density, presence of asymmetry, fat tails, and so on.

Figure 5. Empirical and fitted distributions for I-Da, II-Da, beta, and KW models.

Table 2. MLEs, corresponding standard errors (in brackets), and AIC values for I-Da, II-Da, beta, and KW models in food expenditure and reading skills data.

6.2. Considering the Covariates: The Regression Models

In this section, we consider both type I and type II unit-Dagum distributions according to a regressive perspective and we compare their performances with results from the well-known beta regression.

To this end, we also take into account data regarding covariates and results reported in [34], corresponding with the best beta regression model for each dataset. We should note that, as can be viewed from Figure 5, both the income/food proportions and the reading accuracy scores show an asymmetric distribution; therefore, attention is placed on the median rather than the mean of the distribution, and it could be more appropriate to analyze the central tendency.

Food expenditure data

For the first dataset, information on household income and the number of people living in the household are available. Starting with the reparameterization data reported in (19) and (28), we consider the effect of these covariates on the median and 90th quantile, according to the regression models described in Section 5. Since both the indicators assume values in the unit interval, a logit-link function is used to relate the median and 90th quantiles to the covariates. Moreover, we consider an intercept term related to the

β_{1}

indicator through a log-link function, which is suitable for positive indicators. The ML estimates of the coefficients, their standard errors, and results from the Wald test are reported in Table 3. In both models, we find that the median and 90th quantiles of the proportions spent on food decrease as income increases, while the number of persons living in a household shows a positive significant effect on the 90th quantile, ceteris paribus. Moreover, both models outperformed the beta regression in terms of AIC (−88.37 for beta regression), with the best results obtained for II-UDa regression. A comparison between empirical and fitted curves reported in Figure 6 confirms these results. In particular, here, two different curves are shown for each model. Indeed, through the regression approach and the resulting estimates, it is possible to consider the behaviors of density functions for different covariate values. The depicted curves refer to the median and

v (0.9)

indicators for the I-Da and II-Da model, and to the mean and dispersion parameters for the beta model, when income and the number of persons are equal to the average level observed for

v \leq 0.5

and

v > 0.5

, respectively (

μ_{i n c} = 60.65

;

μ_{p e r s} = 3.37

vs.

μ_{i n c} = 32.65

;

μ_{p e r s} = 6

). This allows us to evaluate the ability of the models to describe the right distribution tail, as well as the central tendency.

Table 3. MLEs, corresponding standard errors, and Wald test results for the I-Da and II-Da regression models for food expenditure data.

Figure 6. Empirical and fitted distributions for I-Da, II-Da, and beta models. The solid lines and the dotted lines refer to the average values of covariates obtained, respectively, for

v \leq 0.5

and

v > 0.5

.

Reading skills data

In the reading skills dataset, in addition to information regarding the presence of dyslexia, z scores for the nonverbal intelligent quotient (iq) test are available. Therefore, we can consider the effects of these characteristics on the median and 90th quantiles of reading accuracy scores, by specifying a logit-link function to relate indicators and covariates. In particular, as suggested by [34], we consider an interaction term between iq and dyslexia. Once again, we relate an intercept term to

β_{1}

, using a log-link function. Similar to that obtained by [34] for regression on the mean indicator, we find a significant main and interaction effect on the median for dyslexia and iq, for both I-Da and II-Da models. Specifically, results reported in Table 4 confirm the positive effect of iq and the negative effect for dyslexia and the interaction term. Moreover, we also find a significant negative effect of dyslexia on the 90th quantile.

Table 4. MLEs, corresponding standard errors, and Wald test results for the I-Da and II-Da regression models for reading skills data.

In this case, the model with the best performance in terms of AIC is the I-Da one, but both of the proposed models show lower values than the beta regression (AIC = −117.8). Figure 7 shows the comparisons among empirical and fitted distributions for dyslexic and non-dyslexic subjects, considering an average iq level that is equal to −0.653 for dyslexic subjects and 0.4966 for control subjects.

Figure 7. Empirical and fitted distributions for I-Da, II-Da, and beta models. Different curves refer to dyslexic and non-dyslexic subjects, considering the average iq level for each group.

7. Concluding Remarks

In this paper, we show that many of the existing proposals on probability distributions for data in the unit interval can be viewed as particular cases of a general class of models, obtained using the techniques of rv transformations. In the present paper, expressions on the distribution and density functions of the class are given and the principal characteristics are furnished. Through the proper transformation choice, it is possible to obtain new distribution functions on bounded support, whose characteristics are easy to derive. Indeed, two new distributions are proposed, starting with the Dagum model, and considering two different transformations. The resulting models are particularly flexible, as is evident by choosing different sets of parameter values and by looking at the behavior of their densities and hazard functions.

We also considered the possibility of reparameterizing the distributions in order to express them in terms of the indicators of interest. In particular, we obtained models that depend on the median and quantile; this gave us the opportunity to relate these quantities to covariates, according to a regressive perspective. Given the particular nature of the involved variables, this led us to consider the regression approach, where the response variable was defined on the unit interval. Therefore, the proposed methodology can be considered as an alternative to other approaches that are often employed when the response variable represents proportions, rates, or percentages. Furthermore, considering regression on the median could be more appropriate in the presence of asymmetry. The applications on two different datasets allowed us to evaluate the behaviors of the suggested models and compare their performances with the most widely used approach in this context, namely the beta regression. The obtained findings are encouraging since both models seem to be very competitive.

Author Contributions

Conceptualization, F.D.; Methodology, F.C. and F.D.; Software, F.C.; Formal analysis, F.C. and F.D.; Data curation, F.C.; Writing—review & editing, F.C. and F.D. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge research grant ‘Fondo sostegno aree socio-umanistiche-Cda del 26.03.2021-quota DESF’ from the University of Calabria.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Proposition 1

First, in the following expression

\begin{matrix} E [V^{r}] & = & \int_{0}^{1} v^{r} \frac{β λ δ}{v \sqrt{1 - v^{2}}} {[log (\frac{1 + \sqrt{1 - v^{2}}}{v})]}^{- δ - 1} \\ {\{1 + λ {[log (\frac{1 + \sqrt{1 - v^{2}}}{v})]}^{- δ}\}}^{- β - 1} d v \end{matrix}

putting

z = {\{1 + λ {[log (\frac{1 + \sqrt{1 - v^{2}}}{v})]}^{- δ}\}}^{- 1}

, with

d v = - \frac{v \sqrt{1 - v^{2}}}{λ δ} {[log (\frac{1 + \sqrt{1 - v^{2}}}{v})]}^{δ + 1} {\{1 + λ {[log (\frac{1 + \sqrt{1 - v^{2}}}{v})]}^{- δ}\}}^{2} d z

, after algebra, we obtain

\begin{matrix} E [V^{r}] = β \int_{0}^{1} {(2 e^{λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}})}^{r} {(1 + e^{2 λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}})}^{- r} z^{β - 1} d z . \end{matrix}

Now, using Newton’s Binomial theorem, we can write

\begin{matrix} {(\frac{2 e^{λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}}}{1 + e^{2 λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}}})}^{r} = 2^{r} \frac{e^{- r λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}}}{{(1 + e^{- 2 λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}})}^{r}} = 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) e^{- (2 j + r) λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}} \end{matrix}

substituting this last result in the rth moment, we obtain

\begin{matrix} E [V^{r}] = β 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) \int_{0}^{1} e^{- (2 j + r) λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}} z^{β - 1} d z \end{matrix}

Putting

y = (2 j + r) λ^{\frac{1}{δ}} {(\frac{z}{1 - z})}^{\frac{1}{δ}}

, with

y \in (0, \infty)

and

d z = δ {(2 j + r)}^{\frac{1}{δ}} λ y^{δ - 1} {[(2 j + r)}^{\frac{1}{δ}} λ +

y^{δ}]^{- 2} d y

, after algebra, we obtain

\begin{matrix} E [V^{r}] & = & β 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) \frac{1}{β} \int_{0}^{+ \infty} e^{- y} β δ {(2 j + r)}^{\frac{1}{δ}} λ y - δ - 1 {(1 + {(2 j + r)}^{\frac{1}{δ}} λ y^{- δ})}^{- β - 1} d y \\ = & 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) E [e^{- Y}; β, δ, λ {(2 j + r)}^{\frac{1}{δ}}] \\ = & β 2^{r} \sum_{j = 0}^{+ \infty} (\binom{- r}{j}) \sum_{s = 0}^{+ \infty} \frac{{(- 1)}^{s}}{s!} λ^{\frac{s}{δ}} {(2 j + r)}^{s} B (β + \frac{s}{δ}, 1 - \frac{s}{δ}) . \end{matrix}

Appendix A.2. Proof of Proposition 2

Given the considered transformation

V : = C (Y) = e^{- Y}

, it is evident that the r-th moment of the type II unit-Dagum distribution coincides with the Laplace transform of the Dagum distribution, i.e.,

\begin{matrix} E [V^{r}] & = & E [e^{- r Y}] \\ = & β λ δ \int_{0}^{+ \infty} e^{- r t} t^{- δ - 1} {(1 + λ t^{- δ})}^{- β - 1} d t \end{matrix}

and putting

e^{- r t} = \sum_{s = 0}^{+ \infty} \frac{{(- r t)}^{s}}{s!}

, we obtain:

\begin{matrix} E [V^{r}] & = & \sum_{s = 0}^{+ \infty} \frac{{(- r)}^{s}}{s!} \int_{0}^{+ \infty} t^{s} β λ δ t^{- δ - 1} {(1 + λ t^{- δ})}^{- β - 1} d t \\ = & \sum_{s = 0}^{+ \infty} \frac{{(- r)}^{s}}{s!} μ_{D a}^{s} . \end{matrix}

Substituting expression (12) into the s-th moment of the Dagum distribution in the previous equation, we obtain the expression for the r-th moment of the type II unit-Dagum distribution.

Appendix A.3. Fisher Information Matrix

In order to compute the expected Fisher information matrix, we consider the following elements of the Hessian matrix:

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial β^{2}} = - \frac{n}{β^{2}} \end{matrix}

(A1)

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial β λ} = - \sum_{i = 1}^{n} \frac{{[C^{- 1} (v_{i})]}^{- δ}}{1 + λ {[C^{- 1} (v_{i})]}^{- δ}} \end{matrix}

(A2)

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial β δ} = \sum_{i = 1}^{n} \frac{λ {[C^{- 1} (v_{i})]}^{- δ} ln [C^{- 1} (v_{i})]}{1 + λ {[C^{- 1} (v_{i})]}^{- δ}} \end{matrix}

(A3)

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial λ^{2}} = - \frac{n}{λ^{2}} + (β + 1) \sum_{i = 1}^{n} \frac{{[C^{- 1} (v_{i})]}^{- 2 δ}}{{\{1 + λ {[C^{- 1} (v_{i})]}^{- δ}\}}^{2}} \end{matrix}

(A4)

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial λ δ} = (β + 1) \sum_{i = 1}^{n} \frac{{[C^{- 1} (v_{i})]}^{- δ} ln [C^{- 1} (v_{i})]}{{\{1 + λ {[C^{- 1} (v_{i})]}^{- δ}\}}^{2}} \end{matrix}

(A5)

\begin{matrix} \frac{\partial^{2} ℓ_{I - I I - U D a} (θ; v)}{\partial δ^{2}} = - \frac{n}{δ^{2}} - λ (β + 1) \sum_{i = 1}^{n} \frac{{[C^{- 1} (v_{i})]}^{- δ} {\{ln [C^{- 1} (v_{i})]\}}^{2}}{{\{1 + λ {[C^{- 1} (v_{i})]}^{- δ}\}}^{2}} . \end{matrix}

(A6)

The elements of the expected Fisher information matrix are functions of the following expectation, with respect to the density function of the rv V

\begin{matrix} E_{j_{1}, j_{2}, j_{3}} = E_{V} \{\frac{{[C^{- 1} (V)]}^{- j_{1} δ} {(ln [C^{- 1} (V)])}^{j_{2}}}{{(1 + λ {[C^{- 1} (V)]}^{- δ})}^{j_{3}}}\} \end{matrix}

(A7)

We now observe that for a generic function

h (.)

of

C^{- 1} (V)

, by a simple transformation of the variable, we can write

\begin{matrix} E_{V} \{h [C^{- 1} (V)]\} & = & \int_{0}^{1} h [C^{- 1} (v)] f_{V} (v; θ) d v = \\ = & \int_{0}^{1} h [C^{- 1} (v)] β λ δ {[C^{- 1} (v)]}^{- δ - 1} {\{1 + λ {[C^{- 1} (v)]}^{- δ}\}}^{- β - 1} [- \frac{\partial C^{- 1} (v)}{\partial v}] d v = \\ = & \int_{0}^{\infty} h [y] β λ δ {[y]}^{- δ - 1} {\{1 + λ {[y]}^{- δ}\}}^{- β - 1} d y = E_{Y} [h (Y)] \end{matrix}

where

Y \sim D a (β, λ, δ)

. Using this last observation, Equation (A7) can be rewritten as

\begin{matrix} E_{j_{1}, j_{2}, j_{3}} & = & E_{Y} \{\frac{{[Y]}^{- j_{1} δ} {(ln [Y])}^{j_{2}}}{{(1 + λ {[Y]}^{- δ})}^{j_{3}}}\} \\ = & \int_{0}^{\infty} \frac{y^{- j_{1} δ} {(ln (y))}^{j_{2}}}{{(1 + λ y^{- δ})}^{j_{3}}} β λ δ y^{- δ - 1} {(1 + λ y^{- δ})}^{- β - 1} d y \\ = & \frac{β}{λ^{j_{1}} δ^{j_{2}}} \int_{0}^{1} y^{β - j_{1} + j_{3} - 1} {(1 - y)}^{\frac{2}{δ} + j_{1}} {[ln (λ) + ln (y) - ln (1 - y)]}^{j_{2}} d y \end{matrix}

(A8)

Using the successive derivatives of the beta function, the expectations for determining the elements of the Fisher information matrix are

\begin{matrix} E_{1, 0, 1} = \frac{β}{λ} B (β, 2 + \frac{2}{δ}) \end{matrix}

(A9)

\begin{matrix} E_{2, 0, 2} = \frac{β}{λ^{2}} B (β, 3 + \frac{2}{δ}) \end{matrix}

(A10)

\begin{matrix} E_{1, 1, 1} = \frac{β}{λ δ} \{ln (λ) B (β, 2 + \frac{2}{δ}) + A_{1} (β, 2 + \frac{2}{λ δ}) + A_{2} (β + 1, 2 + \frac{2}{δ})\} \end{matrix}

(A11)

\begin{matrix} E_{1, 1, 2} = \frac{β}{λ δ} \{ln (λ) B (β + 1, 2 + \frac{2}{δ}) + A_{1} (β, 3 + \frac{2}{λ δ}) + A_{2} (β + 1, 2 + \frac{2}{δ})\} \end{matrix}

(A12)

\begin{matrix} E_{1, 2, 2} & = & \frac{β}{λ δ^{2}} \{{[ln (λ)]}^{2} B (β + 1, 2 + \frac{2}{δ}) + 2 ln (λ) A_{1} (β + 1, 2 + \frac{2}{λ δ}) - 2 ln (λ) A_{2} (β + 1, 2 + \frac{2}{δ}) \\ + & A_{3} (β + 1, 2 + \frac{2}{δ}) - 2 A_{5} (β + 1, 2 + \frac{2}{δ}) + A_{4} (β + 1, 2 + \frac{2}{δ})\} \end{matrix}

(A13)

where

\begin{matrix} A_{1} (p, q) = B (p, q) \{ψ (p) - ψ (p + q)\} \end{matrix}

\begin{matrix} A_{2} (p, q) = B (p, q) \{ψ (q) - ψ (p + q)\} \end{matrix}

\begin{matrix} A_{3} (p, q) = B (p, q) \{{[ψ (p) - ψ (p + q)]}^{2} + [ψ^{^{'}} (p) - ψ^{^{'}} (p + q)]\} \end{matrix}

\begin{matrix} A_{4} (p, q) = B (p, q) \{{[ψ (q) - ψ (p + q)]}^{2} + [ψ^{^{'}} (q) - ψ^{^{'}} (p + q)]\} \end{matrix}

\begin{matrix} A_{5} (p, q) = B (p, q) \{[ψ (q) - ψ (p + q)] [ψ (p) - ψ (p + q)] + [ψ^{^{'}} (p) - ψ^{^{'}} (p + q)]\} \end{matrix}

with

ψ (.)

and

ψ^{^{'}} (.)

being digamma and trigamma functions, respectively.

In order to compute the expected Fisher information matrix, we compute the elements of the Hessian matrix, which, after some algebraic manipulation, for

j, s = 1, 2, 3

and

r_{j} = 1, 2, \dots, p_{j}

and

r_{s} = 1, 2, \dots, p_{s}

, turn out to be

\begin{matrix} \frac{\partial^{2} ℓ (γ; v, X)}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}} & = & k_{1} + k_{2} - k_{3} + k_{4} ln (1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}) + k_{6} \frac{{[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}}{1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}} \\ + & k_{7} \frac{{[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}} ln [C^{- 1} (v_{i})]}{1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}}} + (\frac{1}{{\tilde{β}}_{1, i}} + 1) \\ \times & \{(\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{s, r_{s}}}) \frac{{[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}} (ln [C^{- 1} (v_{i})])}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}})}^{2}} \\ + & (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{s, r_{s}}}) \frac{{[C^{- 1} (v_{i})]}^{- 2 {\tilde{δ}}_{i}}}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}})}^{2}} \\ - & {\tilde{λ}}_{i} (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{s, r_{s}}}) \frac{{[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}} {(ln [C^{- 1} (v_{i})])}^{2}}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}})}^{2}} \\ - & {\tilde{λ}}_{i} (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{s, r_{s}}}) \frac{{[C^{- 1} (v_{i})]}^{- 2 {\tilde{δ}}_{i}} (ln [C^{- 1} (v_{i})])}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (v_{i})]}^{- {\tilde{δ}}_{i}})}^{2}}\} \end{matrix}

(A14)

where

k_{1} = \frac{1}{{\tilde{λ}}_{i}} [- \frac{1}{{\tilde{λ}}_{i}} (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{s, r_{s}}}) (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) + (\frac{\partial^{2} {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})]

,

k_{2} = \frac{1}{{\tilde{δ}}_{i}} [- \frac{1}{{\tilde{δ}}_{i}} (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{s, r_{s}}}) (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}}}) + (\frac{\partial^{2} {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})]

,

k_{3} = \frac{1}{{\tilde{β}}_{1, i}} [- \frac{1}{{\tilde{β}}_{1, i}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{s, r_{s}}}) (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) + (\frac{\partial^{2} {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})]

,

k_{4} = \frac{1}{{\tilde{β}}_{1, i}^{2}} [- \frac{2}{{\tilde{β}}_{1, i}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{s, r_{s}}}) (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) + (\frac{\partial^{2} {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})]

,

k_{5} = (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{s, r_{s}}}) (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}}}) + {\tilde{λ}}_{i} (\frac{\partial^{2} {\tilde{δ}}_{i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})

,

k_{6} = \frac{1}{{\tilde{β}}_{1, i}^{2}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{λ}}_{i}}{\partial γ_{s, r_{s}}}) - (\frac{1}{{\tilde{β}}_{1, i}} + 1) (\frac{\partial^{2} {\tilde{λ}}_{i}}{\partial γ_{j, r_{j}} \partial γ_{s, r_{s}}})

and

k_{7} = k_{5} (\frac{1}{{\tilde{β}}_{1, i}} + 1) - \frac{{\tilde{λ}}_{i}}{{\tilde{β}}_{1, i}} (\frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{j, r_{j}}}) (\frac{\partial {\tilde{δ}}_{i}}{\partial γ_{s, r_{s}}})

.

The elements of the expected Fisher information matrix are functions of the following expectations:

\begin{matrix} E \{\frac{{[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}}}{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}\} = E_{1, 0, 1}; E \{\frac{{[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}} ln [C^{- 1} (V)]}{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}\} = E_{1, 1, 1} \end{matrix}

(A15)

\begin{matrix} E \{\frac{{[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}} ln [C^{- 1} (V)]}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}^{2}}\} = E_{1, 1, 2}; E \{\frac{{[C^{- 1} (V)]}^{- 2 {\tilde{δ}}_{i}}}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}^{2}}\} = E_{2, 0, 2} \end{matrix}

(A16)

\begin{matrix} E \{\frac{{[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}} {(ln [C^{- 1} (V)])}^{2}}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}^{2}}\} = E_{1, 2, 2} \end{matrix}

(A17)

\begin{matrix} E_{2, 1, 2} & = & E \{\frac{{[C^{- 1} (V)]}^{- 2 {\tilde{δ}}_{i}} ln [C^{- 1} (V)]}{{(1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})}^{2}}\} = E_{Y} \{\frac{Y^{- 2 {\tilde{δ}}_{i}} ln (Y)}{{(1 + {\tilde{λ}}_{i} Y^{- {\tilde{δ}}_{i}})}^{2}}\} \\ = & \frac{1}{{\tilde{β}}_{1, i} {\tilde{δ}}_{i} {\tilde{λ}}_{i}^{2}} \{ln ({\tilde{λ}}_{i}) B (\frac{1}{{\tilde{β}}_{1, i}}, 5 + \frac{2}{{\tilde{δ}}_{i}}) + A_{1} (\frac{1}{{\tilde{β}}_{1, i}}, 5 + \frac{2}{{\tilde{δ}}_{i}}) - A_{2} (\frac{1}{{\tilde{β}}_{1, i}}, 5 + \frac{2}{{\tilde{δ}}_{i}})\} \end{matrix}

(A18)

and finally

\begin{matrix} E_{V} \{ln (1 + {\tilde{λ}}_{i} {[C^{- 1} (V)]}^{- {\tilde{δ}}_{i}})\} & = & E_{Y} \{ln [(1 + {\tilde{λ}}_{i} Y^{- {\tilde{δ}}_{i}})]\} \\ = & - \frac{1}{{\tilde{β}}_{1, i}} A_{1} (\frac{1}{{\tilde{β}}_{1, i}}, 3 + \frac{2}{{\tilde{δ}}_{i}}) . \end{matrix}

(A19)

Appendix A.4. Partial Derivatives of System (39)

Evidently, in system (39), the partial derivatives are given by

\begin{matrix} \frac{\partial {\tilde{β}}_{1, i}}{\partial γ_{1, r_{1}}} = \frac{\partial h_{1} (x_{1, i}, γ_{1})}{\partial γ_{1, r_{1}}} for r_{1} = 1, \dots, p_{1} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{λ}}_{i}}{\partial γ_{1, r_{1}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{1, r_{1}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial β_{1, i}} (\frac{\partial β_{1, i}}{\partial γ_{1, r_{1}}}) for r_{1} = 1, \dots, p_{1} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{λ}}_{i}}{\partial γ_{2, r_{2}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{2, r_{2}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial m e_{i}} (\frac{\partial m e_{i}}{\partial γ_{2, r_{2}}}) for r_{2} = 1, \dots, p_{2} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{λ}}_{i}}{\partial γ_{3, r_{3}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{3, r_{3}}} = \frac{\partial \tilde{λ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial v {(q)}_{i}} (\frac{\partial v {(q)}_{i}}{\partial γ_{3, r_{3}}}) for r_{3} = 1, \dots, p_{3} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{δ}}_{i}}{\partial γ_{1, r_{1}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{1, r_{1}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial β_{1, i}} (\frac{\partial β_{1, i}}{\partial γ_{1, r_{1}}}) for r_{1} = 1, \dots, p_{1} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{δ}}_{i}}{\partial γ_{2, r_{2}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{2, r_{2}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial m e_{i}} (\frac{\partial m e_{i}}{\partial γ_{2, r_{2}}}) for r_{2} = 1, \dots, p_{2} \end{matrix}

\begin{matrix} \frac{\partial {\tilde{δ}}_{i}}{\partial γ_{3, r_{3}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial γ_{3, r_{3}}} = \frac{\partial \tilde{δ} (β_{1, i}, m e_{i}, v {(q)}_{i})}{\partial v {(q)}_{i}} (\frac{\partial v {(q)}_{i}}{\partial γ_{3, r_{3}}}) for r_{3} = 1, \dots, p_{3} \end{matrix}

Moreover, by specifying the appropriate link functions of indicators of interest

{\tilde{β}}_{1, i} = e^{x_{1, i}^{^{'}} γ_{1}}, m e_{i} = \frac{e^{x_{2, i}^{^{'}} γ_{2}}}{1 + e^{x_{2, i}^{^{'}} γ_{2}}}, v {(q)}_{i} = \frac{e^{x_{3, i}^{^{'}} γ_{3}}}{1 + e^{x_{3, i}^{^{'}} γ_{3}}},

we have

\frac{\partial β_{1, i}}{\partial γ_{1, r_{1}}} = e^{x_{1, i}^{^{'}} γ_{1}} x_{1, r_{1}, i}, \frac{\partial m e_{i}}{\partial γ_{2, r_{2}}} = \frac{e^{x_{2, i}^{^{'}} γ_{2}}}{{(1 + e^{x_{2, i}^{^{'}} γ_{2}})}^{2}} x_{2, r_{2}, i}

and

\frac{\partial v {(q)}_{i}}{\partial γ_{3, r_{3}}} = \frac{e^{x_{3, i}^{^{'}} γ_{3}}}{{(1 + e^{x_{3, i}^{^{'}} γ_{3}})}^{2}} x_{3, r_{3}, i}

.

References

Alzaatreh, A.; Famoye, F.; Lee, C. A New Method for Generating Families of Continuous Distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its application. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Jones, M. Families of distributions arising from the distributions of order statistics. Test 2004, 13, 1–43. [Google Scholar] [CrossRef]
Kumaraswamy, P. A generalized probability density function for double-bounded random processes. J. Hydrol. 1980, 46, 79–88. [Google Scholar] [CrossRef]
Topp, C.; Leone, F. A family of J-shaped frequency functions. J. Am. Stat. Assoc. 1955, 50, 209–219. [Google Scholar] [CrossRef]
Arnold, B.; Groeneveld, R. Some properties of the arcsine distribution. J. Am. Stat. Assoc. 1980, 75, 173–175. [Google Scholar] [CrossRef]
Van Dorp, R.; Kotz, S. The standard two-sided power distribution and its properties. Am. Stat. 2002, 56, 90–99. [Google Scholar] [CrossRef]
Kotz, S.; Van Dorp, J.R. Beyond Beta: Other Continuous Families of Distributions with Bounded Support and Applications; World Scientific Publishing Co.: Singapore, 2004. [Google Scholar]
Marshall, A.W.; Olkin, I. Life Distributions; Springer: New York, NY, USA, 2007. [Google Scholar]
Modi, K.; Gill, V. Unit Burr III distribution with application. J. Stat. Manag. Syst. 2019, 23, 579–592. [Google Scholar] [CrossRef]
Singh, D.P.; Jha, M.; Tripathi, Y.; Wang, L. Reliability estimation in a multicomponent stress-strength model for unit Burr III distribution under progressive censoring. Qual. Technol. Quant. Manag. 2022, 19, 605–632. [Google Scholar] [CrossRef]
Mazucheli, J.; Menezes, A.; Chakraborty, S. On the one parameter unit-Lindley distribution and its associated regression model for proportion data. J. Appl. Stat. 2019, 46, 700–714. [Google Scholar] [CrossRef]
Mazucheli, J.; Menezes, A.; Dey, S. Unit-Gompertz distribution with applications. Statistica 2019, 79, 26–43. [Google Scholar]
Korkmaz, M.; Chesneau, C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comput. Appl. Math. 2021, 40, 29. [Google Scholar] [CrossRef]
Ghitany, M.; Mazucheli, J.; Menezes, A.; Alqallaf, F. The unit-inverse Gaussian distribution: A new alternative to two-parameter distributions on the unit interval. Commun. Stat. Theory Methods 2018, 48, 3423–3438. [Google Scholar] [CrossRef]
Korkmaz, M.; Chesneau, C.; Korkmaz, Z. On the arcsecant hyperbolic normal distribution. Properties, quantile regression modeling and applications. Symmetry 2021, 13, 117. [Google Scholar] [CrossRef]
Korkmaz, M. A new heavy-tailed distribution defined on the bounded interval: The logit slash distribution and its applications. J. Appl. Stat. 2019, 473, 2097–2119. [Google Scholar] [CrossRef]
Arslan, T. A new family of unit-distributions: Definition, properties and applications. Twms J. Appl. Eng. Math. 2023, 13, 782–791. [Google Scholar]
Ferreira, A.; Mazucheli, J. The zero-inflated, one and zero-and-one-inflated new unit-Lindley distributions. Braz. J. Biom. 2022, 40, 291–326. [Google Scholar] [CrossRef]
Rodrigues, J.; Bazán, J.; Suzuki, A.K. A flexible procedure for formulating probability distributions on the unit interval with applications. Commun. Stat. Theory Methods 2020, 49, 738–754. [Google Scholar] [CrossRef]
Aljarrah, M.; Lee, C.; Famoye, F. On generating T − X family of distributions using quantile functions. J. Stat. Distrib. Appl. 2014, 1, 2. [Google Scholar] [CrossRef]
Bakouch, H.; Nik, A.; Asgharzadeh, A.; Salinas, H. A flexible probability model for proportion data: Unit-half-normal distribution. Commun. Stat. Case Stud. Data Anal. Appl. 2021, 7, 271–288. [Google Scholar] [CrossRef]
Haq, M.; Hashmi, S.; Aidi, K.; Ramos, P.F.L. Unit Modified Burr-III Distribution: Estimation, Characterizations and Validation Test. Ann. Data Sci. 2023, 10, 415–449. [Google Scholar] [CrossRef]
Mazucheli, J.; Leiva, V.; Alves, B.; Menezes, A. A New Quantile Regression for Modeling Bounded Data under a Unit Birnbaum–Saunders Distribution with Applications in Medicine and Politics. Symmetry 2021, 13, 682. [Google Scholar] [CrossRef]
Mazucheli, J.; Menezes, A.; Dey, S. The unit Birnbaum-Saunders distribution with applications. Chil. J. Stat. 2018, 9, 47–57. [Google Scholar]
Nasiru, S.; Abubakari, A.; Angbing, I. Bounded Odd Inverse Pareto Exponential Distribution: Properties, Estimation, and Regression. Int. J. Math. Math. Sci. 2021, 2021, 9955657. [Google Scholar] [CrossRef]
Altun, E.; El-Morshedy, M.; Eliwa, M. A new regression model for bounded response variable: An alternative to the beta and unit Lindley regression models. PLoS ONE 2021, 16, e0245627. [Google Scholar] [CrossRef] [PubMed]
Domma, F.; Condino, F.; Giordano, S. A New Formulation of the Dagum Distribution in terms of Income Inequality and Poverty Measures. Physica A Stat. Mech. Its Appl. 2018, 511, 104–126. [Google Scholar] [CrossRef]
Domma, F.; Condino, F.; Franceschi, S.; De Luca, D.; Biondi, D. On the extreme hydrologic events determinants by means of Beta-Singh-Maddala reparameterization. Sci. Rep. 2022, 12, 15537. [Google Scholar] [CrossRef]
Dagum, C. A New Model of Personal Distribution: Specification and Estimation; Springer: New York, NY, USA, 1977; pp. 413–437. [Google Scholar]
Dagum, C. The Generation and Distribution of Income, the Lorenz Curve and the Gini Ratio. 1980. Available online: https://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=PASCAL8130438924 (accessed on 23 June 2023).
Latorre, G. Proprieta’ campionarie del modello di Dagum per la distribuzione dei redditi. Statistica 1988, 48, 15–27. [Google Scholar]
Kleiber, C.; Kotz, S. Statistical Size Distributions in Economics and Actuarial Science; Wiley Series in Probability and Statistics; Wiley Interscience, John Wiley and Sons Inc.: Hoboken, NJ, USA, 2003. [Google Scholar]
Cribari-Neto, F.; Zeileis, A. Beta Regression in R. J. Stat. Softw. 2020, 34, 1–24. [Google Scholar]
Ferrari, S.; Cribari Neto, F. Beta regression for modelling rates and proportions. J. Appl. Stat. 2004, 31, 799–815. [Google Scholar] [CrossRef]
Song, P.; Tan, M. Marginal models for longitudinal continuous proportional data. Biometrics 2000, 56, 496–502. [Google Scholar] [CrossRef] [PubMed]
Gómez-Déniz, E.; Sordo, M.; Calderín-Ojeda, E. The log-Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
Altun, E. The log-weighted exponential regression model: Alternative to the beta regression model. Commun. Stat. Theory Methods 2021, 50, 2306–2321. [Google Scholar] [CrossRef]
Mousa, A.; El-Sheikh, A.; Abdel-Fattah, M. A gamma regression for bounded continuous variables. Adv. Appl. Stat. 2016, 49, 305–326. [Google Scholar] [CrossRef]
Mitnik, P.; Baek, S. The Kumaraswamy distribution: Median-dispersion re-parameterizations for regression modeling and simulation-based estimation. Stat. Pap. 2013, 54, 177–192. [Google Scholar] [CrossRef]
Lemonte, A.; Moreno-Arenas, G. On a heavy-tailed parametric quantile regression model for limited range response variables. Comput. Stat. 2020, 35, 379–398. [Google Scholar] [CrossRef]
Jodrá, P.; Jiménez-Gamero, M. A quantile regression model for bounded responses based on the exponential-geometric distribution. Revstat 2020, 4, 415–436. [Google Scholar]
Paz, R.; Balakrishnan, N.; Bazán, J. L-logistic regression models: Prior sensitivity analysis, robustness to outliers and applications. Braz. J. Probab. Stat. 2019, 33, 455–479. [Google Scholar]
Mazucheli, J.; Menezes, A.; Fernandes, L.; de Oliveira, R.; Ghitany, M. The unit Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. J. Appl. Stat. 2020, 47, 954–974. [Google Scholar] [CrossRef]

Figure 1. Pdf of the type I unit-Dagum model for different values of parameters.

Figure 2. Hazard rate of the type I unit-Dagum model for different values of parameters.

Figure 3. Pdf of II-UDa for different values of parameters.

Figure 4. Hazard rate of II-UDa for different values of parameters.

Figure 5. Empirical and fitted distributions for I-Da, II-Da, beta, and KW models.

Figure 6. Empirical and fitted distributions for I-Da, II-Da, and beta models. The solid lines and the dotted lines refer to the average values of covariates obtained, respectively, for

v \leq 0.5

and

v > 0.5

.

Figure 7. Empirical and fitted distributions for I-Da, II-Da, and beta models. Different curves refer to dyslexic and non-dyslexic subjects, considering the average iq level for each group.

Table 1. Unit distributions proposed in the literature according to the used transformation.

Distribution, Reference	$C (Y; a)$	$f_{V} (v; θ; a)$	Parameter Space
Unit-Burr III, [10]	$Y / (1 + Y)$	$θ_{1} θ_{2} {[1 + {(\frac{1}{v} - 1)}^{θ_{2}}]}^{- θ_{1} - 1} {(\frac{1}{v} - 1)}^{θ_{2} - 1} \frac{1}{v^{2}}$	$θ_{1}, θ_{2} > 0$
Unit-Half-Normal, [22]	$Y / (1 + Y)$	$\frac{2}{θ {(1 - v)}^{2}} ϕ (\frac{v}{θ (1 - v)})$	$θ > 0$
Unit-modified Burr III, [23]	$Y / (1 + Y)$	$θ_{1} θ_{2} v^{- 2} {(\frac{1 - v}{v})}^{θ_{2} - 1} {[1 + θ_{3} {(\frac{1 - v}{v})}^{θ_{2}}]}^{- \frac{θ_{1}}{θ_{3}} - 1}$	$θ_{1}, θ_{2}, θ_{3} > 0$
Unit-Lindley, [12]	$Y / (1 + Y)$	$\frac{θ^{2}}{1 + θ} {(1 - v)}^{- 3} exp (- \frac{θ v}{1 - v})$	$θ > 0$
Unit-Gompertz, [13]	$exp {- Y}$	$θ_{1} θ_{2} v^{- (θ_{2} + 1)} exp [- θ_{1} (v^{- θ_{2}} - 1)]$	$θ_{1}, θ_{2} > 0$
Unit-Birnbaum-Saunders, [24,25]	$exp {- Y}$	$\frac{1}{2 v θ_{1} θ_{2} \sqrt{2 π}} [{(- \frac{θ_{2}}{log v})}^{1 / 2} + {(- \frac{θ_{2}}{log v})}^{3 / 2}] exp [\frac{1}{2 θ_{1}^{2}} (\frac{log v}{θ_{2}} + \frac{θ_{2}}{log v} + 2)]$	$θ_{1}, θ_{2} > 0$
Bounded Odd inv. Pareto exp., [26]	$exp {- Y}$	$\frac{θ_{1} θ_{2} θ_{3} v^{θ_{3} - 1} {(1 - v^{θ_{3}})}^{θ_{1} - 1}}{{(1 - (1 - θ_{2}) v^{θ_{3}})}^{θ_{1} + 1}}$	$θ_{1}, θ_{2}, θ_{3} > 0$
Unit-Burr XII, [14]	$exp {- Y}$	$θ_{1} θ_{2} v^{- 1} {(- log v)}^{θ_{2} - 1} {(1 + {(- log v)}^{θ_{2}})}^{- θ_{1} - 1}$	$θ_{1}, θ_{2} > 0$
Log-Bilal, [27]	$exp {- Y}$	$\frac{6}{θ} v^{2 / θ - 1} (1 - v^{1 / θ})$	$θ > 0$
Unit-inverse Gaussian, [15]	$exp {- Y}$	$\sqrt{\frac{θ_{1}}{2 π}} \frac{1}{v {(- log v)}^{3 / 2}} exp [- \frac{θ_{1}}{2 {(θ_{2})}^{2} log v} {(log v + θ_{2})}^{2}]$	$θ_{1}, θ_{2} > 0$
Unit-Burr III, [11]	$exp {- Y}$	$θ_{1} θ_{2} \frac{{(l o g (1 / v))}^{- θ_{2} - 1}}{v {(1 + {(l o g (1 / v))}^{- θ_{2}})}^{θ_{1} + 1}}$	$θ_{1}, θ_{2} > 0$
Logit slash, [17]	$1 / [1 + exp {- Y}]$	$\frac{θ_{1}}{v (1 - v) θ_{3}} \int_{0}^{1} t^{θ_{1}} ϕ (t [\frac{l o g (\frac{v}{1 - v}) - θ_{2}}{θ 3}]) d t$	$θ_{1}, θ_{3} > 0; - \infty < θ_{2} < \infty$

Table 2. MLEs, corresponding standard errors (in brackets), and AIC values for I-Da, II-Da, beta, and KW models in food expenditure and reading skills data.

	I-UDa	II-UDa	Beta	KW
	Food Expenditures
$β$	0.484 (1.618)	0.410 (1.631)	6.070 (1.358)	2.954 (0.309)
$λ$	30,279.37 (34.380)	62.520 (6.985)	14.819 (3.398)	26.964 (8.700)
$δ$	13.457 (1.326)	10.056 (1.360)
AIC	−67.337	−67.400	−66.693	−62.978
	Reading Skills
$β$	0.044 (1.661)	0.038 (1.670)	2.514 (0.578)	2.694 (0.589)
$λ$	529.384 (29.316)	0.004 (15.152)	0.675 (0.123)	0.665 (0.121)
$δ$	24.287 (1.604)	14.917 (1.617)
AIC	−65.366	−64.134	−48.841	−49.218

Table 3. MLEs, corresponding standard errors, and Wald test results for the I-Da and II-Da regression models for food expenditure data.

	I-UDa
	Estimate	SE	z	p-Value
	$I_{1} = β_{1}$ (log-link)
Intercept	−0.333	0.506	−0.659	0.51
	$I_{2} = m e$ (logit-link)
Intercept	−0.544	0.016	−33.974	$< 0.001$
income	−0.009	0.001	−16.758	$< 0.001$
	$I_{3} = v (0.9)$ (logit-link)
Intercept	−0.727	0.022	−33.362	$< 0.001$
income	−0.008	0.000	−21.604	$< 0.001$
persons	0.167	0.022	7.425	$< 0.001$
	AIC = −92.29
	II-UDa
	Estimate	SE	z	p-Value
	$I_{1} = β_{1}$ (log-link)
Intercept	−3.672	4.586	−0.801	0.423
	$I_{2} = m e$ (logit-link)
Intercept	−0.552	0.015	−36.371	$< 0.001$
income	−0.009	0.000	−18.971	$< 0.001$
	$I_{3} = v (0.9)$ (logit-link)
Intercept	−0.724	0.014	−50.203	$< 0.001$
income	−0.008	0.000	−25.635	$< 0.001$
persons	0.160	0.015	10.351	$< 0.001$
	AIC = −97.25

Table 4. MLEs, corresponding standard errors, and Wald test results for the I-Da and II-Da regression models for reading skills data.

	I-UDa
	Estimate	SE	z	p-Value
	$I_{1} = β_{1}$ (log-link)
Intercept	−0.9626	0.9008	−1.069	0.285
	$I_{2} = m e$ (logit-link)
Intercept	1.593	0.175	9.107	$< 0.001$
dyslexia	−1.119	0.167	−6.707	$< 0.001$
iq	0.504	0.093	5.389	$< 0.001$
dyslexia × iq	−0.512	0.094	−5.451	$< 0.001$
	$I_{3} = v (0.9)$ (logit-link)
Intercept	2.69045	0.03643	73.85	$< 0.001$
Dyslexia	−1.914	0.03643	−52.544	$< 0.001$
	AIC = −139.32
	II-UDa
	Estimate	SE	z	p-Value
	$I_{1} = β_{1}$ (log-link)
Intercept	−0.7247	0.80705	−0.898	0.369
	$I_{2} = m e$ (logit-link)
Intercept	1.60156	0.18082	8.857	$< 0.001$
dyslexia	−1.1376	0.17224	−6.605	$< 0.001$
iq	0.49774	0.09663	5.151	$< 0.001$
dyslexia × iq	−0.5047	0.09701	−5.202	$< 0.001$
	$I_{3} = v (0.9)$ (logit-link)
Intercept	2.69356	0.03759	71.648	$< 0.001$
dyslexia	−1.9168	0.03768	−50.873	$< 0.001$
	AIC = −137.41

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Unit Distributions: A General Framework, Some Special Cases, and the Regression Unit-Dagum Models

Abstract

1. Introduction

2. General Framework

3. Two Kinds of Unit-Dagum Distributions

3.1. The First Kind of Unit-Dagum Distribution

3.2. A Second Kind of Unit-Dagum Distribution

4. Inference

5. Unit-Dagum Regression Models

6. Applications

6.1. Modeling Food/Income and Accuracy Data

6.2. Considering the Covariates: The Regression Models

7. Concluding Remarks

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix A.1. Proof of Proposition 1

Appendix A.2. Proof of Proposition 2

Appendix A.3. Fisher Information Matrix

Appendix A.4. Partial Derivatives of System (39)

References

Article Metrics

Citations

Article Access Statistics