Birnbaum-Saunders Quantile Regression Models with Application to Spatial Data

Luis Sánchez; Víctor Leiva; Manuel Galea; Helton Saulo

doi:10.3390/math8061000

,

and

¹

Department of Mathematics and Statistics, Universidad de La Frontera, Temuco 4780000, Chile

²

School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile

³

Department of Statistics, Pontificia Universidad Católica de Chile, Santiago 8320000, Chile

⁴

Department of Statistics, Universidade de Brasília, Brasília 70910-90, Brazil

Mathematics2020, 8(6), 1000;https://doi.org/10.3390/math8061000

This article belongs to the Special Issue Statistical Simulation and Computation

Version Notes

Order Reprints

Abstract

In the present paper, a novel spatial quantile regression model based on the Birnbaum–Saunders distribution is formulated. This distribution has been widely studied and applied in many fields. To formulate such a spatial model, a parameterization of the multivariate Birnbaum–Saunders distribution, where one of its parameters is associated with the quantile of the respective marginal distribution, is established. The model parameters are estimated by the maximum likelihood method. Finally, a data set is applied for illustrating the formulated model.

Keywords:

data analytics; geostatistical models; maximum likelihood method; multivariate distributions; R software; statistical parameterizations

1. Introduction

An asymmetric distribution that has recently received considerable attention is the Birnbaum–Saunders (BS) model. It originated from material fatigue and has been applied to reliability and fatigue studies [1,2,3]. Extensive work has been done on the BS distribution with regard to its mathematical and statistical properties, inference, modeling, and diagnostics. Its natural applications have been mainly focused on engineering. However, today they range diverse fields including air pollution [4,5], business [6], earth sciences [7,8], industry [9,10], and medicine [11,12], among other areas. These applications have been performed by an international transdisciplinary group of researchers.

Standard regression models provide an estimate of the mean response given certain values of the covariates. These models cannot be applied to estimate other parameters that are different to the mean, being a limitation of such models. Nevertheless, first, in engineering, environmental, and social sciences, as well as in other areas, often the practitioners are interested in estimating quantiles for establishing warranties of products, determining the levels of nutrients in the soil or measuring economic inequality for poor (lower tail) and rich (upper tail) people by means of their household incomes [13]. Second, the other limitation of the standard regression models is that if the response variable follows a skew distribution, then the mean is not a good central tendency measure to summarize the data and, in this case, the median is a more informative and robust estimate. Additionally, third, regression models can describe parameters of the whole distribution related to variability, skewness, and other higher-order moments, which can characterize a distribution [14]. In order to solve the first two limitations mentioned above, quantile regression models were proposed by [15], extending the median regression model attributed to [16], and generalizing the ordinary sample quantiles to the regression setting. We are interested in modeling the median or other quantiles of the BS distribution by regression; see [17,18].

The accuracy of an estimator of the mean (or median) might be improved if a spatial component is added in the modeling [19]. The idea of spatial quantile regression was initially proposed by [20], and [21] discussed a general spatial quantile regression based on the conditional quantile function, while [22] showed variants of the spatial quantile regression. We provide background of quantile regression including the spatial case in the next section. Ref [23,24,25] introduced BS spatial mean regression models and their diagnostics for the conditional mean; see [26] for details on diagnostic methods. Stochastic processes are applied in the modeling of spatial data to know the corresponding finite dimensional multivariate distributions. BS multivariate distributions have been proposed and studied by [27,28,29]. BS quantile regression models were recently derived by [13] for the independent case, where household income data were considered. However, no studies on BS quantile regression for data with spatial dependence have been proposed.

The main objective of this work is to formulate a novel class of spatial quantile regression models based on the BS distribution. To accomplish this, we propose a quantile parameterization to generate a new multivariate BS model, whose parameters are estimated by the maximum likelihood method. Subsequently, a data set is applied for illustration.

The remainder paper is organized as follows. In Section 2, quantile regression models for the cases of independent and spatial data are described. Section 3 presents the univariate BS distribution in its original parameterization and a new parameterization of it, which allows us to model a quantile. In Section 4, the multivariate normal distribution and its connection to the new parametrization of the multivariate BS distribution are introduced. In Section 5, we formulate the spatial quantile regression model based on the BS distribution. Section 6 derives estimation of model parameters using the maximum likelihood method, whereas tools for model checking are discussed in Section 7. In Section 8, we carry out an empirical example with spatial data to illustrate potential applications of the novel model. Conclusions and future works are mentioned in Section 9. An Appendix A with derivatives for the score vector and Hessian matrix is provided at the end of this paper.

2. Quantile Regression

Standard regression models have been widely used in different areas and they are defined as

Y_{i} = x_{i}^{⊤} β + ε_{i}, i = \bar{1, n},

where Y is the dependent (or response) variable;

x

corresponds to the values of the vector of independent variables (covariates)

X

;

β

is a vector of regression parameters; and

ε

is a random error with

E [ε] = 0

,

Var [ε] = ς^{2}

(constant variance), and

Cov [ε_{l}, ε_{k}] = 0

, for

l \neq k

(uncorrelated errors). This implies that a regression model describes the conditional mean

E [Y | X = x] = x^{⊤} β

, so that it can be written by the probability density function (PDF) of Y parameterized in terms of its mean. For example, if Y is normally distributed, then its linear regression model might be visualized as

Y_{i} | X_{i} = x_{i} \sim N (μ_{i} = x_{i}^{⊤} β, ς^{2}), i = \bar{1, n},

(1)

with

Y_{1} | X_{1} = x_{1}, \dots, Y_{n} | X_{n} = x_{n}

being independent random variables. Additionally, we can generalize the expression for

μ_{i}

given in (1) when considering

μ_{i} = h_{1} (x_{i}^{⊤} β)

, where

h_{1}

is an invertible function, such as in generalized linear models [30]. If we now consider a k-parameter distribution, with

θ = {(θ_{1} = μ = h_{1} (x^{⊤} β), θ_{2}, \dots, θ_{k})}^{⊤}

, that is, distributions parameterized on their mean [31,32] in addition to other parameters, one may establish a more general model of the form

Y_{i} | X_{i} = x_{i} \sim f_{_{Y}} (y; θ_{1} = h_{1} (x_{i}^{⊤} β), θ_{2}, \dots, θ_{k}), i = \bar{1, n},

(2)

where Y now follows some distribution.

Quantile regression models for a response Y offer a mechanism to estimate and predict the median response as well as other quantiles [15]. This class of regression models is based on the quantile function that is given by

Q_{_{Y}} (τ; θ) = \inf {y : F_{_{Y}} (y; θ) \geq τ},

where

θ

is a

k \times 1

parameter vector of the underlying distribution and

0 < τ < 1

. If one of the pararameters of the distribution of Y is its quantile function, we can represent a quantile regression model, similarly to (2), as

Y_{i} | X_{i} = x_{i} \sim f_{_{Y}} (y; Q_{_{Y}} (τ; β, x_{i}) = h (x_{i}^{⊤} β), θ_{2}, \dots, θ_{k}), i = \bar{1, n},

(3)

where h is an invertible function, with positive support and at least twice differentiable,

τ

is a fixed value and, as before,

Y_{1} | X_{1} = x_{1}, \dots, Y_{n} | X_{n} = x_{n}

are independent random variables.

Let

{Y (s), s \in D}

be a stochastic process that is defined over a region

D \subset R^{2}

. We use the notation

Q_{_{Y (s)}} (τ; θ) = \inf {y : F_{_{Y (s)}} (y; θ) \geq τ}

to represent the quantile function for Y in the location

s \in D \subset R^{2}

. If we consider spatial locations

s_{i}

, the quantile function of the process can be modeled by regression as

Q_{_{Y (s_{i})}} (τ; β | x (s_{i})) = x {(s_{i})}^{⊤} β

, or more generally as

Q_{_{Y (s_{i})}} (τ; β | x (s_{i})) = h^{- 1} (x {(s_{i})}^{⊤} β)

, for

i = \bar{1, n}

. Here,

Q_{_{Y_{i}}} (τ; β | x_{i})

is the conditional quantile function of Y given a set of values

x_{i}

for the covariates, in the location

s_{i}

, where

τ

is a fixed value, and h is as given in (3). When

τ = 0.5

, the median is modeled. Often it is assumed that the covariance function of the process only depends on the distance between spatial locations, that is, the stochastic process is stationary.

3. The Univariate Birnbaum-Saunders Distribution

If

Z \sim N (0, 1)

, then the random variable T given by

T = T (Z; α, ϱ) = ϱ {[α Z / 2 + \sqrt{{(α Z / 2)}^{2} + 1}]}^{2}

(4)

has a BS distribution with parameters of shape

α > 0

and scale

ϱ > 0

, which is denoted by

T \sim BS (α, ϱ)

. The random variable T has positive support and the transformation given in (4) is one-to-one, which allows us to establish that

Z = \frac{1}{α} (\sqrt{T / ϱ} - \sqrt{ϱ / T}) \sim N (0, 1) .

The PDF and cumulative distribution function (CDF) of T are expressed, respectively, by

f_{T} (t) = ϕ (A (t; α, ϱ)) a (t; α, ϱ), F_{T} (t) = Φ (A (t; α, ϱ)), t > 0,

where

ϕ, Φ

are the PDF and CDF of the standard normal distribution, whereas

\begin{matrix} A (t; α, ϱ) & = & \frac{1}{α} (\sqrt{t / ϱ} - \sqrt{ϱ / t}), a (t; α, ϱ) = \frac{d}{d t} [A (t; α, ϱ)] = \frac{1}{2 α ϱ} [\sqrt{ϱ / t} + \sqrt{{(t / ϱ)}^{3}}] . \end{matrix}

(5)

Let

T \sim BS (α, ϱ)

. Subsequently, the following properties hold:

(i): $E [T] = ϱ (1 + α^{2} / 2)$ .
(ii): $Var [T] = ϱ^{2} α^{2} (1 + 5 α^{2} / 4)$ .
(iii): $b T \sim BS (α, b ϱ)$ , for $b > 0$ .
(iv): $1 / T \sim BS (α, 1 / ϱ)$ .
(v): $W = Z^{2} = (1 / α^{2}) (T / ϱ + ϱ / T - 2) \sim χ^{2} (1)$ , with $E [W] = 1$ and $Var [W] = 2$ .

These properties are useful for diverse statistical purposes, such as the generation of moments and of random numbers, estimation of parameters, and modeling based on regression. Another property of the BS distribution is presented next. Given

q \in (0, 1)

, note that the qth quantile of the BS distribution is defined as

Q = t_{q} = \frac{ϱ}{4} {(α z_{q} + \sqrt{α^{2} z_{q}^{2} + 4})}^{2} = \frac{ϱ}{4} γ_{α}^{2},

(6)

where

γ_{α} = α z_{q} + \sqrt{α^{2} z_{q}^{2} + 4},

(7)

with

z_{q}

being the qth quantile of the standard normal distribution.

4. The Multivariate BS Distribution and a New Parametrization

Let

V = {(V_{1}, \dots, V_{n})}^{⊤} \in R^{n}

be a random vector with n-variate normal distribution, denoted by

V \sim N_{n} (μ, Σ)

, with mean vector

μ = (μ_{i}) \in R^{n}

and variance-covariance matrix

Σ = (σ_{k l}) \in R^{n \times n}

, with

rank (Σ) = n

. Note that

Σ

is symmetric, non-singular, positive definite, and then the distribution of

V

is non-singular [33]. When the mean vector is zero, that is,

μ = 0_{n \times 1}

, we use the notation

ϕ_{n}

and

Φ_{n}

for the n-variate normal PDF and CDF, respectively, where

0_{n \times 1}

is an

n \times 1

vector of zeros.

The random vector

T = {(T_{1}, \dots, T_{n})}^{⊤} \in R_{+}^{n}

follows an n-variate BS distribution with parameters

α = {(α_{1}, \dots, α_{n})}^{⊤} \in R_{+}^{n}

,

ϱ = {(ϱ_{1}, \dots, ϱ_{n})}^{⊤} \in R_{+}^{n}

, and

Σ \in R^{n \times n}

, if

T_{i} = T (V_{i}; α_{i}, ϱ_{i})

, for

i = \bar{1, n}

, where T is given in (4) and

V = {(V_{1}, \dots, V_{n})}^{⊤} \in R^{n} \sim N_{n} (0_{n \times 1}, Σ)

, with

Σ \in R^{n \times n}

being the variance-covariance matrix of

V

with diagonal elements equal to one. Therefore,

Σ

is also the correlation matrix of

V

in this case. Note that

Σ

is the correlation matrix of

V

and not of

T

, but we use the notation

T \sim {BS}_{n} (α, Q, Σ)

due to the relationship between the BS and normal distributions. Observe that the CDF and PDF of

T \sim {BS}_{n} (α, ϱ, Σ)

are defined, respectively, by

F_{T} (t; α, ϱ, Σ) = Φ_{n} (A; Σ), f_{T} (t; α, ϱ, Σ) = ϕ_{n} (A; Σ) a (t; α, ϱ), t = (t_{1}, \dots, t_{n}) \in R_{+}^{n},

where

A = A (t; α, ϱ) = {(A_{1}, \dots, A_{n})}^{⊤}

, with

A_{i} = A (t_{i}; α_{i}, ϱ_{i})

,

a (t; α, ϱ) = \prod_{i = 1}^{n} a (t_{i}; α_{i}, ϱ_{i})

, and both

A (t_{i}; α_{i}, ϱ_{i})

and

a (t_{i}; α_{i}, ϱ_{i})

are as expressed in (5).

Let

q \in (0, 1)

be a fixed number and

T \sim BS (α, ϱ)

. If we apply the transformation given by

(α, ϱ) \mapsto (α, Q)

(8)

where Q is defined in (6), then this transformation is one-to-one. Therefore, if

T = (T_{1}, \dots, T_{n}) \sim {BS}_{n} (α, ϱ, Σ)

, we have a new parametrization of the multivariate BS distribution, denoted by

T \sim {BS}_{n} (α, Q, Σ)

, acting similarly as in (8) by the transformation expressed as

(α, ϱ, Σ) \mapsto (α, Q, Σ),

(9)

where the elements

Q_{i}, ϱ_{i}

of

Q, ϱ

are related by (6) for the marginal distribution of

T_{i}

,

\forall i = \bar{1, n}

. Thus, according to (9), the CDF and PDF of

T \sim {BS}_{n} (α, Q, Σ)

are given, respectively, by

F_{T} (t; α, Q, Σ) = Φ_{n} (\bar{A}; Σ), f_{T} (t; α, Q, Σ) = ϕ_{n} (\bar{A}; Σ) \bar{a} (t; α, Q), t = (t_{1}, \dots, t_{n}) \in R_{+}^{n},

(10)

where

\bar{A} = {({\bar{A}}_{1}, \dots, {\bar{A}}_{n})}^{⊤}

, with

{\bar{A}}_{i} = A (t_{i}; α_{i}, 4 Q_{i} / γ_{α_{i}}^{2}) = [1 / (α_{i} γ_{α_{i}} \sqrt{4 Q_{i} t_{i}})] (t_{i} γ_{α_{i}}^{2} / 4 Q_{i} - 1)

,

\bar{a} (t; α, Q) = \prod_{j = 1}^{n} a (t_{i}; α_{i}, 4 Q_{i} / γ_{α_{i}}^{2}) = \prod_{j = 1}^{n} [1 / (α_{i} γ_{α_{i}} \sqrt{4 Q_{i} t_{i}})] (γ_{α_{i}}^{2} / 2 + 2 Q_{i} / t_{i})

, and

γ_{α_{i}}

being defined in (7). Figure 1 and Figure 2 present different graphical plots for the PDF defined in (10) with

n = 2

, when the parameters

α

and Q vary, for different rotations of these PDFs.

Figure 1. Plots of the reparametrized bivariate BS PDF for

α_{i} = 0.5

(a),

α_{i} = 0.8

(b),

α_{i} = 1.5

(c) with

Q_{i} = 1.0

, and

Q_{i} = 0.5

(d),

Q_{i} = 0.8

(e),

Q_{i} = 1.5

(f) with

α_{i} = 1.0

, for

i = 1, 2

and

σ = 0.9

.

Figure 2. Plots of the reparametrized bivariate BS PDF for

α_{i} = 0.5

and

Q_{i} = 1.0

, for

i = 1, 2

, with

σ = 0.9

, (a–d) are seen from different angles.

Theorem 1.

Let

T = (T_{1}, \dots, T_{n}) \sim {BS}_{n} (α, Q, Σ)

, with

α = (α_{1}, \dots, α_{n})

,

Q = (Q_{1}, \dots, Q_{n})

, and

Σ = (σ_{k l})

being an

n \times n

correlation matrix. Afterwards,

(i): $T_{i} \sim BS (α_{i}, Q_{i})$ , for $i = \bar{1, n}$ .
(ii): $(T_{i}, T_{j}) \sim {BS}_{2} (α^{(i, j)}, Q^{(i, j)}, Σ^{(i, j)})$ , where $α^{(i, j)} = (α_{i}, α_{j})$ , $Q^{(i, j)} = (Q_{i}, Q_{j})$ and $Σ^{(i, j)}$ is a $2 \times 2$ matrix with ones in its diagonal and its other elements equal to element $(i, j)$ of the matrix Σ.
(iii): $Cov [T_{i}, T_{j}] = \frac{4 α_{i} α_{j} Q_{i} Q_{j}}{γ_{α_{i}}^{2} γ_{α_{j}}^{2}} [α_{i} α_{j} σ_{i j}^{2} + 4 I (α_{i}, α_{j}, σ_{i j})], i, j = \bar{1, n},$

where $I (α_{i}, α_{j}, σ_{i j}) = E {Z_{i} Z_{j} {[{(α_{i} Z_{i} / 2)}^{2} + 1]}^{1 / 2} {[{(α_{j} Z_{j} / 2)}^{2} + 1]}^{1 / 2}}$ , with $(Z_{i}, Z_{j})$ following a bivariate normal distribution and correlation matrix $Σ^{(i, j)}$ ; see [34].
(iv): The variance-covariance matrix of $T$ is $Var [T] = 4 Ω ⊙ (Σ ⊙ Σ ⊙ Ξ + 4 U)$ , where $Ω = (ω_{i j})$ , $Ξ = (ξ_{i j})$ and $U = (u_{i j})$ have elements $ω_{i j} = α_{i}^{2} α_{j}^{2} Q_{i} Q_{j} / (γ_{α_{i}}^{2} γ_{α_{j}}^{2})$ , $ξ_{i j} = α_{i} α_{j}$ and $u_{i j} = I (α_{i}, α_{j}, σ_{i j})$ , respectively, for $i, j = \bar{1, n}$ , and ⊙ is the Hadamard product. If $T_{1}, \dots, T_{n}$ are independent random variables, then $Var [T] = 4 D (ϵ_{i i})$ , where $D (ϵ_{i i}) = diag (ϵ_{11}, \dots, ϵ_{n n})$ , that is, $D$ is a diagonal matrix with elements $ϵ_{i i} = α_{i}^{2} Q_{i}^{2} (α_{i}^{2} + 4 I (α_{i}, α_{i}, 1)) / γ_{α_{i}}^{4}$ .

Proof.

The results are deduced using Theorem 3.1 and p. 117 of [34], with our parametrization. ☐

Corollary 1.

Let

T = (T_{1}, T_{2}) \sim {BS}_{2} (α, Q, Σ)

, with

α = (α_{1}, α_{2})

,

Q = (Q_{1}, Q_{2})

and

Σ = (\begin{matrix} 1 & σ \\ σ & 1 \end{matrix})

. Then,

(i): $E [T_{1} T_{2}] = \frac{4 Q_{1} Q_{2}}{γ_{α_{1}}^{2} γ_{α_{2}}^{2}} [4 + 2 (α_{1}^{2} + α_{2}^{2}) + α_{1}^{2} α_{2}^{2} (1 + σ^{2}) + 4 α_{1} α_{2} I (α_{1}, α_{2}, σ)],$

with $I (α_{1}, α_{2}, σ)$ being defined in Theorem 1(iii).
(ii): $Cov [T_{1}, T_{2}] = \frac{4 σ^{2} α_{1} α_{2} Q_{1} Q_{2}}{γ_{α_{1}}^{2} γ_{α_{2}}^{2}} [α_{1} α_{2} σ^{2} + 4 I (α_{1}, α_{2}, σ)] .$
(iii): $Corr (T_{1}, T_{2}) = \frac{α_{1} α_{2} σ^{2} + 4 I (α_{1}, α_{2}, σ)}{\sqrt{4 + 5 α_{1}^{2}} \sqrt{4 + 5 α_{2}^{2}}} .$

Proof.

The results are obtained using ([34] p.117), with our parametrization; see also [35]. ☐

5. Formulation of the Spatial Model

Let

T = {T (s), s \in D}

be a stochastic process that is defined over a region

D \subset R^{2}

. We assume that the stochastic process

T

is stationary and isotropic, and that, for given spatial locations

s_{i}

, with

i = \bar{1, n}

, the quantile function of the process can be modeled by

Q (T (s_{i}); β | x_{i}) = Q_{i} = h^{- 1} (x_{i}^{⊤} β), i = \bar{1, n},

(11)

where h is an invertible function, with positive support, at least twice differentiable, and

x_{i}^{⊤} = (1, x_{i 1}, \dots, x_{i (p - 1)})

represents the values of

p - 1

covariates, with

x_{i j} = x_{j} (s_{i})

, for

j = \bar{1, p - 1}

, that is,

x_{i j}

is the value of the covariate

X_{j}

at the location

s_{i}

. Note that

p < n

must be satisfied. In addition,

β = {(β_{0}, β_{1}, \dots, β_{p - 1})}^{⊤}

is a vector of unknown parameters to be estimated and

(T (s_{1}), \dots, T (s_{n})) = (T_{1}, \dots, T_{n}) \sim {BS}_{n} (α 1_{n \times 1}, Q (β), Σ)

, with

α > 0

and

1_{n \times 1}

being an

n \times 1

vector of ones. Observe that

Q (β)

is related to

Q

defined in (9), but now depending on

β

. Here,

Σ = (σ_{i j})

is the

n \times n

(non-singular) correlation matrix earlier defined. Thus, based on Theorem 1(iv), the variance-covariance matrix of the BS spatial quantile regression model can be written as

Var [T] = \frac{4 α^{2}}{γ_{α}^{4}} [Q (β) Q {(β)}^{⊤}] ⊙ (α^{2} Σ ⊙ Σ + 4 U),

(12)

where

Q {(β)}^{⊤} = (Q_{1} (β), \dots, Q_{n} (β))

, with

Q_{i} (β) = h^{- 1} (x_{i}^{⊤} β),

for

i = \bar{1, n}

. Notice that the variance-covariance matrix of the BS spatial process that is stated in (12) depends on its quantile function.

Note that the spatial correlation is often modeled by a function of the Matérn family [19]. Subsequently, by using this family and an alternative parameterization suggested by [36], the elements of the matrix

Σ

involved in (12) are given by

σ_{i j} = \{\begin{matrix} 1, i = j; \\ \frac{1}{2^{δ - 1} Γ (δ)} {(φ h_{i j})}^{δ} K_{δ} (φ h_{i j}), i \neq j; \end{matrix}

(13)

where

δ > 0

is a shape parameter;

Γ

is the usual gamma function;

h_{i j}

is the Euclidean distance between the locations

s_{i}

and

s_{j}

, that is,

h_{i j} = | | s_{i} - s_{j} | |

;

φ > 0

is a parameter known as the spatial dependence inverse radius [37] and also related to a parameter named microergodic by [36]; and,

K_{δ}

is the modified Bessel function of the third kind of order

δ

[38]. Some particular cases of the Matérn family are presented in Table 1.

Table 1. Particular cases of the Matérn correlation function with h denoting a distance measure.

6. Estimation of Model Parameters

Let

θ = {(α, β^{⊤}, φ)}^{⊤}

be a vector of unknown parameters of the spatial quantile regression model formulated in (11), which can be estimated by the maximum likelihood method, as follows. Note that

φ > 0

is the spatial dependence inverse radius [39] of the Matérn spatial correlation function defined in (13). Therefore, the corresponding log-likelihood function for

θ

based on the vector of observations

t = (t_{1}, \dots, t_{n})

can be written as

ℓ (θ) = - \frac{n}{2} log (2 π) - \frac{1}{2} log (| Σ |) - \frac{1}{2} {\bar{A}}^{⊤} Σ^{- 1} \bar{A} + log (\bar{a}),

(14)

where

\bar{A} = \bar{A} (t; α 1_{n \times 1}, Q)

, with

Q = Q (β)

,

\bar{a} = \bar{a} (t; α 1_{n \times 1}, Q)

, and

Σ

involved in (12). Taking the derivative of (14), with respect to the corresponding parameters, leads to the

(p + 2) \times 1

score vector that is defined as

\begin{matrix} \dot{ℓ} (θ) & = & {[\frac{\partial ℓ (θ)}{\partial α}, {(\frac{\partial ℓ (θ)}{\partial β})}^{⊤}, \frac{\partial ℓ (θ)}{\partial φ}]}^{⊤} = {({\dot{ℓ}}_{α}, {\dot{ℓ}}_{β_{0}}, {\dot{ℓ}}_{β_{1}}, \dots, {\dot{ℓ}}_{β_{p - 1}}, {\dot{ℓ}}_{φ})}^{⊤} . \end{matrix}

(15)

For details of the score vector given in (15), see the Appendix A. In order to find the maximum likelihood estimate

\hat{θ}

of

θ

, the non-linear system

\dot{ℓ} (θ) = 0_{(p + 2) \times 1}

must be solved. Because this system does not provide a closed analytical solution,

\hat{θ}

must be computed using an iterative procedure for non-linear systems. Here, a quasi-Newton procedure, named Broyden-Fletcher-Goldfarb-Shanno [40,41], may be used through the functions optim and optimx implemented in the R software; see www.R-project.org and [42]. The signs of the determinants of the corresponding Hessian matrix and of its minors were also checked to ensure that a valid maximum has been attained.

Note that the Hessian matrix

\ddot{ℓ} (θ)

for the BS spatial quantile regression model is a

(p + 2) \times (p + 2)

diagonal block matrix. This Hessian matrix is obtained by taking the second derivative of (14), with respect to the corresponding parameters, and it is given by

\ddot{ℓ} (θ) = (\begin{matrix} \frac{\partial^{2} ℓ (θ)}{\partial α^{2}} & \frac{\partial^{2} ℓ (θ)}{\partial α \partial β^{⊤}} & \frac{\partial^{2} ℓ (θ)}{\partial α \partial φ} \\ \frac{\partial^{2} ℓ (θ)}{\partial β \partial α} & \frac{\partial^{2} ℓ (θ)}{\partial β \partial β^{⊤}} & \frac{\partial^{2} ℓ (θ)}{\partial β \partial φ} \\ \frac{\partial^{2} ℓ (θ)}{\partial φ \partial α} & \frac{\partial^{2} ℓ (θ)}{\partial φ \partial β} & \frac{\partial^{2} ℓ (θ)}{\partial φ^{2}} \end{matrix}) = (\begin{matrix} {\ddot{ℓ}}_{α α} & {\ddot{ℓ}}_{α β} & {\ddot{ℓ}}_{α φ} \\ {\ddot{ℓ}}_{β α} & {\ddot{ℓ}}_{β β} & {\ddot{ℓ}}_{β φ} \\ {\ddot{ℓ}}_{φ α} & {\ddot{ℓ}}_{φ β} & {\ddot{ℓ}}_{φ φ} \end{matrix}),

(16)

where the elements of the matrix

\ddot{ℓ} (θ)

are detailed in the Appendix A. Therefore, for the BS spatial quantile regression model, the

(p + 2) \times (p + 2)

expected Fisher information matrix, as obtained from (16), is expressed as

K (θ) = E [- \ddot{ℓ} (θ)] = (\begin{matrix} K_{α α} & K_{α β} & K_{α φ} \\ K_{β α} & K_{β β} & K_{β φ} \\ K_{φ α} & K_{φ β} & K_{φ φ} \end{matrix}),

where the elements of the matrix

K (θ)

are detailed in the Appendix A as well.

7. Model Checking

We consider a property of the multivariate BS distribution related to the Mahalanobis distance in order to evaluate the fit of the spatial model, which might be used to validate the model in practice. Let

u_{i} = {\bar{A}}_{(i)}^{⊤} Σ^{- 1} {\bar{A}}_{(i)}, i = \bar{1, n},

(17)

where

{\bar{A}}_{(i)} = {({\bar{A}}_{1 (i)}, \dots, {\bar{A}}_{n (i)})}^{⊤}

, with

{\bar{A}}_{j (i)} = \frac{1}{α_{i} γ_{α_{i}}} \sqrt{\frac{4 h^{- 1} (x_{i}^{⊤} {\hat{β}}_{(i)})}{t_{i}}} [\frac{t_{i} γ_{α_{i}}^{2}}{4 h^{- 1} (x_{i}^{⊤} {\hat{β}}_{(i)})} - 1], j = \bar{1, n},

and

{\hat{β}}_{(i)}

being the maximum likelihood estimate of

β

obtained using the data set without the case i. A Newton–Raphson one-step approximation to

{\hat{θ}}_{(i)}

can be obtained by

{\hat{θ}}_{(i)} = \hat{θ} - {[{\ddot{ℓ}}_{(i)} (\hat{θ})]}^{- 1} {\dot{ℓ}}_{(i)} (\hat{θ}), i = \bar{1, n},

where

{\ddot{ℓ}}_{(i)} (θ)

and

{\dot{ℓ}}_{(i)} (θ)

are the Hessian matrix and score vector of the BS spatial quantile regression model with its parameters estimated by the maximum likelihood method without the case i. Subsequently, under the assumption

T \sim {BS}_{n} (α 1_{n \times 1}, Q (β), Σ),

u_{i}

defined in (17) is an observation of a random variable that follows approximately a

χ^{2}

distribution with

n - 1

degrees of freedom, for

i = \bar{1, n}

. Thus, by using the Wilson–Hilferty approximation [43], we have that

z_{i} = \frac{{(\frac{u_{i}}{n - 1})}^{1 / 3} - [1 - \frac{2}{9 (n - 1)}]}{{[\frac{2}{9 (n - 1)}]}^{1 / 2}}, i = \bar{1, n},

(18)

is an observation of a random variable which follows approximately a standard normal distribution. Hence, a plot of theoretical versus empirical quantiles (QQ) for

z_{i}

given in (18) can be used to evaluate the model fit. In addition to the approximation of Wilson–Hilferty, the randomized quantile residual defined by [44] may be employed to evaluate the fit of the BS spatial quantile regression model. In the case of this model, such a residual is given by

r_{i} = Φ^{- 1} [F (u_{i})], i = \bar{1, n},

(19)

where

Φ^{- 1}

is the inverse N(0, 1) CDF and F is the

χ^{2} (n - 1)

CDF. Because the randomized quantile residual has approximately a N(0, 1) distribution, a QQ plot of

r_{i}

defined in (19) might also be employed for evaluating the model fit.

8. Empirical Illustrative Example

We analyze a chemical data set associated with imbalances and deficiencies of key nutrients in the soil in order to illustrate the results obtained in this paper. This data set corresponds to levels of magnesium (Mg), which affects the development of the root system, and calcium (Ca) that competes with Mg for absorption of nutrients, for

n = 82

locations of an area in Brazil. The response variable (T) is the content of Mg in the soil (in cmolc/dm3) and the covariate (X) is the content of Ca in the soil (in cmolc/dm3).

A descriptive summary of the response variable includes the sample values (in cmolc/dm3) of the median = 2.0306; mean = 2.008; standard deviation = 0.7713; coefficient of variation = 0.3841; coefficient of skewness = 0.3394; coefficient of kurtosis = 2.9717; minimum = 0.5734; and, maximum = 4.2538. Figure 3 shows the histogram (a), boxplot (b), and scatterplot (c) of the values of the response T. In the boxplot, we detect two outliers that correspond to locations #12 and #47. The directional variogram in Figure 3d shows that there is no preferred direction, that is, an omni-directional semi-variogram is appropriate. Thus, the associated stochastic process can be considered as isotropic.

Figure 3. Histogram (a), boxplot (b), scatterplot (c), and semi-variogram (d) for the response variable with chemical data.

In order to estimate the parameters of BS spatial quantile regression model, we consider the following: (i) the spatial correlation is obtained according to the Matérn function (with

δ = 0.5

; see Table 1); (ii) the random vector

(T (s_{1}), \dots, T (s_{82})) = (T_{1}, \dots, T_{82}) \sim {BS}_{82} (α 1_{82 \times 1}, Q (β), Σ)

is assumed; (iii)

q = 0.5

(the quantile to model the median); and, (iii) the identity, logarithm, and square root functions for the link h of the spatial quantile regression defined in (11) are used and expressed as

Q_{i} = x_{i}^{⊤} β, log (Q_{i}) = x_{i}^{⊤} β, \sqrt{Q_{i}} = x_{i}^{⊤} β, i = \bar{1, 82},

(20)

where

β = {(β_{0}, β_{1})}^{⊤}

is the regression coefficient vector and

x_{i}^{⊤} = (1, x_{i})

, with

x_{i}

being the value of X for the location i.

We can compare spatial regression models while using the corrected Akaike information criterion (CAIC) and the Schwarz Bayesian information criterion (BIC). The CAIC and BIC are given, respectively, by

CAIC = 2 d - 2 ℓ (\hat{θ}) + (2 d^{2} + 2 d) / (n - d - 1), BIC = d log (n) - 2 ℓ (\hat{θ}),

where

ℓ (\hat{θ})

is the log-likelihood function for the parameter

θ

associated with the model evaluated at

θ = \hat{θ}

, d is the dimension of the parameter space, and n is the size of the data set. Both criteria are based on the log-likelihood function and penalize the model with more parameters. A model whose information criterion has a smaller value is better [45]. The log-likelihood, CAIC, and BIC values for the model with links defined in (20) are presented in Table 2. Additionally, we fit a Gaussian spatial regression to the data set, which considers the modeling of the mean = median (symmetric case), allowing us to compare the models that are given in (20). Note that the BS model with square root link is better than the Gaussian model. From this table, we conclude that the BS spatial quantile regression with square root link function should be selected.

Table 2. Values of log-likelihood, CAIC, and BIC for indicated models with chemical data.

The maximum likelihood estimates of the selected model parameters and the corresponding asymptotic standard errors, estimated by using the robust covariance matrix method [46] and reported in parentheses, are:

\hat{α} = 0.2323 (0.0460), {\hat{β}}_{0} = 0.3821 (0.0030), {\hat{β}}_{1} = 0.1884 (0.0093), \hat{φ} = 0.0045 (0.0021) .

These standard errors are low, indicating that all of the parameters are estimated with good statistical precision and allow us to infer they must be part of the model. Based on (13), note that the parameter

φ

is significant at 5% using the confidence interval-method, which means that exists spatial dependence. Therefore, the estimated BS spatial quantile regression model is given by

{\hat{Q}}_{i} = {(0.3821 + 0.1884 x_{i})}^{2}, i = \bar{1, 82},

where the correlation matrix is determined as

\hat{Σ} = Σ (δ, \hat{φ})

, for

δ = 0.5

and evaluated at

\hat{φ} = 0.0045

, whereas the variance-covariance matrix of the BS spatial quantile regression model defined in (12) is estimated as

\hat{Var [T]} = \frac{4 {\hat{α}}^{2}}{{\hat{γ_{α}}}^{4}} (\hat{Q (β)} {\hat{Q (β)}}^{⊤}) ⊙ ({\hat{α}}^{2} \hat{Σ} ⊙ \hat{Σ} + 4 \hat{U}),

where

\hat{γ_{α}}

corresponds to

γ_{α}

evaluated at

\hat{α} = 0.2323

,

{\hat{Q (β)}}^{⊤} = ({\hat{Q}}_{1}, \dots, {\hat{Q}}_{82})

and

\hat{U}

is obtained evaluating

U

at

\hat{α}

and

\hat{φ}

.

Figure 4 provides the QQ plot of the residuals, transformed by the Wilson–Hilferty approximation, after removing a location that was outside the bands. Note that most of the residuals are inside of the bands. Additionally, Figure 5a displays a three-dimensional scatterplot that shows the estimated and observed values of T. These same values are presented in a two-dimensional scatterplot in Figure 5b. These plots allow us to observe a good fit of our model to the data. Therefore, we conclude that the BS spatial quantile regression model is adequate to describe these spatial data, but a better fitting could be obtained if a heavy-tailed asymmetric distribution is considered, such as the BS-Student-t distribution. However, this is beyond of the objective of the present study and it provides a challenge for further research.

Figure 4. QQ plots for transformed residuals with chemical data.

Figure 5. Three-dimensional (a) and two-dimensional (b) scatterplots estimated versus observed response values with chemical data.

9. Conclusions and Future Works

In this paper, we have obtained the following findings:

(i): A new parameterization of the multivariate Birnbaum-Saunders distribution has been established.
(ii): A novel Birnbaum–Saunders spatial quantile regression model has been proposed and derived.
(iii): We have developed maximum likelihood estimation for the parameters of the proposed model.
(iv): A randomized quantile residual has been used for model checking. We have utilized the Wilson–Hilferty approximation for our spatial model residuals to evaluate adequacy model.
(v): The obtained results have been applied to a real data set illustrating its potential usages.

Therefore, we have derived a novel class of spatial quantile regression, which is useful for modeling data generated from a positive skew distribution. The main feature of this spatial regression is the modeling of a quantile for a response variable that follows the Birnbaum–Saunders distribution. The numerical results have reported the good performance of the spatial quantile regression model, indicating that the Birnbaum–Saunders distribution is a good modeling choice when dealing with data that have spatial dependence, positive support and follow a distribution skewed to the right. Hence, it can be a valuable addition to the tool-kit of applied statisticians and data scientists.

The following aspects are open problems for the Birnbaum–Saunders spatial quantile regression models and they can be considered for future work:

(i): A global test for independence might be stated based on $H_{0} : σ_{i j} = 0$ (or $Σ = I_{n}$ , the $n \times n$ identity matrix). Specifically, let $L_{full}$ be the likelihood function for the full model and $L_{reduced}$ be the likelihood function for the reduced model (under $H_{0}$ indicating independence). Subsequently, we can use the likelihood ratio statistic $Λ = L_{reduced} / L_{full}$ to test $H_{0}$ . Thus, instead of using the asymptotic distribution of $- 2 log (Λ)$ , which is unknown, a bootstrap test can be employed.
(ii): In addition, we can consider $H_{0} : φ = 0$ versus $H_{1} : φ > 0$ . In this case, the asymptotic distribution of $- 2 log (Λ)$ under $H_{0}$ is an equally weighted mixture of chi-square distributions with zero and one degree of freedom, whose critical value is 2.7055 at a significance level of 5% [47]. In the spatial case, such a distribution might also be unknown, so that the bootstrap technique can be employed.
(iii): it is of interest to study details of the asymptotic behavior and performance of maximum likelihood estimators [48]. However, applicability of asymptotic frameworks to spatial data is not an easy aspect. This is due to there being at least two relevant frameworks, which can behave quite differently when estimating the spatial dependence parameters; see details about these asymptotic frameworks and their implications in [49].
(iv): The Birnbaum–Saunders distribution is based on the normal distribution and then parameter estimation in spatial quantile regression models can be affected by atypical cases. Thus, robust estimation to these cases, for example based on the Birnbaum–Saunders-t distribution, can be considered to decrease their effects; see [50].
(v): Besides fixed effects that are added to the modeling by regression, random effects can also be added by mixed models, which may produce a more sophisticated Birnbaum-Saunders spatial quantile regression model and closer to reality [51].
(vi): Local influence diagnostics can be conducted for Birnbaum–Saunders spatial quantile regression, which permits the detection of individual or combined influence of cases. Works on local influence in Birnbaum–Saunders models were conducted by a number of authors; see, for example, [18,23,25,52].

Research on these issues is in progress and their findings will be reported in future articles.

Author Contributions

All authors contributed with results and ideas when writing this paper. All authors have read and agreed to the published version of the manuscript.

Funding

The research was partially supported by the project grants “FONDECYT 1200525” from the National Commission for Scientific and Technological Research of the Chilean government (V. Leiva) and “Puente 001/2019” from the Research Directorate of the Vice President for Research of the Pontificia Universidad Católica de Chile, Chile (M. Galea).

Acknowledgments

The authors would also like to thank the Editor and Reviewers for their constructive comments which led to improve the presentation of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Score Vector and Fisher Information Matrix

Appendix A.1. Score Vector

The elements of the

(p + 2) \times 1

score vector given in (15) are detailed as

\begin{matrix} {\dot{ℓ}}_{α} & = & - {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial \bar{A}}{\partial α} + \frac{\partial}{\partial α} [log (\bar{a})], \\ {\dot{ℓ}}_{β_{j}} & = & - {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial \bar{A}}{\partial β_{j}} + \frac{\partial}{\partial β_{j}} [log (\bar{a})], \\ {\dot{ℓ}}_{φ} & = & - \frac{1}{2} tr (Σ^{- 1} \frac{\partial Σ}{\partial φ}) + \frac{1}{2} {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1} \bar{A}, \end{matrix}

where

\partial \bar{A} / \partial α = (\partial {\bar{A}}_{k} / \partial α)

and

\partial \bar{A} / \partial β_{k} = (\partial {\bar{A}}_{k} / \partial β_{j})

, with

\begin{matrix} \frac{\partial {\bar{A}}_{k}}{\partial α} & = & \sqrt{\frac{4 Q_{k}}{t_{k}}} [\frac{γ_{α}^{'} t_{k}}{2 α Q_{k}} - \frac{1}{{(α γ_{α})}^{2}} (γ_{α} + α γ_{α}^{'}) (\frac{t_{k} γ_{α}^{2}}{4 Q_{k}} - 1)], \\ \frac{\partial {\bar{A}}_{k}}{\partial β_{j}} & = & - \frac{1}{α γ_{α} \sqrt{t_{k} Q_{k}}} (\frac{t_{k} γ_{α}^{2}}{4 Q_{k}} + 1) \frac{1}{h^{'} (Q_{k})} x_{k j}, \\ \frac{\partial}{\partial α} [log (\bar{a})] & = & - \frac{n}{α γ_{α}} (γ_{α} + α γ_{α}^{'}) + \sum_{k = 1}^{n} \frac{2 t_{k} γ_{α} γ_{α}^{'}}{t_{k} γ_{α}^{2} + 4 Q_{k}}, \\ \frac{\partial}{\partial β_{j}} [log (\bar{a})] & = & \sum_{k = 1}^{n} (- \frac{1}{2 Q_{k}} + \frac{4}{t_{k} γ_{α}^{2} + 4 Q_{k}}) \frac{1}{h^{'} (Q_{k})} x_{k j} . \end{matrix}

In addition,

\partial Σ / \partial φ = (\partial σ_{i j} / \partial φ)

, with elements defined as

\frac{\partial σ_{i j}}{\partial φ} = \{\begin{matrix} \frac{h_{i j}^{δ}}{2^{δ - 1} Γ (δ)} [δ φ^{δ - 1} K_{δ} (φ h_{i j}) + φ^{δ} K_{δ}^{'} (φ h_{i j}) h_{i j}], & i \neq j; \\ 0, & i = j; \end{matrix}

where

K_{δ}^{'} (u) = d K_{δ} (u) / d u

.

Appendix A.2. Information Matrix

To obtain the Fisher information matrix,

- \ddot{ℓ} (θ)

must be evaluated at

θ = \hat{θ}

. For the BS spatial quantile regression model presented in (11), the elements of the Hessian matrix can be expressed as

\begin{array}{l} {\ddot{ℓ}}_{β_{j} β_{l}} & = & - [{(\frac{\partial \bar{A}}{\partial β_{l}})}^{⊤} Σ^{- 1} \frac{\partial \bar{A}}{\partial β_{j}} + {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial^{2} \bar{A}}{\partial β_{j} \partial β_{l}}] + \frac{\partial}{\partial β_{l}} (\frac{\partial log (\tilde{a})}{\partial β_{j}}), \\ {\ddot{ℓ}}_{β_{j} φ} & = & {\bar{A}}^{⊤} (Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1}) \frac{\partial \bar{A}}{\partial β_{j}}, \\ {\ddot{ℓ}}_{φ φ} & = & - \frac{1}{2} \frac{\partial}{\partial φ} [tr (Σ^{- 1} \frac{\partial Σ}{\partial φ})] \\ + \frac{1}{2} {\bar{A}}^{⊤} [(- Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1} \frac{\partial Σ}{\partial φ} + Σ^{- 1} \frac{\partial^{2} Σ}{\partial φ^{2}}) Σ^{- 1} - Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1}] \bar{A}, \end{array}

where

\begin{array}{l} \frac{\partial^{2} {\tilde{A}}_{k}}{\partial β_{j} \partial β_{l}} & = & \frac{1}{α γ_{α} \sqrt{t_{k} Q_{k}}} \{(\frac{3 t_{k} γ_{α}^{2}}{8 Q_{k}^{2}} + \frac{1}{2 Q_{k}}) \frac{1}{h^{'} (Q_{k})} \\ + (\frac{t_{k} γ_{α}^{2}}{4 Q_{k}} + 1) \frac{h^{″} (Q_{k})}{{[h^{'} (Q_{k})]}^{2}}\} \frac{1}{h^{'} (Q_{k})} x_{k j} x_{k l}, \\ \frac{\partial}{\partial β_{l}} (\frac{\partial log (\tilde{a})}{\partial β_{j}}) & = & \sum_{k = 1}^{n} \{[\frac{1}{2 Q_{k}^{2}} - \frac{16}{{(t_{k} γ_{α}^{2} + 4 Q_{k})}^{2}}] \frac{1}{[h^{'} (Q_{k})]} \\ + [\frac{1}{2 Q_{k}} - \frac{4}{t_{k} γ_{α}^{2} + 4 Q_{k}}] \frac{h^{″} (Q_{k})}{{[h^{'} (Q_{k})]}^{2}}\} \frac{1}{h^{'} (Q_{k})} x_{k j} x_{k l}, \end{array}

and

\partial^{2} Σ / \partial φ^{2} = (\partial^{2} σ_{i j} / \partial φ^{2})

, whose elements are given by

\frac{\partial^{2} σ_{i j}}{\partial φ^{2}} = \{\begin{matrix} \frac{h_{i j}^{δ} φ^{δ - 2}}{2^{δ - 1} Γ (δ)} [δ (δ - 1) K_{δ} (φ h_{i j}) + δ φ K_{δ}^{'} (φ h_{i j}) & i \neq j; \\ + δ φ K_{δ}^{'} (φ h_{i j}) + φ^{2} K_{δ}^{″} (φ h_{i j}) h_{i j}], \\ 0, & i = j; \end{matrix}

with

K_{δ}^{″} (u) = d^{2} K_{δ} (u) / d u^{2}

. In addition, the

p \times 1

and

3 \times 1

vectors

{\ddot{ℓ}}_{β α} = {[{\ddot{ℓ}}_{α β}]}^{⊤}

and

{\ddot{ℓ}}_{φ α} = {[{\ddot{ℓ}}_{α φ}]}^{⊤}

, respectively, have elements given by

\begin{matrix} {\ddot{ℓ}}_{α β_{j}} & = & - [{(\frac{\partial \bar{A}}{\partial β_{j}})}^{⊤} Σ^{- 1} \frac{\partial \bar{A}}{\partial α} + {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial^{2} \bar{A}}{\partial α \partial β_{j}}] + \frac{\partial}{\partial α} (\frac{\partial log (\tilde{a})}{\partial β_{j}}), \\ {\ddot{ℓ}}_{α φ} & = & {\bar{A}}^{⊤} (Σ^{- 1} \frac{\partial Σ}{\partial φ} Σ^{- 1}) \frac{\partial \bar{A}}{\partial α}, \end{matrix}

where

\partial^{2} \bar{A} / \partial α \partial β_{j} = (\partial^{2} {\tilde{A}}_{k} / \partial α \partial β_{j})

, with

\begin{matrix} \frac{\partial^{2} {\tilde{A}}_{k}}{\partial α \partial β_{j}} & = & [\frac{1}{{(α γ_{α})}^{2}} (γ_{α} + α γ_{α}^{'}) (\frac{t_{k} γ_{α}^{2}}{4 Q_{k}} + 1) - \frac{1}{α γ_{α}} (\frac{t_{k} γ_{α} γ_{α}^{'}}{2 Q_{k}})] \\ \times \frac{1}{\sqrt{t_{k} Q_{k}}} \frac{1}{h^{'} (Q_{k})} x_{k j}, \\ \frac{\partial}{\partial α} (\frac{\partial log (\tilde{a})}{\partial β_{j}}) & = & - \sum_{k = 1}^{n} (\frac{8 t_{k} γ_{α} γ_{α}^{'}}{{(t_{k} γ_{α}^{2} + 4 Q_{k})}^{2}}) \frac{1}{h^{'} (Q_{k})} x_{k j} . \end{matrix}

Furthermore, we have

{\ddot{ℓ}}_{α α} = - [{(\frac{\partial \bar{A}}{\partial α})}^{⊤} Σ^{- 1} \frac{\partial \bar{A}}{\partial α} + {\bar{A}}^{⊤} Σ^{- 1} \frac{\partial^{2} \bar{A}}{\partial α^{2}}] + \frac{\partial^{2} log (\tilde{a})}{\partial α^{2}},

where

\partial^{2} \bar{A} / \partial α^{2} = (\partial^{2} {\tilde{A}}_{k} / \partial α^{2})

, with

\begin{matrix} \frac{\partial^{2} {\tilde{A}}_{k}}{\partial α^{2}} & = & \sqrt{\frac{4 Q_{k}}{t_{k}}} \{(\frac{t_{k} γ_{α}^{2}}{4 Q_{k}} - 1) [\frac{2}{{(α γ_{α})}^{3}} {(γ_{α} + α γ_{α}^{'})}^{2} - \frac{2 γ_{α}^{'} + α γ_{α}^{″}}{{(α γ_{α})}^{2}}] \\ - \frac{(γ_{α} + α γ_{α}^{'}) t_{k} γ_{α} γ_{α}^{'}}{2 Q_{k} {(α γ_{α})}^{2}} + \frac{t_{k}}{2 Q_{k}} (\frac{γ_{α}^{″} α - γ_{α}^{'}}{α^{2}})\} \end{matrix}

and

\begin{matrix} \frac{\partial^{2} log (\tilde{a})}{\partial α^{2}} & = & - n \frac{(2 γ_{α}^{'} + α γ_{α}^{″}) (α γ_{α}) - {(γ_{α} + α γ_{α}^{'})}^{2}}{{(α γ_{α})}^{2}} \\ + \sum_{k = 1}^{n} 2 t_{k} \frac{({[γ_{α}^{'}]}^{2} + γ_{α} γ_{α}^{″}) (t_{k} γ_{α}^{2} + 4 Q_{k}) - 2 t_{k} γ_{α}^{2} {(γ_{α}^{'})}^{2}}{{(t_{k} γ_{α}^{2} + 4 Q_{k})}^{2}} . \end{matrix}

References

Arrue, J.; Arellano-Valle, R.B.; Gomez, H.W.; Leiva, V. On a new type of Birnbaum-Saunders models and its inference and application to fatigue data. J. Appl. Stat. 2020. [Google Scholar] [CrossRef]
Khan, M.Z.; Khan, M.F.; Aslam, M.; Mughal, A.R. Design of fuzzy sampling plan using the Birnbaum-Saunders distribution. Mathematics 2019, 7, 9. [Google Scholar] [CrossRef]
Leiva, V.; Saunders, S.C. Cumulative damage models. In Wiley StatsRef: Statistics Reference Online; Wiley: Hoboken, NJ, USA, 2015; pp. 1–10. [Google Scholar]
Marchant, C.; Leiva, V.; Cysneiros, F.J.A.; Liu, S. Robust multivariate control charts based on Birnbaum-Saunders distributions. J. Stat. Comput. Simul. 2018, 88, 182–202. [Google Scholar] [CrossRef]
Cavieres, M.F.; Leiva, V.; Marchant, C.; Rojas, F. A methodology for data-driven decision making in the monitoring of particulate matter environmental contamination in Santiago of Chile. Rev. Environ. Contam. Toxicol. 2020. [Google Scholar] [CrossRef]
Leiva, V.; Santos-Neto, M.; Cysneiros, F.J.A.; Barros, M. A methodology for stochastic inventory models based on a zero-adjusted Birnbaum-Saunders distribution. Appl. Stoch. Model. Bus. Ind. 2016, 32, 74–89. [Google Scholar] [CrossRef]
Carrasco, J.M.F.; Figueroa-Zuniga, J.I.; Leiva, V.; Riquelme, M.; Aykroyd, R.G. An errors-in-variables model based on the Birnbaum-Saunders and its diagnostics with an application to earthquake data. Stoch. Environ. Res. Risk Assess. 2020, 34, 1–12. [Google Scholar] [CrossRef]
Martinez, S.; Giraldo, R.; Leiva, V. Birnbaum-Saunders functional regression models for spatial data. Stoch. Environ. Res. Risk Assess. 2019, 33, 1765–1780. [Google Scholar] [CrossRef]
Huerta, M.; Leiva, V.; Liu, S.; Rodriguez, M.; Villegas, D. On a partial least squares regression model for asymmetric data with a chemical application in mining. Chemom. Intell. Lab. Syst. 2019, 190, 55–68. [Google Scholar] [CrossRef]
Leiva, V.; Aykroyd, R.G.; Marchant, C. Discussion of “Birnbaum-Saunders distribution: A review of models, analysis, and applications” and a novel multivariate data analytics for an economics example in the textile industry. Appl. Stoch. Model. Bus. Ind. 2019, 35, 112–117. [Google Scholar] [CrossRef]
Leao, J.; Leiva, V.; Saulo, H.; Tomazella, V. Incorporation of frailties into a cure rate regression model and its diagnostics and application to melanoma data. Stat. Med. 2018, 37, 4421–4440. [Google Scholar] [CrossRef]
Leao, J.; Leiva, V.; Saulo, H.; Tomazella, V. A survival model with Birnbaum-Saunders frailty for uncensored and censored cancer data. Braz. J. Probab. Stat. 2018, 32, 707–729. [Google Scholar] [CrossRef]
Sánchez, L.; Leiva, V.; Galea, M.; Saulo, H. Birnbaum-Saunders quantile regression and its diagnostics with application to economic data. Appl. Stoch. Models Bus. Ind. 2020. [Google Scholar] [CrossRef]
Ventura, M.; Saulo, H.; Leiva, V.; Monsueto, S. Log-symmetric regression models: Information criteria, application to movie business and industry data with economic implications. Appl. Stoch. Model. Bus. Ind. 2019, 34, 963–977. [Google Scholar] [CrossRef]
Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
Laplace, P. Th’eorie Analytique des Probabilit’es; Editions Jacques Gabayr: Paris, France, 1818. [Google Scholar]
Dasilva, A.; Dias, R.; Leiva, V.; Marchant, C.; Saulo, H. Birnbaum-Saunders regression models: A comparative evaluation of three approaches. J. Stat. Comput. Simul. 2020, in press. [Google Scholar]
Saulo, H.; Leao, J.; Leiva, V.; Aykroyd, R.G. Birnbaum-Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat. Pap. 2019, 60, 1605–1629. [Google Scholar] [CrossRef]
Diggle, P.; Ribeiro, P. Model-Based Geoestatistics; Springer: New York, NY, USA, 2007. [Google Scholar]
Kostov, P. A spatial quantile regression hedonic model of agricultural land prices. Spat. Econ. Anal. 2009, 4, 53–72. [Google Scholar] [CrossRef]
Trzpiot, G. Spatial quantile regression. Comp. Econ. Res. 2013, 15, 265–279. [Google Scholar] [CrossRef]
McMillen, D. Quantile Regression for Spatial Data; Springer: New York, NY, USA, 2013. [Google Scholar]
Garcia-Papani, F.; Uribe-Opazo, M.A.; Leiva, V.; Aykroyd, R.G. Birnbaum-Saunders spatial modelling and diagnostics applied to agricultural engineering data. Stoch. Environ. Res. Risk Assess. 2017, 31, 105–124. [Google Scholar] [CrossRef]
Garcia-Papani, F.; Leiva, V.; Ruggeri, F.; Uribe-Opazo, M.A. Kriging with external drift in a Birnbaum-Saunders geostatistical model. Stoch. Environ. Res. Risk Assess. 2018, 32, 1517–1530. [Google Scholar] [CrossRef]
Garcia-Papani, F.; Leiva, V.; Uribe-Opazo, M.A.; Aykroyd, R.G. Birnbaum-Saunders spatial regression models: Diagnostics and application to chemical data. Chemom. Intell. Lab. Syst. 2018, 177, 114–128. [Google Scholar] [CrossRef]
Liu, Y.; Mao, G.; Leiva, V.; Liu, S.; Tapia, A. Diagnostic analytics for an autoregressive model under the skew-normal distribution. Mathematics 2020, 8, 693. [Google Scholar] [CrossRef]
Sánchez, L.; Leiva, V.; Caro-Lopera, F.J.; Cysneiros, F.J.A. On matrix-variate Birnbaum-Saunders distributions and their estimation and application. Braz. J. Probab. Stat. 2015, 29, 790–812. [Google Scholar] [CrossRef]
Kundu, D. Bivariate sinh-normal distribution and a related model. Braz. J. Probab. Stat. 2015, 20, 590–607. [Google Scholar] [CrossRef]
Kundu, D.; Balakrishnan, N.; Jamalizadeh, A. Generalized multivariate Birnbaum-Saunders distributions and related inferential issues. J. Multivar. Anal. 2013, 116, 230–244. [Google Scholar] [CrossRef]
Dobson, A. An Introduction to Statistical Modelling; Chapman and Hall: New York, NY, USA, 2002. [Google Scholar]
Leiva, V.; Santos-Neto, M.; Cysneiros, F.J.A.; Barros, M. Birnbaum-Saunders statistical modelling: A new approach. Stat. Model. 2014, 14, 21–48. [Google Scholar] [CrossRef]
Santos-Neto, M.; Cysneiros, F.J.A.; Leiva, V.; Barros, M. Reparameterized Birnbaum-Saunders regression models with varying precision. Electron. J. Stat. 2016, 10, 2825–2855. [Google Scholar] [CrossRef]
Diaz-Garcia, J.A.; Leiva, V.; Galea, M. Singular elliptic distribution: Density and applications. Commun. Stat. Theory Methods 2002, 31, 665–681. [Google Scholar] [CrossRef]
Kundu, D.; Balakrishnan, N.; Jamalizadeh, A. Bivariate Birnbaum-Saunders distribution and associated inference. J. Multivar. Anal. 2010, 101, 113–125. [Google Scholar] [CrossRef]
Saulo, H.; Leao, J.; Vila, R.; Leiva, V.; Tomazella, V. On mean-based bivariate Birnbaum-Saunders distributions: Properties, inference and application. Commun. Stat. Theory Methods 2020. [Google Scholar] [CrossRef]
Stein, M.L. Interpolation of Spatial Data: Some Theory for Kriging; Springer: New York, NY, USA, 1999. [Google Scholar]
Mardia, K.; Marshall, R. Maximum likelihood estimation of models for residual covariance in spatial regression. Biometrika 1984, 71, 135–146. [Google Scholar] [CrossRef]
Gradshteyn, I.; Ryzhik, I. Tables of Integrals, Series and Products; Academic Press: New York, NY, USA, 2000. [Google Scholar]
Zhang, H.; Wang, Y. Kriging and cross-validation for massive spatial data. Environmetrics 2010, 21, 290–304. [Google Scholar] [CrossRef]
Nocedal, J.; Wright, S. Numerical Optimization; Springer: New York, NY, USA, 1999. [Google Scholar]
Lange, K. Numerical Analysis for Statisticians; Springer: New York, NY, USA, 2001. [Google Scholar]
R-Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018. [Google Scholar]
Marchant, C.; Leiva, V.; Cysneiros, F.J.A.; Vivanco, J.F. Diagnostics in multivariate generalized Birnbaum-Saunders regression models. J. Appl. Stat. 2016, 43, 2829–2849. [Google Scholar] [CrossRef]
Dunn, P.; Smyth, G. Randomized quantile residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
Ferreira, M.; Gomes, M.I.; Leiva, V. On an extreme value version of the Birnbaum-Saunders distribution. REVSTAT 2012, 10, 181–210. [Google Scholar]
Bhatti, C. The Birnbaum–Saunders autoregressive conditional duration model. Math. Comput. Simul. 2010, 80, 2063–2078. [Google Scholar] [CrossRef]
Song, P.X.K.; Zhang, P.; Qu, A. Maximum likelihood inference in robust linear mixed-effects models using the multivariate T Distributions. Stat. Sin. 2007, 17, 929–943. [Google Scholar]
Genton, M.G.; Zhang, H. Identifiability problems in some non-Gaussian spatial random fields. Chil. J. Stat. 2012, 3, 171–179. [Google Scholar]
Zhang, H.; Zimmerman, D.L. Towards reconciling two asymptotic frameworks in spatial statistics. Biometrika 2005, 92, 921–936. [Google Scholar] [CrossRef]
Athayde, E.; Azevedo, A.; Barros, M.; Leiva, V. Failure rate of Birnbaum-Saunders distributions: Shape, change-point, estimation and robustness. Braz. J. Probab. Stat. 2019, 33, 301–328. [Google Scholar] [CrossRef]
Villegas, C.; Paula, G.A.; Leiva, V. Birnbaum-Saunders mixed models for censored reliability data analysis. IEEE Trans. Reliab. 2011, 60, 748–758. [Google Scholar] [CrossRef]
Santana, L.; Vilca, F.; Leiva, V. Influence analysis in skew-Birnbaum-Saunders regression models and applications. J. Appl. Stat. 2011, 38, 1633–1649. [Google Scholar] [CrossRef]

Figure 1. Plots of the reparametrized bivariate BS PDF for

α_{i} = 0.5

(a),

α_{i} = 0.8

(b),

α_{i} = 1.5

(c) with

Q_{i} = 1.0

, and

Q_{i} = 0.5

(d),

Q_{i} = 0.8

(e),

Q_{i} = 1.5

(f) with

α_{i} = 1.0

, for

i = 1, 2

and

σ = 0.9

.

Figure 2. Plots of the reparametrized bivariate BS PDF for

α_{i} = 0.5

and

Q_{i} = 1.0

, for

i = 1, 2

, with

σ = 0.9

, (a–d) are seen from different angles.

Figure 3. Histogram (a), boxplot (b), scatterplot (c), and semi-variogram (d) for the response variable with chemical data.

Figure 4. QQ plots for transformed residuals with chemical data.

Figure 5. Three-dimensional (a) and two-dimensional (b) scatterplots estimated versus observed response values with chemical data.

Table 1. Particular cases of the Matérn correlation function with h denoting a distance measure.

Model	Shape Parameter	Correlation Function
Exponential	$δ = 0.5$	$σ (h) = exp (- φ h)$
Whittle	$δ = 1.0$	$σ (h) = φ h K_{1} (φ h)$
Gaussian	$δ \to \infty$	$σ (h) = exp (- {(φ h)}^{2})$

Table 2. Values of log-likelihood, CAIC, and BIC for indicated models with chemical data.

Model	$ℓ (\hat{θ})$	CAIC	BIC
Gaussian	−32.1411	70.5900	77.5024
BS–identity link	−36.3659	81.2513	90.3587
BS–logarithm link	−36.3659	81.2513	90.3587
BS–square root link	−24.9112	58.3419	67.4493

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Birnbaum-Saunders Quantile Regression Models with Application to Spatial Data

Abstract

1. Introduction

2. Quantile Regression

3. The Univariate Birnbaum-Saunders Distribution

4. The Multivariate BS Distribution and a New Parametrization

5. Formulation of the Spatial Model

6. Estimation of Model Parameters

7. Model Checking

8. Empirical Illustrative Example

9. Conclusions and Future Works

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Score Vector and Fisher Information Matrix

Appendix A.1. Score Vector

Appendix A.2. Information Matrix

References

Article Metrics

Citations

Article Access Statistics