Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application

Rahayu, Anita; Purhadi,; Sutikno,; Prastyo, Dedy Dwi

doi:10.3390/sym12050813

Open AccessArticle

Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application

¹

Department of Statistics, Institut Teknologi Sepuluh Nopember, Surabaya 60111, Indonesia

²

Department of Statistics, Bina Nusantara University, Jakarta 11480, Indonesia

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(5), 813; https://doi.org/10.3390/sym12050813

Submission received: 17 April 2020 / Revised: 3 May 2020 / Accepted: 7 May 2020 / Published: 14 May 2020

Download

Browse Figures

Versions Notes

Abstract

Gamma distribution is a general type of statistical distribution that can be applied in various fields, mainly when the distribution of data is not symmetrical. When predictor variables also affect positive outcome, then gamma regression plays a role. In many cases, the predictor variables give effect to several responses simultaneously. In this article, we develop a multivariate gamma regression (MGR), which is one type of non-linear regression with response variables that follow a multivariate gamma (MG) distribution. This work also provides the parameter estimation procedure, test statistics, and hypothesis testing for the significance of the parameter, partially and simultaneously. The parameter estimators are obtained using the maximum likelihood estimation (MLE) that is optimized by numerical iteration using the Berndt–Hall–Hall–Hausman (BHHH) algorithm. The simultaneous test for the model’s significance is derived using the maximum likelihood ratio test (MLRT), whereas the partial test uses the Wald test. The proposed MGR model is applied to model the three dimensions of the human development index (HDI) with five predictor variables. The unit of observation is regency/municipality in Java, Indonesia, in 2018. The empirical results show that modeling using multiple predictors makes more sense compared to the model when it only employs a single predictor.

Keywords:

human development dimensions; maximum likelihood estimation; maximum likelihood ratio test; multivariate gamma regression; Wald test

1. Introduction

Gamma distribution is one family of continuous probability distributions and generalizations of exponential distributions [1]. Nagar, Correa, and Gupta [2] mentioned that the gamma distribution function was first introduced by Swiss mathematician Leonhard Euler (1729). Because this function is considered important, many researchers have studied and developed it. Bhattacharya [3], among others, conducted a study on testing the homogeneity of the parameters (shape and scale) of the gamma distribution. Chen and Kotz [4] conducted a study on the probability density function (pdf) of gamma distribution with three parameters (shape, scale, and location). Many researchers also study and develop bivariate gamma distribution; among others are Schickedanz and Krause [5], who conducted a study on testing scale parameters from two gamma-distributed data using the generalized likelihood ratio (GLR). Nadarajah [6] studied the types of bivariate gamma distribution. Next, Nadarajah and Gupta [7] developed two new bivariate gamma distributions based on gamma and beta random variables. In addition, Mathai and Moschopoulos [8] discussed joint densities, product moments, conditional densities, and conditional moments that were developed from two bivariate gamma distributions.

One statistical method that can be applied to analyze the data that follow gamma distribution and its predictor variables is gamma regression. Gamma regression is a type of non-linear regression. A non-linear regression contains at least one parameter with a non-linear form [9,10]. The gamma regression with multiple responses is the so-called multivariate gamma regression (MGR).

The MGR model proposed in this article is the extension of the trivariate gamma regression (TGR) proposed by Rahayu, Purhadi, Sutikno, and Prastyo [11], which describes the theory of parameter estimation and its hypothesis testing. The MGR is developed based on multivariate gamma distribution with three parameters (shape, scale, and location). The supporting references about multivariate gamma distribution were written by Mathai and Moschopoulos [12], and Vaidyanathan and Lakshmi [13]. The parameter estimation method for MGR in this study uses maximum likelihood estimation (MLE). However, the solution cannot be obtained in the closed form. Therefore, a numerical method is needed to achieve the parameter estimator value. The numerical optimization used in this study is the Berndt–Hall–Hall–Hausman (BHHH).

Based on the previously mentioned background, the aims of this study are: (i) how to construct the MGR model, (ii) how to estimate the parameters, and (iii) how to test the significance of the model as well as the significance of the individual parameter. The last objective of this work is how to apply the proposed MGR model to real data. The case study used in this study includes the factors that affect the life expectancy index (first response), education index (second response), and expenditure index (third response), the three indexes that compose the human development dimensions. The unit of observation is the regency/municipality in Java, Indonesia, in 2018. The predictor variables include the percentage of households that have a private toilet, net enrollment rate of schooling, population density, the percentage of poor people, and the unemployment rate.

The rest of the article is organized as follows. Section 2 introduces the detail of the proposed MGR model. Section 3 and Section 4 explore the data and application, respectively. The last section contains conclusions and further research.

2. Multivariate Gamma Regression Model

Suppose

y_{l}

is the response variables data

(y_{l 1}, y_{l 2}, \dots, y_{l k})

that follows multivariate gamma distribution and

x_{l}

is the corresponding predictor variables

(x_{l 1}, x_{l 2}, \dots, x_{l s})

, with sample size as n observations

(l = 1, 2, \dots, n) .

In this section, we discuss the construction of the MGR model, its parameter estimation, and hypothesis testing. A short explanation about univariate gamma regression is introduced to make a smooth transition into the MGR model.

According to Balakrishnan and Wang [14], a random variable

Y

follows univariate gamma distribution with three parameters

(α, γ, λ),

denoted by

Y ~ Gamma (α, γ, λ),

with pdf formulated in Equation (1).

f (y) = {\begin{matrix} \frac{1}{γ^{α} Γ (α)} {(y - λ)}^{α - 1} e^{- \frac{y - λ}{γ}}; & α, γ, λ > 0, λ < y < \infty, \\ ; & otherwise . \end{matrix}

(1)

If

Y ~ Gamma (α, γ, λ)

, then the statistics are as follows [15,16].

μ = E (y) = γ α + λ

,

V a r (y) = α γ^{2}

,

S t d e v (y) = \sqrt{α} γ,

and the skewness is

γ_{1} = \frac{2}{\sqrt{α}} .

Mathai and Moschopoulos (1992) defined the pdf as in Equation (2) for a pair of random variables

(Y_{1}, Y_{2})

that follows bivariate gamma distribution as:

f (y_{1}, y_{2}) = \frac{{(y_{1} - λ_{1})}^{α_{1} - 1} {(y_{2} - y_{1} - λ_{2})}^{α_{2} - 1} e^{- \frac{y_{2} - \sum_{i = 1}^{2} λ_{i}}{γ}}}{γ^{α_{2^{*}}} \prod_{i = 1}^{2} Γ (α_{i})},

(2)

with

α_{i} > 0, γ > 0, λ_{i} \in R, λ_{1} < y_{1} < \infty, λ_{2} < y_{2} < \infty, α_{2^{*}} = α_{1} + α_{2}, i = 1, 2,

f (y_{1}, y_{2}) = 0

for otherwise.

The mean for

Y_{1}

and

Y_{2}

are

E (Y_{1}) = γ α_{1} + λ_{1}

and

E (Y_{2}) = γ (α_{1} + α_{2}) + λ_{1} + λ_{2}

, while the variances are

V a r (Y_{1}) = γ^{2} α_{1} and V a r (Y_{2}) = γ^{2} (α_{1} + α_{2}) .

Suppose there are k response variables; the pdf for random variables

(Y_{1}, Y_{2}, \dots, Y_{k})

that follow multivariate gamma distribution (Mathai and Moschopoulos, 1992) is:

f (y_{1}, y_{2}, \dots, y_{k}) = \frac{{(y_{1} - λ_{1})}^{α_{1} - 1} {(y_{2} - y_{1} - λ_{2})}^{α_{2} - 1} \dots {(y_{k} - y_{k - 1} - λ_{k})}^{α_{k} - 1} e^{- \frac{y_{k} - \sum_{i = 1}^{k} λ_{i}}{γ}}}{γ^{α_{k^{*}}} \prod_{i = 1}^{k} Γ (α_{i})},

(3)

with

α_{i} > 0, γ > 0, λ_{i} \in R,

λ_{1} < y_{1} < \infty, λ_{2} < y_{2} < \infty, λ_{k} < y_{k} < \infty, α_{k^{*}} = α_{1} + α_{2} + \dots + α_{k}, i = 1, 2, \dots k,

otherwise

f (y_{1}, y_{2}, \dots, y_{k}) = 0

.

The mean and variance for

Y_{i}

are

E (Y_{i}) = γ α_{i^{*}} + λ_{i^{*}}

and

V a r (Y_{i}) = γ^{2} α_{i^{*}}

with

α_{i^{*}} = α_{1} + α_{2} + \dots + α_{i}

and

λ_{i^{*}} = λ_{1} + λ_{2} + \dots + λ_{i} .

The MGR model can be stated in Equation (4).

E (Y_{i}) = γ α_{i^{*}} + λ_{i^{*}} = e^{x^{T} β_{i}}, i = 1, 2, \dots, k,

(4)

with

α_{i^{*}} = α_{1} + α_{2} + \dots + α_{i}, λ_{i^{*}} = λ_{1} + λ_{2} + \dots + λ_{i} .

The pdf for the lth observation is formulated in Equation (5) which will be used to compose the likelihood function in Equation (6).

f (y_{l 1}, y_{l 2}, \dots, y_{l k}) = \frac{{(y_{l 1} - λ_{1})}^{α_{1} - 1} {(y_{l 2} - y_{l 1} - λ_{2})}^{α_{2} - 1} \dots {(y_{l k} - y_{l (k - 1)} - λ_{k})}^{α_{k} - 1} e^{- \frac{y_{l k} - \sum_{i = 1}^{k} λ_{i}}{γ}}}{γ^{α_{k}^{*}} Γ (α_{1}) Γ (α_{2}) \dots Γ (α_{k})},

(5)

with

α_{i} > 0, γ > 0, λ_{i} \in R,

λ_{1} < y_{l 1} < \infty, λ_{2} < y_{l 2} < \infty, λ_{k} < y_{l k} < \infty,

$α_{1} = \frac{e^{x_{l}^{T} β_{1}} - λ_{1}}{γ}, α_{2} = \frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ}, \dots, α_{k} = \frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k}}{γ},$
$α_{k^{*}} = α_{1} + α_{2} + \dots + α_{k} = \frac{e^{x_{l}^{T} β_{k}} - λ_{1} - λ_{2} - \dots - λ_{k}}{γ},$ otherwise $f (y_{l 1}, y_{l 2}, \dots, y_{l k}) = 0$ .

Later, we discuss parameter estimation on MGR using MLE. The likelihood function constructed from Equation (5) is:

\begin{array}{l} L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k}) = \prod_{l = 1}^{n} f (y_{l 1}, y_{l 2}, \dots, y_{l k}) \\ = \prod_{l = 1}^{n} (\frac{{(y_{l 1} - λ_{1})}^{α_{1} - 1} {(y_{l 2} - y_{l 1} - λ_{2})}^{α_{2} - 1} \dots {(y_{l k} - y_{l (k - 1)} - λ_{k})}^{α_{k} - 1} e^{- \frac{y_{l k} - \sum_{i = 1}^{k} λ_{i}}{γ}}}{γ^{α_{k}^{*}} Γ (α_{1}) Γ (α_{2}) \dots Γ (α_{k})}), \end{array}

(6)

with values

α_{1}, α_{2}, α_{k}, and α_{k^{*}}

based on Equation (5).

The log-likelihood function from Equation (6) is:

\begin{array}{l} \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k}) \\ = \sum_{l = 1}^{n} \log [\frac{{(y_{l 1} - λ_{1})}^{α_{1} - 1} {(y_{l 2} - y_{l 1} - λ_{2})}^{α_{2} - 1} \dots {(y_{l k} - y_{l (k - 1)} - λ_{k})}^{α_{k} - 1} e^{- \frac{y_{l k} - \sum_{i = 1}^{k} λ_{i}}{γ}}}{γ^{α_{k^{*}}} Γ (α_{1}) Γ (α_{2}) \dots Γ (α_{k})}] . \end{array}

By substituting the values of

α_{1}, α_{2}, α_{k}, and α_{k^{*}}

according to Equation (5), the log-likelihood function is:

\begin{array}{l} \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k}) \\ = \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} β_{1}} - λ_{1} - γ}{γ} \log (y_{l 1} - λ_{1}) + \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2} - γ}{γ} \log (y_{l 2} - y_{l 1} - λ_{2}) + \dots + \\ \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k} - γ}{γ} \log (y_{l k} - y_{l (k - 1)} - λ_{k}) - \sum_{l = 1}^{n} \frac{y_{l k} - λ_{1} - λ_{2} - \dots - λ_{k}}{γ} - \\ \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} β_{k}} - λ_{1} - λ_{2} - \dots - λ_{k}}{γ} \log γ - \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} β_{1}} - λ_{1}}{γ}) - \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ}) - \dots - \\ \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k}}{γ}) . \end{array}

(7)

In this article, the log value is based on e or natural logarithm. The first derivatives of the log-likelihood function for each parameter are as follows.

\begin{matrix} \begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial γ} = \sum_{l = 1}^{n} (\frac{λ_{1} - e^{x_{l}^{T} β_{1}}}{γ^{2}} \log (y_{l 1} - λ_{1})) + \\ \sum_{l = 1}^{n} (\frac{e^{x_{l}^{T} β_{1}} - e^{x_{l}^{T} β_{2}} + λ_{2}}{γ^{2}} \log (y_{l 2} - y_{l 1} - λ_{2})) + \dots + \sum_{l = 1}^{n} (\frac{e^{x_{l}^{T} β_{k - 1}} - e^{x_{l}^{T} β_{k}} + λ_{k}}{γ^{2}} \log (y_{l k} - y_{l (k - 1)} - λ_{k})) + \\ \sum_{l = 1}^{n} \frac{y_{l k}}{γ^{2}} - \frac{n λ_{1}}{γ^{2}} - \frac{n λ_{2}}{γ^{2}} - \frac{n λ_{3}}{γ^{2}} - (\frac{n (\log γ) λ_{1}}{γ^{2}} - \frac{n λ_{1}}{γ^{2}} + \frac{n (\log γ) λ_{2}}{γ^{2}} - \frac{n λ_{2}}{γ^{2}} + \frac{n (\log γ) λ_{3}}{γ^{2}} - \frac{n λ_{3}}{γ^{2}} + \end{array} \\ \sum_{l = 1}^{n} (- \frac{(\log γ) e^{x_{l}^{T} β_{k}}}{γ^{2}} + \frac{e^{x_{l}^{T} β_{k}}}{γ^{2}})) - \sum_{l = 1}^{n} (- \frac{1}{γ^{2}} (Ψ (\frac{e^{x_{l}^{T} β_{1}} - λ_{1}}{γ})) (e^{x_{l}^{T} β_{1}} - λ_{1})) - \\ \sum_{l = 1}^{n} (- \frac{1}{γ^{2}} (Ψ (\frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ})) (e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2})) - \dots - \\ \sum_{l = 1}^{n} (- \frac{1}{γ^{2}} (Ψ (\frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k}}{γ})) (e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k})), \end{matrix}

(8)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial λ_{1}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l 1} - λ_{1})}{γ} - \frac{e^{x_{l}^{T} β_{1}} - λ_{1} - γ}{γ (y_{l 1} - λ_{1})}) + \frac{n}{γ} + \frac{n (\log γ)}{γ} - \\ \sum_{l = 1}^{n} (- \frac{1}{γ} Ψ (\frac{e^{x_{l}^{T} β_{1}} - λ_{1}}{γ})), \end{array}

(9)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial λ_{2}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l 2} - y_{l 1} - λ_{2})}{γ} - \frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2} - γ}{γ (y_{l 2} - y_{l 1} - λ_{2})}) + \frac{n}{γ} + \\ \frac{n (\log γ)}{γ} - \sum_{l = 1}^{n} (- \frac{1}{γ} Ψ (\frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ})), \end{array}

(10)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial λ_{k}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l k} - y_{l (k - 1)} - λ_{k})}{γ} - \frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k} - γ}{γ (y_{l k} - y_{l (k - 1)} - λ_{k})}) + \\ \frac{n}{γ} + \frac{n (\log γ)}{γ} - \sum_{l = 1}^{n} (- \frac{1}{γ} Ψ (\frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k}}{γ})), \end{array}

(11)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial β_{1}} = \sum_{l = 1}^{n} ((\log (y_{l 1} - λ_{1})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{1}}}{γ}) - \\ \sum_{l = 1}^{n} ((\log (y_{l 2} - y_{l 1} - λ_{2})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{1}}}{γ}) - \sum_{l = 1}^{n} ((Ψ (\frac{e^{x_{l}^{T} β_{1}} - λ_{1}}{γ})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{1}}}{γ}) - \\ \sum_{l = 1}^{n} (- (Ψ (\frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{1}}}{γ}), \end{array}

(12)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial β_{2}} = \sum_{l = 1}^{n} ((\log (y_{l 2} - y_{l 1} - λ_{2})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{2}}}{γ}) - \\ \sum_{l = 1}^{n} ((Ψ (\frac{e^{x_{l}^{T} β_{2}} - e^{x_{l}^{T} β_{1}} - λ_{2}}{γ})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{2}}}{γ}) - \sum_{l = 1}^{n} (- (Ψ (\frac{e^{x_{l}^{T} β_{3}} - e^{x_{l}^{T} β_{2}} - λ_{3}}{γ})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{2}}}{γ}), \end{array}

(13)

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k})}{\partial β_{k}} = \sum_{l = 1}^{n} ((\log (y_{l k} - y_{l (k - 1)} - λ_{k})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{k}}}{γ}) - \\ \sum_{l = 1}^{n} ((\log γ) \frac{x_{l}^{T} e^{x_{l}^{T} β_{k}}}{γ}) - \sum_{l = 1}^{n} ((Ψ (\frac{e^{x_{l}^{T} β_{k}} - e^{x_{l}^{T} β_{k - 1}} - λ_{k}}{γ})) \frac{x_{l}^{T} e^{x_{l}^{T} β_{k}}}{γ}), \end{array}

(14)

with

Ψ (z)

= digamma function, which is the first derivative of gamma function, formulated with

Ψ (z) = \frac{\partial [\log Γ (z)]}{d z} = \frac{Γ^{'} (z)}{Γ (z)} .

A maximum likelihood (ML) can be found by setting all the derivatives above to zero and solving the system. No closed-form solution to that system can be found. A numerical method is needed to obtain the solution, i.e., parameter estimate

\hat{γ}, {\hat{λ}}_{1}, {\hat{λ}}_{2}, \dots, {\hat{λ}}_{k}, {\hat{β}}_{1}, {\hat{β}}_{2}, \dots, {\hat{β}}_{k} .

One of the numerical techniques that can be employed is the BHHH algorithm as follows.

Step 1. Determine the initial value for ${\hat{θ}}^{(0)} = {[{\hat{γ}}^{(0)} {\hat{λ}}_{1}^{(0)} {\hat{λ}}_{2}^{(0)} \dots {\hat{λ}}_{k}^{(0)} {\hat{β}}_{1}^{T (0)} {\hat{β}}_{2}^{T (0)} \dots {\hat{β}}_{k}^{T (0)}]}^{T},$ where ${\hat{γ}}^{(0)} > 0, {\hat{λ}}_{1}^{(0)}, {\hat{λ}}_{2}^{(0)}, \dots, {\hat{λ}}_{k}^{(0)} \in R$ satisfies the constraints in Equation (5), and ${\hat{β}}_{1}^{T (0)}, {\hat{β}}_{2}^{T (0)}, \dots, {\hat{β}}_{k}^{T (0)}$ are obtained from the estimate of univariate gamma regression. The Hessian $H (\hat{θ})$ in BHHH is approximated as the negative of the sum of the outer products of the gradients of individual observations. The gradient vector $g (\hat{θ})$ is a vector with each element consisting of the first derivative of the log-likelihood function for each of the estimated parameters.
Step 2. Determine the tolerance limit so that the BHHH iteration process stops. In this study, the tolerance value used is $ε = 10^{- 8} .$
Step 3. Start the BHHH iteration using the following formula.

${\hat{θ}}^{(p + 1)} = {\hat{θ}}^{(p)} - H^{- 1} ({\hat{θ}}^{(p)}) g ({\hat{θ}}^{(p)}),$

(15)

with $p = 0, 1, 2, \dots, p^{*} .$
Step 4. The iteration stops at the $p^{*} - t h$ iteration if it satisfies $‖ {\hat{θ}}^{(p^{*} + 1)} - {\hat{θ}}^{(p *)} ‖ \leq ε$ . When converging, the last iteration produces an estimator value for each parameter.

The null hypothesis on the MGR model is

H_{0} : β_{11} = β_{21} = \dots = β_{s 1} = β_{12} = β_{22} = \dots = β_{s 2} = \dots =

β_{1 k} = β_{2 k} = \dots = β_{s k} = 0

and alternative hypothesis H₁. At least one

β_{q i} \neq 0,

with

q = 1, 2, \dots, s, i = 1,

2, \dots, k .

Ω = {γ, λ_{1}, \dots, λ_{k}, β_{1}, β_{2}, \dots, β_{k}}

is the set of parameters under the population. The

ω = {γ, λ_{1},

\dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k}}

is the set of parameters under the null hypothesis. The first derivatives of the log-likelihood function for each parameter under the null hypothesis are provided in Appendix A.

Proposition 1.

If

Ω

is a set of parameters under the population,

ω

is a set of parameters under the null hypothesis, and the hypothesis being used is the simultaneous test of MGR model, then the test statistic is

G^{2} = - 2 \log Λ = 2 \log L (\hat{Ω}) - 2 \log L (\hat{ω}) .

A Corollary of Proposition 1:

The hypothesis being used in the simultaneous test of the MGR model in Section 2 can be stated in the following form: the null hypothesis is

β^{*} = 0_{(s k \times 1)}

and the alternative hypothesis is

β^{*} \neq 0_{(s k \times 1)},

with

β^{*} = {[\begin{matrix} β_{1}^{* T} & β_{2}^{* T} & \dots & β_{i}^{* T} & \dots & β_{k}^{* T} \end{matrix}]}^{T}

and

β_{i}^{*} = [\begin{matrix} β_{1 i} & β_{2 i} & \dots & β_{s i} \end{matrix}]

for

i = 1, 2, \dots, k

.

It is noted that

{\hat{θ}}_{Ω}

and

{\hat{θ}}_{ω}

are estimators that maximize the likelihood and the log-likelihood functions under the population and under the null hypothesis. The principle of the MLE method is to maximize the likelihood functions [17]. The following are test statistics for the hypothesis being used in the simultaneous test of the MGR model in Section 2.

Λ = \frac{L (\hat{ω})}{L (\hat{Ω})} < Λ_{0},

(16)

where

Λ_{0}

is a constant value between

0 < Λ_{0} \leq 1

.

L (\hat{ω})

and

L (\hat{Ω})

in Equation (16) are:

L (\hat{ω}) = \prod_{l = 1}^{n} (\frac{{(y_{l 1} - {\hat{λ}}_{1})}^{{\hat{α}}_{11} - 1} {(y_{l 2} - y_{l 1} - {\hat{λ}}_{2})}^{{\hat{α}}_{22} - 1} \dots {(y_{l k} - y_{l (k - 1)} - {\hat{λ}}_{k})}^{{\hat{α}}_{k k} - 1} e^{- \frac{y_{l k} - \sum_{i = 1}^{k} {\hat{λ}}_{i}}{\hat{γ}}}}{{\hat{γ}}^{{\hat{α}}_{k k^{*}}} Γ ({\hat{α}}_{11}) Γ ({\hat{α}}_{22}) \dots Γ ({\hat{α}}_{k k})}),

and

L (\hat{Ω}) = \prod_{l = 1}^{n} (\frac{{(y_{l 1} - {\hat{λ}}_{1})}^{{\hat{α}}_{1} - 1} {(y_{l 2} - y_{l 1} - {\hat{λ}}_{2})}^{{\hat{α}}_{2} - 1} \dots {(y_{l k} - y_{l (k - 1)} - {\hat{λ}}_{k})}^{{\hat{α}}_{k} - 1} e^{- \frac{y_{l k} - \sum_{i = 1}^{k} {\hat{λ}}_{i}}{\hat{γ}}}}{{\hat{γ}}^{{\hat{α}}_{k^{*}}} Γ ({\hat{α}}_{1}) Γ ({\hat{α}}_{2}) \dots Γ ({\hat{α}}_{k})}),

(17)

with

{\hat{α}}_{11} = \frac{e^{{\hat{β}}_{01}} - {\hat{λ}}_{1}}{\hat{γ}}, {\hat{α}}_{22} = \frac{e^{{\hat{β}}_{02}} - e^{{\hat{β}}_{01}} - {\hat{λ}}_{2}}{\hat{γ}}, \dots, {\hat{α}}_{k k} = \frac{e^{{\hat{β}}_{0 k}} - e^{{\hat{β}}_{0 (k - 1)}} - {\hat{λ}}_{k}}{\hat{γ}},

${\hat{α}}_{k k^{*}} = {\hat{α}}_{11} + {\hat{α}}_{22} + \dots + {\hat{α}}_{k k} = \frac{e^{{\hat{β}}_{0 k}} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}},$
$\begin{array}{l} {\hat{α}}_{1} = \frac{e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{1}}{\hat{γ}}, {\hat{α}}_{2} = \frac{e^{x_{l}^{T} {\hat{β}}_{2}} - e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{2}}{\hat{γ}}, \dots, {\hat{α}}_{k} = \frac{e^{x_{l}^{T} {\hat{β}}_{k}} - e^{x_{l}^{T} {\hat{β}}_{k - 1}} - {\hat{λ}}_{k}}{\hat{γ}}, and \\ {\hat{α}}_{k^{*}} = {\hat{α}}_{1} + {\hat{α}}_{2} + \dots + {\hat{α}}_{k} = \frac{e^{x_{l}^{T} {\hat{β}}_{k}} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}} . \end{array}$

Based on Equation (17),

\frac{L (\hat{ω})}{L (\hat{Ω})}

is difficult to simplify. To simplify the calculation, the test statistics in Equation (16) are expressed in a form equivalent to:

{(Λ)}^{- 2} = {(\frac{L (\hat{ω})}{L (\hat{Ω})})}^{- 2} = {(\frac{L (\hat{Ω})}{L (\hat{ω})})}^{2} .

(18)

The application of natural logarithms in Equation (18) obtains the following test statistics.

G^{2} = - 2 \log Λ = - 2 \log (\frac{L (\hat{ω})}{L (\hat{Ω})}) = 2 \log (\frac{L (\hat{Ω})}{L (\hat{ω})}) = 2 \log L (\hat{Ω}) - 2 \log L (\hat{ω}),

(19)

with

\log L ({\hat{Ω}}_{M G R}) = \sum_{l = 1}^{n} \log (f (y_{l 1}, y_{l 2}, \dots, y_{l k} | {\hat{Ω}}_{M G R}))

\begin{array}{l} \log L ({\hat{Ω}}_{M G R}) = \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{1} - \hat{γ}}{\hat{γ}} \log (y_{l 1} - {\hat{λ}}_{1}) + \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} {\hat{β}}_{2}} - e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{2} - \hat{γ}}{\hat{γ}} \log (y_{l 2} - y_{l 1} - {\hat{λ}}_{2}) + \dots + \\ \begin{array}{l} \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} {\hat{β}}_{k}} - e^{x_{l}^{T} {\hat{β}}_{k - 1}} - {\hat{λ}}_{k} - \hat{γ}}{\hat{γ}} \log (y_{l k} - y_{l (k - 1)} - {\hat{λ}}_{k}) - \sum_{l = 1}^{n} \frac{y_{l k} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}} - \\ \sum_{l = 1}^{n} \frac{e^{x_{l}^{T} {\hat{β}}_{k}} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}} \log \hat{γ} - \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{1}}{\hat{γ}}) - \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} {\hat{β}}_{2}} - e^{x_{l}^{T} {\hat{β}}_{1}} - {\hat{λ}}_{2}}{\hat{γ}}) - \dots - \end{array} \\ \sum_{l = 1}^{n} \log Γ (\frac{e^{x_{l}^{T} {\hat{β}}_{k}} - e^{x_{l}^{T} {\hat{β}}_{k - 1}} - {\hat{λ}}_{k}}{\hat{γ}}), \end{array}

\log L ({\hat{ω}}_{M G R}) = \sum_{l = 1}^{n} \log (f (y_{l 1}, y_{l 2}, \dots, y_{l k} | {\hat{ω}}_{M G R}))

\begin{matrix} \log L ({\hat{ω}}_{M G R}) = & \sum_{l = 1}^{n} \frac{e^{{\hat{β}}_{01}} - {\hat{λ}}_{1} - \hat{γ}}{\hat{γ}} \log (y_{l 1} - {\hat{λ}}_{1}) + \sum_{l = 1}^{n} \frac{e^{{\hat{β}}_{02}} - e^{{\hat{β}}_{01}} - {\hat{λ}}_{2} - \hat{γ}}{\hat{γ}} \log (y_{l 2} - y_{l 1} - {\hat{λ}}_{2}) + \dots + \\ \sum_{l = 1}^{n} \frac{e^{{\hat{β}}_{0 k}} - e^{{\hat{β}}_{0 (k - 1)}} - {\hat{λ}}_{k} - \hat{γ}}{\hat{γ}} \log (y_{l k} - y_{l (k - 1)} - {\hat{λ}}_{k}) - \sum_{l = 1}^{n} \frac{y_{l k} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}} - \\ \frac{e^{{\hat{β}}_{0 k}} - {\hat{λ}}_{1} - {\hat{λ}}_{2} - \dots - {\hat{λ}}_{k}}{\hat{γ}} \log \hat{γ} - \log Γ (\frac{e^{{\hat{β}}_{01}} - {\hat{λ}}_{1}}{\hat{γ}}) - \log Γ (\frac{e^{{\hat{β}}_{02}} - e^{{\hat{β}}_{01}} - {\hat{λ}}_{2}}{\hat{γ}}) - \dots - \\ \log Γ (\frac{e^{{\hat{β}}_{0 k}} - e^{{\hat{β}}_{0 (k - 1)}} - {\hat{λ}}_{k}}{\hat{γ}}) . \end{matrix}

Proposition 2.

Based on Proposition 1, the distribution of test statistics

G^{2}

is Chi-square with sk degrees of freedom, which can be written as follows.

G^{2} = 2 \log L (\hat{Ω}) - 2 \log L (\hat{ω}) \overset{d}{\to} χ_{s k}^{2}, n \to \infty .

A Corollary of Proposition 2:

If

{\hat{θ}}_{Ω}

is an estimator that maximizes the likelihood and the log-likelihood functions under the population,

{\hat{θ}}_{ω}

is an estimator that maximizes the likelihood and the log-likelihood functions under the null hypothesis, based on Equation (19), so:

\begin{matrix} G^{2} & = 2 \log L ({\hat{θ}}_{Ω}) - 2 \log L ({\hat{θ}}_{ω}) \\ = 2 (\log L ({\hat{θ}}_{Ω}) - \log L (θ_{ω})) - 2 (\log L ({\hat{θ}}_{ω}) - \log L (θ_{ω})) . \end{matrix}

(20)

Log L (θ_{ω})

function can be approached by Taylor’s second-degree expansion around

{\hat{θ}}_{Ω}

as follows.

\log L (θ_{ω}) \approx \log L ({\hat{θ}}_{Ω}) + g ({\hat{θ}}_{Ω}) (θ_{ω} - {\hat{θ}}_{Ω}) - \frac{1}{2} {(θ_{ω} - {\hat{θ}}_{Ω})}^{T} [I ({\hat{θ}}_{Ω})] (θ_{ω} - {\hat{θ}}_{Ω}),

(21)

with

g ({\hat{θ}}_{Ω}) = {\frac{\partial \log L (θ_{Ω})}{\partial θ_{Ω}} |}_{θ_{Ω} = {\hat{θ}}_{Ω}} = 0

and

I ({\hat{θ}}_{Ω}) = - {\frac{\partial^{2} \log L (θ_{Ω})}{\partial θ_{Ω} \partial {(θ_{Ω})}^{T}} |}_{θ_{Ω} = {\hat{θ}}_{Ω}} .

Because

g ({\hat{θ}}_{Ω}) = 0,

then Equation (21) becomes:

\begin{array}{l} \log L (θ_{ω}) \approx \log L ({\hat{θ}}_{Ω}) - \frac{1}{2} {(θ_{ω} - {\hat{θ}}_{Ω})}^{T} [I ({\hat{θ}}_{Ω})] (θ_{ω} - {\hat{θ}}_{Ω}) \\ 2 (\log L ({\hat{θ}}_{Ω}) - \log L (θ_{ω})) \approx {({\hat{θ}}_{Ω} - θ_{ω})}^{T} [I ({\hat{θ}}_{Ω})] ({\hat{θ}}_{Ω} - θ_{ω}) . \end{array}

(22)

Log L (θ_{ω})

function can be approached by Taylor’s second-degree expansion around

{\hat{θ}}_{ω}

as follows.

\log L (θ_{ω}) \approx \log L ({\hat{θ}}_{ω}) + g ({\hat{θ}}_{Ω}) (θ_{ω} - {\hat{θ}}_{ω}) - \frac{1}{2} {(θ_{ω} - {\hat{θ}}_{ω})}^{T} [I ({\hat{θ}}_{Ω})] (θ_{ω} - {\hat{θ}}_{ω}) .

(23)

Because

g ({\hat{θ}}_{Ω}) = 0,

then Equation (23) becomes:

\begin{array}{l} \log L (θ_{ω}) \approx \log L ({\hat{θ}}_{ω}) - \frac{1}{2} {(θ_{ω} - {\hat{θ}}_{ω})}^{T} [I ({\hat{θ}}_{Ω})] (θ_{ω} - {\hat{θ}}_{ω}) \\ 2 (\log L ({\hat{θ}}_{ω}) - \log L (θ_{ω})) \approx {({\hat{θ}}_{ω} - θ_{ω})}^{T} [I ({\hat{θ}}_{Ω})] ({\hat{θ}}_{ω} - θ_{ω}) . \end{array}

(24)

Based on Equations (22) and (24), the test statistics on Equation (20) can be stated as follows.

G^{2} = 2 (\log L ({\hat{θ}}_{Ω}) - \log L (θ_{ω})) - 2 (\log L ({\hat{θ}}_{ω}) - \log L (θ_{ω}))

(25)

G^{2} \approx {({\hat{θ}}_{Ω} - θ_{ω})}^{T} [I ({\hat{θ}}_{Ω})] ({\hat{θ}}_{Ω} - θ_{ω}) - {({\hat{θ}}_{ω} - θ_{ω})}^{T} [I ({\hat{θ}}_{Ω})] ({\hat{θ}}_{ω} - θ_{ω}) .

Equation (25) can be simplified by outlining the quadratic form of

{({\hat{θ}}_{Ω} - θ_{ω})}^{T} [I ({\hat{θ}}_{Ω})] ({\hat{θ}}_{Ω} - θ_{ω})

, so we obtained:

2 (\log L ({\hat{θ}}_{Ω}) - \log L ({\hat{θ}}_{ω})) \approx {\hat{β}}^{* T} ([I_{11}] - [I_{12}] {[I_{22}]}^{- 1} [I_{21}]) {\hat{β}}^{*} \approx {\hat{β}}^{* T} {[I^{11}]}^{- 1} {\hat{β}}^{*} .

(26)

From Equation (26), this can be obtained:

{\hat{β}}^{*} \overset{d}{\to} N (0, {[I^{11}]}_{(s k \times s k)}), n \to \infty,

(27)

{[I^{11}]}^{- \frac{1}{2}} {\hat{β}}^{*} \overset{d}{\to} N (0, I_{s k}) .

(28)

Based on Equation (28), the quadratic form given by Equation (26) distributed Chi-square with sk degrees of freedom is:

\begin{matrix} 2 (\log L ({\hat{θ}}_{Ω}) - \log L ({\hat{θ}}_{ω})) \approx & {[{[I^{11}]}^{- \frac{1}{2}} {\hat{β}}^{*}]}^{T} [{[I^{11}]}^{- \frac{1}{2}} {\hat{β}}^{*}] \\ = z^{T} z \overset{d}{\to} χ_{s k}^{2}, n \to \infty, \end{matrix}

(29)

with

z = {[I^{11}]}^{- \frac{1}{2}} {\hat{β}}^{*} \overset{d}{\to} N (0, I_{s k}), n \to \infty .

sk is a vector dimension

β^{*}

or the difference between the number of parameter sets under the population with the number of parameter sets under the null hypothesis, symbolized by

n (Ω) - n (ω) .

Proposition 3.

The critical area for testing the hypothesis of the MGR model regression parameters simultaneously with regard to Equation (16) is:

\begin{matrix} α & = P (Λ < Λ_{0}) \\ = P (- 2 \log Λ > - 2 \log Λ_{0}) \\ = P (G^{2} > c_{1}), with c_{1} = - 2 \log Λ_{0} \\ = P (G^{2} > χ_{α, s k}^{2}) \\ = P (G^{2} > χ_{α, n (Ω) - n (ω)}^{2}) . \end{matrix}

(30)

Based on Proposition 2 and Proposition 3, the decision to reject the null hypothesis is made if

G^{2} > χ_{α; d f}^{2}

, with

d f = n (Ω) - n (ω),

n (Ω)

is the number of parameters under the population, and

n (ω)

is the number of parameters under the null hypothesis.

The null hypothesis for the partial test is

H_{0} : β_{q i} = 0

, whereas the alternative is

H_{1} : β_{q i} \neq 0

, with

q = 1, 2, \dots, s, i = 1, 2, \dots k

. According to Pawitan [18], the test statistic is stated in Equation (31).

Z = \frac{{\hat{β}}_{q i}}{S E ({\hat{β}}_{q i})},

(31)

with

S E ({\hat{β}}_{q i}) = \sqrt{\hat{var} ({\hat{β}}_{q i})}

. The

\hat{var} ({\hat{β}}_{q i})

is diagonal elements that correspond to the

- H^{- 1} (\hat{θ})

matrix. The null hypothesis is rejected if

| Z | > Z_{α / 2} .

3. Data and Method

The parameter estimation and hypothesis testing on MGR were done based on the following steps. The MGR model was specified based on the pdf in Equation (5) for n observations, l = 1, 2, …, n, to construct the likelihood and the log-likelihood functions. The first derivative of the log-likelihood function for each parameter was computed, then equalized to zero. If the solutions were closed-form, then the parameter estimators were obtained. Otherwise, numerical optimization was needed. As shown in the previous section, the solution for parameter optimization was not closed-form, such that the BHHH algorithm was employed in this work.

The overall test for MGR’s significance was done using the maximum likelihood ratio test (MLRT). The test statistic was formulated in Equation (19). Meanwhile, the partial test for individual parameter significance in MGR was done using the Wald test [18]. Its test statistics are provided in Equation (31). The proposed MGR model, along with its parameter estimation and hypothesis testing, was applied on real data as an application of this study.

This study used secondary data obtained from Statistics Indonesia. The data used were three response variables, i.e., the life expectancy index, education index, and expenditure index, with six predictor variables: percentage of households that have a private toilet, net enrollment rate of schooling, population density, percentage of poor people, and unemployment rate. The data were observed for 119 regencies/municipalities in Java, Indonesia, in the year 2018.

4. Application on Human Development Dimensions Data

First, testing the gamma distribution was done using the Kolmogorov–Smirnov (KS) test. The null hypothesis is the data that follows the gamma distribution against the alternative hypothesis that data does not follow the gamma distribution. The test statistic value of the KS test for each response variable is presented in Table 1. In this paper, the goodness of fit is done univariately as the test for multivariate gamma distribution is not available yet. The test for that is another extensive work that is not covered in this paper. Once each response follows gamma distribution, we assume the multiresponses data follow a multivariate gamma distribution. This assumption is the limitation of this work, such that the proposed model can be applied to real data without delay.

Each response variable has

D_{n} < D_{(0.05)}

and p-value > α. The test concludes not to reject the null hypothesis, meaning that the data of life expectancy index (Y₁), the education index (Y₂), and the expenditure index (Y₃) follow the gamma distribution. Therefore, as our research limitation, as mentioned previously, the three response variables are assumed to follow MG distribution.

To support our assumption, we calculated the correlation between the pair of the response variables to show there are dependencies among responses. The correlation coefficients for each pair are as follows: (i) Y₁ and Y₂ is 0.398 with p-value close to zero, (ii) Y₁ and Y₃ is 0.324 with p-value close to zero, (iii) Y₂ and Y₃ is 0.818 with p-value close to zero. The correlation coefficient between education index (Y₂) and expenditure index (Y₃) is stronger than the other pairs. To find out whether there is dependency among the response variables, one can use Bartlett’s test of sphericity so that the data are feasible for multivariate analysis. This test has statistic value

χ^{2} = 148.735

and p-value = 2.22 × 10⁻¹⁶. The

χ^{2} > {χ^{2}}_{3; 0, 05}

(or 7.815) and p-value < α, and alpha is 0.05. The decision is to reject the null hypothesis (Pearson correlation matrix not equal to an identity matrix), which means the correlation between the response variables is significant in the multivariate sense. Therefore, the data analysis needs to be done in a multivariate way using the MGR model.

We also tested the multicollinearity among the predictor variables. The variance inflation factor (VIF) value for each predictor variable is 1.358 (for X₁), 1.350 (X₂), 1.560 (X₃), 1.849 (X₄), and 1.211 (X₅). The VIF value for each of the predictor variables is less than ten which shows there is no multicollinearity among the predictor variables.

In Table 2, the mean values for response variables Y₁, Y₂, and Y₃ are 0.806, 0.632, and 0.735. Although Y₁, Y₂, and Y₃ have mean values that do not differ greatly, they are not necessarily of the same quality; it depends on the size of the spread of the data. One measure of data distribution that can be used is the coefficient of variation (CoV). The CoV for Y₁, Y₂, and Y₃ are 5.200, 12.210, and 9.140, respectively. The CoV for education index (Y₂) is the highest among others, which means that the variable is more heterogeneous. The CoV for predictor variables X₁, X₂, X₃, X₄, and X₅ are, respectively, 12.100, 16.750, 136.250, 43.750, and 45.530. The CoV for population density (X₃) is the highest among other predictor variables as its range is also the biggest one.

The dependency between response and predictor variables can be shown visually by the matrix plot, as exhibited in Figure 1. The correlation between X₃ and X₄ (−0.585) is stronger than the correlation between X₄ and the other predictor variables, even stronger than other pairs. The correlation between X₁ and X₅ (−0.017) is weakest compared to the correlation of other couples. There are indications that the relationship is non-linear between X₃ with the response variable and the other predictor variables. For the correlation between response and predictor variables, log(Y₁) has the strongest correlation with X₁ (0.434) compared to other predictors. The log(Y₂) and X₃ have the strongest correlation (0.705) compared with other predictors, while the correlation of log(Y₃) and X₃ is the strongest one (0.744). This value shows that log expenditure index and population density has the strongest relationship among other pairs.

To find out which predictor variables significantly predicted response variables, we employed the MGR model. Table 3 presents the ML estimates of the MGR model with a single predictor and their corresponding standard errors, z score, and p-value. Every single predictor does not affect any response variables. Only the intercepts when the MGR model employs X₃ as a single predictor are significant.

The MGR model with a single predictor (for example, the X₅) for the life expectancy index, education index, and expenditure index is obtained as follows.

\begin{array}{l} {\hat{μ}}_{l 1} = \exp (- 0.168892 - 0.006666 X_{l 5}), \\ {\hat{μ}}_{l 2} = \exp (- 0.460389 + 0.006020 X_{l 5}), \\ {\hat{μ}}_{l 3} = \exp (- 0.279763 + 0.003974 X_{l 5}) . \end{array}

As summarized in Table 3, it is shown that all predictor variables are not significant. For comparison, we also did MGR modeling with multiple predictors. Table 4 presents the ML estimates of the MGR model with multiple predictors along with their corresponding standard errors, z score, and p-value.

The estimate of the scale parameter is 0.649423, with its standard error 0.000028. The estimate of

λ_{1}

, the location parameter for Y₁, is 0.670845 (standard error 0.006884); meanwhile, the estimate for

λ_{2}

is −0.309362, with standard error 0.006507, and for

λ_{3}

is 0.000468 (standard error 0.006530). The significant parameters are the scale parameter

γ

, the location parameter for Y₁ and Y₂, respectively, and

λ_{1}

and

λ_{2}

, as their p-values are less than

α = 10 % .

The estimate of each parameter corresponding to each predictor is summarized in Table 4. Therefore, the MGR model for the life expectancy index, education index, and expenditure index is obtained as follows.

\begin{array}{l} {\hat{μ}}_{l 1} = \exp (- 0.353421 + 0.002005 X_{l 1} + 0.000653 X_{l 2} + 0.000004 X_{l 3} - 0.000469 X_{l 4} - 0.009502 X_{l 5}) \\ {\hat{μ}}_{l 2} = \exp (- 0.606408 + 0.000207 X_{l 1} + 0.004706 X_{l 2} + 0.000012 X_{l 3} - {0.011729}_{l 4} - 0.011194 X_{l 5}) \\ {\hat{μ}}_{l 3} = \exp (- 0.274026 + 0.000779 X_{l 1} + 0.000298 X_{l 2} + 0.000013 X_{l 3} - 0.006542 X_{l 4} - 0.007994 X_{l 5}) \end{array}

The Akaike information criterion (AIC) value is −63.903, and the corrected Akaike information criterion (AICc) value is −53.361. To know the average squared difference between the estimated and the actual values, one can use the mean square error (MSE). The MSEs for the life expectancy index, education index, and expenditure index are 0.001, 0.002, and 0.003, respectively. As the MSE is an unbiased estimator of variance, the MSE value is expected to be not much different from the variance of each response variable, i.e., 0.002 (expectancy index), 0.006 (education index), and 0.004 (expenditure index).

We can perform the simultaneous test for the model’s significance using Wilk’s likelihood ratio statistics derived based on the MLRT. The test statistic value is 46.682, and the value of the Chi-square table with 15 degrees of freedom and

α = 10 %

is 22.307. The test statistical value is larger than the value of the Chi-square table; therefore, the decision is to reject the null hypothesis. It means that the five predictor variables have a significant effect on the response variables simultaneously. To find out the predictor variables that partially affect the response variable, one can use test statistics in Equation (31). From Table 4, it can be seen that the significant predictor variable that influences the life expectancy index is the unemployment rate (X₅); meanwhile the education index and expenditure index are significantly affected by the percentage of poor people (X₄) and unemployment rate (X₅).

Based on the results of MGR modeling with a single predictor (Table 3) and multiple predictors (Table 4), it can be determined the differences in the coefficient signs only happen for X₅ in response to Y₂ and Y₃, as shown in Table 5. We can also find the supports of this evidence from the matrix plot in Figure 1, that individually, X₅ has a negative relationship with Y₁, while it has positive dependencies with Y₂ and Y₃. On the other side, the X₄ has a stronger negative individual relationship with all responses. Therefore, when X₄ and X₅ are used as predictors together in the MGR model, the sign of X₅ changes as there is a significant correlation (−0.391 with p-value < 0.05) between X₄ and X₅, where X₄ affects the response of Y₂ and Y₃ is stronger than X₅.

.

Recall the VIF value for X₅ is 1.211, which is small. This value means that there is a weak relationship between X₅ and (X₁, X₂, X₃, and X₄). However, there is a significant correlation between X₄ alone and X₅. In the MGR with multiple predictors, the positive sign for X₅ will not change if X₄ also has a positive sign for responses X₂ and X₃. Unfortunately, that is not the case. The correlation between X₄ and X₂ has a different sign compared with the correlation between X₅ and X₂. The sign of X₄ and X₅ in MGR with multiple predictors can change depending on its correlation with the response variable. The same explanation pertains to response X₃.

Life expectancy index (Y₁) has a negative association with the percentage of poor people (X₄), even though it is not significant for regency/municipality in Java. This finding means that an increase in life expectancy index is not affected by the percentage of poor people in Java. The education index (Y₂) and expenditure index (Y₃) have a significant negative dependency on the percentage of poor people in Java.

The predictions resulting from the MGR model are expected to be close to the actual values. The closer those two values, the narrower the spread, as displayed in Figure 2. It can be seen that fitting values for Y₂ and Y₃ are better than those of Y₁. This result is also supported by significant predictors, as reported in Table 4. The life expectancy index has one significant predictor, while the other two responses have two significant predictors that increase their coefficients of determination.

5. Conclusions

The proposed MGR model has been developed along with its parameter estimation and hypothesis testing. The solution of parameter estimation using MLE is not closed-form such that it is optimized numerically using the BHHH algorithm. The MLRT and Wald tests are employed for testing the model’s significance and the individual parameter, respectively. The proposed MGR model is applied to model the three dimensions of the human development index (HDI) with five predictor variables. The empirical results show that modeling using multiple predictors makes more sense compared to the model when it only employs a single predictor. When multiple predictors are used in the MGR model, there is a possibility that the sign of a particular parameter changes compared to when it is employed alone. This is a common problem that arises in modeling caused by collinearity among predictors. This issue can be overcome in future work.

Author Contributions

Conceptualization, A.R., P., S., and D.D.P.; methodology, P. and S.; software, A.R. and D.D.P.; validation, P. and D.D.P.; formal analysis, A.R. and D.D.P.; investigation, A.R.; data curation, A.R.; writing—original draft preparation, A.R.; writing—review and editing, D.D.P.; visualization, S.; supervision, P. and S.; project administration, P. and S.; All authors have read and agreed to the published version of the manuscript.

Funding

The first author thanks the Kemendikbud, the Republic of Indonesia, which has given the BPPDN scholarship, and Bina Nusantara University. All authors thank LPPM (Research center) of the Institut Teknologi Sepuluh Nopember that funded this study via the Postgraduate Research Scheme in 2019 with grant number: 1153/PKS/ITS/2019.

Acknowledgments

The authors thank the editor and the reviewers for their constructive and helpful comments.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

AIC	Akaike information criterion
AIC_c	Corrected Akaike information criterion
BHHH	Berndt–Hall–Hall–Hausman
CoV	Coefficient of variation
GLR	Generalized likelihood ratio
HDI	Human development index
KS	Kolmogorov–Smirnov
MG	Multivariate gamma
MGR	Multivariate gamma regression
ML	Maximum likelihood
MLE	Maximum likelihood estimation
MLRT	Maximum likelihood ratio test
MSE	Mean square error
Pdf	Probability density function
TGR	Trivariate gamma regression
VIF	Variance inflation factor

Appendix A

The first derivatives of the log-likelihood function for each parameter under the null hypothesis.

\frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial γ} = \sum_{l = 1}^{n} (\frac{λ_{1} - e^{β_{01}}}{γ^{2}} \log (y_{l 1} - λ_{1})) +

\sum_{l = 1}^{n} (\frac{e^{β_{01}} - e^{β_{02}} + λ_{2}}{γ^{2}} \log (y_{l 2} - y_{l 1} - λ_{2})) + \dots + \sum_{l = 1}^{n} (\frac{e^{β_{0 (k - 1)}} - e^{β_{0 k}} + λ_{k}}{γ^{2}} \log (y_{l k} - y_{l (k - 1)} - λ_{k})) +

\begin{array}{l} \sum_{l = 1}^{n} \frac{y_{l k}}{γ^{2}} - \frac{n λ_{1}}{γ^{2}} - \frac{n λ_{2}}{γ^{2}} - \frac{n λ_{3}}{γ^{2}} - (\frac{n (\log γ) λ_{1}}{γ^{2}} - \frac{n λ_{1}}{γ^{2}} + \frac{n (\log γ) λ_{2}}{γ^{2}} - \frac{n λ_{2}}{γ^{2}} + \frac{n (\log γ) λ_{3}}{γ^{2}} - \frac{n λ_{3}}{γ^{2}} + \\ (- \frac{(\log γ) e^{β_{0 k}}}{γ^{2}} + \frac{e^{β_{0 k}}}{γ^{2}})) - (- \frac{1}{γ^{2}} (Ψ (\frac{e^{β_{01}} - λ_{1}}{γ})) (e^{β_{01}} - λ_{1})) - (- \frac{1}{γ^{2}} (Ψ (\frac{e^{β_{02}} - e^{β_{01}} - λ_{2}}{γ})) (e^{β_{02}} - e^{β_{01}} - λ_{2})) - \end{array}

\dots - (- \frac{1}{γ^{2}} (Ψ (\frac{e^{β_{0 k}} - e^{β_{0 (k - 1)}} - λ_{k}}{γ})) (e^{β_{0 k}} - e^{β_{0 (k - 1)}} - λ_{k})) .

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial λ_{1}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l 1} - λ_{1})}{γ} - \frac{e^{β_{01}} - λ_{1} - γ}{γ (y_{l 1} - λ_{1})}) + \frac{n}{γ} + \frac{n (\log γ)}{γ} - \\ (- \frac{1}{γ} Ψ (\frac{e^{β_{01}} - λ_{1}}{γ})) . \end{array}

\begin{array}{l} \frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial λ_{2}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l 2} - y_{l 1} - λ_{2})}{γ} - \frac{e^{β_{02}} - e^{β_{01}} - λ_{2} - γ}{γ (y_{l 2} - y_{l 1} - λ_{2})}) + \frac{n}{γ} + \frac{n (\log γ)}{γ} - \\ (- \frac{1}{γ} Ψ (\frac{e^{β_{02}} - e^{β_{01}} - λ_{2}}{γ})) . \end{array}

\frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial λ_{k}} = \sum_{l = 1}^{n} (- \frac{\log (y_{l k} - y_{l (k - 1)} - λ_{k})}{γ} - \frac{e^{β_{0 k}} - e^{β_{0 (k - 1)}} - λ_{k} - γ}{γ (y_{l k} - y_{l (k - 1)} - λ_{k})}) + \frac{n}{γ} +

\frac{n (\log γ)}{γ} - (- \frac{1}{γ} Ψ (\frac{e^{β_{0 k}} - e^{β_{0 (k - 1)}} - λ_{k}}{γ})) .

\frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial β_{01}} = \sum_{l = 1}^{n} ((\log (y_{l 1} - λ_{1})) \frac{e^{β_{01}}}{γ}) - \sum_{l = 1}^{n} ((\log (y_{l 2} - y_{l 1} - λ_{2})) \frac{e^{β_{01}}}{γ}) -

(Ψ (\frac{e^{β_{01}} - λ_{1}}{γ})) \frac{e^{β_{01}}}{γ} - (- (Ψ (\frac{e^{β_{02}} - e^{β_{01}} - λ_{2}}{γ})) \frac{e^{β_{01}}}{γ}) .

\frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial β_{02}} = \sum_{l = 1}^{n} ((\log (y_{l 2} - y_{l 1} - λ_{2})) \frac{e^{β_{02}}}{γ}) -

\sum_{l = 1}^{n} ((\log (y_{l 3} - y_{l 2} - λ_{3})) \frac{e^{β_{02}}}{γ}) - (Ψ (\frac{e^{β_{02}} - e^{β_{01}} - λ_{2}}{γ})) \frac{e^{β_{02}}}{γ} - (- (Ψ (\frac{e^{β_{03}} - e^{β_{02}} - λ_{3}}{γ})) \frac{e^{β_{02}}}{γ}) .

\frac{\partial \log L (γ, λ_{1}, λ_{2}, \dots, λ_{k}, β_{01}, β_{02}, \dots, β_{0 k})}{\partial β_{0 k}} = \sum_{l = 1}^{n} ((\log (y_{l k} - y_{l (k - 1)} - λ_{k})) \frac{e^{β_{0 k}}}{γ}) - (\log γ) \frac{e^{β_{0 k}}}{γ} -

(Ψ (\frac{e^{β_{0 k}} - e^{β_{0 (k - 1)}} - λ_{k}}{γ})) \frac{e^{β_{0 k}}}{γ} .

References

Tripathi, R.C.; Gupta, C.R.; Pair, K.P. Statistical test involving several independent gamma distribution. J. Ann. Inst. Stat. Math 1993, 773–786. [Google Scholar] [CrossRef]
Nagar, D.K.; Correa, A.R.; Gupta, A.K. Extended matrix variate gamma and beta functions. J. Multivar. Anal. 2013, 122, 53–69. [Google Scholar] [CrossRef]
Bhattacharya, B. Tests of parameters of several gamma distributions with inequality restrictions. J. Ann. Inst. Stat. Math 2002, 54, 565–576. [Google Scholar] [CrossRef]
Chen, W.W.S.; Kotz, S. The riemannian structure of the three parameter gamma distribution. J. Appl. Math. 2013, 4, 514–522. [Google Scholar] [CrossRef]
Schickedanz, P.T.; Krause, G.F.A. Test for the scale parameters of two gamma distributions using the generalized likelihood ratio. J. Appl. Meteorol. 1970, 9, 13–16. [Google Scholar] [CrossRef]
Nadarajah, S. Reliability for some bivariate gamma distributions. Math. Probl. Eng. 2005, 2, 151–163. [Google Scholar] [CrossRef]
Nadarajah, S.; Gupta, A.K. Some bivariate gamma distributions. Appl. Math. Lett. 2006, 19, 767–774. [Google Scholar] [CrossRef]
Mathai, A.M.; Moschopoulos, P.G. A Form of multivariate gamma distribution. J. Ann. Inst. Stat. Math 1992, 44, 97–106. [Google Scholar] [CrossRef]
Bates, D.M.; Watts, D.G. Nonlinear Regression Analysis and Its Applications, 2nd ed.; John Wiley & Sons, Inc.: New York, NY, USA, 1988; ISBN: 9780470316757 (online), ISBN: 9780471816430 (print). [Google Scholar] [CrossRef]
Pan, J.; Mahmoudi, M.R.; Baleanu, D.; Maleki, M. On comparing and classifying several independent linear and non-linear regression models with symmetric errors. Symmetry 2019, 11, 820. [Google Scholar] [CrossRef]
Rahayu, A.; Purhadi; Sutikno; Prastyo, D.D. Trivariate gamma regression. IOP Conf. Ser. Mater. Sci. Eng. 2019, 546, 052062. [Google Scholar] [CrossRef]
Mathai, A.M.; Moschopoulos, P.G. On a multivariate gamma. J. Multivar. Anal. 1991, 39, 135–153. [Google Scholar] [CrossRef]
Vaidyanathan, V.S.; Lakshmi, R.V. Parameter estimation in multivariate gamma distribution. Stat. Optim. Inf. Comput. 2015, 3. [Google Scholar] [CrossRef]
Balakrishnan, N.; Wang, J. Simple efficient estimation for the three-parameter gamma distribution. J. Stat. Plan. Inference 2000, 85, 115–126. [Google Scholar] [CrossRef]
Ewemoje, T.A.; Ewemooje, O.S. Best distribution and plotting positions of daily maximum flood estimation at ona river in Ogun-Oshun river Basin, Nigeria. Agric. Eng. Int. 2011, 13, 1–13, EID: 2-s2.0-84877825735. [Google Scholar]
Bono, R.; Arnau, J.; Alarcon, R.; Blanca, M.J. Bias, precision, and accuracy of skewness and kurtosis estimators for frequently used continuous distributions. Symmetry 2020, 12, 19. [Google Scholar] [CrossRef]
Usman, M.; Zubair, M.; Shiblee, M.; Rodrigues, P.; Jaffar, S. Probabilistic modeling of speech in spectral domain using maximum likelihood estimation. Symmetry 2018, 10, 750. [Google Scholar] [CrossRef]
Pawitan, Y. All Likelihood: Statistical Modelling and Inference Using Likelihood, 1st ed.; Clarendon Press: Oxford, UK, 2001; pp. 41–42. ISBN 9780199671229. [Google Scholar]

Figure 1. The matrix plot of the response and predictor variables.

Figure 2. The actual values and the estimated values.

Table 1. Gamma distribution test with Kolmogorov–Smirnov (KS) for

α = 0.05

.

Table 1. Gamma distribution test with Kolmogorov–Smirnov (KS) for

α = 0.05

.

Response	$D_{n}$	$D_{(0, 05)}$	p-Value
Y₁	0.118	0.124	0.066
Y₂	0.107	0.124	0.123
Y₃	0.065	0.124	0.667

Table 2. Description of data.

Variables	Mean	SD	Coefficient of Variation	Min	Max
Life expectancy index (Y₁)	0.806	0.042	5.200	0.680	0.890
Education index (Y₂)	0.632	0.077	12.210	0.470	0.850
Expenditure index (Y₃)	0.735	0.067	9.140	0.620	0.960
Percentage of households that have a private toilet (X₁)	80.215	9.710	12.100	37.820	98.010
Net enrollment rate of schooling (X₂)	63.579	10.650	16.750	34.220	89.460
Population density (X₃)	3298	4493	136.250	278	19757
Percentage of poor people (X₄)	9.623	4.211	43.750	1.680	21.210
Unemployment rate (X₅)	5.337	2.430	45.530	1.430	12.770

Table 3. Parameter estimation of multivariate gamma regression (MGR) model with a single predictor.

Parameter	Estimate	Standard Error	z	p-Value
$(Y_{1}, Y_{2}, Y_{3}) ~ X_{1}$
$β_{01}$	−0.422284	0.646166	−0.654	0.513
$β_{11}$	0.002758	0.007970	0.346	0.729
$β_{02}$	−0.762038	1.120002	−0.680	0.496
$β_{12}$	0.004297	0.016611	0.259	0.796
$β_{03}$	−0.437765	0.760203	−0.576	0.565
$β_{13}$	0.002339	0.011250	0.208	0.835
$(Y_{1}, Y_{2}, Y_{3}) ~ X_{2}$
$β_{01}$	−0.305102	0.518863	−0.588	0.557
$β_{21}$	0.001896	0.008076	0.235	0.814
$β_{02}$	−0.881184	0.575235	−1.532	0.126
$β_{22}$	0.007363	0.008563	0.860	0.390
$β_{03}$	−0.430093	0.857503	−0.502	0.616
$β_{23}$	0.002789	0.014685	0.190	0.849
$(Y_{1}, Y_{2}, Y_{3}) ~ X_{3}$
$β_{01}$	−0.205148	0.000421	−486.716	0.000 *
$β_{31}$	0.000002	0.000054	0.029	0.977
$β_{02}$	−0.483898	0.000262	−1846.353	0.000 *
$β_{32}$	0.000020	0.000021	0.930	0.352
$β_{03}$	−0.306820	0.000209	−1465.698	0.000 *
$β_{33}$	0.000016	0.000014	1.132	0.258
$(Y_{1}, Y_{2}, Y_{3}) ~ X_{4}$
$β_{01}$	−0.166290	0.581543	−0.286	0.775
$β_{41}$	−0.002307	0.019731	−0.117	0.907
$β_{02}$	−0.238049	1.200557	−0.198	0.843
$β_{42}$	−0.019448	0.046895	−0.415	0.678
$β_{03}$	−0.136018	1.044931	−0.130	0.896
$β_{43}$	−0.012480	0.035650	−0.350	0.726
$(Y_{1}, Y_{2}, Y_{3}) ~ X_{5}$
$β_{01}$	−0.168892	0.555574	−0.304	0.761
$β_{51}$	−0.006666	0.057555	−0.116	0.908
$β_{02}$	−0.460389	0.862713	−0.534	0.594
$β_{52}$	0.006020	0.092371	0.065	0.948
$β_{03}$	−0.279763	0.831607	−0.336	0.737
$β_{53}$	0.003974	0.040046	0.099	0.921

* Significant at

α = 10 %

.

Table 4. Parameter estimation of MGR model with multiple predictors.

Parameter	Estimate	Standard Error	z	p-Value
Life expectancy index (Y₁)
$β_{01}$	−0.353421	0.000119	−2969.383	0.000 **
$β_{11}$	0.002005	0.002987	0.671	0.502
$β_{21}$	0.000653	0.003494	0.187	0.852
$β_{31}$	0.000004	0.000029	0.124	0.902
$β_{41}$	−0.000469	0.002831	−0.166	0.868
$β_{51}$	−0.009502	0.005252	−1.809	0.070 **
Education index (Y₂)
$β_{02}$	−0.606408	0.000120	−5069.156	0.000 **
$β_{12}$	0.000207	0.003800	0.055	0.956
$β_{22}$	0.004706	0.004952	0.950	0.342
$β_{32}$	0.000012	0.000018	0.677	0.498
$β_{42}$	−0.011729	0.004125	−2.843	0.004 **
$β_{52}$	−0.011194	0.006636	−1.687	0.092 **
Expenditure index (Y₃)
$β_{03}$	−0.274026	0.000101	−2723.533	0.000 **
$β_{13}$	0.000779	0.002337	0.333	0.739
$β_{23}$	0.000298	0.002621	0.114	0.910
$β_{33}$	0.000013	0.000014	0.943	0.346
$β_{43}$	−0.006542	0.003125	−2.093	0.036 **
$β_{53}$	−0.007994	0.003900	−2.050	0.040 **

** Significant at

α = 10 %

.

Table 5. The difference in coefficient signs and significance of the parameters in the MGR model.

Response Variables	Predictor Variables	MGR Modeling
		Multiple Predictors	Single Predictor
		Multiple Predictors	X₁	X₂	X₃	X₄	X₅
Y₁	X₁	+	+
	X₂	+		+
	X₃	+			+
	X₄	−				−
	X₅	− ***					−

Y₂	X₁	+	+
	X₂	+		+
	X₃	+			+
	X₄	− ***				−
	X₅	− ***					+

Y₃	X₁	+	+
	X₂	+		+
	X₃	+			+
	X₄	− ***				−
	X₅	− ***					+

*** Significant at

α = 10 %

.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rahayu, A.; Purhadi; Sutikno; Prastyo, D.D. Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application. Symmetry 2020, 12, 813. https://doi.org/10.3390/sym12050813

AMA Style

Rahayu A, Purhadi, Sutikno, Prastyo DD. Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application. Symmetry. 2020; 12(5):813. https://doi.org/10.3390/sym12050813

Chicago/Turabian Style

Rahayu, Anita, Purhadi, Sutikno, and Dedy Dwi Prastyo. 2020. "Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application" Symmetry 12, no. 5: 813. https://doi.org/10.3390/sym12050813

APA Style

Rahayu, A., Purhadi, Sutikno, & Prastyo, D. D. (2020). Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application. Symmetry, 12(5), 813. https://doi.org/10.3390/sym12050813

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multivariate Gamma Regression: Parameter Estimation, Hypothesis Testing, and Its Application

Abstract

1. Introduction

2. Multivariate Gamma Regression Model

3. Data and Method

4. Application on Human Development Dimensions Data

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Nomenclature

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI