Next Article in Journal
Relations between Generalized Bi-Periodic Fibonacci and Lucas Sequences
Next Article in Special Issue
A New Extended Two-Parameter Distribution: Properties, Estimation Methods, and Applications in Medicine and Geology
Previous Article in Journal
Non-Parametric Analysis of Efficiency: An Application to the Pharmaceutical Industry
Previous Article in Special Issue
On Designing Non-Parametric EWMA Sign Chart under Ranked Set Sampling Scheme with Application to Industrial Process
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Parameter Estimation and Hypothesis Testing of Geographically Weighted Multivariate Generalized Poisson Regression

1
Department of Statistics, Faculty of Science and Data Analytics, Institut Teknologi Sepuluh Nopember, Surabaya 60111, Indonesia
2
Department of Statistics, Politeknik Statistika STIS, Jakarta 13330, Indonesia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2020, 8(9), 1523; https://doi.org/10.3390/math8091523
Submission received: 6 August 2020 / Revised: 28 August 2020 / Accepted: 3 September 2020 / Published: 7 September 2020
(This article belongs to the Special Issue Probability, Statistics and Their Applications)

Abstract

:
We introduce a new multivariate regression model based on the generalized Poisson distribution, which we called geographically-weighted multivariate generalized Poisson regression (GWMGPR) model, and we present a maximum likelihood step-by-step procedure to obtain parameters for it. We use the maximum likelihood ratio test to examine the significance of the regression parameters and to define their critical region.
MSC:
62H10; 62H11; 62H12; 62H15; 62E20

1. Introduction

Poisson distribution is one of the most widely utilized univariate and multivariate regression models for count data analysis. The univariate regression model is comprised of one response variable, while the multivariate regression model is made up of two or more response variables that are correlated. There are several multivariate count data models based on Poisson distribution, namely multivariate Poisson regression model, multivariate Poisson-gamma mixture model, multivariate Poisson log-normal model, and multivariate generalized Poisson regression model [1,2].
Poisson distribution is characterized by its restrictive assumption on equidispersion, a condition where the sample variance is equal to the mean. However, when the sample mean is less (greater) than the variance, it is called overdispersion (underdispersion). The multivariate Poisson regression model assumes equidispersion and allows for nonnegative correlations, while the multivariate Poisson-gamma mixture model handles overdispersion and nonnegative correlations. Furthermore, the multivariate Poisson-log-normal model allows overdispersion and a more flexible correlation structure. The multivariate generalized Poisson regression (MGPR) model is the most flexible amongst all because it allows any type of dispersion and correlation.
The MGPR model developed by Famoye [2] is a regression model based on multivariate generalized Poisson distribution, which was obtained by the same procedure in constructing bivariate generalized Poisson distribution, which is a product of generalized Poisson marginal with a multiplicative factor [3,4]. To date, bivariate and multivariate generalized Poisson regression models have been applied to real data with units of analysis such as elderly, women, patients, soccer teams, or households [2,5,6]. These models are global regression models that assume the relationships between variables are spatially constant.
When the units of analysis used are locations, spatial effects need to be taken into account in modeling the relationships between the explanatory variables and the response variables. Two possible issues arise when the sample data contain spatial elements, namely spatial dependency and spatial heterogeneity [7]. Out of these two, spatial dependency or spatial autocorrelation is best known and most acknowledged. Spatial dependency occurs when the intensity of an event at one location influences the intensity at surrounding locations. Meanwhile, spatial heterogeneity describes the distribution of an event and the distribution of a relationship across a landscape. This means that what matters in a socio-economic relationship may not be homogenously distributed across space, such as infant mortality rates or unemployment rates at different locations. Both of these issues tend to arise when data are collected by location units such as provinces, districts/cities, sub-districts, villages, etc.
One statistical technique commonly used to address spatial effects is geographically weighted regression (GWR) model [8]. The GWR model allows the relationships between response variables and explanatory variables vary over space so the regression coefficients systematically vary across space. With this method, separate regression models are estimated for each location unit. All observations, i.e., location units, are included in each regression but with different weights. The farther a spatial unit from the location for which the regression is fitted, the smaller its data are weighted. This is in line with Tobler’s First Law, which stated that everything is related to everything else, but near things are more related than distant things [9].
The GWR models for univariate response applied in previous studies were based on normal distribution [10,11,12,13]. Another study used Poisson distribution to develop a geographically weighted Poisson regression model and semi-parametric geographically weighted Poisson regression [14]. Meanwhile, Rodrigues et al. [15] and Chen et al. [16] used the GWR model based on logistic distribution and negative binomial distribution, respectively. The development of the GWR model for multivariate responses based on several different distributions has been carried out, such as based on normal distribution [17], Poisson distribution [18], Weibull distribution [19], Student-t distribution [20], negative binomial distribution [21], and logistic distribution [22].
Furthermore, two GWR models for count data which use multivariate responses have been developed, based on the Poisson distribution and negative binomial distribution. The first model is called geographically weighted multivariate Poisson regression (GWMPR) model [18], while the second model is called geographically weighted negative binomial regression (GWMNBR) model [21]. The GWMPR model assumes equidispersion, whereas the GWMNBR model only handles cases of overdispersion. In this paper, we propose an alternative multivariate regression model that can handle any type of dispersion by taking into account spatial effects in measuring the relationships between the response variables and the explanatory variables. The proposed model is called a geographically weighted multivariate generalized Poisson regression (GWMGPR) model.
The following discussion in this paper is divided into several main topics. In Section 2, we define the GWMGPR model followed by discussions on the parameter estimation using the maximum likelihood estimation (MLE) method in Section 3. We present simultaneous hypothesis testing in Section 4 to test the significance of all parameters together. In this section, we define the distribution of the test statistic and the critical region for the corresponding hypothesis by applying the maximum likelihood ratio test. We give conclusions in Section 5.

2. GWMGPR Model

The GWMGPR model is an extension of existing model of MGPR developed by Famoye [2] as follows:
P y 1 , , y g = l = 1 g μ l 1 + φ l μ l y l 1 + φ l y l y l 1 y l ! exp μ l 1 + φ l μ l 1 + φ l y l × 1 + l < m g γ l m e y l z l e y m z m ,
where y 1 , , y g = 0 , 1 , 2 , with z l = exp μ l r l 1 1 + φ l μ l ; l = 1 , 2 , , g , and ln r l φ l μ l r l 1 1 + φ l μ l + 1 = 0 , l = 1 , 2 , , g , where y l is the response variable l = 1 , 2 , , g , φ l is the dispersion parameter of y l , and γ l m shows the correlation between y l and y m l , m = 1 , 2 , , g ; l < m . When φ l = 0 , the MGPR model is reduced to multivariate Poisson regression model. Meanwhile, if φ l < 0 or φ l > 0 , then the MGPR model experiences underdispersion or overdispersion, respectively. The variable y l and y m are negatively or positively correlated when γ l m < 0 or γ l m > 0 , respectively.
We use the usual log-linear relationship between the marginal Y i l and the covariates x i as follows:
μ i l = q i l exp x i T β l ,
where q i l is an exposure measure for observation i of the response variable l, x i is a vector of the explanatory variables, and β l is a parameter vector of the MGPR model for the response variable l,
x i = 1 x i 1 x i 2 x i p p + 1 × 1 T , i = 1 , 2 , , n , β l = β l 0 β l 1 β l 2 β l p p + 1 × 1 T , l = 1 , 2 , , g .
Equation (2) produces one regression model with the same regression coefficients for all n observations (locations), which we called the global model. Meanwhile, the local regression model produces n regression models; therefore, each location has its own regression coefficients. This is the underlying idea of the GWMGPR model, which is to build a model that takes into account spatial variations in relationships that allows the regression coefficients to vary across geographical locations. In this case, the location-specific regression coefficients are functions of longitude and latitude coordinates denoted by u i = u 1 i , u 2 i , a vector of two-dimensional coordinates of location i. This means that the response variables Y 1 i , Y 2 i , , Y i l are predicted by explanatory variables x i , each of which has a regression coefficient depending on the location where the data were observed.
Let n random samples x i , y i , i = 1 , 2 , , n from the random variables y i M G P μ u i , φ u i , γ u i be given, where u i is a two-dimensional coordinate vector of location i; then, the GWMGPR model based on MGPR in (2) is written as
μ i l u i = q i l exp x i T β l u i ,
where q i l is the exposure for location i of the response variable l with
x i = 1 x i 1 x i 2 x i p p + 1 × 1 T , i = 1 , 2 , , n ,
β l u i = β l 0 u i β l 1 u i β l 2 u i β l p u i p + 1 × 1 T , l = 1 , 2 , , g ; i = 1 , 2 , , n .

3. Parameter Estimation

As stated earlier, we use the MLE method to obtain the estimators of the GWMGPR model. The MLE method is widely used to find estimators from observed data. This method requires that the estimators, say θ ^ , maximizes the log-likelihood function by solving the gradient function to zero. For a more detailed discussion about MLE, including its principles and properties, see Pawitan [23].
Supposing n random samples x i , y i , i = 1 , 2 , , n are taken independently from random variables y i M G P μ u i , φ u i , γ u i , then the likelihood function of location i that corresponds to (1) is
L Ω = L μ l u i , φ l u i , γ l m u i , i = 1 , 2 , , n ; l , m = 1 , 2 , g ; l < m , = i = 1 n l = 1 g μ i l u i 1 + φ l u i μ i l u i y i l 1 + φ l u i y i l y i l 1 y i l ! exp μ i l u i 1 + φ l u i μ i l u i 1 + φ l u i y i l × 1 + l < m g γ l m u i e y i l z i l e y i m z i m ,
where
z i l = exp μ i l u i r l 1 1 + φ l u i μ i l u i ,
ln r l φ l u i μ i l u i r l 1 1 + φ l u i μ i l u i + 1 = 0 ,
and μ i l u i is stated in (3). Hence, the parameter vector of location i of the corresponding GWMGPR model is written as
θ u i = β 1 T u i β g T u i φ 1 u i φ g u i γ 12 u i γ g 1 g u i T .
Let Q 1 be the log-likelihood function of (4) such that
Q 1 = ln L μ l u i , φ l u i , γ l m u i , i = 1 , 2 , , n ; l , m = 1 , 2 , g ; l < m , = i = 1 n l = 1 g y i l ln μ i l u i 1 + φ l u i μ i l u i + y i l 1 ln 1 + φ l u i y i l ln y i l ! μ i l u i 1 + φ l u i y i l 1 + φ l u i μ i l u i + ln 1 + Q 2 ,
where
Q 2 = l < m g γ l m u i e y i l z i l e y i m z i m ,
and z i l is defined in (5).
To estimate the parameters for location i, we can approximate Equation (2) by Equation (3). The regression model is built on using a subset or the entire data points by giving a weight based on the distance of the data points to location i. We give more weight to data points that are closer to location i than other data points that are farther away. Thus, the weights vary according to the proximity of the data points to location i. This means that we need to compute a weighting matrix for each location where the weights represent the proximity of each data point to location i with closer points having more weight in the estimation of the parameters for location i.
The above explanation means that the local parameters of θ ^ are estimated by a weighting function, w i j . The weights w i j , i , j = 1 , 2 , , n at each location u i are defined by a function of distance d i j between the center of location i and other locations. Several weighting functions (kernels) are presented in Fotheringham et al. [8]. The kernels generally define weights such that locations close to the regression point will make more contributions in estimating the parameters than locations farther away. The most common kernels are the Gaussian function and bi-square function accompany with fixed or adaptive bandwidth.
Fixed Gaussian weighting function is defined as
w i j = exp 1 2 d i j b 2 ,
and fixed bi-square weighting function is
w i j = 1 d i j b 2 2 ; if d i j < b 0 ; otherwise ,
where d i j is Euclidean distance between location i and location j
d i j = u 1 i u 1 j 2 + u 2 i u 2 j 2 ,
and b is the bandwidth that controls the degree of distance-decay. The bandwidth is a measure of the distance-decay in the weighting function and indicates the extent to which the local model should be smoothed [8]. When b is large (small), it gives relative higher (lower) weight for more remote locations. This means that a fixed weighting functions may lead to a problem of undersmoothing in areas with only small observations and oversmoothing in areas with a high density of observations.
An alternative to the fixed weighting functions is adaptive kernels or spatially varying kernels, which give smaller bandwidth in areas with more data points and greater bandwidth in areas with less data points. Two types of adaptive kernel are adaptive Gaussian and adaptive bi-square. Adaptive Gaussian weighting function is defined as
w i j = exp 1 2 d i j b i 2 ,
and adaptive bi-square weighting function is stated as
w i j = 1 d i j b i 2 2 ; if d i j b i 0 ; otherwise ,
where b i is the bandwidth of location i.
Therefore, the implementation of GWMGPR requires a choice of specific function: a bisquare or a Gaussian function and the choice of bandwidth. The results of GWMGPR are more sensitive to the bandwidth of the particular weighting function chosen rather than to the selection of the weighting function. This means that we need to determine optimum bandwidth so there are sufficient data points within the kernel to get a reliable estimation in terms of bias and standard error but not too large of bandwidth so that there are too many data points such that the heterogeneity is washed away or not too high of bandwidth that the kernel only includes one data point for each location, then the system breaks down. The greater the bandwidth, the lower standard error of the coefficient regression but the higher bias, and vice versa.
There are several techniques used to obtain optimal bandwidth in terms of optimum trade-off between bias and standard error, such as the generalized cross-validation (GCV), which is used to determine the best combination of the bias and standard error. The optimum bandwidth is indicated by the minimum value of GCV, which is defined as follows [8]:
G C V = n i = 1 n y i y ^ i b T y i y ^ i b n ν 1 2 ,
where y ^ i b is the prediction of y i for bandwidth b, and v 1 is the number of effective parameters in the GWMGPR model.
Once the weights w i j are calculated, the parameter estimation is determined. To obtain the estimator of location i, we need information of location j by applying a weighting function w i j to the log-likelihood function. When the optimal weighting function is obtained, the log-likelihood function of location i in (8) is written as a weighted log-likelihood function as follows:
Q 1 = ln L μ l u i , φ l u i , γ l m u i w i j , l , m = 1 , 2 , g ; l < m , Q 1 = j = 1 n l = 1 g y j l ln μ j l u i 1 + φ l u i μ j l u i + y j l 1 ln 1 + φ l u i y j l ln y j l ! μ j l u i 1 + φ l u i y j l 1 + φ l u i μ j l u i + ln 1 + Q 2 w i j ,
where
Q 2 = l < m g γ l m u i e y j l z j l e y j m z j m ,
z j l = exp μ j l u i r l 1 1 + φ l u i μ j l u i ,
ln r l φ l u i μ j l u i r l 1 1 + φ l u i μ j l u i + 1 = 0 ,
and
μ j l u i = q j l exp x j T β l u i .
The MLE method is applied to the parameter estimation; therefore, the weighted log-likelihood function in (10) is maximized over the vector parameter θ u i in (7). The first partial derivatives of Q 1 with respect to θ u i are shown as follows:
Q 1 β l T u i = j = 1 n y j l μ j l u i x j 1 + φ l u i μ j l u i 2 + Q 2 / β l T u i 1 + Q 2 w i j , Q 1 φ l u i = j = 1 n μ j l u i y j l 1 + φ l u i μ j l u i + y j l 1 y j l 1 + φ l u i y j l u i
μ j l u i y j l μ j l u i 1 + φ l u i μ j l 2 + Q 2 / φ l u i 1 + Q 2 w i j ,
Q 1 γ l m u i = j = 1 n 1 1 + Q 2 Q 2 γ l m u i w i j
with Q 2 is stated in (11), where
Q 2 β l T u i = l < m g γ l m u i e y j m z j m z j l β l T u i , Q 2 φ l u i = l < m g γ l m e y j m z j m z j l φ l u i , Q 2 γ l m u i = l < m g γ l m u i γ j l m e y j l z j l e y j m z j m = e y j l z j l e y j m z j m ,
and
z j l β l T u i = z j l x j μ j l r l 1 1 + φ l u i μ j l u i 1 + φ l u i μ j l u i φ l u i μ j l u i r l , z j l φ l u i = z j l μ j l 2 u i r l 1 2 1 + φ l u i μ j l u i 1 + φ l u i μ j l u i φ l u i μ j l u i r l ,
with z j l and r l are stated in (12) and (13), respectively.
From Equations (15) to (17), the total number of parameters of GWMGPR model, notated by a 1 , is g + 2 p + 3 g n / 2 . The first partial derivatives of Q 1 with respect to each parameter in (15) through (17) are not closed-form and the MLE does not give analytical solutions. A numerical optimization, such as the Newton–Raphson algorithm, as the simplest and the most common optimization method, is needed when there is no analytical solution available to obtain the maximum likelihood estimators. The Newton–Raphson algorithm is
θ ^ h + 1 u i = θ ^ h u i H 1 θ ^ h u i g θ ^ h u i ,
where θ ^ h u i is the vector of parameter of location i at the h t h iteration, H 1 θ ^ h u i is the inverse of Hessian matrix of location i at the h t h iteration, and g θ ^ h u i is the gradient vector of location i at the h t h iteration. For a more detailed discussion about numerical optimization methods, see Cameron and Trivedi [24].

4. Simultaneous Hypothesis Testing

Simultaneous hypothesis testing aims to test more than one parameter, for example to test all the parameters β l k u i , l = 1 , 2 , g ; i = 1 , 2 , , n together. This is used to determine the significance of the regression parameters in the model jointly. The hypothesis for simultaneous testing of parameter β l k u i is stated as follows:
H 0 : β l 1 u i = β l 2 u i = = β l p u i = 0 ; i = 1 , 2 , , n ; l , m = 1 , 2 , , g , H 1 : at least one β l k u i 0 .
If we cannot reject H 0 , then it means that none of the regression parameter values are significant. In case of rejection, it means that one or more of β l k u i ’s have a significant effect in the model.
The critical region of hypothesis testing in (18) is obtained using the likelihood ratio test technique by calculating the ratio between the maximum value of the likelihood function under H 0 , L ω , with the maximum value of the likelihood function under the population, L Ω . The maximum value of L Ω is the maximum value in (4). Meanwhile, the calculation of L ω is described below.
The first step to obtain the maximum L ω in determining the critical region for the hypothesis in (18) is to define the parameter space under H 0 , ω = β ω l 0 u i , φ ω l u i , γ ω l m u i , l , m = 1 , 2 , , g ; l < m . Let θ ω 2 u i be the vector of parameters under H 0
θ ω 2 u i = β ω 10 u i β ω g 0 u i φ ω 1 u i φ ω g u i γ ω 12 u i γ ω g 1 g u i T ,
and the GWMGPR model of location i under H 0 be
μ ω i l u i = q i l exp β ω l 0 u i .
The likelihood function in space ω is
L ω = i = 1 n P y i β ω l 0 u i , φ ω l u i , γ ω l m u i , l , m = 1 , 2 , , g ; l < m .
Let Q 1 ω = ln L ω , then the log-likelihood function under H 0 is written as
Q 1 ω = i = 1 n P y i β ω l 0 u i , φ ω l u i , γ ω l m u i , l , m = 1 , 2 , , g ; l < m , = i = 1 n l = 1 g y i l ln μ ω i l u i 1 + φ ω l u i μ ω i l u i + y i l 1 ln 1 + φ ω l u i y i l ln y i l ! μ ω i l u i 1 + φ ω l u i y i l 1 + φ ω l u i μ ω i l u i + ln 1 + Q 2 ω ,
where
Q 2 ω = l < m g γ ω l m u i e y i l z ω i l e y i m z ω i m , z ω i l = exp μ ω i l u i r ω l 1 1 + φ ω l u i μ ω i l u i ,
and
ln r ω l φ ω l u i μ ω i l u i r ω l 1 1 + φ ω l u i μ ω i l u i + 1 = 0 .
By carrying out the same procedure as described previously, we need the information of location j to obtain the estimator of location i. The contribution of each location depends on the weighting function w i j , such that the weighted log-likelihood function corresponding to (21) is written as
Q 1 ω = w i j ln L ω Q 1 ω = j = 1 n l = 1 g y j l ln μ ω j l u i 1 + φ ω l u i μ ω j l u i + y j l 1 ln 1 + φ ω l u i y j l ln y j l ! μ ω j l u i 1 + φ ω l u i y j l 1 + φ ω l u i μ ω j l u i + ln 1 + Q 2 ω w i j ,
where
Q 2 ω = l < m g γ ω l m u i e y j l z ω j l e y j m z ω j m , z ω j l = exp μ ω j l u i r l 1 1 + φ ω l u i μ ω j l u i , ln r l φ ω l u i μ ω j l u i r l 1 1 + φ ω l u i μ ω j l u i + 1 = 0 ,
and μ ω j l u i is defined as
μ ω j l u i = q j l exp β ω l 0 u i .
The first partial derivatives of Q 1 ω with respect to each parameter in the parameter vector θ ω 2 u i are
Q 1 ω β ω l 0 u i = j = 1 n y j l μ ω j l u i 1 + φ ω l u i μ ω j l u i 2 + Q 2 ω / β ω l 0 u i 1 + Q 2 ω 2 w i j , Q 1 ω φ ω l u i = j = 1 n μ ω j l u i y j l 1 + φ ω l u i μ ω j l u i + y j l 1 y j l 1 + φ ω l u i y j l
μ ω j l u i y i l μ ω j l u i 1 + φ ω l u i μ ω j l u i 2 + Q 2 ω / φ ω l u i 1 + Q 2 ω w i j ,
Q 1 ω γ ω l m u i = j = 1 n 1 1 + Q 2 ω Q 2 ω γ ω l m u i w i j ,
where Q 2 ω is defined in (23) with
Q 2 ω β ω l 0 u i = l < m g γ ω l m u i e y j m z ω j m z ω j l β ω l 0 u i , Q 2 ω φ ω l u i = l < m g γ ω l m u i e y j m z ω j m z ω j l φ ω l u i , Q 2 ω γ ω l m u i = l < m g γ ω l m γ ω l m u i e y j l z ω j l e y j m z ω j m = e y j l z ω j l e y i m z ω j m ,
and
z ω j l β ω l 0 u i = z ω j l μ ω j l u i 1 + φ ω l u i μ ω j l u i × r ω l 1 1 + φ ω l u i μ ω j l u i φ ω l u i μ ω j l u i r ω l , z ω j l φ ω l u i = z ω j l μ ω j l 2 u i 1 + φ ω l u i μ ω j l u i × r ω l 1 2 1 + φ ω l u i μ ω j l u i φ ω l u i μ ω j l u i r ω l .
Equations (24) through (26) are not closed-form, so a numerical iteration is needed to solve the equations. The total number of parameters in the parameter space ω , notated by a 2 , is g + 3 g n / 2 .
Using a significance level α 0 , 1 , the critical region of the likelihood ratio test for the GWMGPR model with regard to the hypothesis in (18) is
Λ = max L ω max L Ω < k α ,
where k α is a constant that depends on α with 0 k α 1 , L Ω and L ω are stated in (4) and (20), respectively. Equation (27) can be written as
G 2 = ln Λ 2 = 2 ln Λ = 2 ln L Ω ^ ln L ω ^ .
The distribution of the test statistic G 2 in (28) is written in the following Theorem 1.
Theorem 1.
Supposing the maximum likelihood estimator under H 0 and the population regarding to hypothesis testing stated in (18) are
θ ^ ω 2 g + 3 g n / 2 × 1 = β ^ ω 10 u i β ^ ω g 0 u i φ ^ ω 1 u i φ ^ ω g u i γ ^ ω 12 u i γ ^ ω g 1 g u i T ,
θ ^ g + 2 p + 3 g n / 2 × 1 = β ^ 1 T u i β ^ g T u i φ ^ 1 u i φ ^ g u i γ ^ 12 u i γ ^ g 1 g u i T ,
respectively, then the test statistic G 2 in (28) follows a chi-square distribution with g p n degrees of freedom.
Proof of Theorem 1.
Suppose the maximum likelihood estimator under the population related to the hypothesis stated in (18) is partitioned as follows:
θ ^ g + 2 p + 3 g n / 2 × 1 = θ ^ 1 θ ^ 2 T ,
where
θ ^ 1 = β ^ l 1 u i β ^ l 2 u i β ^ l p u i T , l = 1 , 2 , , g ,
and
θ ^ 2 = β ^ 10 u i β ^ g 0 u i φ ^ 1 u i φ ^ g u i γ ^ 12 u i γ ^ g 1 g u i T .
The maximum likelihood estimators and the known parameters in H 0 are partitioned as
θ ^ ω = θ ω 1 θ ^ ω 2 T ,
where
θ ω 1 g p n × 1 = 0 0 0 T ,
and
θ ^ ω 2 g + 3 g n / 2 × 1 = β ^ ω 10 u i β ^ ω g 0 u i φ ^ ω 1 u i φ ^ ω g u i γ ^ ω 12 u i γ ^ ω g 1 g u i T ,
with the true parameter is partitioned as
θ ω = θ ω 1 θ 2 T .
Therefore, the hypothesis in (18) can be written as
H 0 : θ 1 = θ ω 1 H 1 : θ 1 θ ω 1 .
The test statistic G 2 stated in (28) can be described as follows:
G 2 = 2 ln L θ ^ ln L θ ^ ω = 2 ln L θ ^ ln L θ ω 2 ln L θ ^ ω ln L θ ω .
Let θ ω = ln L θ ω , using Taylor’s second order expansion around θ ^ , we get
θ ω θ ^ + g θ ^ θ ω θ ^ 1 / 2 θ ω θ ^ T I θ ^ θ ω θ ^ ,
where
g θ ^ = θ θ θ = θ ^ ,
and
I θ ^ = 2 θ θ θ T θ = θ ^ .
Since g θ ^ = 0 , then
2 θ ^ θ ω θ ^ θ ω T I θ ^ θ ^ θ ω .
Meanwhile, when the function θ ω is represented via second order approximation around θ ^ ω , the following is obtained:
θ ω θ ^ ω + g θ ^ θ ω θ ^ ω 1 / 2 θ ω θ ^ ω I θ ^ θ ω θ ^ ω ,
or it can be written as
2 θ ^ ω θ ω θ ^ ω θ ω T I θ ^ θ ^ ω θ ω .
Following (30) and (31), the test statistic in (29) can be written as
G 2 θ ^ θ ω T I θ ^ θ ^ θ ω θ ^ ω θ ω T I θ ^ θ ^ ω θ ω .
Let the partition of Fisher information matrix I θ ^ and its inverse I 1 θ ^ be
I θ ^ = I 11 g p n × g p n I 12 g p n × g + 3 g n / 2 I 21 g + 3 g n / 2 × g p n I 22 g + 3 g n / 2 × g + 3 g n / 2 ,
and
I 1 θ ^ = I g p n × g p n 11 I g p n × g + 3 g n / 2 12 I g + 3 g n / 2 × g p n 21 I g + 3 g n / 2 × g + 3 g n / 2 22 .
Based on the nature of the conditional distribution, given as θ 1 = θ ω 1 , θ ^ 1 and θ ^ 2 , we get
θ ^ ω 2 = θ ^ 2 I 21 I 11 1 θ ^ 1 θ ω 1 .
A simpler form of (33) using a simple manipulation of the partitioned matrix of the Fisher information is
θ ^ ω 2 = θ ^ 2 + I 22 1 I 21 θ ^ 1 θ ω 1 .
Since θ ^ ω θ ω = 0 , θ ^ ω 2 θ 2 and θ ω 2 θ 2 = θ ^ 2 θ 2 + I 22 1 I 21 θ ^ 1 θ ω 1 , then
θ ^ ω θ ω T I θ ^ θ ^ ω θ ω = θ ^ ω 2 θ 2 T I 22 θ ^ ω 2 θ 2 = θ ^ 1 θ ω 1 θ ^ 2 θ 2 T I 12 I 22 1 I 21 I 12 I 21 I 22 θ ^ 1 θ ω 1 θ ^ 2 θ 2 .
The statistic G 2 in (32) can then be written as
G 2 θ ^ θ ω 2 T I θ ^ θ ^ θ ω 2 θ ^ ω 2 θ ω 2 T I θ ^ θ ^ ω 2 θ ω 2 = θ ^ 1 θ ω 1 T I 11 I 12 I 22 1 I 21 θ ^ 1 θ ω 1 = θ ^ 1 θ ω 1 T I 11 1 θ ^ 1 θ ω 1 .
With regard to the normality for MLE under the regularity conditions (see Cameron and Trivedi [25]), the distribution of the partitioned matrix is
θ ^ 1 θ 1 θ ^ 2 θ 2 a N 0 , I 1 θ I 11 I 12 I 21 I 22 ,
such that
θ ^ 1 θ 1 a N 0 , I 11 , I 11 1 / 2 θ ^ 1 θ 1 a N 0 , I g p n , G 2 = θ ^ 1 θ ω 1 T I 11 1 θ ^ 1 θ ω 1 a χ g p n 2 .
Regarding Equation (27), the critical region of the test statistic G 2 is
α = P Λ < k α ; ω = P 2 ln Λ > 2 ln k α ; ω = P G 2 > c ; ω , where c = 2 ln k α = P G 2 > χ α , g p n 2 .
Therefore, H 0 in (18) is rejected when G 2 > χ α , g p n 2 .

5. Conclusions

We developed a local regression model, called a geographically weighted multivariate generalized Poisson regression (GWMGPR) model. We have shown the step-by-step procedure to obtain the estimation of the parameters and the test statistic for hypothesis testing of the GWMGPR model. It is easy to see that the same procedure of developing the GWMGPR model presented in this article can be applied in developing other GWR models based on different multivariate distributions.
The distribution of the test statistic developed in this paper is based on an asymptotic approach. Therefore, further study is needed for development of exact distributions of the test statistics. Despite the advantages of presenting regression coefficients for each location, GWMGPR is a complex model that needs a challenging coding program. Furthermore, simulation and application of the GWMGPR model using real data to validate the theoretical results in this paper that are being prepared for our next manuscript.

Author Contributions

Conceptualization, S.M.B., P.P., and S.S.; methodology, S.M.B., P.P., S.S. and S.P.R.; writing—original draft preparation, S.M.B.; writing—review and editing, S.M.B. and P.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
GWMGPRGeographically weighted multivariate generalized Poisson regression
GWMNBRGeographically weighted multivariate negative binomial regression
GWMPRGeographically weighted multivariate Poisson regression
GWRGeographically weighted regression
MGPRMultivariate generalized Poisson regression
MLEMaximum likelihood estimation

References

  1. Winkelmann, R. Econometric Analysis of Count Data, 5th ed.; Springer: Berlin, Germany, 2008. [Google Scholar]
  2. Famoye, F. A Multivariate Generalized Poisson Regression Model. Commun. Stat. Theory Methods 2015, 44, 497–511. [Google Scholar] [CrossRef]
  3. Famoye, F.; Okafor, R.; Adamu, M.O. A multivariate Generalized Poisson Distribution. J. Stat. Theory Appl. 2011, 10, 519–532. [Google Scholar]
  4. Famoye, F. A New Bivariate Generalized Poisson Distribution. Stat. Neerl. 2010, 64, 112–124. [Google Scholar] [CrossRef]
  5. Famoye, F. Comparisons of Some Bivariate Regression Models. J. Stat. Comput. Simul. 2012, 82, 937–949. [Google Scholar] [CrossRef]
  6. Zamani, H.; Faroughi, P.; Ismail, N. Bivariate Generalized Poisson Regression Model: Applications on Health Care Data. Empir. Econ. 2016, 51, 1607–1621. [Google Scholar] [CrossRef]
  7. Anselin, L. Spatial Econometrics: Methods and Models; Springer: Dordrecht, The Netherlands, 1988. [Google Scholar]
  8. Fotheringham, A.S.; Brundson, C.; Charlton, M. Geographically Weighted Regression: The Analysis of Spatially Varying Relationship; John Wiley & Son Ltd.: Chichester, UK, 2002. [Google Scholar]
  9. Tobler, W.R. Cellular Geography. In Theory and Decision Library (An International Series in the Philosophy and Methodology of the Social and Behavioral Sciences); Gale, S., Olsson, G., Eds.; Springer: Dordrecht, The Netherlands, 1979; pp. 379–386. [Google Scholar]
  10. Bascuñán, M.M.; Quezada, C.R. Geographically Weighted Regression for Modelling the Accessibility to the Public Hospital Network in Concepción Metropolitan Area, Chile. Geospat. Health 2016, 11, 263–273. [Google Scholar]
  11. Fotheringham, A.S.; Kelly, M.H.; Charlton, M. The Demographic Impacts of the Irish Famine: Towards a Greater Geographical Understanding. Trans. Inst. Br. Geogr. 2012, 38, 221–237. [Google Scholar] [CrossRef]
  12. Lin, C.H.; Wen, T.H. Using Geographically Weighted Regression (GWR) to Explore Spatial Varying Relationships of Immature Mosquitoes and Human Densities with the Incidence of Dengue. Int. J. Environ. Res. Public Health 2011, 8, 2798–2815. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Zhao, F.; Park, N. Using Geographically Weighted Regression Models to Estimate Annual Average Daily Traffic. Transp. Res. Rec. J. Transp. Res. Board 2004, 1879, 99–107. [Google Scholar] [CrossRef]
  14. Nakaya, T.; Fotheringham, A.S.; Brunsdon, C.; Charlton, M. Geographhically Weighted Poisson Regression for Disease Association Mapping. Stat. Med. 2005, 24, 2695–2717. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Rodrigues, M.; de la Riva, J.; Fotheringham, S. Modeling the Spatial Variation of the Explanatory Factors of Human-caused Wildfires in Spain Using Geographically Weighted Logistic Regression. Appl. Geogr. 2014, 48, 52–63. [Google Scholar] [CrossRef]
  16. Chen, J.; Liu, L.; Xiao, L.; Xu, C.; Long, D. Integrative Analysis of Spatial Heterogeneity and Overdispersion of Crime with a Geographically Weighted Negative Binomial Model. Int. J. Geo-Inf. 2020, 9, 60. [Google Scholar] [CrossRef] [Green Version]
  17. Harini, S.; Purhadi; Mashuri, M.; Sunaryo, S. Statistical Test for Multivariate Geographically Weighted Regression Model Using the Method of Maximum Likelihood Ratio Test. Int. J. Appl. Math. Stat. 2012, 29, 110–115. [Google Scholar]
  18. Purhadi, B.W.O.; Purnami, S.W. Parameter Estimation of Geographically Weigthed Multivariate Poisson Regression. Appl. Math. Sci. 2015, 9, 4081–4093. [Google Scholar]
  19. Suyitno; Purhadi; Sutikno; Irhamah. Parameter Estimation of Geographically Weighted Trivariate Weibull Regression Model. Appl. Math. Sci. 2016, 10, 861–878. [Google Scholar]
  20. Sugiarti, H.; Purhadi; Sutikno; Purnami, S.W. Parameter Estimation of Geographically Weighted Multivariate t Regression Model. J. Theor. Appl. Inf. Technol. 2016, 92, 45–51. [Google Scholar]
  21. Dewi, Y.S.; Purhadi; Sutikno; Purnami, S.W. Comparison of Nelder Mead and BFGS Algorithms on Geographically Weighted Multivariate Negative Binomial. Int. J. Adv. Sci. Eng. Inf. Technol. 2019, 9, 979–987. [Google Scholar] [CrossRef] [Green Version]
  22. Fathurahman, M.; Purhadi; Sutikno; Ratnasari, V. Geographically Weighted Multivariate Logistic Regression Model and Its Application. Abstr. Appl. Anal. 2020, 2020, 8353481. [Google Scholar] [CrossRef]
  23. Pawitan, Y. In All Likelihood; Clarendon Press: Oxford, UK, 2001. [Google Scholar]
  24. Cameron, A.C.; Trivedi, P.K. Microeconometrics Methods and Applications; Cambridge University Press: New York, NY, USA, 2005. [Google Scholar]
  25. Cameron, A.C.; Trivedi, P.K. Regression Analysis of Count Data, 2nd ed.; Cambridge University Press: New York, NY, USA, 2013. [Google Scholar]

Share and Cite

MDPI and ACS Style

Berliana, S.M.; Purhadi; Sutikno; Rahayu, S.P. Parameter Estimation and Hypothesis Testing of Geographically Weighted Multivariate Generalized Poisson Regression. Mathematics 2020, 8, 1523. https://doi.org/10.3390/math8091523

AMA Style

Berliana SM, Purhadi, Sutikno, Rahayu SP. Parameter Estimation and Hypothesis Testing of Geographically Weighted Multivariate Generalized Poisson Regression. Mathematics. 2020; 8(9):1523. https://doi.org/10.3390/math8091523

Chicago/Turabian Style

Berliana, Sarni Maniar, Purhadi, Sutikno, and Santi Puteri Rahayu. 2020. "Parameter Estimation and Hypothesis Testing of Geographically Weighted Multivariate Generalized Poisson Regression" Mathematics 8, no. 9: 1523. https://doi.org/10.3390/math8091523

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop