Next Article in Journal
New Multi-Parametrized Estimates Having pth-Order Differentiability in Fractional Calculus for Predominating -Convex Functions in Hilbert Space
Previous Article in Journal
Well Posedness and Finite Element Approximability of Three-Dimensional Time-Harmonic Electromagnetic Problems Involving Rotating Axisymmetric Objects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Binomial Regression Models with a Flexible Generalized Logit Link Function

by
Rindang Bangun Prasetyo
1,2,
Heri Kuswanto
1,*,
Nur Iriawan
1 and
Brodjol Sutijo Suprih Ulama
3
1
Department of Statistics, Faculty of Science and Data Analytics, Institut Teknologi Sepuluh Nopember, Jl. Arif Rahman Hakim Surabaya 60111, Indonesia
2
BPS-Statistics Indonesia, Jl. Dr. Sutomo 6-8, Jakarta 10710, Indonesia
3
Department of Business Statistics, Faculty of Vocational, Institut Teknologi Sepuluh Nopember, Jl. Arif Rahman Hakim Surabaya 60111, Indonesia
*
Author to whom correspondence should be addressed.
Symmetry 2020, 12(2), 221; https://doi.org/10.3390/sym12020221
Submission received: 21 December 2019 / Revised: 19 January 2020 / Accepted: 27 January 2020 / Published: 2 February 2020

Abstract

:
In binomial regression, a link function is used to join the linear predictor variables and the expectation of the response variable. This paper proposes a flexible link function from a new class of generalized logistic distribution, namely a flexible generalized logit (glogit) link. This approach considers both symmetric and asymmetric models, including the cases of lighter and heavier tails, as compared to standard logistic. The glogit is created from the inverse cumulative distribution function of the exponentiated-exponential logistic (EEL) distribution. Using a Bayesian framework, we conduct a simulation study to investigate the model performance compared to the most commonly used link functions, e.g., logit, probit, and complementary log–log. Furthermore, we compared the proposed model with several other asymmetric models using two previously published datasets. The results show that the proposed model outperforms the existing ones and provides flexibility fitting the experimental dataset. Another attractive aspect of the model are analytically tractable and can be easily implemented under a Bayesian approach.

1. Introduction

Binomial regression is a general term of a regression model in which the dependent variable has binomial distribution. The binomial distribution is formed from a number of successes in some repeated trials of a binomial experiment, where each trial has a probability of success. Binomial regression belongs to the generalized linear models (GLM) class, which uses a link function to connect linear predictor variables to the expectation of response variables. In binomial regression, predictor variables relate the probability of success from a series of independent Bernoulli trials. The link function for the binomial regression is formed from inverse cumulative distribution functions (c.d.f.) of a random distribution, for instance, the logit link function that corresponds to the inverse c.d.f. of the standard logistic distribution, the probit link function uses the inverse c.d.f. of standard normal distribution, and the complementary log–log (cloglog) link function is formed from the inverse c.d.f. of the Gumbel distribution. For more information about GLM and binomial regression, see McCullagh and Nelder [1] or Agresti [2].
The logit, probit, and cloglog links are the three commonly used link functions in a binomial regression. The first link is preferable due to its easy interpretation of the regression coefficients and the odds ratio. The logit model is a linear model for the canonical or natural parameter of the underlying exponential family that was obtained, and it has a closed form [3]. However, the logit model cannot always guarantee a good fit for all binomial regressions, especially for asymmetric models [4]. Czado and Santner [5], Tiku and Vaughan [6], Dobson and Barnett [7] indicated that misspecification of the link function in binomial regression could produce significant bias and increase mean squared error of parameter estimates and the predicted probabilities.
Both logit and probit links are symmetric links. Symmetric links assume that the probability of a given binomial response approaches 0 at the same rate as it approaches 1. This implies that the probability curve has a symmetric form of about 0.5. It is supported by the symmetrical probability density function (p.d.f.), which corresponds to the inverse c.d.f. of the link function. This assumption may not be reasonable in various cases. Many studies found that these symmetric link functions have some limitations. Nagler [8] indicated how sensitive the inferences are if a symmetric link was incorrectly used in the direction of an asymmetric model. Chen et al. [9] concluded that when the probability of a given binomial response approaches 0 at a different rate than it approaches 1 (depending on the covariate) or vice versa, the symmetric link function is inappropriate.
Collet [10] showed that an asymmetric link function could be more fitting than symmetric for binomial regression in some specific situations. One of the asymmetric links that is frequently used in binomial regression is the cloglog link function. However, the cloglog link has a fixed left or negative skewness, which means the probability of a given binomial response rises slowly from 0 but then tapers off more quickly as it approaches 1 and not the other way around [11]. As a result, the cloglog may produce an inaccurate inference for a symmetric model and a positive skewness model, and hence, it is not a flexible link function for binomial regression, although it is an asymmetric link. Binomial regression will be better modeled with flexible link functions that allow for symmetric and asymmetric (both positive and negative skewness) models and that allows the model to determine the required skewness level.
There have been many types of studies proposed that include skewness into the link function. Stukel [12] presented a two-parameter class of generalized logistic models that can approximate many link functions, including probit, logit, and complementary log–log. However, in a Bayesian analysis, it may not be straightforward to be implemented, because of a problem of multiple covariates and noninformative improper priors. Chen et al. [9] proposed skew-probit links using a latent variable approach of Albert and Chib [13], which can lead to proper posterior distributions using standard improper priors. Kim et al. [14] introduced a new link function based on the class of generalized skewed t-link models with the constraint on the shape parameter. Bazan et al. [15] presented the skewed probit link function and some variants with different parameterizations. Naranjo et al. [16] presented asymmetric link function based on the asymmetric exponential power (AEP) distribution. More recently, Caron et al. [17] proposed a skewed Weibull link function for categorical response data arising from the binomial model as well as the multinomial model and showed that logit, probit, and complementary log–log can be obtained as limiting cases.
In this paper, we propose a flexible generalized logit (glogit) link function for the binomial regression model. The link function is created from inverse cumulative distribution function (c.d.f.) of a new class of generalized logistic based on the exponentiated-exponential logistic (EEL) distribution that was introduced by Ghosh and Alzaatreh [18]. The EEL distribution has similar characteristics to Type IV generalized logistic distribution (GLD). The distributions consider both symmetric and asymmetric models, including the case of lighter and heavier tails, simultaneously as compared to standard logistic. The logistic distribution, Type I GLD, and Type II GLD can be obtained as special cases of both distributions. However, the EEL distribution differs from the Type IV GLD in the c.d.f. which is algebraically more tractable, so that it is more easily to be implemented. Some of the researchers use the previous GLD as the link function in binary regression, e.g., Tiku and Vaughan [6] that use Type I GLD and Oral [19] extended the work of Tiku and Vaughan [6] with a stochastic covariate, Valle et al. [20] use the Type III GLD and show that the model has ability to capture heavy and light tails, and Prentice [21] applies Type IV GLD model for Dose-Response Curves.
A Bayesian estimation technique using Markov chain Monte Carlo (MCMC) is provided for implementations of the proposed model. We present a simple syntax for the computation of the model using the JAGS (Just Another Gibbs Sampler) program shown as an attractive aspect. A simulation study to investigate the model performance of the proposed model in comparison with the most commonly used link functions, i.e., logit, probit, and complementary log–log is conducted. We also compare the proposed model with several other asymmetric models using two previously published datasets, i.e., the beetle mortality dataset [22] that was analyzed by Naranjo et al. [16] and potency of three different poisons dataset [23] analyzed by Caron et al. [17].
The rest of the paper is organized as follows: Section 2 describes the binomial regression model and the role of the link function in the model. In Section 3, we introduce the flexible generalized logit (glogit) link based on the c.d.f. corresponding to the exponentiated-exponential logistic (EEL) distribution, and we show the graphs of the distribution in various schemes. Section 4 demonstrates the flexibility of the glogit model compared with logit, probit, and complementary log–log by simulated datasets. Section 5 contains the application and comparison of the glogit model using the beetle mortality and potency of three different poisons datasets. The conclusion and discussion are given in Section 6.

2. Binomial Regression Model

The general notation of the binomial regression model is given as follows. Suppose y = ( y 1 , y 2 , , y n ) is an n × 1 vector of independent binary random variables that took value 1 for success and 0 for failure. Let X denote n × ( k + 1 ) design matrix with rows x i , where x i = ( 1 , x i 1 , x i 2 , , x i k ) is a ( k + 1 ) × 1 vector of the predictor variable, where value 1 corresponds to an intercept. Furthermore, we consider β = ( β 0 , β 1 , , β k ) as a ( k + 1 ) × 1 corresponding the intercept and regression coefficient.
In binomial regression models, the probability p i = P ( y i = 1 ) is predicted by applying link functions to a linear combination of predictor variables that leads to the following model:
F 1 ( p i ) = x i β ,
where F 1 ( · ) is called as a link function. Link function F 1 can be formed from inverse c.d.f. of a random distribution F . When F is a c.d.f. of a symmetric distribution, the resulting link is also symmetric. The most common symmetric links include logit and probit, whereas an asymmetric link is obtained when F is an asymmetric c.d.f., for example, the cloglog link.
The likelihood function ( β | y , X ) is derived from the likelihood of binomial distribution as follows:
( β | y , X ) = i = 1 n F ( x i β ) y i ( 1 F ( x i β ) ) 1 y i ,
thus, the log-likelihood function ( β | y , X ) is given by:
( β | y , X ) = i = 1 n y i log ( F ( x i β ) ) + i = 1 n ( 1 y i ) log ( 1 F ( x i β ) ) .
The β parameters in the model (3) can be estimated by using a maximum likelihood method or using the Bayesian method. The numerical method, such as the Newton–Raphson algorithm or the Nelder–Mead algorithm, can be applied for estimation with maximum likelihood estimation (MLE). While in the Bayesian framework, inferences about model parameters are typically summarized by random draws from a posterior density, which is proportional to the likelihood function for the data multiplied by a prior for the parameters.

3. Flexible Generalized Logit Link

The flexible generalized logit link function for the binomial regression model proposed in this paper is developed from inverse c.d.f. of a new class of generalized logistic. We start this section with a brief review of the new class of generalized logistic based on the exponentiated-exponential logistic (EEL) distribution given by Ghosh and Alzaatreh [18]. The EEL distribution is obtained using the technique of mixing two distributions of exponentiated-exponential and logistic distribution. For a more detailed discussion about this method, see Alzaatreh et al. [24] as well as Gupta and Kundu [25].

3.1. Exponentiated-Exponential Logistic Distribution

A random variable X is an EEL distribution, with parameter θ = ( μ , s , α , λ ) , where μ and s > 0 are the locations and the scale parameter of logistic distribution, α > 0 and λ > 0 are the shapes and the scale parameter of exponentiated-exponential distribution. The probability density function (p.d.f.) of X is given by:
f g l ( x ) = α λ e ( x μ ) / s s ( 1 + e ( x μ ) / s ) λ + 1 ( 1 ( 1 + e ( x μ ) / s ) λ ) α 1 .
We denote hereafter as X ~ EEL ( μ , s , α , λ ) . If α = λ = 1 , the p.d.f. of X in (4) is reduced to the p.d.f. of the logistic distribution. In the special case of λ = 1 , the p.d.f. in (4) is the Type I GLD. Moreover, when α = 1 , the p.d.f. reduces to the Type II GLD. The illustrations of some typical standard EEL p.d.f.s for different values of α and λ in μ = 0 and s = 1 are provided in Figure 1. Figure 1 shows that the density function of the EEL distribution can take different shapes. It has negative skewness for α < λ , positive skewness for α > λ , and symmetric for α = λ . The greater skewness can be achieved when the difference between α and λ is higher. Furthermore, the standard EEL p.d.f. has lighter tails compared to standard logistic p.d.f. when α > 1 and λ > 1 , otherwise if α < 1 and λ < 1 then heavier tails are achieved.
Based on the p.d.f. in (4), the c.d.f. for the EEL distribution is given by:
F g l ( x ) = ( 1 ( 1 + e ( x μ ) / s ) λ ) α .
Suppose F g l 1 ( p ) denotes the quantile function for the EEL distribution, where 0 < p < 1 . Then, the quantile function, which is an inverse of the EEL c.d.f. can be written as
F g l 1 ( p ) = μ + s   log ( 1 ( 1 p 1 / α ) 1 / λ ) .
The result is obtained by using F g l ( F g l 1 ( p ) ) = p in (4) and then solving it for F g l 1 ( p ) .

3.2. Binomial Regression Model with Glogit Link Function

A binomial regression model, applying the inverse of the EEL c.d.f. as a link function, namely glogit, is presented in this section. The glogit link function is formed from the quantile function of the standard EEL distribution at μ = 0 and s = 1 . The glogit link function F g l 1 can be defined as:
F g l 1 ( p )   =   glogit ( p ) = log ( 1 ( 1 p 1 / α ) 1 / λ ) .
When α = λ = 1 , the glogit link function in (7) will be reduced to the logit link function. Figure 2 depicts different EEL c.d.f. curves for the different values of α and λ .
The glogit class for the binomial regression model is obtained by considering F g l 1 in the specification of the binomial regression model given in (1), which is expressed as:
glogit ( p i ) = log ( 1 ( 1 p i 1 / α ) 1 / λ ) = x i β   ,         i = 1 , 2 , , n .
We can rewrite the formula in (8) as:
p i = F g t ( x i β )   ,         i = 1 , 2 , , n . = ( 1 ( 1 + e ( x i β ) ) λ ) α ,
where F g t ( · ) is the c.d.f. of the EEL distribution given in (5) in case μ = 0 and s = 1 . The likelihood function for the glogit model can be formed by substituting (9) in (2) as:
( β , α , λ | y , X ) = i = 1 n F g t ( x i β ) y i ( 1 F g t ( x i β ) ) 1 y i = i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i .
Thus, the log-likelihood function ( β , α , λ | y , X ) is given as:
( β , α , λ | y , X ) = i = 1 n y i α log ( 1 ( 1 + e ( x i β ) ) λ ) + i = 1 n ( 1 y i ) log ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) ,
where β , α , and λ in the model (11) are the parameters to be estimated by using a maximum likelihood estimation (MLE) method, or a Bayesian method as previously mentioned in Section 2.
In this paper, we conduct a Bayesian method for the simulation and applications. The Bayesian analysis requires the joint posterior density π ( β , α , λ | y , X ) that is defined by:
π ( β , α , λ | y , X ) ( β , α , λ | y , X ) π ( β ) π ( α ) π ( λ ) ,
where π ( β ) is the join prior of regression coefficients, π ( α ) is prior for the α parameter, and π ( λ ) is prior for the λ parameter. For the priors of the intercept and regression coefficients, we can adopt the typical proper priors for β , a normal prior, and thus β j ~ N ( μ j * , τ j * ) , where j = 0 , 1 , , k , μ j * = 0 , and τ j * denotes the precision ( 1 / σ j 2 * ) with a small value representing the common situation that is a lack of information available for the priors. Alternatively, informative priors are obtained when μ j * and τ j * are elicited based on historical information, expert opinion or validation from datasets. For α and λ , we can use a proper lognormal prior with the mean 0 and variance 1 ( α ~ LN ( 0 , 1 ) ;   λ ~ LN ( 0 , 1 ) ) , for the reasons that their value is positive and around to 1 (get near to the logit model).
With the model setup of prior distributions as shown above, the joint posterior distribution of the model is given by:
π ( β , α , λ | y , X ) i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i × j = 0 k ( τ ( β ) j 2 π e τ ( β ) j 2 ( β j μ ( β ) j ) 2 ) × τ ( α ) α 2 π e τ ( α ) 2 ( ln ( α ) μ ( α ) ) 2 × τ ( λ ) λ 2 π e τ ( λ ) 2 ( ln ( λ ) μ ( λ ) ) 2
Using the proportionality follows from Bayes’ theorem, the joint posterior distribution (13) can be rewritten as:
π ( β , α , λ | y , X ) i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i × j = 1 k ( 1 α λ e 1 2 ( τ ( β ) j ( β j μ ( β ) j ) 2 + τ ( α ) ( ln ( α ) μ ( α ) ) 2 + τ ( λ ) ( ln ( λ ) μ ( λ ) ) 2 ) )
The analysis of Bayesian models is using MCMC, which sampling from the posterior is done using the Gibbs sampling algorithm. The full conditional distributions for all parameters must be derived to implement the algorithm for the joint posterior distribution (14). The full conditional distributions for β are easy to obtain:
π ( β j | y , X , α , λ , β ( j ) ) i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i × e 1 2 ( τ ( β ) j ( β j μ ( β ) j ) 2 ) ,
where β ( j ) = ( β 0 , β 1 , , β j 1 , β j + 1 , , β k ) and the subscript ( j ) denotes the j-th element has been removed. The full conditional distributions for α and λ are given by:
π ( α | y , X , β , λ ) i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i × 1 α e 1 2 α ( τ ( α ) ( ln ( α ) μ ( α ) ) 2 )
π ( λ | y , X , β , α ) i = 1 n ( ( 1 ( 1 + e ( x i β ) ) λ ) α ) y i ( 1 ( 1 ( 1 + e ( x i β ) ) λ ) α ) 1 y i × 1 λ e 1 2 λ ( τ ( λ ) ( ln ( λ ) μ ( λ ) ) 2 )
Each draw can be done using the adaptive rejection metropolis algorithm implemented in JAGS. JAGS is a program for Bayesian graphical modeling, which has compatibility with classic BUGS (Bayesian inference using Gibbs sampling), developed by Martyn Plummer [26]. We can express the models for k = 2 in JAGS using the following simple syntax:
model {
   for (i in 1:n) {
   y[i]~dbin(p[i], 1)
   omega[i] < -beta0 + beta1*x1[i] + beta2*x2[i]
   p[i] < -pow((1-pow((1+exp(omega[i])),-lambda)),alpha)
       }
   #Prior
   beta0~dnorm(0, 0.001)
   beta1~dnorm(0, 0.001)
   beta2~dnorm(0, 0.001)
   alpha~dlnorm(0, 1)
   lambda~dlnorm(0, 1)
   }
We implemented the models using the function autojags in the jagsUI package developed by Kellner [27] in the R software. The autojags applies the convergence diagnostic based on Gelman and Rubin [28] and Brooks and Gelman [29] called Gelman–Rubin diagnostic tests. To compute the Gelman–Rubin statistic, we must run two chains in JAGS using two different sets of initial values. If the MCMC chain has reached convergence, the Gelman–Rubin statistics, namely Rhat, are lower than 1.05, and it concludes that burn-in is adequate, whereas the value in excess of 1.05 indicates a lack of convergence. Another convergence diagnostic method of Raftery and Lewis [30] was considered in autojags. The method estimates and applies the number of iterations needed for a given level of precision in posterior samples when quantiles are the posterior summaries of interest.

4. Simulation

A simulation study was carried out to evaluate the performance of the glogit link function for the binomial regression models compared to logit, probit, and cloglog. For this purpose, several datasets were generated by the following steps:
  • Step 1: Specify a binomial regression model with one predictor variable and set a vector of the regression coefficients β = ( β 0 , β 1 ) = ( 0.1 ,   1 ) ;
  • Step 2: Generate x i 1 ~ Normal ( 0 , 1 ) , then we create x i = ( 1 , x i 1 ) ;
  • Step 3: Compute p i from a given link, so that p i = F ( x i β );
  • Step 4: Generate n i ~ Poisson ( δ ) with δ = 100, and Y i ~ Binomial ( n i , p i ) , which is the binomial distribution with p i as the probability of success and n i is the sample size for the repeated Bernoulli trials.
We consider seven simulations to accommodate into seven scenarios for symmetric and asymmetric models and lighter and heavier tails (compared to the logit model). The scenarios were considered by applying the different inverse of link functions F ( · ) . The scenarios are as follows:
  • Scenario 1: F 1 ( p i ) = glogit ( p i | α = 1 ,   λ = 1 ) , same with F 1 ( p i ) = logit ( p i ) ;
  • Scenario 2: F 1 ( p i ) = glogit ( p i | α = 4 ,   λ = 4 ) , represent the lighter tails compared to the logit;
  • Scenario 3: F 1 ( p i ) = glogit ( p i | α = 0.5 ,   λ = 0.5 ) , represent the heavier tails compared to the logit;
  • Scenario 4: F 1 ( p i ) = glogit ( p i | α = 3 ,   λ = 0.3 ) , represent the positive skewness;
  • Scenario 5: F 1 ( p i ) = glogit ( p i | α = 0.3 ,   λ = 3 ) , represent the negative skewness;
  • Scenario 6: F 1 ( p i ) = cloglog ( p i ) ;
  • Scenario 7: F 1 ( p i ) = probit ( p i ) .
The comparison of graph visualizations for scenarios 1 to 5 represented specifically for the glogit can be seen in Figure 1 and Figure 2. For each scenario, we generated 100 datasets, and for each dataset, we generated m = 100 binomial response variables, thus, y i ~ Binomial ( n i , p i ) for i = 1 , , 100 . Furthermore, we fit all of the models for each simulated dataset, respectively, to find the best performance in terms of model comparisons. We conducted a Bayesian analysis on the above seven simulated datasets and set with the uninformative prior for each model given in Section 3.2.
To compare the performance of the models under different link functions, we use two model comparisons. The first is the deviance information criterion (DIC) proposed by Spiegelhalter et al. [31], which represents a goodness-of-fit criterion. It is an estimate of expected predictive error. Therefore, the lower value of DIC is better. DIC is given by DIC = D ( η ) ¯ + p D ^ , where D ( η ) = 2 log L ( η ) is deviance of model, L ( η ) is the likelihood, and D ( η ) ¯ = E ( D ( η ) | data ) is the deviance posterior mean. The number of parameters is given as p , and p D ^ = D ( η ) ¯ D ( η ¯ ) is defined as the effective number of parameters, while D ( η ¯ ) is the deviance posterior means of the parameters of interest, with η ¯ = E ( η | data ) . Another criterion for the model comparison is absolute errors (AE). Absolute error is defined as the absolute value of the difference between the prediction value and the true value of a measurement and is commonly given as the maximum possible error given a model's degree of accuracy. AE is defined by A E = i = 1 n | p i p i ^ | . The smaller values of AE indicate better models.
Table 1 shows the result of the simulation for the seven scenarios. Scenarios 1 to 3 are symmetrical models in which the first one is generated from glogit ( α = 1 ; λ = 1 ) , or same with the logit. The second one represents the lighter tails compared to the logit, and the third one represents the heavier tails compared to the logit. The fourth and fifth scenarios are used to represent the asymmetric models in which the fourth one has a positive direction of skewness, while the fifth one has a negative direction. In addition, the sixth and seventh scenarios belong to the cloglog and probit link function. We evaluate each scenario by DIC and AE values to compare the models, i.e., glogit, logit, cloglog, and probit.
First, we focus on the results of the first scenario. When the datasets generated from the logit link that equaled the glogit link for α =   λ = 1 , then the logit model is the best, followed by the glogit. However, the DIC and AE values of both models are very close compared to the other. For the second to fifth scenario that used datasets generated from the glogit model for the several values of α and λ , the glogit model performs best. In the second scenario, which has lighter tails compared to the standard logistic, the second-best fit model is the probit model. For the third scenario, the logit model gives almost identical values of DIC and AE, with the result that it is the second-best fit model. In the fourth scenario, the second-best fit is the probit model, whereas, in the fifth one, the second-best fit is the cloglog model. This result is consistent with the fact that the cloglog model also has negative skewness. When the datasets are generated from cloglog and probit models (Scenario 6 and 7), the second-best fit is the glogit model after them. However, the differences in the model comparison values are quite similar. In summary, the simulation results show that the glogit model is quite flexible and performs better than the standard link function.

5. Application

This section discusses the result of applying the proposed method to the empirical cases. It aims to study the performance of the flexible generalized logit link function, compared with other recent asymmetric link functions. Two datasets are considered, i.e., beetle mortality dataset of Bliss [22] and potency of three different poison datasets of Finney [23].

5.1. Beetle Mortality Dataset

The beetle mortality dataset in Table 2 is taken from Bliss [22]. The data consist of the toxicological experiment results based on the number of flour beetles dead after several hours of carbon disulfide ( C S 2 ) exposure at various concentrations (transformed to ten base logarithm of dose). More information about the data can be found in Bliss [22].
The dataset in Table 2 was previously analyzed by Naranjo et al. [16], who proposed the AEP-based link model to analyze a binomial regression by using the inverse of the Asymmetric Exponential Power (AEP) c.d.f. as the link function. Furthermore, they compared the model with some symmetric and asymmetric link models, such as the probit and t-link (T(8)) models that are proposed by Albert and Chib [13], the skewed probit (SP) model that are proposed by Chen et al. [9], and the skewed generalized t-link (SGT) model proposed by Kim et al. [14]. The model is given by
η i = F 1 ( p i ) = β 0 + β 1 log ( dose i ) ,   i = 1 , 2 , , 8 ,
where the response variable is binary with ( y i = 1 ) representing the dead and ( y i = 0 ) representing the beetles that survived, p i is the proportion of dead beetles ( ndead i ) to the number of beetles exposed ( n i ) under the ith dose. A comparison of the five models was considered by using different link functions F 1 .
Five models were fitted to the data, and four different criteria of model comparison were used, e.g., the deviance information criterion (DIC), the Bayesian information criterion (BIC), the Akaike information criterion (AIC), and the absolute errors (AE). However, in this paper, only DIC and AE are used to compare with the flexible generalized logit (glogit) model because the DIC, BIC, and AIC give the same conclusions. The formulas of DIC and AE are given in Section 4. All of the estimation and comparison criteria values for the models, except the glogit model, are adopted from Naranjo et al. [16] that applied Bayesian analysis to estimate the parameters. For the glogit model, we also used a Bayesian framework, whose join posterior is given in (12), including the settings for the prior.
The posterior estimates (mean and standard deviation), the DIC value, and the AE value for the fitted models are reported in Table 3. The glogit model is fitted to the dataset, and the result is compared with those for the probit, T(8), SP, SGT, and AEP-based link models processed by Naranjo et al. [16]. These results presented in Table 3 show that the glogit model provides the best fit, followed by the AEP-based link model indicated by the smallest value of DIC. For criterion AE, we note that the AEP-based link model has the smallest value so that the AEP-based link model is superior, followed by the glogit model. Furthermore, the glogit and AEP-based link models have similar values of model comparison compared with the other in this dataset.

5.2. The Potency of Three Different Poisons Dataset

In the second application, we take the potency of three different poisons dataset from Finney [23] and the analysis results from Caron et al. [17]. The data consist of the test results of the different doses and the different poisons to know the potency of the poisons to kill insects. The poisons are rotenone, deguelin, and a mixture of rotenone and deguelin. The data are provided in Table 4, where n is the number of observations. More information about this data can be found in Finney [23].
Caron et al. [17] proposed a skewed Weibull (SW) link function and analyzed the dataset in Table 4 in MLE and Bayesian framework. In the Bayesian method, they compared the proposed model with another skewed link function, such as the AEP-based link (AEP) model presented by Naranjo et al. [16] and the skewed probit (SP) model proposed by Chen et al. [9]. The model that was used is:
η i = F 1 ( p i ) = β 0 + β 1 log ( dose i ) + β 2 Rotenone + β 3 Deguelin ,   i = 1 , 2 , , 8 ,
where the p i is the proportion of dead insects ( dead i ) to the number of observation ( n i ) under the ith log(dose), rotenone dummy variable, and deguelin dummy variable. The mixture variable is considered as the reference poison (rotenone = 0 and deguelin = 0). A comparison to the models was considered by applying different F 1 (link functions). They are the SP, AEP, SW, and glogit models.
In this section, we applied the DIC and the mean absolute error (MAE) for model comparison. The MAE is defined as MAE = 1 n i = 1 n | p i p i ^ | . The estimation and the model comparison values for the models, except the glogit model, were taken from Caron et al. [17]. More detail about their methods can be found in Caron et al.’s study [17]. For the glogit model, we used the model presented in (12), including the settings for the prior.
The model comparison results of DIC and MAE for the fitted models are presented in Table 5. The glogit model is fitted to the dataset, and the result is compared with SP, AEP, and SW models analyzed by Caron et al. [17]. The results reported in Table 5 show that the glogit model provides the best fit, followed by the AEP-based link model, indicated by the smallest value of DIC and MAE. Overall, we see the superiority of the glogit against other link functions for the datasets. The mean and standard deviation of posterior estimates values for all of the models are also given in Table 5.

6. Conclusions

A flexible generalized logit (glogit) link function for analyzing binomial regression is proposed in this paper. The link function is based on a new class of generalized logistic distribution, exponentiated-exponential logistic (EEL) distribution. This distribution considers both symmetric and asymmetric models to accommodate positive or negative skewness. Moreover, this allows for the case of a lighter or heavier tails behavior compared to the standard logistic.
The paper provides a simulation of binomial regression using seven scenarios to indicate the flexibility and superiority of the glogit compared with common link functions, i.e., logit, probit, and cloglog in the Bayesian framework. The simulation result show that the glogit model always gave the best fit for the datasets generated from the glogit link function and the second-best fit for the datasets generated from another correspond link function. It also shows the second-best fit model when the datasets generated from the glogit was the model that related to the shape of EEL p.d.f. curve determined from α and λ value.
In addition, applying the model for the datasets from previous studies indicates that the glogit fits the data well and outperforms the other asymmetric link functions, such as the probit and t-link (T(8)) models proposed by Albert and Chib [13], the skewed probit (SP) model proposed by Chen et al. [9], the skewed generalized t-link (SGT) model proposed by Kim et al. [14], AEP-based link (AEP) model presented by Naranjo et al. [16], and finally, a skewed Weibull (SW) model presented by Caron et al. [17]. The proposed model is evidently flexible and capable of handling the datasets. The performance of the glogit is very good, indicated by the smallest value of DIC, followed by the AEP-based link model for both of the datasets. We are convinced that the glogit model is a good option and could be used in practice due to its flexibility. Other attractive aspects of the model are that it is analytically tractable and can be easily implemented under a Bayesian approach using the simple syntax.
This study adopted the typical priors for the Bayesian analysis with non-informative priors. Priors with other distributions can also be considered. The prior selection is an important activity that was not analyzed in this paper and hence, can be considered in future research.

Author Contributions

R.B.P., H.K., N.I., and B.S.S.U. understood and designed the research; R.B.P. collected and analyzed the data, and drafted the paper. All authors critically read and revised the draft and approved the final paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by BPS-Statistics Indonesia and the Indonesia Endowment Fund for Education (LPDP).

Acknowledgments

The authors thank the referees for their helpful comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. McCullagh, P.; Nelder, J.A. Generalized Linear Models, 2nd ed.; Chapman and Hall: London, UK, 1989. [Google Scholar]
  2. Agresti, A. Categorical Data Analysis, 3rd ed.; John Wiley and Sons Inc.: New York, NY, USA, 2013. [Google Scholar]
  3. Pérez–Sánchez, J.M.; Gómez–Déniz, E.; Dávila–Cárdenes, N. A comparative study of logistic models using an asymmetric link: Modelling the away victories in football. Symmetry 2018, 10, 224. [Google Scholar] [CrossRef] [Green Version]
  4. Li, J. Choosing the Proper Link Function for Binary Data. Ph.D. Thesis, The University of Texas, Austin, TX, USA, 2014. [Google Scholar]
  5. Czado, C.; Santner, T.J. The effect of link misspecification on binary regression inference. J. Stat. Plan. Inference 1992, 33, 213–231. [Google Scholar] [CrossRef]
  6. Tiku, M.L.; Vaughan, D.C. Logistic and Nonlogistic Density Funcitons in Binary Regression with Nonstochastic Covariates. Biom. J. 1997, 39, 883–898. [Google Scholar] [CrossRef]
  7. Dobson, A.J.; Barnett, A.G. An Introduction to Generalized Linear Models, 3rd ed.; CRC Press: Boca Raton, FL, USA, 2008. [Google Scholar]
  8. Nagler, J. Scobit: An alternative estimator to logit and probit. Am. J. Political Sci. 1994, 38, 230–255. [Google Scholar] [CrossRef]
  9. Chen, M.H.; Dey, D.K.; Shao, Q.M. A new skewed link model for dichotomous quantal response data. J. Am. Stat. Assoc. 1999, 94, 1172–1186. [Google Scholar] [CrossRef]
  10. Collet, D. Modelling Binary Data, 2nd ed.; Chapman & Hall/CRC: Boca Raton, FL, USA, 2003. [Google Scholar]
  11. Jiang, X.; Dey, D.K.; Prunier, R.; Wilson, A.M.; Holsinger, K.E. A new class of flexible link functions with application to species co-occurrence in cape floristic region. Ann. Appl. Stat. 2013, 7, 2180–2204. [Google Scholar] [CrossRef] [Green Version]
  12. Stukel, T. Generalized logistic models. J. Am. Stat. Assoc. 1988, 83, 426–431. [Google Scholar] [CrossRef]
  13. Albert, J.; Chib, S. Bayesian analysis of binary and polychotomous response data. J. Am. Stat. Assoc. 1993, 88, 669–679. [Google Scholar] [CrossRef]
  14. Kim, S.; Chen, M.H.; Dey, D.K. Flexible generalized t-link models for binary response data. Biometrika 2008, 95, 93–106. [Google Scholar] [CrossRef]
  15. Bazán, J.L.; Bolfarine, H.; Branco, M.D. A framework for skew-probit links in binary regression. Commun. Stat. Theory Methods 2010, 39, 678–697. [Google Scholar] [CrossRef]
  16. Naranjo, L.; Pérez, C.J.; Martín, J. Bayesian analysis of some models that use the asymmetric exponential power distribution. Stat. Comput. 2015, 25, 497–514. [Google Scholar] [CrossRef]
  17. Caron, R.; Sinha, D.; Dey, D.K.; Polpo, A. Categorical data analysis using a skewed weibull regression model. Entropy 2018, 20, 176. [Google Scholar] [CrossRef] [Green Version]
  18. Ghosh, I.; Alzaatreh, A. A new class of generalized logistic distribution. Commun. Stat. 2018, 47, 2043–2055. [Google Scholar] [CrossRef]
  19. Oral, E. Binary Regression with Stochastic Covariates. Commun. Stat. Theory Methods 2006, 35, 1429–1447. [Google Scholar] [CrossRef]
  20. Valle, L.D.; Leisen, F.; Rossini, L.; Zhu, W. Bayesian analysis of immigration in Europe with generalized logistic regression. J. Appl. Stat. 2019, 1–15. [Google Scholar] [CrossRef]
  21. Prentice, R.L. A Generalization of the Probit and Logit Methods for Dose Response Curves. Biometrics 1976, 32, 761–768. [Google Scholar] [CrossRef]
  22. Bliss, C. The calculation of the dosage-mortality curve. Ann. Appl. Biol. 1935, 22, 134–167. [Google Scholar] [CrossRef]
  23. Finney, D.J. Probit Analysis; University Press: Cambridge, UK, 1947. [Google Scholar]
  24. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distribution. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef] [Green Version]
  25. Gupta, R.D.; Kundu, D. Exponentiated exponential family: An alternative to gamma and weibull distributions. Biom. J. 2001, 43, 117–130. [Google Scholar] [CrossRef]
  26. Plummer, M. JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. In Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003), Vienna, Austria, 20–22 March 2003. [Google Scholar]
  27. Kellner, K. jagsUI: A Wrapper around rjags to Streamline JAGS Analyses. Available online: https://cran.r-project.org/web/packages/jagsUI (accessed on 2 December 2019).
  28. Gelman, A.; Rubin, D.B. Inference from iterative simulation using multiple sequences. Stat. Sci. 1992, 7, 457–472. [Google Scholar] [CrossRef]
  29. Brooks, S.; Gelman, A. General methods for monitoring convergence of iterative simulations. J. Comput. Graph. Stat. 1998, 7, 434–455. [Google Scholar]
  30. Raftery, A.E.; Lewis, S.M. How many iterations in the Gibbs sampler. In Bayesian Statistics 4; Bernardo, J.M., Smith, A.F.M., Dawid, A.P., Berger, J.O., Eds.; Oxford University Press: New York, NY, USA, 1992; pp. 763–774. [Google Scholar]
  31. Spiegelhalter, D.J.; Best, N.G.; Carlin, B.P.; Van der Linde, A. Bayesian measures of model complexity and fit (with discussion). J. R. Stat. Soc. Ser. B: Stat. Methodol. 2002, 64, 583–640. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Probability density function (p.d.f.) of the exponentiated-exponential logistic (EEL) distribution when μ = 0 and s = 1 at various choices of α and λ . The black solid line shows the p.d.f. of EEL when α = 1 and λ = 1 that are equal to standard logistic distribution. The red dashed line shows the p.d.f. of EEL when α = 4 and λ = 4 that has lighter tails compared to the standard logistic. The blue dotted line shows the p.d.f. of EEL when α = 0.5 and λ = 0.5 that has heavier tails compared to the standard logistic. The yellow dotted dashed line shows the p.d.f. of EEL when α = 3 and λ = 0.3 that has positive skewness. The green long dashed line shows the p.d.f. of EEL when   α = 0.3 and λ = 3 that has negative skewness.
Figure 1. Probability density function (p.d.f.) of the exponentiated-exponential logistic (EEL) distribution when μ = 0 and s = 1 at various choices of α and λ . The black solid line shows the p.d.f. of EEL when α = 1 and λ = 1 that are equal to standard logistic distribution. The red dashed line shows the p.d.f. of EEL when α = 4 and λ = 4 that has lighter tails compared to the standard logistic. The blue dotted line shows the p.d.f. of EEL when α = 0.5 and λ = 0.5 that has heavier tails compared to the standard logistic. The yellow dotted dashed line shows the p.d.f. of EEL when α = 3 and λ = 0.3 that has positive skewness. The green long dashed line shows the p.d.f. of EEL when   α = 0.3 and λ = 3 that has negative skewness.
Symmetry 12 00221 g001
Figure 2. Cumulative distribution functions (c.d.f.) of the EEL distribution when μ = 0 and s = 1 at various choices of α and λ , which correspond to Figure 1.
Figure 2. Cumulative distribution functions (c.d.f.) of the EEL distribution when μ = 0 and s = 1 at various choices of α and λ , which correspond to Figure 1.
Symmetry 12 00221 g002
Table 1. Model comparison using average of deviance information criterion (DIC) and absolute errors (AE) corresponding to the link functions at several scenarios.
Table 1. Model comparison using average of deviance information criterion (DIC) and absolute errors (AE) corresponding to the link functions at several scenarios.
ScenariosGlogit LinkLogit LinkCloglog LinkProbit Link
DICAEDICAEDICAEDICAE
1815.21 **0.1903 **813.74 *0.1589 *1116.611.7661824.270.3413
2697.04 *0.1409 *746.970.50931903.642.5617712.61 **0.2909 **
3826.91 *0.1980 *827.74 **0.2021 **916.400.9986833.120.2904
4462.63 *0.0545 *539.940.2971557.110.3286477.06 **0.1396 **
5597.04 *0.0895 *822.000.8047599.08 **0.1038 **655.230.4173
6745.41 **0.1934 **1613.282.4169742.98 *0.1369 *1378.132.0713
7776.19 **0.2139 **825.320.60171611.832.4673771.02 *0.1249 *
Scenario 1 = glogit ( α = 1 ; λ = 1 ) ; Scenario 2 = glogit ( α = 4 ; λ = 4 ) ; Scenario 3 = glogit ( α = 0.5 ; λ = 0.5 ) ; Scenario 4 = glogit ( α = 3 ; λ = 0.3 ) ; Scenario 5 = glogit ( α = 0.3 ; λ = 3 ) ; Scenario 6 = cloglog; Scenario 7 = probit. * indicates the smallest value; ** indicates the next smallest value.
Table 2. Beetles toxicological experiment of C S 2 exposure at various concentrations.
Table 2. Beetles toxicological experiment of C S 2 exposure at various concentrations.
log(dose)nndeadp
49.15960.102
53.060130.217
56.962180.290
60.856280.500
64.863520.825
68.759530.898
72.662610.984
76.560601
Table 3. Summary of posterior estimations and model comparison for the several models fitted to the beetle mortality dataset.
Table 3. Summary of posterior estimations and model comparison for the several models fitted to the beetle mortality dataset.
ParametersModel [Posterior Mean (Standard Deviation)]
ProbitT(8)SPSGTAEPGlogit
β 0 −33.88 (1.91)−35.17 (2.03)−34.45 (2.06)−8.3 (1.73)−35.16 (0.36)−16.56 (5.08)
β 1 19.14 (1.04)19.86 (1.14)19.43 (1.14)4.69 (0.98)19.69 (0.21)0.25 (0.08)
- δ   =   0.08   ( 0.45 ) v 1   =   9.4   ( 14.8 ) θ 1   =   1.04   ( 0.15 ) α   =   0.86   ( 0.59 )
- δ   =   0.22   ( 0.1 ) θ 1   =   0.47   ( 0.07 ) λ   =   2.61   ( 2.37 )
DIC39.2641.2240.2236.7236.43 **35.09 *
AE0.350.370.350.250.15 *0.18 **
* indicates the smallest value; ** indicates the next smallest value.
Table 4. Relative potency of three different poisons (rotenone, deguelin, and a mixture of the two) to kill insects.
Table 4. Relative potency of three different poisons (rotenone, deguelin, and a mixture of the two) to kill insects.
RotenoneDeguelinMixture
log(dose)deadnlog(dose)deadnlog(dose)deadn
1.0144501.748481.44850
0.8942491.6147501.314346
0.7124461.4847491.183848
0.5816481.31344812746
0.41650118480.712246
0.7116490.4747
Table 5. Summary of posterior estimations and model comparison for the several models fitted to the potency of three different poisons dataset.
Table 5. Summary of posterior estimations and model comparison for the several models fitted to the potency of three different poisons dataset.
ParametersModel [Posterior Mean (Standard Deviation)]
SPAEPSWglogit
β 0 −2.19 (0.40)−4.98 (1.51)0.27 (0.20)−10.78 (0.81)
β 1 2.65 (0.22)5.27 (1.48)0.78 (0.23)9.08 (0.74)
β 2 0.38 (0.13)1.23 (0.46)0.12 (0.04)1.89 (0.37)
β 3 −0.50 (0.13)−1.01 (0.38)−0.15 (0.06)−1.74 (0.40)
- δ   =   0.04   ( 0.55 ) θ 1   =   0.45   ( 0.12 ) γ   =   4.03   ( 1.20 ) α   =   0.26   ( 0.03 )
- θ 1   = 0.91   ( 0.19 ) λ   =   1.17   ( 0.72 )
DIC751.92749.01 **751.38747.75 *
MAE0.06620.0564 **0.06440.0497 *
* indicates the smallest value; ** indicates the next smallest value.

Share and Cite

MDPI and ACS Style

Prasetyo, R.B.; Kuswanto, H.; Iriawan, N.; Ulama, B.S.S. Binomial Regression Models with a Flexible Generalized Logit Link Function. Symmetry 2020, 12, 221. https://doi.org/10.3390/sym12020221

AMA Style

Prasetyo RB, Kuswanto H, Iriawan N, Ulama BSS. Binomial Regression Models with a Flexible Generalized Logit Link Function. Symmetry. 2020; 12(2):221. https://doi.org/10.3390/sym12020221

Chicago/Turabian Style

Prasetyo, Rindang Bangun, Heri Kuswanto, Nur Iriawan, and Brodjol Sutijo Suprih Ulama. 2020. "Binomial Regression Models with a Flexible Generalized Logit Link Function" Symmetry 12, no. 2: 221. https://doi.org/10.3390/sym12020221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop