Abstract
In this paper, we propose the classical and Bayesian regression models for use in conjunction with the inverted Weibull (IW) distribution; there are the inverted Weibull Regression model (IW-Reg) and inverted Weibull Bayesian regression model (IW-BReg). In the proposed models, we suggest the logarithm and identity link functions, while in the Bayesian approach, we use a gamma prior and two loss functions, namely zero-one and modified general entropy (MGE) loss functions. To deal with the outliers in the proposed models, we apply Huber and Tukey’s bisquare (biweight) functions. In addition, we use the iteratively reweighted least squares (IRLS) algorithm to estimate Bayesian regression coefficients. Further, we compare IW-Reg and IW-BReg using some performance criteria, such as Akaike’s information criterion (AIC), deviance (D), and mean squared error (MSE). Finally, we apply the some real datasets collected from Saudi Arabia with the corresponding explanatory variables to the theoretical findings. The Bayesian approach shows better performance compare to the classical approach in terms of the considered performance criteria.
1. Introduction
McCullagh and Nelder [] published a book on the generalized linear models (GLMs) that led to their widespread use and appreciation. They extended the scoring method to maximum likelihood estimation (MLE) in exponential families. Nelder and Pregibon [] described methods of jointly estimating the parameters of both the link and variance functions. The iteratively reweighted least squares (IRLS) algorithm is amenable to some statistics and measures that are common to all the GLMs. Nelder and Wedderburn [] used the Newton-Raphson process for regression coefficients estimates. They reported that the Newton-Raphson process with expected second derivatives is equivalent to the Fisher’s scoring technique. Additionally, De Jong and Heller [] reported that the Newton-Raphson iteration equation leads to a sequence that often rapidly converges. These include the D statistic, along with some specific residuals and influence measures. Yuan and Bentler [] reported that the convergence properties of the Fisher-scoring algorithm are affected by many factors. One of them is multicollinearity among the observed variables. If the sample or model implied covariance matrix is close to being singular, the Fisher-scoring algorithm may have difficulty reaching a set of converged solutions. Liao [] introduced a systematic way of interpreting commonly used probability models: logit, probit, and the other GLMs.
The inverse Weibull (IW) model that was derived by Keller and Kamath [] based on physical considerations on some mechanical components’ failures subject to degradation phenomena, assuming that the strength of a component decreases with time with a power law. Calabria and Pulcini [] proposed the IW distribution as a suitable model to describe mechanical degradation phenomena. They investigated a statistical property of the maximum likelihood estimator of the IW reliability. Jiang et al. [] derived the Weibull (W) and IW mixture models with a common shape parameter for a system’s components. They also used an example to illustrate that the proposed mixture model can be used to approximate the reliability behaviors of the consecutive-k-out-of-n systems. Mahmoud et al. [] considered the order statistics arising from IW distribution. They also derived an exact expression for the single moments of order statistics. Pasari and Dikshit [] investigated the suitability of W distribution in the probabilistic assessment of earthquake hazards. The performance is also compared with two other popular models from the same W family, namely the two-parameter W model and the IW model. Jazi et al. [] proposed a discrete IW distribution, a discrete version of the continuous IW variable of fitting discrete-time reliability and survival data sets. Kundu and Howlader [] considered the Bayesian inference and prediction problems of the IW distribution based on Type-II censored data. Qingtian et al. [] discussed the definition of environmental factors and restricting conditions of constant failure mechanism Based on generally accepted basic hypotheses. They estimated the IW distribution environment factor using the MLE and Bayes estimation methods. Musleh and Helu [] considered two types of inference procedures: the classical (MLE, Approximate MLE and the least square method (LSE)) and the Bayesians (the squared error loss function (SQR), Linex loss function (LIN), General entropy loss function (GE), the Precautionary loss function (PRE)) to estimate the unknown parameters of the IW distribution when data under consideration are progressively type-II censoring. Akgul et al. [] used IW distribution for modeling the seasonal wind speed using the modified maximum likelihood (MML) estimators of the parameters. The MML estimators’ efficiencies are compared with the well-known maximum likelihood (ML) and the least-squares (LS) estimators via the Monte-Carlo simulation study. Nassar and Abo-Kasem [] discussed the estimation problem of the unknown parameters of the IW distribution based on adaptive type-II progressively hybrid censored data. They used classical and Bayesian estimation methods to estimate the unknown parameters.
The Bayesian approach to modelling provides an alternative to the standard GLMs. The posterior mode estimation is an alternative to full posterior analysis or posterior mean estimation, which avoids numerical integrations or simulation methods. It has been proposed by many authors; see References [,]. Dey et al. [] described how to conceptualize, perform, and critique the traditional GLMs from a Bayesian perspective and how to use modern computational methods to summarize inferences using simulation. Olsson [] given an overview of the GLMs and has presented practical examples. The exponential family of distributions are discussed along with the maximum likelihood estimation and ways of assessing the fit of the model. Dobson and Barnett [] presented a theoretical background of the GLMs. For the Bayesian estimation in this context, a useful asymmetric loss function, known as the LINEX loss function, was introduced by Varian (1975) and has been widely used by several authors. A suitable alternative to the modified LINEX loss is the general entropy (GE) loss function proposed by []. This loss function is a generalization of the entropy loss function used by several authors [,,]. One highly used one is the zero-one loss function (for more details, see Reference []).
In order to reduce the influence of outliers on the estimate, some robust measures were proposed in the literature. The common robust estimation method can be divided into several categories: M, MM, median, L1, Msplit, R, S, least-trimmed squares, and sign-constraint robust least squares estimation. Among these, Huber’s M estimation has become one of the main robust estimation methods by virtue of its simple calculation and convenience to implement []. The key aspect is the involvement of a loss function that is applied to data errors that was selected to less rapidly increase than the square loss function that is used in least-squares or maximum-likelihood procedures. There exist several well-known families of loss functions, such as Huber, Hampel, and Tukey’s biweight (or bisquare) that can be used for the computation of M estimators []. The IW distribution is flexible distribution can be used as a competing to gamma and Weibull distributions to describe more widely real life data, failure characteristics, such as infant mortality, useful life and wear-out periods, applications in medicine and ecology, determining the cost-effectiveness, maintenance periods of reliability centered maintenance activities, and biological research (for more details, see References [,,,,,,,,,]).
This paper is structured as follows: In Section 2, we present an overview of the GLMs and propose the IW-Reg model under various link functions for estimating the model parameters. In Section 3, we estimate the IW-Bayesian regression (IW-BReg) model under a gamma prior, various link and two loss functions. In Section 4, we apply the theoretical results of both of IW-Reg and IW-BReg models to some real datasets collected from Saudi Arabia. Next, we investigate the performance of the proposed models in terms of some criteria, such as Akaike’s information criterion (AIC), mean squared error (MSE), and deviance (D). In addition, we propose Huber and Tukey’s bisquare (biweight) functions to improve IW-BReg models. In addition, we adopt the iteratively reweighted least squares (IRLS) algorithm to estimate the Bayesian regression coefficients. Finally, Section 5 draws a succinct conclusion to the findings.
2. Classical Approach
Nelder and Wedderburn [] introduced the class of the GLMs, defined according to the assumption that are observations of the response variable, with the density function of as follows:
where , are known functions, with being the canonical parameter. A link function, , relating to the regression coefficients, is given by
where , is a vector of p unknown regression parameters, is a vector of explanatory variables, and is a linear predictor of the vectors and . Here, the is a link function, which is a monotonic differentiable invertible function. The model given by (1) and (2) is called the GLM. The GLM class includes, as special cases, linear regression and analysis of variance models, logit and probit models for quantal responses, log-linear models, and multinomial response models for counts; for more details, see Reference [].
Consider that the probability density function of the IW distribution as follows []
here, and are the shape and scale parameters, respectively. The mean value of the response variable is given by
The cumulative function of IW distribution is given by
Let be a random sample from IW, and , the log-likelihood function based on , is given by
The regression coefficients are estimated using the Fisher’s scoring technique [,]. In order to develop the GLMs for our models, the IW-Reg are similar to GLMs, except that the distribution of the response variable is not a member of the exponential family []. We also suggest some convenient link functions of , in view of (2), as in the following lemmas.
Lemma 1 (The IW-Reg model with a log link function).
Let the response variable Y have an IW distribution, , and let the link function of the form be
Thus, the estimated coefficients using Fisher’s scoring technique at the iteration is given by
where X is a covariates matrix, is an initial vector, ,
and , and
The procedure in (7) can be repeated until . The IW-Reg model in this case is given by
Proof.
Suppose that, in , as in (3), the parameter is assumed to be known. The log-likelihood function based on is given as in (5). The link function connecting the with the linear model , in this case, is given as in (6). for the log-likelihood is written from one observation as
From (5) and (6), we have
which can be written in the matrix notation as
Taking the second derivatives of , we have
hence,
Since and , then and
where is the diagonal matrix of weights, and is as it is in (8). Then,
From (13) and (14), we have
Finally, the estimated coefficients is given by
and
as given in (7).
To derive the MLS of , the IRLS is used. Under certain regularity conditions on the likelihood function, the MLE are asymptotically normal, unbiased, and efficient, with covariance matrix equal to the inverse of Fisher’s information matrix []. Thus, has asymptotically normal distribution,
where is the inverse of Fisher’s information matrix. □
Lemma 2 (The IW-Reg model with identity link function).
Let the response variable Y have an IW distribution, , and let the link function of the form be
Thus, the estimated coefficients using Fisher’s scoring technique at the iteration is given by
where X is a covariates matrix, is an initial vector, ,
and , and
The procedure in (17) can be repeated until . The IW-Reg model in this case is given by
Proof.
Suppose that, in , as in (3), the parameter is assumed to be known. The log-likelihood function based on is given as in (5). The link function connecting the with the linear model , in this case, is given in (16). From (5), (11), and (16), we have
which can be written in the matrix notation as in (13). Taking the second derivatives of , we have
hence,
since and , then and is given as in (14), where is the diagonal matrix of weights, and is as it is in (18). Using (15), we have as in (17). □
Lemma 3 (Convergence estimates in the Fisher’s scoring process).
Let the response variable Y have an IW distribution, and let the link function of the form be i = 1,2,...,n, the estimated coefficients using Fisher’s scoring technique at the iteration is given by
Then, , where X is a covariate matrix, W is the diagonal matrix of weights, and Z is a vector of the response variable.
Proof.
Suppose that, in , as in (3), the parameter is assumed to be known. The log-likelihood function based on is given as in (5). Furthermore, suppose that the link function connecting the with the linear model is given as in (2). The Fisher’s scoring process to obtain the MLEs estimates is given by computing the iterations:
where is the score vector for the log-likelihood (5), and is the Fisher’s information matrix. Taking the expectation of the Equation (21), we have
since the estimates are the MLEs, and . Hence, we get for all where is a vector of expectations. From (21), we have
Using the Chebyshev inequality, for every , we find
Now, by the Jensen inequality,
Let and, using (25), we then obtain
since . On the other hand, by choosing into the Equation (26), this becomes as . □
3. Bayesian Approach
Diaconis and Ylvisker [] introduced a conjugate prior distribution for the exponential family, which, as in (1), can be shown as
where is a normalization constant, and are natural parameters. The values are connected to the regression coefficients by the link function as
The posterior distribution of is given by
Das and Dey [] suggested a Jacobian transformation and rewrote (29) with the term , as
where is a normalization constant, and . They used a zero-one loss function to attain the posterior mode of (30) as ; hence, the estimated coefficients are given by
where is the least square estimates, and [,]. Under regularity conditions, the estimator has a asymptotically normal distribution , where is the inverse of Bayesian Fisher’s information (BIF). Note that the BFI is given by
where is the posterior pdf of [,,].
In order to develop the Bayesian approach, we propose a modified loss function of the general entropy (MGE) loss function to be appropriate for the Bayesian estimates. The MGE loss function is introduced in the following lemma.
Lemma 4 (The MGE loss function).
Consider that the posterior distribution of the is , is an estimate of , and , are independent observations. A suitable alternative loss function to the GE loss is the MGE loss function, given as
Thus, the posterior Bayes estimates of is given by solving the equation
Proof.
In order to develop a Bayesian approach, we suggest inverted Weibull Bayesian generalized linear models (IW-BReg) that are similar to the approach in Section 3, except that the distribution of the response variable is not a member of the exponential family. We use the general form of the posterior in (30), and since is a monotonic differentiable function, then we attain the posterior Bayes estimates. Moreover, we use the a log and identity link functions with different loss functions. The IW-BReg estimates correspond to the link functions using different loss functions, as in the following lemmas.
Lemma 5 (The IW-BReg model based on zero-one loss function).
Let the response variable Y have an IW distribution and let the link function of the form be as in (2). Consider that α has a gamma prior with the following density function:
Thus, the posterior mode of by using a zero-one loss function can be derived by solving the following equation:
where , is defined as in (6) and (16). The estimated coefficients are given as in (31). The IW-BReg model in this case is given by
Proof.
Suppose that is as it is in (3), the parameter is assumed to be known, and the density function of is given by
Consider a gamma prior for , which can be written as in (34). The posterior distribution of is given by
Using Jacobian transformation from to , we have
Taking the derivative of the log posterior, we have
hence, we get the equation as in (35), and the posterior mode of is given by solving it. □
Lemma 6 (The IW-BReg model based on MGE loss function).
Let the response variable Y have an IW distribution, and let the link function of the form be as given in (2). Consider that has a gamma prior with a density function as given in (34). Thus, the posterior Bayes estimates of , by using an MGE loss function, can be derived by solving the equation
where , is defined as in (6) and (16). The estimated coefficients are given as in (31). The IW-BReg model in this case is given by
Proof.
Suppose that is as it is in (3). The parameter is assumed to be known, and the density function of is given as in (37). Consider the gamma prior for , which can be written as given in (34). Using the posterior distribution of that is given in (38), we have
hence,
Using Lemma (4), we have
Thus, the posterior Bayes estimates of by using the MGE loss function can be derived by solving the Equation (39). □
4. Data Analysis
In this section, we show the usefulness and performance of the IW-Reg and IW-BReg models by applying the theoretical findings in Section 2 and Section 3 to some real datasets. For simplicity, we use the following notations for the proposed models used throughout the applications.
Model | Description |
Model I | IW-Reg model based on identity link function |
Model II | IW-Reg model based on log link function |
Model III | IW-BReg model based on identity link and zero-one loss function |
Model IV | IW-BReg model based on log link and zero-one loss |
Model V | IW-BReg model based on identity link and MGE loss |
Model VI | IW-BReg model based on log link and MGE loss |
4.1. Application 1: (The Minimum Temperatures)
Dataset in this application was collected from the meteorology station at King Khalid International Airport, Saudi Arabia during (2014–2018). This data contains 54 observations (monthly data), in which the response variable Y be the minimum of dry bulb temperatures in Celsius. The explanatory variables are; ; mean of relative humidity, ; mean of vapor pressure (mm), ; mean of sky cover oktes, ; maximum of station-level pressure (mm).
In order to aid in distributional assessment of the response variable Y, the empirical cumulative distribution function (ECDF) plot was proposed. Kolmogorov–Smirnov goodness of fit test (K–S) was calculated based on IW, Gaussian, and gamma distributions. The IW-Reg and IW-BReg models based on log and identity link, and loss functions were fitted using the proved Lemmas in Section 2 and Section 3. Bayes coefficients were obtained using a gamma prior with some known values of the hyperparameters and . In addition, Huber’s function was suggested to avoid such distortions due to an outlier in ; see Appendix A. In this case, under regularity conditions, estimator has asymptotically normal distribution [,]. The performance of all these models were compared. Modeling performance is measured in terms of some criteria, such as AIC, D, D/df, and MSE []. We also used Thiel’s inequality coefficient to compare the prediction accuracy of the selected models [,]. The backward-selection method was used in the IW-BReg model to select the best fit in view of the covariates.
To check the adequacy for the selected models, we consider Pearson residuals [,]. R software was used to carry out calculations. In order to compare with known distributions, the glm() function in “stats” was used to fit the GLMs []. Functions qqPlot(), ecdf(), boxplot, and ks.test() in R package “stats” were used for the assessment distributions []. To solves n roots of n nonlinear equations in Section 3, the function multiroot() in R package “rootSolve” was used []. The fitting results and the relative errors (RE) of the selected model, and other numerical results are shown in Table 1, Table 2 and Table 3.

Table 1.
Efficiency Gamma, Gaussian, inverted Weibull Regression (IW-Reg), IW-Bayesian regression (IW-BReg) models.

Table 2.
Akaike’s information criterion (AIC), deviance (D), and mean squared error (MSE) of the Model VI (backward selection method).

Table 3.
Fitting results of the Model VI based on Huber’s function (during fitting interval, 2014).
Based on the results obtained from K–S test, the p-value = 0.315 for the test indicates that the IW distribution fits the response variable in the given data quite well. Figure 1 provides the ECDF plot, and it is clear that the IW distribution fits these data well.

Figure 1.
The empirical cumulative distribution function (ECDF) plot of the minimum temperatures based on IW distribution and some other distributions.
To compare between the Bayesian fitting results, we observe that the results based on MGE loss function are better than zero-one loss function. Table 1 shows that the IW-BReg models based on MGE loss function (Model V and VI) are good in terms of MSE, AIC, and D statistics. Table 1 also shows that the of IW-Reg and IW-BReg models (I, II, III, IV, V, and VI) are less than 1, indicating that the fitting degree is very good. If the model is correct, the Pearson residuals and Pearson statistics have an approximately normal distribution with mean 0 and chi-square distribution , respectively. For the IW-BReg model based on identity link and MGE loss function (Model V), the Pearson statistics is , the p-value for Anderson-Darling is 0.0001, and the Cox Stuart test is 1, so the Pearson residuals are not normal but randomly scattered around zero at the level of significant . For the IW-BReg model based on log link and MGE loss function (Model VI), the Pearson statistics is , the p-value for Anderson-Darling is 0.06379, and the Cox Stuart test is 1, so the Pearson residuals are normal and randomly scattered around zero.
Based on this analysis, we conclude that the Model VI is more appropriate for fitting these data, leading to the following equation
For the backward selection method results, Table 2, we can conclude that the predictive model is given as follows:
We also can see that, this model has AIC = 364.3539 and a low MSE = 2.3463, and there was also a significant relationship among variables when using level of significance . For the residuals, the Pearson statistics is , p-value for Anderson-Darling is 0.0443 and for the Cox Stuart test is 1.
Because of the presence of an outlier, we can conclude that the Model VI based on Huber’s function is the best for our data, and it is given as follows:
From Table 2, we can see, this model has AIC = 363.2006 and a low MSE = 2.3451, and there was also a significant relationship among variables when using the level of significance . For the residuals, the Pearson statistics is , the p-value for Anderson-Darling is 0.052, and the Cox Stuart test is 1. Hence, the Pearson residuals are normal randomly scattered around zero; see Figure 2. The fitting results for this model during the year 2014 are shown in Table 3. We can also see that the fitting accuracy is good because the TIC value is closer to 0 than 1.

Figure 2.
(a) Pearson residuals plot and (b) Normal Q-Q plot of the residuals using Model VI.
4.2. Application 2: (Wind Speed Data)
The dataset in this application was taken again from the meteorology station at King Khalid International Airport, Saudi Arabia, in 2016. This data contains 91 observations, during 7 June and 5 September, (summer season), in which the response variable Y be the mean wind speed (km/h). The explanatory variables are; ; maximum’s wind direction, ; maximum of station-level pressure (mm), ; mean of sea-level pressure (mm), ; mean of dry bulb temperatures of air (Celsius), ; mean of wet bulb temperatures (Celsius), ; mean of relative humidity, ; mean of vapor pressure (mm), ; mean of sky cover oktes, ; maximum of station-level pressure (mm), ; maximum of sea-level pressure (mm), ; maximum of dry bulb temperatures (Celsius), ; maximum of the wet bulb temperatures (Celsius), ; maximum of relative humidity, ; minimum of station-level pressure (mm), ; minimum of sea-level pressure (mm), ; minimum of dry bulb temperatures, ; minimum of the wet bulb temperatures, ; minimum of relative humidity, ; time of maximum daily wind (HH:MM).
Proceeding similarly, as in Application 1 to aid in the distributional assessment. In this dataset, we identify the outliers, different plots as the quantile-quantile (Q-Q) plot, ECDF, and box plot were proposed. Again, Lemmas in Section 2 and Section 3 were applied to these data to fit the IW-Reg based on log and identity link functions were used. Besides being an alternative analysis, the IW-BReg models were obtained using a log, identity link, and a gamma prior with known hyperparameters and parameters. We also compare the performance of all these models. In addition, biweight function was suggested to avoid such distortions due to outliers; see Appendix A. In this case, under regularity conditions, estimator has asymptotically normal distribution [,]. The modeling performance was measured in terms of some criteria, such as AIC, D, D/df, and MSE []. We also used Theil’s Inequality coefficient (TIC) to measure the prediction accuracy of the selected models [,]. To compare the residual for all models, we consider Pearson residuals to check the adequacy of the regression model fitted to the data [,].
Furthermore, to detect the influential cases, we use the Cook’s distance measure using the formula and in the case of Bayesian analysis [,]. The backward selection method was used in the IW-Reg model to remove the input variable; see Table 4. R software was used to carry out the calculations. In order to compare with known distributions, the function glm in “stats” is used to fit the GLMs. The functions qqPlot, ecdf, boxplot, and ks.test in the R package “stats” are used for the assessment distributions []. To solves n roots of n nonlinear equations in Section 3, the function multiroot() in R package “rootSolve” was used []. The fitting, predictive results of these models and the other numerical results are shown on the Table 4, Table 5, Table 6, Table 7 and Table 8.

Table 4.
The Model I (backward selection method).

Table 5.
Efficiency Gaussian, inverted Wiebull (IW-Reg), IW-Bayesian regression (IW-BReg) models.

Table 6.
AIC, BIC, D, and MSE of the Model VI based biweight function.

Table 7.
Anderson-Darling and Cox Stuart test for Pearson residuals of the Model VI.

Table 8.
Fitting and Predicted results for the Model VI based on biweight function.
Based on the results obtained from K–S test, the p-value = 0.139 for the test indicates that the IW distribution fits the response variable in the given data quite well. Figure 3 provides the Q-Q plot and ECDF, and it is clear that the IW distribution fits these data well. Figure 4 provides box plot corresponding to the mean wind speed variable Y, and this chart mapped one outlier (leverage point) that exceeds the values of .

Figure 3.
(a) Q-Q plots of the wind speed based on IW distribution and (b) The ECDF plot of Y based on IW and some other distributions.

Figure 4.
Box plot of the wind speed variable .
From Table 4, we can observe that the variables , , , and are significant for the model, so there is a significant relationship among variables. In these models, is stabilizes when the Fisher’s scoring procedure is converged at and , respectively, because of . To compare the Bayesian fitting results we observe that the results based on MGE loss function (Model V and VI) better than zero-one loss function (Model III and IV); see Table 5. Table 5 also shows that the of the models I, II, III, IV, V, and VI are less than 1, indicating that the fitting degree is very good.
Based on this analysis, we also conclude that the Model VI is more appropriate for fitting these data, leading to the following equation
For the residuals, the Pearson statistics is , p-value for Anderson-Darling is 0.0496, and for the Cox Stuart test is 1; see Table 5. This residuals have a large positive residual at the observation 91. However, for the model, this case is non-influential according to where corresponding to upper -percentile from the F distribution [].
Because of the presence of an outlier, we can conclude that the Model VI based on biweight function is the best for our data, and it is given as follows:
From Table 6, we can see that this model has AIC = 446.515 and a low MSE = 3.046, and there was also a significant relationship among variables when using the level of significance . For the residuals, the Pearson statistics is , the p-value for Anderson-Darling is 0.0612, and the Cox Stuart test is 1. Hence, the Pearson residuals are normal randomly scattered around zero at the level of significant ; see Table 7 and Figure 5. This Figure shows no large positive residual. The fitting and predicted results for this model during 2016 and 2017 are shown in Table 8. We can also see that the prediction accuracy is good because the TIC value is closer to 0 than 1.

Figure 5.
(a) Pearson residuals plot and (b) Normal Q-Q plot of the Pearson residuals for the Model VI based on biweight function as mentioned in Table 6.
5. Conclusions
In this paper, the regression models IW-Reg and IW-BReg for modeling Saudi datasets are considered. Zero-one and MGE loss functions were used to attain the Bayesian estimates based on a log and identity functions. In the classical approach, parameter estimation is done by the Fisher’s scoring technique, and closed-form expressions are provided for the score function, and for Fisher’s information matrix and its inverse. In the Bayesian approach, parameter estimation is performed using a gamma prior distribution, Jacobian transformation, and least-squares estimates. The IW-Reg and IW-BReg models were compared to find which model predicted better. To deal with outlier problems, IW-BReg based on Huber’s and biweight functions, and the adopted algorithm based on IRLS to find the estimates, were proposed. For distributional assessment, Q-Q, ECDF, box plots, and the K–S test were applied. Some criteria, namely AIC, D, D/df, and MSE, were also computed for all regression models.
According to the results of the Application (1), the IW-BReg model based on Huber’s and MGE loss with a log link function, performed the best in terms of the AIC, D, D/df, and MSE statistics, so it is recommended for these data. In contrast, the IW-Reg model showed poor results compared with those of the other models. Results indicated that the IW-BReg model based on Huber’s and MGE loss is highly capable of improving regression models’ performance to a greater extent in predicting the minimum of dry bulb temperatures (Celsius) in Saudi Arabia. It is found the following regressors are significant for the model: Explanatory variables are: , the mean of relative humidity, , the mean of vapor pressure (mm); and , the maximum of station-level pressure (mm). Application (2), the IW-BReg model based on biweight function and MGE loss with a log link function, performed the best in terms of the AIC, D, D/df, and MSE statistics, so it is recommended for these data. In contrast, IW-Reg and IW-BReg based on zero-one loss function showed poor results than those of the other models. Finally, the results in this application indicated that the IW-BReg model based on biweight function and MGE loss with a log link function is highly capable of improving regression models’ performance to a greater extent in predicting the mean wind speed (km/h) in Saudi Arabia. It is found the following regressors are significant for the model: Explanatory variables are: , the mean of station-level pressure (mm); , the mean of wet–bulb temperatures (Celsius); , the mean of sky cover oktes; and , the minimum of dry bulb temperatures. From these discussions, we conclude that IW-BReg model based on log link and MGE loss has good performance for the response variables in the considered applications.
Author Contributions
Conceptualization, (S.R.A.-D., K.S.S.); methodology, S.R.A.-D.; software, S.R.A.-D.; validation and formal analysis, S.R.A.-D.; resources, (S.R.A.-D., K.S.S.); supervision, K.S.S.; writing—original draft preparation, S.R.A.-D.; writing—review and editing, (S.R.A.-D., K.S.S.). All authors have read and agreed to the published version of the manuscript.
Funding
This article was funded by the Deanship of Scientific Research at King Saud University (RG-1435-056).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this article was taken from the National Center for Meteorology in Saudi Arabia on the link https://ncm.gov.sa/Ar/About/Branches/Pages/default.aspx, accessed on 7 December 2020.
Acknowledgments
The authors would like to thank the editor and referees for their helpful comments, which improved the presentation of the paper. In addition, the authors would like to extend their sincere appreciation to the Deanship of Scientific Research at King Saud University for its funding this Research Group (RG-1435-056).
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Robust IW-BReg Models
M-estimation is considered to be the most common method of robust regression. It was proposed by [,] in the presence of outliers, and it is more efficient than ordinary least squares (OLS) [,]. The Huber’s function takes the following form:
where k is the tuning constant, r is the residual corresponding to the observation in OLS, and is the objective function that satisfies certain properties. Often, can be formed by using a linear combination of the residuals. Defining function and the corresponding weight function in this case is as follows:
Another M-estimation function is the Tukey bisquare (biweight) function. This is based on Tukey’s function, taking the form of that in Reference []
where k is the tuning constant, and r is the residual corresponding to the observation in OLS. Defining function and the corresponding weight function in this case is given as follows:
To make the IW-BReg models are robust, we suggest Huber’s and biweight functions for these models. There are also many other versions of the M-estimation function that could be used here.
Let the response variable Y have an IW distribution, and let the link function of the form be as given in (2). Consider that has a gamma prior with density function is given as in (34). Using the Jacobian transformation from to and using the link function, we have the posterior distribution of is given as in (38). Thus, the estimated coefficients are given as
where and are the posterior Bayes estimates of using the zero-one or MGE loss functions, and , are the selected weights depending on M-estimation functions. In this case, coefficients are estimated using an adopted IRLS algorithm [,,,] as follows:
- i.
- Setting the iteration counter at , finding an initial estimates of regression coefficients using IW-Reg estimates.
- ii.
- The initial residuals are based on the link function that is given as in (2), and calculate an initial scale estimate .
- iii.
- An initial standardized residuals are calculated and used to calculate initial estimates for the weight function. Preliminary weights are .
- iv.
- Calculate Bayes estimates using a gamma prior with known parameters and zero-one or MGE loss functions.
- vii.
- Using weights from Steps i–iii and Step iv to find estimators in (A5).
- viii.
- Set ; then, go to Step ii. Steps ii to vii are repeated until the estimate of is stabilized from the previous iteration, which means: .
References
- McCullagh, P.; Nelder, J.A. Generalized Linear Models; Number 37 in Monographs on Statistics and Applied Probability; Chapman and Hall: London, UK, 1983. [Google Scholar]
- Nelder, J.A.; Pregibon, D. An extended quasi-likelihood function. Biometrika 1987, 74, 221–232. [Google Scholar] [CrossRef]
- Nelder, J.A.; Wedderburn, R.W.M. Generalized linear models. J. R. Stat. Soc. Ser. A 1972, 135, 370–384. [Google Scholar] [CrossRef]
- De Jong, P.; Heller, G.Z. Generalized Linear Models for Insurance Data; Cambridge University Press: New York, NY, USA, 2008. [Google Scholar]
- Yuan, K.H.; Bentler, P.M. Improving the convergence rate and speed of Fisher-scoring algorithm: Ridge and anti-ridge methods in structural equation modeling. Ann. Inst. Stat. Math. 2017, 69, 571–597. [Google Scholar] [CrossRef]
- Liao, T.F. Interpreting Probability Models: Logit, Probit, and Other Generalized Linear Models; SAGE Publications: Thousand Oaks, CA, USA, 1994. [Google Scholar]
- Keller, A.Z.; Kamath, K. Alternate reliability models for mechanical systems. In Proceedings of the 3rd International Conference on Reliability and Maintainability, Toulouse, France, 16–21 October 1982; pp. 411–415. [Google Scholar]
- Calabria, R.; Pulcini, G. Confidence limits for reliability and tolerance limits in the inverse Weibull distribution. Reliab. Eng. Syst. Saf. 1989, 24, 77–85. [Google Scholar] [CrossRef]
- Jiang, R.; Zuo, M.J.; Li, H.X. Weibull and inverse Weibull mixture models allowing negative weights. Reliab. Eng. Amnd Syst. Saf. 1999, 66, 227–234. [Google Scholar] [CrossRef]
- Mahmoud, M.A.W.; Sultan, K.S.; Amer, S.M. Order statistics from inverse Weibull distribution and associated inference. Comput. Stat. Data Anal. 2003, 42, 149–163. [Google Scholar] [CrossRef]
- Pasari, S.; Dikshit, O. Impact of three-parameter Weibull models in probabilistic assessment of earthquake hazards. Pure Appl. Geophys. 2014, 171, 1251–1281. [Google Scholar] [CrossRef]
- Jazi, M.A.; Lai, C.D.; Alamatsaz, M.H. A discrete inverse Weibull distribution and estimation of its parameters. Stat. Methodol. 2010, 7, 121–132. [Google Scholar] [CrossRef]
- Kundu, D.; Howlader, H. Bayesian inference and predication of the inverse Weibull distribution for Type-II censoring data. Comput. Stat. Data Anal. 2010, 54, 1547–1558. [Google Scholar] [CrossRef]
- Han, Q.; Li, L.; Gao, X. Statistical inference of the environment factor for inverse weibull distribution. In Proceedings of the 2010 The 2nd Conference on Environmental Science and Information Application Technology, Wuhan, China, 17–18 July 2010; Volume 3, pp. 613–616. [Google Scholar]
- Musleh, R.M.; Helu, A. Estimation of the inverse Weibull distribution based on progressively censored data: Comparative study. Reliab. Eng. Syst. Saf. 2014, 131, 216–227. [Google Scholar] [CrossRef]
- Akgul, F.; Senoglu, B.; Arslan, T. An alternative distribution to Weibull for modeling the wind speed data: Inverse Weibull distribution. Energy Convers. Manag. 2016, 114, 234–240. [Google Scholar] [CrossRef]
- Nassar, M.; Abo-Kasem, O.E. Estimation of the inverse Weibull parameters under adaptive type-II progressive hybrid censoring scheme. J. Comput. Appl. Math. 2017, 315, 228–239. [Google Scholar] [CrossRef]
- Cepeda, E.; Gamerman, D. Bayesian methodology for modeling parameters in the two parameter exponential family. Rev. Estad. 2005, 57, 93–105. [Google Scholar]
- Fahrmeir, L.; Tutz, G. Multivariate Statistical Modelling Based on Generalized Linear Models; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
- Dey, D.K.; Ghosh, S.K.; Mallick, B.K. Generalized Linear Models: A Bayesian Perspective; CRC Press: New York, NY, USA, 2000. [Google Scholar]
- Olsson, U. Generalized Linear Models, An Applied Approach; Student Litteratur Lund.: Lund, Sweden, 2002. [Google Scholar]
- Dobson, A.J.; Barnett, A. An Introduction to Generalized Linear Models; CRC Press: Boca Raton, FL, USA, 2008. [Google Scholar]
- Calabria, R.; Pulcini, G. An engineering approach to Bayes estimation for the Weibull distribution. Microelectron. Reliab. 1994, 34, 789–802. [Google Scholar] [CrossRef]
- Dey, D.K.; Ghosh, M.; Srinivasan, C. Simultaneous estimation of parameters under entropy loss. J. Stat. Plan. Inference 1986, 15, 347–363. [Google Scholar] [CrossRef]
- Calabria, R.; Pulcini, G. Point estimation under asymmetric loss functions for left-truncated exponential samples. Commun. Stat. Theory Methods 1996, 25, 585–600. [Google Scholar] [CrossRef]
- Sano, N.; Suzuki, H.; Koda, M. A robust ensemble learning using zero-one loss function. J. Oper. Res. Soc. Jpn. 2008, 51, 95–110. [Google Scholar] [CrossRef][Green Version]
- Li, Y.; Hou, L.; Yang, Y.; Tong, J. Huber’s M-Estimation-Based Cubature Kalman Filter for an INS/DVL Integrated System. In Mathematical Problems in Engineering; Hindawi: London, UK, 2020. [Google Scholar]
- Sinova, B.; Van Aelst, S. Advantages of M-estimators of location for fuzzy numbers based on Tukey’s biweight loss function. Int. J. Approx. Reason. 2018, 93, 219–237. [Google Scholar] [CrossRef]
- Drapella, A. Complementary Weibull distribution: Unknown or just forgotten. Qual. Reliab. Eng. Int. 1993, 9, 383–385. [Google Scholar] [CrossRef]
- Mudholkar, G.S.; Kollia, G.D. Generalized Weibull family: A structural analysis. Commun. Stat. Theory Methods 1994, 23, 1149–1171. [Google Scholar] [CrossRef]
- Murthy, D.P.; Xie, M.; Jiang, R. Weibull Models; John Wiley and Sons: Hoboken, NJ, USA, 2004; Volume 505. [Google Scholar]
- Khan, M.S.; Pasha, G.R.; Pasha, A.H. Theoretical analysis of inverse Weibull distribution. WSEAS Trans. Math. 2008, 7, 30–38. [Google Scholar]
- De Gusmao, F.R.; Ortega, E.M.; Cordeiro, G.M. The generalized inverse Weibull distribution. Stat. Pap. 2011, 52, 591–619. [Google Scholar] [CrossRef]
- Singh, S.; Singh, U.; Sharma, V. Bayesian prediction of observations from inverse Weibull distribution based on Type-II hybrid censored sample. Int. J. Adv. Stat. Probab. 2013, 1, 32–43. [Google Scholar] [CrossRef][Green Version]
- Elbatal, I.; El Gebaly, Y.M.; Amin, E.A. The Beta Generalized Inverse Weibull Geometric Distribution. Pak. J. Stat. Oper. Res. 2017, 75–90. [Google Scholar] [CrossRef]
- McCullagh, P.; Nelder, J.A. Generalized Linear Models; CRC Press: Boca Raton, FL, USA, 1989; Volume 37. [Google Scholar]
- Muhammed, H.Z. Bivariate inverse Weibull distribution. J. Stat. Comput. Simul. 2016, 86, 2335–2345. [Google Scholar] [CrossRef]
- Ferrari, S.; Cribari-Neto, F. Beta regression for modelling rates and proportions. J. Appl. Stat. 2004, 31, 799–815. [Google Scholar] [CrossRef]
- Houston, W.M.; Woodruff, D.J. Empirical Bayes Estimates of Parameters from the Logistic Regression Model; ACT Research Report Series 97-6; ACT, Inc.: Iowa, IA, USA, 1997; 34p. [Google Scholar]
- Das, S.; Dey, D.K. On Bayesian Analysis of Generalized Linear Models: A New Perspective, Technical Report; University of Connecticut, Department of Statistics: Storrs, CT, USA, 2007; 33p. [Google Scholar]
- Das, S.; Dey, D.K. On Bayesian analysis of generalized linear models using the Jacobian technique. Am. Stat. 2006, 60, 264–268. [Google Scholar] [CrossRef]
- Tellinghuisen, J. Least squares with non-normal data: Estimating experimental variance functions. Analyst 2008, 133, 161–166. [Google Scholar] [CrossRef]
- Clarkson, E. Bayesian Fisher Information and Detection of a Small Change in a Parameter. In Proceedings of the 2020 54th Annual Conference on Information Sciences and Systems (CISS), Princeton, NJ, USA, 18–20 March 2020; pp. 1–5. [Google Scholar]
- Leuthold, R.M. On the use of Theil’s inequality coefficients. Am. J. Agric. Econ. 1975, 57, 344–346. [Google Scholar] [CrossRef]
- Niu, T.; Zhang, L.; Zhang, B.; Yang, B.; Wei, S. An Improved Prediction Model Combining Inverse Exponential Smoothing and Markov Chain. In Mathematical Problems in Engineering; Hindawi: London, UK, 2020; 11p. [Google Scholar]
- Faraway, J.J. Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
- Fox, J.; Weisberg, S. An R Companion to Applied Regression; Sage Publications: Thousand Oaks, CA, USA, 2018. [Google Scholar]
- Soetaert, K. rootSolve: Nonlinear Root Finding, Equilibrium and Steady-State Analysis of Ordinary Differential Equations; R Package 1.6; 2009; Available online: https://cran.r-project.org/web/packages/rootSolve/index.html (accessed on 7 December 2020).
- Agresti, A. Foundations of Linear and Generalized Linear Models; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
- Diaz-Garcia, J.A.; Gonzalez-Farıas, G. A note on the Cook’s distance. J. Stat. Plan. Inference 2004, 120, 119–136. [Google Scholar] [CrossRef]
- Huber, P.J. Robust Statistics; John Wiley and Sons: Hoboken, NJ, USA, 1981. [Google Scholar]
- Huber, P.J. Robust estimation of a location parameter. Ann. Math. Stat. 1964, 35, 73–101. [Google Scholar] [CrossRef]
- Rousseeuw, P.J.; Leroy, A.M. Robust Regression and Outlier Detection; John Wiley and Sons: Hoboken, NJ, USA, 1987. [Google Scholar]
- Chang, L.; Hu, B.; Chang, G.; Li, A. Robust derivative-free Kalman filter based on Huber’s M-estimation methodology. J. Process Control 2013, 23, 1555–1561. [Google Scholar] [CrossRef]
- Maronna, R.A.; Martin, R.D.; Yohai, V.J. Robust Statistics: Theory and Methods; John Wiley and Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
- Wen, F.; Liu, W. Iteratively reweighted optimum linear regression in the presence of generalized Gaussian noise. In Proceedings of the 2016 IEEE International Conference on Digital Signal Processing (DSP), Beijing, China, 16–18 October 2016; pp. 657–661. [Google Scholar]
- Kikuchi, H.; Yasunaga, H.; Matsui, H.; Fan, C.I. Efficient privacy-preserving logistic regression with iteratively Re-weighted least squares. In Proceedings of the 2016 11th Asia Joint Conference on Information Security (AsiaJCIS), Fukuoka, Japan, 4–5 August 2016; pp. 48–54. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).