# Bayesian Reference Analysis for the Generalized Normal Linear Regression Model

^{1}

^{2}

^{3}

^{4}

^{5}

^{*}

^{†}

## Abstract

**:**

## 1. Introduction

## 2. Generalized Normal Distribution

## 3. Generalized Normal Linear Regression Model

**Proposition**

**1.**

## 4. Objective Bayesian Analysis

**Proposition**

**2.**

**Corollary**

**1.**

#### 4.1. Reference Prior

#### 4.2. A Problem with the Jeffreys Prior

**Corollary**

**2.**

**Proof.**

**Corollary**

**3.**

**Proof.**

**Proposition**

**3.**

**Proof.**

## 5. Metropolis–Hastings Algorithm

- Set the values ${\mathit{\omega}}^{\left(0\right)}=\left(\right)open="("\; close=")">{\omega}_{1}^{\left(0\right)},{\omega}_{2}^{\left(0\right)},{\omega}_{3}^{\left(0\right)}$.
- Generate ${\omega}_{1}^{*}$ from the proposal distribution $q({\omega}_{1}^{\left(0\right)},{\omega}_{1}^{*})$.
- Sample u from a uniform distribution $U(0,1)$.
- If $u\le min\left(\right)open="\{"\; close="\}">1,exp\left(\right)open="["\; close="]">log{\pi}^{\mathsf{R}}\left(\right)open="("\; close=")">{\omega}_{1}^{*}|{\omega}_{2}^{\left(0\right)},{\omega}_{3}^{\left(0\right)},\mathbf{y}$ then update ${\omega}_{1}^{\left(1\right)}$ from ${\omega}_{1}^{*}$; otherwise, use the value of ${\omega}_{1}^{\left(0\right)}$, i.e., ${\omega}_{1}^{\left(1\right)}={\omega}_{1}^{\left(0\right)}$.
- Repeat the same steps above for ${\omega}_{2}^{\left(1\right)}$ and ${\omega}_{3}^{\left(1\right)}$.
- Repeat Steps 2–5 until we obtain the target sample size.

## 6. Selection Criteria for Models

## 7. Bayesian Case Influence Diagnostics

## 8. Applications

#### 8.1. Artificial Data

#### 8.2. Real Data Set

## 9. Discussion

## Author Contributions

## Funding

## Institutional Review Board Statement

## Informed Consent Statement

## Conflicts of Interest

## Appendix A

**Proof of Corollary**

**2.**

**Proof of Corollary**

**3.**

**Proof of Proposition**

**3.**

## References

- Subbotin, M.T. On the law of frequency of errors. Math. Sb.
**1923**, 31, 296–301. [Google Scholar] - Belsley, D.A.; Kuh, E.; Welsch, R.E. Regression Diagnostic: Identifying Influential Data and Sources of Collinearity; John Wiley: New York, NY, USA, 1980. [Google Scholar]
- Barnett, V.; Lewis, T. Outliers in Statistical Data; John Wiley: New York, NY, USA, 1994. [Google Scholar]
- Box, G.E.P.; Tiao, G.C. Bayesian Inference in Statistical Analysis; John Wiley & Sons: New York, NY, USA, 1973. [Google Scholar]
- Walker, S.G.; Gutiérrez-Peña, E. Robustifying Bayesian procedures. In Bayesian Statistics 6; Bernardo, J.M., Berger, J.O., Dawid, A.P., Smith, A.F.M., Eds.; Oxford University Press: Oxford, UK, 1999; pp. 685–710. [Google Scholar]
- Liang, F.; Liu, C.; Wang, N. A robust sequential Bayesian method for identification of differentially expressed genes. Stat. Sin.
**2007**, 17, 571–595. [Google Scholar] - Varanasi, M.k.; Aazhang, B. Parametric generalized Gaussian density estimation. J. Acoust. Soc. Am.
**1989**, 86, 1404–1415. [Google Scholar] [CrossRef] - Agro, G. Maximum likelihood estimation for the exponential power function parameters. Commun. Stat.-Simul. Comput.
**1995**, 24, 523–536. [Google Scholar] [CrossRef] - West, M. On scale mixtures of normal distributions. Biometrika
**1987**, 79, 646–648. [Google Scholar] [CrossRef] - Choy, S.T.B.; Smith, F.M. On Robust Analysis of a Normal Location Parameter. J. R. Stat. Soc. Ser. B
**1997**, 59, 463–474. [Google Scholar] [CrossRef] - Salazar, E.; Ferreira, M.A.; Migon, H.S. Objective Bayesian analysis for exponential power regression models. Sankhya Ser. B
**2012**, 74, 107–125. [Google Scholar] [CrossRef] - Bernardo, J.M. Reference Analysis. In Handbook of Statistics 25; Dey, D., Rao, C., Eds.; Elsevier: Amsterdam, The Netherlands, 2005; Volume 25, pp. 17–90. [Google Scholar]
- Bernardo, J.M. Reference Posterior Distributions for Bayesian-Inference. J. R. Stat. Soc. Ser. B-Methodol.
**1979**, 41, 113–147. [Google Scholar] [CrossRef] - Berger, J.O.; Bernardo, J.M. On the development of reference priors. In Bayesian Statistics 4; Bernardo, J.M., Berger, J.O., Dawid, A.P., Smith, A.F.M., Eds.; Oxford University Press: Oxford, UK, 1992; pp. 35–60. [Google Scholar]
- Fonseca, T.C.; Ferreira, M.A.; Migon, H.S. Objective Bayesian analysis for the Student-t regression model. Biometrika
**2008**, 95, 325–333. [Google Scholar] [CrossRef] - Northrop, P.J.; Attalides, N. Posterior propriety in Bayesian extreme value analyses using reference priors. Stat. Sin.
**2016**, 26, 721–743. [Google Scholar] [CrossRef][Green Version] - Ramos, P.L.; Louzada, F.; Ramos, E. Posterior Properties of the Nakagami-m Distribution Using Noninformative Priors and Applications in Reliability. IEEE Trans. Reliab.
**2018**, 67, 105–117. [Google Scholar] [CrossRef] - Ramos, E.; Ramos, P.L.; Louzada, F. Posterior properties of the Weibull distribution for censored data. Stat. Probab. Lett.
**2020**, 166, 108873. [Google Scholar] [CrossRef] - Tomazella, V.L.; de Jesus, S.R.; Louzada, F.; Nadarajah, S.; Ramos, P.L. Reference Bayesian analysis for the generalized lognormal distribution with application to survival data. Stat. Its Interface
**2020**, 13, 139–149. [Google Scholar] [CrossRef] - Ramos, P.L.; Mota, A.L.; Ferreira, P.H.; Ramos, E.; Tomazella, V.L.; Louzada, F. Bayesian analysis of the inverse generalized gamma distribution using objective priors. J. Stat. Comput. Simul.
**2021**, 91, 786–816. [Google Scholar] [CrossRef] - Nadarajah, S. A generalized normal distribution. J. Appl. Stat.
**2005**, 32, 685–694. [Google Scholar] [CrossRef] - Zhu, D.; Zinde-Walsh, V. Properties and estimation of asymmetric exponential power distribution. J. Econom.
**2009**, 148, 86–99. [Google Scholar] [CrossRef][Green Version] - Chen, M.H.; Shao, Q.M.; Ibrahim, J.G. Monte Carlo Methods in Bayesian Computation; Springer: New York, NY, USA, 2000. [Google Scholar]
- Zellner, A. Maximal Data Information Prior Distributions. Basic Issues Econom.
**1984**, 211–232. [Google Scholar] - Moala, F.A.; Dey, S. Objective and subjective prior distributions for the Gompertz distribution. Anais Acad. Bras. Ciências
**2018**, 90, 2643–2661. [Google Scholar] [CrossRef][Green Version] - Moala, F.A.; Rodrigues, J.; Tomazella, V.L.D. A note on the prior distributions of weibull parameters for the reliability function. Commun. Stat. Theory Methods
**2009**, 38, 1041–1054. [Google Scholar] [CrossRef] - Muller, P. A Generic Approach to Posterior Integration and Gibbs Sampling; Technical Report; Department of Statistics, Purdue University: West Lafayette, IN, USA, 1991. [Google Scholar]
- Geweke, J. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. In Bayesian Statistics 4; Bernardo, J.M., Berger, J.O., Dawid, A.P., Smith, A.F.M., Eds.; Oxford University Press: Oxford, UK, 1992; pp. 625–631. [Google Scholar]
- Spiegelhalter, D.; Best, N.; Carlin, B.; van der Linde, A. Bayesian measures of model complexity and fit. J. R. Stat. Soc. Ser. B
**2002**, 64, 583–639. [Google Scholar] [CrossRef][Green Version] - Brooks, S.P. Discussion on the paper by Spiegelhalter, Best, Carlin, and van der Linde (2002). J. R. Stat. Soc. Ser. B-Stat. Methodol.
**2002**, 64, 616–618. [Google Scholar] - Gelfand, A.E.; Dey, D.K.; Chang, H. Model determination using predictive distributions with implementation via sampling-based methods (with discussion). In Bayesian Statistics; Kotz, Johnson, N.L., Eds.; Oxford University Press: New York, NY, USA, 1992; Volume 4, pp. 147–167. [Google Scholar]
- Gelfand, A.E.; Dey, K.D. Bayesian Model Choice: Asymptotics and Exact Calculations. J. R. Stat. Soc. Ser. B
**1994**, 56, 501–514. [Google Scholar] [CrossRef] - Cook, R.D.; Weisberg, S. Residuals and Influence in Regression; Chapman and Hall: Boca Raton, FL, USA, 1982. [Google Scholar]
- Peng, F.; Dey, D. Bayesian analysis of outlier problems using divergence measures. Can. J. Stat.
**1995**, 23, 199–213. [Google Scholar] [CrossRef] - R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
- Alzaatreh, A.; Aljarrah, M.; Almagambetova, A.; Zakiyeva, N. On the Regression Model for Generalized Normal Distributions. Entropy
**2021**, 23, 173. [Google Scholar] [CrossRef] - García, V.J.; Gómez-Déniz, E.; Vázquez-Polo, F.J. A new skew generalization of the normal distribution: Properties and applications. Comput. Stat. Data Anal.
**2010**, 54, 2021–2034. [Google Scholar] [CrossRef] - Nascimento, D.C.; Ramos, P.L.; Elal-Olivero, D.; Cortes-Araya, M.; Louzada, F. Generalizing the normality: A novel towards different estimation methods for skewed information. ArXiv
**2021**, arXiv:2105.00031. [Google Scholar] - Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; Dover: New York, NY, USA, 1972; p. 256. [Google Scholar]

**Figure 1.**Density functions of the GN distribution with the parameters (

**a**) $\mu =0$ and $\sigma =1$ fixed and varying s and (

**b**) $\mu =0$ and $s=1.5$ fixed and varying $\sigma $.

**Figure 2.**Real data. Generalized normal linear regression model with $s=2.865$. (

**a**) Scatterplot of the data and fit generalized normal linear regression model. (

**b**) Quantile residuals versus fit values. (

**c**) Quantile residuals versus the index. (

**d**) Graph of the quantiles of the GN distribution for the residuals of the model.

**Figure 3.**Real data. Normal linear regression model. (

**a**) Scatterplot of the data and fit normal linear regression model. (

**b**) Quantile residuals versus fit values. (

**c**) Quantile residuals versus the index. (

**d**) Graph of the quantiles of the normal distribution for the residuals of the model.

**Table 1.**Artificial data. Posterior mean, median, standard deviation (SD), and $95\%$ HDP intervals for the parameters of the model.

Parameters | Mean | Median | SD | 95% HDP |
---|---|---|---|---|

${\mathit{\beta}}_{1}$ | $1.995$ | $1.996$ | $0.086$ | $(1.826;2.164)$ |

${\mathit{\beta}}_{2}$ | $-1.510$ | $-1.510$ | $0.032$ | $(-1.572;-1.447)$ |

$\sigma $ | $1.027$ | $1.028$ | $0.055$ | $(0.922;1.135)$ |

s | $2.657$ | $2.631$ | $0.320$ | $(2.042;3.293)$ |

**Table 2.**Artificial data. Posterior mean, $RV(\%)$, and $95\%$ HDP intervals for the parameters of the model.

Data Names | Perturbed Case | Parameters | Mean | RV (%) | 95% HDP |
---|---|---|---|---|---|

a | None | ${\beta}_{1}$ | $1.995$ | − | $(1.826;2.164)$ |

${\beta}_{2}$ | $-1.510$ | − | $(-1.572;-1.447)$ | ||

$\sigma $ | $1.027$ | − | $(0.922;1.135)$ | ||

s | $2.657$ | − | $(2.041;3.293)$ | ||

b | 50 | ${\beta}_{1}$ | $2.026$ | $1.554$ | $(1.841;2.211)$ |

${\beta}_{2}$ | $-1.517$ | $0.464$ | $(-1.585;-1.449)$ | ||

$\sigma $ | $0.970$ | $5.550$ | $(0.866;1.073)$ | ||

s | $2.188$ | $17.651$ | $(1.763;2.613)$ | ||

c | 250 | ${\beta}_{1}$ | $1.966$ | $1.454$ | $(1.775;2.154)$ |

${\beta}_{2}$ | $-1.495$ | $0.993$ | $(-1.564;-1.423)$ | ||

$\sigma $ | $0.916$ | $10.808$ | $(0.816;1.016)$ | ||

s | $1.867$ | $29.733$ | $(1.565;2.165)$ | ||

d | $\{50,250\}$ | ${\beta}_{1}$ | $2.002$ | $1.332$ | $(1.807;2.198)$ |

${\beta}_{2}$ | $-1.506$ | $0.265$ | $(-1.578;-1.436)$ | ||

$\sigma $ | $0.902$ | $12.171$ | $(0.801;1.003)$ | ||

s | $1.773$ | $33.271$ | $(1.496;2.052)$ |

Data Names | Case Number | $\mathit{K}(\mathit{\pi},{\mathit{\pi}}_{(-\mathit{i})})$ | Calibration |
---|---|---|---|

a | 50 | 0.0121 | 0.5774 |

250 | 0.0014 | 0.5262 | |

b | 50 | 16.1593 | 1.0000 |

c | 250 | 19.2236 | 1.0000 |

d | 50 | 2.8796 | 0.9992 |

250 | 18.2292 | 1.0000 |

**Table 4.**Real data. Posterior mean and $95\%$ HPD intervals for the parameters of the model and Bayesian comparison criteria.

Model | Parameters | Mean | 95% HDP | DIC | EBIC | B |
---|---|---|---|---|---|---|

${\beta}_{1}$ | 7.066 | (6.398; 7.724) | ||||

Generalized | ${\beta}_{2}$ | 0.948 | (0.907; 0.990) | 5829.180 | 5847.96 | −2914.595 |

Normal | $\sigma $ | 3.237 | (3.034; 3.440) | |||

s | 2.865 | (2.436; 3.305) | ||||

${\beta}_{1}$ | 7.002 | (6.331; 7.671) | ||||

Normal | ${\beta}_{2}$ | 0.949 | (0.907; 0.991) | 6248.652 | 6262.98 | −3126.614 |

$\sigma $ | 1.993 | (1.916; 2.068) |

Case Number | $\mathit{K}(\mathit{\pi},{\mathit{\pi}}_{(-\mathit{i})})$ | Calibration |
---|---|---|

335 | 0.1683 | 0.7673 |

479 | 0.1734 | 0.7707 |

803 | 0.4936 | 0.8960 |

**Table 6.**Posterior estimates and $RV(\%)$ for Eucalyptus height and diameter data following the removal of the influential case.

Cases Deletions | Parameter | Mean | $\mathit{RV}(\%)$ | 95% HDP |
---|---|---|---|---|

803 | ${\beta}_{1}$ | 7.067 | 0.014 | (6.375; 7.753) |

${\beta}_{2}$ | 0.948 | - | (0.905; 0.991) | |

$\sigma $ | 3.252 | 0.463 | (3.054; 3.454) | |

s | 2.936 | 2.478 | (2.486; 3.390) |

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

## Share and Cite

**MDPI and ACS Style**

Tomazella, V.L.D.; Jesus, S.R.; Gazon, A.B.; Louzada, F.; Nadarajah, S.; Nascimento, D.C.; Rodrigues, F.A.; Ramos, P.L.
Bayesian Reference Analysis for the Generalized Normal Linear Regression Model. *Symmetry* **2021**, *13*, 856.
https://doi.org/10.3390/sym13050856

**AMA Style**

Tomazella VLD, Jesus SR, Gazon AB, Louzada F, Nadarajah S, Nascimento DC, Rodrigues FA, Ramos PL.
Bayesian Reference Analysis for the Generalized Normal Linear Regression Model. *Symmetry*. 2021; 13(5):856.
https://doi.org/10.3390/sym13050856

**Chicago/Turabian Style**

Tomazella, Vera Lucia Damasceno, Sandra Rêgo Jesus, Amanda Buosi Gazon, Francisco Louzada, Saralees Nadarajah, Diego Carvalho Nascimento, Francisco Aparecido Rodrigues, and Pedro Luiz Ramos.
2021. "Bayesian Reference Analysis for the Generalized Normal Linear Regression Model" *Symmetry* 13, no. 5: 856.
https://doi.org/10.3390/sym13050856