On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors

Pan, Ji-Jun; Mahmoudi, Mohammad Reza; Baleanu, Dumitru; Maleki, Mohsen

doi:10.3390/sym11060820

Open AccessArticle

On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors

by

Ji-Jun Pan

¹,

Mohammad Reza Mahmoudi

^2,*

,

Dumitru Baleanu

³

and

Mohsen Maleki

⁴

¹

College of Mathematics, Dianxi Science and Technology, Normal University, Lincang 677000, China

²

Department of Statistics, Faculty of Science, Fasa University, Fasa 74616 86131, Iran

³

Department of Mathematics, Faculty of Art and Sciences, Cankaya University Balgat, Ankara 06530, Turkey

⁴

Department of Statistics, Faculty of Science, Shiraz University, Shiraz 71946 85115, Iran

^*

Author to whom correspondence should be addressed.

Symmetry 2019, 11(6), 820; https://doi.org/10.3390/sym11060820

Submission received: 30 May 2019 / Revised: 15 June 2019 / Accepted: 19 June 2019 / Published: 20 June 2019

(This article belongs to the Special Issue Symmetry in Applied Continuous Mechanics)

Download

Browse Figures

Versions Notes

Abstract

:

In many real world problems, science fields such as biology, computer science, data mining, electrical and mechanical engineering, and signal processing, researchers aim to compare and classify several regression models. In this paper, a computational approach, based on the non-parametric methods, is used to investigate the similarities, and to classify several linear and non-linear regression models with symmetric errors. The ability of each given approach is then evaluated using simulated and real world practical datasets.

Keywords:

comparison; Friedman test; linear regression; nonlinear regression; sign test; symmetric errors; Wilcoxon test

1. Introduction

In many situations, we aim to study the effects of variables

X_{1}, \dots, X_{k}

on variable

Y

. Simple and multiple regressions are data analysis techniques to model these effects. The authors of the references [1,2] applied simple and multiple linear regression models in different science fields, such as agriculture, biology, material, mechanical engineering, and signal processing. In many real world problems, scientists want to compare the relationship between the dependent variable and independent variables in several separate datasets.

The comparison of the correlation between the variables X and Y in two separate datasets, different techniques was provided by [3,4,5]. The comparison of the correlation between the variables X and Y in a dataset, and the correlation between the two variables X and W in another dataset, resulted in different methods developed by [6,7,8,9,10]. The correlation between the variables X and Y in a dataset, and the correlation between two variables W and Z in another dataset, were compared by different methods in [9,11,12]. The comparison and classification of two, and more simple linear regression models, have been considered in [13,14,15,16]. The comparison of two regression models has been reported in [14,15,16,17,18,19,20,21,22].

In the present research, we aim to compare and classify several linear and non-linear regression models that fitted on several independent datasets. The non-parametric methods are used to construct an approach to investigate the similarity and to classify the linear and non-linear regression models. A given approach is then evaluated using simulation and real world studies. The introduced approach is powerful and applicable in its ability to compare any linear or non-linear regression models.

2. Models Comparing and Classification

Assume

(X_{1 j}, \dots, X_{k j}, Y_{j}), j = 1, \dots, n_{i},

is a sample dataset of size

n_{i}

, from

(X_{1}, \dots, X_{k}, Y) .

The equations of m linear or non-linear regression models can be written by:

Y_{i j} = f_{i} (X_{1 j}, \dots, X_{k j}) + ε_{i j}, j = 1, \dots, n_{i}, i = 1, \dots, m,

(1)

such that for

i = 1, \dots, m,

ε_{i j}, j = 1, \dots, n_{i},

are zero-mean symmetric random variables with unknown and equal variance

σ_{i}^{2}

.

By considering Equation (1), consequently, the conditional expectation of Y based on

f_{i} (X_{1}, \dots, X_{k}),

that we show it by

θ_{i} (X_{1}, \dots, X_{k}),

is given by:

θ_{i} (X_{1}, \dots, X_{k}) = E (Y | f_{i} (X_{1}, \dots, X_{k})) = f_{i} (X_{1}, \dots, X_{k}) .

(2)

In real-word problems the aim is to test the hypothesis

H_{0} : θ_{1} (X_{1}, \dots, X_{k}) = θ_{2} (X_{1}, \dots, X_{k}) = \dots = θ_{m} (X_{1}, \dots, X_{k}) .

Under the rejection of

H_{0}

, we conclude that at least two models of the m regression models are not statistically similar, and if

H_{0}

is accepted then it can be concluded that the m regression models are statistically equal.

The regression equations can be represented by:

Y_{i} = f_{i} (X_{1}, \dots, X_{k}) + ε_{i}, i = 1, \dots, m,

(3)

such that

Y_{i} = {(y_{1}, \dots, y_{n_{i}})}^{T}, i = 1, \dots, m,

are the values for the dependent variable Y,

X_{1} = {(x_{11}, \dots, x_{1 n_{i}})}^{T}, \dots, X_{k} = {(x_{k 1}, \dots, x_{k n_{i}})}^{T}, i = 1, \dots, m

are the values for the independent variables

(X_{1}, \dots, X_{k})

,

f_{i} (X_{1}, \dots, X_{k}) = {(f_{i} (x_{11}, \dots, x_{k 1}), \dots, f_{i} (x_{1 n_{i}}, \dots, x_{k n_{i}}))}^{T},

and

ε_{i} = {(ε_{i 1}, \dots, ε_{i n_{i}})}^{T}, i = 1, \dots, m,

are zero-mean random variables with unknown and equal variance

σ_{i}^{2}

.

First, all m regression models are estimated by

{\hat{Y}}_{i} = {\hat{f}}_{i} (X_{1}, \dots, X_{k}), i = 1, \dots, m,

(4)

for all

n = d i s t i n c t p o i n t s (n_{1} \cup^{} n_{2} \cup^{} \dots \cup^{} n_{m})

values of

(X_{1}, \dots, X_{k})

, where

{\hat{Y}}_{i} = {({\hat{y}}_{i 1}, \dots, {\hat{y}}_{i n})}^{T}, i = 1, \dots, m,

are the estimated values for dependent variable

Y

, based on ith regression model. Since

ε_{i}, i = 1, \dots, m,

are zero-mean symmetric random variables, consequently,

{\hat{y}}_{i 1}, \dots, {\hat{y}}_{i n}, i = 1, \dots, m,

are unbiased estimators for

θ_{i} (X_{1}, \dots, X_{k}), i = 1, \dots, m,

respectively. In other words,

{\hat{y}}_{i 1}, \dots, {\hat{y}}_{i n}, i = 1, \dots, m,

are random variables with mean

θ_{i} (X_{1}, \dots, X_{k}), i = 1, \dots, m .

Remark 1.

n = d i s t i n c t p o i n t s (n_{1} \cup^{} n_{2} \cup^{} \dots \cup^{} n_{m})

means that the repeated points are assumed once.

Now, to compare the fitted regression models, the Friedman test [23,24,25,26] will be applied on n couples

({\hat{y}}_{11}, \dots, {\hat{y}}_{m 1}), \dots, ({\hat{y}}_{1 n}, \dots, {\hat{y}}_{m 1}) .

The Friedman test that is a non-parametric alternative to the repeated measures is used to compare related datasets (datasets that are repeated on the same subjects). This test is commonly applied when dataset do not follow the parametric conditions, such as normality assumption.

Classification

In previous discussion, if

H_{0}

is false, then we conclude that the mechanism of one model or mechanisms of some models are significantly different from the other models. However, to determine which models are significantly different from each other, the sign test or Wilcoxon test are applied in order to compare each of the regression model pairs.

3. Simulation Study

This section assesses the ability of the introduced approach simulation datasets. First, the different datasets from different regression models are produced. Then, we compute the values of the Estimated Type I error probability (

\hat{α}

) and the Estimated Power

(\hat{π})

of the introduced approach. For comparison, the Wilcoxon and Friedman tests are applied. The simulations are accomplished after 1000 runs and using the R 3.5.3 software (R Development Core Team, 2018) on a PC (Processor: Intel(R) CoreTM(2) Duo CPU T7100 @ 1.80GHz 1.80GHz, RAM: 2.00GB, System Type: 32-bit).

Example 1.

Assume the simple linear regression model:

Y = β X + ε,

(5)

such that

ε

and

X

are independent.

Example 2.

Let

Y = β_{0} + β_{1} X + β_{2} X ε,

(6)

such that

ε

and

X

are independent.

Example 3.

Assume:

Y = 1 + β X + ε,

(7)

such that

ε

and

X

are independent.

Example 4.

Assume the multiple linear regression model:

Y = β_{0} + β_{1} X_{1} + 2 β_{2} X_{2} + ε,

(8)

such that

ε

,

X_{1}

and

X_{2}

are independent.

Example 5.

For the first dataset, assume the simple nonlinear regression model:

Y = e^{X} + ε,

(9)

such that

ε

and

X

are independent.

For the second and the third datasets let

Y = {e^{X} + ε, 1 + β X + ε},

and

Y = {e^{X} + ε, 1 + β X + ε, 2 X + ε},

respectively.

Figure 1 and Figure 2 shows the density plots of the some parts of the response variable Y. As it can be seen in these figures, the density plots are symmetric, but not necessarily normal (Figure 2).

The values of

\hat{α}

(first four rows) and

\hat{π}

(other rows) for Examples 1 to 5 are summarized in Table 1, Table 2, Table 3, Table 4 and Table 5, respectively. As Table 1, Table 2, Table 3, Table 4 and Table 5 indicate the values of

\hat{α}

are very close to size test (

α =

0.05), and consequently the introduced approach can be controlled the type I error. Also the values of

\hat{π}

show that the given technique can distinguished between the null and alternative hypotheses.

4. Real Data

In this section, a practical real data is considered to study the power of the introduced approach in real world problems. Drought is a damaging natural phenomenon. To prevent this phenomenon, the hydrologists model and predict the drought datasets in a standard time period. In this research, the average monthly rainy days (1966–2010) at three Iranian synoptic stations (Fasa, Sarvestan, and Shiraz) was considered and modeled.

To model and forecast the average monthly rainy days, different polynomial regression models of orders 1 to 3 (linear, quadratic and cubic) and exponential model were fitted to datasets. The formulas of the considered models are as following:

Linear model:

Y = β_{0} + β_{1} X + ε

Quadratic model:

Y = β_{0} + β_{1} X + β_{2} X^{2} + ε .

(10)

Cubic model : Y = β_{0} + β_{1} X + β_{2} X^{2} + β_{3} X^{3} + ε .

(11)

Exponential model : Y = β_{0} + β_{1} e^{β_{2} X} + ε .

(12)

The numerical computations are done using the R 3.5.3 software (Library ‘nlstools’, lm() function for linear regression and nls() function for nonlinear regression) and Minitab 18 software.

The results of fitted regression models are summarized in Table 6. It can be observed that, for all of the stations, respectively, the polynomial regression of order 3 (cubic), and the exponential models, had the most R-square (R²) and the least root mean square error (RMSE) between all fitted models.

Now, we use the proposed approach to compare and classify these stations, for each model. The result of Friedman test is shown in Table 7. This table indicated that the fitted cubic and exponential models are significantly different in these stations (p < 0.05). Also, there is no significant difference between the fitted linear and quadratic models in these stations (p > 0.05).

As Table 8 indicates, we can classify the stations in two clusters, for cubic and exponential models. First cluster: Fasa and Sarvestan, and second cluster: Shiraz.

5. Conclusions

In many real world problems, researchers wish to compare and classify the regression models in several datasets. In this paper, the non-parametric methods were used to construct an approach to investigate the similarity of some linear and non-linear regression models with symmetric errors. Particular approaches were evaluated using simulation and practical datasets. A simulation study indicated that the introduced approach controlled the Type I error. Also the proposed technique distinguished well between null and alternative hypotheses. The introduced approach also had many advantages. First, it was powerful. Second, it was not too computational. Third, it could be applied to compare any linear or non-linear regression models. Fourth, this method did not need the normality of errors and could be applied for all models with symmetric errors.

Author Contributions

Conceptualization, J.-J.P., M.R.M. and D.B.; Formal analysis, M.R.M., D.B. and M.M.; Investigation, J.-J.P., M.R.M. and D.B.; Methodology, J.-J.P., M.R.M., D.B. and M.M.; Project administration, J.-J.P.; Software, J.-J.P., M.R.M., D.B. and M.M.; Supervision, J.-J.P., M.R.M. and D.B.; Validation, J.-J.P., M.R.M. and D.B.; Visualization, J.-J.P., M.R.M. and D.B.; Writing—Original Draft, J.-J.P. and M.R.M.; Writing—Review and Editing, J.-J.P., M.R.M., D.B. and M.M.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wan, J.; Zhang, D.; Xu, W.; Guo, Q. Parameter Estimation of Multi Frequency Hopping Signals Based on Space-Time-Frequency Distribution. Symmetry 2019, 11, 648. [Google Scholar] [CrossRef]
Sajid, M.; Shafique, T.; Riaz, I.; Imran, M.; Jabbar Aziz Baig, M.; Baig, S.; Manzoor, S. Facial asymmetry-based anthropometric differences between gender and ethnicity. Symmetry 2018, 10, 232. [Google Scholar] [CrossRef]
Mahmouudi, M.R.; Maleki, M.; Pak, A. Testing the Difference between Two Independent Time Series Models. Iran. J. Sci. Technol. Trans. A Sci. 2017, 41, 665–669. [Google Scholar] [CrossRef]
Fisher, R.A. On the Probable Error of a Coefficient of Correlation Deduced from a Small Sample. Metron 1921, 1, 3–32. [Google Scholar]
Mahmoudi, M.R.; Mahmoodi, M. Inferrence on the Ratio of Correlations of Two Independent Populations. J. Math. Ext. 2014, 7, 71–82. [Google Scholar]
Howell, D.C. Statistical Methods for Psychology, 6th ed.; Thomson Wadsworth: Stamford, CT, USA, 2007. [Google Scholar]
Hotelling, H. The Selection of Variates for Use in Prediction with Some Comments on the General Problem of Nuisance Parameters. Ann. Math. Stat. 1940, 11, 271–283. [Google Scholar] [CrossRef]
Williams, E.G. The Comparison of Regression Variables. J. R. Stat. Soc. Ser. B 1959, 21, 396–399. [Google Scholar] [CrossRef]
Steiger, J.H. Tests for Comparing Elements of a Correlation Matrix. Psychol. Bull. 1980, 87, 245–251. [Google Scholar] [CrossRef]
Meng, X.; Rosenthal, R.; Rubin, D.B. Comparing Correlated Correlation Coefficients. Psychol. Bull. 1992, 111, 172–175. [Google Scholar] [CrossRef]
Peter, C.C.; Van Voorhis, W.R. Statistical Procedures and Their Mathematical Bases; McGraw-Hill: New York, NY, USA, 1940. [Google Scholar]
Raghunathan, T.E.; Rosenthal, R.; Rubin, D.B. Comparing Correlated but Nonoverlapping Correlations. Psychol. Methods 1996, 1, 178–183. [Google Scholar] [CrossRef]
Liu, W.; Jamshidian, M.; Zhang, Y. Multiple Comparison of Several Linear Regression Lines. J. R. Stat. Soc. Ser. B 2004, 99, 395–403. [Google Scholar]
Liu, W.; Hayter, A.J.; Wynn, H.P. Operability Region Equivalence: Simultaneous Confidence Bands for the Equivalence of Two Regression Models Over Restricted Regions. Biom. J. 2007, 49, 144–150. [Google Scholar] [CrossRef] [PubMed]
Liu, W.; Jamshidian, M.; Zhang, Y.; Bertz, F.; Han, X. Pooling Batches in Drug Stability Study by Using Constant-width Simultaneous Confidence Bands. Stat. Med. 2007, 26, 2759–2771. [Google Scholar] [CrossRef] [PubMed]
Liu, W.; Jamshidian, M.; Zhang, Y.; Bertz, F.; Han, X. Some New Methods for the Comparison of Two Linear Regression Models. J. Stat. Plan. Inference 2007, 137, 57–67. [Google Scholar] [CrossRef]
Hayter, A.J.; Liu, W.; Wynn, H.P. Easy-to-Construct Confidence Bands for Comparing Two Simple Linear Regression Lines. J. Stat. Plan. Inference 2007, 137, 1213–1225. [Google Scholar] [CrossRef]
Jamshidian, M.; Liu, W.; Bretz, F. Simultaneous Confidence Bands for all Contrasts of Three or More Simple Linear Regression Models over an Interval. Comput. Stat Data Anal. 2010, 54, 1475–1483. [Google Scholar] [CrossRef]
Marques, F.J.; Coelho, C.A.; Rodrigues, P.C. Testing the equality of several linear regression models. Comput. Stat. 2016, 32, 1453–1480. [Google Scholar] [CrossRef]
Mahmoudi, M.R.; Mahmoudi, M.; Nahavandi, E. Testing the Difference between Two Independent Regression Models. Commun. Stat. Theory Methods 2016, 45, 6284–6289. [Google Scholar] [CrossRef]
Mahmoudi, M.R. On Comparing Two Dependent Linear and Nonlinear Regression Models. J. Test. Eval. 2018, 47, 449–458. [Google Scholar] [CrossRef]
Mahmoudi, M.R.; Maleki, M.; Pak, A. Testing the Equality of Two Independent Regression Models. Commun. Stat. Theory Methods 2018, 47, 2919–2926. [Google Scholar] [CrossRef]
Conover, W.J. Practical Nonparametric Statistics, 3rd ed.; John Wiley: Hoboken, NJ, USA, 1980. [Google Scholar]
Friedman, M. The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. J. R. Stat. Soc. Ser. B 1937, 32, 675–701. [Google Scholar] [CrossRef]
Friedman, M. A Correction: The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. J. R. Stat. Soc. Ser. B 1939, 34, 109. [Google Scholar] [CrossRef]
Friedman, M. A Comparison of Alternative Tests of Significance for the Problem of m Rankings. Ann. Math. Stat. 1940, 11, 86–92. [Google Scholar] [CrossRef]

Figure 1. The density plots of the some parts of the response variable Y (Black or Pie: Normal (0,1); Red or Triangle: Normal (0,2); Green or Star: Normal (0,3); Blue or Plus: Normal (0,5)).

Figure 2. The density plots of the some parts of the response variable Y (Black or Pie: 0.5 – Beta (1.5,1.5); Red or Triangle: 0.5 – Beta (1.75,1.75); Green or Star: 0.5 – Beta (2,2); Blue or Plus: 0.5 – Beta (2.5,2.5)).

Table 1. The values of

\hat{α}

and

\hat{π}

for Example 1.

Table 1. The values of

\hat{α}

and

\hat{π}

for Example 1.

				(n₁, n₂, n₃)
ε	X	β		(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
ε	X	Second	Third	(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	1	1	0.053	0.051	0.051	0.049
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	1	1	0.052	0.052	0.051	0.048
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	1	1	0.053	0.052	0.050	0.049
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	1	1	0.053	0.052	0.050	0.049
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	1	2	0.738	0.882	0.934	0.958
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	1	2	0.753	0.801	0.950	0.981
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	1	2	0.754	0.854	0.945	0.972
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	1	2	0.749	0.889	0.913	0.970
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	1	3	0.703	0.825	0.941	0.993
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	1	3	0.710	0.859	0.917	0.975
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	1	3	0.728	0.824	0.910	0.984
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	1	3	0.707	0.864	0.934	0.953
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	2	1	0.768	0.828	0.913	0.978
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	2	1	0.703	0.824	0.928	0.951
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	2	1	0.794	0.846	0.930	0.968
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	2	1	0.794	0.800	0.903	0.955
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	2	2	0.745	0.813	0.946	0.971
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	2	2	0.718	0.858	0.937	0.981
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	2	2	0.784	0.866	0.901	0.953
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	2	2	0.726	0.821	0.944	0.999
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.25)$	2	3	0.795	0.849	0.924	0.982
$U n i f o r m (- 2, 2)$	$E x p o n e n t i a l (5)$	2	3	0.755	0.856	0.928	0.961
$N o r m a l (0, 0.5)$	$N o r m a l (0, 0.25)$	2	3	0.763	0.845	0.936	0.988
$N o r m a l (0, 0.5)$	$E x p o n e n t i a l (5)$	2	3	0.710	0.865	0.914	0.975

Table 2. The values of

\hat{α}

and

\hat{π}

for Example 2.

Table 2. The values of

\hat{α}

and

\hat{π}

for Example 2.

				(n₁, n₂, n₃)
ε	X	(β₀, β₁, β₂)		(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
ε	X	Second	Third	(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(2, 1, 2)$	0.052	0.052	0.051	0.049
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(2, 1, 2)$	0.053	0.051	0.050	0.049
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(2, 1, 2)$	0.053	0.052	0.051	0.049
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(2, 1, 2)$	0.052	0.052	0.051	0.049
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(0, 2, 1)$	0.770	0.843	0.903	0.981
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(0, 2, 1)$	0.743	0.817	0.909	0.979
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(0, 2, 1)$	0.771	0.842	0.918	0.992
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(0, 2, 1)$	0.791	0.855	0.934	0.967
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(3, 2, 1)$	0.737	0.891	0.941	0.997
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(3, 2, 1)$	0.798	0.860	0.932	0.988
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(2, 1, 2)$	$(3, 2, 1)$	0.740	0.849	0.947	0.993
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(2, 1, 2)$	$(3, 2, 1)$	0.712	0.827	0.916	0.997
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(2, 1, 2)$	0.782	0.837	0.932	0.966
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(2, 1, 2)$	0.780	0.830	0.936	0.960
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(2, 1, 2)$	0.720	0.857	0.945	0.998
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(2, 1, 2)$	0.767	0.897	0.902	0.958
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(0, 2, 1)$	0.790	0.809	0.921	0.992
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(0, 2, 1)$	0.741	0.814	0.935	0.992
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(0, 2, 1)$	0.710	0.844	0.945	0.981
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(0, 2, 1)$	0.760	0.871	0.906	0.972
$U n i f o r m (- 1, 1)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(3, 2, 1)$	0.776	0.807	0.919	0.969
$U n i f o r m (- 1, 1)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(3, 2, 1)$	0.701	0.875	0.928	0.963
$N o r m a l (0, 2)$	$N o r m a l (0, 1)$	$(0, 2, 1)$	$(3, 2, 1)$	0.780	0.803	0.936	0.987
$N o r m a l (0, 2)$	$E x p o n e n t i a l (1)$	$(0, 2, 1)$	$(3, 2, 1)$	0.720	0.886	0.923	0.960

Table 3. The values of

\hat{α}

and

\hat{π}

for Example 3.

Table 3. The values of

\hat{α}

and

\hat{π}

for Example 3.

				(n₁, n₂, n₃)
ε	X	β		(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
ε	X	Second	Third	(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	1	1	0.053	0.051	0.050	0.049
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	1	1	0.053	0.051	0.051	0.050
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	1	1	0.053	0.051	0.051	0.050
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	1	1	0.053	0.051	0.051	0.050
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	1	2	0.724	0.846	0.924	0.996
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	1	2	0.734	0.813	0.942	0.952
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	1	2	0.737	0.818	0.914	0.959
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	1	2	0.764	0.819	0.949	0.998
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	1	5	0.797	0.808	0.904	0.959
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	1	5	0.760	0.869	0.919	0.978
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	1	5	0.793	0.843	0.917	0.988
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	1	5	0.765	0.876	0.910	0.983
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	2	1	0.742	0.868	0.934	0.954
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	2	1	0.730	0.810	0.925	0.966
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	2	1	0.725	0.867	0.911	0.981
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	2	1	0.769	0.868	0.930	0.996
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	2	2	0.763	0.816	0.905	0.982
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	2	2	0.706	0.895	0.935	0.951
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	2	2	0.723	0.866	0.909	0.981
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	2	2	0.765	0.857	0.903	0.974
$U n i f o r m (- 1, 1)$	$Geometric (0.4)$	2	5	0.710	0.867	0.910	0.950
$U n i f o r m (- 1, 1)$	$B i n o m i a l (2, 0.7)$	2	5	0.764	0.837	0.904	0.981
$N o r m a l (0, 0.5)$	$Geometric (0.4)$	2	5	0.778	0.891	0.933	0.987
$N o r m a l (0, 0.5)$	$B i n o m i a l (2, 0.7)$	2	5	0.726	0.819	0.946	0.967

Table 4. The values of

\hat{α}

and

\hat{π}

for Example 4.

Table 4. The values of

\hat{α}

and

\hat{π}

for Example 4.

				(n₁, n₂, n₃)
X₁	X₂	(β₀, β₁, β₂)		(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
X₁	X₂	Second	Third	(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(2, 1, 2)$	0.052	0.052	0.050	0.049
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(2, 1, 2)$	$(2, 1, 2)$	0.053	0.052	0.050	0.049
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(2, 1, 2)$	0.052	0.052	0.051	0.049
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(2, 1, 2)$	$(2, 1, 2)$	0.052	0.051	0.050	0.048
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(0, 2, 1)$	0.734	0.893	0.923	0.961
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(2, 1, 2)$	$(0, 2, 1)$	0.787	0.887	0.947	0.964
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(0, 2, 1)$	0.766	0.813	0.943	0.973
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(2, 1, 2)$	$(0, 2, 1)$	0.762	0.897	0.909	0.993
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(3, 2, 1)$	0.706	0.866	0.936	0.966
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(2, 1, 2)$	$(3, 2, 1)$	0.746	0.882	0.946	0.960
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(2, 1, 2)$	$(3, 2, 1)$	0.716	0.875	0.948	0.975
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(2, 1, 2)$	$(3, 2, 1)$	0.757	0.811	0.939	0.950
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(2, 1, 2)$	0.792	0.866	0.936	0.985
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(0, 2, 1)$	$(2, 1, 2)$	0.768	0.824	0.902	0.995
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(2, 1, 2)$	0.773	0.841	0.933	0.983
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(0, 2, 1)$	$(2, 1, 2)$	0.795	0.801	0.940	0.992
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(0, 2, 1)$	0.790	0.891	0.912	0.953
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(0, 2, 1)$	$(0, 2, 1)$	0.784	0.855	0.924	0.951
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(0, 2, 1)$	0.739	0.842	0.908	0.961
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(0, 2, 1)$	$(0, 2, 1)$	0.749	0.880	0.905	0.963
$U n i f o r m (0, 2)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(3, 2, 1)$	0.745	0.854	0.918	0.956
$U n i f o r m (0, 2)$	$Geometric (0.3)$	$(0, 2, 1)$	$(3, 2, 1)$	0.739	0.825	0.946	0.955
$B i n o m i a l (3, 0.5)$	$E x p o n e n t i a l (5)$	$(0, 2, 1)$	$(3, 2, 1)$	0.743	0.883	0.926	0.960
$B i n o m i a l (3, 0.5)$	$Geometric (0.3)$	$(0, 2, 1)$	$(3, 2, 1)$	0.734	0.840	0.918	0.976

Table 5. The values of

\hat{α}

and

\hat{π}

for Example 5.

Table 5. The values of

\hat{α}

and

\hat{π}

for Example 5.

				(n₁, n₂, n₃)
ε	X	Y		(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
ε	X	Second	Third	(10, 10, 10)	(20, 40, 60)	(50, 75, 100)	(75, 100, 150)
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$e^{X} + ε$	0.052	0.051	0.051	0.048
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$e^{X} + ε$	$e^{X} + ε$	0.053	0.051	0.051	0.049
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$e^{X} + ε$	0.052	0.051	0.051	0.049
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$e^{X} + ε$	$e^{X} + ε$	0.052	0.051	0.050	0.048
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$1 + β X + ε$	0.787	0.895	0.901	0.965
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$e^{X} + ε$	$1 + β X + ε$	0.787	0.829	0.930	0.974
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$1 + β X + ε$	0.725	0.848	0.912	0.991
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$e^{X} + ε$	$1 + β X + ε$	0.759	0.898	0.944	0.984
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$2 X + ε$	0.734	0.891	0.949	0.962
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$e^{X} + ε$	$2 X + ε$	0.788	0.811	0.921	0.981
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$2 X + ε$	0.759	0.877	0.941	0.965
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$e^{X} + ε$	$2 X + ε$	0.704	0.868	0.948	0.989
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$e^{X} + ε$	0.798	0.845	0.908	0.956
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$e^{X} + ε$	$e^{X} + ε$	0.753	0.809	0.927	0.989
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$e^{X} + ε$	$e^{X} + ε$	0.731	0.865	0.910	0.990
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$e^{X} + ε$	$e^{X} + ε$	0.731	0.820	0.906	0.962
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$1 + β X + ε$	$1 + β X + ε$	0.723	0.897	0.934	0.960
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$1 + β X + ε$	$1 + β X + ε$	0.799	0.807	0.949	0.982
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$1 + β X + ε$	$1 + β X + ε$	0.713	0.877	0.916	0.952
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$1 + β X + ε$	$1 + β X + ε$	0.743	0.872	0.925	0.965
$U n i f o r m (- 2, 2)$	$N o r m a l (0, 0.5)$	$1 + β X + ε$	$2 X + ε$	0.725	0.892	0.901	0.996
$U n i f o r m (- 2, 2)$	$P o i s s o n (5)$	$1 + β X + ε$	$2 X + ε$	0.795	0.886	0.944	0.959
$N o r m a l (0, 0.25)$	$N o r m a l (0, 0.5)$	$1 + β X + ε$	$2 X + ε$	0.707	0.821	0.925	0.972
$N o r m a l (0, 0.25)$	$P o i s s o n (5)$	$1 + β X + ε$	$2 X + ε$	0.798	0.825	0.924	0.974

Table 6. Indices to evaluate the fitted regression models.

Model	Station	R Square	RMSE
Linear	Fasa	0.624	1.693
	Sarvestan	0.638	1.516
	Shiraz	0.689	1.501
Quadratic	Fasa	0.734	1.350
	Sarvestan	0.743	1.285
	Shiraz	0.767	1.265
Cubic	Fasa	0.895	0.910
	Sarvestan	0.899	0.855
	Shiraz	0.976	0.529
Exponential	Fasa	0.767	0.978
	Sarvestan	0.778	0.926
	Shiraz	0.876	0.713

Table 7. Friedman test to compare the stations.

Model	p
Linear	0.123
Quadratic	0.224
Cubic	<0.001
Exponential	<0.001

Table 8. Wilcoxon test to compare and classify the stations.

Model	Stations		p
Cubic	Pair 1	Shiraz - Fasa	0.011
	Pair 2	Shiraz - Sarvestan	0.003
	Pair 3	Fasa - Sarvestan	0.144
Exponential	Pair 1	Shiraz - Fasa	0.019
	Pair 2	Shiraz - Sarvestan	<0.001
	Pair 3	Fasa - Sarvestan	0.112

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pan, J.-J.; Mahmoudi, M.R.; Baleanu, D.; Maleki, M. On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors. Symmetry 2019, 11, 820. https://doi.org/10.3390/sym11060820

AMA Style

Pan J-J, Mahmoudi MR, Baleanu D, Maleki M. On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors. Symmetry. 2019; 11(6):820. https://doi.org/10.3390/sym11060820

Chicago/Turabian Style

Pan, Ji-Jun, Mohammad Reza Mahmoudi, Dumitru Baleanu, and Mohsen Maleki. 2019. "On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors" Symmetry 11, no. 6: 820. https://doi.org/10.3390/sym11060820

APA Style

Pan, J.-J., Mahmoudi, M. R., Baleanu, D., & Maleki, M. (2019). On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors. Symmetry, 11(6), 820. https://doi.org/10.3390/sym11060820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On Comparing and Classifying Several Independent Linear and Non-Linear Regression Models with Symmetric Errors

Abstract

1. Introduction

2. Models Comparing and Classification

Classification

3. Simulation Study

4. Real Data

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI