Next Article in Journal
The Quasicrystal Model as a Framework for Order to Disorder Transitions in 2D Systems
Previous Article in Journal
A Sub-Zonal PMV-Based HVAC and Façade Control System for Curtain Wall Buildings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Fuzzy Linear Regression of Rainfall-Altitude Relationship †

by
Christos Tzimopoulos
,
Christos Evangelides
,
Christos Vrekos
and
Nikiforos Samarinas
*
Department of Rural and Surveying Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Presented at the 3rd EWaS International Conference on “Insights on the Water-Energy-Food Nexus”, Lefkada Island, Greece, 27–30 June 2018.
Proceedings 2018, 2(11), 636; https://doi.org/10.3390/proceedings2110636
Published: 27 August 2018
(This article belongs to the Proceedings of EWaS3 2018)

Abstract

:
Classical linear regression has been used to measure the relationship between rainfall data and altitude in different meteorological stations, in order to evaluate a linear relation. The values of rainfall are supposed as dependent variables and the values of elevation of each station as independent variables. It has long been known that a classical statistical relationship exists between annual rainfall and the station elevation which in many cases is linear as the one examined in this article. However classical linear regression makes rigid assumptions about the statistical properties of the model, accepting the error terms as random variables, and the violation of this assumption could affect the validity of the classical linear regression. Fuzzy regression assumes ambiguous and imprecise parameters and data. For this reason it may be more effective than classical regression. In this paper we evaluate the relationship between annual rainfall data and the elevation of each station in Thessaly’s meteorological stations, using fuzzy linear regression with trapezoidal membership functions. In this possibilistic model the dependent measured elevations are crisp, and the independent observed rainfall values as well as the parameters of the model are fuzzy.

1. Introduction

The scope of constructing in engineering a model is always to attempt to maximize its usefulness. This aim is closely connected with the relationship among three key characteristics of every systems model: complexity, credibility, and uncertainty. Our purpose in systems modelling is to estimate an optimal level of uncertainty for each modelling problem. In 1965 Zadeh in [1], introduced the theory of fuzzy sets, in which the fuzziness of a system incorporated all the types of uncertainty, in contrast with probability theory, which was capable of representing only one of several distinct types of uncertainty.
In Hydrology, rainfall measurement models have been extensively used in the design process of water resource projects such as hydrological prediction, spillway design, climatic change studies, rainfall and runoff correlation etc. Rainfall measurements in a specific area are commonly displayed in the form of time series, where recorded values can be either continuous or discrete. In many instances, there is a correlation between rainfall data that belong to different stations and comprise measurements with differing range and stations elevations. Correlation analysis is used to depict the relation between the independent variable (usually the meteorological station elevations) and the dependent variables (meteorological stations rainfall data).
In classical linear regression, the difference between measurement values and estimated values, is a random variable with normal distribution and is considered to be caused by measurement errors. Upper and lower bounds of the estimated value are calculated and the probability that the estimated value will lie between them represents the estimation confidence. According to this, classical regression is considered to be probabilistic and has many uses, but can be rendered problematic if the data set is small, if it’s hard to prove that error distribution is normal, if there is fuzziness between dependent and independent variables or if linearity acceptance is not proper [2].
Nowadays, new regression models have been introduced, based on fuzzy logic [3,4,5,6,7,8,9,10,11,12,13]. In fuzzy regression the difference between measurement values and estimated values is attributed to the inherent fuzziness of the system as well as to the fuzziness of input and output data. In contrast with classical regression analysis, fuzzy regression analysis uses fuzzy functions for the regression factors and usually meets one of the three cases: (a) Crisp input values x i j and crisp output values y j (CICO); (b) Crisp input values x i j and fuzzy output values y ˜ i (CIFO); and (c) Fuzzy input values x ˜ i j and fuzzy output values y ˜ i (FIFO). In all of these three cases, estimated values Y ˜ i are fuzzy.
Most of the cited above writers have used tridiagonal fuzzy functions for the formulation of the problem. However the need to use trapezoidal functions, results of the following reasons [14]: (a) we need to optimize the fuzziness of the model and (b) we need to restrict experimental data inside the estimated value range. Using trapezoidal membership functions for estimated values [14,15,16,17,18] allows us to achieve inclusion for output data with triangular membership functions y ˜ j and estimated values with trapezoidal membership functions Y ˜ j , for confidence level h = 1, for which the kernel is not minimized in a point: [ y ˜ j ] h = 1 [ Y ˜ j ] h = 1 . In addition, for a level of confidence h = 0, we can achieve inclusion:   [ y ˜ j ] h = 0 [ Y ˜ j ] h = 0 . Due to the linearity of the membership function, inclusion for those levels of confidence, allows us to ensure that inclusion is possible for every level of confidence: [ y ˜ j ] h [ Y ˜ j ] h ,   h [ 0 , 1 ] . Consequently, trapezoidal fuzzy parameters have higher capability to model varieties of fuzziness than triangular fuzzy parameters.
Trapezoidal membership function models have been used by [14,15,19,20,21,22,23,24]. Charfeddine in [19] extended Tanaka’s method for the case of trapezoidal membership functions with crisp measured input and output values. She used a fuzzy level function f ˜ , with four crisp level functions f a , f b , f c , f d For, f a , f b , she used Tanaka’s method [8], whereas for f b , f c , she used classical linear regression, which led to function f ˜ and standard deviation σ of the model. Based on this f b , f c , become: f b = f ^ λ σ ,   f c = f ^ + λ σ with λ being an adjustment factor. Ganesan and Veeramani in [22], described a fuzzy linear programming with symmetric trapezoidal fuzzy parameters. They proved fuzzy analogues of some important theorems of linear programming and they gave a numerical example, leading to a solution of fuzzy linear programming problems. Fung et al. in [25], proposed a new model for asymmetric trapezoidal fuzzy parameters. Kumar and Kaur in [24] presented a new method, called Mehar’s method, suitable for solving fuzzy linear programming problems with trapezoidal fuzzy parameters. They proved that this method is easy to apply to fuzzy linear programming problems, with respect to existing methods. Kheirfam and Verdegay in [23] extended the dual simplex method to fuzzy linear programming, with symmetric trapezoidal fuzzy parameters. They studied the variation of values to a certain limit, so that the fuzzy optimal solution remains invariant.
In this article, a possibilistic model (Tzimopoulos et al. model) is described, where membership functions are trapezoidal, measured input values are crisp and measured output values are fuzzy triangular [13]. In this model, a rather simple two-phase method was used, with one crisp measured input value and one fuzzy measured output value. During step 1, measured input and output values were considered crisp, while parameters were considered fuzzy and supports were estimated using Tanaka’s method. In step 2, estimated supports were considered to be the known kernel of the trapezoidal membership functions and in the inclusion they were transferred to the known terms. Supports for the trapezoidal membership functions of the estimation were calculated. Triangular membership functions were used for measured output values.
Further, we present one application of the above model, concerning a hydrological problem in the region of Central Thessaly (Greece), where twenty rainfall measurement stations of the region with rainfall data and their elevation have been used.

2. Mathematical Model

2.1. Model of Tzimopoulos et al. (2006)

2.1.1. Generalities

Measured input values x i j are considered crisp and measured output values are considered symmetrical triangular fuzzy numbers y ˜ j = ( y j , y j k , y j + ) (Figure 1), while estimated values are considered trapezoidal fuzzy numbers Y ˜ j = ( S j , K j , K j + , S j + ) (Figure 2). The above problem can be divided into the following two steps:

2.1.2. Step 1

According to the kernel inclusion constraint mentioned above, we have:
y j k [ Y j , Y j + ] h = 1
Namely, the kernel of experimental output values is within the kernel of estimated values. According to Shapiro et al. (2009), for the case of crisp output values (only the kernels of measured values), if we apply Tanaka’s method (Tanaka 1987), the range [ Y j , Y j + ] h = 1 = [ K j , K j + ] h = 1 encircles the kernels of experimental values. In this stage, only triangular functions A ˜ 0 ( r 0 , c 0 ) ,   A ˜ 1 ( r 1 , c 1 ) are applied for the method of Tanaka and the possibilistic model used is: Y ˜ j = A ˜ 0 + A ˜ 1 x 1 j . Thus, the problem of determining the estimated values in this step is:
min ( c ) = M c 0 + c 1 i = 1 M x 1 j
Y j + = K j + = i = 0 1 r i x i j + i = 0 1 c i x i j y j k
Y j = K j = i = 0 1 r i x i j i = 0 1 c i x i j y j k ,   x 0 j = 1
c 0 ,   c i 0
Through the solution of this system we get the surroundings [ Y j , Y j + ] h = 1 = [ K j , K j + ] h = 1 as follows:
Y j + = ( r 0 + c 0 ) + ( r 1 + c 1 ) x j
Y j = ( r 0 c 0 ) + ( r 1 c 1 ) x j
As long as they meet the same constraints, these surroundings coincide with the kernel of trapezoidal functions, resulting in the relations below:
Y j = K j = K A ˜ 0 + K A ˜ 1 x 1 j ,     j = 1 , , M Y j + = K j + = K A ˜ 0 + + K A ˜ 1 + x 1 j ,     j = 1 , , M

2.1.3. Step 2

We apply now support inclusion:
[ S y ˜ j , S y ˜ j + ] [ S Y ˜ j , S Y ˜ j + ]
where space [ S y ˜ j , S y ˜ j + ] is given as follows:
[ S y ˜ j = y j = y j   e j   ,     S y ˜ j + = y j + = y j   + e j   ]
Based on relations (3a, 3b, 3c) and (4a, 4b), we get:
K Y ˜ j i = 1 2 c i x i j y j ,         x 1 j = 1
K Y ˜ j + + i = 1 2 c i r x i j y j + ,         x 1 j = 1 ,       j = 1 , 2 , , M
where K Y ˜ j + and K Y ˜ j are known since they have been calculated during step 1.
The problem turns now into the following:
min ( c ) = M ( c 0 + c 0 r ) + ( c 1 + c 1 r ) j = 1 M x 1 j
s.t
K Y ˜ j i = 1 2 c i r x i j y ˜ j e j ,         x 0 j = 1 K Y ˜ j + + i = 1 2 c i r x i j y ˜ j + + e j ,         x 0 j = 1 ,       j = 1 , 2 , , M                             c 0 , c 0 r , c 1 , c 1 r 0                                                                                                            

3. Applications

3.1. Generalities

Given the following rainfall and elevation data of twenty meteorological stations of central Thessaly (Greece) (Table 1 and Table 2), with crisp input data and symmetric output fuzzy data, we want to assess the two step model of Tzimopoulos et al., using the fuzzy regression method, considering trapezoidal membership functions for the assessment.

3.2. Step 1

We apply Tanaka’s model, considering that output data are crisp. For, the model is written as follows:
min ( 20 c 0 + 10926 c 1 )
s.t
r 0 c 0 +                 90 r 1 90 c 1 740.7 r 0 c 0 +                 95 r 1 95 c 1 611.1 r 0 c 0 +       992 r 1 992 c 1 1594.1 r 0 c 0 + 1000 r 1 1000 c 1 1384.9 r 0 + c 0 +                 90 r 1 + 90 c 1 740.7 r 0 + c 0 +                 95 r 1 + 95 c 1 611.1 r 0 + c 0 +       992 r 1 + 992 c 1 1594.1 r 0 + c 0 + 1000 r 1 + 1000 c 1 1384.9
Solving this problem we get kernel equations:
[ Y j , Y j + ] = [ y , y u ] ,   or
y = 529.493 + 0.42706 x ,   y u = 800.989 + 0.79956 x

3.3. Step 2

In phase 2, the kernel is considered to be known and the model is written as follows:
min ( 20 c 0 + 10926 c 1 + 20 c 0 u + 10926 c 1 u )
s.t
c 0       90 c 1 61.70 . c 0 1000 c 1 220.5 c 0 u +                   90 c 1 u 21.10 . c 0 u +   1000 c 1 u 8.00
In Figure 3 calculation results are shown, while support equations are:
[ S y ˜ j , S y ˜ j + ] = [ y , y u ] ,   or
y = 450.069 + 0.3629 x ,   y u = 887.100 + 1.20226 x
In the Figure 4a illustrates the estimated output data, Figure 4b illustrates the measured and predicted values at Megali_Kerasia_Ypex meteorological station, while Figure 3a, b illustrate the predicted parameters A ˜ 0 and A ˜ 1 respectively.

4. Comments-Conclusions

The two-step model has the advantage of using only four unknown quantities during each phase, in contrast with Bisserier and Fung models, which use eight.
In all the above cases, the presented model can be applied for all confidence levels and have the following property: data kernels are included into estimated kernels and data supports are also included into estimated supports.
In the case of rainfall measurement observations, station association is achieved, even for small samples and we can extend the shorter time series, due to fuzzy correlation for more rainfall stations.

References

  1. Zadeh, L.A. Fuzzy Sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef]
  2. Savic, D.A.; Pedrycz, W. Evaluation of fuzzy linear regression models. Fuzzy Sets Syst. 1991, 39, 51–63. [Google Scholar] [CrossRef]
  3. Papadopoulos, B.; Sirpi, M. Similarities in Fuzzy Regression Models. J. Optim. Theory Appl. 1999, 102, 373–383. [Google Scholar] [CrossRef]
  4. Papadopoulos, B.; Sirpi, M. Similarities and Distances in Fuzzy Regression Models. Soft Comput. 2004, 8, 556–561. [Google Scholar] [CrossRef]
  5. Redden, D.T.; Woodall, W.H. Further examination of fuzzy linear regression. Fuzzy Sets Syst. 1996, 79, 203–211. [Google Scholar] [CrossRef]
  6. Shapiro, A.F.; Berry-Stölzle, T.R.; Koissi, M.-C. The Fuzziness in Regression Models. ARC2009_Shapiro_06 PDF. 2009; p. 6.
  7. Tanaka, H.; Uejima, S.; Asai, K. Linear regression analysis with fuzzy models. IEEE Trans. Syst. Man Cybern. 1982, 12, 903–907. [Google Scholar]
  8. Tanaka, H. Fuzzy data analysis by possibilistic linear models. Fuzzy Sets Syst. 1987, 24, 363–375. [Google Scholar] [CrossRef]
  9. Tanaka, H.; Watada, J. Possibilistic linear systems and their application to the linear regression model. Fuzzy Sets Syst. 1988, 27, 275–289. [Google Scholar] [CrossRef]
  10. Tanaka, H.; Hayashi, I.; Watada, J. Possibilistic linear regression analysis for fuzzy data. EJOR 1989, 40, 389–396. [Google Scholar] [CrossRef]
  11. Tanaka, H.; Ishibuchi, H. Identification of possibilistic linear systems by quadratic membership functions of fuzzy parameters. Fuzzy Sets Syst. 1991, 41, 145–160. [Google Scholar] [CrossRef]
  12. Tzimopoulos, C.; Papadopoulos, Β. Fuzzy Logic with Application in Engineering; Ziti: Thessaloniki, Greece, 2013; 627p. (In Greek) [Google Scholar]
  13. Tzimopoulos, C.; Papadopoulos, K.; Papadopoulos, B. Fuzzy Regression with Applications in Hydrology. IJEIT 2016, 5, 69–75. [Google Scholar]
  14. Bisserier, A. Une approche Paramétrique de la Régression Linéaire Floue-Formalisation par Intervalles. Ph.D. Thesis, Université de Savoie, Chambéry, France, 2010. [Google Scholar]
  15. Bisserier, A.; Boukezzoula, R.; Galichet, S. Linear Fuzzy Regression Using Trapezoidal Fuzzy Intervals. In Proceedings of the IPMU’08, Torremolinos, Spain, 22–27 June 2008; pp. 181–188. [Google Scholar]
  16. Bisserier, A.; Boukezzoula, R.; Galichet, S. Linear Fuzzy Regression Using Trapezoidal Fuzzy Intervals. JUS 2010, 4, 59–72. [Google Scholar]
  17. Bisserier, A.; Boukezzoula, R.; Galichet, S. A revisited approach to linear fuzzy regression using trapezoidal fuzzy intervals. Inf. Sci. 2010, 180, 3653–3673. [Google Scholar] [CrossRef]
  18. Bisserier, A.; Boukezzoula, R.; Galichet, S. Linear Fuzzy Regression Using Trapezoidal Fuzzy Intervals. In Foundations of Reasoning under Uncertainty; Bouchon-Meunier, B., Magdalena, L., Ojeda-Aciego, M., Verdegay, J.L., Yager, R., Eds.; Springer: Berlin, Germany, 2010; Volume 249, pp. 2–22. [Google Scholar]
  19. Charfeddine, S. Optimisation de l’ offre d’ une Compagnie Aérienne en Environnement Incertain. Thèse de Docteur; Université de Toulouse: Toulouse, France, 2004. [Google Scholar]
  20. Charfeddine, S.; Mora-Camino, F.; De Coligny, M. Fuzzy linear regression: Application to the estimation of air transport demand. In Proceedings of the International Conference on Fuzzy Sets and Soft Computing in Ecomomics and Finance, Saint-Petersburg, Russia, 17–20 June 2004; pp. 350–359. [Google Scholar]
  21. Charfeddine, S.; Zbidi, K.; Mora-Camino, F. Fuzzy Regression Analysis using Trapezoidal Fuzzy Numbers. In Proceedings of the EUSFLAT—LFA 2005, Barcelona, Spain, 7–9 September 2005; pp. 1213–1218. [Google Scholar]
  22. Ganesan, K.; Veeramani, P. Fuzzy Linear Programs with Trapezoidal Fuzzy Numbers. Ann. Oper. Res. 2006, 143, 305–315. [Google Scholar] [CrossRef]
  23. Kheirfam, B.; Verdegay, J.L. The dual simplex method and sensitivity analysis for fuzzy linear programming with symmetric trapezoidal numbers. Fuzzy Optim. Decis. Mak. 2013, 12, 171–189. [Google Scholar] [CrossRef]
  24. Kumar, A.; Kaur, J. A New Method for Solving Fuzzy Linear Programs with Trapezoidal Fuzzy Numbers. J. Fuzzy Set Valued Anal. 2011, 3, 103–118. [Google Scholar] [CrossRef]
  25. Fung, R.Y.K.; Chen, Y.; Tang, J. Estimating the functional relationships for quality function deployment under uncertainties. Fuzzy Sets Syst. 2006, 157, 98–120. [Google Scholar] [CrossRef]
Figure 1. (a) Triangular membership function, (b) Trapezoidal membership function.
Figure 1. (a) Triangular membership function, (b) Trapezoidal membership function.
Proceedings 02 00636 g001
Figure 2. Central Thessaly meteorological stations.
Figure 2. Central Thessaly meteorological stations.
Proceedings 02 00636 g002
Figure 3. (a) Predicted parameters A ˜ 0 , (b) Predicted parameters A ˜ 1 .
Figure 3. (a) Predicted parameters A ˜ 0 , (b) Predicted parameters A ˜ 1 .
Proceedings 02 00636 g003
Figure 4. (a) Estimated fuzzy output data, (b) Measured and predicted output at Meg._Ker._Ypex station.
Figure 4. (a) Estimated fuzzy output data, (b) Measured and predicted output at Meg._Ker._Ypex station.
Proceedings 02 00636 g004
Table 1. Elevation and Rainfall data from meteorological stations of Central Thessaly.
Table 1. Elevation and Rainfall data from meteorological stations of Central Thessaly.
M. StationsKallifonioGrammatikoKarditsomag.KarditsaTrikalaKalamapakaMouza. YpexRahoulaMeg_Ker_YpexAgiofyllo
X (m)909595103110202226330500581
Y (mm)7416116945737078676791065864823
e = 0.15Y111,1107191,66502743104,08928686,02214106,06393130,10625101,859934159,726129,6461372123,495
Table 2. Elevation and Rainfall data from meteorological stations of Central Thessaly.
Table 2. Elevation and Rainfall data from meteorological stations of Central Thessaly.
M. StationsLoutr._YpexAmarantosMolohaBathylakkosMalakasioBronteroRentinaChrisomiliaArgitheaKerasia
X (m)7307447908008428539039409921000
Y (mm)841117213481118104514751112129415941384,9
e = 0.15Y126,18743175,8080864202,244286167,6779156,731302221,27336166,768142194,169239,1214286207,7294
Notation: e means the spread of the rainfall data, which are symmetrical triangular fuzzy numbers.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tzimopoulos, C.; Evangelides, C.; Vrekos, C.; Samarinas, N. Fuzzy Linear Regression of Rainfall-Altitude Relationship. Proceedings 2018, 2, 636. https://doi.org/10.3390/proceedings2110636

AMA Style

Tzimopoulos C, Evangelides C, Vrekos C, Samarinas N. Fuzzy Linear Regression of Rainfall-Altitude Relationship. Proceedings. 2018; 2(11):636. https://doi.org/10.3390/proceedings2110636

Chicago/Turabian Style

Tzimopoulos, Christos, Christos Evangelides, Christos Vrekos, and Nikiforos Samarinas. 2018. "Fuzzy Linear Regression of Rainfall-Altitude Relationship" Proceedings 2, no. 11: 636. https://doi.org/10.3390/proceedings2110636

APA Style

Tzimopoulos, C., Evangelides, C., Vrekos, C., & Samarinas, N. (2018). Fuzzy Linear Regression of Rainfall-Altitude Relationship. Proceedings, 2(11), 636. https://doi.org/10.3390/proceedings2110636

Article Metrics

Back to TopTop