Next Article in Journal
An Urban Governance Framework for Including Environmental Migrants in Sustainable Cities
Previous Article in Journal
Influence of Climate on Conflicts and Migrations in Southern Africa in the 19th and Early 20th Centuries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Rainy Day Prediction Model with Climate Covariates Using Artificial Neural Network MLP, Pilot Area: Central Italy

by
Matteo Gentilucci
* and
Gilberto Pambianchi
School of Science and Technology, Geology Division, University of Camerino, 62032 Camerino, Italy
*
Author to whom correspondence should be addressed.
Climate 2022, 10(8), 120; https://doi.org/10.3390/cli10080120
Submission received: 19 July 2022 / Revised: 9 August 2022 / Accepted: 15 August 2022 / Published: 17 August 2022

Abstract

:
The reconstruction of daily precipitation data is a much-debated topic of great practical use, especially when weather stations have missing data. Missing data are particularly numerous if rain gauges are poorly maintained by their owner institutions and if they are located in inaccessible areas.In this context, an attempt was made to assess the possibility of reconstructing daily rainfall data from other climatic variables other than the rainfall itself, namely atmospheric pressure, relative humidity and prevailing wind direction.The pilot area for the study was identified in Central Italy, especially on the Adriatic side, and 119 weather stations were considered.The parameters of atmospheric pressure, humidity and prevailing wind direction were reconstructed at all weather stations on a daily basis by means of various models, in order to obtain almost continuous values rain gauge by rain gauge. The results obtained using neural networks to reconstruct daily precipitation revealed a lack of correlation for the prevailing wind direction, while correlation is significant for humidity and atmospheric pressure, although they explain only 10–20% of the total precipitation variance. At the same time, it was verified by binary logistic regression that it is certainly easier to understand when it will or will not rain without determining the amount. In this case, in fact, the model achieves an accuracy of about 80 percent in identifying rainy and non-rainy days from the aforementioned climatic parameters. In addition, the modelling was also verified on all rain gauges at the same time and this showed reliability comparable to an arithmetic average of the individual models, thus showing that the neural network model fails to prepare a model that performs better from learning even in the case of many thousands of data (over 400,000). This shows that the relationships between precipitation, relative humidity and atmospheric pressure are predominantly local in nature without being able to give rise to broader generalisations.

1. Introduction

1.1. Aim of the Study and State of the Art

The reconstruction of daily precipitation data is a topic that climatologists have always tried to solve. The difficulty lies in the fact that rainfall can be extremely localised, [1] and therefore, change radically a few kilometres away from the measurement, so it is complicated to reconstruct it on the basis of nearby rain gauges or terrain topography [2]. In an attempt to solve this problem, climatologists very often rely on satellite measurements in order to quantify daily precipitation over the entire globe [3,4]. Among the tools that enable these evaluations are the IMERG (Integrate Multi-satellitE Retrievals for GPM) algorithm or the TMPA (TRMM multi-satellite precipitation analyses) or PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks), etc.; however, to date, it has been shown that they are still highly inaccurate in some areas, and therefore, not always very useful for quantifying rainfall [5,6,7]. On the other hand, there remains the possibility of trying to reconstruct rainfall values with neighbouring rain gauges through geostatistics, which although probably the most accurate and reliable method, with the obvious problems described above, for reasons of time and calculation is not very useful on a daily scale [8,9]. In this context, therefore, a path remains little explored to date, that of reconstructing precipitation from other climatic variables, such as atmospheric pressure, relative humidity and the prevailing wind direction in our case.In fact, it is very rare in the literature that other climate parameters are used to reconstruct precipitation on a daily scale [10]. This reconstruction procedure is very often neglected because weather stations are more often equipped with rain gauges than with hygrometers, anemometers or barometers, so it is complicated to reconstruct rainfall data from variables that are not continuous over the territory and the individual weather station.Sometimes pressure data are used to reconstruct precipitation in very old time series, due to the lack of rainfall data, which leads to not very reliable results [11]; while precipitation reconstructions with relative humidity and wind direction as covariates are almost absent in the literature.In order to obtain the aforementioned climate variables (atmospheric pressure, relative humidity and prevailing wind direction) for each weather station whose precipitation data are available, it is necessary in some cases to reconstruct them in full.Many methods for reconstructing climate data are reported in the literature, e.g., for wind, modelling based either on digital terrain models [12] or combined with satellite data (Lidar) is very often used [13]. Relative humidity, on the other hand, is very often spatialised, especially with the use of deterministic or geostatistical techniques, however, the timescale is rarely daily and more often monthly or yearly [14,15]. The key part of reconstructing precipitation data lies in the choice of the statistical method used, i.e., modelling. There are many ways to reconstruct data on the basis of the relationships observed between the dependent variable and the supposed independent variables. The time scale is certainly an important limitation, as geostatistical methods, such as kriging with its variants are based on the interpretation of the semivariogram, so in the case of hundreds of data to reconstruct, it is very time-consuming [16]. In these cases, techniques that need to estimate fewer parameters are required, among these are the various types of regression, linear, non-linear, logistic, or even artificial neural networks including radial basis functions or multilayer perceptrons [17,18]. Previous climate research shows a greater potential of the artificial neural network than multiple regression, allowing for a reduction in error [19]. There have been approaches to the reconstruction of climate variables completely based on machine learning, combining stochastic synthesis with a multilayer perceptron; however, serious doubts remain as to the comprehensibility of the model and its repeatability due to the two hidden layers having no physical or climatic variables as input parameters [20]. In the literature, there have also been examples of machine learning models applied to the reconstruction of daily rainfall that has good results, in particular, studies have verified the reconstruction potential of the multilayer perceptron with the support vector machine and the random forest [21]. These models developed in semi-arid areas perform well in terms of RMSE and R2, precisely because of the greater ease of predicting near-zero values and the resulting low variance [21]. This research aims to interpolate or reconstruct if necessary, the data of independent variables, such as relative humidity, atmospheric pressure and prevailing wind direction. Subsequently, the aim of the research is to assess the significance of the relationships between precipitation and other climatic variables, so as to be able to help explain a percentage of the variance in daily precipitation and to use this model together with other independent variables in order to obtain increasingly reliable results. At the same time, there is also the ambitious goal of understanding, by reducing precipitation to a binary variable that can always be assessed by means of multilayer perceptron analysis, how to predict a rainfall event and with what accuracy.Thus, the innovativeness of this research lies both in quantifying the relationship between variables, such as relative humidity, wind and atmospheric pressure for the prediction of daily rainfall, and in the possibility of understanding on the basis of these variables the presence or absence of a rainy day, all using neural networks as a modelling technique.In fact, there are no other examples in the literature that rely on three independent climate variables to reconstruct daily rainfall.

1.2. Geographical Framework

Central Italy is bordered by two seas, the Tyrrhenian Sea and the Adriatic Sea, arms of the Mediterranean Sea which, depending on their size, influence the climate of inland areas to a greater or lesser extent. The extent of the study area is approximately 13,400 km, and a total of 119 rain gauges are distributed over the area (Figure 1).
The study area encompasses an area that stretches from the Adriatic Sea to the Apennines with peaks over 2000 m and overlooks the Tyrrhenian slope for a few kilometres, thus it is a particularly heterogeneous area in terms of elevation

2. Materials and Methods

2.1. Weather Stations and Climate Data

Climate data were collected by four institutions: the Functional Multiple Risk Centre of the Civil Protection of the Marche Region, the hydrographic service of the Umbria Region, the national system for the processing and dissemination of climate data of the IstitutoSuperiore per la Protezione e la RicercaAmbientale (ISPRA) and the Experimental Geophysical Observatory of Macerata (OGSM). The total number of rain gauges is 119, collected from 2010 to 2021, a shorter period than the classic standard 30-year period prescribed by the WMO, which was chosen on the one hand due to the large amount of data (scale is daily), and on the other hand due to missing data not being too extensive for each weather station, what it would have been like collecting the previous 20 years. Regarding the prevailing wind direction, data were collected from 16 anemometers, while 42 weather stations in the study area are equipped with hygrometers and 12 weather stations with barometers (Table 1).

2.2. Interpolation and Extrapolation of Climate Data

In order to obtain the climatic values for atmospheric pressure, relative humidity and prevailing wind direction, interpolative procedures were required. For both atmospheric pressure and relative humidity, an inverse distance weighting (IDW) based on the square of the distance was performed, with the following formula:
z ^ ( x ) = i n w i z i i n w i
where
w i = | x x i | 2
| x x i | = Euclidean   distance
z ^ ( x ) = value z at location x, it is a weighted mean of nearby observations
IDW has been calculated for each day reconstructing missing values and testing errors by means of cross-correlation, which led on average to the resultsshown in Table 2 and Figure 2.
On the other hand, for the prevailing wind direction, the situation was more complex, both because of the variability to which the direction is usually subject on uneven topographical surfaces, and because of the few weather stations equipped with an anemometer.In light of these issues, in order to obtain the most correct values possible, wind direction and wind speed modelling software, Wind Ninja, was used [22]. This software was primarily designed to assess the development of fires; however, it also performs well in the climatic field.In this way, the values of wind from the weather stations were introduced into the software and the response was evaluated by interpolating the entire surface based on the topographic obstacles encountered by the wind. This procedure then made it possible to extrapolate the values obtained inall weather stations where wind direction values were missing (Figure 3).

2.3. Predictive Modelling

The main objective of the research is to obtain a model that can best exploit atmospheric pressure, relative humidity and prevailing wind direction to predict precipitation or at least explain some of the variance. Artificial neural networks were used to do this, which have the peculiarity of simulating the functioning of the animal brain by using non-linear data modelling to discover complex relationships between data (Figure 4). In particular, the algorithm called multilayer perceptron (MLP) was used, which has input layers and output layers, and in between the two, there are hidden layers with many stacked neurons [23].
The MLP algorithm is as follows:
f ( X ) = b 2 + W 2 ( f A ( b 1 + W 1 X ) )
where
W 1 = weight matrices of the hidden layer
W 2 = weight matrices of the output layer
b 1 = bias vector of the hidden layer
b 2 = bias vector of the output layer
f A = activation function
The goodness of the model was assessed using the relative error [24]:
R E v = y r v y ^ r v / y r v
where
y r v = source point
y ^ r v = predicted point
Subsequently, the possibility of creating a model that could predict whether it would rain or not was also considered. Again, the MLP technique was used, and precipitation was transformed into a binary categorical variable, with zero indicating no precipitation and one indicating the presence of precipitation. In this case, the success rate of the model was evaluated, corresponding to the ability to correctly predict a rainy or a non-rainy day. In order to assess the importance of each individual variable in the model, an analysis was carried out to assess how much the predicted values change as the independent variables change. In particular, the effect of a perturbation introduced into the model is measured and related to the measured data. It is calculated by measuring the effect this perturbation has by computing the Root Mean Square difference between the original y ^ and the perturbed y ^ i . A larger Root Mean Square difference means that the variable is “more important”. Finally, to assess the reliability of the model, it was divided into 30% data subject to training and 70% data subject to testing. For each of these two categories, the correctly predicted values were evaluated, i.e., those in which the model predicts rain or no rain, in agreement with the observed data, and this generates a percentage of model reliability.

3. Results

3.1. Quantitative Rainfall Prediction Model Based on MLP Technique

The MLP technique was used on all 119 available weather stations, with precipitation as the dependent variable and relative humidity, atmospheric pressure and prevailing wind direction as covariates. From the point of view of the architecture of the neural network, two hidden layers were chosen between the input and output layersand evaluatedon the basis of the decrease in relative error. For each weather station modelled, a 70% sample was taken for model training and a 30% sample for testing the model. This percentage was chosen by analysing the best performance of the model in relation to the various percentages of data taken for training and those taken for testing as can be seen in Table 3.
The results as can be seen in Table 4 are not such that daily precipitation can be predicted on the basis of the chosen covariates, however, some interesting indications were obtained. It was observed that especially atmospheric pressure and relative humidity possess a significant relationship with precipitation and thus could be exploited for introduction into other models together with other variables.
From Table 4, it can be seen that the model can estimate on average between 10 and 20% of the variance of the daily precipitation at each weather station.At the same time, the importance of each independent variable used in the model was tested for each weather station, and this revealed that both atmospheric pressure and relative humidity have a weight, while wind direction does not seem to influence precipitation(Table 5).In this case, the influence of each variable on precipitation was calculated through the independent variable importance, which is a measure of how much the value predicted by the model changes for different values of the independent variable(Table 5).
The most important variable in most cases is atmospheric pressure, followed by relative humidity, with wind being of little importance, although there are some weather stations for which wind is a significant variable.

3.2. Binary Forecast Model of Rainy and Non-Rainy Days Based on the MLP Technique

Observing the significance ofthe correlation between relative humidity, atmospheric pressure and precipitation, led us to draw up another model with the aim this time of predicting rainy and non-rainy days (Table 6).
In this case, the model developed with the MLP turned out to be much better when it does not have to assess the extent of rainfall but only discriminates between rainy and non-rainy days. The hyperbolic tangent was chosen as the hidden layer activation function and the softmax as the output layer activation function. In fact, the correctness of the model in the random samples chosen as tests for the model itself led to the model being considered correct, on average, in 79.7% of the cases for the training part of the data and 80% for the testing part of the data. The sensitivity and specificity of the test arealso reported in order to understand the actual validity of the model and how accurately it can reconstruct rainy and non-rainy days (Table 7).
For further confirmation, all observations were also tested simultaneously; however, this did not improve the model, which gave almost exactly the same results with a prediction accuracy of 80.1%.
For the purpose of completeness, here is the network diagram showing how wind direction has been treated as a categorical variable, while atmospheric pressure and relative humidity as numerical variables (Figure 5).

4. Discussion

This research had the absolutely innovative aim of using other climate variables as covariates to predict daily precipitation. In particular, there are many examples in the literature of multivariate analyses in which the independent variables are precipitation-related environmental variables spatialised in a GIS environment [17,25,26]. In contrast, there are no examples exploring the correlation between precipitation and other climatic variables, especially atmospheric pressure in the literature is never considered for this type of analysis, but rather is often correlated with teleconnective indices [27]. However, a good significance was observed in the correlation between daily precipitation and atmospheric pressure, so this was on average the most important variable for the model in the analysis.Regarding the prevailing wind direction, in some cases, it was used as a well-correlated variable for precipitation prediction [28,29], exactly the opposite of what was found in this analysis, where wind direction was observed to be of marginal importance compared to the other variables used.Finally, relative humidity is most often part of analyses that combine climate and pollution [30], although in this case, it shows a good correlation for precipitation prediction.From the point of view of model architecture, MLP-type artificial neural networks have been increasingly used for predictive purposes over the past 20 years, and they have also found quite interesting applications in rainfall forecasting [31]. Normally, however, predictive climate analyses of this type analyse monthly scales, whereas in this case the daily scale was evaluated. There have also been analyses that seem to be able to predict rainfall, again using machine learning techniques, but with one major difference, that these analyses were performed on arid climate zones that, therefore, do not possess the high variability of rainfall present in the pilot area that can be defined as a humid climate. This has obvious repercussions on the error of the model and it follows that it cannot be considered more correct if it is not applied to the same climate zone [21]. In the case of the present research, the input parameters were represented by threeclimate variables. The result of the quantitative analysis on the one hand made it clear that the threechosen variables are not sufficient to be able to indicate responsive precipitation values; however, they do allow us to discriminate those that could be exported into a new model by making it more refined.In particular, atmospheric pressure, relative humidity and prevailing wind direction can explain 10 to 20 percent of the variance, with a smaller role for the wind variable. On the other hand, a good model fit emerged with regard to the possibility of rain during the day, in fact, the correctness of the forecast reaches about 80 percent, which is certainly significant.

5. Conclusions

Quantitative prediction of daily precipitation is certainly not an easy subject to solve, and variables, such as atmospheric pressure, relative humidity and prevailing wind direction alone are not sufficient to guarantee a good model response. However, the relationships found, make it possible to think of developments through the addition of other variables that may have even greater weight. As far as the prediction of the possibility of daily precipitation is concerned, then in that case they show that they can result in a rather high-performance model.A final consideration must be made regarding the applicability of the relationships obtained, since having observed that a much larger number of samples does not improve the model, it is evident that the relationships between precipitation and atmospheric pressure or relative humidity are extremely local, so much so that not even in such a close and almost homogeneous environment do they lead to an improvement in the performance of the model. Therefore, a model, such as this must always be calibrated area by area and cannot be subject to extensive generalisations.

Author Contributions

Conceptualisation, M.G. and G.P.; methodology, M.G.; software, M.G.; validation, M.G. and G.P.; formal analysis, M.G.; investigation, G.P.; resources, M.G.; data curation, G.P.; writing—original draft preparation, G.P.; writing—review and editing, G.P.; visualisation, M.G.; supervision, M.G.; project administration, M.G.; funding acquisition, M.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pelfini, M.; Santilli, M. Frequency of debris flows and their relation with precipitation: A case study in the Central Alps, Italy. Geomorphology 2008, 101, 721–730. [Google Scholar] [CrossRef]
  2. Kisaka, M.O.; Mucheru-Muna, M.; Ngetich, F.K.; Mugwe, J.N.; Mugendi, D.; Mairura, F. Rainfall Variability, Drought Characterization, and Efficacy of Rainfall Data Reconstruction: Case of Eastern Kenya. Adv. Meteorol. 2015, 2015, 380404. [Google Scholar] [CrossRef]
  3. Wu, L.; Zhai, P. Validation of daily precipitation from two high-resolution satellite precipitation datasets over the Tibetan Plateau and the regions to its east. Acta Meteorol. Sin. 2012, 26, 735–745. [Google Scholar] [CrossRef]
  4. Hong, Y.; Adler, R.; Huffman, G. Evaluation of the potential of NASA multi-satellite precipitation analysis in global landslide hazard assessment. Geophys. Res. Lett. 2006, 33, g1028010. [Google Scholar] [CrossRef]
  5. Dezfuli, A.K.; Ichoku, C.M.; Huffman, G.J.; Mohr, K.I.; Selker, J.S.; Van De Giesen, N.; Hochreutener, R.; Annor, F.O. Validation of IMERG precipitation in Africa. J. Hydrometeorol. 2017, 18, 2817–2825. [Google Scholar] [CrossRef] [PubMed]
  6. Gentilucci, M.; Pambianchi, G. Prediction of Snowmelt Days Using Binary Logistic Regression in the Umbria-Marche Apennines (Central Italy). Water 2022, 14, 1495. [Google Scholar] [CrossRef]
  7. Gentilucci, M.; Barbieri, M.; Pambianchi, G. Reliability of the IMERG product through reference rain gauges in Central Italy. Atmospheric Res. 2022, 278, 106340. [Google Scholar] [CrossRef]
  8. Gentilucci, M.; Barbieri, M.; Burt, P.; D’Aprile, F. Preliminary data validation and reconstruction of temperature and precip-itation in Central Italy. Geosciences 2018, 8, 202. [Google Scholar] [CrossRef]
  9. Tiwari, S.; Jha, S.K.; Sivakumar, B. Reconstruction of daily rainfall data using the concepts of networks: Accounting for spatial connections in neighborhood selection. J. Hydrol. 2019, 579, 124185. [Google Scholar] [CrossRef]
  10. Mishra, V.; Shah, R.; Azhar, S.; Shah, H.; Modi, P.; Kumar, R. Reconstruction of droughts in India using multiple land-surface models (1951–2015). Hydrol. Earth Syst. Sci. 2018, 22, 2269–2284. [Google Scholar] [CrossRef]
  11. Gimmi, U.; Luterbacher, J.; Pfister, C.; Wanner, H. A method to reconstruct long precipitation series using systematic de-scriptive observations in weather diaries: The example of the precipitation series for Bern, Switzerland (1760–2003). Theor. Appl. Climatol. 2007, 87, 185–199. [Google Scholar] [CrossRef]
  12. Rios, O.; Valero, M.M.; Pastor, E.; Planas, E. A data-driven fire spread simulator: Validation in Vall-Llobrega’s fire. Front. Mech. Eng. 2019, 5, 8. [Google Scholar] [CrossRef]
  13. Towers, P.; Jones, B.L. Real-time wind field reconstruction from LiDAR measurements using a dynamic wind model and state estimation. Wind. Energy 2014, 19, 133–150. [Google Scholar] [CrossRef]
  14. Nguyen, X.T.; Nguyen, B.T.; Do, K.P.; Bui, Q.H.; Nguyen, T.N.T.; Vuong, V.Q.; Le, T.H. Spatial interpolation of meteorologic variables in Vietnam using the Kriging method. J. Inf. Process. Syst. 2015, 11, 134–147. [Google Scholar]
  15. Li, L.; Zha, Y. Mapping relative humidity, average and extreme temperature in hot summer over China. Sci. Total Environ. 2018, 615, 875–881. [Google Scholar] [CrossRef]
  16. Chen, Y.-C.; Wei, C.; Yeh, H.-C. Rainfall network design using kriging and entropy. Hydrol. Process. 2007, 22, 340–346. [Google Scholar] [CrossRef]
  17. Gentilucci, M.; Bisci, C.; Burt, P.; Fazzini, M.; Vaccaro, C. Interpolation of Rainfall Through Polynomial Regression in the Marche Region (Central Italy). In the Annual International Conference on Geographic Information Science; Springer: Cham, Switzerland, 2018; pp. 55–73. [Google Scholar]
  18. Hamidi, O.; Poorolajal, J.; Sadeghifar, M.; Abbasi, H.; Maryanaji, Z.; Faridi, H.R.; Tapak, L. A comparative study of support vector machines and artificial neural networks for predicting precipitation in Iran. Theor. Appl. Climatol. 2014, 119, 723–731. [Google Scholar] [CrossRef]
  19. Mekanik, F.; Imteaz, M.; Gato-Trinidad, S.; Elmahdi, A. Multiple regression and Artificial Neural Network for long-term rainfall forecasting using large scale climate modes. J. Hydrol. 2013, 503, 11–21. [Google Scholar] [CrossRef]
  20. Rozos, E.; Dimitriadis, P.; Mazi, K.; Koussis, A. A Multilayer Perceptron Model for Stochastic Synthesis. Hydrology 2021, 8, 67. [Google Scholar] [CrossRef]
  21. Bellido-Jiménez, J.A.; Gualda, J.E.; García-Marín, A.P. Assessing Machine Learning Models for Gap Filling Daily Rainfall Series in a Semiarid Region of Spain. Atmosphere 2021, 12, 1158. [Google Scholar] [CrossRef]
  22. Forthofer, J.; Shannon, K.; Butler, B. Simulating diurnally driven slope winds with WindNinja. In Proceedings of the 8th Eighth Symposium on Fire and Forest Meteorology, Kalispell, MT, USA, 13–15 October 2009; American Meteorological Society: Boston, MA, USA, 2009. 13p. Available online: https://ams.confex.com/ams/8Fire/techprogram/paper_156275.htm (accessed on 1 February 2022).
  23. Pham, B.T.; Nguyen, M.D.; Bui, K.-T.T.; Prakash, I.; Chapi, K.; Bui, D.T. A novel artificial intelligence approach based on Multi-layer Perceptron Neural Network and Biogeography-based Optimization for predicting coefficient of consolidation of soil. Catena 2019, 173, 302–311. [Google Scholar] [CrossRef]
  24. Wong, W.; Xia, M.; Chu, W. Adaptive neural network model for time-series forecasting. Eur. J. Oper. Res. 2010, 207, 807–816. [Google Scholar] [CrossRef]
  25. Gouvas, M.; Sakellariou, N.; Xystrakis, F. The relationship between altitude of meteorological stations and average monthly and annual precipitation. Stud. Geophys. Geod. 2009, 53, 557–570. [Google Scholar] [CrossRef]
  26. Gentilucci, M.; Barbieri, M.; Burt, P. Climate and Territorial Suitability for the Vineyards Developed Using GIS Techniques. In Conference of the Arabian Journal of Geosciences; Springer: Cham, Switzerland, 2018; pp. 11–13. [Google Scholar] [CrossRef]
  27. Kim, J.-S.; Seo, G.-S.; Jang, H.-W.; Lee, J.-H. Correlation analysis between Korean spring drought and large-scale teleconnection patterns for drought forecasting. KSCE J. Civ. Eng. 2016, 21, 458–466. [Google Scholar] [CrossRef]
  28. Mello, L.; Pontes, M.S.; Fagundes, I.; Almeida, M.P.C.; Andrade, F.J.A. Modified rain attenuation prediction method considering the effect of wind direction. J. Microw. Optoelectron. Electromagn. Appl. 2014, 13, 254–267. [Google Scholar] [CrossRef]
  29. Jenamani, R.K.; Bhan, S.C.; Kalsi, S.R. Observational/forecasting aspects of the meteorological event that caused a record highest rainfall in Mumbai. Curr. Sci. 2006, 90, 1344–1362. [Google Scholar]
  30. Ng, D.H.L.; Li, R.; Raghavan, S.V.; Liong, S.-Y. Investigating the relationship between Aerosol Optical Depth and Precipitation over Southeast Asia with Relative Humidity as an influencing factor. Sci. Rep. 2017, 7, 13395. [Google Scholar] [CrossRef]
  31. Hung, N.Q.; Babel, M.S.; Weesakul, S.; Tripathi, N.K. An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrol. Earth Syst. Sci. 2009, 13, 1413–1425. [Google Scholar] [CrossRef]
Figure 1. Map of the study area and positioning of rain gauges.
Figure 1. Map of the study area and positioning of rain gauges.
Climate 10 00120 g001
Figure 2. Interpolation example performed with the IDW method and annual average data from 2010 to 2021: (a) Relative humidity (b) Atmospheric pressure.
Figure 2. Interpolation example performed with the IDW method and annual average data from 2010 to 2021: (a) Relative humidity (b) Atmospheric pressure.
Climate 10 00120 g002
Figure 3. Example of the influence of the topography of the study area on the surface wind direction in the case of a north-easterly wind.
Figure 3. Example of the influence of the topography of the study area on the surface wind direction in the case of a north-easterly wind.
Climate 10 00120 g003
Figure 4. Structure of the MLP [21].
Figure 4. Structure of the MLP [21].
Climate 10 00120 g004
Figure 5. Network diagram of the aggregate model of all weather stations, where var_p_A is atmospheric pressure; var_h is humidity; var_w is wind; var_p = 0 means no rainfall; var_p = 1 means rainfall.
Figure 5. Network diagram of the aggregate model of all weather stations, where var_p_A is atmospheric pressure; var_h is humidity; var_w is wind; var_p = 0 means no rainfall; var_p = 1 means rainfall.
Climate 10 00120 g005
Table 1. Name, name of the weather stations, Long., longitude, Lat., latitude, Alt., altitude, Sen., equipment of sensors to measure climatic variables r (rain gauge), p (barometer), w (anemometer), h (hygrometer).
Table 1. Name, name of the weather stations, Long., longitude, Lat., latitude, Alt., altitude, Sen., equipment of sensors to measure climatic variables r (rain gauge), p (barometer), w (anemometer), h (hygrometer).
IdNameLong.Lat.Alt.Sen.IdNameLong.Lat.Alt.Sen.
1Acqualagna12.743.6193r,h61Monte Grimano Terme12.543.8362r
2Acquasanta13.442.8392r,h62Monte Paganuccio12.843.6889r,h
3Agugliano13.443.5170r63Monte Prata13.242.91813r,p,w,h
4Amandola13.443.0550r64Montecchio12.843.943r
5Ancona Baraccola13.543.637r,h65Montefano13.443.4215r,h
6Ancona Regione13.543.691r,p,w,h66Montelabbate12.843.865r
7Apecchio12.443.6465r67Montemonaco13.342.9987r
8Appignano13.343.4199r68Mozzano13.542.8193r,p,h
9Arcevia12.943.5535r,h69Nocera Umbra12.843.1535r
10Arquata del Tronto13.342.8720r70Norcia13.142.8700r
11Badia Tedalda12.243.7756r71Osimo Monteragolo13.543.5123r,w,h
12Barbara13.043.6219r72Pennabilli12.343.8600r
13Bastia Umbra12.643.1214r73Pergola12.843.6306r,h
14Bettolelle13.243.726r74Pesaro 12.943.99r
15Bocca Serriola12.443.5730r75Piagge13.043.7201r
16Bolognola Pintura13.243.01352r,p,w,h76Pianello di Cagli12.643.5384r,w,h
17Bronzo12.543.8173r,h77Pievebovigliana13.143.1451r
18Ca’Mazzasette12.643.8112r78Piobbico12.543.6339r
19Camerino13.143.1664r,p,w,h79Pioraco13.043.2441r
20Campodiegoli12.843.3507r80Poggio San Vicino13.143.4580r,h
21Cantiano12.643.5360r81Ponte Felcino12.443.1205r
22Capodacqua13.242.7817r82Porto Sant’Elpidio13.843.23r,p,w,h
23Carestello12.543.3523r83Recanati13.543.4235r
24Carpegna12.343.8748r84Ripatransone13.843.0494r
25Cascia 13.042.7604r85Rostighello13.543.428r
26Case San Giovanni13.043.4620r86Rotella13.643.0385r
27Cingoli13.243.4790r,h87Sant’Angelo in Pontano13.443.1473r
28Citta di Castello12.343.5304r88Sant’Angelo in Vado 12.443.7359r,h
29Colle13.143.5350r,p,w,h89San Bendetto del Tronto13.942.96r,p,w,h
30Colleponi12.943.4254r,h90San Giovanni13.043.4625r
31Corinaldo13.043.6203r91San Lorenzo in Campo12.943.6209r
32Cupramontana13.143.4506r92Santa Maria di Pieca13.343.1467r
33Esanatoglia Convento12.943.3608r93Santa Maria Goretti13.743.0130r
34Fabriano 12.943.3357r,h94Santa Maria in Arzilla12.943.853r
35Fermo 13.743.2280r95San Severino Marche 13.243.2220r,h
36FiastraTrebbio13.243.0747r,h96Sassofeltrio12.543.9221r
37Filottrano13.343.4270r97Sassoferrato12.943.4312r
38Foligno12.743.0224r98Sassotetto13.243.01365r,p,w,h
39Fonte Avellana12.743.5689r,h99Scheggia12.743.4688r
40Foresta della Cesana12.743.7640r,h100Sefro13.043.2469r
41Forsivo13.042.8968r101Sellano12.942.9608r
42Fossombrone12.843.7116r102Senigallia13.243.75r,w,h
43Gallo12.743.8122r,h103Serravalle di Chienti13.043.1647r,h
44Gelagna Alta13.043.1711r104Servigliano13.543.1215r
45Grottammare13.943.04r105Sorti13.043.1672r,h
46Grottazzolina13.643.1200r106Spindoli12.943.2484r
47GualdoTadino12.843.2535r107Spinetoli13.842.952r
48Gubbio12.643.3473r108Svarchi13.643.56r
49Illice13.442.9760r109Tavoleto12.643.8426r
50Jesi13.243.596r,h110Tolentino 13.343.2244r,w,h
51Loreto 13.643.4127r,h111Trestina12.243.4257R
52Loro Piceno13.443.2435r,h112Umito13.442.7646R
53Lucrezia13.043.836r,h113Urbania12.543.7273R
54Macerata13.443.3294r,w,h114Urbino12.643.7451r,p,w,h
55Marotta13.143.7144r115Ussita13.143.0744r
56Metaurilia13.143.87r116Villa Fastiggi12.943.922r,p,w,h
57Moie13.143.5110r117Villa Potenza13.443.3133r,h
58Monte Bove Sud13.242.91917r,p,w,h118Villa San Filippo13.643.358r
59Monte Cavallo13.043.0960r119Vallo di Nera12.942.8310r
60Monte Cucco12.743.41092r
Table 2. Table of average errors in the model from 2010 to 2021; ME, mean error; RMSE, root mean square error.
Table 2. Table of average errors in the model from 2010 to 2021; ME, mean error; RMSE, root mean square error.
Error of a Model in the PredictionRelative HumidityAtmospheric Pressure
ME−0.14−0.01
RMSE4.861.79
Table 3. Evaluation of the correct prediction percentage of rainfall and non-rainfall events with different percentages of testing and training datasets.
Table 3. Evaluation of the correct prediction percentage of rainfall and non-rainfall events with different percentages of testing and training datasets.
Training PercentageTesting PercentageCorrect Percentage
505079.7
604079.8
703080.0
802079.9
901079.9
Table 4. Assessment of the relative error for the training (reTr) and for the testing (ret), for each weather station identified by the id code (Id).
Table 4. Assessment of the relative error for the training (reTr) and for the testing (ret), for each weather station identified by the id code (Id).
IdreTrretIdreTrretIdreTrretIdreTrretIdreTrretIdreTrret
10.830.83210.820.83410.890.89610.890.93810.860.811010.830.86
20.890.85220.840.87420.870.84620.870.88820.900.871020.870.89
30.830.86230.840.81430.860.82630.900.91830.830.841030.890.92
40.930.89240.850.88440.860.84640.860.85840.890.891040.900.91
50.840.88250.800.83450.810.87650.860.86850.870.831050.780.80
60.860.87260.880.88460.760.84660.900.86860.810.731060.880.84
70.850.86270.890.90470.900.89670.860.84870.910.901070.820.77
80.830.82280.860.90480.830.82680.820.73880.890.881080.910.92
90.900.91290.840.88490.840.83690.840.80890.880.961090.930.93
100.860.91300.830.83500.850.84700.880.85900.820.781100.850.86
110.880.87310.840.83510.880.88710.860.81910.920.941110.860.91
120.870.84320.870.87520.860.86720.840.85920.850.841120.880.90
130.890.86330.850.85530.870.87730.820.82930.880.881130.850.88
140.850.85340.810.85540.860.85740.840.87940.930.911140.850.83
150.890.90350.810.79550.860.83750.860.88950.810.881150.880.85
160.860.89360.860.84560.930.90760.850.85960.730.901160.840.85
170.910.87370.860.86570.810.83770.880.87970.850.901170.850.82
180.910.91380.880.85580.890.92780.880.88980.860.901180.890.86
190.840.82390.750.80590.800.82790.820.84990.870.881190.860.85
200.850.82400.830.82600.820.83800.850.811000.850.84
Table 5. Independent variable importance for all the weather stations; Id = code to identify the weather station; V. = independent variable (p, atmospheric pressure; h, relative humidity; w, wind direction); Imp = value of the independent variable importance.
Table 5. Independent variable importance for all the weather stations; Id = code to identify the weather station; V. = independent variable (p, atmospheric pressure; h, relative humidity; w, wind direction); Imp = value of the independent variable importance.
IdV.ImpIdV.ImpIdV.ImpIdV.ImpIdV.ImpIdV.ImpIdV.ImpIdV.Imp
1p0.6416p0.4631p0.5346p0.2961p0.33 p0.58 p0.84 p0.76
h0.34h0.44h0.44h0.62h0.6176h0.3491h0.10106h0.15
w0.02w0.10w0.03w0.09w0.06 w0.08 w0.06 w0.09
2p0.5517p0.4932p0.4947p0.7762p0.52 p0.66 p0.37 p0.06
h0.37h0.43h0.47h0.04h0.3577h0.1892h0.57107h0.85
w0.08w0.08w0.04w0.19w0.13 w0.16 w0.06 w0.09
3p0.4218p0.6733p0.7948p0.6163p0.16 p0.74 p0.51 p0.60
h0.49h0.26h0.11h0.32h0.5978h0.1393h0.45108h0.38
w0.09w0.07w0.10w0.07w0.25 w0.13 w0.04 w0.02
4p0.5919p0.5334p0.5549p0.2464p0.62 p0.49 p0.77 p0.97
h0.16h0.45h0.39h0.72h0.2879h0.4894h0.05109h0.02
w0.25w0.02w0.06w0.04w0.10 w0.03 w0.18 w0.01
5p0.4620p0.8035p0.4250p0.5365p0.61 p0.51 p0.64 p0.46
h0.42h0.13h0.53h0.44h0.3280h0.3895h0.32110h0.51
w0.12w0.07w0.05w0.03w0.07 w0.11 w0.04 w0.03
6p0.4821p0.5536p0.6551p0.6066p0.44 p0.48 p0.62 p0.87
h0.43h0.39h0.32h0.37h0.4181h0.4796h0.36111h0.05
w0.09w0.06w0.03w0.03w0.15 w0.05 w0.02 w0.08
7p0.6822p0.8037p0.4952p0.6167p0.43 p0.72 p0.48 p0.80
h0.18h0.06h0.46h0.37h0.5182h0.2397h0.47112h0.02
w0.14w0.14w0.05w0.02w0.06 w0.05 w0.05 w0.18
8p0.5323p0.8238p0.9153p0.5968p0.28 p0.46 p0.66 p0.50
h0.39h0.08h0.03h0.39h0.6883h0.5198h0.34113h0.46
w0.08w0.10w0.06w0.02w0.04 w0.03 w0.00 w0.04
9p0.6924p0.4939p0.5254p0.4569p0.74 p0.41 p0.82 p0.51
h0.24h0.49h0.36h0.44h0.0884h0.5799h0.03114h0.41
w0.07w0.02w0.12w0.11w0.18 w0.02 w0.15 w0.08
10p0.5225p0.5340p0.5655p0.4270p0.93 p0.52 p0.87 p0.61
h0.36h0.42h0.44h0.52h0.0585h0.48100h0.10115h0.38
w0.12w0.05w0.00w0.06w0.02 w0.00 w0.03 w0.01
11p0.6826p0.7141p0.9556p0.7171p0.60 p0.15 p0.92 p0.52
h0.20h0.22h0.02h0.19h0.3586h0.84101h0.04116h0.42
w0.12w0.07w0.03w0.10w0.05 w0.01 w0.04 w0.06
12p0.4727p0.5142p0.5557p0.3472p0.47 p0.59 p0.58 p0.61
h0.51h0.37h0.38h0.64h0.5087h0.16102h0.38117h0.34
w0.02w0.12w0.07w0.02w0.03 w0.25 w0.04 w0.05
13p0.7928p0.8143p0.6658p0.4873p0.47 p0.73 p0.44 p0.41
h0.18h0.11h0.32h0.51h0.4488h0.21103h0.41118h0.48
w0.03w0.08w0.02w0.01w0.09 w0.06 w0.15 w0.11
14p0.4829p0.5544p0.8259p0.5074p0.47 p0.55 p0.45 p0.85
h0.50h0.37h0.07h0.33h0.4389h0.38104h0.15119h0.14
w0.02w0.08w0.11w0.17w0.10 w0.07 w0.40 w0.01
15p0.9230p0.4245p0.3560p0.4275p0.50 p0.38 p0.63 p
h0.03h0.35h0.62h0.53h0.4390h0.54105h0.23120h
w0.05w0.23w0.03w0.05w0.07 w0.08 w0.14 w
Table 6. Error computation based on the percentage of correctly predicted rainy or non-rainy days, testing value in percentage (test), for each weather station identified by the id code (Id).
Table 6. Error computation based on the percentage of correctly predicted rainy or non-rainy days, testing value in percentage (test), for each weather station identified by the id code (Id).
IdtestIdtestIdtestIdtestIdtestIdtestIdtestIdtest
181.21677.63179.64680.26177.07678.99178.210676.6
276.81777.63283.14777.66276.97778.79275.910779.0
378.11881.23376.74876.96376.47876.39374.410879.3
475.31981.23480.64975.06479.17977.29479.310978.8
583.02080.33576.95083.76582.28080.19581.411078.6
681.72177.03676.95182.36678.88181.39681.711179.3
776.52276.73775.45278.86773.18280.29776.911275.9
883.02381.33881.05382.46882.48376.29877.611381.5
982.82478.23982.45481.46982.18479.59979.211481.0
1078.12580.24077.15578.97078.98578.810079.911577.8
1175.92683.34176.75678.87180.78675.910181.711682.6
1278.82777.74283.95782.27279.88776.810282.211781.4
1384.12879.04382.65876.37382.68877.210380.911877.3
1479.42980.34478.05982.77480.88983.110475.611979.1
1577.83081.54581.56073.57581.89077.010581.2120
Table 7. Sensitivity and specificity of the model, in the testing part, amounting to 30% of the total data. True Positive days with rain correctly predicted by the model; True Negative days without rain correctly predicted by the model; False Negative days without rain incorrectly predicted; False Positive days with rain incorrectly predicted.
Table 7. Sensitivity and specificity of the model, in the testing part, amounting to 30% of the total data. True Positive days with rain correctly predicted by the model; True Negative days without rain correctly predicted by the model; False Negative days without rain incorrectly predicted; False Positive days with rain incorrectly predicted.
Cases Cases
True Positive (rain)17,535True Negative (no rain)86,811
False Negative9570False Positive16,574
Sensitivity65%Specificity84%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gentilucci, M.; Pambianchi, G. Rainy Day Prediction Model with Climate Covariates Using Artificial Neural Network MLP, Pilot Area: Central Italy. Climate 2022, 10, 120. https://doi.org/10.3390/cli10080120

AMA Style

Gentilucci M, Pambianchi G. Rainy Day Prediction Model with Climate Covariates Using Artificial Neural Network MLP, Pilot Area: Central Italy. Climate. 2022; 10(8):120. https://doi.org/10.3390/cli10080120

Chicago/Turabian Style

Gentilucci, Matteo, and Gilberto Pambianchi. 2022. "Rainy Day Prediction Model with Climate Covariates Using Artificial Neural Network MLP, Pilot Area: Central Italy" Climate 10, no. 8: 120. https://doi.org/10.3390/cli10080120

APA Style

Gentilucci, M., & Pambianchi, G. (2022). Rainy Day Prediction Model with Climate Covariates Using Artificial Neural Network MLP, Pilot Area: Central Italy. Climate, 10(8), 120. https://doi.org/10.3390/cli10080120

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop