Assessment of Hellwig Method for Predictors’ Selection in Groundwater Level Time Series Forecasting

Effective groundwater planning and management should be based on the prediction of available water volume. The complex nature of groundwater systems makes this complicated and requires the use of complex methods. Data-driven models using computational intelligence are becoming increasingly popular in that field. The key issue in predictive modelling is the selection of input variables. Wrocław-Osobowice irrigation fields were a wastewater treatment plant until 2013. The monitoring of groundwater levels is being continued to assess the water relations in that area after the end of their exploitation. The aim of the study was to assess the Hellwig method for predictors’ selection in groundwater level forecasting with support vector regression models. Data covered the daily time series of groundwater level in the period 2015–2019. Obtained models with a root mean squared error (RMSE) of 0.024–0.292 m and r2 of 0.7–0.9 were considered as high quality. Moreover, they showed good prediction ability for high as well as low groundwater values. Additionally, the proposed method is simple, and its implementation only requires access to groundwater level measurement data. It may be useful in groundwater management and planning in terms of actual climate change and threat of water deficits.


Introduction
Precise groundwater level (GWL) forecasting is crucial for efficient groundwater planning and management. Due to the complex nature of groundwater systems, its prediction is a complicated task that requires complex methods. Among them, data-driven models using computational intelligence (CI) methods are becoming increasingly popular and have significant potential [1][2][3][4]. This results from the fact that, in contrast to creating physics-based models which are time-consuming and labor-intensive and require taking into account a large amount of data that describe the modelled phenomenon with the use of complex algorithms, they are much easier to implement. At the same time, the use of artificial intelligence in creating data-driven models consists of predicting the size of the phenomenon by searching for links in sets of historical data and recreating the most frequently occurring patterns [5].
Selection of input variables plays a crucial role in the efficiency of the predicted model [6][7][8]. Literature indicates that the past time series of GWL bring the most important information to GWL time series modelling [9].
Supplementing missing data with time series from neighboring measurement points is a common and recommended practice [10]. Nevertheless, in the field of groundwater level modelling, it has been stated that a unique and individual approach for the study area under consideration is needed. A model suitable for a particular zone may not necessarily be good even for a neighboring area [6]. 2 of 13 In the present research, daily time series were used to produce short-term GWL predictions. However, different applications require data with different time intervals (short-, mid-and long-term). Daily groundwater level forecasts are important, for example, in creating irrigation schedules in regions with water scarce, especially in the period when water consumption by plants is high [11].
In the literature concerning groundwater level modelling, researchers have used many methods of optimal variable selection. For example, Sharafati et al. [12] used the gamma test (GT) to obtain the best input combinations for monthly groundwater level prediction over the Rafsanjan aquifer in Iran by an ensemble machine learning method such as gradient boosting regression (GBR). Wu et al. [13] tested the Boruta method, which is 'a wrapper around the Random Forest classification algorithm', as a preprocessing technique in a hybrid model combining signal decomposition (VMD) with feature extraction (Boruta) and ELM (VMD-Boruta-ELM, called VBELM) for monthly GWL forecasting in the Zhangye Basin, northwest China. Rahman et al. [14] involved XGB (eXtreme Gradient Boosting) and RF (random forests) to create GWL predictions merging machine learning (ML) with wavelet transforms (WTs) for Kumamoto City in southern Japan. Sahoo et al. [8] used an input variable selection method based on singular spectrum analysis (SSA), mutual information, and genetic algorithms to model seasonal groundwater level changes with an automated hybrid artificial neural network (HANN) in the High Plains aquifer system (HPA) and the Mississippi River Valley alluvial aquifer (MRVA), U.S.A. On the other hand, for predictor selection in GWL forecasting, techniques based on correlations (sometimes based on analysis using cross-correlation, autocorrelation, and partial autocorrelation) between input and output variables are implemented in search of less complicated but not less efficient methods. Zhao et al. [15] studied the correlation between predictors and groundwater level before classification and regression tree (CART) modelling of the Shuping landslide in the Three Gorges Reservoir area, China. Similarly, Iqbal et al. [16] used CART modelling for the area located between Ravi River and Sutlej River, Pakistan, using artificial neural networks (ANNs). Osman et al. [17] used cross-correlation to choose the best input variables for extreme gradient boosting (Xgboost), artificial neural networks, and support vector regression predictive GWL models in Selangor, Malaysia. The Hellwig method also fits in correlation techniques for predictor selection. This method is otherwise known as the method of information capacity indicators [18]. Its high efficiency, confirmed in the literature, based on a comparison with other variable selection techniques (zero unitarization method, Technique for Order Preference by Similarity to Ideal Solution; TOPSIS) [19][20][21], prompted the authors to apply it in groundwater level modelling, which has not happened before.
The Osobowice irrigation fields are natural wetlands which have been used for wastewater treatment of the Wrocław agglomeration for over 100 years. Their use for wastewater treatment was one of the first ways to neutralize them. This technology appeared in the second half of the 19th century and consisted of flooding the field surfaces with sewage pretreated in mechanical settling tanks. Partially purified wastewater infiltrated the soil from the surface, and there it was treated by mechanical filtration and biological processes occurring in the soil environment, with the participation of microorganisms. The method was widely used until the end of the 19th century, when its share decreased as a result of modern methods for wastewater treatment in artificial conditions development [22].
The Wrocław-Osobowice wetlands for wastewater treatment are located in the northern part of Wrocław, on the right bank of the river Odra (Figure 1a). The choice of such a location was made taking into account factors related to landforms, ownership relationships, and soil conditions. The first devices were commissioned in 1881. The exploitation started on an area of about 560 ha. It was then systematically expanded to almost 1300 ha until 2013, when water treatment usage was ended. Presently, it is an ecological area, due to the occurrence of a rich fauna and flora. As a result of ceasing flooding the fields with sewage, the amount of water supplying this area significantly decreased. Preliminary analyses enable estimates that in the last years of the wetlands' operation, the amount of sewage discharged was similar to the amount of water from atmospheric precipitation. After 2013, the total amount of water supplying this area decreased as a consequence by one-half, which contributed to the decrease in groundwater level [22].

Hellwig Method
Data from eight of the twelve wells were taken to model groundwater levels in four piezometers where data gaps were noted (in N-8, P-34, and P-43 because of fire in August 2015, whereas in P-10 because of the GWL falling below the measuring instrument level). The choice of which predictors should be used for forecasting was difficult due to the large number of possible combinations to analyze. Eight wells (N-2, N-4, N-6, N-7, P-19, P-22, P-30, P-36) located in the vicinity were used to create forecasts in piezometers N-8, P-34, P-43 and P-10, of which 255 ( = 2 8 -1) possible combinations for each well were obtained, including single, double and triple combinations of predictors, etc., until all eight were used. Creating 255 models for each piezometer would be very work-intensive and timeconsuming, and that is why the Hellwig method was used to create the rank of neighboring well combinations which served as predictors to the forecasting models. The concept is to use explanatory variables strongly dependent on the explained variable and at the same time weakly correlated with each other. However, this is not a strict criterion for the variables' selection; in addition, there is a numerical criterion, the so-called integral capacity of information carriers' combination. In this case, the information carriers are all explanatory variables.
For all received combinations, individual capacity of information carriers was defined according to the following formula: In mid-August 2015, a fire burned several dozen hectares of the Osobowice irrigation field. Twenty-one fire brigades and two planes fought the fire, profusely pouring water on the area. The groundwater level response to this operation was observed in three of the 12 piezometers (N-8, P-43 and P-34). For N-8, its increase was very clear (1.5 m), while for P-43 and P-34 it responded less (0.5 m and 0.02 m, respectively) because of the greater distance from the fire area. Moreover, in the period December 2015-February 2016, in P-10, the GWL fell below the measuring instrument level.
Due to the above, there was a need for reconstruction of groundwater level time series in four wells on the basis of eight neighboring areas. For this purpose, a support vector regression (SVR) method was used. GWL prediction models were created based on daily time series from neighboring wells in the period 1 May 2015-31 August 2019. The aim of the study was to evaluate the Hellwig's method for selecting predictors (the best combinations of eight adjacent wells) for groundwater level forecasting in four other wells where data gaps were noted, using SVR models.
The main contributions of this paper are: 1.
The study of the performance of Hellwig's method for selection of the predictors for groundwater level modelling; Groundwater level prediction in wetlands after wastewater treatment exploitation; 4.
Supplementing the missing groundwater level time series.

Research Area
Groundwater monitoring, which provided data for the analysis contained in this work, was conducted by employees of Wrocław University of Environmental and Life Sciences on behalf of the Municipal Water and Sewage Company in Wrocław, Poland. It served to assess the water relations in the area of irrigation field in the period after termination of their exploitation as a wastewater treatment plant.
Groundwater levels were measured in a network of 31 wells in total, twelve of which were equipped with hydrostatic devices for automatic water level measurement and with recorders, saving data in nonvolatile memory, at a user-programmed time interval (every hour). In the other wells, readings were carried out with a hydrogeological whistle once a week. Only the results of measurements carried out using hydrostatic devices were used in the work. The locations of these 12 wells are shown in Figure 1a, and the location of the Osobowice irrigation field in Poland is presented in Figure 1b.
Research conducted by the Institute of Soil Science and Environmental Protection at Wrocław University of Environmental and Life Sciences showed the occurrence of alluvial soils in the analyzed area. These are mainly brown muds (in the upper layers made of clay, mainly light and medium, lined at a depth of 50-100 cm with sand or gravel) and, in lower layers, sandy muds (made of light loam sands, light loamy and strong loam sands, lined at a depth of 50-100 cm with loose sands and rarely clays). The long-term system of irrigation field exploitation, which consisted of flooding the surface of the area with wastewater rich in nutrients, did not force a deep overgrowth of plant roots deep into the profile. Shallow plants' rooting did not favor the formation of sufficiently deep humus levels. The current methods of field exploitation, without supplying sewage, has changed the soil moisture. Permeable layers' occurrence in the soil profile with deep groundwater levels causes water deficiency in the plants' root zones [23].

Hellwig Method
Data from eight of the twelve wells were taken to model groundwater levels in four piezometers where data gaps were noted (in N-8, P-34, and P-43 because of fire in August 2015, whereas in P-10 because of the GWL falling below the measuring instrument level). The choice of which predictors should be used for forecasting was difficult due to the large number of possible combinations to analyze. Eight wells (N-2, N-4, N-6, N-7, P-19, P-22, P-30, P-36) located in the vicinity were used to create forecasts in piezometers N-8, P-34, P-43 and P-10, of which 255 (S = 2 8 -1) possible combinations for each well were obtained, including single, double and triple combinations of predictors, etc., until all eight were used. Creating 255 models for each piezometer would be very work-intensive and time-consuming, and that is why the Hellwig method was used to create the rank of neighboring well combinations which served as predictors to the forecasting models. The concept is to use explanatory variables strongly dependent on the explained variable and at the same time weakly correlated with each other. However, this is not a strict criterion for the variables' selection; in addition, there is a numerical criterion, the so-called integral capacity of information carriers' combination. In this case, the information carriers are all explanatory variables.
For all received combinations, individual capacity of information carriers was defined according to the following formula: where j is the variable number in the combination under consideration, r j is the correlation coefficient of the potential explanatory variable number j with the explained variable (element of the vector of linear correlation coefficients between the explanatory variables and the explained variable R 0 ), and r ij is the correlation coefficient between the ith and jth potential explanatory variables (elements of the correlation coefficients matrix between potential explanatory variables R). H Q measures the amount of information a variable Xj adds about variable Y in the combination. H Q increases when r j increases, whereas it decreases when the more variable X j is correlated with the other explanatory variables.
The individual capacity of information carriers for all possible combination calculations of the integral capacity of information carriers was calculated according to the formula: where n is the number of predictors, j is the variable number in the combination under consideration, and h j is the individual capacity of information carriers. The integral capacity of information carriers for combination is the sum of the individual capacities of information carriers, which are part of this combination. It is the criterion for choosing the appropriate combination of explanatory variables, and chosen is the one with the highest H Q [18].

SVR Modelling
In the next step, for the four analyzed piezometers, with the two best combinations of predictors according to the Hellwig method, prediction models were created using support vector regression (SVR). The results were compared with the SVR models obtained for the worst combination which brought the least information, and with the model for which all eight predictors were used.
Literature mining indicates that SVM (support vector machine) is the modelling method that gives the best prediction ability, which refers to the single and combination (hybrid) of several models and techniques [9,[24][25][26][27].
SVR is an element of the SVMs derived from the classical perceptron, whose constructs are artificial neural networks. Both SVM and SVR are used to try and find a hyperplane that represents the nature of the data in the best way and control whether it meets the binding requirements of correct element classification for SVM and computational error in a specific range for SVR. In comparison to many other learning methods, the aim of both SVM and SVR is not only to adapt to the learning data, but to do so in such a way that it will enable prediction of the future behavior of the modelled phenomenon in the best way possible. For SVM, to introduce nonlinearity to the model and thus to improve its ability to cope with data that are not separated in a linear way, the so-called kernel trick is used as a standard approach [28]. The non-linear nature of the data may be caught as a result of using Radial Basis Function (RBF). The RBF kernel on two samples, x and y, represented as feature vectors in some input space, is defined as [29]: where x − y 2 is the squared Euclidean distance between the x and y vectors, and σ is a free parameter. For SVR, the values of the co-ordinates of the vector w and the value of the absolute term b are found by solving the following optimization problem: Minimum w maintaining: where w is the length of vector w, w is the weight vector determining the hyperplane that separates points belonging to different classes, b is the absolute term determined during optimization, x i is a set of data belonging to two classes defined by y i variables, y i is the actual value corresponding to the x i input vector, and ε is a non-optimized method parameter that defines the acceptable level of error, i.e., of the difference between the predicted values and those that exist in the learning data.
In the present research, the daily groundwater level was predicted with the use of an SVR supported by an RBF algorithm. The parameters of the SVR model that defined the width of the margin of trust and the maximum value of weight that may be determined for the given vector were: ε = 0.1 and C = 10.0 for all models. The RBF was determined by the gamma parameter γ (width of the kernel function), equaled to 0.5 and 0.33 for the best combinations of predictors, 1.0 for the worst, and 0.125 for all eight wells taken as input variables. To prevent the phenomenon of overlearning, the dataset was divided into learning (75%) and test (25%) subsets.
For comparison purposes and to evaluate the results obtained with SVR, they were compared with artificial neural network (ANN) models using the most popular multilayer perceptron (MLP) model [25]. From several created neural networks, one for each Hellwig combination corresponding to SVR models was chosen for comparison because of the best performance. Multilayer perceptron models with different architectures were created, as stated in Table 1.

Models' Quality Metrics
Models' performance was assessed in both subsets based on the value of squared correlation (r 2 ) for the actual and prognosed values, root mean squared error (RMSE), as well as mean absolute error (MAE) of the models according to the following formulas. Modelling was carried out using the MATLAB and Statistica programs.
where n is the number of observations, x i are the predicted data, and y i are the observed data. The greater the differences between the actual and predicted values were, the greater the error was, and more adjustment was required by the network [30]. Table 2 contains GWL modelling results in the analyzed wells. The following lines present the values of squared correlation (r 2 ) between the observed and predicted time series and the size of models' errors (RMSE, MAE). In columns, the results of modelling in the learning, testing and in both samples together for the two best combinations of predictors, selected by the Hellwig method, for the combination considered the least informative, and for the model based on all eight predictors are placed. The weakest modelling results were obtained for N-8. For the best predictor combination, according to the Hellwig method as well as for all eight predictors, r 2 in the testing subset equaled 0.710-0.774, whereas RMSE was 0.103-0.117 m. For N-8, the best combinations of predictors were two-element combinations N-4, P-22 and N-4, P-36. Results indicated that the least informative combination was a single piezometer N-6, which was also justified by the analysis of the correlation between groundwater levels in N-8 and in particular, individual wells (Table 3). In N-8, it was correlated the least with N-6 and N-2 (r equaled 0.4817 and 0.5919, respectively), and the most with N-4, although r was 0.7472. That was the well with the largest response to the firefighting action in August 2015, and therefore the measurement results differed the most from the other piezometers, which was also confirmed by the forecasting results. GWL at P-10 was predicted with RMSE in the testing subset from 0.049 to 0.099 m for all eight explanatory variables and for two combinations of predictors selected as the best by the Hellwig method (N-2, N-6 and N-2, N-6, P-30). Squared r 2 correlations in these cases were 0.910-0.977 for the testing sets. Groundwater level forecasting in this well, using only a single N-4 time series, provided the SVR model with the lowest quality (RMSE = 0.292 and r 2 = 0.142). Indeed, the measurement results in N-4 were the least correlated with P-10 (r = 0.3637), while the most with N-6 (r = 0.8710) ( Table 3).

Results and Discussion
In the case of P-34, the number of elements in combination with gathering explanatory variables, considered as the best, was larger-they were sets of three (N-4, P-22, P-36) and four elements (N-4, P-22, P-30, P-36). The quality of the forecasting models created on their basis and on the basis of all eight predictors, determined by r 2 and RMSE, was the highest among all those analyzed (0.980-0.986; 0.024-0.029, respectively). The correlation between GWL in P-34 and time series from individual wells was also relatively high, r equaled from 0.7502 to 0.932 (Table 3). The lowest value was obtained in the case of N-4 and the highest for P-22, which was explained by the different distance between piezometers (Figure 1a).
GWL time series in P-43 were forecasted with an accuracy of 0.875-0.936 (r 2 ) and 0.056-0.087 (RMSE) in the testing subsets for a combination based on all eight predictors and for the best combination according to Hellwig method: P-22, P-30, P-36 and P-22, P-36. Groundwater level time series from N-4 added the least amount of information to the prediction model and the correlation coefficient between N-4 and P-43 was 0.3608, while for P-36 and P-43 it was 0.9099 (Table 3). The model's performance (created on the basis of the time series from a single N-4) was very weak: r 2 = 0.084 and RMSE = 0.214.
In all analyzed cases, the results of GWL measurements in individual neighboring wells, relatively away from the well for which the forecast was made, were insufficient to create a prediction model of high quality. However, the results were consistent with the statement that a unique and individual approach for a single well was needed as the model suitable for one might not necessarily be good for neighboring points [6].
It should be emphasized that the use of all eight predictors for modelling provided predictive models of higher quality than the predictors' combinations selected as the best by Hellwig's method. Out of the 255 possible cases, for N-8 it was 116th, for P-10, 105th, 82nd for P-34, and the 114th combination in turn for P-43. This observation required verification in additional research.
Modern approaches to groundwater level prediction, based on computational intelligence methods, have resulted in improving the models' performance and minimization of error volume. The comparison of the results obtained with the use of the proposed method with the findings discussed in the subject literature demonstrated that the twostage approach to daily GWL time series prediction of the combined variables selection technique with SVR is reasonable and brings the expected results. However, using the Hellwig method for this purpose needs more research. Moreover, compared to the results presented below, which concern the similar models' performance, the quality of the created models might be considered as high (RMSE amounted to 0.024-0.292 m in the testing subset).
Due to the limited volume of the paper, the modelling results in graphic form were given only for the N-8 well, for which the highest increase in groundwater level was observed as a result of fire fighting in August 2015. Figures 2-4 summarize the results of GWL time series forecasting (for the testing sample) in N-8 for the best (a) and worst (b) combination of predictors selected by the Hellwig method. It was found that the model for N-8, obtained on the basis of GWL time series from N-4 and P-22, reflected the reality well, although it was misleading to obtain a relatively low coefficient of determination equaled to 0.710. This was due to the fact that the model did not match the peak of August 2015, which was intentional, because it was not the result of precipitation and had to be eliminated to recreate the natural course of the time series (Figure 2a). In addition, outliers were clearly visible in Figure 3a, which was also related to the situation in August 2015. The residual histogram indicates that most numerous (190 and 170, respectively) were the differences between observed and predicted values in the range (0.0; −0.1) and (−0.1; −0.2), and extreme differences were associated with underestimation of the model in the period after the fire (Figure 4a). 82nd for P-34, and the 114th combination in turn for P-43. This observation required verification in additional research.
Modern approaches to groundwater level prediction, based on computational intelligence methods, have resulted in improving the models' performance and minimization of error volume. The comparison of the results obtained with the use of the proposed method with the findings discussed in the subject literature demonstrated that the twostage approach to daily GWL time series prediction of the combined variables selection technique with SVR is reasonable and brings the expected results. However, using the Hellwig method for this purpose needs more research. Moreover, compared to the results presented below, which concern the similar models' performance, the quality of the created models might be considered as high (RMSE amounted to 0.024-0.292 m in the testing subset).
Due to the limited volume of the paper, the modelling results in graphic form were given only for the N-8 well, for which the highest increase in groundwater level was observed as a result of fire fighting in August 2015. Figures 2-4 summarize the results of GWL time series forecasting (for the testing sample) in N-8 for the best (a) and worst (b) combination of predictors selected by the Hellwig method. It was found that the model for N-8, obtained on the basis of GWL time series from N-4 and P-22, reflected the reality well, although it was misleading to obtain a relatively low coefficient of determination equaled to 0.710. This was due to the fact that the model did not match the peak of August 2015, which was intentional, because it was not the result of precipitation and had to be eliminated to recreate the natural course of the time series (Figure 2a). In addition, outliers were clearly visible in Figure 3a, which was also related to the situation in August 2015. The residual histogram indicates that most numerous (190 and 170, respectively) were the differences between observed and predicted values in the range (0.0; −0.1) and (−0.1; −0.2), and extreme differences were associated with underestimation of the model in the period after the fire (Figure 4a). In contrast to Emamgholizadeh et al. [31], Wang et al. [32] and Wunsh et al. [33] found an underestimation of high and overestimation of low GWL values; the scatter plot for N-8 presents exactly the opposite result. However, the observed under-and overestimations were minor, and they did not indicate the necessity of data divisions into low and high clusters.
The obtained results showed that the single time series coming from N-6 carried insufficient information for modelling the groundwater level in N-8 (Figures 2-4b).
To evaluate SVR results, they were compared with ANN multilayer perceptron models. Results obtained with MLP models were even better than those obtained with SVR. RMSE values for the best combinations suggested by Hellwig method and for all input variables equaled from 0.010 to 0.093 (learning and testing subsets), while r 2 results were 0.778-0.997 (Table 4).    In contrast to Emamgholizadeh et al. [31], Wang et al. [32] and Wunsh et al. [33] found an underestimation of high and overestimation of low GWL values; the scatter plot for N-8 presents exactly the opposite result. However, the observed under-and overestimations were minor, and they did not indicate the necessity of data divisions into low and high clusters.
The obtained results showed that the single time series coming from N-6 carried insufficient information for modelling the groundwater level in N-8 (Figures 2-4b).
To evaluate SVR results, they were compared with ANN multilayer perceptron models. Results obtained with MLP models were even better than those obtained with SVR. RMSE values for the best combinations suggested by Hellwig method and for all input variables equaled from 0.010 to 0.093 (learning and testing subsets), while r 2 results were 0.778-0.997 (Table 4).   In contrast to Emamgholizadeh et al. [31], Wang et al. [32] and Wunsh et al. [33] found an underestimation of high and overestimation of low GWL values; the scatter plot for N-8 presents exactly the opposite result. However, the observed under-and overestimations were minor, and they did not indicate the necessity of data divisions into low and high clusters.
The obtained results showed that the single time series coming from N-6 carried insufficient information for modelling the groundwater level in N-8 (Figures 2, 3 and 4b).
To evaluate SVR results, they were compared with ANN multilayer perceptron models. Results obtained with MLP models were even better than those obtained with SVR. RMSE values for the best combinations suggested by Hellwig method and for all input variables equaled from 0.010 to 0.093 (learning and testing subsets), while r 2 results were 0.778-0.997 (Table 4).

Conclusions
Due to the complex nature of groundwater systems, the prediction of its behavior is a complicated task that requires complex methods. Among them, data-driven models using computational intelligence techniques are becoming increasingly popular and have significant potential because they are an alternative for multidimensional physics-based models. Moreover, CI groundwater level forecasting constitutes a modern approach to supporting groundwater management and planning. In the present research, SVR, which is part of the CI method, was used to reconstruct the incomplete time series of wells located in Wrocław-Osobowice wetlands for wastewater treatment, Poland, on the basis of the measurement period: May 1, 2015-August 31, 2019. This study aimed to evaluate the Hellwig method and input select variables. To the best of the author's knowledge, this study is the first attempt of Hellwig method application for GWL time series implemented with an SVR model in order to find the best combination of predictors and to improve the prediction quality.
Results showed that the use of all eight predictors (wells) for modelling provided predictive models of higher quality than the predictors' combinations selected as the best by Hellwig's method. Out of the 255 possible cases, for N-8, it was 116th; for P-10, 105th; 82nd for P-34; and the 114th combination in turn for P-43. Those ambiguous result indicated that more research should be devoted to that area because this observation required verification in additional research.
It seems that the obtained RMSE values for the created models that equaled 0.024-0.292 m and r 2 of 0.7-0.9, might be subjectively considered as acceptable in the field of regional hydrology. Moreover, the created models worked well and showed good prediction ability for high as well as low daily groundwater values. Nevertheless, the comparison with MLP predictions proved to be even more accurate and led to the creation of models with better quality.
Moreover, data originated from a single well located relatively far from the analyzed piezometer are insufficient to create forecasts of high quality. Additionally, selecting the optimal input variables plays a crucial role in GWL time series reconstruction.
Additionally, the proposed method is simple, and its implementation requires access only to groundwater level measurement data. It may be useful in groundwater management and planning in terms of actual climate change and the threat of water deficits. In future studies, the authors are planning to complement the research with GWL predictions on the basis of meteorology data.