Next Article in Journal
A Weibull-Beta Prime Distribution to Model COVID-19 Data with the Presence of Covariates and Censored Data
Previous Article in Journal
On the Sampling Size for Inverse Sampling
Previous Article in Special Issue
Modeling Realized Variance with Realized Quarticity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Predictive Algorithm for Time Series Forecasting Based on Machine Learning Techniques: Evidence for Decision Making in Agriculture and Tourism Sectors

by
Juan D. Borrero
1,*,
Jesús Mariscal
1 and
Alfonso Vargas-Sánchez
2
1
Agricultural Economics Research Group, Department of Management and Marketing, University of Huelva, Pza. de la Merced s/n, 21002 Huelva, Spain
2
Department of Management and Marketing, University of Huelva, Pza. de la Merced s/n, 21002 Huelva, Spain
*
Author to whom correspondence should be addressed.
Stats 2022, 5(4), 1145-1158; https://doi.org/10.3390/stats5040068
Submission received: 24 October 2022 / Revised: 7 November 2022 / Accepted: 9 November 2022 / Published: 16 November 2022
(This article belongs to the Special Issue Modern Time Series Analysis)

Abstract

:
Accurate time series prediction techniques are becoming fundamental to modern decision support systems. As massive data processing develops in its practicality, machine learning (ML) techniques applied to time series can automate and improve prediction models. The radical novelty of this paper is the development of a hybrid model that combines a new approach to the classical Kalman filter with machine learning techniques, i.e., support vector regression (SVR) and nonlinear autoregressive (NAR) neural networks, to improve the performance of existing predictive models. The proposed hybrid model uses, on the one hand, an improved Kalman filter method that eliminates the convergence problems of time series data with large error variance and, on the other hand, an ML algorithm as a correction factor to predict the model error. The results reveal that our hybrid models obtain accurate predictions, substantially reducing the root mean square and absolute mean errors compared to the classical and alternative Kalman filter models and achieving a goodness of fit greater than 0.95. Furthermore, the generalization of this algorithm was confirmed by its validation in two different scenarios.

1. Introduction

At both supply and demand levels, accurate time series prediction techniques are becoming crucial for modern decision support systems. Linear models, such as the autoregressive integrated moving average (ARIMA), are typically employed in many areas of study [1,2,3,4,5,6]. Specifically, seasonal ARIMA (SARIMA) [7] models are being used for forecasting in retail trade [8], tourism demand [9,10], financial sectors [11] or coronavirus disease predictions (COVID-19) [12], in which the seasonal component is manifest.
The state-space model Kalman filter technique has also often been applied for forecasting purposes in different economic fields, such as oil price [13], stock markets [14], COVID-19 [15], fisheries [16] and road traffic [17].
In addition to linear models, several nonlinear algorithms, such as machine learning (ML) techniques, have been developed recently to predict data from time series [18]. Thus, a nonlinear autoregressive (NAR) neural network is a generalization of the ARIMA model with a more complex structure, and therefore it is more robust than linear regressions. We find several applications for this algorithm, including in: the pricing of minerals [19], quantification of waste [20] or water levels [21], displacements in foundation pits [22], COVID-19 [23] and others [24,25,26]. In addition, numerous studies have used support vector regression (SVR) to make time series predictions [27] in different domains, including: energy consumption [28,29], the financial sector [30,31,32], weather [33,34], water [35,36,37,38,39], COVID-19 [40] and education [41].
Finally, recent research has used hybrid models to predict much more diverse situations, such as the demand for flights in the aviation industry [42], electrical energy consumption [43], weather prediction [44] and financial trends [45].
The literature reveals certain limitations due to the inability to adequately detect trend variations. Linear models such as ARIMA and time series forecasting models based on previous data often continue data trends without taking into account potential changes in the future values of the observed variable. For this reason, when a time series presents peaks and changes in trend, those models usually reveal a very characteristic deviation to the right [46,47,48,49,50], and the fit of the prediction curve for the observed variable is less than what is desired [51,52,53,54]. Furthermore, the use of the Kalman filter could lead to a refinement of our prediction, but the implementation of this algorithm requires the simulation of an error term that might cause convergence problems. Although some research analyzes similar issues [55,56,57], this study tries to solve this problem using a new model that modifies the Kalman filter [58] and hybridizes it with ML techniques. The resolution of these problems could be very useful for any time series with few data, without the possibility of capturing information from exogenous variables and convergence problems, as is the case for the agricultural and tourism time series.
Recent agricultural forecasting research has employed ARIMA models [59,60,61], the Kalman filter method [62], NAR [46,63], SVR [64,65,66,67] and hybridized models [68,69,70,71]. This research does not provide conclusive results, except that ML-based models are widely used when large datasets and numerous variables are available, such as those that capture remote sensing data [72,73,74].
On the other hand, time series models are also widely applied in tourism demand forecasting [75,76], typically using ARIMA and SARIMA models [77,78,79,80,81,82], the Kalman filter econometric-based methodology [83,84] and artificial intelligence-based methods [85,86] as [87] pointed out. To date, hybrid methods in tourism forecasting are not universally preferred [88].
The main purpose of this research is to use the same idea described in the hybrid models built over ARIMA processes but with a double correction: first, a correction of the prediction on the observed variable using a Kalman filter [58] and its modified version [89] to solve the converge problems, and second, a correction of the system error with a nonlinear component. Therefore, this study tries to correct the deviation to the right by applying a hybridization with ML techniques, thus developing an efficient prediction system through a simple process using only previously gathered data without exogenous variables or complex models.
To verify the advantages of our hybrid model, it is applied to berry production and tourism demand time series. These are two economic activities with a noticeable seasonal character. Although little is known about berry yield forecasting from a variable perspective over time, the berry sector also presents seasonal productions that have not been well-studied in the literature even though they are important in the raw material markets [62]. Additionally, when reviewing the literature on tourism and passenger transportation demand forecasting from 2007 to 2017, it was noticed that seasonality and contingencies are the main obstacles to achieving more accurate forecasts [75].
Finally, we compared the root mean square error (RMSE), mean absolute error (MAE), mean absolute scaled error (MASE) and goodness-of-fit R2 between hybridized and non-hybridized models.

2. Material and Methods

To demonstrate the full functionality of the new hybrid model developed, it is fitted using four datasets.

2.1. Data

Spain represents the highest proportion (40.1%) of land dedicated to fruit production in the European Union (EU-28) [90]. A daily berry yield dataset of strawberry and raspberry fruits was extracted from three large coops that operate in Huelva, situated in the southwestern part of Spain, corresponding to three periods, i.e., (2017–2018), (2018–2019) and (2019–2020). To ensure that the seasonal periods were equal, September 1 was chosen as the start date and July 15 of the following year as the end date for each agricultural season. The first two seasons were used as training datasets and the last as the test dataset to evaluate the results.
The dataset corresponding to the tourism sector of Huelva, extracted from the Hotel Occupancy Survey [91], comprises the number of travelers and monthly overnight stays, our predictor variables, for the period between 1999 and 2019 and contains 252 observations. Unfortunately, the year 2020 was not considered due to the exceptionality caused by the COVID-19 pandemic, with figures that are non-reflective of the normal evolution of the sector.
ARIMA models are especially useful for managing time series that exhibit seasonal patterns. Seasonality is a typical feature of both the agriculture and tourism industry and is quite pronounced in the case of Huelva. The berry yield time series datasets have a seasonal component with a lag of 317 days. With tourism highly marked by the sun and beach segment, the summer months are the ones in which tourist activity is most concentrated. Moreover, volumes of travelers and overnight stays in hotels are variables that are usually employed to quantify such seasonality and evolution. Taking the year 2019 as a reference, the seasonal ratio, or the quotient between the highest and the lowest number of overnight stays monthly reached in August and January, respectively [92], was 12.424. Again, the seasonality rate, the quotient between the sum of the overnight stays in the three months with the highest influx and the total number of overnight stays for the year, was 0.479, i.e., 47.9% of the overnight stays in hotels, ranked from July to September.

2.2. The ARIMA and Kalman Filter Linear Models

Firstly, an autoregressive integrated moving average (ARIMA) model was used as a base research model. Let Yt be a stationary stochastic process with a mean of zero and differentiated d ϵ N times. The ARIMA (p,d,q) process is defined as
Y t = i = 1 p ϕ i Y t i + j = 1 q θ j ϵ t j + ϵ t ,
where ϵ t ~ N (0, σ2) is a white noise process associated with the system, p, q ϵ N are the lags chosen for the model, and the parameters of the systems are ϕi, θj ϵ R.
To choose the appropriate values of p and q, we considered the ARIMA process, which minimizes the value of both the Akaike criterion (AIC) and the Bayesian criterion (BIC), automating the process with the R software. On the other hand, to determine the coefficients of the ARIMA process, we applied the Hannan–Rissanen algorithm [93].
Secondly, we used the Kalman filter [58], a recursive algorithm widely used in the fields of economics and econometrics based on a state-space system defined by the system state (2) and the output equation of the system (3):
y t =   Ay t 1 +   B ω t ,
z t =   H   y t +   ν t ,
where y t R n ϵ is the state vector, z t R n the value of the exit and ω t ~ Ns (0, Q) and ν t   ~ Nm (0, R) the white noise processes. Finally, A ϵ Rnxn, B ϵ Rnxs and H ϵ Rmxn are the parameters of the state systems.

2.3. Nonlinear Autoregressive (NAR) Neural Networks and Support Vector Regression (SVR)

The main idea behind a nonlinear autoregressive neural network (NAR) is to eliminate the existing linearity constraint in the ARIMA model looking for an approximation of the form (4)
X t = f   ( X t i , . . .   X t p ) + ϵ t ,
where f is an unknown nonlinear function, Xt is the time series data, p ϵ N is the number of lags used in the estimation, and ϵ t ~ N (0, σ2) is a white noise process.
As NAR neural networks are a type of multilayer perceptron (MLP) with a single output value, and consequently there is no specific method to determine their architecture, we selected a hidden layer [94], by defining our algorithm as the following expression:
X ^ t = φ 2 j = 1 M ω j o φ 1 i = 1 p ω i j + β j + β 0 ,
where p , M Z are the number of selected lags and the number of nodes in the hidden layer, respectively, which have been calculated using a node growth technique seeking to minimize the RMSE. We used the sigmoid function and the hyperbolic tangent function in the hidden layer and the identity function in the output layer as respective activation functions determined by φ 1 , φ 2 [20,22,23,45,95]. Finally, wij, wj0 are the selected weights from input i to hidden unit j and from hidden unit j to output o, respectively, and β j , β 0 . R are the respective biases units.
We chose the backpropagation algorithm to train our NAR neural network and to fix the weights.
Support vector regression (SVR) is a ML model based on finding a nonlinear regression between N pairs of elements x t , y t ,   where x t R n are the input vectors and y t R are their corresponding outputs, like the one described in Equation (6):
y ^ i = w Ψ x i + b ,
where Ψ is an unknown nonlinear function, w R n are the weights of the model and b R is its respective bias. To find the values of w , we trained our model using the following loss function:
min w   1 2   w 2 + C i = 1 N   y ^ i y i ε
with ϵ ,   C > 0 . This definition means that we consider the error to be zero for all estimated values with a distance from their true value of less than ϵ. Therefore, by including slack variables in the loss function and applying the Lagrange multipliers method, we obtain a new equation to minimize, described in Equation (8).
min α , α * 1 2 i = 1 N j = 1 N α i α i * α j α j * k x i , x j + i = 1 N   α i α i * y i + i = 1 N α i α i * ε ,
subject to
i = 1 N   α i α i * 0   α i , α i *   C
The values α i α i * are known as support vectors, α i , α i * are the Lagrange multipliers, and k functions as the kernel function. Finally, if we use the results obtained in (8) and (9) in Equation (6), our final approximation is represented by Equation (10):
y ^ i = j = 1 N   α j α j * k x i , x j + b ,
In our research, we used the Gaussian function as the kernel function
k x i , x j = e γ x i x j 2 ,
and we calculated the value of parameters γ, C, ε ϵ R using the package e1071 of R programming language [96].
Lastly, to determine the dimension of the input vector, we applied a growth technique like the one used in the NAR model, which involves increasing the number of input dimensions by one unit at each step and calculating the RMSE value until finding the input dimension that minimizes the RMSE value.

2.4. The Hybrid Models

In general, the introduction of an ARIMA process in the state space implies convergence problems when the variance of the associated white noise process is large. For this reason, in our study, we compared the classical method of introduction in the state-space [97,98], (KF), with the new inclusion method described in [89], named alternative Kalman filter (AFK), where converge problems are fixed, and hybridized it with ML techniques.
The main idea was to use the NAR and SVR models as correction factors to predict the error of the AKF model. So, let Yt be a time series and ϵ t ~N (0, σ2) its white noise process associated with the AKF model; the hybrid system is described by Equation (12):
Y ^ t + 1 = L ^ t + 1 + ϵ ^ t + 1 ,
where L ^ t+1 is the linear value associated with the AKF prediction and ϵ ^ t + 1 is an error estimation calculated by the NAR or the SVR model.

3. Results

In this section, weekly results from the two berry crops and the two tourism demand datasets are examined and analyzed, establishing a comparison among the (1) KF, (2) AKF, (3) AKF-SVR and (4) AKF-NAR hybrid models. The parameters on which each model depends are described in Section 2.
The criteria used for the comparisons were the RMSE, MAE, MASE and goodness-of-fit R2 as there was a particular interest in error minimization and a higher number of parameters was not penalized, which is a requirement of a nonlinear process.
Finally, all our experiments were programmed using a CPU with a 2.6GHz i-7 processor and 16 GB of ram memory. On the other hand, as mentioned in Section 2, all the algorithms were programmed in R software using mainly the e1071, forecast and tseries packages.

3.1. Results for Berry Datasets

The weekly strawberry season forecast was built on an ARIMA (3,1,1). For the AKF-SVR model, the Gaussian kernel function with γ = 0.1, C = 1 and ε = 0.1 was used. Moreover, eight inputs were considered. For the AKF-NAR model, the hyperbolic tangent activation function was employed in the hidden layer, η = 0.01, p = 4 and M = 1.
Table 1 summarizes the results obtained for the 2019–2020 season. It was noticed that the models that best approximated the real values of the strawberry season were the hybrid AKF-SVR and AKF-NAR.
Similarly, Table 2 summarizes the results for the raspberry time series data. This predictive system was built on an ARIMA (1,1,1) using the Gaussian kernel function with γ = 0.071, C = 1, ε = 0.1 and 14 inputs for the AKF-SVR model. However, for the AKF-NAR model, the hyperbolic tangent function as an activation function was used in the hidden layer, η = 0.1, p = 16 and M = 1. Following the same criteria, the best predictive system was the hybrids.
Figure 1 and Figure 2 demonstrate a comparison of the four predictive models with respect to the observed real variable for each crop.

3.2. Results for Tourism Datasets

To predict the number of nights spent in Huelva Province hotels (2017–2019), an ARIMA (1,1,1) was used. The Gaussian kernel function with γ = 0.1, C = 1 and ε = 0.1 with 10 inputs was employed for the AKF-SVR model. The sigmoid function served as the activation function in the hidden layer. η = 0.1, p = 10 and M = 1 for the AKF-NAR model.
The best model was still the hybrid AKF-SVR (Table 3 and Figure 3). Identical results were obtained for visitors (Table 4 and Figure 4).
In conclusion, for all the datasets analyzed, the hybrid models developed improved the results over the linear ones, with the AKF-SVR algorithm showing the best performance.

4. Discussion

Different forecasting systems were analyzed by combining an ARIMA model with more sophisticated algorithms, such as a new approach to the classical Kalman filter combined with ML processes.
The results reveal that hybrid systems combining the AKF with error correction through a NAR neural network or an SVR model achieved higher accuracy than classical forecasting models. These results are better in the four time series analyzed for the two economic sectors. This is a simple model not seen in the literature in which the classical Kalman filter algorithm is modified, and the model error is estimated by a machine learning algorithm.
Three interesting outcomes were identified when the model results were compared with similar research that used different prediction models over the time series, such as ARIMA, Kalman filter, SVR or NAR.
First, our algorithm showed a better fit to the real data than those used in other studies [4,20,46,52,69,99,100,101,102], as we can confirm by the values of R2.
Second, we can see that in Kalman-filter-based models and in other studies based on linear predictive models, prediction plots exhibit a deviation to the right since these predictive models do not quickly capture trend variation [1,16,20,28,49,50,71,103]. However, our hybrid models corrected for this standard deviation by capturing trend changes (Figure 1, Figure 2, Figure 3 and Figure 4).
Finally, after analyzing the time series data, it was evident that they had a relevant seasonal component. Consequently, a SARIMA model may be an excellent model to apply in time series with that peculiarity. However, when considering recent research in which that model has been used [8,9,10,11,12,104], it is seen that monthly or daily data with the short seasonal component are used; i.e., the seasonal component is not far from the data to be predicted. In the berry dataset, the seasonal component has a lag of approximately 317 days, making its use computationally unfeasible since the computation time outstrips the advantage of its usage.

5. Conclusions, Limitations and Future Research

In this study, different predictive methods used to obtain a weekly yield forecast of strawberries and raspberries during an agricultural season and monthly tourism forecasts for both visitors and overnight stays have been developed and compared.
An alternative method for introducing the ARIMA model in a state-space system that eliminates randomness in the algorithm and uses only the previous data and the error produced by the system was developed. In addition, ML techniques such as NAR and SVR were introduced in the model to correct the system error and improve performance.
First, based on the acquired results, it can be concluded that these new hybrid models that combine both techniques are the best for predicting weekly berry production and tourism demand, substantially reducing the RMSE, MAE and MASE and achieving a goodness of fit greater than 0.95 in all cases.
Secondly, predictive models applied on time series that, as in the case of the agriculture and tourism sectors analyzed in this research, have a seasonal component and convergence problems, show little reliability at the points of trend variation. Therefore, the time series forecasting method developed is also considered useful for time series data with a significant standard deviation in the white noise.
Finally, an increased computing capacity and automated data accessibility make it possible to implement machine-learning-based techniques to support decision making. However, for these algorithms to work properly they need a lot of data, and the advantage of the algorithm we present is that it does not need a lot of data or complex models with many variables, which makes it easy for practitioners to implement and use.
The models developed still have some weaknesses, such as poor forecasting of the maximum, minimum and turning points, which could be an idea for further research. Secondly, we only analyzed four time series from two sectors. We propose this predictive algorithm be tested with other time series data [105]. Thirdly, it could be interesting to add exogenous variables to the ARIMA model [106,107,108] or use a nonlinear autoregressive exogenous model (NARX) [109,110,111,112,113,114,115]. Finally, this approach could be compared with other research that uses different ML models than those presented here, such as extreme learning machines (ELMs), convolutional neural networks (CNNs), long short-term memory (LSTM) or singular-spectrum analysis (SSA), among others [116,117,118,119,120,121,122,123].

Author Contributions

Conceptualization, J.D.B.; methodology, J.D.B.; validation, J.D.B.; formal analysis, J.D.B.; investigation, J.D.B., J.M. and A.V.-S.; resources, J.D.B. and A.V.-S.; data curation, J.D.B., J.M. and A.V.-S.; writing—original draft preparation, J.D.B. and J.M.; writing—review and editing, J.D.B. and A.V.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the availability of these data. Data were obtained from a third party. The data are not publicly available due to privacy concerns.

Acknowledgments

The authors acknowledge the support provided by the companies that released the data used for the analysis.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Benvenuto, D.; Giovanetti, M.; Vassallo, L.; Angeletti, S.; Ciccozzi, M. Application of the arima model on the covid-2019 epidemic dataset. Data Brief 2020, 29, 105340. [Google Scholar] [CrossRef] [PubMed]
  2. Grogger, J. Soda taxes and the prices of sodas and other drinks: Evidence from Mexico. Am. J. Agric. Econ. 2017, 99, 481–498. [Google Scholar] [CrossRef]
  3. Hernandez-Matamoros, A.; Fujita, H.; Hayashi, T.; Perez-Meana, H.H. Forecasting of covid19 per regions using arima models and polynomial functions. Appl. Soft Comput. 2020, 96, 106610. [Google Scholar] [CrossRef] [PubMed]
  4. Jamil, R. Hydroelectricity consumption forecast for Pakistan using ARIMA modeling and supply-demand analysis for the year 2030. Renew. Energy 2020, 154, 1–10. [Google Scholar] [CrossRef]
  5. Melchior, C.; Zanini, R.R.; Guerra, R.R.; Rockenbach, D.A. Forecasting Brazilian mortality rates due to occupational accidents using autoregressive moving average approaches. Int. J. Forecast. 2020, 37, 825–837. [Google Scholar] [CrossRef]
  6. Yang, H.; O’Connell, J.F. Short-term carbon emissions forecast for aviation industry in shanghai. J. Clean. Prod. 2020, 275, 122734. [Google Scholar] [CrossRef]
  7. Geurts, M.D.; Ibrahim, I. Comparing the box-jenkins approach with the exponentially smoothed forecasting model application to Hawaii tourists. J. Mark. Res. 1975, 12, 182–188. [Google Scholar] [CrossRef]
  8. García, J.R.; Pacce, M.; Rodrigo, T.; de Aguirre, P.R.; Ulloa, A.C. Measuring and forecasting retail trade in real time using card transactional data. Int. J. Forecast. 2021, 37, 1235–1246. [Google Scholar] [CrossRef]
  9. Guizzardi, A.; Pons, F.M.E.; Angelini, G.; Ranieri, E. Big data from dynamic pricing: A smart approach to tourism demand forecasting. Int. J. Forecast. 2021, 37, 1049–1060. [Google Scholar] [CrossRef]
  10. He, K.; Ji, L.; Wu, C.W.D.; Tso, K.F.G. Using SARIMA–CNN–LSTM approach to forecast daily tourism demand. J. Hosp. Tour. Manag. 2021, 49, 25–33. [Google Scholar] [CrossRef]
  11. Li, D.; Jiang, F.; Chen, M.; Qian, T. Multi-step-ahead wind speed forecasting based on a hybrid decomposition method and temporal convolutional networks. Energy 2022, 238, 121981. [Google Scholar] [CrossRef]
  12. Sekadakis, M.; Katrakazas, C.; Michelaraki, E.; Kehagia, F.; Yannis, G. Analysis of the impact of COVID-19 on collisions, fatalities and injuries using time series forecasting: The case of Greece. Accid. Anal. Prev. 2021, 162, 106391. [Google Scholar] [CrossRef] [PubMed]
  13. Aamir, M.; Shabri, A. Modelling and forecasting monthly crude oil price of Pakistan: A comparative study of arima, garch and arima kalman model. AIP Conf. Proc. 2016, 1750, 060015. [Google Scholar] [CrossRef] [Green Version]
  14. Das, S.; Barai, P. Time-varying industry beta in Indian stock market and forecasting errors. Int. J. Emerg. Mark. 2015, 10, 521–534. [Google Scholar] [CrossRef]
  15. Muhammad, A. Using the kalman filter with arima for the covid-19 pandemic dataset of Pakistan. Data Brief 2020, 31, 105854. [Google Scholar] [CrossRef]
  16. Selvaraj, J.J.; Arunachalam, V.; Coronado-Franco, K.V.; Romero-Orjuela, L.V.; Ramírez-Yara, Y.N. Time-series modeling of fishery landings in the Colombian Pacific Ocean using an arima model. Reg. Stud. Mar. Sci. 2020, 39, 101477. [Google Scholar] [CrossRef]
  17. Emami, A.; Sarvi, M.; Bagloee, S.A. Using Kalman filter algorithm for short-term traffic flow prediction in a connected vehicle environment. J. Mod. Transp. 2019, 27, 222–232. [Google Scholar] [CrossRef] [Green Version]
  18. Storm, H.; Baylis, K.; Heckelei, T. Machine learning in agricultural and applied economics. Eur. Rev. Agric. Econ. 2020, 47, 849–892. [Google Scholar] [CrossRef] [Green Version]
  19. Wang, Z.-X.; Zhao, Y.; He, L. Forecasting the monthly iron ore import of china using a model combining empirical mode decomposition, non-linear autoregressive neural network, and autoregressive integrated moving average. Appl. Soft Comput. 2020, 94, 106475. [Google Scholar] [CrossRef]
  20. Sunayana, C.; Kumar, S.; Kumar, R. Forecasting of municipal solid waste generation using non-linear autoregressive (nar) neural models. Waste Manag. 2021, 121, 206–214. [Google Scholar] [CrossRef]
  21. Alsumaiei, A.A.; Alrashidi, M.S. Hydrometeorological drought forecasting in hyper-arid climates using nonlinear autoregressive neural networks. Water 2020, 12, 2611. [Google Scholar] [CrossRef]
  22. Sun, Z.; Li, K.; Li, Z. Prediction of horizontal displacement of foundation pit based on nar dynamic neural network. IOP Conf. Ser. Mater. Sci. Eng. 2020, 782, 042032. [Google Scholar] [CrossRef] [Green Version]
  23. Khan, E.; Mohammad, F.; Gupta, R. Arima and nar based prediction model for time series analysis of COVID-19 cases in India. J. Saf. Sci. Resil. 2020, 1, 12–18. [Google Scholar] [CrossRef]
  24. Rodrigues, C.P.; Awe, O.O.; Pimentel, J.S.; Mahmoudvand, R. Modelling the Behaviour of Currency Exchange Rates with Singular Spectrum Analysis and Artificial Neural Networks. Stats 2020, 3, 12. [Google Scholar] [CrossRef]
  25. Taheri, S.; Brodie, G.; Gupta, D. Optimised ANN and SVR models for online prediction of moisture content and temperature of lentil seeds in a microwave fluidised bed dryer. Comput. Electron. Agric. 2021, 182, 106003. [Google Scholar] [CrossRef]
  26. Yu, Z.; Yang, K.; Luo, Y.; Shang, C. Spatial-temporal process simulation and prediction of chlorophyll-a concentration in dianchi lake based on wavelet analysis and long-short term memory network. J. Hydrol. 2020, 582, 124488. [Google Scholar] [CrossRef]
  27. Ju, X.; Cheng, M.; Xia, Y.; Quo, F.; Tian, Y. Support vector regression and time series analysis for the forecasting of bayannur’s total water requirement. Procedia Comput. Sci. 2014, 31, 523–531. [Google Scholar] [CrossRef] [Green Version]
  28. Valente, J.M.; Maldonado, S. Svr-ffs: A novel forward feature selection approach for high-frequency time series forecasting using support vector regression. Expert Syst. Appl. 2020, 160, 113729. [Google Scholar] [CrossRef]
  29. Yu, L.; Liang, S.; Chen, R.; Lai, K.K. Predicting monthly biofuel production using a hybrid ensemble forecasting methodology. Int. J. Forecast. 2019, 38, 3–20. [Google Scholar] [CrossRef]
  30. Chen, W.; Xu, H.; Jia, L.; Gao, Y. Machine learning model for bitcoin Exchange rate prediction using economic and technology determinants. Int. J. Forecast. 2021, 37, 28–43. [Google Scholar] [CrossRef]
  31. Hess, A.; Spinler, H.S.; Winkenbach, M. Real-time demand forecasting for an urban delivery platform. Transp. Res. Part E Logist. Transp. Rev. 2021, 145, 102147. [Google Scholar] [CrossRef]
  32. Jin, Z.; Guo, K.; Sun, Y.; Lai, L.; Liao, Z. The industrial asymmetry of the stock price prediction with investor sentiment: Based on the comparison of predictive effects with svr. J. Forecast. 2020, 39, 1166–1178. [Google Scholar] [CrossRef]
  33. Das, P.; Chanda, K. Bayesian network based modeling of regional rainfall from multiple local meteorological drivers. J. Hydrol. 2020, 591, 125563. [Google Scholar] [CrossRef]
  34. Dhiman, H.S.; Deb, D.; Guerrero, J.M. Hybrid machine intelligent svr variants for wind forecasting and ramp events. Renew. Sustain. Energy Rev. 2019, 108, 369–379. [Google Scholar] [CrossRef]
  35. Abbasi, M.; Farokhnia, A.; Bahreinimotlagh, M.; Roozbahani, R. A hybrid of random forest and deep auto-encoder with support vector regression methods for accuracy improvement and uncertainty reduction of long-term streamflow prediction. J. Hydrol. 2020, 597, 125717. [Google Scholar] [CrossRef]
  36. Rahim, B.; Aalami, M.T.; Adamowski, J. Coupling a hybrid cnn-lstm deep learning model with a boundary corrected maximal overlap discrete wavelet transform for multiscale lake water level forecasting. J. Hydrol. 2021, 598, 126196. [Google Scholar] [CrossRef]
  37. Lee, T.; Shin, J.-Y.; Kim, J.-S.; Singh, V.P. Stochastic simulation on reproducing long-term memory of hydroclimatological variables using deep learning model. J. Hydrol. 2020, 582, 124540. [Google Scholar] [CrossRef]
  38. Piri, J.; Pirzadeh, B.; Keshtegar, B.; Givehchi, M. A hybrid statistical regression technical for prediction wastewater inflow. Comput. Electron. Agric. 2021, 184, 106115. [Google Scholar] [CrossRef]
  39. Wu, T.; Zhang, W.; Jiao, X.; Guo, W.; Hamoud, Y.A. Evaluation of stacking and blending ensemble learning methods for estimating daily reference evapotranspiration. Comput. Electron. Agric. 2021, 184, 106039. [Google Scholar] [CrossRef]
  40. Balli, S. Data analysis of covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods. Chaos Solit. Fract. 2021, 142, 110512. [Google Scholar] [CrossRef]
  41. Pimentel, S.J.; Ospina, R.; Ara, A. Learning Time Acceleration in Support Vector Regression: A Case Study in Educational Data Mining. Stats 2021, 4, 41. [Google Scholar] [CrossRef]
  42. Xu, S.; Chan, H.K.; Zhang, T. Forecasting the demand of the aviation industry using hybrid time series sarima-svr approach. Transport. Res. Part E Logist. Transport. Rev. 2018, 122, 169–180. [Google Scholar] [CrossRef]
  43. Nichiforov, C.; Stamatescu, I.; Fagarasan, I.; Stamatescu, G. Energy consumption forecasting using arima and neural network models. In Proceedings of the 2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE), Galati, Romania, 20–22 October 2017; pp. 1–4. [Google Scholar] [CrossRef]
  44. Khan, M.; Hayet, M.M.; Muhammad, N.S.; El-Shafie, A. Wavelet based hybrid ann-arima models for meteorological drought forecasting. J. Hydrol. 2020, 590, 125380. [Google Scholar] [CrossRef]
  45. Li, Z.; Han, J.; Song, A.Y. On the forecasting of high frequency financial time series based on arima model improved by deep learning. J. Forecast. 2020, 39, 1081–1097. [Google Scholar] [CrossRef]
  46. Abraham, E.R.; Mendes dos Reis, J.G.; Vendrametto, O.; de Oliveira Costa Neto, P.L.; Toloi, R.C.; de Souza, A.G.; de Oliveira Morais, M. Time series prediction with artificial neural networks: An analysis using brazilian soybean production. Agriculture 2020, 10, 475. [Google Scholar] [CrossRef]
  47. Chu, X.; Li, Y.; Tian, D.; Feng, J.; Mu, W. An optimized hybrid model based on artificial intelligence for grape price forecasting. Br. Food J. 2019, 121, 3247–3265. [Google Scholar] [CrossRef]
  48. Mahto, A.K.; Alam, M.A.; Biswas, R.; Ahmed, J.; Alam, S.I. Short-term forecasting of agriculture commodities in context of Indian market for sustainable agriculture by using the artificial neural network. J. Food Qual. 2021, 5, 9906. [Google Scholar] [CrossRef]
  49. Maldaner, L.F.; Corredo, L.d.; Canata, T.F.; Molin, J.P. Predicting the sugarcane yield in real-time by harvester engine parameters and machine learning approaches. Comput. Electron. Agric. 2021, 181, 105945. [Google Scholar] [CrossRef]
  50. Yin, H.; Jin, D.; Gu, Y.H.; Park, C.J.; Han, S.K.; Yoo, S.J. Stl-attlstm: Vegetable price forecasting using stl and attention mechanism-based lstm. Agriculture 2020, 10, 612. [Google Scholar] [CrossRef]
  51. Abbas, F.; Afzaal, H.; Farooque, A.A.; Tang, S. Crop yield prediction through proximal sensing and machine learning algorithms. Agronomy 2020, 10, 1046. [Google Scholar] [CrossRef]
  52. Castillo, C.; Pérez, R.; Vallejo-Orti, M. The impact of recent gully filling practices on wheat yield at the campiña landscape in southern Spain. Soil Tillage Res. 2021, 212, 105041. [Google Scholar] [CrossRef]
  53. Esfandiarpour, I.; Karimi, E.; Shirani, H.; Esmaeili, M.; Mosleh, Z. Yield prediction of apricot using a hybrid particle swarm optimizationimperialist competitive algorithm- support vector regression (pso-ica-svr) method. Sci. Hortic. 2019, 257, 108756. [Google Scholar] [CrossRef]
  54. Gomez, D.; Salvador, P.; Sanz-Justo, J.; Casanova, J. Regional estimation of garlic yield using crop, satellite and climate data in Mexico. Comput. Electron. Agric. 2021, 181, 105943. [Google Scholar] [CrossRef]
  55. Rajae, R.; Mélard, G. Autoregressive Models with Time-Dependent Coefficients—A Comparison between Several Approaches. Stats 2022, 5, 46. [Google Scholar] [CrossRef]
  56. Hyoung, L.B. Bootstrap Prediction Intervals of Temporal Disaggregation. Stats 2022, 5, 13. [Google Scholar] [CrossRef]
  57. Macedo, D.P.; Marques, A.C.; Damette, O. The role of electricity flows and renewable electricity production in the behaviour of electricity prices in Spain. Econ. Anal. Policy 2020, 76, 885–900. [Google Scholar] [CrossRef]
  58. Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef] [Green Version]
  59. Mehmood, Q.; Sial, M.; Riaz, M.; Shaheen, N. Forecasting the production of sugarcane crop of Pakistan for the year 2018-2030, usign box-jenkings methodology. J. Anim. Plant Sci. 2019, 29, 1396–1401. [Google Scholar]
  60. Tatarintsev, M.; Korchagin, S.; Nikitin, P.; Gorokhova, R.; Bystrenina, I.; Serdechnyy, D. Analysis of the Forecast Price as a Factor of Sustainable Development of Agriculture. Agronomy 2021, 11, 1235. [Google Scholar] [CrossRef]
  61. Wang, M. Short-term forecast of pig price index on an agricultural internet platform. Agribusiness 2019, 35, 492–497. [Google Scholar] [CrossRef]
  62. Ewald, C.-O.; Zou, Y. Analytic formulas for futures and options for a linear quadratic jump diffusion model with seasonal stochastic volatility and convenience yield: Do fish jump? Eur. J. Oper. Res. 2021, 294, 801–815. [Google Scholar] [CrossRef]
  63. Xu, X.; Zhang, Y. Corn cash price forecasting with neural networks. Comput. Electron. Agric. 2021, 184, 106120. [Google Scholar] [CrossRef]
  64. Dubois, A.; Teytaud, F.; Verel, S. Short term soil moisture forecasts for potato crop farming: A machine learning approach. Comput. Electron. Agric. 2021, 180, 105902. [Google Scholar] [CrossRef]
  65. Liu, Y.; Duan, Q.; Wang, D.; Zhang, Z.; Liu, C. Prediction for hog prices based on similar sub-series search and support vector regression. Comput. Electron. Agric. 2019, 157, 581–588. [Google Scholar] [CrossRef]
  66. Priyadarshi, R.; Panigrahi, A.; Routroy, S.; Garg, G.K. Demand forecasting at retail stage for selected vegetables: A performance analysis. J. Model. Manag. 2019, 14, 1042–1063. [Google Scholar] [CrossRef]
  67. Shao, Y.; Xiong, T.; Li, M.; Hayes, D.; Zhang, W.; Xie, W. China’s missing pigs: Correcting china’s hog inventory data using a machine learning approach. Am. J. Agric. Econ. 2020, 103, 1082–1098. [Google Scholar] [CrossRef]
  68. Fang, Y.; Guan, B.; Wu, S.; Heravi, S. Optimal forecast combination based on ensemble empirical mode decomposition for agricultural commodity futures prices. J. Forecast. 2020, 39, 877–886. [Google Scholar] [CrossRef]
  69. Gopal, P.; Bhargavi, R. A novel approach for efficient crop yield prediction. Comput. Electron. Agric. 2019, 165, 104968. [Google Scholar] [CrossRef]
  70. Sujjaviriyasup, T.; Komkrit, P. Hybrid arima-support vector machine model for agricultural production planning. Appl. Math. Sci. 2013, 7, 2833–2840. [Google Scholar] [CrossRef]
  71. Wang, B.; Liu, P.; Chao, Z.; Junmei, W.; Chen, W.; Cao, N.; O’Hare, G.; Wen, F. Research on hybrid model of garlic short-term price forecasting based on big data. Comput. Mater. Cont. 2018, 57, 283–296. [Google Scholar] [CrossRef]
  72. Huang, H.; Huang, J.; Feng, Q.; Liu, J.; Li, X.; Wang, X.; Niu, Q. Developing a Dual-Stream Deep-Learning Neural Network Model for Improving County-Level Winter Wheat Yield Estimates in China. Remote Sens. 2022, 14, 5280. [Google Scholar] [CrossRef]
  73. Wang, J.; Si, H.; Gao, Z.; Shi, L. Winter Wheat Yield Prediction Using an LSTM Model from MODIS LAI Products. Agriculture 2022, 12, 1707. [Google Scholar] [CrossRef]
  74. Yli-Heikkila, M.; Wittke, S.; Luotamo, M.; Puttonen, E.; Sulkava, M.; Pellikka, P.; Heiskanen, J.; Klami, A. Scalable Crop Yield Prediction with Sentinel-2 Time Series and Temporal Convolutional Network. Remote Sens. 2022, 14, 4193. [Google Scholar] [CrossRef]
  75. Ghalehkhondabi, I.; Ardjmand, E.; Young, A.W.; Weckman, R.G. A review of demand forecasting models and methodological developments within tourism and passenger transportation industry. J. Tour. Futures 2019, 5, 75–93. [Google Scholar] [CrossRef] [Green Version]
  76. Song, H.; Qiu, R.T.R.; Park, J. A review of research on tourism demand forecasting. Ann. Tour. Res. 2019, 75, 338–362. [Google Scholar] [CrossRef]
  77. Cabrer-Borrás, B.; Iranzo-Pérez, D. El efecto de los atentados del 11-S sobre el turismo en España. Estud. Econ. Apl. 2007, 25, 365–386. [Google Scholar]
  78. Cava, J.A.; Millán, M.G.; Dancausa, M.G. Enotourism in Southern Spain: The Montilla-Moriles PDO. Int. J. Environ. Res. Public Health 2022, 19, 3393. [Google Scholar] [CrossRef]
  79. Gričar, S.; Bojnec, S. Did Human Microbes Affect Tourist Arrivals before the COVID-19 Shock? Pre-Effect Forecasting Model for Slovenia. Int. J. Environ. Res. Public Health 2022, 19, 13482. [Google Scholar] [CrossRef]
  80. Kulendran, N.; Witt, S.F. Forecasting the Demand for International Business Tourism. J. Travel Res. 2003, 41, 265–271. [Google Scholar] [CrossRef]
  81. Safarov, B.; Al-Smadi, H.M.; Buzrukova, M.; Janzakov, B.; Ilieş, A.; Grama, V.; Ilieș, D.C.; Vargáné, K.C.; Dávid, L.D. Forecasting the Volume of Tourism Services in Uzbekistan. Sustainability 2022, 14, 7762. [Google Scholar] [CrossRef]
  82. Turtureanu, A.-G.; Pripoaie, R.; Cretu, C.-M.; Sirbu, C.; Marinescu, E.Ş.; Talaghir, L.; Chițu, F. A Projection Approach of Tourist Circulation under Conditions of Uncertainty. Sustainability 2022, 14, 1964. [Google Scholar] [CrossRef]
  83. Okutani, I.; Yorgos, S.J. Dynamic prediction of traffic volume through Kalman filtering theory. Transp. Res. Part B Methodol. 1984, 18, 1–11. [Google Scholar] [CrossRef]
  84. Qiao, W.; Haghani, A.; Hamedi, M. A Nonparametric Model for Short-Term Travel Time Prediction Using Bluetooth Data. J. Intell. Transp. Syst. Technol. Plan. Oper. 2013, 17, 165–175. [Google Scholar] [CrossRef]
  85. Gričar, S.; Bojnec, S.; Baldigara, T. Insight into Predicted Shocks in Tourism: Review of an Ex-Ante Forecasting. J. Risk Financ. Manag. 2022, 15, 436. [Google Scholar] [CrossRef]
  86. Nguyen, L.Q.; Fernandes, P.O.; Teixeira, J.P. Analyzing and Forecasting Tourism Demand in Vietnam with Artificial Neural Networks. Forecasting 2022, 4, 3. [Google Scholar] [CrossRef]
  87. Goh, C.; Law, R. The Methodological Progress of Tourism Demand Forecasting: A Review of Related Literature. J. Travel Tour. Mark. 2011, 28, 296–317. [Google Scholar] [CrossRef]
  88. Zhang, Y.; Choo, W.C.; Ho, J.S.; Wan, C.K. Single or Combine? Tourism Demand Volatility Forecasting with Exponential Weighting and Smooth Transition Combining Methods. Computation 2022, 10, 137. [Google Scholar] [CrossRef]
  89. Borrero, J.D.; Mariscal, J. Predicting Time SeriesUsing an Automatic New Algorithm of the Kalman Filter. Mathematics 2022, 10, 2915. [Google Scholar] [CrossRef]
  90. De Cicco, A. The Fruit and Vegetable Sector in the EU–A Statistical Overview. EU. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=The_fruit_and_vegetable_sector_in_the_EU_-_a_statistical_overview (accessed on 24 September 2022).
  91. INE, Instituto Nacional de Estadística. Encuesta de Ocupación Hotelera. Available online: https://www.ine.es/dynt3/inebase/en/index.htm?padre=239 (accessed on 16 March 2022).
  92. De Cantis, S.; Ferrante, M.; Vaccina, F. Seasonal Pattern and Amplitude–a Logical Framework to Analyse Seasonality in Tourism: An Application to Bed Occupancy in Sicilian Hotels. Tour. Econ. 2011, 17, 655–675. [Google Scholar] [CrossRef]
  93. Brockwell, P.J.; Davis, R.A. Introduction to Time Series and Forecasting. In Springer Texts in Statistics, 2nd ed.; Springer: New York, NY, USA, 2006. [Google Scholar]
  94. Trenn, S. Multilayer perceptrons: Approximation order and necessary number of hidden units. IEEE Trans. Neural Netw. IEEE Neural Netw. Counc. 2008, 19, 836–844. [Google Scholar] [CrossRef] [Green Version]
  95. Wongsathan, R.; Isaravuth, S. A hybrid arima and neural networks model for pm-10 pollution estimation: The case of Chiang Mai city moat area. Procedia Comput. Sci. 2016, 86, 273–276. [Google Scholar] [CrossRef]
  96. Meyer, D.; Dimitriadou, E.; Hornik, K.; Weingessel, A.; Leisch, F.; Chang, C.; Lin, C. Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071). 2021; p. e1071. Available online: https://cran.r-project.org/web/packages/e1071/index.html (accessed on 12 October 2022).
  97. Hamilton, J.D. Chapter 50 State-Space Models. In Volume 4 of Handbook of Econometrics; Elsevier: Amsterdam, Netherlands, 1994; pp. 3039–3080. [Google Scholar]
  98. Harvey, A.C. Forecasting, Structural Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar] [CrossRef]
  99. Feng, L.; Wang, Y.; Zhang, Z.; Du, Q. Geographically and temporally weighted neural network for winter wheat yield prediction. Remote Sens. Environ. 2021, 262, 112514. [Google Scholar] [CrossRef]
  100. Youssef, K.; Gökçekus, H.; Alassi, E. Identifying most influencing input parameters for predicting cereal production using an artificial neural network model. Model. Earth Syst. Environ. 2021, 3, 1–14. [Google Scholar] [CrossRef]
  101. Piekutowska, M.; Niedbała, G.; Piskier, T.; Lenartowicz, T.; Pilarski, K.; Wojciechowski, T.; Pilarska, A.A.; Czechowska-Kosacka, A. The application of multiple linear regression and artificial neural network models for yield prediction of very early potato cultivars before harvest. Agronomy 2021, 11, 885. [Google Scholar] [CrossRef]
  102. Shafiee, S.; Lied, L.M.; Burud, I.; Dieseth, J.A.; Alsheikh, M.; Lillemo, M. Sequential forward selection and support vector regression in comparison to lasso regression for spring wheat yield prediction based on uav imagery. Comput. Electron. Agric. 2021, 183, 106036. [Google Scholar] [CrossRef]
  103. Khiem, N.; Minh, N.; Takahashi, Y.; Dong, K.T.P.; Yasuma, H.; Kimura, N. Predicting the price of Vietnamese shrimp products exported to the US market using machine learning. Fish. Sci. 2021, 87, 411–423. [Google Scholar] [CrossRef]
  104. Paredes-Garcia, J.W.; Ocampo-Velázquez, R.V.; Torres-Pacheco, I.; Cedillo-Jiménez, C.A. Price Forecasting and Span Commercialization Opportunities for Mexican Agricultural Products. Agronomy 2019, 9, 826. [Google Scholar] [CrossRef] [Green Version]
  105. Makridakis, S.E.; Assimakopoulos, V. The M4 Competition: 100,000 time series and 61 forecasting methods. Int. J. Forecast. 2020, 36, 54–74. [Google Scholar] [CrossRef]
  106. Abolghasemi, M.; Beh, E.; Tarr, G.; Gerlach, R. Demand forecasting in supply chain: The impact of demand volatility in the presence of promotion. Comput. Ind. Eng. 2020, 142, 106380. [Google Scholar] [CrossRef] [Green Version]
  107. Chiu, L.-Y.; Rustia, D.J.A.; Lu, C.-Y.; Lin, T.-T. Modelling and forecasting of greenhouse whitefly incidence using time-series and arimax analysis. IFAC-Pap. Online 2019, 52, 196–201. [Google Scholar] [CrossRef]
  108. Wiwik, A.; Mahananto, F.; Qamar, A.; Zaini, S.Z.; Boga, K.; Sumaryant, A. Forecasting the price of Indonesias rice using hybrid artificial neural network and autoregressive integrated moving average (hybrid nns-arimax) with exogenous variables. Procedia Comput. Sci. 2019, 161, 677–686. [Google Scholar] [CrossRef]
  109. Alarcon, V.J. Hindcasting and forecasting total suspended sediment concentrations using a narx neural network. Sustainability 2021, 13, 363. [Google Scholar] [CrossRef]
  110. Bucci, A. Cholesky-ann models for predicting multivariate realized volatility. J. Forecast. 2020, 39, 865–876. [Google Scholar] [CrossRef] [Green Version]
  111. Canchala, T.; Alfonso-Morales, W.; Carvajal-Escobar, Y.; Cerón, L.W.; Caicedo-Bravo, E. Monthly rainfall anomalies forecasting for Southwestern Colombia using artificial neural networks approaches. Water 2020, 12, 2628. [Google Scholar] [CrossRef]
  112. Heidari, E.; Daeichian, A.; Sobati, M.A.; Movahedirad, S. Prediction of the droplet spreading dynamics on a solid substrate at irregular sampling intervals: Nonlinear auto-regressive exogenous artificial neural network approach (narx-ann). Chem. Eng. Res. Des. 2020, 156, 263–272. [Google Scholar] [CrossRef]
  113. Ma, Q.; Liu, S.; Fan, X.; Chai, C.; Wang, Y.; Yang, K. A time series prediction model of foundation pit deformation based on empirical wavelet transform and narx network. Mathematics 2020, 8, 1535. [Google Scholar] [CrossRef]
  114. Mustapa, F.R.; Dahlan, N.Y.; Yassin, A.I.M.; Nordin, A.H.M. Quantification of energy savings from an awareness program using narx-ann in an educational building. Energy Build. 2020, 215, 109899. [Google Scholar] [CrossRef]
  115. Yetkin, M.; Kim, Y. Time series prediction of mooring line top tension by the narx and volterra model. Appl. Ocean Res. 2019, 88, 170–186. [Google Scholar] [CrossRef]
  116. Fen, Z.; Niu, W.; Tang, T.; Xu, Y.; Zhang, H. Evolutionary artificial intelligence model via cooperation search algorithm and extreme learning machine for multiple scales nonstationary hydrological time series prediction. J. Hydrol. 2021, 595, 126062. [Google Scholar] [CrossRef]
  117. Gu, Y.H.; Jin, D.; Yin, H.; Zheng, R.; Piao, X.; Yoo, S.J. Forecasting Agricultural Commodity Prices Using Dual Input Attention LSTM. Agriculture 2022, 12, 256. [Google Scholar] [CrossRef]
  118. Hennig, M.; Grafinger, M.; Hofmann, R.; Gerhard, D.; Dumss, S.; Rosenberger, P. Introduction of a time series machine learning methodology for the application in a production system. Adv. Eng. Inform. 2021, 47, 101197. [Google Scholar] [CrossRef]
  119. Kolidakis, S.; Botzoris, G.; Profillidis, V.; Lemonakis, P. 2019. Road traffic forecasting—A hybrid approach combining Artificial Neural Network with Singular Spectrum Analysis. Econ. Anal. Policy 2019, 64, 159–171. [Google Scholar] [CrossRef]
  120. Larrea Porto, M.A.; Irigoyen, E.; Barragán, A.J.; ManuelAndújar, J. Extreme learning machine ensemble model for time series forecasting boosted by pso: Application to an electric consumption problem. Neurocomputing 2020, 452, 140. [Google Scholar] [CrossRef]
  121. Milunovich, G. Forecasting Australia’s real house price index: A comparison of time series and machine learning methods. J. Forecast. 2020, 39, 1098–1118. [Google Scholar] [CrossRef]
  122. Qin, X.; Yin, D.; Dong, X.; Chen, D.; Zhang, S. Passenger Flow Prediction of Scenic Spots in Jilin Province Based on Convolutional Neural Network and Improved Quantile Regression Long Short-Term Memory Network. ISPRS Int. J. Geo-Inf. 2022, 11, 509. [Google Scholar] [CrossRef]
  123. Wang, H.; Zhang, M.; Yang, Y. Machine learning for multiphase flowrate estimation with time series sensing data. Measur. Sens. 2020, 10–12, 100025. [Google Scholar] [CrossRef]
Figure 1. Prediction charts for strawberries: 2019–2020 (weekly).
Figure 1. Prediction charts for strawberries: 2019–2020 (weekly).
Stats 05 00068 g001
Figure 2. Prediction charts for raspberries: 2019–2020 (weekly).
Figure 2. Prediction charts for raspberries: 2019–2020 (weekly).
Stats 05 00068 g002
Figure 3. Prediction charts for total overnight visitors in Huelva: 2017–2019 (monthly).
Figure 3. Prediction charts for total overnight visitors in Huelva: 2017–2019 (monthly).
Stats 05 00068 g003
Figure 4. Prediction charts for total visitors in Huelva: 2017–2019 (monthly).
Figure 4. Prediction charts for total visitors in Huelva: 2017–2019 (monthly).
Stats 05 00068 g004
Table 1. Results for strawberry time series: 2019–2020 (weekly).
Table 1. Results for strawberry time series: 2019–2020 (weekly).
ModelRMSEMAEMASER2
KF90,847.0152,906.001.1470.899
AKF62,269.7436,901.740.8000.953
AKF-NAR61,311.9935,640.530.7730.954
AKF-SVR61,339.7335,147.710.7700.954
Table 2. Results for raspberry time series: 2019–2020 (weekly).
Table 2. Results for raspberry time series: 2019–2020 (weekly).
ModelRMSEMAEMASER2
KF25,323.0012,576.670.8020.938
AKF23,419,5011,037.770.7040.947
AKF-NAR20,425.929324.950.5950.960
AKF-SVR14,248.646772.140.4320.980
Table 3. Results for total overnight visitors in Huelva: 2017–2019 (monthly).
Table 3. Results for total overnight visitors in Huelva: 2017–2019 (monthly).
ModelRMSEMAEMASER2
KF116,602.982,500.060.7080.739
AKF93,020.3370,959.720.6090.834
AKF-NAR98,669.0772,939.550.6260.813
AKF-SVR44,416.7934,270.450.2940.962
Table 4. Results for total visitors in Huelva: 2017–2019 (monthly).
Table 4. Results for total visitors in Huelva: 2017–2019 (monthly).
ModelRMSEMAEMASER2
KF23,131.5417,501.850.7590.775
AKF20,542.4816,267.910.7060.822
AKF-NAR21,328.916,844.930.7310.808
AKF-SVR11,433.179350.80.4060.945
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Borrero, J.D.; Mariscal, J.; Vargas-Sánchez, A. A New Predictive Algorithm for Time Series Forecasting Based on Machine Learning Techniques: Evidence for Decision Making in Agriculture and Tourism Sectors. Stats 2022, 5, 1145-1158. https://doi.org/10.3390/stats5040068

AMA Style

Borrero JD, Mariscal J, Vargas-Sánchez A. A New Predictive Algorithm for Time Series Forecasting Based on Machine Learning Techniques: Evidence for Decision Making in Agriculture and Tourism Sectors. Stats. 2022; 5(4):1145-1158. https://doi.org/10.3390/stats5040068

Chicago/Turabian Style

Borrero, Juan D., Jesús Mariscal, and Alfonso Vargas-Sánchez. 2022. "A New Predictive Algorithm for Time Series Forecasting Based on Machine Learning Techniques: Evidence for Decision Making in Agriculture and Tourism Sectors" Stats 5, no. 4: 1145-1158. https://doi.org/10.3390/stats5040068

APA Style

Borrero, J. D., Mariscal, J., & Vargas-Sánchez, A. (2022). A New Predictive Algorithm for Time Series Forecasting Based on Machine Learning Techniques: Evidence for Decision Making in Agriculture and Tourism Sectors. Stats, 5(4), 1145-1158. https://doi.org/10.3390/stats5040068

Article Metrics

Back to TopTop