Evaluation of Heuristics for Taken’s Theorem Hyper-Parameters Optimization in Time Series Forecasting Tasks †
Abstract
1. Introduction
2. Theoretical Background
2.1. Mutual Information
2.2. Dimension Congruence
2.3. Random Search
2.4. Grid Search
2.5. Fitness Function
2.5.1. MAPE
2.5.2. MSE
2.5.3.
2.6. Support Vector Regression Algorithm (SVR)
- , and are the parallel hyperplanes.
- is located at a distance above .
- is located at a distance below .
- Then and form the hypertube
- The quantity is the minimum possible, subject to
3. Experiments and Results
3.1. SARS-CoV-2 in Mexico (COV)
3.1.1. Bitcoin Price on Bitfinex (BIT)
3.1.2. Air Temperature in Acuitzio del Canje (TEM)
3.1.3. S&P500 Index
3.1.4. Seismic Activity in Michoacán
3.1.5. Atmospheric Carbon Dioxide Concentration
3.2. Experimental Setup
- Set contained the last of the data to be used for testing and calculating the model’s fitness.
- Set contained the last of the data once set had been removed, to be used for hyper-parameter tuning.
- Set consisted of the remaining data to be used for training the parameters.
- Grid search was used to find the C and for all SVR models.
- The grid for C values was in .
- The grid for values was .
- The sets , contained all possible values of and m for each time series, and each procedure (Random Search and Grid Search) had 20 elements (for computational capacity reasons).
- The infimum of these sets was always 2.
- The supremum was always (so that ).
- The distribution used for the random search was always uniform.
- Mutual information + dimension congruence:
- 1.
- Find using the mutual information function from Equation (2) on , and take the minimum.
- 2.
- Find the embedding dimension by selecting the first m that satisfies in Equation (6) with the obtained on .
- 3.
- Train all possible SVRs determined by the elements of on .
- 4.
- Select the model having MAPE on which is the minimum.
- 5.
- Measure the goodness of the selected model using MAPE on .
- Random search and grid search:
- 1.
- Use each element of to train models on .
- 2.
- Select the model having MAPE on which is the minimum.
- 3.
- Measure the goodness of the selected model using MAPE on .
3.3. Results
3.4. Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick; Springer: Berlin/Heidelberg, Germany, 1980; pp. 366–381. [Google Scholar]
- Cao, L. Practical method for determining the minimum embedding dimension of a scalar time series. Phys. D Nonlinear Phenom. 1997, 110, 43–50. [Google Scholar] [CrossRef]
- Bergstra, J.; Bengio, Y. Random Search for Hyper-Parameter Optimization; Universite de Montreal: Montreal, QC, Canada, 2012; Available online: https://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf (accessed on 10 May 2023).
- Yu, S.; Pritchard, M.; Ma, P.L.; Singh, B.; Silva, S. Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset. arXiv 2023, arXiv:2302.03845. [Google Scholar]
- De Myttenaere, A.; Golden, B.; Le Gr, B.; Rossi, F. Mean Absolute Percentage Error for regression models. Neurocomputing 2016, 192, 38–48. [Google Scholar] [CrossRef]
- Lehmann, E.L.; Casella, G. Theory of Point Estimation, 2nd ed.; Springer: New York, NY, USA, 1998; ISBN 978-0-387-98502-2. [Google Scholar]
- Yin, P.; Fan, X. Estimating R2 Shrinkage in Multiple Regression: A Comparison of Different Analytical Methods. J. Exp. Educ. 2001, 69, 203–224. [Google Scholar] [CrossRef]
- Rivas-Perea, P.; Cota-Ruiz, J.; Chaparro, D.G.; Venzor, J.A.P.; Carreón, A.Q.; Rosiles, J.G. Support Vector Machines for Regression: A Succinct Review of Large-Scale and Linear Programming Formulations. Int. J. Intell. Sci. 2013, 3, 5–14. [Google Scholar] [CrossRef]
Series | MAPE-RS | MAPE-GS | MAPE-IC | -RS | -GS | -IC | MSE-RS | MSE-GS | MSE-IC |
---|---|---|---|---|---|---|---|---|---|
COV | 3.6714 | −3.4295 | 0.0005 | ||||||
BIT | 0.1566 | −1.026 | 0.0023 | ||||||
TEM | 0.02007 | −0.3787 | 0.011 | ||||||
S&P | 0.508 | −5.336 | 0.152 | ||||||
TEL | 0.1395 | −0.1061 | 0.001800 | ||||||
CO2 | 0.0491 | 0.3706 | 0.004 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hernandez-Mazariegos, R.; Ortiz-Bejar, J.; Ortiz-Bejar, J. Evaluation of Heuristics for Taken’s Theorem Hyper-Parameters Optimization in Time Series Forecasting Tasks. Eng. Proc. 2023, 39, 71. https://doi.org/10.3390/engproc2023039071
Hernandez-Mazariegos R, Ortiz-Bejar J, Ortiz-Bejar J. Evaluation of Heuristics for Taken’s Theorem Hyper-Parameters Optimization in Time Series Forecasting Tasks. Engineering Proceedings. 2023; 39(1):71. https://doi.org/10.3390/engproc2023039071
Chicago/Turabian StyleHernandez-Mazariegos, Rodrigo, Jose Ortiz-Bejar, and Jesus Ortiz-Bejar. 2023. "Evaluation of Heuristics for Taken’s Theorem Hyper-Parameters Optimization in Time Series Forecasting Tasks" Engineering Proceedings 39, no. 1: 71. https://doi.org/10.3390/engproc2023039071
APA StyleHernandez-Mazariegos, R., Ortiz-Bejar, J., & Ortiz-Bejar, J. (2023). Evaluation of Heuristics for Taken’s Theorem Hyper-Parameters Optimization in Time Series Forecasting Tasks. Engineering Proceedings, 39(1), 71. https://doi.org/10.3390/engproc2023039071