This study aimed to evaluate how well stochastic processes and machine learning algorithms predict stock index movements across different sectors. The results revealed significant findings regarding the effectiveness of various modeling techniques in financial forecasting. This discussion section explores the implications of these results, emphasizing important insights and considerations for practitioners and researchers in financial analysis. The results shed light on important observations. Firstly, the metrics of the stochastic processes used in prediction exhibit very high values, approaching 100% for the three sectors studied, except for the GBM model, which fails to surpass the 50% threshold (
Table 1,
Table 2 and
Table 3 and
Table 7,
Table 8 and
Table 9). The lower performance of the GBM model compared to the Vasicek and CIR models can be attributed to the fact that these latter models are improvements upon the GBM. Both the Vasicek and CIR models incorporate the mean-reversion property, allowing them to better capture long-term trends in financial data. Unlike the GBM, which does not explicitly incorporate this feature, the Vasicek and CIR models are specifically designed to model interest rates and other financial phenomena that exhibit a tendency to revert to a historical mean. This integration of the mean-reversion property can, therefore, lead to higher performance for the Vasicek and CIR models compared to the GBM in certain financial contexts. Particularly, the metrics of the CIR model stand out with a value of 99% for all three sectors, exceeding the Vasicek model, which is natural considering that the CIR model is an improvement over the Vasicek model. On the other hand, the metrics of the machine learning algorithms hover around 70%, with a slight advantage for the SVM model, as highlighted by
Bouasabah (
2024) (
Table 4,
Table 5,
Table 6,
Table 7,
Table 8 and
Table 9). It is noteworthy that this study focuses on predicting the next day’s trend of stock indices (rise with +1, fall with −1). For the machine learning algorithms, predicting the target variable is based on three variables (features): OpenClose, HighLow, and DiffVolume. The model is trained on a test dataset, enabling it to predict the next day’s value for different combinations of feature variables. In contrast, prediction based on stochastic processes uses today’s real value to predict tomorrow’s value, which explains the high metrics observed for stochastic processes, as the true value of today’s index is already known. Therefore, the Vasicek and CIR models are excellent if today’s index value is known and one seeks to predict tomorrow’s value. Consequently, these high metric values must be interpreted with caution. It is also important to note that all three stochastic processes provide the predicted value of the index, not just the trend, which sets them apart from machine learning algorithms that only provide the trend. Furthermore, it is worth noting that the parameters of machine learning algorithms are not static and can be dynamically adjusted, unlike the parameters of stochastic models, which are static and calculated for each sample. Another observation we made in this study is that the metrics of stochastic processes reach high values, largely due to the use of the real and current value of the index to predict the future value of the next day. It would be interesting, in a future study, to explore the impact on these metrics if we were to use not the real and current value (today’s real value) of the index, but rather a value predicted by the model (based on the real value of the previous day) to estimate the value of the next day. Similarly, instead of using a real value to estimate the value of the previous day, we could use another predicted value, and so on. By adopting a recursive approach, we could go back in time to determine when it is optimal to stop using the real value of the index without compromising the performance of the stochastic process metrics too much. This is, therefore, a compromise to be established between “the memory of the stochastic model” and thresholds to set for the metrics, an approach that could enrich our understanding of underlying processes and prediction strategies. In summary, confirming the superiority of stochastic processes over machine learning algorithms in absolute terms is difficult to ascertain, but what the study confirms is their performance and superiority only when the value of today is known and one seeks to predict that of tomorrow. Furthermore, considering hyperparameter optimization for machine learning algorithms emerges as a crucial perspective to achieve even higher performance levels
Yang and Shami (
2020). This strategic approach leverages the inherent flexibility of machine learning, providing a pathway to fine-tune models to suit specific market dynamics. In the end, to take advantage of each approach, one can combine stochastic processes and machine learning algorithms as follows:
(1) Utilize stochastic processes if the number of predicted values does not exceed “the model’s memory”: It is worth noting that, in this study, “the memory of the stochastic process” is defined as the number of future index values that the model can predict between day j and day without using real values from the previous day in the prediction, but only the successive predicted values from a given real value at day j without lowering the metric values beyond a certain preset threshold.
(2) Utilize machine mearning algorithms if the number of values to predict exceeds the memory of the stochastic processes used in the prediction. Finally, the practical application of the study’s findings is to guide traders in selecting the appropriate model for predicting the future value of a tracker. Based on our study, the choice of the prediction model is clear and well justified. By following this method, we can leverage both approaches for better prediction quality. As a perspective, a general and absolute comparison of a test dataset could confirm the superiority of one approach over the other. Additionally, extending the analysis to other sectors could show if there is a sector effect on model performance.