Previous Issue
Volume 7, September
 
 

Forecasting, Volume 7, Issue 4 (December 2025) – 6 articles

  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list.
  • You may sign up for e-mail alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.
Order results
Result details
Section
Select all
Export citation of selected articles as:
19 pages, 769 KB  
Article
Can Simple Balancing Algorithms Improve School Dropout Forecasting? The Case of the State Education Network of Espírito Santo, Brazil
by Guilherme Armando de Almeida Pereira and Kiara de Deus Demura
Forecasting 2025, 7(4), 59; https://doi.org/10.3390/forecast7040059 (registering DOI) - 18 Oct 2025
Abstract
This study evaluates the effect of simple data-level balancing techniques on predicting school dropout across all state public high schools in Espírito Santo, Brazil. We trained Logistic Regression with LASSO (LR), Random Forest (RF), and Naive Bayes (NB) models on first-quarter data from [...] Read more.
This study evaluates the effect of simple data-level balancing techniques on predicting school dropout across all state public high schools in Espírito Santo, Brazil. We trained Logistic Regression with LASSO (LR), Random Forest (RF), and Naive Bayes (NB) models on first-quarter data from 2018–2019 and forecasted dropouts for 2020, with additional validation in 2022. Facing strong class imbalance, we compared three balancing methods—RUS, SMOTE, and ROSE—against models trained on the original data. Performance was assessed using accuracy, sensitivity, specificity, precision, F1, AUC, and G-mean. Results show that the imbalance severely harmed RF and NB trained without balancing, while Logistic Regression remained more stable. Overall, balancing techniques improved most metrics: RUS and ROSE were often superior, while SMOTE produced mixed results. Optimal configurations varied by year and metric, and RUS and ROSE made up most of the best combinations. Although most configurations benefited from balancing, some decreased performance; therefore, we recommend systematic testing of multiple balancing strategies and further research into SMOTE variants and algorithm-level approaches. Full article
Show Figures

Figure 1

25 pages, 6191 KB  
Article
Machine Learning Forecasting of Direct Solar Radiation: A Multi-Model Evaluation with Trigonometric Cyclical Encoding
by Latif Bukari Rashid, Shahzada Zaman Shuja and Shafiqur Rehman
Forecasting 2025, 7(4), 58; https://doi.org/10.3390/forecast7040058 - 17 Oct 2025
Abstract
As the world is shifting toward cleaner energy sources, accurate forecasting of solar radiation is critical for optimizing the performance and integration of solar energy systems. In this study, we explore eight machine learning models, namely, Random Forest Regressor, Linear Regression Model, Artificial [...] Read more.
As the world is shifting toward cleaner energy sources, accurate forecasting of solar radiation is critical for optimizing the performance and integration of solar energy systems. In this study, we explore eight machine learning models, namely, Random Forest Regressor, Linear Regression Model, Artificial Neural Network, k-Nearest Neighbors, Support Vector Regression, Gradient Boosting Regressor, Gaussian Process Regression, and Deep Learning, as to their use in forecasting direct solar radiation across six climatically diverse regions in the Kingdom of Saudi Arabia. The models were evaluated using eight statistical metrics along with time-series and absolute error analyses. A key contribution of this work is the introduction of Trigonometric Cyclical Encoding, which has significantly improved temporal representation learning. Comparative SHAP-based feature-importance analysis revealed that Trigonometric Cyclical Encoding enhanced the explanatory power of temporal features by 49.26% for monthly cycles and 53.30% for daily cycles. The findings show that Deep Learning achieved the lowest root mean square error, as well as the highest coefficient of determination, while Artificial Neural Network demonstrated consistently high accuracy across the sites. Support Vector Regression performed optimally but was less reliable in some regions. Error and time-series analyses reveal that Artificial Neural Network and Deep Learning maintained stable prediction accuracy throughout high solar radiation seasons, whereas Linear Regression, Random Forest Regressor, and k-Nearest Neighbors showed greater fluctuations. The proposed Trigonometric Cyclical Encoding technique further enhanced model performance by maintaining the overall fitness of the models, which ranged between 81.79% and 94.36% in all scenarios. This paper supports the effective planning of solar energy and integration in challenging climatic conditions. Full article
(This article belongs to the Topic Solar and Wind Power and Energy Forecasting, 2nd Edition)
Show Figures

Figure 1

17 pages, 887 KB  
Article
Comparison of Linear and Beta Autoregressive Models in Forecasting Nonstationary Percentage Time Series
by Carlo Grillenzoni
Forecasting 2025, 7(4), 57; https://doi.org/10.3390/forecast7040057 - 13 Oct 2025
Viewed by 82
Abstract
Positive percentage time series are present in many empirical applications; they take values in the continuous interval (0,1) and are often modeled with linear dynamic models. Risks of biased predictions (outside the admissible range) and problems of heteroskedasticity in the presence of asymmetric [...] Read more.
Positive percentage time series are present in many empirical applications; they take values in the continuous interval (0,1) and are often modeled with linear dynamic models. Risks of biased predictions (outside the admissible range) and problems of heteroskedasticity in the presence of asymmetric distributions are ignored by practitioners. Alternative models are proposed in the statistical literature; the most suitable is the dynamic beta regression which belongs to generalized linear models (GLM) and uses the logit transformation as a link function. However, owing to the Jensen inequality, this approach may also not be optimal in prediction; thus, the aim of the present paper is the in-depth forecasting comparison of linear and beta autoregressions. Simulation experiments and applications to nonstationary time series (the US unemployment rate and BR hydroelectric energy) are carried out. Rolling regression for time-varying parameters is applied to both linear and beta models, and a prediction criterion for the joint selection of model order and sample size is defined. Full article
(This article belongs to the Special Issue Feature Papers of Forecasting 2025)
Show Figures

Figure 1

39 pages, 5604 KB  
Article
Prediction of 3D Airspace Occupancy Using Machine Learning
by Cristian Lozano Tafur, Jaime Orduy Rodríguez, Pedro Melo Daza, Iván Rodríguez Barón, Danny Stevens Traslaviña and Juan Andrés Bermúdez
Forecasting 2025, 7(4), 56; https://doi.org/10.3390/forecast7040056 - 8 Oct 2025
Viewed by 356
Abstract
This research introduces a system designed to predict three-dimensional airspace occupancy over Colombia using historical Automatic Dependent Surveillance-Broadcast (ADS-B) data and machine learning techniques. The goal is to support proactive air traffic management by estimating future aircraft positions—specifically their latitude, longitude, and flight [...] Read more.
This research introduces a system designed to predict three-dimensional airspace occupancy over Colombia using historical Automatic Dependent Surveillance-Broadcast (ADS-B) data and machine learning techniques. The goal is to support proactive air traffic management by estimating future aircraft positions—specifically their latitude, longitude, and flight level. To achieve this, four predictive models were developed and tested: K-Nearest Neighbors (KNN), Random Forest, Extreme Gradient Boosting (XGBoost), and Long Short-Term Memory (LSTM). Among them, the LSTM model delivered the most accurate results, with a Mean Absolute Error (MAE) of 312.59, a Root Mean Squared Error (RMSE) of 1187.43, and a coefficient of determination (R2) of 0.7523. Compared to the baseline models (KNN, Random Forest, XGBoost), these values represent an improvement of approximately 91% in MAE, 83% in RMSE, and an eighteen-fold increase in R2, demonstrating the substantial advantage of the LSTM approach. These metrics indicate a significant improvement over the other models, particularly in capturing temporal patterns and adjusting to evolving traffic conditions. The strength of the LSTM approach lies in its ability to model sequential data and adapt to dynamic environments—making it especially suitable for supporting future Trajectory-Based Operations (TBO). The results confirm that predicting airspace occupancy in three dimensions using historical data are not only possible but can yield reliable and actionable insights. Looking ahead, the integration of hybrid neural network architectures and their deployment in real-time systems offer promising directions to enhance both accuracy and operational value. Full article
(This article belongs to the Topic Short-Term Load Forecasting—2nd Edition)
Show Figures

Figure 1

29 pages, 1977 KB  
Article
From Market Volatility to Predictive Insight: An Adaptive Transformer–RL Framework for Sentiment-Driven Financial Time-Series Forecasting
by Zhicong Song, Harris Sik-Ho Tsang, Richard Tai-Chiu Hsung, Yulin Zhu and Wai-Lun Lo
Forecasting 2025, 7(4), 55; https://doi.org/10.3390/forecast7040055 - 2 Oct 2025
Viewed by 360
Abstract
Financial time-series prediction remains a significant challenge, driven by market volatility, nonlinear dynamic characteristics, and the complex interplay between quantitative indicators and investor sentiment. Traditional time-series models (e.g., ARIMA and GARCH) struggle to capture the nuanced sentiment in textual data, while static deep [...] Read more.
Financial time-series prediction remains a significant challenge, driven by market volatility, nonlinear dynamic characteristics, and the complex interplay between quantitative indicators and investor sentiment. Traditional time-series models (e.g., ARIMA and GARCH) struggle to capture the nuanced sentiment in textual data, while static deep learning integration methods fail to adapt to market regime transitions (bull markets, bear markets, and consolidation). This study proposes a hybrid framework that integrates investor forum sentiment analysis with adaptive deep reinforcement learning (DRL) for dynamic model integration. By constructing a domain-specific financial sentiment dictionary (containing 16,673 entries) based on the sentiment analysis approach and word-embedding technique, we achieved up to 97.35% accuracy in forum title classification tasks. Historical price data and investor forum sentiment information were then fed into a Support Vector Regressor (SVR) and three Transformer variants (single-layer, multi-layer, and bidirectional variants) for predictions, with a Deep Q-Network (DQN) agent dynamically fusing the prediction results. Comprehensive experiments were conducted on diverse financial datasets, including China Unicom, the CSI 100 index, corn, and Amazon (AMZN). The experimental results demonstrate that our proposed approach, combining textual sentiment with adaptive DRL integration, significantly enhances prediction robustness in volatile markets, achieving the lowest RMSEs across diverse assets. It overcomes the limitations of static methods and multi-market generalization, outperforming both benchmark and state-of-the-art models. Full article
Show Figures

Figure 1

43 pages, 4605 KB  
Article
Unveiling the Dynamics of Wholesale Sales and Business Cycle Impacts in Japan: An Extended Moving Linear Model Approach
by Koki Kyo and Hideo Noda
Forecasting 2025, 7(4), 54; https://doi.org/10.3390/forecast7040054 - 26 Sep 2025
Viewed by 264
Abstract
Wholesale sales value is one of the key elements included in the coincident indicator series of the indexes of business conditions in Japan. The objectives of this study are twofold. The first is to comprehend features of dynamic structure of various components for [...] Read more.
Wholesale sales value is one of the key elements included in the coincident indicator series of the indexes of business conditions in Japan. The objectives of this study are twofold. The first is to comprehend features of dynamic structure of various components for 12 business types of the wholesale sales in Japan, focusing on the period from January 1980 to December 2022. The second is to elucidate effect of business cycles on the behavior of each business type of wholesale sales. Specifically, we utilize our moving linear model approach to decompose monthly time-series data of wholesale sales into a seasonal component, an unusually varying component containing outliers, a constrained component, and a remaining component. Additionally, we construct a distribution-free dynamic linear model and examine the time-varying relationship between the decomposed remaining component, which contains cyclical variation, in each business type of the wholesale sales and that in the coincident composite index. Our proposed approach reveals complex dynamics of various components of time series on wholesale sales. Furthermore, we find that different business types of the wholesale sales exhibit diverse responses to business cycles, which are influenced by macroeconomic conditions, government policies, or exogenous shocks. Full article
Show Figures

Figure 1

Previous Issue
Back to TopTop