Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (12)

Search Parameters:
Keywords = GRU-KAN model

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
28 pages, 520 KB  
Article
A Delta-Targeted Hybrid Deep Learning Architecture for Short-Term Scrap Steel Price Forecasting: A Comparative Study
by Nihan Sena Cifci, Melike Karatay, Yasemin Demirel, Yesim Aygul and Onur Ugurlu
Appl. Sci. 2026, 16(10), 4981; https://doi.org/10.3390/app16104981 - 16 May 2026
Viewed by 210
Abstract
Forecasting scrap steel prices is crucial for the economic sustainability of recycling operations, yet it remains challenging due to inherent volatility and non-stationary behavior. In this study, we develop and evaluate a delta-targeted Hybrid forecasting pipeline for short horizons of 1, 3, and [...] Read more.
Forecasting scrap steel prices is crucial for the economic sustainability of recycling operations, yet it remains challenging due to inherent volatility and non-stationary behavior. In this study, we develop and evaluate a delta-targeted Hybrid forecasting pipeline for short horizons of 1, 3, and 7 days. We benchmark classical baselines (Naive, Seasonal Autoregressive Integrated Moving Average (SARIMA), and Exponential Smoothing (ETS)) against recurrent deep learning models (Simple Recurrent Neural Network (RNN), Gated Recurrent Unit (GRU), and Long Short-Term Memory (LSTM)) and recent neural forecasting baselines, including Decomposition-Linear (DLinear), Convolutional Kolmogorov–Arnold Network (C-KAN), and Neural Basis Expansion Analysis for Time Series (N-BEATS), using real-world daily scrap steel price data. The results indicate that delta-targeting generally yields more stable predictive performance than direct raw-price forecasting as the prediction horizon increases. For example, at the 7-day horizon, the predictive fit improves from approximately R20.87 for raw-price LSTM to around R20.90 for delta-trained recurrent models. At the same horizon, a delta-based RNN achieves the lowest Mean Absolute Percentage Error (MAPE) among the evaluated models (approximately 1.39%), while the proposed Hybrid model remains competitive across all tested horizons and maintains a goodness-of-fit of approximately R20.90 without uniformly minimizing point error relative to the best-performing recurrent baseline. Attention profiling and permutation-based feature importance analyses indicate that the model places relatively higher weight on calendar-related inputs, consistent with the presence of weekly patterns in the data; these results should be interpreted as sensitivity diagnostics rather than causal evidence. Overall, the findings suggest that delta-transformed targets provide a more suitable prediction space than raw-price targets for short-horizon scrap steel forecasting, while the Hybrid design offers a balanced combination of predictive performance and diagnostic interpretability for operational decision support. Full article
Show Figures

Figure 1

27 pages, 26393 KB  
Article
Oil Production Forecasting Under Asymmetric Temporal Dynamics Using Signature-Weighted Kolmogorov–Arnold Network
by Zhidan Yang, Chaoran Zhang, Jiaqi Bian, Jian Zou and Zhong Chen
Symmetry 2026, 18(5), 818; https://doi.org/10.3390/sym18050818 (registering DOI) - 9 May 2026
Viewed by 158
Abstract
Accurate production forecasting of oil wells is of great significance for reservoir management, production optimization, and investment decisions. However, complex subsurface dynamics and sudden operational interventions frequently break the temporal symmetry of production sequences, generating highly asymmetric data distributions. Standard deep sequence architectures [...] Read more.
Accurate production forecasting of oil wells is of great significance for reservoir management, production optimization, and investment decisions. However, complex subsurface dynamics and sudden operational interventions frequently break the temporal symmetry of production sequences, generating highly asymmetric data distributions. Standard deep sequence architectures often suffer from severe phase lag and limited adaptability when modeling such asymmetric regime transitions. To resolve these bottlenecks, we introduce the Signature-Weighted Kolmogorov–Arnold Network with Gated Recurrent Units (SigKAN-GRU). The architecture replaces static node activations with adaptive edge–spline mappings, enabling robust approximation of asymmetric nonlinearities. Path signatures compress high-order asymmetric temporal trajectories into invariant geometric features, a learnable gating kernel filters critical variations, and a final GRU layer enforces explicit sequential memory. This integration bridges long-term depletion trends with abrupt asymmetrical perturbations while maintaining structurally controlled complexity and an interpretable decomposition of nonlinear response and temporal weighting. Validated on two real-world wells with contrasting data characteristics, SigKAN-GRU consistently minimizes absolute error metrics and phase distortions against prevailing baselines. In addition, event-sensitive evaluations further confirm its reliability in peak regions and abrupt shock intervals. The resulting framework translates erratic historical data into robust deterministic forecasts, offering a rigorous quantitative tool for field-level reservoir optimization. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

28 pages, 13123 KB  
Article
A Generative Augmentation and Physics-Informed Network for Interpretable Prediction of Mining-Induced Deformation from InSAR Data
by Yuchen Han, Jiajia Yuan, Mingzhi Sun and Lu Liu
Remote Sens. 2026, 18(7), 987; https://doi.org/10.3390/rs18070987 - 25 Mar 2026
Viewed by 647
Abstract
Accurate forecasting of mining-induced surface deformation is critical for coal-mine safety assessment and hazard mitigation. InSAR deformation time series are often short, temporally sparse, and strongly nonlinear. These characteristics can make purely data-driven predictors unreliable in small-sample settings. To address this issue, we [...] Read more.
Accurate forecasting of mining-induced surface deformation is critical for coal-mine safety assessment and hazard mitigation. InSAR deformation time series are often short, temporally sparse, and strongly nonlinear. These characteristics can make purely data-driven predictors unreliable in small-sample settings. To address this issue, we propose a generation–prediction–interpretation framework that combines generative augmentation with physics-informed forecasting. We first develop a TCN-TimeGAN model to synthesize high-fidelity deformation sequences and expand the training set. Recurrent modules in the generator and discriminator are replaced with causal TCN residual blocks, and a temporal self-attention layer is further stacked on top of the TCN backbone to adaptively reweight informative time steps. We then construct a physics-informed Kolmogorov–Arnold Network, termed PI-KAN. Subsidence-consistency and smoothness priors are embedded in the learning objective to promote physically plausible predictions while retaining spline-based interpretability. Experiments on SBAS-InSAR deformation series from the Guqiao coal mine show that the framework achieves an RMSE of 0.825 mm and an R2 of 0.968. It outperforms TGAN-KAN, CNN-BiGRU, and BiGRU under the same evaluation protocol. Visualizations of the learned spline-based edge functions further reveal stronger nonlinear responses for lagged inputs closer to the forecast horizon, providing interpretable evidence of short-term temporal sensitivity under sparse observations. Full article
Show Figures

Figure 1

17 pages, 4808 KB  
Article
Predicting Groundwater Depth Using Historical Data Trend Decomposition: Based on the VMD-LSTM Hybrid Deep Learning Model
by Jie Yue, Hong Guo, Deng Pan, Huanxiang Wang, Yawen Xin, Furong Yu, Yingying Shao and Rui Dun
Water 2026, 18(6), 689; https://doi.org/10.3390/w18060689 - 15 Mar 2026
Viewed by 438
Abstract
Groundwater is a critical natural and strategic economic resource, and the accurate prediction of groundwater depth dynamics is essential for the rational development and utilization of water resources. However, under the combined influence of climate variability, human activities, and complex hydrogeological conditions, groundwater [...] Read more.
Groundwater is a critical natural and strategic economic resource, and the accurate prediction of groundwater depth dynamics is essential for the rational development and utilization of water resources. However, under the combined influence of climate variability, human activities, and complex hydrogeological conditions, groundwater level time series exhibit strong nonlinear and non-stationary characteristics, posing great challenges to the accurate prediction of groundwater level dynamics. Most existing prediction models rely on sufficient hydro-meteorological and exploitation data that are difficult to obtain in water-scarce regions, or fail to effectively decouple the multi-scale features of non-stationary groundwater level signals, resulting in limited prediction accuracy and insufficient generalization ability. To address these research gaps, this study takes Zhengzhou, a typical water-deficient city in the Yellow River Basin, as the study area, and proposes a hybrid deep learning framework combining Variational Mode Decomposition (VMD) and Long Short-Term Memory (LSTM) neural network for predicting shallow and intermediate-deep groundwater level changes. Kolmogorov–Arnold Networks (KANs) and Gated Recurrent Units (GRUs) are selected as benchmark models to verify the superior performance of the proposed framework. In this framework, the non-stationary groundwater level signal is adaptively decomposed into Intrinsic Mode Functions (IMFs) with distinct frequency characteristics via VMD. An independent LSTM model is constructed for each IMF to capture its unique temporal variation pattern, and the final groundwater level prediction is obtained by linearly reconstructing the predicted results of all IMFs. The results show that the coefficient of determination (R2) of the VMD-LSTM model exceeds 0.90 for all monitoring datasets, with low Mean Absolute Error (MAE) and Mean Squared Error (MSE). It significantly outperforms the benchmark models in handling nonlinear and non-stationary time series features. Using only historical groundwater level data as input, the proposed framework effectively overcomes the limitation of insufficient driving variables in data-scarce regions and fully explores the multi-scale evolution of groundwater dynamics through the synergistic effect of multi-scale decomposition and deep learning. The method presented in this study provides a novel and reliable technical approach for groundwater level prediction in water-deficient and data-limited areas, and also offers scientific support for the rational management and sustainable utilization of regional groundwater resources. Future research will incorporate driving factors such as meteorology and exploitation to further improve the model’s ability to capture abrupt changes in groundwater level dynamics. Full article
Show Figures

Figure 1

22 pages, 2272 KB  
Article
Short-Term Photovoltaic Power Prediction Using a DPCA–CPO–RF–KAN–GRU Hybrid Model
by Mingguang Liu, Ying Zhou, Yusi Wei, Weibo Zhao, Min Qu, Xue Bai and Zecheng Ding
Processes 2026, 14(2), 252; https://doi.org/10.3390/pr14020252 - 11 Jan 2026
Viewed by 456
Abstract
In photovoltaic (PV) power generation, the intermittency and uncertainty caused by meteorological factors pose challenges to grid operations. Accurate PV power prediction is crucial for optimizing power dispatching and balancing supply and demand. This paper proposes a PV power prediction model based on [...] Read more.
In photovoltaic (PV) power generation, the intermittency and uncertainty caused by meteorological factors pose challenges to grid operations. Accurate PV power prediction is crucial for optimizing power dispatching and balancing supply and demand. This paper proposes a PV power prediction model based on Density Peak Clustering Algorithm (DPCA)–Crested Porcupine Optimizer (CPO)–Random Forest (RF)–Gated Recurrent Unit (GRU)–Kolmogorov–Arnold Network (KAN). First, the DPCA is used to accurately classify weather conditions according to meteorological data such as solar radiation, temperature, and humidity. Then, the CPO algorithm is established to optimize the factor screening characteristic variables of the RF. Subsequently, a hybrid GRU model with a KAN layer is introduced for short-term PV power prediction. The Shapley Additive Explanation (SHAP) method values evaluating feature importance and the impact of causal features. Compared with other contrast models, the DPCA-CPO-RF-KAN-GRU model demonstrates better error reduction capabilities under three weather types, with an average fitting accuracy R2 reaching 97%. SHAP analysis indicates that the combined average SHAP value of total solar radiation and direct solar radiation contributes more than 70%. Finally, the Kernel Density Estimation (KDE) is utilized to verify that the KAN-GRU model has high robustness in interval prediction, providing strong technical support for ensuring the stability of the power grid and precise decision-making in the electricity market. Full article
(This article belongs to the Section Energy Systems)
Show Figures

Figure 1

20 pages, 6013 KB  
Article
A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction
by Jiahang Cui, Jianghong Li, Feichao Cai, Zhenmin Zhao and Yuxi Liu
Sensors 2025, 25(18), 5680; https://doi.org/10.3390/s25185680 - 11 Sep 2025
Cited by 1 | Viewed by 1125
Abstract
With the development of active magnetic bearings (AMBs) toward higher speeds, understanding high-speed rotor dynamics has become a crucial focus in AMB research. Traditional finite element modeling (FEM) methods, however, are unable to rapidly and comprehensively uncover the complex interplay between controller parameters [...] Read more.
With the development of active magnetic bearings (AMBs) toward higher speeds, understanding high-speed rotor dynamics has become a crucial focus in AMB research. Traditional finite element modeling (FEM) methods, however, are unable to rapidly and comprehensively uncover the complex interplay between controller parameters and dynamic behavior. To address this limitation, a surrogate modeling approach based on a hybrid gated recurrent unit–Kolmogorov–Arnold network (GRU-KAN) is introduced to mathematically capture the effects of coupled control gains on rotor dynamics. To enhance model generalization, a genetic algorithm-driven uniform design sampling strategy is also implemented. Comparative studies against support vector regression and Kriging surrogates indicate a higher coefficient of determination (R2=0.9887) and lower residuals for the proposed approach. Experimental validation across multiple controller parameter combinations shows that the resulting machine learning surrogate predicts the critical speed with a mean absolute error of only 38.51 rpm and a mean absolute percentage error of 1.56×101%, while requiring merely 1.14×104 s per evaluation—compared to 201 s for traditional FEM. These findings demonstrate the surrogate’s efficiency, accuracy, and comprehensive predictive capabilities, offering an effective method for rapid critical speed estimation in AMB–rotor systems. Full article
(This article belongs to the Section Physical Sensors)
Show Figures

Figure 1

25 pages, 6752 KB  
Article
Hybrid Deep Learning Combining Mode Decomposition and Intelligent Optimization for Discharge Forecasting: A Case Study of the Baiquan Karst Spring
by Yanling Li, Tianxing Dong, Yingying Shao and Xiaoming Mao
Sustainability 2025, 17(18), 8101; https://doi.org/10.3390/su17188101 - 9 Sep 2025
Viewed by 1047
Abstract
Karst springs play a critical strategic role in regional economic and ecological sustainability, yet their spatiotemporal heterogeneity and hydrological complexity pose substantial challenges for flow prediction. This study proposes FMD-mGTO-BiGRU-KAN, a four-stage hybrid deep learning architecture for daily spring flow prediction that integrates [...] Read more.
Karst springs play a critical strategic role in regional economic and ecological sustainability, yet their spatiotemporal heterogeneity and hydrological complexity pose substantial challenges for flow prediction. This study proposes FMD-mGTO-BiGRU-KAN, a four-stage hybrid deep learning architecture for daily spring flow prediction that integrates multi-feature signal decomposition, meta-heuristic optimization, and interpretable neural network design: constructing an Feature Mode Decomposition (FMD) decomposition layer to mitigate modal aliasing in meteorological signals; employing the improved Gorilla Troops Optimizer (mGTO) optimization algorithm to enable autonomous hyperparameter evolution, overcoming the limitations of traditional grid search; designing a Bidirectional Gated Recurrent Unit (BiGRU) network to capture long-term historical dependencies in spring flow sequences through bidirectional recurrent mechanisms; introducing Kolmogorov–Arnold Networks (KAN) to replace the fully connected layer, and improving the model interpretability through differentiable symbolic operations; Additionally, residual modules and dropout blocks are incorporated to enhance generalization capability, reduce overfitting risks. By integrating multiple deep learning algorithms, this hybrid model leverages their respective strengths to adeptly accommodate intricate meteorological conditions, thereby enhancing its capacity to discern the underlying patterns within complex and dynamic input features. Comparative results against benchmark models (LSTM, GRU, and Transformer) show that the proposed framework achieves 82.47% and 50.15% reductions in MSE and RMSE, respectively, with the NSE increasing by 8.01% to 0.9862. The prediction errors are more tightly distributed, and the proposed model surpasses the benchmark model in overall performance, validating its superiority. The model’s exceptional prediction ability offers a novel high-precision solution for spring flow prediction in complex hydrological systems. Full article
Show Figures

Figure 1

28 pages, 7608 KB  
Article
A Forecasting Method for COVID-19 Epidemic Trends Using VMD and TSMixer-BiKSA Network
by Yuhong Li, Guihong Bi, Taonan Tong and Shirui Li
Computers 2025, 14(7), 290; https://doi.org/10.3390/computers14070290 - 18 Jul 2025
Viewed by 994
Abstract
The spread of COVID-19 is influenced by multiple factors, including control policies, virus characteristics, individual behaviors, and environmental conditions, exhibiting highly complex nonlinear dynamic features. The time series of new confirmed cases shows significant nonlinearity and non-stationarity. Traditional prediction methods that rely solely [...] Read more.
The spread of COVID-19 is influenced by multiple factors, including control policies, virus characteristics, individual behaviors, and environmental conditions, exhibiting highly complex nonlinear dynamic features. The time series of new confirmed cases shows significant nonlinearity and non-stationarity. Traditional prediction methods that rely solely on one-dimensional case data struggle to capture the multi-dimensional features of the data and are limited in handling nonlinear and non-stationary characteristics. Their prediction accuracy and generalization capabilities remain insufficient, and most existing studies focus on single-step forecasting, with limited attention to multi-step prediction. To address these challenges, this paper proposes a multi-module fusion prediction model—TSMixer-BiKSA network—that integrates multi-feature inputs, Variational Mode Decomposition (VMD), and a dual-branch parallel architecture for 1- to 3-day-ahead multi-step forecasting of new COVID-19 cases. First, variables highly correlated with the target sequence are selected through correlation analysis to construct a feature matrix, which serves as one input branch. Simultaneously, the case sequence is decomposed using VMD to extract low-complexity, highly regular multi-scale modal components as the other input branch, enhancing the model’s ability to perceive and represent multi-source information. The two input branches are then processed in parallel by the TSMixer-BiKSA network model. Specifically, the TSMixer module employs a multilayer perceptron (MLP) structure to alternately model along the temporal and feature dimensions, capturing cross-time and cross-variable dependencies. The BiGRU module extracts bidirectional dynamic features of the sequence, improving long-term dependency modeling. The KAN module introduces hierarchical nonlinear transformations to enhance high-order feature interactions. Finally, the SA attention mechanism enables the adaptive weighted fusion of multi-source information, reinforcing inter-module synergy and enhancing the overall feature extraction and representation capability. Experimental results based on COVID-19 case data from Italy and the United States demonstrate that the proposed model significantly outperforms existing mainstream methods across various error metrics, achieving higher prediction accuracy and robustness. Full article
Show Figures

Figure 1

33 pages, 10136 KB  
Article
Carbon Price Forecasting Using a Hybrid Deep Learning Model: TKMixer-BiGRU-SA
by Yuhong Li, Nan Yang, Guihong Bi, Shiyu Chen, Zhao Luo and Xin Shen
Symmetry 2025, 17(6), 962; https://doi.org/10.3390/sym17060962 - 17 Jun 2025
Cited by 6 | Viewed by 3077
Abstract
As a core strategy for carbon emission reduction, carbon trading plays a critical role in policy guidance and market stability. Accurate forecasting of carbon prices is essential, yet remains challenging due to the nonlinear, non-stationary, noisy, and uncertain nature of carbon price time [...] Read more.
As a core strategy for carbon emission reduction, carbon trading plays a critical role in policy guidance and market stability. Accurate forecasting of carbon prices is essential, yet remains challenging due to the nonlinear, non-stationary, noisy, and uncertain nature of carbon price time series. To address this, this paper proposes a novel hybrid deep learning framework that integrates dual-mode decomposition and a TKMixer-BiGRU-SA model for carbon price prediction. First, external variables with high correlation to carbon prices are identified through correlation analysis and incorporated as inputs. Then, the carbon price series is decomposed using Variational Mode Decomposition (VMD) and Empirical Wavelet Transform (EWT) to extract multi-scale features embedded in the original data. The core prediction model, TKMixer-BiGRU-SA Net, comprises three integrated branches: the first processes the raw carbon price and highly relevant external time series, and the second and third process multi-scale components obtained from VMD and EWT, respectively. The proposed model embeds Kolmogorov–Arnold Networks (KANs) into the Time-Series Mixer (TSMixer) module, replacing the conventional time-mapping layer to form the TKMixer module. Each branch alternately applies the TKMixer along the temporal and feature-channel dimensions to capture dependencies across time steps and variables. Hierarchical nonlinear transformations enhance higher-order feature interactions and improve nonlinear modeling capability. Additionally, the BiGRU component captures bidirectional long-term dependencies, while the Self-Attention (SA) mechanism adaptively weights critical features for integrated prediction. This architecture is designed to uncover global fluctuation patterns in carbon prices, multi-scale component behaviors, and external factor correlations, thereby enabling autonomous learning and the prediction of complex non-stationary and nonlinear price dynamics. Empirical evaluations using data from the EU Emission Allowance (EUA) and Hubei Emission Allowance (HBEA) demonstrate the model’s high accuracy in both single-step and multi-step forecasting tasks. For example, the eMAPE of EUA predictions for 1–4 step forecasts are 0.2081%, 0.5660%, 0.8293%, and 1.1063%, respectively—outperforming benchmark models and confirming the proposed method’s effectiveness and robustness. This study provides a novel approach to carbon price forecasting with practical implications for market regulation and decision-making. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

22 pages, 9495 KB  
Article
SOH Estimation Method for Lithium-Ion Batteries Using Partial Discharge Curves Based on CGKAN
by Shengfeng He, Wenhu Qin, Zhonghua Yun, Chao Wu and Chongbin Sun
Batteries 2025, 11(5), 167; https://doi.org/10.3390/batteries11050167 - 23 Apr 2025
Cited by 5 | Viewed by 2441
Abstract
Accurate estimation of the state of health (SOH) of lithium-ion batteries is essential to ensure the safe and stable operation of equipment such as electric vehicles. To address the limitations in the accuracy and robustness of existing methods under complex operating conditions, a [...] Read more.
Accurate estimation of the state of health (SOH) of lithium-ion batteries is essential to ensure the safe and stable operation of equipment such as electric vehicles. To address the limitations in the accuracy and robustness of existing methods under complex operating conditions, a CNN-BiGRU-KAN (CGKAN) method for SOH estimation based on partial discharge curves is proposed. Firstly, random forest analysis is applied to extract features highly correlated with battery health from the partial discharge curve data. Next, a SOH estimation framework based on the CGKAN model is developed, where 1-Dimensional-Convolutional Neural Networks (1D-CNN) are used to extract deep features from the original data, Bidirectional Gated Recurrent Unit (BiGRU) captures the bidirectional dependencies of the time series, and Kolmogorov–Arnold Networks (KAN) enhances the modeling of complex nonlinear features through its nonlinear mapping capabilities, thereby improving the accuracy of SOH estimation. Finally, multiple experiments under different conditions are conducted, and the results demonstrate that the proposed CGKAN method, by integrating the individual advantages of 1D-CNN, BiGRU, and KAN, efficiently captures complex nonlinear patterns in battery health features and maintains stable performance across various operating conditions. Full article
Show Figures

Graphical abstract

15 pages, 3745 KB  
Article
Indoor Microclimate Monitoring and Forecasting: Public Sector Building Use Case
by Ruslans Sudniks, Arturs Ziemelis, Agris Nikitenko, Vasco N. G. J. Soares and Andis Supe
Information 2025, 16(2), 121; https://doi.org/10.3390/info16020121 - 8 Feb 2025
Cited by 3 | Viewed by 1985
Abstract
This research aims to demonstrate a machine learning (ML) algorithm-based indoor air quality (IAQ) monitoring and forecasting system for a public sector building use case. Such a system has the potential to automate existing heating/ventilation systems, therefore reducing energy consumption. One of Riga [...] Read more.
This research aims to demonstrate a machine learning (ML) algorithm-based indoor air quality (IAQ) monitoring and forecasting system for a public sector building use case. Such a system has the potential to automate existing heating/ventilation systems, therefore reducing energy consumption. One of Riga Technical University’s campus buildings, equipped with around 128 IAQ sensors, is used as a test bed to create a digital shadow including a comparison of five ML-based data prediction tools. We compare the IAQ data prediction loss using Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) error metrics based on real sensor data. Gated Recurrent Unit (GRU) and Kolmogorov–Arnold Networks (KAN) prove to be the most accurate models regarding the prediction error. Also, GRU proved to be the most efficient model regarding the required computation time. Full article
Show Figures

Figure 1

18 pages, 4375 KB  
Article
Research on Oil Well Production Prediction Based on GRU-KAN Model Optimized by PSO
by Bo Qiu, Jian Zhang, Yun Yang, Guangyuan Qin, Zhongyi Zhou and Cunrui Ying
Energies 2024, 17(21), 5502; https://doi.org/10.3390/en17215502 - 4 Nov 2024
Cited by 25 | Viewed by 3165
Abstract
Accurately predicting oil well production volume is of great significance in oilfield production. To overcome the shortcomings in the current study of oil well production prediction, we propose a hybrid model (GRU-KAN) with the gated recurrent unit (GRU) and Kolmogorov–Arnold network (KAN). The [...] Read more.
Accurately predicting oil well production volume is of great significance in oilfield production. To overcome the shortcomings in the current study of oil well production prediction, we propose a hybrid model (GRU-KAN) with the gated recurrent unit (GRU) and Kolmogorov–Arnold network (KAN). The GRU-KAN model utilizes GRU to extract temporal features and KAN to capture complex nonlinear relationships. First, the MissForest algorithm is employed to handle anomalous data, improving data quality. The Pearson correlation coefficient is used to select the most significant features. These selected features are used as input to the GRU-KAN model to establish the oil well production prediction model. Then, the Particle Swarm Optimization (PSO) algorithm is used to enhance the predictive performance. Finally, the model is evaluated on the test set. The validity of the model was verified on two oil wells and the results on well F14 show that the proposed GRU-KAN model achieves a Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE) and Coefficient of Determination (R2) values of 11.90, 9.18, 6.0% and 0.95, respectively. Compared to popular single and hybrid models, the GRU-KAN model achieves higher production-prediction accuracy and higher computational efficiency. The model can be applied to the formulation of oilfield-development plans, which is of great theoretical and practical significance to the advancement of oilfield technology levels. Full article
Show Figures

Figure 1

Back to TopTop