Next Article in Journal
A High-Precision Segmentation Method for Photovoltaic Modules Integrating Transformer and Improved U-Net
Previous Article in Journal
Industry 4.0 and Process Innovation: What Lies Ahead?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Optimizing Intermittent Pumping Duration with a Physics–Data Dual-Driven CatBoost Model Enhanced by Bayesian and Attention Mechanisms

1
Key Laboratory of Enhanced Oil and Gas Recovery, Ministry of Education, Northeast Petroleum University, Daqing 163318, China
2
Xinli Oil Production Plant of Jilin Oilfield Branch of PetroChina, Songyuan 138000, China
3
School of Electrical and Information Engineering, Northeast Petroleum University, Daqing 163318, China
*
Author to whom correspondence should be addressed.
Processes 2025, 13(12), 4012; https://doi.org/10.3390/pr13124012
Submission received: 10 November 2025 / Revised: 29 November 2025 / Accepted: 4 December 2025 / Published: 11 December 2025
(This article belongs to the Section Petroleum and Low-Carbon Energy Process Engineering)

Abstract

Traditional oilfields face challenges such as high energy consumption, imprecise control, and lax management in mid-to-late development stages, leading to increased costs and reduced efficiency. To address these issues, this work aims to develop an intelligent optimization framework for intermittent pumping by explicitly integrating physical mechanisms with data-driven modeling. Specifically, we propose a data–physics dual-driven method that combines physics-based parameters derived from seepage mechanics with data-driven feature selection using Pearson correlation analysis to identify nine key production factors. An improved CatBoost regression framework is developed through systematic preprocessing, including data cleaning, cubic polynomial feature expansion, F-value screening, and Z-score normalization. The model is further enhanced using Bayesian hyperparameter optimization, a weight adaptation mechanism, and an attention-based multi-level architecture. The novelty of this work lies in the unified dual-driven optimization strategy and the enhanced CatBoost framework that jointly improve prediction accuracy and model generalization. Experimental results demonstrate that the proposed method can accurately predict pumping operation times. Compared with the original CatBoost model, the MAE of the large-interval model decreases by 56.94%, while that of the small-interval model decreases by 16.23%. In addition, the accuracy of the large-interval model increases by 4.1%, and that of the small-interval model increases by 1.22%. These improvements show that the enhanced CatBoost model significantly strengthens predictive performance. This approach provides a reliable basis for optimizing pumping schedules, reducing energy consumption, and promoting intelligent and refined oilfield management.
Keywords: pumping regime optimization; dual-driven strategy; mathematical modelling; physical mechanisms; CatBoost regression framework pumping regime optimization; dual-driven strategy; mathematical modelling; physical mechanisms; CatBoost regression framework

Share and Cite

MDPI and ACS Style

Zhang, C.; Feng, F.; Zhang, C.; Li, S.; Xie, J. Optimizing Intermittent Pumping Duration with a Physics–Data Dual-Driven CatBoost Model Enhanced by Bayesian and Attention Mechanisms. Processes 2025, 13, 4012. https://doi.org/10.3390/pr13124012

AMA Style

Zhang C, Feng F, Zhang C, Li S, Xie J. Optimizing Intermittent Pumping Duration with a Physics–Data Dual-Driven CatBoost Model Enhanced by Bayesian and Attention Mechanisms. Processes. 2025; 13(12):4012. https://doi.org/10.3390/pr13124012

Chicago/Turabian Style

Zhang, Chengming, Fuping Feng, Cong Zhang, Shiyuan Li, and Junzhuzi Xie. 2025. "Optimizing Intermittent Pumping Duration with a Physics–Data Dual-Driven CatBoost Model Enhanced by Bayesian and Attention Mechanisms" Processes 13, no. 12: 4012. https://doi.org/10.3390/pr13124012

APA Style

Zhang, C., Feng, F., Zhang, C., Li, S., & Xie, J. (2025). Optimizing Intermittent Pumping Duration with a Physics–Data Dual-Driven CatBoost Model Enhanced by Bayesian and Attention Mechanisms. Processes, 13(12), 4012. https://doi.org/10.3390/pr13124012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop