Prediction of Water Saturation in Lacustrine Tight Reservoirs of Chang8 in the Central Ordos Basin—Based on the PSO+LightGBM Model
Abstract
1. Introduction
- (1)
- Traditional water saturation determination methods (laboratory measurements and log-based models) fail to meet the requirements of full-area and full-interval coverage and high-precision prediction for lacustrine tight reservoirs with strong heterogeneity and micro-nano pore throats (e.g., Chang8 of the Ordos Basin).
- (2)
- Existing machine learning-based water saturation prediction studies lack integration of hyperparameter optimization algorithms (e.g., PSO), leading to suboptimal computational efficiency and vulnerability to overfitting.
- (3)
- Most machine learning models for reservoir parameter prediction are opaque ‘black-box’ models without interpretability analysis, making it difficult for geological researchers to validate and accept the prediction results.
- (4)
- Few studies have targeted the specific geological characteristics of the Chang 8 lacustrine tight reservoirs in the central Ordos Basin, resulting in a lack of tailored high-precision prediction models for this key exploration target.
- (1)
- To address the limitations of traditional methods, leverage the inherent advantages of machine learning in handling complex nonlinear relationships between geological/engineering parameters and water saturation, and develop a high-precision prediction model suitable for lacustrine tight reservoirs.
- (2)
- To improve model performance by introducing PSO for hyperparameter optimization of LightGBM, XGBoost, and MERF, thereby solving overfitting and slow convergence issues in tight reservoir data training.
- (3)
- To enhance the credibility and geological acceptability of the model by integrating the SHAP value method for visual interpretability analysis of the optimized model.
- (4)
- To provide reliable technical support for the efficient exploration and development of tight oil resources in the Chang 8 Member of the central Ordos Basin, and alleviate China’s energy supply-demand contradiction.
2. Geological Setting
3. Experiments and Methods (Shaanxi, Xi’an, China)
3.1. Workflow
3.2. Data Source and Analysis
3.3. Xgboost Algorithm
3.4. Lightgbm
3.5. Mixed Effects Random Forest Algorithm (Merf)
3.6. Particle Swarm Optimization Algorithm (Pso)
3.7. Shap Algorithm
3.8. Hyperparameter Optimization and Evaluation Metrics
4. Results and Discussion
4.1. Analysis of Core Water Saturation
4.2. Model Prediction Results
4.3. Interpretability Evaluation
5. Model Application
6. Conclusions
- (1)
- The PSO algorithm can quickly determine the optimal hyperparameter combination for the LightGBM model and establish a nonlinear mapping between water saturation and logging parameters. Evaluation metrics based on RMSE, R2, and Swa indicate that the PSO+LightGBM algorithm outperforms PSO+XGBoost and PSO+MERF in predicting water saturation in tight reservoirs.
- (2)
- The SHAP (Shapley Additive EXPlanations) algorithm was used to conduct an interpretability analysis of the constructed PSO+LightGBM model. The results indicate that five logging parameters—SP (Self-Potential), M2R3, DEN (Density Log), DT (Acoustic Travel Time Log), and CN (Neutron Capture Cross-Section Log)—have the most significant impact on the water saturation prediction model.
- (3)
- Application results demonstrate that the PSO+LightGBM water saturation prediction model constructed in this study exhibits excellent generalization performance in low-permeability tight reservoirs, making it a reliable and efficient prediction method. This model has considerable practical potential and offers a novel technical approach for evaluating tight sandstone reservoirs.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wang, Z.; Fan, Z.; Zhang, X.; Liu, B.; Chen, X. Status, Trends and Enlightenment of Global Oil and Gas Development in 2021. Pet. Explor. Dev. 2022, 49, 1210–1228. [Google Scholar] [CrossRef]
- Qu, J.; Ding, X.; Zha, M.; Chen, H.; Gao, C.; Wang, Z. Geochemical Characterization of Lucaogou Formation and Its Correlation of Tight Oil Accumulation in Jimsar Sag of Junggar Basin, Northwestern China. J. Pet. Explor. Prod. Technol. 2017, 7, 699–706. [Google Scholar] [CrossRef] [PubMed][Green Version]
- Fu, J.; Wang, L.; Chen, X.; Liu, J.; Hui, X.; Cheng, D. Progress and prospects of shale oil exploration and development in the seventh member of Yanchang Formation in Ordos Basin. China Pet. Explor. 2023, 28, 1–14. (In Chinese) [Google Scholar]
- Fu, S.; Fu, J.; Niu, X.; Li, S.; Wu, Z.; Zhou, X.; Liu, J. Accumulation conditions and key exploration and development technologies of Qingcheng Oilfield. Acta Pet. Sin. 2020, 41, 777–795. (In Chinese) [Google Scholar]
- Tao, S.; Hu, S.; Wang, J.; Bai, B.; Pang, Z.; Wang, M.; Chen, Y.; Chen, Y.; Yang, Y.; Jin, X.; et al. Formation conditions, enrichment regularities and resource potentials of continental tight oil in China. Acta Pet. Sin. 2023, 44, 1222–1239. (In Chinese) [Google Scholar]
- Zhou, X.; Zhang, C.; Zhang, Z.; Zhang, R.; Zhu, L.; Zhang, C. A Saturation Evaluation Method in Tight Gas Sandstones Based on Diagenetic Facies. Mar. Pet. Geol. 2019, 107, 310–325. [Google Scholar] [CrossRef]
- Pan, H.-J.; Wei, C.; Yan, X.-F.; Li, X.-M.; Yang, Z.-F.; Gui, Z.-X.; Liu, S.-X. 3D Rock Physics Template-Based Probabilistic Estimation of Tight Sandstone Reservoir Properties. Pet. Sci. 2024, 21, 3090–3101. [Google Scholar] [CrossRef]
- Wu, J.; Luo, R.; Lei, C.; Yin, J.; Chen, X. Prediction of water saturation in tight sandstone reservoirs from well log data based on the large language models (LLMs). Nat. Gas Ind. 2024, 44, 77–87. (In Chinese) [Google Scholar]
- Ding, S.; Yang, S.; Lu, W.; Luo, R.; Zhu, L.; Gu, Y.; Chen, X. Robust prediction for water saturation based on strategy of light gradient boosting machine. Prog. Geophys. 2023, 38, 185X200. (In Chinese) [Google Scholar] [CrossRef]
- Yang, G.; Ren, Z.; Qi, K. Research on Diagenetic Evolution and Hydrocarbon Accumulation Periods of Chang 8 Reservoir in Zhenjing Area of Ordos Basin. Energies 2022, 15, 3846. [Google Scholar] [CrossRef]
- Wang, X.; Yang, S.; Zhao, Y.; Wang, Y. Improved Pore Structure Prediction Based on MICP with a Data Mining and Machine Learning System Approach in Mesozoic Strata of Gaoqing Field, Jiyang Depression. J. Pet. Sci. Eng. 2018, 171, 362–393. [Google Scholar] [CrossRef]
- Lai, F.; Li, Z.; Zhang, W.; Dong, H.; Kong, F.; Jiang, Z. Investigation of Pore Characteristics and Irreducible Water Saturation of Tight Reservoir Using Experimental and Theoretical Methods. Energy Fuels 2018, 32, 3368–3379. [Google Scholar] [CrossRef]
- Fu, J.; Chen, M.; Chen, L.; Shao, R.; Li, Y.; Chen, Z.; Xin, J.; Pan, Y. Reservoir Permeability Prediction Method Based on Fuzzy Clustering and Machine Learning. Chem. Technol. Fuels Oils 2025, 60, 1518–1527. [Google Scholar] [CrossRef]
- Behdad, A.; Cuddy, S. Water Saturation Modeling in Carbonate Reservoirs Using the Bulk Volume Water Approach. J. Pet. Explor. Prod. Technol. 2025, 15, 105. [Google Scholar] [CrossRef]
- Okon, A.N.; Adewole, S.E.; Uguma, E.M. Artificial Neural Network Model for Reservoir Petrophysical Properties: Porosity, Permeability and Water Saturation Prediction. Model. Earth Syst. Environ. 2021, 7, 2373–2390. [Google Scholar] [CrossRef]
- Gad, M.; Mahmoud, A.A.; Panagopoulos, G.; Kiomourtzi, P.; Kirmizakis, P.; Elkatatny, S.; bin Waheed, U.; Soupios, P. Predicting Water Saturation in a Greek Oilfield with the Power of Artificial Neural Networks. ACS Omega 2025, 10, 557–566. [Google Scholar] [CrossRef]
- Baziar, S.; Shahripour, H.B.; Tadayoni, M.; Nabi-Bidhendi, M. Prediction of Water Saturation in a Tight Gas Sandstone Reservoir by Using Four Intelligent Methods: A Comparative Study. Neural Comput. Appl. 2018, 30, 1171–1185. [Google Scholar] [CrossRef]
- Sutiadi, A.; Taufiq Fathaddin, M. Estimating the Porosity and Initial Water Saturation in South Structure of X Field Using Artificial Neural Network. IOP Conf. Ser. Earth Environ. Sci. 2025, 1451, 012032. [Google Scholar] [CrossRef]
- Shehata, A.A.; Ahmed, M.; Kassem, A.A.; Abdelrehim, R.; Tsuji, T.; Ismail, A. Optimizing Permeability and Porosity Prediction with Advanced Machine Learning: A Case Study Unlocking the Complexities of Late Cretaceous Reservoirs, Gulf of Suez, Egypt. J. Afr. Earth Sci. 2025, 228, 105670. [Google Scholar] [CrossRef]
- Singh, H.; Seol, Y.; Myshakin, E.M. Prediction of Gas Hydrate Saturation Using Machine Learning and Optimal Set of Well-Logs. Comput. Geosci. 2021, 25, 267–283. [Google Scholar] [CrossRef]
- Akbari, A.; Rahimi, M. Estimation of water saturation in an oil reservoir using nine different machine learning techniques: A case study. Geosystem Eng. 2025, 1–27. [Google Scholar] [CrossRef]
- Wang, Y.; Zheng, L.; Chen, G.; Kong, M.; Yuan, L.; Wang, B.; Hu, L.; Jiang, T.; Zhou, F. A Genetic Particle Swarm Optimization with Policy Gradient for Hydraulic Fracturing Optimization. SPE J. 2024, 30, 560–572. [Google Scholar] [CrossRef]
- Li, M.; Li, W.; Gu, M.; Wu, S.; Wang, P.; Wang, Y.; Cao, Q.; Xu, Z.; Hao, Y. Reservoir Characteristics and Shale Oil Enrichment of Shale Laminae in the Chang 7 Member, Ordos Basin. Energies 2025, 18, 5342. [Google Scholar] [CrossRef]
- Pang, Q.; Hu, G.; Hu, C.; Meng, F.; Wang, B.; Zhang, J. The Lithofacies of Sandstones Interbedded with Shales: Implication for Organic Matter Accumulation of Triassic Deep Lacustrine Setting, Southern Ordos Basin. ACS Omega 2024, 9, 23266–23282. [Google Scholar] [CrossRef]
- Yang, Z.; Wu, S.; Zhang, J.; Zhang, K.; Xu, Z. Diagenetic Controls on the Reservoir Quality of Tight Reservoirs in Digitate Shallow-Water Lacustrine Delta Deposits: An Example from the Triassic Yanchang Formation, Southwestern Ordos Basin, China. Mar. Pet. Geol. 2022, 144, 105839. [Google Scholar] [CrossRef]
- Wang, W.; Dang, H.; Kang, S.; Xiao, Q.; Ding, L.; Shi, L. Porosity Prediction of Tight Oil Reservoirs Based on LightGBM and SHAP Algorithms. Oil Gas Geol. Recovery 2025, 32, 90–99. (In Chinese) [Google Scholar] [CrossRef]
- Zhang, Y.; Shi, B.; Zhang, Y.; Shi, H.; Wen, W.; Zhang, Y. Application of Machine Learning for Porosity Estimation of Beach and Bar Sand Bodies in a Lacustrine Basin: A case study of the Lower Cretaceous strata in Chepaizi area, Junggar Basin, NW China. Acta Sedimentol. Sin. 2023, 41, 1559–1567. [Google Scholar]
- Davoudi, A.; Kalantariasl, A.; Parsaei, R.; Parsaei, H. Estimating Permeability Impairment Due to Asphaltene Deposition during the Natural Oil Depletion Process Using Machine Learning Techniques. Geoenergy Sci. Eng. 2023, 230, 212225. [Google Scholar] [CrossRef]
- Chen, T. XGBoost: A Scalable Tree Boosting System; Cornell University: Ithaca, NY, USA, 2016. [Google Scholar]
- Mahayana, D. Data-Driven LightGBM Controller for Robotic Manipulator. IEEE Access 2024, 12, 40883–40893. [Google Scholar] [CrossRef]
- Mwakipunda, G.C.; Komba, N.A.; Kouassi, A.K.F.; Ayimadu, E.T.; Mgimba, M.M.; Ngata, M.R.; Yu, L. Prediction of Hydrogen Solubility in Aqueous Solution Using Modified Mixed Effects Random Forest Based on Particle Swarm Optimization for Underground Hydrogen Storage. Int. J. Hydrog. Energy 2024, 87, 373–388. [Google Scholar] [CrossRef]
- Krennmair, P.; Schmid, T. Flexible Domain Prediction Using Mixed Effects Random Forests. J. R. Stat. Soc. Ser. C Appl. Stat. 2022, 71, 1865–1894. [Google Scholar] [CrossRef]
- Wang, D.; Tan, D.; Liu, L. Particle Swarm Optimization Algorithm: An Overview. Soft Comput. 2018, 22, 387–408. [Google Scholar] [CrossRef]
- Chu, C.C.F.; Chan, D.P.K. Feature Selection Using Approximated High-Order Interaction Components of the Shapley Value for Boosted Tree Classifier. IEEE Access 2020, 8, 112742–112750. [Google Scholar] [CrossRef]
- Dai, Z.; Li, S.; Hu, B.; Kong, X.; Zhang, J.; Zhu, B.; Wei, Q. Machine Learning-Based Prediction of the Migration Range of Dissolved CO2 in Deep Saline Aquifers: SHAP Interpretation and Engineering Insights. Energy Fuels 2025, 39, 18924–18934. [Google Scholar] [CrossRef]
- He, Z.; Yang, Y.; Fang, R.; Zhou, S.; Zhao, W.; Bai, Y.; Li, J.; Wang, B. Integration of Shapley Additive Explanations with Random Forest Model for Quantitative Precipitation Estimation of Mesoscale Convective Systems. Front. Environ. Sci. 2023, 10, 1057081. [Google Scholar] [CrossRef]
- Nazari, H.; Hajizadeh, F. Prediction of Oil Reservoir Porosity Using Petrophysical Data and a New Intelligent Hybrid Method. Pure Appl. Geophys. 2023, 180, 4261–4274. [Google Scholar] [CrossRef]
- Jialong, L.; Yuanku, M. Machine learning applications in distinguishing granite genesis types. China J. Geol. 2025, 60, 1509–1529. [Google Scholar] [CrossRef]











| Model | Core Parameters | Search Range | Optimal Value | Training Duration(s) |
|---|---|---|---|---|
| PSO+Xgboost | Learning-rate | 0.01~0.5 | 0.1013 | 0.070 |
| Max-depth | 2~256 | 57 | ||
| Minimum of samples per leaf | 1~100 | 49.3 | ||
| Number of trees | 50–500 | 345 | ||
| Gamma | 0~10 | 9.035 | ||
| PSO+Lightgbm | Learning-rate | 0.01~0.5 | 0.1466 | 0.034 |
| Max-depth | 2~256 | 169 | ||
| Minimum of samples per leaf | 1~100 | 8 | ||
| L1 Regularization term | 0~20 | 17.818 | ||
| L2 Regularization term | 0~20 | 13.46 | ||
| PSO+MERF | Max-depth | 2~256 | 241 | 2.184 |
| Minimum of Samples per Leaf | 1~100 | 24 | ||
| Number of trees | 50–500 | 241 |
| Model | Training Set-Swa (%) | Testing Set-Swa (%) |
|---|---|---|
| PSO+Xgboost | 89.4 | 81.5 |
| PSO+Lightgbm | 91.8 | 85.2 |
| PSO+MERF | 94.9 | 83.3 |
| Model | Core Plugs/Piece | Water Saturation from Core Analysis (%) | Model-Predicted Water Saturation (%) | R2 | Swa | ||
|---|---|---|---|---|---|---|---|
| Range | Mean | Range | Mean | ||||
| PSO+XGBoost | 118 | 26.65–83.85 | 50.43 | 33.9–74.5 | 56.54 | 80.3 | 76.8 |
| PSO+LightGBM | 28.7–80.7 | 50.64 | 88.9 | 82.3 | |||
| PSO+MERF | 33.4–78.3 | 50.51 | 87.8 | 81.4 | |||
| Archie | 22.4–89.6 | 55.22 | 72.8 | 67.5 | |||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, L.; Tan, C.; Xiao, L.; Wei, Q.; Dang, H.; Kang, S.; Liang, W.; Dong, X.; Liu, L. Prediction of Water Saturation in Lacustrine Tight Reservoirs of Chang8 in the Central Ordos Basin—Based on the PSO+LightGBM Model. Processes 2026, 14, 42. https://doi.org/10.3390/pr14010042
Li L, Tan C, Xiao L, Wei Q, Dang H, Kang S, Liang W, Dong X, Liu L. Prediction of Water Saturation in Lacustrine Tight Reservoirs of Chang8 in the Central Ordos Basin—Based on the PSO+LightGBM Model. Processes. 2026; 14(1):42. https://doi.org/10.3390/pr14010042
Chicago/Turabian StyleLi, Lusheng, Chengqian Tan, Ling Xiao, Qinlian Wei, Hailong Dang, Shengsong Kang, Weiwei Liang, Xu Dong, and Ling Liu. 2026. "Prediction of Water Saturation in Lacustrine Tight Reservoirs of Chang8 in the Central Ordos Basin—Based on the PSO+LightGBM Model" Processes 14, no. 1: 42. https://doi.org/10.3390/pr14010042
APA StyleLi, L., Tan, C., Xiao, L., Wei, Q., Dang, H., Kang, S., Liang, W., Dong, X., & Liu, L. (2026). Prediction of Water Saturation in Lacustrine Tight Reservoirs of Chang8 in the Central Ordos Basin—Based on the PSO+LightGBM Model. Processes, 14(1), 42. https://doi.org/10.3390/pr14010042

