Research on Wellbore Trajectory Optimization and Drilling Control Based on the TD3 Algorithm
Abstract
1. Introduction
2. Related Work
2.1. Deep Q Network
2.2. Deep Deterministic Strategy Gradient
2.3. Double-Delay Depth Deterministic Policy Gradient Algorithm
2.4. TD3 Network
3. Reinforcement Learning Environment Modeling and Design
3.1. State Space Design
3.2. Action Space Design
3.3. Reward Function Design
4. Experimental Design and Result Analysis
4.1. Experimental Settings
4.2. Three-Stage Wellbore Trajectory Design Based on Reinforcement Learning
4.3. Borehole Design Under Multi-Source Obstacle Coupling Based on Reinforcement Learning
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wang, P.T. Research on Wellbore Trajectory Prediction and Optimized Design Method of Future Drilling Trajectory. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2022. [Google Scholar]
- Du, X. Research on Intelligent Optimization Method of Drilling Parameters Based on Machine Learning. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2022. [Google Scholar]
- Huang, M.; Zhou, K.; Wang, L.; Zhou, J. Application of long short-term memory network for wellbore trajectory prediction. Pet. Sci. Technol. 2023, 44, 3185–3204. [Google Scholar] [CrossRef]
- Dai, X.P. Research on an Intelligent Guidance Method for 3D Wellbore Trajectory Based on Deep Learning. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2022. [Google Scholar]
- Gao, Y.; Wang, N.; Ma, Y. L2.SSA-LSTM prediction model of steering drilling wellbore trajectory. IEEE Access 2023, 12, 450–461. [Google Scholar] [CrossRef]
- Gao, Y.; Wang, N.; Ma, Y. Wellbore Trajectory Prediction Method, System, Device and Medium Based on Deep Learning and Digital Twin. CN202410081486, 7 November 2024. [Google Scholar]
- Li, Z.; Song, X.; Wang, Z.; Jiang, Z.; Pan, T.; Zhu, Z. Real-Time Prediction of Wellbore Trajectory with a Dual-Input GRU (Di-GRU) Model. In Proceedings of the Offshore Technology Conference Asia; OTC: Houston, TX, USA; Kuala Lumpur, Malaysia, 2024; D021S015R003. [Google Scholar]
- Jiang, S.Z. Research on Optimal Control Model, Algorithm, and Application of Non-Straight Well Trajectory. Ph.D. Thesis, Dalian University of Technology, Dalian, China, 2002. [Google Scholar]
- Yang, H.C. Optimization and Control Technology for Unconventional Oil and Gas Well Trajectories Under the “Well Factory” Model. Inn. Mong. Petrochem. Ind. 2013, 39, 114–115. [Google Scholar]
- Yin, S. Research on the Optimization Design of Horizontal Wellbore Trajectory for Shale Gas in Southern Sichuan. Master’s Thesis, Southwest Petroleum University, Chengdu, China, 2014. [Google Scholar]
- Li, C.F.; Ran, F.; Wen, S.Z.; Ke, G.G.; Chen, L. Optimization Technology and Application of Horizontal Well Trajectory for Ultra-Deep Thin Reservoirs in Yuanba Gas Field, Sichuan Basin. Glob. Geol. 2021, 40, 354–363+374. [Google Scholar]
- Gong, F.J.; Wu, J.Z.; Wang, W. Multi-Objective Optimization of Well Trajectory Design in Geologically Uncertain Formations. Petrochem. Appl. 2016, 35, 18–20. [Google Scholar]
- Liu, M.S.; Fu, J.H.; Bai, J. Optimization Design and Application of Shale Gas Dual-Dimensional Horizontal Well Trajectory. Spec. Oil Gas Reserv. 2016, 23, 147–150+158. [Google Scholar]
- Bai, J.P. Modeling and Application Research on the Optimization Design of 3D Wellbore Trajectory Based on Factory Operation. Master’s Thesis, Xi’an Shiyou University, Xi’an, China, 2017. [Google Scholar]
- Zhang, L.; Zhang, Y.C.; Dong, P.H.; Yue, M.; Hou, X.X. Research on Key Technologies for Drilling of Shallow Large Displacement Horizontal Wells in Bohai Oilfield. Unconv. Oil Gas 2022, 9, 10–17. [Google Scholar]
- Huang, W.D. Multi-Objective Optimization of Geological Drilling Trajectories with Nonlinear Constraints and Parameter Uncertainty. Ph.D. Thesis, China University of Geosciences, Beijing, China, 2022. [Google Scholar]
- Liu, X.L.; Qiang, Z.Z.; Huang, Y.G.; Fei, S.X.; Wang, J.C.; Cui, Y.H.; Wang, Z.B.; Zhang, Z.T. Application of 3D Geological Modeling Technology for Horizontal Well Based on Data Fusion of Multiple Sources. In Proceedings of the International Field Exploration and Development Conference, Xi’an, China, 16–18 November 2022; Springer: Singapore, 2022. [Google Scholar]
- Fang, C.; Wang, Q.; Jiang, H.; Chen, Z.W.; Wang, Y.; Zhai, W.B. Shale Wellbore Stability and Well Trajectory Optimization: A Case Study from Changning, Sichuan, China. Pet. Sci. Technol. 2022, 41, 564–585. [Google Scholar] [CrossRef]
- Qin, Z.L. Research on the Orbit Design and Trajectory Control of Horizontal Wells in Loose Sandstone-Mudstone Formations in Niger. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2023. [Google Scholar]
- Liu, H. Research on an Intelligent Downhole Full Closed-Loop Guided Drilling Method Based on Reinforcement Learning. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2020. [Google Scholar]
- Fan, C. Research on an Intelligent Obstacle Avoidance Guided Drilling Algorithm Incorporating Spatial Attention Mechanism. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2022. [Google Scholar]
- Wang, F. A Method of Adaptive Trajectory Tracking Control of Wellbore Based on Reinforcement Learning. Master’s Thesis, China University of Petroleum (Beijing), Beijing, China, 2022. [Google Scholar]
- Jian, Z. Research on the Optimization of Horizontal Well Trajectory Based on Drilling, Logging and Seismic Results. Master’s Thesis, Northeast Petroleum University, Daqing, Chian, 2023. [Google Scholar]
- Peshkov, G.; Pavlov, M.; Katterbauer, K.; Shehri, A.A. Real-Time AI Geosteering for Horizontal Well Trajectory Optimization. In Proceedings of the SPE Annual Caspian Technical Conference, Baku, Azerbaijan, 21–23 November 2023; SPE: Richardson, TX, USA, 2023. D031S017R007. [Google Scholar]
- Yu, Y.; Chen, W.; Liu, Q.; Chau, M.; Vesselinov, V.; Meehan, R. Training an Automated Directional Drilling Agent with Deep Reinforcement Learning in a Simulated Environment. In Proceedings of the SPE/IADC International Drilling Conference and Exhibition, Stavanger, Rogaland, Norway, 9–11 March 2021. [Google Scholar]
- Vishnumolakala, N.; Kesireddy, V.; Dey, S.; Gildin, E.; Losoya, E.Z. Optimizing Well Trajectory Navigation and Advanced Geo-Steering Using Deep-Reinforcement Learning. In Proceedings of the SPE Annual Caspian Technical Conference, Baku, Azerbaijan, 21–23 November 2023; SPE: Richardson, TX, USA, 2023. D021S012R003. [Google Scholar]
- Zhu, D.; Xu, Q.; Wang, F.; Chen, D.; Ye, Z.; Zhou, H.; Zhang, K. A Target-Aware Well Path Control Method Based on Transfer Reinforcement Learning. SPE J. 2024, 29, 1730–1741. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement learning: An introduction. IEEE Trans. Neural Netw. 1998, 9, 1054. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous control with deep reinforcement learning. arXiv 2015, arXiv:1509.02971. [Google Scholar] [CrossRef]
- Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep reinforcement learning: A brief survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef]
- Zhang, F.; Li, J.; Li, Z. A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment. Neurocomputing 2020, 411, 206–215. [Google Scholar] [CrossRef]
- Yuan, X.; Wang, Y.; Zhang, R.; Gao, Q. Reinforcement learning control of hydraulic servo system based on TD3 algorithm. Machines 2022, 10, 1244. [Google Scholar] [CrossRef]
- Yin, C.; Huang, Z. Application of TD3 algorithm with adaptive. In Proceedings of the 2nd International Conference on the Frontiers of Robotics and Software Engineering (FRSE 2024), Guiyang, China, 14–16 June 2024; Springer Nature: Cham, Switzerland, 2024. [Google Scholar]
- Fujimoto, S.; Hoof, H.; Meger, D. Addressing function approximation error in actor-critic methods. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; PMLR: New York, NY, USA, 2018. [Google Scholar]
- Tan, H. Reinforcement learning with deep deterministic policy gradient. In Proceedings of the 2021 International Conference on Artificial Intelligence, Big Data and Algorithms (CAIBDA), Xi’an, China, 28–30 May 2021; IEEE: Piscataway, NJ, USA, 2021. [Google Scholar]
- Silver, D.; Lever, G.; Heess, N.; Degris, T.; Wierstra, D.; Riedmiller, M. Deterministic policy gradient algorithms. In Proceedings of the International Conference on Machine Learning, Beijing, China, 21–26 June 2014; PMLR: New York, NY, USA, 2014. [Google Scholar]
Characteristics | DDPG | TD3 | Theoretical Advantages |
---|---|---|---|
Q value estimation | Single network, overestimation | Dual network, min suppression | Deviation reduction |
Strategy update frequency | Real-time update | Update with a delay of steps | Variance reduction |
Target stability | Hard update | Soft update () | Tracking error reduction 40% |
Convergence rate | The sample efficiency is increased by times. |
Serial Number | Category | Configuration |
---|---|---|
1 | Processor | Intel Core i7-12700K |
2 | Memory | 32GB RAM |
3 | Storage | 1TB SSD |
4 | Operating System | Windows 11/Ubuntu 20.04 |
5 | Programming Language | Python 3.9 |
6 | Libraries Used | baselines3, gym, NumPy, Matplotlib |
7 | Learning Rate | 0.0003 |
8 | Adjustment Step Size | 1024 |
9 | Batch Size | 64 |
10 | clip_range | 0.2 |
Serial Number | North | East | Height | Serial Number | North | East | Height |
---|---|---|---|---|---|---|---|
1 | 0 | 0 | 0 | 14 | 531.86 | 443.22 | 800 |
2 | 0 | 0 | 215 | 15 | 600.99 | 500.83 | 800 |
3 | 1.15 | 0.96 | 244.96 | 16 | 670.12 | 558.44 | 800 |
4 | 11.47 | 9.56 | 333.87 | 17 | 739.26 | 616.05 | 800 |
5 | 31.89 | 26.57 | 419.77 | 18 | 808.39 | 673.66 | 800 |
6 | 61.93 | 51.61 | 500.73 | 19 | 877.52 | 731.27 | 800 |
7 | 100.94 | 84.11 | 574.94 | 20 | 946.66 | 788.88 | 800 |
8 | 148.02 | 123.35 | 640.73 | 21 | 1015.79 | 846.49 | 800 |
9 | 202.13 | 168.44 | 696.62 | 22 | 1084.92 | 904.1 | 800 |
10 | 262.05 | 218.37 | 741.36 | 23 | 1154.05 | 961.71 | 800 |
11 | 326.43 | 272.02 | 773.94 | 24 | 1181.71 | 984.76 | 800 |
12 | 393.82 | 328.19 | 793.64 | 25 | 1188.62 | 990.52 | 800 |
13 | 462.73 | 385.61 | 800 | 26 | 1200 | 1000 | 800 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gu, H.; Wu, Y.; Li, X.; Hou, Z. Research on Wellbore Trajectory Optimization and Drilling Control Based on the TD3 Algorithm. Appl. Sci. 2025, 15, 7258. https://doi.org/10.3390/app15137258
Gu H, Wu Y, Li X, Hou Z. Research on Wellbore Trajectory Optimization and Drilling Control Based on the TD3 Algorithm. Applied Sciences. 2025; 15(13):7258. https://doi.org/10.3390/app15137258
Chicago/Turabian StyleGu, Haipeng, Yang Wu, Xiaowei Li, and Zhaokai Hou. 2025. "Research on Wellbore Trajectory Optimization and Drilling Control Based on the TD3 Algorithm" Applied Sciences 15, no. 13: 7258. https://doi.org/10.3390/app15137258
APA StyleGu, H., Wu, Y., Li, X., & Hou, Z. (2025). Research on Wellbore Trajectory Optimization and Drilling Control Based on the TD3 Algorithm. Applied Sciences, 15(13), 7258. https://doi.org/10.3390/app15137258