Research on the Supporting Dynamics and Adaptive Intelligent Control Method for Hydraulic Support
Abstract
1. Introduction
2. Kinematic Analysis of Hydraulic Supports and the Supporting Environment
2.1. Kinematics Analysis of Hydraulic Support
2.2. Support Environment of Hydraulic Support
3. Reinforcement Learning Model for the Supporting Pose Based on Markov Decision Process
3.1. Supporting Action Space
3.2. Supporting State Space
3.3. Supporting Reward Function
- (1)
- Support Height Reward Function
- (2)
- Support Angle Reward Function
- (3)
- Support Resistance Reward Function
- (4)
- Resultant Force Action Point Reward Function
- (5)
- Comprehensive Reward Function Design
4. Autonomous Decision-Making and Control Method for the Supporting Pose
4.1. Policy Gradient-Based Proximal Policy Optimization
4.2. Progressive Neural Network-Based Transfer Policy
4.3. Network Structure of Proposed Algorithm
5. Experiment
5.1. Experiment Platform Construction
5.2. Training in Simulation and Transfer to Reality
5.3. Result Analysis and Discussion
- (1)
- The accuracy of the MDP for the support pose. This paper is the first in the field of hydraulic support research to define the MDP of the hydraulic support process and to propose a support pose control model based on an MDP. The supporting action space is defined based on the kinematic theory of hydraulic supports, the supporting state space is defined based on dynamic theory, and the reward function is defined according to the dynamic coupling relationship between the hydraulic support and the coal seam roof. As this is the first proposal, there remains substantial room for further research and optimization of the support pose MDP in the future.
- (2)
- Modeling errors in the Gazebo-based hydraulic support simulator. Modeling hydraulic supports in Gazebo requires simplifying key factors such as hydraulic dynamics (e.g., pressure–flow coupling and delays), structural flexibility, contact loads, and friction. Reinforcement learning strategies are optimized on this idealized model and often exploit these unrealistic assumptions. When transferred to the real environment, due to modeling errors such as nonlinearity, time delays, uncertain loads, and structural deformation in the actual system, the control performance of the strategy may degrade, manifesting as response lag, oscillation, insufficient accuracy, or even support instability, thereby affecting safety and reliability. To address this issue, this paper adopts randomization and robust training to cover uncertainties during training, including randomization of friction, time delay, load, and roof stiffness. However, experimental results still show a gap. Therefore, there remains room for further exploration and improvement in simulation and training methods.
- (3)
- PNN reuses existing features through lateral connections while learning to compensate for differences in the real system (such as hydraulic lag, friction, and load uncertainty), thereby largely mitigating modeling errors and improving the adaptability and stability of the strategy under complex real working conditions. However, the mining environment in which hydraulic supports operate is highly dynamic and complex, and PPO exhibits issues with exploration efficiency in such environments. To address this, this paper mainly mixes a batch of pre-collected stable-operation support data during training to reduce ineffective exploration. Therefore, from the perspective of algorithm optimization, there is still room to improve the final reward through other approaches.
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wang, H.; Cao, Y.; Wang, H. Research and practice on key technologies for intelligentization of coal mine. Coal Geol. Explor. 2023, 51, 44–54. [Google Scholar]
- Peng, S.S.; Du, F.; Cheng, J.; Li, Y. Automation in U.S. longwall coal mining: A state-of-the-art review. Int. J. Min. Sci. Technol. 2019, 29, 151–159. [Google Scholar] [CrossRef]
- Pang, Y.; Shi, Y. Intelligent control algorithms for posture and height control of four-leg hydraulic supports. Sci. Rep. 2025, 15, 3010. [Google Scholar] [CrossRef]
- Lindsay, J.; Hall, A.; Cai, M.; Simser, B. Mitigating geotechnical challenges in deep mining: Lessons learned from shaft station excavations at extreme depths. Can. Geotech. J. 2025, 62, 1–23. [Google Scholar] [CrossRef]
- Shakeri, J.; Ghorbani, E.; Taheri, A. A review of deep mining challenges, hazard mitigation, design approaches, and support methods. Undergr. Space 2026, 28, 86–136. [Google Scholar] [CrossRef]
- Peng, S.; Cheng, Z.; Che, L.; Zheng, Y.; Cao, S. Kinematic performance analysis of a parallel mechanism for loading test of hydraulic support. Mech. Mach. Theory 2022, 168, 104592. [Google Scholar] [CrossRef]
- Ren, H.; Zhao, G.; Zhou, J.; Wen, Z.; Ding, Y.; Li, S. Key technologies of all position and orientation monitoring and virtual simulation and control for smart mining equipment. J. China Coal Soc. 2020, 45, 956–971. [Google Scholar]
- Bai, J.; Hou, C. Control principle of surrounding rocks in deep roadway and its application. J. China Univ. Min. Technol. 2006, 35, 145–148. [Google Scholar]
- Gwiazda, A. Design of the roof support with strait-Line mechanism. Adv. Mater. Res. 2014, 1036, 553–558. [Google Scholar] [CrossRef]
- Xu, Y.; Wang, G.; Ren, H. Theory of coupling relationship between surrounding rocks and powered support. J. China Coal Soc. 2015, 40, 2528–2533. [Google Scholar]
- Lu, X.; Wang, Z.; Tan, C.; Yan, H.; Si, L.; Wei, D. A portable support attitude sensing system for accurate attitude estimation of hydraulic support based on unscented kalman filter. Sensors 2020, 20, 5459. [Google Scholar] [CrossRef]
- Ma, K.; Xie, J.; Guo, X.; Wang, X.W.; Wang, X.S.; Wang, L.J. A pose monitoring method for floating connection mechanism of a hydraulic support based on virtual-real integration. Gong-Kuang Zidonghua 2025, 51, 158–163. [Google Scholar]
- Mei, Z.; Wang, X.; Xie, J.; Li, S.; Liu, J. A sensing system and solving method for dynamic detection of relative pose of hydraulic support group. Measurement 2025, 243, 116145. [Google Scholar] [CrossRef]
- You, X.; Ge, S. Research on decision-making and control technology for hydraulic supports based on digital twins. Symmetry 2025, 16, 1316. [Google Scholar] [CrossRef]
- Feng, Z.; Xie, J.; Yan, Z.; Mei, Z.; Zheng, Z.; Li, T. An information processing method of software and hardware coupling for VR monitoring of hydraulic support groups. Multimed. Tools Appl. 2023, 82, 19067–19089. [Google Scholar] [CrossRef]
- Yao, Y.; Zhang, J.; Xiong, W. Method of tail beam posture prediction of top coal caving hydraulic support based on LSTM. Coal Sci. Technol. 2025, 53, 362–371. [Google Scholar]
- Zhang, Y.; Zhang, H.; Gao, K.; Zeng, Q.; Meng, F.; Cheng, J. Research on intelligent control system of hydraulic support based on position and posture detection. Machines 2023, 11, 33. [Google Scholar] [CrossRef]
- Sutton, R.; Barto, A. Reinforcement Learning: An Introduction; MIT Press: Cambridge, MA, USA, 1998. [Google Scholar]
- Yang, S.; Zhang, M.; Feng, X.; Hua, Y.; Cao, Y. Deep reinforcement learning-based knowledge graph reasoning for autonomous driving systems. IEEE Trans. Ind. Inform. 2026, 22, 3341–3351. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, J.-Q.; Zhang, S.; Jiang, Y.; Li, M.; Yong, B.; Zhou, Q.; Zhou, X. Generative policy-driven HAC reinforcement learning for autonomous driving incident response. Futur. Gener. Comput. Syst. 2026, 175, 108106. [Google Scholar] [CrossRef]
- Leonardo, L.R.; Leticia, B.; Paula, P.C.; Marcos, V.S.A.; Luiz, M.G.G. Dual or unified: Optimizing drive-based reinforcement learning for cognitive autonomous robots. Cogn. Syst. Res. 2026, 95, 101422. [Google Scholar]
- Kim, T.; Choi, M.; Choi, S.; Yoon, T.; Choi, D. Deep reinforcement learning control of a hexapod robot. Actuators 2026, 15, 33. [Google Scholar] [CrossRef]
- Barros, P.; Yalçın, Ö.N.; Tanevska, A.; Sciutti, A. Incorporating rivalry in reinforcement learning for a competitive game. Neural Comput. Appl. 2022, 35, 16739–16752. [Google Scholar] [CrossRef]
- Florian, R.; Manuel, E.; Kai, E. Simulation-driven balancing of competitive game levels with reinforcement learning. IEEE Trans. Games 2024, 16, 903–913. [Google Scholar] [CrossRef]
- Parnada, A.; Qu, M.; Castellani, M.; Chang, H.J.; Wang, Y. Towards cost-effective and safe contact-rich robotic manipulation with reinforcement learning: A review of techniques for future industrial automation. Proc. Inst. Mech. Eng. Part I J. Syst. Control. Eng. 2026, 240, 3–35. [Google Scholar] [CrossRef]
- ElMenshawy, M.; Wu, L.; Gue, B.; AbouRizk, S. Automating pipe spool fabrication shop scheduling for modularized industrial construction projects using reinforcement learning. J. Comput. Civ. Eng. 2025, 39, 04025013. [Google Scholar] [CrossRef]
- Wang, G.; Peng, J.; Guan, C.; Chen, J.; Guo, B. Multi-drone collaborative shepherding through multi-task reinforcement learning. IEEE Robot. Autom. Lett. 2024, 9, 10311–10318. [Google Scholar] [CrossRef]
- Rusu, A.; Vecerik, M.; Rothörl, T.; Pascanu, R.; Hadsell, R. Sim-to-Real Robot Learning from Pixels with Progressive Nets. arXiv 2016. [Google Scholar] [CrossRef]
- Rusu, A.; Rabinowitz, N.; Desjardins, G.; Soyer, H.; Kirkpatrick, J.; Kavukcuoglu, K.; Hadsell, R. Progressive Neural Networks. arXiv 2016, arXiv:1606.04671. [Google Scholar] [CrossRef]
- Zhu, Z.; Lin, K.; Jain, A.; Zhou, J. Transfer learning in deep reinforcement learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 13344–13362. [Google Scholar] [CrossRef] [PubMed]
- Tiwari, R.; Khapre, S.; Singh, A. Reinforcement learning in robotic systems: A review on sim-to-real transfer. Robot. Auton. Syst. 2026, 198, 105327. [Google Scholar] [CrossRef]
- Qian, M.; Miao, X.; He, F. Mechanism of coupling effect between supports in the workings and the rocks. J. China Coal Soc. 1996, 21, 21. [Google Scholar]
- Ma, Q.; Zhao, X.; Song, Z. Break of main roof ahead of workface and ground pressure. J. China Coal Soc. 2001, 26, 473–477. [Google Scholar]
- Tan, Y.; Wu, S.; Yin, Z. Mining Pressure and Strata Control; China Coal Industry Publishing House: Beijing, China, 2011. [Google Scholar]
- Deshpande, S.; Walambe, R.; Kotecha, K.; Selvachandran, G.; Abraham, A. Advances and applications in inverse reinforcement learning: A comprehensive review. Neural Comput. Appl. 2025, 37, 11071–11123. [Google Scholar] [CrossRef]















| Stabilizing Ram Thrust | Constant | Increase | Decrease | |
|---|---|---|---|---|
| Leg Thrust | ||||
| Constant | ||||
| Increase | ||||
| Decrease | ||||
| State Variable | Support Height | Support Angle | Resultant Force | Support Resistance Action Point |
|---|---|---|---|---|
| support height | 0.5 | 0.7 | 0.9 | 0.7 |
| support angle | 0.3 | 0.5 | 0.7 | 0.6 |
| resultant force | 0.1 | 0.3 | 0.5 | 0.2 |
| support resistance action point | 0.3 | 0.4 | 0.8 | 0.5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Lu, X.; Zhang, L.; Wei, D. Research on the Supporting Dynamics and Adaptive Intelligent Control Method for Hydraulic Support. Machines 2026, 14, 496. https://doi.org/10.3390/machines14050496
Lu X, Zhang L, Wei D. Research on the Supporting Dynamics and Adaptive Intelligent Control Method for Hydraulic Support. Machines. 2026; 14(5):496. https://doi.org/10.3390/machines14050496
Chicago/Turabian StyleLu, Xuliang, Lin Zhang, and Dong Wei. 2026. "Research on the Supporting Dynamics and Adaptive Intelligent Control Method for Hydraulic Support" Machines 14, no. 5: 496. https://doi.org/10.3390/machines14050496
APA StyleLu, X., Zhang, L., & Wei, D. (2026). Research on the Supporting Dynamics and Adaptive Intelligent Control Method for Hydraulic Support. Machines, 14(5), 496. https://doi.org/10.3390/machines14050496

