Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators

Liang, Yuling; Xie, Mengjia; Zhang, Juan; Ming, Zhongyang; Gao, Zhiyun

doi:10.3390/act14100506

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators

by

Yuling Liang

^1,*,

Mengjia Xie

¹,

Juan Zhang

²,

Zhongyang Ming

² and

Zhiyun Gao

³

¹

School of Artificial Intelligence, Shenyang University of Technology, Shenyang 110870, China

²

School of Information Science and Engineering, Northeastern University, Shenyang 110819, China

³

School of Transportation Science and Engineering, Civil Aviation University of China, Tianjin 300300, China

^*

Author to whom correspondence should be addressed.

Actuators 2025, 14(10), 506; https://doi.org/10.3390/act14100506 (registering DOI)

Submission received: 3 September 2025 / Revised: 5 October 2025 / Accepted: 17 October 2025 / Published: 19 October 2025

(This article belongs to the Special Issue Advances in Intelligent Control of Actuator Systems)

Download Review Reports Versions Notes

Abstract

This study explores a stochastic guarantee cost control (GCC) for time-varying systems with random parameters and asymmetric saturation actuators by employing the integral reinforcement learning (IRL) method in the dynamic event-triggered (DET) mode. Firstly, a modified Hamilton–Jacobi–Isaac (HJI) equation is formulated, and then the worst-case disturbance policy and the asymmetric saturation optimal control signal can be obtained. Secondly, the multivariate probabilistic collocation method (MPCM) is used to evaluate the value function at designated sampling points. The purpose of introducing the MPCM is to simplify the computational complexity of stochastic dynamic programming (SDP) methods. Furthermore, the DET mode is utilized to solve the SDP problem to reduce the computational burden on communication resources. Finally, the Lyapunov stability theorem is applied to analyze the stability of time-varying systems, and the simulation shows the feasibility of the designed method.

Keywords: stochastic systems; asymmetric saturation actuators; multivariate probabilistic collocation method; adaptive dynamic programming

Share and Cite

MDPI and ACS Style

Liang, Y.; Xie, M.; Zhang, J.; Ming, Z.; Gao, Z. Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators. Actuators 2025, 14, 506. https://doi.org/10.3390/act14100506

AMA Style

Liang Y, Xie M, Zhang J, Ming Z, Gao Z. Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators. Actuators. 2025; 14(10):506. https://doi.org/10.3390/act14100506

Chicago/Turabian Style

Liang, Yuling, Mengjia Xie, Juan Zhang, Zhongyang Ming, and Zhiyun Gao. 2025. "Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators" Actuators 14, no. 10: 506. https://doi.org/10.3390/act14100506

APA Style

Liang, Y., Xie, M., Zhang, J., Ming, Z., & Gao, Z. (2025). Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators. Actuators, 14(10), 506. https://doi.org/10.3390/act14100506

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Integral Reinforcement Learning-Based Stochastic Guaranteed Cost Control for Time-Varying Systems with Asymmetric Saturation Actuators

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI