Next Article in Journal
A Break-Regime Score-Driven Model for Tail-Risk Forecasting in China’s Carbon Market Under Policy Shifts
Previous Article in Journal
SGMT with S-PACE: A Framework for Temporal Alignment and Quality-Aware Multimodal Fusion in Emotion Recognition
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

TD3-Based Reinforcement Learning for Adaptive PID-like Control of Uncertain Dynamical Systems

1
Faculty of Engineering and Natural Sciences, Sivas University of Science and Technology, Sivas 58000, Turkey
2
Department of Computer Science, College of Computer Sciences and Information Technology, King Faisal University, Al-Ahsa 36362, Saudi Arabia
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(10), 1744; https://doi.org/10.3390/math14101744
Submission received: 10 April 2026 / Revised: 14 May 2026 / Accepted: 15 May 2026 / Published: 19 May 2026

Abstract

This paper presents a TD3-based reinforcement learning framework for adaptive PID-like control of uncertain dynamical systems. Although proportional–integral–derivative (PID) control remains widely used because of its simplicity, interpretability, and practical effectiveness, fixed-gain PID controllers often experience performance degradation in the presence of external disturbances, parameter variations, and changing operating conditions. To address this limitation, the control task is formulated as a continuous-action reinforcement learning problem in which the observation vector is constructed from PID-related error components, namely the tracking error, its integral, and its derivative. Based on these error-derived observations, a Twin Delayed Deep Deterministic Policy Gradient (TD3) agent learns a bounded continuous control policy through interaction with the environment while preserving a PID-like structural interpretation. The proposed framework is evaluated on a representative mass–spring–damper system under three challenging scenarios: external disturbance, parametric uncertainty, and their simultaneous presence. Its performance is further examined for both constant-reference regulation and sinusoidal reference tracking. The simulation results show that the learned controller achieves stable and accurate tracking, fast transient response, and robust behavior across varying operating conditions. These findings demonstrate the potential of TD3-based reinforcement learning as an effective adaptive PID-like control strategy for uncertain dynamical systems.
Keywords: adaptive PID control; reinforcement learning; policy learning; TD3 algorithm; robust tracking; parametric uncertainty; continuous control adaptive PID control; reinforcement learning; policy learning; TD3 algorithm; robust tracking; parametric uncertainty; continuous control

Share and Cite

MDPI and ACS Style

Demircioğlu, U.; Bakır, H.; Almarri, B.; Abdul Hafez, A.H. TD3-Based Reinforcement Learning for Adaptive PID-like Control of Uncertain Dynamical Systems. Mathematics 2026, 14, 1744. https://doi.org/10.3390/math14101744

AMA Style

Demircioğlu U, Bakır H, Almarri B, Abdul Hafez AH. TD3-Based Reinforcement Learning for Adaptive PID-like Control of Uncertain Dynamical Systems. Mathematics. 2026; 14(10):1744. https://doi.org/10.3390/math14101744

Chicago/Turabian Style

Demircioğlu, Ufuk, Halit Bakır, Badar Almarri, and A. H. Abdul Hafez. 2026. "TD3-Based Reinforcement Learning for Adaptive PID-like Control of Uncertain Dynamical Systems" Mathematics 14, no. 10: 1744. https://doi.org/10.3390/math14101744

APA Style

Demircioğlu, U., Bakır, H., Almarri, B., & Abdul Hafez, A. H. (2026). TD3-Based Reinforcement Learning for Adaptive PID-like Control of Uncertain Dynamical Systems. Mathematics, 14(10), 1744. https://doi.org/10.3390/math14101744

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop