Triple Phase Shift Modulation for Active Bridge Converter: Deep Reinforcement Learning-Based Efficiency Optimization
Abstract
1. Introduction
| Algorithms | Action Space | Continuous Variable | Nonlinear Handling | Model Dependency | ZVS Constraints |
|---|---|---|---|---|---|
| GA [21] | Continuous | Good | Moderate | High | difficult |
| PSO [24] | Continuous | Good | Moderate | High | difficult |
| NR [22] | Continuous | Good | Weak | High | difficult |
| DQN [25,26] | Discrete | Poor | Moderate | Low | Good |
| RL-ANN [27] | Continuous | Moderate | Strong | Low | Good |
| DDPG [32,33,34] | Continuous | Good | Strong | Low | Good |
2. TAB Converter Circuit Topology and Modulation Strategy
2.1. TAB Converter Circuit Topology
2.2. TPS Modulation Strategy
2.3. ZVS Constraints
2.4. Stability Modeling and Analysis
3. Losses Distribution
3.1. Losses Model of Power Switches
3.2. Power Losses Model in Magnetic Components
4. Algorithm Optimization of TAB Converter Under TPS Modulation
4.1. DDPG Algorithm
4.2. Reward Function for Minimizing the Power Losses
4.3. Training of the DDPG Algorithm
- Step 1: Randomly initialize the actor network θμ, critic network θQ and softly initialize their target networks.
- Step 2: For each training episode, observe the initial system state st.
- Step 3: In each training step, select action by adding exploration noise nt to the actor network output.
- Step 4: Execute αt, obtain the reward rt and next state st+1, then store the experience tuple (st, αt, rt, st+1) into the replay buffer.
- Step 5: Sample a mini-batch of experience data and compute the target value.
- Step 6: Update the critic network by minimizing the loss function shown in Equation (39).
- Step 7: Update the actor network using the deterministic policy gradient in Equation (41).
- Step 8: Soft update the target networks as shown in Equation (40) and repeat until convergence.
4.4. DDPG Algorithm Optimization Results
5. Experimental Verifications
5.1. Normal Mode
5.2. Switch from Normal Mode to Fault-Tolerant Mode
5.3. Fault-Tolerant Mode
5.4. ZVS and Converter Efficiency
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Aldarsi, E.; Singh, R.; Zhang, J. A Case Study of a Stand-Alone AC and DC Power Network in the Red Sea New City, Kingdom of Saudi Arabia. Electronics 2026, 15, 1077. [Google Scholar] [CrossRef]
- Liu, J.; Ming, Z.; Shi, H.; Zhang, H.; Liu, Z. A Hierarchical Distributed Method for Source-Grid-Load-Storage Coordinated Power and Energy Balance in Distribution Networks. Electronics 2026, 15, 1054. [Google Scholar] [CrossRef]
- Liu, Y.; Tang, F.; Liu, Z.; Zuo, L. Research on the Optimal Transient Power Angle Control Strategy for New Energy Transmission Systems in Energy Storage Enhancement Areas. Sustainability 2026, 18, 1636. [Google Scholar] [CrossRef]
- IEEE Std 1709-2018 (Revision of IEEE Std 1709-2010); IEEE Recommended Practice for 1 kV to 35 kV Medium-Voltage DC Power Systems on Ships. IEEE: Piscataway, NJ, USA, 2018; pp. 1–54.
- Baltazar, P.; Barros, J.D.; Gomes, L. A Distributed Electric Vehicles Charging System Powered by Photovoltaic Solar Energy with Enhanced Voltage and Frequency Control in Isolated Microgrids. Electronics 2026, 15, 418. [Google Scholar] [CrossRef]
- Chen, Q.; Su, Y.; Hu, B.; Shao, C.; Xu, L.; Huang, C. Dynamic State Estimation for Sustainable Distribution Systems Considering Data Correlation and Noise Adaptiveness. Sustainability 2026, 18, 1693. [Google Scholar] [CrossRef]
- Maevsky, D.; Kharchenko, V.; Bardis, N.; Stetsiuk, D.; Maevskaya, E.; Kryvda, V. Dependability Model of Electric Power Systems for Assessing Smart City Energy Sustainability. Sustainability 2026, 18, 1512. [Google Scholar] [CrossRef]
- Chen, Y.; Ma, J.; Zhu, M.; Liu, J. Dual-Mode Wide-Voltage-Range Operation of Hybrid Triple Active Bridge Converter for Bipolar DC Distribution Systems. IEEE Trans. Ind. Appl. 2024, 60, 8998–9014. [Google Scholar] [CrossRef]
- Cao, T.; Zhu, J.; Guo, Y.; Han, Y.; Wu, B.; Li, D. Utilizing the Intrinsic CC/CV Characteristics of a CLLC Converter for Battery Charging with ZVS Operation. Electronics 2026, 15, 1128. [Google Scholar] [CrossRef]
- Kang, X.; Li, S.; Smedley, K.M. Decoupled PWM Plus Phase-Shift Control for a Dual-Half-Bridge Bidirectional DC–DC Converter. IEEE Trans. Power Electron. 2018, 33, 7203–7213. [Google Scholar] [CrossRef]
- Zhang, H.; Dong, D.; Liu, W.; Ren, H.; Zheng, F. Systematic Synthesis of Multiple-Input and Multiple-Output DC–DC Converters for Nonisolated Applications. IEEE J. Emerg. Sel. Top. Power Electron. 2022, 10, 6470–6481. [Google Scholar] [CrossRef]
- Ma, J.; Chen, Y.; Shen, X.; Qiu, Y. Fault-Tolerant Multiport Active Bridge Converter for Resilient Energy Storage Integration in Zonal Shipboard DC System. J. Mar. Sci. Eng. 2025, 13, 654. [Google Scholar] [CrossRef]
- Yang, W.; Ma, J.; Zhu, M.; Hu, C. Open-Circuit Fault Diagnosis and Tolerant Method of Multiport Triple Active-Bridge DC-DC Converter. IEEE Trans. Ind. Appl. 2023, 59, 5473–5487. [Google Scholar] [CrossRef]
- Sato, Y.; Uno, M.; Nagata, H. Nonisolated Multiport Converters Based on Integration of PWM Converter and Phase-Shift-Switched Capacitor Converter. IEEE Trans. Power Electron. 2020, 35, 455–470. [Google Scholar] [CrossRef]
- Ma, J.; Zhu, M.; Li, Y.; Cai, X. Monopolar Fault Reconfiguration of Bipolar Half Bridge Converter for Reliable Load Supply in DC Distribution System. IEEE Trans. Power Electron. 2022, 37, 11305–11318. [Google Scholar] [CrossRef]
- Shih, L.; Liu, Y.; Chiu, H. A novel hybrid mode control for a phase-shift full-bridge converter featuring high efficiency over a full-load range. IEEE Trans. Power Electron. 2019, 34, 2794–2804. [Google Scholar] [CrossRef]
- Guo, Z. Modulation scheme of dual active bridge converter for seamless transitions in multiworking modes compromising ZVS and conduction loss. IEEE Trans. Ind. Electron. 2020, 67, 7399–7409. [Google Scholar] [CrossRef]
- Tang, Y.; Shen, S.; Mi, C.; Wang, Y.; Chen, S. Reinforcement learning based efficiency optimization scheme for the DAB DC–DC converter with triple-phase-shift modulation. IEEE Trans. Ind. Electron. 2021, 68, 7350–7361. [Google Scholar] [CrossRef]
- Bhattacharjee, A.K.; Batarseh, I. Optimum hybrid modulation for improvement of efficiency over wide operating range for triple-phase-shift dual-active-bridge converter. IEEE Trans. Power Electron. 2020, 35, 4804–4818. [Google Scholar] [CrossRef]
- Sha, D.; Sun, T.; Zhang, J. Varying switching frequency control for current-fed dual-active bridge DC–DC converter with constant flux density change for transformers. IEEE Trans. Power Electron. 2020, 35, 3766–3777. [Google Scholar] [CrossRef]
- Hou, N.; Li, Y. Overview and comparison of modulation and control strategies for non-resonant single-phase dual-active-bridge DC-DC converter. IEEE Trans. Power Electron. 2020, 35, 3148–3172. [Google Scholar] [CrossRef]
- Meng, L.; Dragicevic, T.; Vasquez, J.C.; Guerrero, J.M. Tertiary and secondary control levels for efficiency optimization and system damping in droop-controlled DC–DC converters. IEEE Trans. Smart Grid. 2015, 6, 2615–2626. [Google Scholar] [CrossRef]
- Du, Z.; Tolbert, L.M.; Chiasson, J.N.; Ozpineci, B. Reduced switchingfrequency active harmonic elimination for multilevel converters. IEEE Trans. Ind. Electron. 2008, 55, 1761–1770. [Google Scholar] [CrossRef]
- Cao, D.; Hu, W.; Zhao, J.; Zhang, G.; Zhang, B.; Liu, Z.; Chen, Z.; Blaabjerg, F. Reinforcement learning and its applications in modern power and energy systems: A review. J. Modern Power Syst. Clean Energy. 2020, 8, 1029–1042. [Google Scholar] [CrossRef]
- Dilokthanakul, N.; Kaplanis, C.; Pawlowski, N.; Shanahan, M. Feature control as intrinsic motivation for hierarchical reinforcement learning. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 3409–3418. [Google Scholar] [CrossRef] [PubMed]
- Nwachukwu, S.E.; Folly, K.A.; Awodele, K.O. Soft Actor-Critic-Based MPPT Control of Solar PV Systems Under Partial Shading Conditions. IEEE Open Access J. Power Energy 2025, 12, 194–208. [Google Scholar] [CrossRef]
- Tang, Y.; Hu, W.; Xiao, J. A Deep Q-Network based optimized modulation scheme for Dual-Active-Bridge converter to reduce the RMS current. Energy Rep. 2020, 6, 1192–1198. [Google Scholar] [CrossRef]
- Tang, Y.; Shen, S.; Mi, C.; Wang, Y.; Chen, S. RL-ANN-Based Minimum-Current-Stress Scheme for the Dual-Active-Bridge Converter with Triple-Phase-Shift Control. IEEE J. Emerg. Sel. Top. Power Electron. 2022, 10, 673–689. [Google Scholar] [CrossRef]
- Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep reinforcement learning: A brief survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef]
- Cao, D.; Hu, W.; Zhao, J.; Huang, Q.; Chen, Z.; Blaabjerg, F. A multi-agent deep reinforcement learning based voltage regulation using coordinated PV inverters. IEEE Trans. Power Syst. 2020, 35, 4120–4123. [Google Scholar] [CrossRef]
- Qiu, C.; Hu, Y.; Chen, Y.; Zeng, B. Deep deterministic policy gradient (DDPG)-Based energy harvesting wireless communications. IEEE Internet Things J. 2019, 6, 8577–8588. [Google Scholar] [CrossRef]
- Xu, H.; Sun, H.; Nikovski, D.; Kitamura, S.; Mori, K.; Hashimoto, H. Deep reinforcement learning for joint bidding and pricing of load serving entity. IEEE Trans. Smart Grid. 2019, 10, 6366–6375. [Google Scholar] [CrossRef]
- Tang, Y.; Shen, S.; Mi, C.; Wang, Y.; Chen, S. Artificial intelligence-aided minimum reactive power control for the DAB converter based on harmonic analysis method. IEEE Trans. Power Electron. 2021, 36, 9704–9710. [Google Scholar] [CrossRef]
- Sun, L.; Pan, Y.; Wang, H. Assessing the inertial response time of grid-forming converters: Estimation and optimization. Electr. Power Syst. Res. 2026, 253, 112470. [Google Scholar] [CrossRef]
- Tang, Y.; Cao, D.; Xiao, J. AI-aided power electronic converters automatic online real-time efficiency optimization method. Fundam. Res. 2025, 5, 1111–1116. [Google Scholar] [CrossRef]
















| Parameters | Value |
|---|---|
| Input voltage in traditional parallel topology (Ui_parallel) | 48 V |
| Input voltage in traditional series topology (Ui_series) | 48 V |
| Input voltage of battery pack in TAB topology (Ui_TAB) | 48 V |
| Transformer turns ratio of parallel topology (n) | 1:2 |
| Transformer turns ratio of series topology (n) | 1:1 |
| Transformer turns of TAB topology (n) | 1:1 |
| Output voltage (Uo) | 100 V |
| Rated power (Po) | 216 W |
| Capacitor (Co) | 220 μF |
| Switching frequency (fs) | 20 kHz |
| Leakage inductance (Lr) | 45 μH |
| Parameters | Value |
|---|---|
| Actor network learning rate (θμ) | 0.0003 |
| Critic network learning rate (θQ) | 0.003 |
| Soft update rate (τ) | 0.005 |
| Penalty coefficient (α) | 150 |
| Penalty coefficient (ω) | ω = 1.5 (ZVS), ω = 15 (None-ZVS) |
| Discount factor (γd) | 0.98 |
| Noise parameters (σ) | 0.01 |
| Memory pool size | 50,000 |
| Number of episodes | 10,000 |
| Step size of each episode | 20 |
| Reward stability | Fluctuation < 1 % |
| Items | Figure 2a Architecture | Figure 2b Architecture | TPS Architecture |
|---|---|---|---|
| Input Switches | BSZ440N15NS3G | BSC070N10NS5 | BSC070N10NS5 |
| Output Switches | BSZ440N15NS3G | BSC320N20NS3G | BSC320N20NS3G |
| Input Capacitors | B41858C9227M000 | B41858C9227M000 | B41858C9227M000 |
| Output Capacitors | B41858C9227M000 | B43504A2477M000 | B43504A2477M000 |
| Transformer (core) | B66375G0000X187 | ||
| Inductor (core) | 74435581000 |
| Parameters | Series-EPS | Parallel-SPS | TAB-TPS |
|---|---|---|---|
| Fault-tolerant | × | × | √ |
| ZVS | Partial ZVS | Partial ZVS | Full ZVS |
| DOF | 4 | 2 | 3 |
| Capacitors | 4 | 4 | 3 |
| Switches | 16 | 16 | 12 |
| Transformers | 2 | 2 | 2 |
| Inductors | 2 | 2 | 1 |
| Parameters | Value |
|---|---|
| Input voltage of battery pack i = 1 (U1) | 48 V |
| Input voltage of battery pack i = 2 (U2) | 48 V |
| Input voltage of battery pack i = 3 (U3) | 48 V |
| Transformer turns ratio (n) | 1:1 |
| Output voltage (Uo) | 100 V |
| Rated power (Po) | 216 W |
| Capacitor (Co) | 110 μF |
| Switching frequency (fs) | 20 kHz |
| Leakage inductance (Lr) | 45 μH |
| Mode and Strategy | 40 (W) | 80 (W) | 120 (W) | 160 (W) | 200 (W) |
|---|---|---|---|---|---|
| TPS in normal mode | 93% | 92.9% | 92.35% | 91.56% | 91.31% |
| DDPG-TPS in normal mode | 96.9% | 95.8% | 95.2% | 94.44% | 94.28% |
| TPS in fault mode | 91.4% | 91.31% | 90.67% | 89.78% | 89.25% |
| DDPG-TPS in fault mode | 96.36% | 95.2% | 94.65% | 93.8% | 93.18% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Huang, Y.; Zhao, Q.; Zhu, M.; Wen, S.; Zhang, B. Triple Phase Shift Modulation for Active Bridge Converter: Deep Reinforcement Learning-Based Efficiency Optimization. Electronics 2026, 15, 1563. https://doi.org/10.3390/electronics15081563
Huang Y, Zhao Q, Zhu M, Wen S, Zhang B. Triple Phase Shift Modulation for Active Bridge Converter: Deep Reinforcement Learning-Based Efficiency Optimization. Electronics. 2026; 15(8):1563. https://doi.org/10.3390/electronics15081563
Chicago/Turabian StyleHuang, Yiqi, Qiang Zhao, Miao Zhu, Shuli Wen, and Bing Zhang. 2026. "Triple Phase Shift Modulation for Active Bridge Converter: Deep Reinforcement Learning-Based Efficiency Optimization" Electronics 15, no. 8: 1563. https://doi.org/10.3390/electronics15081563
APA StyleHuang, Y., Zhao, Q., Zhu, M., Wen, S., & Zhang, B. (2026). Triple Phase Shift Modulation for Active Bridge Converter: Deep Reinforcement Learning-Based Efficiency Optimization. Electronics, 15(8), 1563. https://doi.org/10.3390/electronics15081563

