Optimal Weighting Factors Design for Model Predictive Current Controller for Enhanced Dynamic Performance of PMSM Employing Deep Reinforcement Learning
Abstract
:1. Introduction
2. Mathematical Modeling
2.1. Modeling of a Three-Phase Two-Level Voltage Source Inverter
2.2. Modeling of Surface Permanent Magnet Synchronous Motor
2.3. Decision Function
3. Proposed Intelligent Weighting Factors Design Approach
3.1. MOGA-Based NN Weighting Factor Tuning Approach
- The total number of transitions over the simulation, i.e., the sum of the ;
- The ITAE of the q-axis current where N is the total number of time steps in the simulation.
Parameter | Description | Value |
---|---|---|
Sampling period | ||
Switching frequency reference | ||
Stator resistance | ||
Stator inductance | ||
Flux linkage | ||
Torque constant | ||
p | Pole pairs | 4 |
J | Rotor inertia | |
Speed proportional gain | ||
Speed integral gain |
3.2. Proposed Weighting Factor Tuning Scheme Based on DRL
3.2.1. Generalities
- The state space corresponding to the set of all possible values of the states describing the environment with which the agent interacts;
- The action space corresponding to the set of all possible choices the agent can make to act on the environment;
- The transition dynamics characterizing the dynamics of the environment by giving the probability of obtaining a state at time knowing a state-action pair at time k;
- The reward function quantifying the desirability for the agent to perform action when the environment is in some state ;
- The discount factor determining the relative importance of future rewards compared to immediate ones, therefore influencing the agent’s consideration of long-term consequences when selecting an action to perform.
3.2.2. Twin Delayed Deep Deterministic Policy Gradient
3.2.3. Comparison of TD3 with Other Reinforcement Learning Algorithms
3.2.4. Problem Formulation
- The number of commutations of the VSI’s legs is directly linked to the switching frequency;
- The information contained in the q-axis current and its error is contained, at least partly, in the SPMSM’s phase a current, its error, and its harmonic distortion.
- The current tracking reward which incentivizes the SPMSM’s phase a current to follow its reference within a given tolerance margin and is expressed as
- The switching frequency reward ensuring that the switching frequency of the VSI follows its reference and is expressed as
- The specific goal reward increasing the reward if some specific performance thresholds are achieved, that is expressed as
- The weighting factor variation penalty discourages rapid changes in the weighting factors’ values to increase stability and is expressed as
- The control unbalance penalty penalizing the gap between the values of the two weighting factors, having for expression
3.2.5. Learning Scheme
Algorithm 1: TD3 training algorithm. |
4. Test Results and Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
ANN | Artificial neural network |
CCS | Continuous control set |
DDPG | Deep deterministic policy gradient |
DRL | Deep reinforcement learning |
FCS | Finite control set |
IAE | Integral absolute error |
ISE | Integral squared error |
ITAE | Integral time-weighted absolute error |
ITSE | Integral time-weighted squared error |
MOGA | Multi-objective genetic algorithm |
MDP | Markov decision process |
MPC | Model predictive control |
MPCC | Model predictive current control |
NN | Neural network |
PI | Proportional–integral |
PMSM | Permanent magnet synchronous motor |
RL | Reinforcement learning |
SAC | Soft Actor Critic |
SMPC | Sequential model predictive control |
SPMSM | Surface permanent magnet synchronous motor |
TD3 | Twin Delayed Deep Deterministic Gradient |
THD | Total harmonic distortion |
VSI | Voltage source inverter |
References
- Nam, K.H. AC Motor Control and Electrical Vehicle Applications, 1st ed.; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar]
- Xu, W.; Ismail, M.M.; Islam, M.R. Permanent Magnet Synchronous Machines and Drives: Flux Weakening Advanced Control Techniques; CRC Press: Boca Raton, FL, USA, 2024. [Google Scholar]
- Bida, V.M.; Samokhvalov, D.V.; Al-Mahturi, F.S. PMSM vector control techniques–A survey. In Proceedings of the 2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering, Moscow, Russia, 29 January–1 February 2018; pp. 577–581. [Google Scholar]
- Yao, H.; Yan, Y.; Shi, T.; Zhang, G.; Wang, Z.; Xia, C. A novel SVPWM scheme for field-oriented vector-controlled PMSM drive system fed by cascaded H-bridge inverter. IEEE Trans. Power Electron. 2021, 36, 8988–9000. [Google Scholar] [CrossRef]
- Sun, X.; Diao, K.; Lei, G.; Guo, Y.; Zhu, J. Direct torque control based on a fast modeling method for a segmented-rotor switched reluctance motor in HEV application. IEEE J. Emerg. Sel. Top. Power Electron. 2021, 9, 232–241. [Google Scholar] [CrossRef]
- Yu, Z.; Long, J. Review on advanced model predictive control technologies for high-power converters and industrial drives. Electronics 2024, 13, 4969. [Google Scholar] [CrossRef]
- Mahmoud, H.; Mohamed, M.A.; Hassan, A.A.; Mossa, M.A. Optimal predictive voltage control of a wind driven five phase PMSG system feeding an isolated load. e-Prime Adv. Electr. Eng. Electron. Energy 2024, 9, 100697. [Google Scholar] [CrossRef]
- Zhang, Z.; Li, Z.; Kazmierkowski, M.P.; Rodríguez, J.; Kennel, R. Robust predictive control of three-level NPC back-to-back power converter PMSG wind turbine systems with revised predictions. IEEE Trans. Power Electron. 2018, 33, 9588–9598. [Google Scholar] [CrossRef]
- Zhang, Y.; Liu, X.; Li, H.; Zhang, Z. A model independent predictive control of PMSG wind turbine systems with a new mechanism to update variables. Energies 2023, 16, 3764. [Google Scholar] [CrossRef]
- Karimi, M.H.; Rostami, M.A.; Zamani, H. Continuous control set model predictive control for the optimal current control of permanent magnet synchronous motors. Control Eng. Pract. 2023, 138, 105590. [Google Scholar] [CrossRef]
- Karamanakos, P.; Geyer, T. Guidelines for the design of finite control set model predictive controllers. IEEE Trans. Power Electron. 2020, 35, 7434–7450. [Google Scholar] [CrossRef]
- Tomlinson, M.; du Toit Mouton, H.; Kennel, R.; Stolze, P. A fixed switching frequency scheme for finite-control-set model predictive control – Concept and algorithm. IEEE Trans. Ind. Electron. 2016, 63, 7662–7670. [Google Scholar] [CrossRef]
- Xu, S.; Sun, Z.; Yao, C.; Zhang, H.; Hua, W.; Ma, G. Model predictive control with constant switching frequency for three-level T-type inverter-fed PMSM drives. IEEE Trans. Ind. Electron. 2022, 69, 8839–8850. [Google Scholar] [CrossRef]
- Mossa, M.A.; Mohamed, R.A.; Al-Sumaiti, A.S. Performance enhancement of a grid connected wind turbine-based PMSG using effective predictive control algorithm. IEEE Access 2025, 13, 64160–64185. [Google Scholar] [CrossRef]
- Usama, M.; Kim, J. Model-free current control solution employing intelligent control for enhanced motor drive performance. Sci. Rep. 2025, 15, 60. [Google Scholar] [CrossRef] [PubMed]
- Dragičević, T.; Novak, M. Weighting factor design in model predictive control of power electronic converters: An artificial neural network approach. IEEE Trans. Ind. Electron. 2019, 66, 8870–8880. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhang, Z.; Babayomi, O.; Li, Z. Weighting factor design techniques for predictive control of power electronics and motor drives. Symmetry 2023, 15, 1219. [Google Scholar] [CrossRef]
- Cortes, P.; Kouro, S.; La Rocca, B.; Vargas, R.; Rodriguez, J.; Leon, J.I.; Vazquez, S.; Franquelo, L.G. Guidelines for weighting factors design in Model Predictive Control of power converters and drives. In Proceedings of the 2009 IEEE International Conference on Industrial Technology, Churchill, Australia, 10–13 February 2009. [Google Scholar]
- Rodriguez, J.; Garcia, C.; Mora, A.; Davari, S.A.; Rodas, J.; Valencia, D.F.; Elmorshedy, M.; Wang, F.; Zuo, K.; Tarisciotti, L.; et al. Latest advances of model predictive control in electrical drives—Part II: Applications and benchmarking with classical control methods. IEEE Trans. Power Electron. 2022, 37, 5047–5061. [Google Scholar] [CrossRef]
- Caseiro, L.M.A.; Mendes, A.M.S.; Cruz, S.M.A. Dynamically weighted optimal switching vector model predictive control of power converters. IEEE Trans. Ind. Electron. 2019, 66, 1235–1245. [Google Scholar] [CrossRef]
- Li, X.; Zhang, H.; Shadmand, M.B.; Balog, R.S. Model predictive control of a voltage-source inverter with seamless transition between islanded and grid-connected operations. IEEE Trans. Ind. Electron. 2017, 64, 7906–7918. [Google Scholar] [CrossRef]
- Norambuena, M.; Rodriguez, J.; Zhang, Z.; Wang, F.; Garcia, C.; Kennel, R. A very simple strategy for high-quality performance of AC machines using model predictive control. IEEE Trans. Power Electron. 2019, 34, 794–800. [Google Scholar] [CrossRef]
- Zhang, J.; Li, L.; Norambuena, M.; Rodriguez, J.; Dorrell, D.G. Sequential model predictive control of direct matrix converter without weighting factors. In Proceedings of the IECON 2018-44th Annual Conference of the IEEE Industrial Electronics Society, Washington, DC, USA, 21–23 October 2018; pp. 1477–1482. [Google Scholar]
- Arshad, M.H.; Abido, M.A.; Salem, A.; Elsayed, A.H. Weighting factors optimization of model predictive torque control of induction motor using NSGA-II with TOPSIS decision making. IEEE Access 2019, 7, 177595–177606. [Google Scholar] [CrossRef]
- Li, S.; Won, H.; Fu, X.; Fairbank, M.; Wunsch, D.C.; Alonso, E. Neural-network vector controller for permanent-magnet synchronous motor drives: Simulated and hardware-validated results. IEEE Trans. Cybern. 2020, 50, 3218–3230. [Google Scholar] [CrossRef]
- Fu, X.; Li, S. A novel neural network vector control technique for induction motor drive. IEEE Trans. Energy Convers. 2015, 30, 1428–1437. [Google Scholar] [CrossRef]
- Guazzelli, P.R.U.; de Andrade Pereira, W.C.; de Oliveira, C.M.R.; de Castro, A.G.; de Aguiar, M.L. Weighting factors optimization of predictive torque control of induction motor by multiobjective genetic algorithm. IEEE Trans. Power Electron. 2019, 34, 6628–6638. [Google Scholar] [CrossRef]
- Zanchetta, P. Heuristic multi-objective optimization for cost function weights selection in finite states model predictive control. In Proceedings of the 2011 Workshop on Predictive Control of Electrical Drives and Power Electronics, Munich, Germany, 14–15 October 2011; pp. 70–75. [Google Scholar]
- Villarroel, F.; Espinoza, J.R.; Rojas, C.A.; Rodriguez, J.; Rivera, M.; Sbarbaro, D. Multiobjective switching state selector for finite-states model predictive control based on fuzzy decision making in a matrix converter. IEEE Trans. Ind. Electron. 2013, 60, 589–599. [Google Scholar] [CrossRef]
- Novak, M.; Xie, H.; Dragicevic, T.; Wang, F.; Rodriguez, J.; Blaabjerg, F. Optimal cost function parameter design in predictive torque control (PTC) using artificial neural networks (ANN). IEEE Trans. Ind. Electron. 2021, 68, 7309–7319. [Google Scholar] [CrossRef]
- Vazquez, S.; Marino, D.; Zafra, E.; Peña, M.D.V.; Rodríguez-Andina, J.J.; Franquelo, L.G.; Manic, M. An artificial intelligence approach for real-time tuning of weighting factors in FCS-MPC for power converters. IEEE Trans. Ind. Electron. 2022, 69, 11987–11998. [Google Scholar] [CrossRef]
- Fujimoto, S.; Hoof, H.; Meger, D. Addressing function approximation error in actor-critic methods. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; PMLR: New York, NY, USA, 2018; pp. 1587–1596. [Google Scholar]
- Deb, K. Multi-Objective Optimization Using Evolutionary Algorithms; John Wiley & Sons: Chichester, UK, 2001. [Google Scholar]
- Atkinson, K.E. An Introduction to Numerical Analysis; John Wiley & Sons: Hoboken, NJ, USA, 1989. [Google Scholar]
- Zhang, Y.; Xu, D.; Liu, J.; Gao, S.; Xu, W. Performance improvement of model-predictive current control of permanent magnet synchronous motor drives. IEEE Trans. Ind. Appl. 2017, 53, 3683–3695. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction, 2nd ed.; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous control with deep reinforcement learning. arXiv 2019, arXiv:1509.02971. [Google Scholar]
- Haarnoja, T.; Zhou, A.; Abbeel, P.; Levine, S. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; PMLR: New York, NY, USA, 2018. [Google Scholar]
000 | 0 | 0 | 0 |
100 | |||
110 | |||
010 | |||
011 | |||
001 | |||
101 | |||
111 | 0 | 0 | 0 |
Input Features | Error Metrics | Error Metrics | ||||||
---|---|---|---|---|---|---|---|---|
ISE | ITSE | IAE | ITAE | ISE | ITSE | IAE | ITAE | |
0.08091 | ||||||||
Algorithm | Stability | Sample Efficiency | Computational Cost | Control Performance |
---|---|---|---|---|
DDPG | Low | Moderate | Low | Moderate |
SAC | High | High | High | High |
TD3 | High | Moderate | Moderate | High |
Aspect | DRL | MOGA-Based ANN |
---|---|---|
Learning approach | Trial-and-error interaction | Offline optimization followed by supervised learning |
Adaptability | Dynamically adjusting to changing conditions | Pre-trained for specific scenarios |
Computational load | High during training and deployment | High for training but low for online use |
Multi-objective handling | Implicit in reward design | Explicit via Pareto-optimal solutions |
Implementation complexity | High (requires advanced DRL setup) | Moderate (MOGA and ANN integration) |
Robustness to variations | Strong but need proper selection of state and reward | Dependent on training dataset |
Parameter | Description | Value |
---|---|---|
, | Lower and upper bound for the weighting factor | and 20 |
, | Lower and upper bound for the weighting factor | and |
MOGA population size | 120 | |
Number of MOGA generations | 50 | |
Number of RL training episodes | 800 | |
Episode steps | ||
Tolerance margin for the current error | ||
Size of the mini-batch in Algorithm 1 | 256 | |
Size of the replay buffer | ||
Learning rate | ||
Weight for average update of the target critics | ||
Agent discount factor |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Usama, M.; Salaje, A.; Chevet, T.; Langlois, N. Optimal Weighting Factors Design for Model Predictive Current Controller for Enhanced Dynamic Performance of PMSM Employing Deep Reinforcement Learning. Appl. Sci. 2025, 15, 5874. https://doi.org/10.3390/app15115874
Usama M, Salaje A, Chevet T, Langlois N. Optimal Weighting Factors Design for Model Predictive Current Controller for Enhanced Dynamic Performance of PMSM Employing Deep Reinforcement Learning. Applied Sciences. 2025; 15(11):5874. https://doi.org/10.3390/app15115874
Chicago/Turabian StyleUsama, Muhammad, Amine Salaje, Thomas Chevet, and Nicolas Langlois. 2025. "Optimal Weighting Factors Design for Model Predictive Current Controller for Enhanced Dynamic Performance of PMSM Employing Deep Reinforcement Learning" Applied Sciences 15, no. 11: 5874. https://doi.org/10.3390/app15115874
APA StyleUsama, M., Salaje, A., Chevet, T., & Langlois, N. (2025). Optimal Weighting Factors Design for Model Predictive Current Controller for Enhanced Dynamic Performance of PMSM Employing Deep Reinforcement Learning. Applied Sciences, 15(11), 5874. https://doi.org/10.3390/app15115874