Experimental Implementation of Reinforcement Learning Applied to Maximise Energy from a Wave Energy Converter
Abstract
:1. Introduction
2. Experimental Set-Up
3. Mathematical Model
3.1. Hydrodynamic Model
3.2. Resistive Control
3.3. Model Validation
4. Reinforcement Learning (RL)
RL on Resistive Control
- State Space: This corresponds to the vector that is formed based on the dimension of the vector that stores all the values of the angle as shown in Equation (7):
- Action Space: Considering the selected state space, the action space is defined in Equation (9):
- Reward Function: This represents the objective that is expected to be maximised by the controller. Therefore, in this case, the reward function needs to be a function of the absorbed power. To prevent the device from making excessive movements in extreme wave conditions that could disrupt the proper functioning of the system, the agent is penalised with a reward of if the established limit value, denoted as , is exceeded. If it is equal to or less than this value, the average power raised to a factor u is obtained, as in [17], which aims to amplify the value because the difference between the average power obtained between neighbouring values of can be very low and must be evident to avoid convergence problems in the algorithm.
- Exploration Strategy: To address the exploration–exploitation dilemma [18], an -greedy strategy is followed, which defines the action taken at each time step as follows:
5. Experimental Procedure
6. Results and Discussion
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
WEC | Wave Energy Converter |
MPC | Model Predictive Control |
PTO | Power Take-Off |
RL | Reinforcement Learning |
LPT | Linear Potential Theory |
BEM | Boundary Element Method |
PLA | Polylactic Acid |
DC | Direct Current |
References
- Salter, S. Wave power. Nature 1974, 7720, 249–260. [Google Scholar] [CrossRef]
- Mørk, G.; Barstow, S.; Kabuth, A.; Pontes, M. T Assessing the global wave energy potential. In Proceedings of the International Conference on Offshore Mechanics and Arctic Engineering-OMAE, Shanghai, China, 6–11 June 2010; Volume 3, pp. 447–454. [Google Scholar]
- Golbaz, D.; Asadi, R.; Amini, E.; Mehdipour, H.; Nasiri, M.; Nezhad, M.M.; Naeeni, S.T.O.; Neshat, M. Ocean Wave Energy Converters Optimization: A Comprehensive Review on Research Directions. arXiv 2021, arXiv:2105.07180. [Google Scholar]
- Coe, R.; Bacelli, G.; Wilson, D.; Abdelkhalik, O.; Korde, U.; Robinett, R., III. A comparison of control strategies for wave energy converters. Int. J. Mar. Energy 2017, 20, 45–63. [Google Scholar] [CrossRef]
- Maria-Arenas, A.; Garrido, A.; Rusu, E.; Garrido, I. Control Strategies Applied to Wave Energy Converters: State of the Art. Energies 2019, 12, 3115. [Google Scholar] [CrossRef]
- Anderlini, E.; Forehand, D.I.; Stansell, P.; Xiao, Q.; Abusara, M. Control of a point absorber using reinforcement learning. IEEE Trans. Sustain. Energy 2016, 7, 1681–1690. [Google Scholar] [CrossRef]
- Anderlini, E.; Forehand, I.D.; Bannon, E.; Xiao, Q.; Abusara, M. Reactive control of a two-body point absorber using reinforcement learning. Ocean Eng. 2018, 148, 650–658. [Google Scholar] [CrossRef]
- Budal, K.; Falnes, J. Optimum Operation of Wave Power Converter; Internal Report; Norwegian University of Science and Technology: Trondheim, Norway, 1976; pp. 1–12. [Google Scholar]
- Hasankhani, A.; Tang, Y.; Van Zwieten, J.; Sultan, C. Comparison of Deep Reinforcement Learning and Model Predictive Control for Real-Time Depth Optimization of a Lifting Surface Controlled Ocean Current Turbine. In Proceedings of the 2021 IEEE Conference on Control Technology and Applications (CCTA), San Diego, CA, USA, 8–11 August 2021. [Google Scholar] [CrossRef]
- Tiron, R.; Pinck, C.; Reynaud, E.G.; Dias, F. Is Boufouling a Critical Issue For Wave Energy Converters? In Proceedings of the 22nd International Offshore and Polar Engineering Conference, Rhodes, Greece, 17–23 June 2012. [Google Scholar]
- Bruzzone, L.; Fanghella, P.; Berselli, G. Reinforcement Learning control of an onshore oscillating arm Wave Energy Converter. Ocean Eng. 2020, 206, 107346. [Google Scholar] [CrossRef]
- Zou, S.; Zhou, X.; Khan, I.; Weaver, W.W.; Rahman, S. Optimization of the electricity generation of a wave energy converter using deep reinforcement learning. Ocean. Eng. 2022, 244, 110363. [Google Scholar] [CrossRef]
- Anderlini, E.; Husain, S.; Parker, G.G.; Abusara, M.; Thomas, G. Towards real-time reinforcement learning control of a wave energy converter. J. Mar. Sci. Eng. 2020, 8, 845. [Google Scholar] [CrossRef]
- Chen, K.; Huang, X.; Lin, Z.; Xiao, X.; Han, Y. Control of a Wave Energy Converter Using Model-Free Deep Reinforcement Learning. In Proceedings of the 2024 UKACC 14th International Conference on Control (CONTROL), Winchester, UK, 10–12 April 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Chen, K.; Huang, X.; Lin, Z.; Xiao, X.; Han, Y. Design and Tank Testing of Reinforcement Learning Control for Wave Energy Converters. IEEE Trans. Sustain. Energy 2024, 15, 2534–2546. [Google Scholar] [CrossRef]
- Pierart, P.; Rubilar, M.; Rothen, J. Experimental Validation of Damping Adjustment Method with Generator Parameter Study for Wave Energy Conversion. Energies 2023, 16, 5298. [Google Scholar] [CrossRef]
- Anderlini, E.; Forehand, D.I.; Bannon, E.; Abusara, M. Control of a Realistic Wave Energy Converter Model Using Least-Squares Policy Iteration. IEEE Trans. Sustain. Energy 2017, 8, 1618–1628. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
h | l | ||||
---|---|---|---|---|---|
0.252 | 0.5916 | 0.48 | 5.40 | 1.33 | 0.21 |
km (V/rad/s) | (H) | D (kgm2)/s | J (kgm2) | |
---|---|---|---|---|
0.050 | 7.22 | 0.01 | 3.9 × 10−5 | 0.43 |
Initial Power (mW) | Final Power (mW) | Efficiency (%) (Initial) | Efficiency (%) (Final) | |
---|---|---|---|---|
Theoretical | 1.16 | 4.01 | 4.9 | 16.9 |
Experimental | 1.531 | 4.198 | 6.4 | 17.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pierart, F.G.; Campos, P.G.; Basoalto, C.E.; Rohten, J.; Davey, T. Experimental Implementation of Reinforcement Learning Applied to Maximise Energy from a Wave Energy Converter. Energies 2024, 17, 5087. https://doi.org/10.3390/en17205087
Pierart FG, Campos PG, Basoalto CE, Rohten J, Davey T. Experimental Implementation of Reinforcement Learning Applied to Maximise Energy from a Wave Energy Converter. Energies. 2024; 17(20):5087. https://doi.org/10.3390/en17205087
Chicago/Turabian StylePierart, Fabian G., Pedro G. Campos, Cristian E. Basoalto, Jaime Rohten, and Thomas Davey. 2024. "Experimental Implementation of Reinforcement Learning Applied to Maximise Energy from a Wave Energy Converter" Energies 17, no. 20: 5087. https://doi.org/10.3390/en17205087
APA StylePierart, F. G., Campos, P. G., Basoalto, C. E., Rohten, J., & Davey, T. (2024). Experimental Implementation of Reinforcement Learning Applied to Maximise Energy from a Wave Energy Converter. Energies, 17(20), 5087. https://doi.org/10.3390/en17205087