Next Article in Journal
Numerical Study of the Effects of Roughness Coupled with Inclination on a Turbulent Flow around an Obstacle
Previous Article in Journal
Optimization of Cellulase-Assisted Extraction of Total Flavonoids from Equisetum via Response Surface Methodology Based on Antioxidant Activity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development of an MCTS Model for Hydrogen Production Optimisation

1
Department of Computer Systems, Faculty of Information Technologies, Latvia University of Life Sciences and Technologies, Liela iela 2, LV-3001 Jelgava, Latvia
2
Institute of Electronics and Computer Science, 14 Dzerbenes Street, LV-1006 Riga, Latvia
*
Author to whom correspondence should be addressed.
Processes 2023, 11(7), 1977; https://doi.org/10.3390/pr11071977
Submission received: 25 April 2023 / Revised: 25 June 2023 / Accepted: 26 June 2023 / Published: 30 June 2023

Abstract

:
Hydrogen has the potential to revolutionize the energy industry due to its clean-burning and versatile properties. It is the most abundant element in the universe and can be produced through a variety of methods, including electrolysis. The widespread adoption of hydrogen faces various challenges, including the high cost of production; thus, it is important to optimise the production processes. This research focuses on development of models for hydrogen production optimisation based on various external factors and parameters. Models based on electricity prices are developed and compared between different market situations. To run hydrogen production more effectively, it is required to use renewable energy sources for the production process. Adding the solar power component to the economic evaluation model outcome is more positive. The Monte Carlo tree search (MCTS) algorithm is adapted to effectively control the electrolysis process. MCTS schedule optimization was performed for a 24 h time horizon applying two time-resolution settings—1 h and 15 min. The results demonstrate the potential of the MCTS algorithm for finding good schedules for water electrolyser devices by taking into account variable environmental factors. Whereas the MCTS with a 15 min resolution ensures mathematically better results, it requires more computational power to solve the decision tree.

1. Introduction

The unstoppable growth of the population and the world economy, together with shifting from rural to urban living has led to a significant increase in energy demand [1,2]. Utilisation of fossil fuels as the main source of energy caused an increase in the carbon dioxide (CO2) in the atmosphere, which is the main cause of global warming [3]. It would be beneficial for future energy sustainability, resilience and global security to decarbonise the energy supply by using renewable energy sources [4]. One of the options is to use hydrogen (H2). H2 has received a great deal of attention as an energy carrier from both scientists and policymakers [5]. Hydrogen energy is a clean and low-carbon energy, and its combustion product is water that has no carbon emissions [6]. It is predicted that H2 will play a significant role in shifting the global energy system towards a sustainable energy system by 2050 [7]. In 2019, the entire worldwide hydrogen production was around 500 billion m3 per year [8]. Hydrogen can be produced from different resources, including fossil fuels and renewable energy (RE) resources [9]; however, only the type produced using renewable energy is considered as “green” hydrogen [10]. Additionally, nuclear power could bring a significant contribution to hydrogen supplies in the future [11]). Nowadays, 96% of H2 is produced from non-renewables and the remaining 4 % by water electrolysis [12]. Thus, RE resources will play a significant role in the transition process towards a clean and sustainable energy system worldwide [13]. Unfortunately, renewable energy (such as solar or wind) is intermittent and can be random [14], so it is not possible to fully account for it while the hydrogen production process is ongoing. In addition, it should be mentioned that renewable energy characteristics can significantly influence electrolyser performance [15]. To increase the economic efficiency of the hydrogen production process, it is required to control the process based on various parameters, including the availability of renewable sources, actual electricity prices, demand for hydrogen, etc. In the long term, a hydrogen-based economy could contribute towards reducing carbon dioxide emissions and local environmental pollution, securing energy independence and achieving a sustainable energy supply [16].
Two primary methods are used to produce H2: steam methane reforming of natural gas and electrolysis of water. Within this study, the latter method is considered, which contributes to green and sustainable energy production policy. Electrolysis is an electrochemical reaction that splits water into H2 and O2 with electricity input and it is considered the most developed and commercial technology for hydrogen production [17].
There are different hydrogen production systems available for the electrolysis of water; the most common are alkaline (AEL) and polymer electrolyte membrane (PEMEL) systems. Alkaline electrolysers have good durability and maturity but have a low partial load range and low current density [18]. Compared with AEL systems, polymer PEMEL enables flexible operation, requires less than 1 min for start-up and seconds to ramp up from an idle state to the maximum rated power [19]. A fast response to dynamic power supplies allows PEMEL to provide services in the control reserves market and to take advantage of dynamic electricity prices as it is one of the important issues in hydrogen production.
The aim of this study was to develop different models based on electricity prices and the Monte Carlo tree search (MCTS) algorithm to effectively control the electrolysis process by changing the operation modes of the electrolyser (switching on/off).
The MCTS was selected as it provides capabilities to solve decision trees with high branching factors. Additionally, the MCTS is not dependent upon the starting position (root) of the decision tree and can be used ad hoc when it is needed. The authors found a new opportunity for this algorithm application, which is usually limited to gaming environments.
In regard to electricity price modelling, various methods of electricity pricing such as critical peak pricing (CPP), time-of-use pricing (TUP), real-time pricing (RTP) and so forth have been presented. RTP is reportedly one of the best methods [20].
Optimal control strategies can significantly boost the efficiency of hydrogen electrolysers [21]. Control strategies for hydrogen-producing systems that use solar energy are important and required because several parameters (e.g., the number of electrolyser on/off cycles, system component power levels) affect the performance of the whole system [22,23]. Regarding the control strategies, [23] proposed a novel model that aims to schedule the production of hydrogen by an alkaline water electrolysis system and using renewable energy. The developed model was a mixed-integer linear program that incorporates several variables describing operational states and transitions, as well as taking into account the hydrogen flow. A scheduling approach using a combination of solar power and power from the grid was also proposed by [24], where various aspects within the alkaline water electrolysis system were considered, such as electricity price, costs related to the operation of the electrolyser, the solar radiation profile, etc. The control of the scheduling was based on solving optimisation formulas that include the aforementioned aspects. In a study by [25], a segmented fuzzy control method was proposed for scheduling the alkaline electrolyser by using a wind power system. The implementation of fluctuating and highly intermittent renewable energy for the electrolyser operation is challenging, as each component of a hydrogen energy system needs to be optimised to increase the operation time and system efficiency [26].
It is important to note that the field of electrolysis process control is dynamic and that new methods and technologies can emerge in the future. Some of the methods used for the control of electrolysis processes are:
  • Advanced Process Control (APC): APC techniques involve using sophisticated algorithms and models to optimise the performance of electrolysis processes [27]. These methods take into account various process parameters and use real-time data to adjust operating conditions and optimise energy efficiency.
  • Model Predictive Control (MPC): MPC is an advanced control strategy that uses mathematical models of the process to predict its behaviour and optimise control actions [28]. By considering process constraints and objectives, MPC can determine the optimal setpoints for various process variables, such as temperature, current density and electrolyte flow rate.
  • Machine Learning and Artificial Intelligence (AI): Machine learning and AI techniques have gained significant attention in process control applications. These methods involve developing models and algorithms that can learn from data and make predictions or control decisions. By utilizing historical data and real-time measurements, machine learning algorithms can optimise the electrolysis process and improve its performance [29].

2. Materials and Methods

2.1. Electricity-Price-Based Electrolyser Control

In the first stage, simple models based on electricity prices are developed to showcase different scenarios for hydrogen production in various markets. For the comparison, four countries are selected: Latvia, Germany, Spain and Romania, as these countries are partners in the ERA-NET project “New technology to produce hydrogen from Renewable Energy Sources based on AI with optimized costs for environmental applications” (HydroG(re)EnergY-Env).
From the available technoeconomic analysis, it was found that the cost of electricity constitutes up to 40–57% of the levelised cost of hydrogen. However, this cost can be reduced by considering different geographical locations and smart operation strategies. Additionally, the carbon footprint of the electricity source must be considered to ensure that the hydrogen produced has a minimal emission of CO2.
For this study, electricity prices are taken from the “Nord Pool” market (https://www.nordpoolgroup.com/ (accessed on 8 February 2023) but data are acquired from the EU Transparency platform ENTSO-E (https://newtransparency.entsoe.eu/ (accessed on 8 February 2023) using the provided application programming interface (API).
For calculation purposes, a small hydrogen production facility is considered with a power consumption of 58 kWh (which could produce approx. 1 kg of H2). For this approach (electricity-price-based electrolyser control), only H2 prices are considered: 6.25 EUR/kg.
For model demonstration purposes, the time frame for the calculations is taken as one week, but if required it can be changed and adapted based on calculation and analysis needs.
To increase the profit of the production, it is advisable to switch off the technological process when the market electricity price is high and costs for the production exceeds the potential income from the hydrogen. A threshold of 150 EUR/MWh was selected, which corresponds to the daily income for H2 production—at this threshold H2 production becomes profitable (24 kg of H2 per day with a H2 price of 6.25 EUR/kg).

2.2. Electrolyser Schedule Optimisation Using the MCTS Algorithm

At the next stage, the aforementioned approach was extended with additional input parameters and variables and the Monte Carlo tree search (MCTS) algorithm was adopted to optimise the electrolyser operation schedule.
By the definition, the MCTS is a heuristic best-first search algorithm that combines Monte Carlo simulation with searching combinatorial spaces represented by decision trees. In such trees, nodes denote states and edges denote transitions (actions) from one state to another [30]. The MCTS iteratively explores the space by repeating random samples according to Monte Carlo simulation technique [31] and gradually biassing the exploration toward the most promising regions of the decision tree [32].
Each iteration of the MCTS algorithm usually involves four phases: selection, expansion, simulation and backpropagation [33]. Although intermediate states do not need to be evaluated, the MCTS is very good at shrinking large-sized problems by only taking the reward from the terminal state at the end of the simulation process, then balancing the exploitation and exploration [34].
The MCTS has been widely used in planning problems and video and board games such as Go, Poker, Settlers of Catan, Civilization, Sokoban, etc. [35,36,37,38,39,40].
In the authors case, a flow chart for the execution of the MCTS for the problem solution is depicted in Figure 1:
The execution of the MCTS algorithm involves updating of the prices for electricity, hydrogen and oxygen in the expenses model when available. The current state of the electrolyser is taken into account as one of the model parameters. The MCTS algorithm resolves the decision tree defined by the model and produces a schedule that can be applied for changing the operational modes of the electrolyser.
Several MCTS algorithm varieties have been developed over the years for specific-use cases. In particular, single-player MCTS was proposed by [41]. The main differences between SP-MCTS and MCTS are the range of values for the problem states—for two-player environments the reward value alternates depending on which of the players makes the decision (the best move for a player is the worst move for his opponent). Additionally, SP-MCTS has no uncertainty about the opponent’s action, as there are no opponents.

2.3. Optimisation Problem Setup

An algorithm adaptation based on the ideas from the SP-MCTS was used for finding the optimal control strategy of a H2-producing electrolyser considering variable external parameters. The authors used a simplified model for the H2 production process via electrolysis and only critical parts of it were considered for the demonstration of the MCTS approach. The possible states and corresponding parameters of the modelled electrolyser are depicted in Figure 2.
A distinctive feature of electrolysers is the characteristic process inertia—H2 production does not start immediately upon powering up the electrolyser, nor stop immediately after powering it down—electrolysers go through specific “StartUp” and “Shutdown” sequences that can take up to 1 h (used in authors model). The time for these sequences can vary for different electrolysers, as it is dependent on the required electrolyte temperature for maximum efficiency as well as on the membrane types used. Moreover, due to internal component degradation (mostly membrane), electrolysers have a limited number of “On/Off” cycles before maintenance is needed. The distinct state sequences were introduced into the model with the associated energy consumption profiles. The fixed costs associated with the Start/Shutdown state cycles represent the maintenance costs due to internal component degradation.
Upon reaching the production state, the electrolyser load can be varied within a limited range. The product output (H2 and O2) changes according to the applied electrical power. Discrete low/medium/high production states are used within the model. The assumptions are made based on small-scale electrolysers available on the market (e.g., https://pureenergycentre.com/hydrogen-products-pure-energy-centre/hydrogen-electrolyser/ (accessed on 8 February 2023), taking an hourly H2 production of 1 kg and an energy consumption of ~60 kWh.
The model variables (see Table 1) and parameters (see Table 2) of the modelled electrolyser are summarised in below. The selected parameters are not taken from any source and are assumed by the authors based on the descriptions of small-scale electrolysers available on market and used for demonstration purposes only.
Electrolysers are not used as stand-alone devices but rather as a part of a larger infrastructure that provides access to all needed supplies (electricity and water), storage and other facilities. Within the model, such infrastructure components are considered as external parameters and are modelled in the form of costs of supplies and the value of output products. The following components are included in the model:
  • An array of photovoltaic solar (PV) cells with a maximum capacity of 100 kW of energy. This represents a source of renewable energy with available excess power that varies over the day.
  • The relative solar irradiation is modelled for a 12 h long light day, which roughly corresponds to the spring or fall equinoxes. Depending on the actual day of the year and the latitude the solar declination angle will change further, affecting the solar irradiation captured by the PV cells.
  • The power grid connection represents a source of energy with virtually unlimited capacity but with prices changing every hour. Real “Nord Pool” spot market prices for 8 February 2023 were used for modelling (see Figure 3). The model assumes that all available renewable energy is used first and that only the remaining demand is covered by the power grid.
  • Output products (H2 and O2) are considered as income for the production process and modelled as sold for a fixed price. The H2 price can vary based on various factors and production methods [42]. The oxygen generated by electrolysis can be used for some medical and niche applications [43], so can also provide additional income. The prices considered for the model were: PH2 for 5.00 EUR/kg and PO2 for 0.10 EUR/kg.
The objective function (OF) is defined as (1) and aims to maximize revenue from H2 and O2 production for a 24 h schedule by taking into account varying energy prices and available renewable energy sources.
O F = t = 1 T Q t H 2 · P H 2 + Q t O 2 · P O 2 m a x E t E t P V , 0 · P t E C t
where:
  • T is the timespan;
  • Qt is the quantity of products produced;
  • P is the price of products;
  • Et is the energy consumed by the electrolyser;
  • E t P V is the energy provided by the PV cells;
  • P t E is the grid energy price;
  • Ct is the fixed maintenance cost (component degradation).

2.4. MCTS Adaptation for Electrolyser Schedule Optimisation

The MCTS algorithm finds the best solution by exploring the most promising branches of decision space. For the electrolyser schedule optimisation, the tree nodes represent possible electrolyser states and each tree level corresponds to a discrete time step. The MCTS algorithm starts from the root of the decision tree and selects child nodes based on their value. The root of the tree corresponds to the current state of the system, which in the case of electrolyser was selected as the off state at midnight.
Upon reaching a leaf node (unexplored branch of the decision tree), the MCTS algorithm expands the tree by adding possible transitions to the next states. The next one of the newly added states is selected and simulated: uniform random actions are applied to the electrolyser until it reaches the end of the timespan. At the end of the simulation stage, the objective function value OF is calculated.
Finally, the obtained OF value is back propagated to all previously selected nodes of the decision tree via updating their value. This gives an estimate of how valuable the previously selected branch of the decision tree is, and the MCTS algorithm balances between exploring new branches and exploiting the most promising branches that are already known. The MCTS workflow is depicted in Figure 4.
The MCTS algorithm usually evaluates nodes using the value of upper confidence bound 1 applied to trees (UCT), which also ensures balancing between exploration and exploitation strategies. However, the UCT was originally designed for zero-sum two-player games and specifically considers binary results of the simulation—either a win or a loss—which keeps the average value for the node in the range of [0; 1].
For the electrolyser, the OF values represent the costs and revenues of production and are not bound to specific ranges. To overcome this peculiarity, the authors applied sigmoid scaling on the average OF value prior to the UCT calculation. The parameters of the sigmoid (see Equations (2) and (3) and Figure 5) are experimentally selected to fit the range [−250; 150] from the electrolyser model (the parameters of the model have been biased towards expenses, thus most common OF values are negative).
f v = v v m i n + v m a x v m i n 2 10 v m a x v m i n
where:
  • v is the raw value of the objective function OF
  • vmin is the lower range bound (−250)
  • vmax is the upper range bound (150)
σ v = 1 1 + e f v
The final UCT formula used in the MCTS algorithm is (4).
U C T = σ v n + c ln N n
where:
  • v is the total value of the node;
  • n is the total number of simulations from the node;
  • N is the total number of simulations from the node’s parent;
  • c is the exploration parameter; a value of 1.5 is used by the authors.

3. Results

3.1. The Economic Results of Electricity-Price-Based Electrolyser Control

Figure 6 demonstrates the possible scenario of hydrogen production operation for one week (in four countries), taking into account the costs for electricity and the potential income from the H2. The specification of the electrolyser should be considered as there could be some limitations to the number of on/off switching cycles. Modern electrolysers did not require much time to start production at the maximal rate and almost instantly start up.
The production costs and income calculations results are as follows:
  • 24 × 6.25 = EUR 150 (income per day when using a ~60 kWh electrolyser that produces ~1 kg of H2 per h) as a threshold (shown in Figure 6);
  • The overall income per week = 150 × 7 = EUR 1050;
  • The overall costs for H2 production for the selected countries are: LV—EUR 1160.71; RO—EUR 1468.17; DE—EUR 1439.10; ES—EUR 1256.10.
It can be concluded that hydrogen production only using the grid power is not economically feasible in all the selected countries for the selected week (4–10 February 2023), as the costs for electricity are higher than the potential income.
With the dynamic market price for electricity, the operating modes for hydrogen production can be changed and production can potentially be stopped during hours when the electricity price is higher than the defined threshold. In Figure 7, the below electricity price for one specific date is demonstrated and it can be concluded that hydrogen production can be stopped from 8 to 10 and from 14 to 18, thus minimising the overall production costs. Obviously, the amount of hydrogen produced will also be decreased. In this case, hydrogen production will be for 16 h out of 24 h and costs will be EUR 80.67, but the income from H2 = EUR 100. Therefore, it is profitable in this case, with a profit of EUR 19.33.
Calculating the income and costs for one week (but excluding hours with high electricity prices) leads to the following results: H2 production hours: 120 h; total energy used: 6960 kWh; costs: EUR 665.30; income: EUR 780; profit: EUR 114.70.
To run hydrogen production more effectively, it is required to use renewable energy sources for the production process. Adding the solar power component to the economic evaluation model outcome is more positive. A solar system of 100 kW is taken as a reference for calculations.
Figure demonstrates the usage of grid and solar power within a one-day timeframe. The bars in Figure 8 represent the electricity market price.
The calculations for the Latvian market prices for the one-week scenario presented are as follows: costs = EUR 718.95; however, without solar power it was EUR 1160.71, so the costs are decreased by 441.76 (38%). The usage of solar power would decrease the operational costs of hydrogen production, but investments in the solar system also should be taken into consideration in real projects.

3.2. MCTS Results

MCTS schedule optimisation was performed for a 24 h time horizon. The schedule started at midnight with the electrolyser switched off. Two time-resolution settings were used in the experiments—1 h and 15 min. The first implies a 24-level deep decision tree; the latter has a depth of 96 levels. Figure 9 and Figure 10 show the best solutions found by the MCTS algorithm alongside a few manually created schedules for comparison. The summaries for each schedule include the total energy consumed E, hydrogen produced QH2, oxygen produced QO2 and profit F.
The results demonstrate the potential of the MCTS algorithm to find good schedules for water electrolyser devices by considering variable environmental factors. Although the MCTS with a 15 min resolution ensures mathematically better results (the estimated profit is EUR 47.24 vs. EUR 47.37), it requires more computational power to solve the decision tree. The electrolyser state model has an average branching factor of 3, thus the number of possible states to evaluate is around 324 and 396 for the 1 h and 15 min resolutions, respectively.
An additional advantage of the MCTS is that it is able to search for solutions from any state of the electrolyser and thus can be used ad hoc. Figure 11 shows the best schedules found by the MCTS algorithm considering the initial state of the electrolyser. Such a feature allows the use of the MCTS in a reactive manner when external factors change in order to adapt to the actual situation. Another useful feature of the MCTS is partial results. MCTS, at its core, investigates a decision tree and gradually builds confidence in the most promising branches. After a number of iterations, even if the time horizon is not yet reached by the algorithm, the latest branching option has already converged to a solution and can thus be used for decision making.

4. Discussion

The described hydrogen production scheduling problem can be also solved by other approaches and methods. For instance, reinforcement learning (RL) can be applied. RL is a branch of machine learning that incorporates human-level control [44]. As a branch of machine learning, RL is gaining popularity beyond the machine learning community because of its broader applicability. As reinforcement learning does not require a previous data set for training and the agent can learn instantly, it has a strong appeal for real-time applications, for example for UAV task scheduling [45], for building-energy management [46] and for robot manipulations [47]; it can also be applied in intelligent healthcare applications [48], etc. The general model for RL consists of the agent, environment, actions, states and rewards. The purpose of RL is for the agent to maximise the reward by continuously taking actions in response to an environment. The next action can be defined based on the rewards and exploration–exploitation strategies [49].
Both RL and MCTS share some important similarities in their approach to decision-making problems, but they also have some key differences in their algorithms and applications. Both of these approaches involve learning from experience and involve exploration and the use of value functions. Additionally, RL and MCTS can be combined with deep learning. Deep RL combines reinforcement learning with deep neural networks to handle high-dimensional state spaces. Deep MCTS combines MCTS with neural networks to improve the search process.
Another platform that can be used for such a problem is Google’s OR-Tools (Operations Research) (https://developers.google.com/optimization (accessed on 20 March 2023). OR-Tools is an open-source software suite for optimisation that is tuned for tackling the world’s toughest problems in vehicle routing, flows, integer and linear programming and constraint programming.
The achieved modelling results cannot be truly validated and directly compared with other experiments in the literature, as a similar setup is not found in sources and industrial experiments. However, the main outcome of this study is the proof-of-concept validation of the MCTS algorithm and its potential application for the control of the electrolysis process. In the future, we plan to extend the model with additional parameters and to apply this method for controlling the real electrolysis process in an experimental plant.

5. Conclusions

Hydrogen can be produced from a variety of sources, including fossil fuels, renewable energy sources and water. Cost reductions in renewable energy sources such as solar, wind and electrolysers will increase hydrogen production competitiveness over time.
Due to its unique properties and potential applications, hydrogen is the subject of active research and development in many fields.
Simple models based on electricity prices can be used to demonstrate different scenarios for hydrogen production in various markets. It is found that to gain economic benefits, the usage of solar or other renewable energy power is mandatory.
The study of this algorithm, which is based on SP-MCTS ideas, is used to define the optimal control strategy for the electrolyser in terms of the production of H2.
One of the advantages of MCTS is that it is able to search for problem solutions when the electrolyser is in any state and can thus be used ad hoc.

Author Contributions

Conceptualization, A.Z. and V.K.; methodology, A.Z. and A.K.; software, V.K. and A.K. validation, all authors, writing—original draft preparation, A.Z., V.K. and A.K.; writing—review and editing, K.O., A.N. and K.S. All authors have read and agreed to the published version of the manuscript.

Funding

The scientific research and publication of this article were supported by the ERA-NET Project “New technology to produce hydrogen from Renewable Energy Sources based on AI with optimized costs for environmental applications” (HydroG(re)EnergY-Env, No. 112068).

Data Availability Statement

Data available on request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alanne, K.; Cao, S. An overview of the concept and technology of ubiquitous energy. Appl. Energy 2019, 238, 284–302. [Google Scholar] [CrossRef]
  2. Dawood, F.; Anda, M.; Shafiullah, G.M. Hydrogen Production for Energy: An Overview. Int. J. Hydrogen Energy 2020, 45, 3847–3869. [Google Scholar] [CrossRef]
  3. Boudellal, M. Power-to-Gas: Renewable Hydrogen Economy for the Energy Transition; Walter de Gruyter GmbH & Co KG: Berlin, Germany, 2023. [Google Scholar]
  4. Strielkowski, W.; Civín, L.; Tarkhanova, E.; Tvaronavičienė, M.; Petrenko, Y. Renewable Energy in the Sustainable Development of Electrical Power Sector: A Review. Energies 2021, 14, 8240. [Google Scholar] [CrossRef]
  5. Marbán, G.; Valdés-Solís, T. Towards the hydrogen economy? Int. J. Hydrogen Energy 2007, 32, 1625–1637. [Google Scholar] [CrossRef] [Green Version]
  6. Shatnawi, M.; Al Qaydi, N.; Aljaberi, N.; Aljaberi, M. Hydrogen-based energy storage systems: A review. In Proceedings of the 2018 7th International Conference on Renewable Energy Research and Applications (ICRERA), Paris, France, 14–17 October 2018; pp. 697–700. [Google Scholar]
  7. Staffell, I.; Scamman, D.; Abad, A.V.; Balcombe, P.; Dodds, P.E.; Ekins, P.; Shah, N.; Ward, K.R. The role of hydrogen and fuel cells in the global energy system. Energy Environ. Sci. 2019, 12, 463–491. [Google Scholar] [CrossRef] [Green Version]
  8. Kumar, S.S.; Himabindu, V. Hydrogen production by PEM water electrolysis—A review. Mater. Sci. Energy Technol. 2019, 2, 442–454. [Google Scholar]
  9. El-Shafie, M.; Kambara, S.; Hayakawa, Y. Hydrogen Production Technologies Overview. J. Power Energy Eng. 2019, 7, 107–154. [Google Scholar] [CrossRef] [Green Version]
  10. Oliveira, A.M.; Beswick, R.R.; Yan, Y. A green hydrogen economy for a renewable energy society. Curr. Opin. Chem. Eng. 2021, 33, 100701. [Google Scholar] [CrossRef]
  11. El-Emam, R.S.; Özcan, H. Comprehensive review on the techno-economics of sustainable large-scale clean hydrogen production. J. Clean. Prod. 2019, 220, 593–609. [Google Scholar] [CrossRef]
  12. Kannah, R.Y.; Kavitha, S.; Karthikeyan, O.P.; Kumar, G.; Dai-Viet, N.V.; Banu, J.R. Techno-economic assessment of various hydrogen production methods—A review. Bioresour. Technol. 2021, 319, 124175. [Google Scholar] [CrossRef]
  13. Gielen, D.; Boshell, F.; Saygin, D.; Bazilian, M.D.; Wagner, N.; Gorini, R. The role of renewable energy in the global energy transformation. Energy Strategy Rev. 2019, 24, 38–50. [Google Scholar] [CrossRef]
  14. Ji, Y.; Wang, J.; Xu, J.; Fang, X.; Zhang, H. Real-Time Energy Management of a Microgrid Using Deep Reinforcement Learning. Energies 2019, 12, 2291. [Google Scholar] [CrossRef] [Green Version]
  15. Kojima, H.; Nagasawa, K.; Todoroki, N.; Ito, Y.; Matsui, T.; Nakajima, R. Influence of renewable energy power fluctuations on water electrolysis for green hydrogen production. Int. J. Hydrogen Energy 2023, 48, 4572–4593. [Google Scholar] [CrossRef]
  16. Mehmeti, A.; Angelis-Dimakis, A.; Arampatzis, G.; McPhail, S.J.; Ulgiati, S. Life Cycle Assessment and Water Footprint of Hydrogen Production Methods: From Conventional to Emerging Technologies. Environments 2018, 5, 24. [Google Scholar] [CrossRef] [Green Version]
  17. Gondal, I.A.; Masood, S.A.; Khan, R. Green hydrogen production potential for developing a hydrogen economy in Pakistan. Int. J. Hydrogen Energy 2018, 43, 6011–6039. [Google Scholar] [CrossRef]
  18. Schmidt, O.; Gambhir, A.; Staffell, I.; Hawkes, A.; Nelson, J.; Few, S. Future cost and performance of water electrolysis: An expert elicitation study. Int. J. Hydrogen Energy 2017, 42, 30470–30492. [Google Scholar] [CrossRef]
  19. IRENA. Hydrogen from renewable power: Technology outlook for the energy transition. Int. Renew. Energy Agency Abu Dhabi 2018, 2018, 52. [Google Scholar]
  20. Motamedi, A.; Zareipour, H.; Rosehart, W.D. Electricity market price forecasting in a price-responsive smart grid environment. In Proceedings of the IEEE PES General Meeting, Minneapolis, MN, USA, 25–29 July 2010; pp. 1–4. [Google Scholar]
  21. Arsad, A.; Hannan, M.; Al-Shetwi, A.Q.; Hossain, M.; Begum, R.; Ker, P.J.; Salehi, F.; Muttaqi, K. Hydrogen electrolyser for sustainable energy production: A bibliometric analysis and future directions. Int. J. Hydrogen Energy 2023, 48, 4960–4983. [Google Scholar] [CrossRef]
  22. Ulleberg, Ø. The importance of control strategies in PV–hydrogen systems. Sol. Energy 2004, 76, 323–329. [Google Scholar] [CrossRef]
  23. Varela, C.; Mostafa, M.; Zondervan, E. Modeling alkaline water electrolysis for power-to-x applications: A scheduling approach. Int. J. Hydrogen Energy 2021, 46, 9303–9313. [Google Scholar] [CrossRef]
  24. Yang, Y.; De La Torre, B.; Stewart, K.; Lair, L.; Phan, N.L.; Das, R.; Gonzalez, D.; Lo, R.C. The scheduling of alkaline water electrolysis for hydrogen production using hybrid energy sources. Energy Convers. Manag. 2022, 257, 115408. [Google Scholar] [CrossRef]
  25. Hong, Z.; Wei, Z.; Han, X. Optimization scheduling control strategy of wind-hydrogen system considering hydrogen production efficiency. J. Energy Storage 2022, 47, 103609. [Google Scholar] [CrossRef]
  26. Brauns, J.; Turek, T. Alkaline Water Electrolysis Powered by Renewable Energy: A Review. Processes 2020, 8, 248. [Google Scholar] [CrossRef] [Green Version]
  27. Bauer, M.; Craig, I.K. Economic assessment of advanced process control—A survey and framework. J. Process. Control. 2008, 18, 2–18. [Google Scholar] [CrossRef]
  28. Shin, Y.; Smith, R.; Hwang, S. Development of model predictive control system using an artificial neural network: A case study with a distillation column. J. Clean. Prod. 2020, 277, 124124. [Google Scholar] [CrossRef]
  29. Salari, A.; Shakibi, H.; Habibi, A.; Hakkaki-Fard, A. Optimization of a solar-based PEM methanol/water electrolyzer using machine learning and animal-inspired algorithms. Energy Convers. Manag. 2023, 283, 116876. [Google Scholar] [CrossRef]
  30. Świechowski, M.; Godlewski, K.; Sawicki, B.; Mańdziuk, J. Monte Carlo Tree Search: A review of recent modifications and applications. Artif. Intell. Rev. 2023, 56, 2497–2562. [Google Scholar] [CrossRef]
  31. Bai, F.; Ju, X.; Wang, S.; Zhou, W.; Liu, F. Wind farm layout optimization using adaptive evolutionary algorithm with Monte Carlo Tree Search reinforcement learning. Energy Convers. Manag. 2022, 252, 115047. [Google Scholar] [CrossRef]
  32. Rakotoarison, H.; Schoenauer, M.; Sebag, M. Automated machine learning with monte-carlo tree search. arXiv 2019, arXiv:1906.00170. [Google Scholar]
  33. Chaslot, G.; Bakkes, S.; Szita, I.; Spronck, P. Monte-carlo tree search: A new framework for game ai. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Palo Alto, CA, USA, 22–24 October 2008; Volume 4, pp. 216–217. [Google Scholar]
  34. Liu, X.; Fotouhi, A. Formula-E race strategy development using artificial neural networks and Monte Carlo tree search. Neural Comput. Appl. 2020, 32, 15191–15207. [Google Scholar] [CrossRef] [Green Version]
  35. Ponsen, M.J.; Gerritsen, G.; Chaslot, G. Integrating Opponent Models with Monte-Carlo Tree Search in Poker. In Proceedings of the 3rd AAAI Conference on Interactive Decision Theory and Game Theory; AAAI Press: Washington, DC, USA, 2010; pp. 37–42. [Google Scholar]
  36. Robles, D.; Rohlfshagen, P.; Lucas, S.M. Learning non-random moves for playing Othello: Improving Monte Carlo tree search. In Proceedings of the 2011 IEEE Conference on Computational Intelligence and Games (CIG’11), Seoul, Republic of Korea, 31 August–3 September 2011; pp. 305–312. [Google Scholar]
  37. Szita, I.; Chaslot, G.; Spronck, P. Monte-carlo tree search in settlers of catan. In Proceedings of the Advances in Computer Games: 12th International Conference, ACG 2009, Pamplona, Spain, 11–13 May 2009; Revised Papers 12. Springer: Berlin/Heidelberg, Germany, 2009; pp. 21–32. [Google Scholar]
  38. Silver, D.; Huang, A.; Maddison, C.J.; Guez, A.; Sifre, L.; van den Driessche, G.; Schrittwieser, J.; Antonoglou, I.; Panneershelvam, V.; Lanctot, M.; et al. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484–489. [Google Scholar] [CrossRef]
  39. Brown, N.; Bakhtin, A.; Lerer, A.; Gong, Q. Combining deep reinforcement learning and search for imperfect-information games. Adv. Neural Inf. Process. Syst. 2020, 33, 17057–17069. [Google Scholar]
  40. Crippa, M.; Lanzi, P.L.; Marocchi, F. An analysis of Single-Player Monte Carlo Tree Search performance in Sokoban. Expert Syst. Appl. 2022, 192, 116224. [Google Scholar] [CrossRef]
  41. Schadd, M.P.; Winands, M.H.; Van Den Herik, H.J.; Chaslot, G.M.B.; Uiterwijk, J.W. Single-player monte-carlo tree search. In Proceedings of the Computers and Games: 6th International Conference, CG 2008, Beijing, China, 29 September–1 October 2008; Proceedings 6. Springer: Berlin/Heidelberg, Germany, 2008; pp. 1–12. [Google Scholar]
  42. Gandhi, K.; Apostoleris, H.; Sgouridis, S. Catching the hydrogen train: Economics-driven green hydrogen adoption potential in the United Arab Emirates. Int. J. Hydrogen Energy 2022, 47, 22285–22301. [Google Scholar] [CrossRef]
  43. Nicita, A.; Maggio, G.; Andaloro, A.; Squadrito, G. Green hydrogen as feedstock: Financial analysis of a photovoltaic-powered electrolysis plant. Int. J. Hydrogen Energy 2020, 45, 11395–11408. [Google Scholar] [CrossRef]
  44. Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
  45. Yang, J.; You, X.; Wu, G.; Hassan, M.M.; Almogren, A.; Guna, J. Application of reinforcement learning in UAV cluster task scheduling. Futur. Gener. Comput. Syst. 2019, 95, 140–148. [Google Scholar] [CrossRef]
  46. Mason, K.; Grijalva, S. A review of reinforcement learning for autonomous building energy management. Comput. Electr. Eng. 2019, 78, 300–312. [Google Scholar] [CrossRef] [Green Version]
  47. Nguyen, H.; La, H. Review of deep reinforcement learning for robot manipulation. In Proceedings of the 2019 Third IEEE International Conference on Robotic Computing (IRC), Naples, Italy, 25–27 February 2019; pp. 590–595. [Google Scholar]
  48. Coronato, A.; Naeem, M.; De Pietro, G.; Paragliola, G. Reinforcement learning for intelligent healthcare applications: A survey. Artif. Intell. Med. 2020, 109, 101964. [Google Scholar] [CrossRef]
  49. Hsu, R.C.; Liu, C.-T.; Chen, W.-Y.; Hsieh, H.-I.; Wang, H.-L. A Reinforcement Learning-Based Maximum Power Point Tracking Method for Photovoltaic Array. Int. J. Photoenergy 2015, 2015, 496401. [Google Scholar] [CrossRef]
Figure 1. Flow chart of the execution of the MCTS algorithm.
Figure 1. Flow chart of the execution of the MCTS algorithm.
Processes 11 01977 g001
Figure 2. State diagram of a simplified electrolyser for H2 production.
Figure 2. State diagram of a simplified electrolyser for H2 production.
Processes 11 01977 g002
Figure 3. Grid energy prices and available PV energy for each hour of one specific day.
Figure 3. Grid energy prices and available PV energy for each hour of one specific day.
Processes 11 01977 g003
Figure 4. Four phases of the MCTS algorithm.
Figure 4. Four phases of the MCTS algorithm.
Processes 11 01977 g004
Figure 5. Sigmoid scaling of the OF value.
Figure 5. Sigmoid scaling of the OF value.
Processes 11 01977 g005
Figure 6. Possible scenario of hydrogen production based on fluctuating costs of the electricity.
Figure 6. Possible scenario of hydrogen production based on fluctuating costs of the electricity.
Processes 11 01977 g006
Figure 7. Possible scenario of hydrogen production based on dynamic electricity prices (green color—hours of production and red—hours of idle state of the electrolyser).
Figure 7. Possible scenario of hydrogen production based on dynamic electricity prices (green color—hours of production and red—hours of idle state of the electrolyser).
Processes 11 01977 g007
Figure 8. Grid and solar power usage for H2 production.
Figure 8. Grid and solar power usage for H2 production.
Processes 11 01977 g008
Figure 9. Comparison of electrolyser schedules (1 h resolution). The manually created schedules show the electrolyser running on (a) constant low power, (b) constant high power for the whole day and (c) power adjusted for daylight hours. (d) The best schedule found by the MCTS algorithm after taking into account available renewable energy sources and grid electricity prices.
Figure 9. Comparison of electrolyser schedules (1 h resolution). The manually created schedules show the electrolyser running on (a) constant low power, (b) constant high power for the whole day and (c) power adjusted for daylight hours. (d) The best schedule found by the MCTS algorithm after taking into account available renewable energy sources and grid electricity prices.
Processes 11 01977 g009
Figure 10. Comparison of electrolyser schedules (15 min resolution). The manually created schedules show the electrolyser running on (a) constant high power for the whole day and (b) power adjusted for daylight hours. (c) The schedule transferred from the best results for the MCTS with a 1 h resolution (see Figure 9d). (d) The best schedule found by an MCTS with a 15 min resolution.
Figure 10. Comparison of electrolyser schedules (15 min resolution). The manually created schedules show the electrolyser running on (a) constant high power for the whole day and (b) power adjusted for daylight hours. (c) The schedule transferred from the best results for the MCTS with a 1 h resolution (see Figure 9d). (d) The best schedule found by an MCTS with a 15 min resolution.
Processes 11 01977 g010
Figure 11. The best electrolyser schedules depend on initial state. The electrolyser starts from the (a) off and (b) high-production states.
Figure 11. The best electrolyser schedules depend on initial state. The electrolyser starts from the (a) off and (b) high-production states.
Processes 11 01977 g011
Table 1. List of nomenclature.
Table 1. List of nomenclature.
SymbolPropertyUnits
RH2H2 production ratekg/h
RO2O2 production ratekg/h
CFixed costsEUR
EElectrical energykWh
TTimespans
PH2Price of H2EUR
PO2Price of O2EUR
PEPrice of electricity per kWhEUR
FProfitEUR
QH2Quantity of H2 producedkg
QO2Quantity of O2 producedkg
Table 2. Summary of the electrolyser states and their parameters.
Table 2. Summary of the electrolyser states and their parameters.
Electrolyser StateElectrical Energy
E, kWh
Fixed Costs
C, EUR
RH2, kg/hRO2, kg/hRemarks
Start2.03.00--Flat maintenance costs represent membrane degradation over time
Purge5.0---Electrolyser purging (water pumping) before starting production
Heating15.0---Energy consumption for reaching optimal electrolyte temperature
BuildUp20.0---Pressure build-up phase with no product output
Shutdown2.03.00--Flat maintenance costs represent membrane degradation over time
Low production20.0-0.21.6H2 production on low energy profile
Mid production40.0-0.64.8H2 production on medium energy profile
High production60.0-1.08.0H2 production on high energy profile
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Komasilovs, V.; Zacepins, A.; Kviesis, A.; Ozols, K.; Nikulins, A.; Sudars, K. Development of an MCTS Model for Hydrogen Production Optimisation. Processes 2023, 11, 1977. https://doi.org/10.3390/pr11071977

AMA Style

Komasilovs V, Zacepins A, Kviesis A, Ozols K, Nikulins A, Sudars K. Development of an MCTS Model for Hydrogen Production Optimisation. Processes. 2023; 11(7):1977. https://doi.org/10.3390/pr11071977

Chicago/Turabian Style

Komasilovs, Vitalijs, Aleksejs Zacepins, Armands Kviesis, Kaspars Ozols, Arturs Nikulins, and Kaspars Sudars. 2023. "Development of an MCTS Model for Hydrogen Production Optimisation" Processes 11, no. 7: 1977. https://doi.org/10.3390/pr11071977

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop