Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems

Aatabe, Mohamed; Jenkal, Wissam; Mosaad, Mohamed I.; Hussien, Shimaa A.

doi:10.3390/en18153899

Open AccessArticle

Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems

by

Mohamed Aatabe

^1,*

,

Wissam Jenkal

¹,

Mohamed I. Mosaad

²

and

Shimaa A. Hussien

³

¹

LISTI, National School of Applied Sciences, Ibn Zohr University, Agadir B.P. 1136, Morocco

²

Royal, Commission Yanbu Colleges Institutes, Yanbu Industrial College, Yanbu 46452, Saudi Arabia

³

Electrical Department, College of Engineering, Princess Nourah bint Abdulrahman University, Riyadh 11671, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(15), 3899; https://doi.org/10.3390/en18153899

Submission received: 8 June 2025 / Revised: 4 July 2025 / Accepted: 14 July 2025 / Published: 22 July 2025

(This article belongs to the Section A2: Solar Energy and Photovoltaic Systems)

Download

Browse Figures

Versions Notes

Abstract

Standalone photovoltaic (PV) systems offer a viable path to decentralized energy access but face limitations during periods of low solar irradiance. While batteries provide short-term storage, their capacity constraints often restrict the use of surplus energy, highlighting the need for long-duration solutions. Green hydrogen, generated via proton exchange membrane (PEM) electrolyzers, offers a scalable alternative. This study proposes a stochastic energy management framework that leverages a Markov decision process (MDP) to coordinate PV generation, battery storage, and hydrogen production under variable irradiance and uncertain load demand. The strategy dynamically allocates power flows, ensuring system stability and efficient energy utilization. Real-time weather data from Goiás, Brazil, is used to simulate system behavior under realistic conditions. Compared to the conventional perturb and observe (P&O) technique, the proposed method significantly improves system performance, achieving a

99.9 %

average efficiency (vs. 98.64%) and a drastically lower average tracking error of 0.3125 (vs. 9.8836). This enhanced tracking accuracy ensures faster convergence to the maximum power point, even during abrupt load changes, thereby increasing the effective use of solar energy. As a direct consequence, green hydrogen production is maximized while energy curtailment is minimized. The results confirm the robustness of the MDP-based control, demonstrating improved responsiveness, reduced downtime, and enhanced hydrogen yield, thus supporting sustainable energy conversion in off-grid environments.

Keywords:

standalone PV microgrid; PEM hydrogen production; stochastic energy management; Markov decision process

1. Introduction

Hydrogen has emerged as a versatile and clean energy carrier, widely recognized for its high energy density, ease of storage, and lack of harmful emissions [1,2,3]. Traditional hydrogen production methods, such as natural gas reforming and coal gasification, are associated with significant carbon and pollutant emissions [4,5,6]. In contrast, water electrolysis powered by renewable energy sources offers a sustainable and environmentally friendly alternative—green hydrogen. Among the various electrolyzer technologies, proton exchange membrane (PEM) electrolyzers stand out due to their compact design, high current density, and ability to produce high-purity hydrogen [7,8,9]. PEM electrolysis is particularly well-suited for integration with renewable energy sources like solar and wind power, addressing the need for green hydrogen production.

Integrating renewable energy, particularly photovoltaic (PV) systems, into hydrogen production through electrolysis presents a promising pathway for sustainable energy solutions. Unlike conventional methods reliant on fossil fuels, this approach significantly reduces environmental impact while leveraging the declining cost of solar electricity [10,11]. However, due to natural fluctuations, PV generation has inherent intermittency, requiring hybrid energy storage systems for dependable hydrogen production. Integrating PV systems with PEM electrolyzers allows two-phase energy storage (short-term battery buffering and long-term hydrogen storage), thereby assuring uninterrupted operation [12,13].

Despite their promising potential, standalone PV–Battery–PEM electrolyzer systems for green hydrogen production present both advantages and challenges that must be carefully considered. One of the primary advantages is the ability to directly utilize intermittent solar energy to generate hydrogen, enabling clean and decentralized energy storage that can decouple energy generation from consumption in time and space. The integration of batteries provides short-term energy buffering, smoothing fluctuations in PV output, and enhancing system reliability, while PEM electrolyzers offer fast dynamic response and high efficiency at varying loads, making them well-suited to cope with solar variability. Furthermore, PEM electrolyzers produce high-purity hydrogen without the need for extensive downstream purification, and their modular design allows for scalability and rapid startup/shutdown cycles, which align well with the variable nature of renewable energy.

However, these systems also face several challenges that limit their widespread deployment. The high capital costs of PEM electrolyzers and batteries remain a significant barrier, alongside operational costs related to maintenance and component degradation, particularly the limited cycle life of batteries under frequent charge–discharge cycles. Moreover, the variability and intermittency of solar energy impose complex control and energy management requirements to optimize hydrogen production without excessive energy curtailment or premature battery aging. The efficiency of the entire PV–Battery–PEM system depends heavily on accurate forecasting and dynamic control strategies to balance energy flows among components while meeting variable load demands. Additionally, PEM electrolyzers require stable operating conditions, and frequent power fluctuations can accelerate membrane degradation, reducing system lifetime and increasing replacement costs.

Addressing these advantages and limitations through advanced energy management and control strategies, the standalone DC PV-microgrid plays a pivotal role in bridging the gap between solar energy’s intermittent nature and the stable power demands of electrical loads and hydrogen production. The system relies on solar panels to generate DC electricity, which is then regulated by DC-DC converters to maintain optimal voltage levels for different components [14,15]. An integrated energy management system (EMS) dynamically allocates power between direct consumption, battery storage for short-term fluctuations, and hydrogen production for long-term energy storage. Additionally, charge controllers ensure efficient energy flow to storage units, preventing overcharging and optimizing battery lifespan [16,17].

Effective power optimization is crucial to improving the efficiency and reliability of PV-PEM microgrids, where solar energy is the primary source for both electrical loads and hydrogen production. A well-designed control strategy ensures the optimal power distribution between the load demand, energy storage system (ESS), and electrolyzer, enhancing the overall system stability [18,19]. Due to PV power’s intermittent nature, fluctuations in solar irradiance can lead to sudden variations in power output, which, if not properly managed, can negatively impact system performance.

To address these challenges, DC-DC converters play a critical role in regulating the power flow from the PV array to the loads, batteries, and electrolyzer. The converter adjusts the voltage and current levels to ensure efficient energy transfer while mitigating the effects of power fluctuations [20,21]. Maximum power point tracking (MPPT) control further enhances system performance by dynamically adjusting the operating point of the PV array to extract the maximum available power under varying solar irradiance conditions [22,23]. By integrating an advanced MPPT control, the system can quickly respond to changes in solar input and optimize power allocation. This ensures stable operation, preventing excessive voltage deviations that could compromise the efficiency of the ESS or electrolyzer-efficient energy storage system coordination, stabilizing the coordination of the energy storage system, which stabilizes power fluctuations.

When solar generation exceeds demand, surplus energy can be stored in batteries, ensuring optimal energy utilization. Conversely, the stored energy can be dispatched during periods of low solar availability to maintain a stable and reliable power supply. Additionally, surplus energy can be strategically allocated to power electrolyzers for hydrogen production, optimizing resource utilization and reducing energy wastage. By dynamically managing power flow within the PV-PEM microgrid, the EMS enhances energy efficiency, reduces operational costs, and improves the overall sustainability of hydrogen production systems.

Despite significant advancements in renewable energy technologies, a critical challenge in the real-world deployment of standalone PV systems is the unpredictable nature of load demand. Traditional deterministic energy management strategies often rely on fixed forecasts or average load profiles, making them inadequate in handling abrupt, stochastic variations in consumption. These limitations lead to inefficient energy allocation, increased reliance on batteries, and underutilization of surplus solar energy. Consequently, there is a pressing need for energy management approaches that can anticipate and adapt to fluctuations in random load in real time. This research addresses this gap by proposing a stochastic EMS framework based on a Markov decision process (MDP), designed to optimize energy flow within a PV–Battery–PEM electrolyzer system and ensure sustainable hydrogen production under uncertain operating conditions.

1.1. Literature Review

Recent research has demonstrated a growing interest in standalone renewable energy systems for green hydrogen production. The authors of [24] propose an autonomous hybrid wind–solar system designed for green hydrogen production and water treatment, focusing on optimal sizing and economic feasibility to support hydrogen refueling stations. Similarly, Ref. [25] addresses the adaptability of renewable energy systems to various demand profiles, such as rural, institutional, and medical needs, by using optimization algorithms to ensure economically and environmentally sound configurations. Moreover, an operational optimization method tailored to off-grid hydrogen systems is designed in [26], emphasizing the reduction of design inefficiencies and improving long-term operational reliability.

Furthermore, the authors of [27] explore optimal hybrid microgrid configurations by performing extensive techno-economic simulations, with a focus on cost minimization, renewable penetration, and hydrogen production potential. Their study provides valuable insights into long-term system planning and highlights hydrogen’s role in sustainable energy strategies. However, their approach is purely design-oriented and relies on deterministic scenarios with averaged resource and load profiles. In contrast, this work shifts the focus to real-time stochastic control that dynamically manages energy flows under uncertain and time-varying solar generation and load demand.

In terms of control strategies and energy flow management, various studies have explored advanced optimization techniques for hydrogen-integrated microgrids. The authors of [17] implement a model predictive control (MPC) scheme to manage a standalone PV-hydrogen-battery system, effectively reducing battery cycling and prioritizing hydrogen production during energy surplus periods. Ref. [28] introduces a hierarchical economic MPC framework that coordinates short- and long-term control layers to maintain system flexibility and economic performance. In the same context, an MPC-based management system for an on-site hydrogen refueling station is designed in [29], considering dynamic constraints and operational scheduling to maximize hydrogen production and distribution efficiency. Furthermore, the study presented in [30] proposes a neural network-based predictive control system capable of smoothing power fluctuations in a solar-wind hybrid system with hydrogen and battery storage, thereby advancing intelligent control applications in hydrogen energy systems.

Several researchers have also focused on improving system reliability through enhanced power electronics and hybrid storage solutions. Ref. [20] proposes a two-stage DC–DC conversion architecture combining a resonant frequency converter and a partial power regulation unit to facilitate MPPT and efficient electrolyzer operation in off-grid PV systems. The resonant frequency converter and partial power regulation unit enhance MPPT and electrolyzer functionality; nonetheless, the study’s dependence on idealized irradiance profiles neglects actual transient situations, such as rapid cloud cover. The intricacy of the suggested topology, necessitating synchronized management of several conversion steps, may elevate maintenance costs and diminish scalability for field deployments. These constraints highlight the need for resilient control systems that can adjust to erratic solar fluctuations.

The authors of [31] advocate for sliding mode control and bidirectional converters to improve operational flexibility and storage efficiency in standalone PV systems. An integer linear programming with MPC to optimize hydrogen-battery hybrid storage was proposed in [32], aiming to reduce component degradation and enhance economic performance. Ref. [33] takes a complementary approach by proposing a hybrid energy storage system that integrates batteries, fuel cells, and supercapacitors, which together reduce stress on any single storage element and extend the operational life of the system, optimized using a fractional gradient descent algorithm.

Finally, the authors of [21] propose a simplified architecture for standalone PV-powered hydrogen generation by eliminating traditional power converters and implementing a degradation-aware control strategy. Their approach maintains constant electrolyzer power despite irradiance variations, achieving impressive efficiency metrics through indirect PV control and strategic battery use. However, unlike the proposed stochastic control framework, their method lacks real-time adaptability to random load fluctuations. The direct electrical coupling between the PV array and electrolyzer limits the system’s flexibility, particularly under unpredictable load conditions.

Ref. [34] focuses on long-term performance estimation through machine learning, identifying the most suitable forecasting models for hydrogen production across different geographic and climatic conditions based on extensive weather datasets. These works emphasize the value of predictive and simplified design methodologies in making green hydrogen systems more accessible, resilient, and scalable for off-grid and distributed energy applications. Yet, the proposed methodology requires high-resolution input data (1-min irradiance/wind measurements), which may be unfeasible for distant off-grid locations with inadequate monitoring infrastructure. These restrictions indicate a need for adaptive algorithms that balance prediction accuracy with practical operating limits.

Despite the comprehensive advancements in the design, control, and optimization of the PV-PEM microgrid via optimized power electronics, streamlined architectures, and machine learning forecasting, a critical limitation across much of the existing literature lies in the limited consideration of demand-side uncertainty, particularly in scenarios where load behavior exhibits random variations. Many studies emphasize the optimization of generation resources, energy storage, and power electronics, yet they often operate under assumptions of predictable or averaged load profiles. Such simplifications can significantly underestimate the operational challenges faced in real-world off-grid systems, where consumption patterns may fluctuate unpredictably due to user behavior, seasonal changes, or application-specific demands. This oversight leaves a gap in ensuring robust energy management strategies that can dynamically adapt to random load variations. Addressing this gap requires control models that optimize energy flow and incorporate stochastic representations of load demand to reflect actual operating conditions and enhance the resilience of autonomous PV microgrids.

Incorporating stochastic modeling into the control architecture of PV-PEM microgrids is essential for capturing the inherent randomness associated with real-world load consumption. Unlike deterministic methods that rely on fixed or average values, stochastic models use probability distributions and random variables to describe fluctuations in energy demand more realistically. This approach allows for anticipating a wide range of possible consumption scenarios, rather than a single expected outcome, enabling the system to prepare for both typical and extreme load conditions. By integrating uncertainty directly into forecasting and decision-making processes, stochastic modeling enhances the responsiveness and flexibility of energy management strategies. As a result, PV-PEM microgrids are better equipped to allocate resources, schedule storage use, and maintain supply-demand balance, even under erratic or rapidly changing consumption patterns. This capability is especially critical in isolated or autonomous systems, where forecasting errors can lead to power shortages, unnecessary cycling of storage devices, or system instability.

MDP offers a structured and adaptable approach for addressing uncertainties in energy management, particularly in standalone PV microgrids dedicated to green hydrogen production. In such systems, the unpredictable nature of load demand and hydrogen production dynamics introduces complexity that requires probabilistic modeling. MDPs are well-suited to this context, as they represent decision-making in environments where the outcome of each action depends only on the current system state and a set of probabilistic transitions. This property makes them ideal for modeling sequential decision-making under uncertainty, where actions, such as charging batteries, powering electrolyzers, or shedding loads, must be taken in response to fluctuating inputs. Integrating MDPs into the control strategy makes it possible to evaluate each decision’s long-term impact on system performance, balancing hydrogen production efficiency, storage stability, and energy availability. The ability of MDPs to generate optimal policies over time enhances the resilience and intelligence of the microgrid, enabling it to adapt dynamically to both forecasted and unforeseen changes in its operational environment.

1.2. Main Contribution

This research introduces a stochastic energy management approach designed to enhance the utilization of excess energy in standalone PV microgrids, with the primary objective of maximizing green hydrogen production. The proposed framework leverages an MDP integrated with energy management control to anticipate unpredictable load consumption and optimize the distribution of generated power between local consumption, battery storage, and a PEM electrolyzer. Unlike conventional strategies that rely on deterministic assumptions, this method incorporates the stochastic nature of load behavior to make informed, real-time decisions that prevent energy wastage. By accurately forecasting when energy demand will be low, the system intelligently channels surplus power toward hydrogen production rather than letting it go unused or overcharging the storage systems. This ensures a more efficient exploitation of solar energy, even under fluctuating environmental and consumption conditions, thereby improving the reliability and sustainability of standalone microgrids designed for hydrogen generation.

The main contributions of this work can be summarized as follows:

A novel integration of MDP and energy management control is proposed to manage power flows in a standalone PV–battery–electrolyzer system, specifically focusing on converting excess solar energy into hydrogen by anticipating future load behavior.
The method enhances forecasting accuracy of energy consumption patterns, enabling the system to proactively allocate available power to hydrogen production when predicting low demand.
The approach ensures continuous and efficient hydrogen generation by maintaining operational stability despite random solar input and load profile variations.
Battery lifetime is extended through smart control of charge–discharge cycles, reducing unnecessary cycling by prioritizing hydrogen production during energy surplus periods.

The remainder of this paper is structured as follows. Section 2 outlines the architecture of the standalone PV–battery–electrolyzer microgrid, including its main components and the integration of DC loads. Section 3 details the proposed stochastic energy management strategy, combining energy management control with an MDP for optimized power distribution. In Section 4, simulation results are presented to evaluate the performance of the developed method under realistic operating conditions. Finally, Section 5 summarizes the key findings and offers concluding insights.

2. Overview of the Standalone PV–PEM–Microgrid System

A standalone DC PV-PEM microgrid is an autonomous energy system engineered to function independently from the main power grid, with the primary goal of generating and storing renewable electricity for green hydrogen production. It integrates energy conversion, regulation, storage, and consumption subsystems in a coordinated manner to ensure reliable and continuous operation. As shown in Figure 1, the system comprises a PV array, DC-DC converters, a hydrogen production unit based on a PEM electrolyzer, and energy storage components. The PV modules are the main energy source, converting solar radiation into DC electricity through the PV effect. Their output depends on irradiance and ambient temperature, and is inherently variable throughout the day.

Power conditioning stages are employed to manage these fluctuations and match the power requirements of the PEM electrolyzer and storage units. A DC-DC converter is essential to stabilize the voltage and current levels, thereby improving power transfer efficiency and safeguarding sensitive components. These converters also integrate MPPT algorithms, which continuously adjust the operating point of the PV array to extract the highest possible power under changing solar conditions [22,23].

Since electrolyzers require a stable and continuous DC power input to operate efficiently, the regulated power from the PV array is first used to meet the energy demand of all connected DC loads. When solar availability is high, the PV system also charges the batteries until they reach their maximum SoC. Once the load demand is satisfied and the storage system is fully charged, surplus energy is directed to the PEM electrolyzer to produce green hydrogen. This approach prioritizes supplying critical loads and maximizing energy storage before utilizing excess renewable energy for hydrogen generation. The battery system serves as a buffer, storing energy during high-generation periods and supplying power when solar input is insufficient, thereby maintaining energy balance and supporting continuous hydrogen production when direct PV power is not available.

2.1. PV Conversion System

The PV system is designed to harness solar energy by utilizing PV modules that generate DC electricity when exposed to sunlight. These modules are connected to a DC-DC converter, which plays a crucial role in conditioning the output power to meet the requirements of various DC loads. As illustrated in Figure 2, the converter adjusts the voltage and current from the PV array through a duty cycle control

u (t)

, which modulates the switching of the MOSFET, ensuring efficient power delivery to the connected loads. This setup allows the PV generator to supply power to different components, including batteries and a PEM electrolyzer, while maintaining optimal operation.

The nonlinear state-space model governs the PV conversion system [35]:

\begin{matrix} \dot{x} (t) & = f (x (t)) x (t) + g (x (t)) u (t), t \geq 0, x_{0} \in R^{3}, \end{matrix}

(1)

where

x (t) = {[v_{p v} (t), i (t), v (t)]}^{'} \in R^{3}

is the state vector representing the system states,

v_{p v} (t)

is the voltage at the PV terminals,

i (t)

is the inductor current, and

v (t)

is the output capacitor voltage of the DC-DC converter. The system matrices are given as follows:

\begin{matrix} f (x (t)) & = [\begin{matrix} \frac{1}{C_{p v}} \frac{i_{p v}}{v_{p v}} & - \frac{1}{C_{p v}} & 0 \\ \frac{1}{L} & - \frac{R_{L} + R_{D} + \frac{R_{C} R_{l o a d}}{R_{C} + R_{l o a d}}}{L} & - \frac{R_{l o a d}}{L (R_{C} + R_{l o a d})} \\ 0 & \frac{R_{l o a d}}{C (R_{C} + R_{l o a d})} & - \frac{1}{C (R_{C} + R_{l o a d})} \end{matrix}], \\ g (x (t)) & = [\begin{matrix} 0 \\ \frac{- R_{M} + R_{D} + \frac{R_{C} R_{l o a d}}{R_{C} + R_{l o a d}}}{L} i (t) + \frac{R_{l o a d}}{L (R_{C} + R_{l o a d})} v (t) \\ - \frac{R_{l o a d}}{C (R_{C} + R_{l o a d})} i (t) \end{matrix}] . \end{matrix}

2.2. Battery Energy Storage System

Battery energy storage systems (BESSs) are a cornerstone of energy management strategies in stand-alone PV microgrids. They function as dynamic buffers, capturing surplus electrical energy for deferred use, thereby ensuring a continuous supply during periods of low solar generation. The operational behavior of BESSs is governed by electrochemical mechanisms within the cells, which define the dynamics of charging and discharging, as well as the overall efficiency and service life of the storage unit [36].

Energy exchange between the battery and the rest of the microgrid is handled by a power conversion interface, typically a bidirectional DC-DC converter. This system is tasked with managing energy flows in both directions, absorbing energy during surplus generation and supplying it during deficits, while minimizing conversion losses and enhancing the overall reliability of the system. Control strategies embedded within this converter also serve to monitor the state of charge (SoC), prevent battery degradation from overcharging or deep discharging, and maintain load balance [37].

The SoC is a critical indicator of the battery’s remaining usable capacity and is commonly tracked using a Coulomb counting technique. This method estimates the SoC by integrating the current entering or leaving the battery over time, accounting for the direction of flow. Mathematically, the SoC at time t is calculated as follows:

S o C (t) = S o C (t_{0}) + \frac{Q (t)}{C_{b a t}},

(2)

where

C_{b a t}

denotes the nominal capacity of the battery, and

Q (t)

is the net charge accumulated or discharged between the initial time

t_{0}

and time t, given by the following:

Q (t) = \int_{t_{0}}^{t} i_{b a t} (τ) d τ,

(3)

with

i_{b a t}

representing the instantaneous current. A positive value indicates charging, while a negative value corresponds to discharging.

In a standalone PV system, the bidirectional converter operates in two main configurations, as illustrated in Figure 3:

Buck mode (charging phase): The converter steps down the higher DC bus voltage $V_{b u s}$ to align with the lower battery voltage $V_{b a t}$ , facilitating safe energy storage.
Boost mode (discharging phase): When the system needs energy, the converter increases the battery voltage $V_{b a t}$ to match the bus voltage $V_{b u s}$ , ensuring adequate power delivery to the loads and other subsystems.

This flexible operation allows the BESS to actively contribute to energy autonomy, load regulation, and stability in the standalone PV microgrid architecture.

2.3. PEM Electrolyzer System

The PEM electrolyzer is a critical subsystem in the standalone PV-based microgrid, enabling the conversion of electrical energy into hydrogen via electrolysis. The electrolyzer dissociates water molecules into hydrogen and oxygen gases when powered by DC electricity. This process is highly sensitive to the input voltage and current, which the upstream power electronics regulate to ensure safe and efficient operation [38].

The dynamic behavior of the PEM electrolyzer can be described by the relationship between the input current and the produced hydrogen flow rate. The molar flow of hydrogen

{\dot{n}}_{H_{2}} (t)

is directly proportional to the electrolyzer current

i_{e l} (t)

, and can be expressed as follows:

{\dot{n}}_{H_{2}} (t) = \frac{η_{F} \cdot i_{e l} (t)}{2 F},

(4)

where

η_{F}

is the Faraday efficiency, and F is the Faraday constant (

F \approx

96,485 C/mol). The Faraday efficiency accounts for losses due to side reactions and non-idealities in the electrochemical process.

The terminal voltage of the PEM electrolyzer

v_{e l} (t)

can be modeled as the sum of the thermodynamic voltage

E_{r e v}

and the overpotentials resulting from activation, ohmic, and concentration losses. A simplified dynamic expression is as follows:

v_{e l} (t) = E_{r e v} + v_{a c t} (t) + v_{o h m i c} (t) + v_{c o n} (t),

(5)

where

E_{r e v}

is the reversible voltage dependent on temperature and pressure,

v_{a c t} (t)

is the activation overvoltage caused by reaction kinetics,

v_{o h m i c} (t)

accounts for voltage losses due to membrane and electrode resistance, and

v_{c o n} (t)

is the concentration overvoltage arising at high current densities.

2.4. Control Objectives for Energy Management

Energy management in autonomous PV systems integrating a PEM electrolyzer and operating under variable and uncertain load profiles requires a robust and forward-looking control strategy to ensure optimal performance and system sustainability. In this context, stochastic energy management (SEM) offers a suitable framework for real-time decision-making under uncertainty [39,40,41], enabling the optimization of power flows while preserving system reliability and operational efficiency. The primary control objective is to maximize the utilization of PV-generated power, prioritizing local consumption, without the need for external energy sources. A secondary, but equally critical, objective is to maintain the battery’s SoC within predefined safety and performance thresholds to prevent degradation and ensure energy availability during low irradiance periods.

To meet these goals, the control strategy must account for the inherent stochasticity in solar irradiance and load demand, as well as the nonlinear dynamics of system components, such as the PV generator, battery energy storage system, and PEM electrolyzer. The SEM algorithm incorporates models of these uncertainties, often using an MDP to forecast possible future states of the system, including energy generation and consumption scenarios. These forecasts inform control decisions, enabling proactive and adaptive power dispatch.

Operational constraints are integrated into the control model, including the maximum and minimum SoC limits, the efficiency maps of the PV panels, and the safe operating range of the electrolyzer, particularly its allowable input current and voltage ranges. The DC-DC converters interfacing the PV array with both the battery and electrolyzer are modulated accordingly, ensuring that power flows are dynamically regulated in real-time. This prevents both overcharging and deep discharging of the battery, and maintains the electrolyzer within its high-efficiency operating zone.

Through this SEM approach, the system ensures optimal energy sharing between the battery and the electrolyzer, while adapting to real-time variations in generation and load. As a result, the PV-microgrid operates autonomously and efficiently, producing green hydrogen when surplus energy is available, maintaining battery health, and ensuring a reliable and continuous energy supply even in the face of unpredictable demand patterns.

The following section will demonstrate how the proposed SEM-MDP framework achieves these objectives.

3. MDP-Driven Approach to Optimizing Hydrogen Production

This section presents a stochastic energy management architecture designed for an off-grid DC PV-PEM microgrid integrating multiple loads, battery storage, and a PEM electrolyzer. The system configuration, shown in Figure 4, illustrates the proposed energy management strategy aimed at enhancing hydrogen production by intelligently managing surplus solar energy. This framework represents a novel contribution, as it integrates MDP-based forecasting directly into the energy management loop for real-time, adaptive decision-making under uncertainty.

The proposed EMS orchestrates the operation of the microgrid by dynamically supervising power flows among the PV generator, load demands, battery storage, and the electrolyzer. Central to this EMS is the MDP-based stochastic controller, which anticipates future variations in load consumption, enabling proactive and optimal energy allocation.

A power management controller (PMC) receives probabilistic forecasts from the MDP and determines the optimal power dispatch strategy. It continuously assesses the instantaneous load demand, battery SoC, and PV generation. Under normal conditions, priority is given to supplying local loads and charging the battery. However, when the load is fully met and the battery reaches its maximum SoC, any remaining surplus PV energy is automatically redirected to the PEM electrolyzer.

This approach ensures that no solar energy is wasted, as excess power is effectively transformed into green hydrogen. In doing so, the system not only enhances hydrogen production but also avoids unnecessary battery cycling, thereby preserving battery lifespan and maximizing overall system efficiency and sustainability, even under fluctuating environmental and unpredictable load conditions.

3.1. MDP-Driven Load Consumption Forecasting

The overall power consumption in a standalone PV microgrid that supplies diverse loads often exhibits random and unpredictable patterns. This variability arises from factors such as user behavior, intermittent appliance usage, and the non-uniform power profiles of individual loads. For instance, some appliances may operate cyclically or have usage peaks at specific times (e.g., day vs. night), leading to significant temporal fluctuations in total load demand. Moreover, when several loads with distinct power profiles are involved, their combined consumption behavior becomes highly stochastic, complicating the task of real-time energy allocation. These fluctuations can undermine the stability, efficiency, and reliability of the microgrid if not properly anticipated and managed. Designing an EMS capable of predicting and adapting to these changes is, therefore, a key requirement for a sustainable and resilient PV-PEM microgrid.

To address these challenges, this work introduces a novel stochastic control strategy based on an MDP, which constitutes a key contribution of this study. Unlike conventional methods, the MDP-based approach enables real-time modeling of random fluctuations in load demand and facilitates probabilistic forecasting of power consumption across multiple users. This probabilistic load model becomes the cornerstone of the EMS, allowing for anticipatory decisions that optimize energy dispatch across the PV generator, battery storage, and hydrogen production unit.

The MDP captures the discrete and dynamic nature of load states, where each load may be either active or inactive at any moment. This abstraction leads to a finite but possibly large number of system states, each corresponding to a unique combination of load activities and associated power demand. The stochastic behavior is then modeled as a continuous-time Markov chain, denoted as

θ_{t}, t \geq 0

, where transitions between states are governed by transition probabilities reflecting the likelihood of switching from one load configuration to another.

By incorporating this stochastic load model into the EMS, the system is able to predict future demand trajectories and optimize energy use accordingly. For instance, during low-demand intervals or when the battery reaches its maximum SoC, the EMS proactively redirects surplus PV energy to the PEM electrolyzer, thereby producing green hydrogen instead of curtailing generation or over-cycling the battery. This predictive logic improves system efficiency, prolongs battery lifespan, and ensures maximum utilization of renewable energy resources.

The MDP framework applied in this study is structured as follows:

State space definition: The state space includes all possible combinations of the on/off statuses of n loads, leading to $2^{n}$ distinct states $θ_{t}$ , each associated with a deterministic load power $P_{load} (θ_{t})$ . These states capture the stochastic load profile dynamics over time [35,42,43].
Transition probabilities: Given a state space $S = {1, \dots, r}$ , the probability of transitioning from state i to j after a short time ℏ is as follows:

$Pr [θ_{t + ℏ} = j | θ_{t} = i] = \{\begin{matrix} π_{i j} ℏ + o (ℏ), & if i \neq j, \\ 1 + π_{i i} ℏ + o (ℏ), & if i = j, \end{matrix}$

(6)

where $π_{i j} \geq 0$ are the transition rates, and $π_{i i} = - \sum_{j \neq i} π_{i j}$ ensures that total probabilities sum to one.
Real-time updating: As new measurements are acquired, the transition matrix is updated to reflect observed consumption patterns. For example, if a load becomes more active than predicted, the model adapts by increasing the transition rates toward higher-power states, thereby enhancing future prediction accuracy.
Continuous forecasting loop: The algorithm operates in a loop, continuously updating state probabilities and outputting a real-time forecast of $P_{load} (θ_{t})$ . This forecast feeds directly into the EMS, informing the power dispatch decisions of the PV system, battery, and PEM electrolyzer.

By leveraging this MDP-based stochastic modeling approach, the EMS gains the ability to make optimal decisions under uncertainty, ensuring that PV generation is dynamically matched to actual and forecasted load demands. The integration of this mechanism into the microgrid control strategy enables a predictive and robust energy management system. By outperforming traditional MPPT and rule-based energy scheduling, this strategy provides probabilistic foresight, unlocking new levels of operational efficiency and enabling the exploitation of excess PV energy for sustainable hydrogen production when conventional storage and consumption pathways are saturated.

3.2. PV Power Optimization Under Unpredictable Load Consumption

To ensure MPPT in uncertain and time-varying load demands, a robust control framework based on the

H_{\infty}

technique is proposed as an alternative to model optimization control. This approach exploits real-time load consumption forecasts provided by an MDP, capturing the stochastic behavior of load consumption.

Unpredictable load demand fluctuations significantly affect the PV array’s output power. This relationship can be characterized through a nonlinear dependence on the load state

θ_{t}

and the converter duty cycle

u (t)

, where the ratio between PV and load currents modulates the efficiency of effective power transfer. More formally, the output power of the PV generator under stochastic load conditions is expressed as follows:

P_{p v} = P_{l o a d} (θ_{t}) {(\frac{i_{p v}}{i_{l o a d}})}^{2} {(1 - u (t))}^{2} .

(7)

To design a control scheme capable of responding to such uncertainties, we adopt a robust stochastic optimization framework that directly incorporates the PV system’s nonlinear electrical characteristics. According to established PV modeling approaches [35,44], the instantaneous power extracted from the PV panel can also be described as follows:

\begin{matrix} P_{p v} & = v_{p v} i_{p v} \\ = n_{p} I_{p h} v_{p v} - n_{p} I_{r s} v_{p v} (exp (\frac{k_{p v} v_{p v}}{n_{s}}) - 1), \end{matrix}

(8)

where

I_{p h}

is the photocurrent,

I_{r s}

is the reverse saturation current, and

n_{p}

and

n_{s}

are the number of PV cells in parallel and series, respectively. The parameter

k_{p v} = \frac{q}{η k T}

represents the inverse thermal voltage of the cell, encapsulating the temperature dependence of the PV diode equation.

These expressions are integrated into the

H_{\infty}

-based MPPT controller design, ensuring that the PV array operates near its maximum power point despite variations in load and environmental conditions. The combined power equations and stochastic control formulation enable the system to robustly track optimal power levels, maintaining efficient energy delivery in dynamic and uncertain scenarios.

The output power delivered by the PV system, as influenced by the stochastic load dynamics and converter operation, is characterized by the following expression:

\begin{matrix} y (t) = \frac{d P_{p v}}{d v_{p v}} & = i_{p v} - \frac{n_{p} k_{p v}}{n_{s}} I_{r s} v_{p v} exp (\frac{k_{p v} v_{p v}}{n_{s}}), \\ = [\frac{i_{p v}}{v_{p v}} - \frac{n_{p} k_{p v}}{n_{s}} I_{r s} exp (\frac{k_{p v} v_{p v}}{n_{s}}) 0 0] x (t), \\ = C x (t) . \end{matrix}

(9)

The stochastic

H_{\infty}

controller is designed to minimize the worst-case impact of unpredictable disturbances, such as random load fluctuations, on the ability of the PV system to operate at its maximum power point. By combining (1) and (9), the dynamics of the controlled system are modeled as a linear process of varying time influenced by the state of the MDP

θ_{t}

and are expressed as follows:

\begin{matrix} \dot{\bar{x}} (t) & = \bar{f} (θ_{t}) \bar{x} (t) + \bar{g} (θ_{t}) u (t), \\ y (t) & = \bar{C} \bar{x} (t), \end{matrix}

(10)

where

\bar{f} (θ_{t}) = [\begin{matrix} f (θ_{t}) & 0 \\ C & 0 \end{matrix}], \bar{g} (θ_{t}) = [\begin{matrix} g (θ_{t}) \\ 0 \end{matrix}], and \bar{C} = [\begin{matrix} C & 0 \end{matrix}] .

where

\bar{x} (t) = {[x (t) e (t)]}^{'}

is the augmented system state, and

u (t)

is the control input (duty cycle of the DC-DC converter). The desired optimal operating trajectory

x_{d} (t)

corresponds to the conditions under which the PV output reaches the MPP. To regulate tracking, the error signal is defined as follows:

e (t) = x (t) - x_{d} (t),

(11)

with the goal of ensuring

lim_{t \to \infty} e (t) = 0

, i.e., asymptotic convergence to the MPP. The robust feedback control law is formulated as follows:

u (t) = K_{1} (θ_{t}) x (t) + K_{2} (θ_{t}) e (t),

(12)

where

K_{1} (θ_{t})

and

K_{2} (θ_{t})

are state- and error-dependent gain matrices adapted to each MDP state. The regulated output is defined as

y (t) = C x (t)

. The

H_{\infty}

control problem seeks to minimize the worst-case energy gain from the disturbance associated with the stochastic load demand

θ_{t}

to the performance output

y (t)

, thereby ensuring robust stability and optimal tracking performance.

This is achieved by minimizing the following cost functional:

J_{\infty} = E [\int_{0}^{\infty} (e^{⊤} (t) e (t) - δ^{2} x_{d}^{⊤} (t) x_{d} (t)) d t],

(13)

where

δ > 0

is a predefined robustness margin. The design guarantees that tracking error energy remains bounded and attenuated despite random demand deviations. Moreover, physical constraints are imposed on the converter’s duty cycle and the battery’s state-of-charge to ensure operational safety:

u_{min} \leq u (t) \leq u_{max}, S o C_{min} \leq S o C (t) \leq S o C_{max} .

(14)

By embedding the

H_{\infty}

controller within a stochastic energy management framework based on MDP forecasts, the PV microgrid gains the capability to robustly and adaptively track its optimal operating point. This strategy enhances energy capture, improves dynamic response, and maintains overall system stability, even under high variability and uncertainty. It is particularly effective in autonomous settings where environmental and load unpredictability are prominent.

3.3. Stochastic Power Flow Management

The proposed strategy in Algorithm 1 orchestrates a predictive and adaptive control strategy within the PV microgrid to ensure optimal energy utilization in real time. At each control interval, the algorithm initiates by forecasting future load consumption using an MDP, which captures the stochastic behavior of user demand through probabilistic transitions between discrete load states. Concurrently, the PV subsystem is governed by a robust

H_{\infty}

-based MPPT controller that computes the optimal duty cycle required to track the maximum power point under environmental uncertainties.

This control loop minimizes the tracking error while enhancing system robustness against unpredictable load changes. The predicted load demand and generated PV power are then used to evaluate the power balance and determine the appropriate operational mode. If the PV generation exceeds the load and the battery is not fully charged, the surplus energy is directed to charge the battery.

However, when the SoC reaches its maximum, any excess energy is intelligently routed to a hydrogen production unit via an electrolyzer, thereby preventing energy curtailment and contributing to long-term energy storage. Conversely, if the PV power is insufficient to meet demand, the battery discharges according to its SoC level to supply the load partially or fully.

The real-time execution of the proposed MDP-based energy management algorithm proceeds as follows:

1.: Stochastic load forecasting: At each time step t, the MDP predicts the next load level $θ_{t + ℏ}$ based on the current state $θ_{t}$ and the transition probability matrix Q. This provides a probabilistic estimation of $P_{load} (t + ℏ)$ without relying on historical datasets. The transition matrix is constructed offline and can be updated periodically using recent operational data.
2.: Measurement and state update: Real-time data such as irradiance $G (t)$ , temperature $T (t)$ , the current system state $x (t)$ , and battery SoC are measured. These serve as inputs for evaluating the PV model and updating the system state prediction for $t + ℏ$ .
3.: Robust power optimization: The controller solves the $H \infty$ optimization problem by minimizing the cost functional $J \infty$ , yielding the control input $u (t)$ . The robustness parameter $δ$ is tuned (e.g., $δ = 0.006$ ) to achieve optimal performance. The control sampling frequency is set to $ℏ = 60$ s, which is sufficient for capturing solar and load dynamics without inducing excessive computational load.
4.: Control execution: The control input $u (t)$ is applied to adjust the PV operating point, ensuring maximum power extraction under uncertainties while driving the power tracking error $e (t) \to 0$ .
5.: Supervisory energy dispatch: Based on the predicted surplus or deficit between $P_{p v}$ and $P_{load}$ , and the current SoC level, the controller activates one of five operational modes: charging the battery (Mode 1), routing power to hydrogen production (Mode 2), discharging the battery to supply full or partial load (Modes 3 and 4), or relying on direct PV supply if the battery is depleted (Mode 5). This ensures safe SoC management and effective hydrogen utilization without violating system constraints.

Algorithm 1 Stochastic

H_{\infty}

-based energy management for PV microgrid with hydrogen production

1:: Identify discrete load levels ${P_{load} (1), P_{load} (2), \dots, P_{load} (r)}$
2:: Define MDP states $S : = {1, 2, \dots, r}$
3:: Construct transition probability matrix $Q = [π_{i j}]$
4:: Initialize load state $θ_{0}$ , system state $x_{0}$ , and battery SoC $S o C_{0}$
5:: Set control time step ℏ
6:: for $t = 0$ to $t_{f}$ do
7:: Load forecasting via MDP
8:: Update transition probabilities $P r [θ_{t + ℏ} = j | θ_{t} = i]$
9:: Identify next load state $θ_{t + ℏ}$ and get predicted $P_{load} (t + ℏ)$
10:: PV optimization using stochastic $H_{\infty}$ MPPT
11:: Measure current irradiance $G (t)$ and temperature $T (t)$
12:: Compute optimal reference $(V_{p v, o p t}, I_{p v, o p t})$
13:: Evaluate system dynamics via PV model $x (t + ℏ)$
14:: Minimize $H_{\infty}$ cost functional $J_{\infty}$ to derive $u (t)$
15:: Apply control $u (t)$ to maximize $P_{p v}$ while ensuring ${lim}_{t \to \infty} e (t) \to 0$
16:: Energy management decision
17:: Update $S o C (t + ℏ)$
18:: if $P_{p v} > P_{load}$ then
19:: $P_{excess} = P_{p v} - P_{load}$
20:: if $S o C < S o C_{max}$ then
21:: Mode 1: Charge the battery with $P_{excess}$
22:: else
23:: Mode 2: Route $P_{excess}$ to the hydrogen production system
24:: Set $P_{H_{2}} = P_{excess}$
25:: end if
26:: else
27:: $P_{deficit} = P_{load} - P_{p v}$
28:: if $S o C > S o C_{min}$ then
29:: if $P_{p v} = 0$ then
30:: Mode 3: Discharge battery to supply total load
31:: $P_{bat} = P_{load}$
32:: else
33:: Mode 4: Discharge battery to meet deficit
34:: $P_{bat} = P_{deficit}$
35:: end if
36:: else
37:: Mode 5: Battery off, PV supplies as much as possible
38:: $P_{bat} = 0$
39:: end if
40:: end if
41:: end for

4. Simulation Results and Discussion

This section presents simulation results that highlight the performance of the proposed stochastic control-based EMS tailored for standalone PV–PEM microgrids dedicated to green hydrogen production. The proposed strategy effectively manages uncertainties from fluctuating solar irradiance and stochastic load demand by dynamically coordinating energy flows between PV generation, battery storage, and the PEM electrolyzer.

Figure 5 illustrates the real-time implementation architecture of the EMS for the standalone DC PV microgrid. The proposed EMS integrates three key functional blocks: (i) a stochastic

H_{\infty}

controller for robust MPPT operation, (ii) an MDP-based load consumption forecasting, and a decision-making that orchestrates power flow among the system components. This architecture ensures that the PV system’s output is continuously optimized under uncertainty by dynamically adjusting the duty cycle

u_{opt}

of the DC–DC boost converter, allowing the PV generator to track its MPP despite unpredictable changes in irradiance and load conditions.

Unlike traditional fixed-rule EMS schemes, this architecture supports adaptive control and predictive decision-making. EMS defines multiple operational modes governed by the SoC of the battery, the priority of the load, and the status of the hydrogen production. During high solar generation periods, if the battery reaches its SoC upper limit, the EMS triggers hydrogen production mode, redirecting excess PV power to the PEM electrolyzer via switch

S_{4}

. In contrast, during low irradiance periods, if the SoC remains above a defined threshold, the EMS enables battery discharge mode through switch

S_{3}

to maintain load supply. Switches

S_{1}

,

S_{3}

, and

S_{4}

, respectively, represent battery charging, DC load supply, and electrolyzer activation. These transitions are coordinated by a central logic controller, as shown in Figure 5, which dynamically adjusts power routing based on real-time conditions and predictive inputs.

This clarified architecture underscores the novelty of the proposed EMS: it combines stochastic control, load forecasting, and intelligent mode switching in a unified framework to ensure reliable operation, high PV utilization, and optimized hydrogen production under varying environmental and load profiles.

The simulation is conducted on a representative off-grid PV–PEM system comprising a Siemens SP75 solar panel, a high-efficiency DC–DC boost converter, a lithium-ion battery for short-term storage, and a PEM electrolyzer for green hydrogen production. These components mirror real-world deployment scenarios and offer practical insight into EMS performance. Detailed specifications are provided in Table A1 in Appendix A.

An MDP represents the stochastic behavior of the load profiles within the microgrid. This approach allows for a realistic modeling of load consumption patterns, which are inherently uncertain and time-varying. The transitions between different discrete load states are governed by a transition rate matrix Q, which encodes the probabilities of switching from one consumption level to another over time. This matrix is constructed based on simulated load scenarios that mimic typical user demand patterns, generated through a combination of consumption statistics and random sampling to capture variability. Statistical analysis of these scenarios is then used to estimate transition probabilities, ensuring that Q reflects the temporal dynamics of real-world load fluctuations. By integrating this probabilistic framework, the energy management strategy can anticipate likely future load scenarios and proactively adjust control actions. The following rate matrix exemplifies the modeled transitions:

Q = [\begin{matrix} - 150 & 12 & 22 & 48 & 8 & 40 & 8 & 12 \\ 30 & - 189 & 30 & 33 & 12 & 18 & 30 & 36 \\ 48 & 44 & - 268 & 48 & 8 & 24 & 40 & 56 \\ 6 & 18 & 33 & - 153 & 48 & 6 & 30 & 12 \\ 64 & 32 & 32 & 40 & - 340 & 40 & 52 & 80 \\ 66 & 30 & 39 & 12 & 18 & - 213 & 6 & 42 \\ 7 & 24 & 9 & 17 & 13 & 27 & - 122 & 25 \\ 48 & 92 & 16 & 88 & 16 & 48 & 56 & - 364 \end{matrix}] .

Through this formulation, the PV–PEM microgrid dynamically adapts its energy flow, not only to meet immediate consumption needs but also to prioritize long-term sustainability by converting excess renewable energy into hydrogen.

4.1. Performance Evaluation and Interpretation of Key Findings

The simulation study was carried out under two distinct scenarios to evaluate the performance of the proposed EMS strategy. The first scenario considered synthetically generated weather profiles under standard test conditions. The second scenario employed real-time meteorological data obtained from the weather monitoring station at the School of Electrical, Mechanical, and Computer Engineering (EMC), the Federal University of Goiás (UFG), located in Goiânia, Brazil. This dataset, which includes measurements of solar irradiance and ambient temperature, is publicly accessible via https://sites.google.com/site/sfvemcufg/weather-station (accessed on 1 May 2025).

4.1.1. Scenario 1: Synthetic Weather Profiles

In the first scenario, simulations were carried out using synthesized weather data to replicate dynamic environmental conditions. This setup enables a rigorous evaluation of the proposed EMS under standard test conditions (

T = 25 ° C

and

G = 1000 {W / m}^{2}

), which are critical for accurately modeling the stochastic nature of load demand in off-grid PV systems.

These simulations are crucial for validating the system’s performance when subjected to unpredictable load consumption, offering insights into its ability to track the maximum power point, manage load demands, and regulate the battery SoC efficiently. Furthermore, the controller’s capability to handle energy surpluses through hydrogen production using the PEM electrolyzer was also evaluated, ensuring that excess PV power is utilized productively when the battery reaches its maximum capacity.

In our simulations, the PV system was configured to supply energy to a set of DC loads with time-varying power demand. To account for the stochastic nature of load behavior, we defined eight distinct consumption scenarios, as presented in Table 1, each corresponding to a specific level of load demand. These scenarios were synthetically generated to reflect a wide range of realistic load conditions and abrupt consumption variations commonly observed in standalone PV systems. These scenarios were modeled using an MDP, represented by the state variable

θ_{t}

, allowing us to capture the probabilistic switching between load profiles based on a predefined transition rate matrix Q.

To manage the uncertainties associated with time-varying load behavior, an MDP-based energy management strategy was implemented to estimate, in real-time, the global consumption state. This strategy relies on the transition matrix Q, which defines the probabilistic evolution between eight predefined load scenarios, as presented in Figure 6.

The MDP framework enables the control system to dynamically adjust energy distribution according to the anticipated load level by continuously evaluating the current operating state and forecasting likely transitions. This adaptability ensures that energy flows are optimally balanced between the PV source, the storage system, the DC loads, and the hydrogen production.

As depicted in Figure 7, the proposed MDP-based mechanism effectively tracks abrupt and unpredictable variations in consumption, demonstrating a high level of responsiveness and robustness. Its predictive capability enhances decision-making under uncertainty, allowing the system to proactively compensate for demand fluctuations while maintaining operational efficiency.

Figure 8a depicts the temporal evolution of the PV power output under the supervision of the proposed control strategy, previously outlined in Figure 5. The results clearly show that the controlled system effectively follows the reference power profile

P_{r e f}

, even in the presence of sudden shifts in load demand. This capacity to promptly track dynamic setpoints demonstrates the responsiveness and precision of the control algorithm during both start-up and steady-state conditions. These results are achieved by minimizing the robust control cost defined in Equation (13), where the robustness parameter

δ

was tuned to an optimal value of

0.007

. This setting ensures reliable tracking of

P_{r e f}

under stochastic load variations, balancing robustness and performance.

Furthermore, the proposed control scheme demonstrates remarkable resilience to stochastic variation induced by MDP-based load forecasting, ensuring stable and reliable power delivery under dynamic and unpredictable conditions. Compared to the conventional Perturb and Observe (P&O) technique, the proposed strategy achieves a significantly lower average tracking error of 0.3125, versus 9.8836 for P&O. It also maintains a higher average energy conversion efficiency of

99.9 %

, compared to

98.64 %

for the P&O method. As illustrated in Figure 8, the proposed controller responds much faster to abrupt load changes, quickly converging to the new optimal operating point, while the P&O technique exhibits slower adaptation and larger oscillations. This rapid and accurate convergence, coupled with stable voltage and current behavior, confirms the effectiveness of the proposed method for real-time energy management in standalone PV systems. Overall, these results emphasize the superior performance, robustness, and adaptability of the proposed control strategy in managing uncertain and time-varying energy flows without compromising system stability.

The temporal profile of the battery’s SoC is illustrated in Figure 9, highlighting the effectiveness of the proposed stochastic EMS in real-time operation. Throughout the simulation, the SoC remains consistently within the predefined safety margins, namely

S o C_{m i n} = 10 %

and

S o C_{m a x} = 90 %

, despite the presence of unpredictable variations in load demand. This indicates that the control system successfully anticipates fluctuations and allocates available energy accordingly. When the battery approaches its upper charge threshold, the control algorithm intelligently diverts excess photovoltaic power, rather than curtailing it, toward the PEM electrolyzer for green hydrogen production. This coordinated mechanism ensures optimal utilization of solar resources, prevents battery overcharging, and promotes sustainable energy storage through hydrogen generation.

Figure 10 and Figure 11 illustrate the cumulative mass of hydrogen produced and the corresponding water consumption over the simulation horizon. These results highlight the effectiveness of the proposed energy management strategy in harnessing surplus PV energy for sustainable hydrogen generation once the battery reaches its maximum SoC.

As shown, hydrogen production increases progressively during periods of high solar availability and low load demand, indicating that excess energy is efficiently diverted to the PEM electrolyzer rather than being curtailed. The associated water consumption profile in Figure 11 mirrors the hydrogen production trend, reflecting the relationship between water electrolysis and hydrogen output.

Overall, the results validate the controller’s capability to intelligently coordinate energy flows between the battery, the DC loads, and the PEM electrolyzer in accordance with the operational modes defined in Figure 12. By dynamically adjusting power allocation based on the battery’s SoC, the real-time load demand, and the availability of PV generation, the proposed strategy ensures efficient energy utilization under varying conditions. This adaptive management not only prevents battery overcharge or deep discharge but also enables the productive use of surplus solar energy for green hydrogen production.

4.1.2. Scenario 2: Real-Time Weather Data

In this scenario, the performance of the proposed multi-objective stochastic control strategy is evaluated under real-time solar irradiance and temperature conditions, as depicted in Figure 13. When exposed to realistic and time-varying climatic conditions, these environmental profiles enable us to evaluate the controller’s adaptability and robustness in managing energy flows within the microgrid, under operating temperatures ranging from 19 °C to 55 °C.

Figure 14 illustrates the temporal evolution of the MDP. Based on the defined state space in Table 2, the MDP effectively tracks and forecasts variations in load consumption under realistic weather conditions. The system achieved precise real-time estimation of the total load demand by implementing the MDP-based stochastic forecasting strategy, as shown in Figure 15. These predictive insights allowed the energy management algorithm to take anticipatory and well-informed actions in distributing the available energy resources.

Figure 16 illustrates the optimized PV power output achieved under real-time weather conditions. The results demonstrate the ability of the proposed stochastic control strategy to continuously adapt PV generation in response to fluctuating solar irradiance and sudden shifts in load demand. By efficiently tracking the available solar resource and reallocating power accordingly, the controller ensures optimal energy utilization. This adaptability highlights the robustness and responsiveness of the control approach in dynamic operational environments.

The battery’s SoC evolution under real-time weather conditions is depicted in Figure 17, offering insights into its operational performance within the proposed microgrid architecture. Throughout the simulation, the battery exhibited consistent behavior—charging during periods of high solar availability and discharging when the PV output was insufficient to meet load demand. The observed SoC peaks align with midday periods characterized by strong irradiance, while decreases correspond to cloudy intervals or load surges. Notably, once the SoC approached its upper threshold, surplus energy was redirected to the electrolyzer for hydrogen production, thereby preventing overcharging. The battery operated reliably within the predefined bounds, ensuring smooth transitions and avoiding excessive cycling. This controlled SoC profile confirms the robustness of the proposed stochastic control framework in maintaining energy balance and safeguarding system stability under varying real-world conditions.

The results presented in Figure 18 and Figure 19 provide a comprehensive overview of the hydrogen production performance and corresponding water consumption under real-time weather conditions. The mass of hydrogen generated follows a variable trend that reflects both the availability of excess PV energy and the battery’s SoC. When the battery reaches its upper SoC threshold, surplus solar energy is intelligently redirected to power the electrolyzer, resulting in efficient green hydrogen production.

This process is inherently linked to the fluctuating solar irradiance levels observed throughout the day, as well as the stochastic load demand predicted by the MDP-based strategy. Simultaneously, the evolution of the water consumption curve shows a direct proportional relationship to hydrogen generation, confirming the expected electrolysis behavior. According to the operating modes illustrated in Figure 20, these results validate the capacity of the energy management system to optimize renewable energy utilization and support sustainable hydrogen production, even under dynamically changing weather and load conditions.

4.2. Benchmarking Against Existing Control Methods

This section provides a comparative analysis with existing studies to better contextualize the proposed MDP-based energy management strategy for green hydrogen production. Table 3 highlights key distinctions in power optimization techniques, load forecasting approaches, and data requirements. Many conventional methods employ deterministic optimization, such as adaptive control or model predictive control, often based on simplified or repetitive load profiles. In contrast, some recent approaches adopt stochastic optimization techniques, including Monte Carlo simulations (MCS) and deep reinforcement learning, which, although powerful, typically demand significant computational resources and large volumes of historical data for training and scenario generation.

In contrast, the proposed method leverages a finite-state MDP to model and manage random load fluctuations in real time. This eliminates the need for large-scale data collection or computationally heavy prediction engines. The key contributions of our approach are summarized as follows:

Stochastic load adaptation: By using a Markov model with discrete load states and transition probabilities, the control system can respond effectively to abrupt and unpredictable changes in load without relying on historical consumption data.
Efficient energy allocation: The MDP policy optimizes the distribution of PV power between battery charging and hydrogen production, ensuring safe battery operation and improved hydrogen yield under dynamic conditions.
Low computational overhead: The proposed strategy provides a practical alternative to high-complexity algorithms, making it well-suited for real-time applications in remote areas where computational resources may be limited.

5. Conclusions

This study aimed to develop a robust stochastic energy management strategy for standalone PV systems dedicated to green hydrogen production, focusing on the intelligent exploitation of excess solar energy under unpredictable load conditions. Recognizing the limitations of traditional deterministic control methods, the proposed approach integrates optimization control with an MDP to forecast short-term load consumption and guide optimal power dispatch between DC loads, battery storage, and a PEM electrolyzer.

The simulation results validated the effectiveness of the proposed strategy under realistic and dynamic operating scenarios. The MDP-based controller successfully adapted to fluctuations in both solar irradiance and load behavior, enabling more efficient use of surplus energy for hydrogen production. Compared to baseline strategies without stochastic modeling, the proposed method achieved a power optimization efficiency of

99.6 %

, which consequently enhanced the continuity of hydrogen generation. Additionally, intelligent scheduling of battery operations contributed to a potential extension of battery lifespan by reducing cycling stress, as reflected in decreased depth-of-discharge fluctuations.

One of the key findings of this work is the demonstration that incorporating stochastic load forecasting enables proactive and adaptive energy management, even in highly variable off-grid environments. This makes the proposed framework particularly relevant for autonomous energy systems in remote or infrastructure-limited regions—defined as locations where grid connectivity is unavailable or unreliable, and where access to maintenance resources, fuel supply chains, or technical support is severely constrained—making consistent hydrogen production and system resilience critical.

Despite these promising results, the current model assumes idealized sensor measurements without explicit consideration of measurement noise or hardware limitations. Furthermore, the Faraday efficiency and electrolyzer performance parameters were treated as constants, which may limit accuracy under varying operating conditions. These assumptions represent limitations that will be addressed in future work.

Future research will explore the integration of additional renewable energy sources, such as wind or biomass, to further enhance the adaptability and autonomy of the standalone microgrid system, enabling more robust and resilient operation under diverse environmental conditions. To bridge the gap between simulation and practical deployment, we also plan to conduct real-time hardware-in-the-loop (HIL) validation to assess the feasibility and effectiveness of the proposed stochastic energy management strategy on actual control hardware.

Implementing the MDP-based controller in real hardware environments presents several challenges, including computational requirements for real-time load forecasting and decision-making, as well as the necessity for continuous, accurate monitoring of load demand and system states. Addressing these challenges will involve optimizing the algorithm for embedded platforms with limited processing power and memory, and developing efficient data acquisition systems capable of reliable, low-latency measurement of system variables.

Author Contributions

Conceptualization, M.A. and W.J.; methodology, M.A.; software, M.A.; validation, M.A., W.J., and M.I.M.; formal analysis, M.A. and M.I.M.; investigation, W.J. and M.I.M.; resources, S.A.H.; writing—original draft preparation, M.A.; writing—review and editing, W.J. and M.I.M.; project administration, S.A.H.; funding acquisition, S.A.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R827), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BESS	Battery Energy Storage System
EMS	Energy Management System
ESS	Energy Storage System
HIL	Hardware-In-the-Loop
MDP	Markov Decision Process
MCS	Monte Carlo Simulation
MPPT	Maximum Power Point Tracking
MPC	Model Predictive Control
PEM	Proton Exchange Membrane
PMC	Power Management Controller
P&O	Perturb and Observe
PV	Photovoltaic
SEM	Stochastic Energy Management
SoC	State of Charge

Appendix A

Table A1. Specifications of the PV-PEM-Battery DC Microgrid.

Parameters	Value	Unit
PV module SP75
series-parallel cells $(N_{s}, N_{p})$	$(36, 1)$
Maximum power, $P_{m a x}$	$74.8$	W
Voltage at Maximum power, $V_{m p}$	17	V
Current at Maximum power, $I_{m p}$	$4.26$	A
Open circuit voltage, $V_{o c}$	$21.6$	V
Short circuit current, $I_{s c}$	$4.7$	A
Reverse saturation current, $I_{r r}$	$1.5885 \times 10^{- 8}$	A
Temperature coefficient, $K_{I}$	2.06	mA/°C
Ideality factor, $η$	$1.2$
Lithium–ion battery
Nominal voltage, $v_{b a t}$	$52.2$	V
Battery capacity, $C_{b a t}$	100	Ah
Efficiency of Battery Charge, $η_{c}$	$0.9$
Efficiency of Battery Discharge, $η_{d}$	$0.9$
PEM Electrolyzer
Nominal voltage per cell, $v_{P E M, c e l l}$	$1.8$	V
Number of cells, $N_{P E M, c e l l}$	50
PEM electrolyzer efficiency, $η_{P E M}$	$0.7$
Molar mass, $M_{H_{2}}$	$2.016$	g/mol
DC-DC Boost converter
Input capacitor, $C_{p v}$	1	mF
Output capacitor, C	100	μF
Output capacitor resistance, $R_{C}$	$0.162$	$Ω$
Inductance, L	10	mH
Inductance resistance, $R_{L}$	$0.48$	mΩ
Internal resistance of MOSFET, $R_{M}$	$0.27$	$Ω$
Internal resistance of diode, $R_{D}$	$0.24$	$Ω$

References

Qin, H.; Tang, S.; Xu, L.; Li, A.; Lv, Q.; Dong, J.; Liu, L.; Ding, X.; Jiang, N.; Luo, R.; et al. Alkaline functional chromium carbide: Immobilization of ultrafine ruthenium copper nanoparticles for efficient hydrogen evolution from ammonia borane hydrolysis. J. Colloid Interface Sci. 2025, 697, 137897. [Google Scholar] [CrossRef] [PubMed]
Wen, J.; Tang, S.; Ding, X.; Yin, Y.; Song, F.; Yang, X. In Situ Raman Study of Layered Double Hydroxide Catalysts for Water Oxidation to Hydrogen Evolution: Recent Progress and Future Perspectives. Energies 2024, 17, 5712. [Google Scholar] [CrossRef]
Liang, J.; Li, H.; Chen, L.; Ren, M.; Fakayode, O.A.; Han, J.; Zhou, C. Efficient hydrogen evolution reaction performance using lignin-assisted chestnut shell carbon-loaded molybdenum disulfide. Ind. Crops Prod. 2023, 193, 116214. [Google Scholar] [CrossRef]
Chatterjee, P.; Ambati, M.S.K.; Chakraborty, A.K.; Chakrabortty, S.; Biring, S.; Ramakrishna, S.; Wong, T.K.S.; Kumar, A.; Lawaniya, R.; Dalapati, G.K. Photovoltaic/photo-electrocatalysis integration for green hydrogen: A review. Energy Convers. Manag. 2022, 261, 115648. [Google Scholar] [CrossRef]
Indrajith, B.; Gunawardane, K. Navigating the Intersection of Microgrids and Hydrogen: Evolutionary Trends, Challenges, and Future Strategies. Energies 2025, 18, 614. [Google Scholar] [CrossRef]
Shanmugasundaram, S.; Thangaraja, J.; Rajkumar, S.; Ashok, S.D.; Sivaramakrishna, A.; Shamim, T. A review on green hydrogen production pathways and optimization techniques. Process Saf. Environ. Prot. 2025, 197, 107070. [Google Scholar] [CrossRef]
Folgado, F.J.; González, I.; Calderón, A.J. Simulation platform for the assessment of PEM electrolyzer models oriented to implement digital Replicas. Energy Convers. Manag. 2022, 267, 115917. [Google Scholar] [CrossRef]
Wallnöfer-Ogris, E.; Grimmer, I.; Ranz, M.; Höglinger, M.; Kartusch, S.; Rauh, J.; Macherhammer, M.G.; Grabner, B.; Trattner, A. A review on understanding and identifying degradation mechanisms in PEM water electrolysis cells: Insights for stack application, development, and research. Int. J. Hydrogen Energy 2024, 65, 381–397. [Google Scholar] [CrossRef]
Pouya Beigzadeh, A.; Ataollah, N.; Ombretta, P. Parametric Sensitivity of a PEM Electrolyzer Mathematical Model: Experimental Validation on a Single-Cell Test Bench. Energies 2025, 28, 2217. [Google Scholar] [CrossRef]
Arias, I.; Battisti, F.G.; Romero-Ramos, J.A.; Pérez, M.; Valenzuela, L.; Cardemil, J.; Escobar, R. Assessing system-level synergies between photovoltaic and proton exchange membrane electrolyzers for solar-powered hydrogen production. Appl. Energy 2024, 368, 123495. [Google Scholar] [CrossRef]
Shi, R.; Guo, X.; Ren, D. How to extend the photovoltaic value chain? A blockchain-based strategy for photovoltaic-storage-hydrogen integration with green electricity trading. Energy 2025, 315, 134394. [Google Scholar] [CrossRef]
Ratib, M.K.; Muttaqi, K.M.; Islam, M.R.; Sutanto, D.; Agalgaonkar, A.P. Large-scale production of green hydrogen from solar energy in Australia: Operation and control of a multi-unit PEM electrolyser system. Int. J. Hydrogen Energy 2025, 98, 873–886. [Google Scholar] [CrossRef]
Karthikeyan, B.; Kumar, G.P.; Basa, S.; Sinha, S.; Tyagi, S.; Kamat, P.; Prabakaran, R.; Kim, S.C. Strategic optimization of large-scale solar PV parks with PEM Electrolyzer-based hydrogen production, storage, and transportation to minimize hydrogen delivery costs to cities. Appl. Energy 2025, 377, 124758. [Google Scholar] [CrossRef]
Tebibel, H. Off grid PV system for hydrogen production using PEM methanol electrolysis and an optimal management strategy. Int. J. Hydrogen Energy 2017, 42, 19432–19445. [Google Scholar] [CrossRef]
Kumar, R.K.; Samuel, P. Designing a hydrogen generation system through PEM water electrolysis with the capability to adjust fast fluctuations in photovoltaic power. Int. J. Hydrogen Energy 2024, 82, 1–10. [Google Scholar] [CrossRef]
Dahbi, S.; Aziz, A.; Messaoudi, A.; Mazozi, I.; Kassmi, K.; Benazzi, N. Management of excess energy in a photovoltaic/grid system by production of clean hydrogen. Int. J. Hydrogen Energy 2018, 43, 5283–5299. [Google Scholar] [CrossRef]
Cecilia, A.; Carroquino, J.; Roda, V.; Costa-Castelló, R.; Barreras, F. Optimal energy management in a standalone microgrid, with photovoltaic generation, short-term storage, and hydrogen production. Energies 2020, 13, 1454. [Google Scholar] [CrossRef]
Hossain, M.A.; Islam, M.R.; Hossain, M.A.; Hossain, M. Control strategy review for hydrogen-renewable energy power system. J. Energy Storage 2023, 72, 108170. [Google Scholar] [CrossRef]
Alharbi, A.G.; Olabi, A.; Rezk, H.; Fathy, A.; Abdelkareem, M.A. Optimized energy management and control strategy of photovoltaic/PEM fuel cell/batteries/supercapacitors DC microgrid system. Energy 2024, 290, 130121. [Google Scholar] [CrossRef]
Renaudineau, H.; Llor, A.M.; Hernandez, M.S.; Concha, D.; Wilson-Veas, A.H.; Kouro, S. Photovoltaic to electrolysis off-grid green hydrogen production with DC–DC conversion. Renew. Energy 2024, 237, 121687. [Google Scholar] [CrossRef]
Sánchez-Squella, A.; Flores, R.; Burgos, R.; Morales, F.; Nader, A.; Valdivia-Lefort, P. 99.6% efficiency DC-DC coupling for green hydrogen production using PEM electrolyzer, photovoltaic generation and battery storage operating in an off-grid area. Renew. Energy 2024, 237, 121781. [Google Scholar] [CrossRef]
Liu, H.D.; Huang, B.J.; Chen, W.Y.; Shih, J.W. A solar energy system with a dual-input power converter and global MPPT for off-grid applications. Electr. Power Syst. Res. 2025, 243, 111497. [Google Scholar] [CrossRef]
Rezzak, D.; Beddiaf, Y.; Boudjerda, N.; Kihal, M.C.; Arbid, M. Reset limited integral hysteresis sliding mode and hybrid MPPT/DC-Bus controls with limits supervision in photovoltaic/valve-regulated lead-acid battery system. J. Energy Storage 2025, 112, 115507. [Google Scholar] [CrossRef]
Rizk-Allah, R.M.; Hassan, I.A.; Snasel, V.; Hassanien, A.E. An optimal standalone wind-photovoltaic power plant system for green hydrogen generation: Case study for hydrogen refueling station. Results Eng. 2024, 22, 102234. [Google Scholar] [CrossRef]
Koholé, Y.W.; Ngopgang, B.R.; Fohagui, F.C.V.; Ngouleu, C.A.W.; Tchuen, G. Green hydrogen production and storage via excess energy derived from a hybrid power system under different climatic conditions: Cameroon case study. Energy Convers. Manag. 2025, 325, 119418. [Google Scholar] [CrossRef]
Kookos, I.K. Systematic optimization of off-grid green hydrogen production systems. Int. J. Hydrogen Energy 2024, 79, 1299–1312. [Google Scholar] [CrossRef]
Alturki, A.A. Optimal design for a hybrid microgrid-hydrogen storage facility in Saudi Arabia. Energy Sustain. Soc. 2022, 12, 24. [Google Scholar] [CrossRef] [PubMed]
Guo, X.; Gu, F.; Liu, H.; Yu, Y.; Li, R.; Wang, J. Sustainable PV-hydrogen-storage microgrid energy management using a hierarchical economic model predictive control framework. Energy Inform. 2025, 8, 18. [Google Scholar] [CrossRef]
Cardona, P.; Costa-Castelló, R.; Roda, V.; Carroquino, J.; Vali no, L.; Serra, M. Model predictive control of an on-site green hydrogen production and refuelling station. Int. J. Hydrogen Energy 2023, 48, 17995–18010. [Google Scholar] [CrossRef]
Syed, M.A.; Khalid, M. An intelligent model predictive control strategy for stable solar-wind renewable power dispatch coupled with hydrogen electrolyzer and battery energy storage. Int. J. Energy Res. 2023, 2023, 4531054. [Google Scholar] [CrossRef]
Battula, S.; Panda, A.K.; Garg, M.M. Stand-alone PV connected system with energy storage with flexible operation. Electr. Eng. 2024, 106, 2893–2907. [Google Scholar] [CrossRef]
Holtwerth, A.; Xhonneux, A.; Müller, D. Model Predictive Control of a Stand-Alone Hybrid Battery-Hydrogen Energy System: A Case Study of the PHOEBUS Energy System. Energies 2024, 17, 4720. [Google Scholar] [CrossRef]
Younis, R.A.; Touti, E.; Aoudia, M.; Zahrouni, W.; Omar, A.I.; Elmetwaly, A.H. Innovative hybrid energy storage systems with sustainable integration of green hydrogen and energy management solutions for standalone PV microgrids based on reduced fractional gradient descent algorithm. Results Eng. 2024, 24, 103229. [Google Scholar] [CrossRef]
Urhan, B.B.; Erdoğmuş, A.; Dokuz, A.Ş.; Gökçek, M. Predicting green hydrogen production using electrolyzers driven by photovoltaic panels and wind turbines based on machine learning techniques: A pathway to on-site hydrogen refuelling stations. Int. J. Hydrogen Energy 2025, 101, 1421–1438. [Google Scholar] [CrossRef]
Aatabe, M.; El Guezar, F.; Vargas, A.N.; Bouzahir, H. A novel stochastic maximum power point tracking control for off-grid standalone photovoltaic systems with unpredictable load demand. Energy 2021, 235, 121272. [Google Scholar] [CrossRef]
Moncecchi, M.; Brivio, C.; Mandelli, S.; Merlo, M. Battery energy storage systems in microgrids: Modeling and design criteria. Energies 2020, 13, 2006. [Google Scholar] [CrossRef]
Zhang, C.; Li, P.; Guo, Y. Bidirectional DC/DC and SOC drooping control for DC microgrid application. Electronics 2020, 9, 225. [Google Scholar] [CrossRef]
Yodwong, B.; Guilbert, D.; Phattanasak, M.; Kaewmanee, W.; Hinaje, M.; Vitale, G. Proton exchange membrane electrolyzer modeling for power electronics control: A short review. C 2020, 6, 29. [Google Scholar] [CrossRef]
Aatabe, M.; El Abbadi, R.; Vargas, A.N.; Bouzid, A.E.M.; Bawayan, H.; Mosaad, M.I. Stochastic energy management strategy for autonomous PV–microgrid under unpredictable load consumption. IEEE Access 2024, 12, 84401–84419. [Google Scholar] [CrossRef]
Tighirt, A.; Aatabe, M.; El Guezar, F.; Bouzahir, H.; Vargas, A.N. Stochastic power management strategy for an autonomous wind energy conversion system with battery storage under random load consumption using Markov process. J. Energy Storage 2025, 114, 115812. [Google Scholar] [CrossRef]
Aatabe, M.; Latif, R.; Mosaad, M.I.; Hussien, S.A. Stochastic energy management of DC photovoltaic microgrids using Markov decision process. Results Eng. 2025, 27, 105835. [Google Scholar] [CrossRef]
Aatabe, M.; El Guezar, F.; Bouzahir, H.; Vargas, A.N. Constrained stochastic control of positive Takagi-Sugeno fuzzy systems with Markov jumps and its application to a DC-DC boost converter. Trans. Inst. Meas. Control 2020, 42, 3234–3242. [Google Scholar] [CrossRef]
Tighirt, A.; Aatabe, M.; El Guezar, F.; Bouzahir, H.; Vargas, A.N.; Neretti, G. A New Stochastic Controller for Efficient Power Extraction from Small-Scale Wind Energy Conversion Systems under Random Load Consumption. Energies 2024, 17, 4927. [Google Scholar] [CrossRef]
El Abbadi, R.; Aatabe, M.; Bouzid, A.E.M. Wireless Diagnosis and Control of DC–DC Converter for Off-Grid Photovoltaic Systems. Sustainability 2024, 16, 3252. [Google Scholar] [CrossRef]
Kumar, M.; Singh, R.; Arora, P.; Bhosale, A. An adaptive control strategy for DC-DC buck converter for a small-scale distributed green hydrogen production unit using SPV-battery-based off-grid system. Renew. Energy 2025, 255, 123697. [Google Scholar] [CrossRef]
Buchibabu, P.; Somlal, J. Green energy management in DC microgrids enhanced with robust model predictive control and muddled tuna swarm MPPT. Electr. Eng. 2024, 106, 2799–2819. [Google Scholar] [CrossRef]
Zhang, Y.; Wei, W. Model construction and energy management system of lithium battery, PV generator, hydrogen production unit and fuel cell in islanded AC microgrid. Int. J. Hydrogen Energy 2020, 45, 16381–16397. [Google Scholar] [CrossRef]
Dong, W.; Sun, H.; Mei, C.; Li, Z.; Zhang, J.; Yang, H. Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid. Energy Convers. Manag. 2023, 277, 116640. [Google Scholar] [CrossRef]

Figure 1. Scheme of the standalone PV–PEM–microgrid system.

Figure 2. Schematics of the PV generator system.

Figure 3. Schematic of a bidirectional DC-DC converter system with battery integration.

Figure 4. Structure of the DC PV-PEM microgrid with the energy management system.

Figure 5. Stochastic control-based energy management for PV–PEM microgrid operation.

Figure 6. Estimated load consumption under standard test conditions.

Figure 7. Markov chain evolution under standard test conditions.

Figure 8. PV power by applying: (a) stochastic

H_{\infty}

controller and (b) P&O technique.

Figure 8. PV power by applying: (a) stochastic

H_{\infty}

controller and (b) P&O technique.

Figure 9. Battery SoC under standard test conditions.

Figure 10. Mass of hydrogen produced under standard test conditions.

Figure 11. Mass of water consumed under standard test conditions.

Figure 12. Operating modes of the microgrid under standard test conditions.

Figure 13. Five days of real-time data for (a) temperature and (b) irradiance.

Figure 14. Markov chain evolution under real-time weather conditions.

Figure 15. Estimated load consumption under real-time weather conditions.

Figure 16. PV power by applying the stochastic

H_{\infty}

controller under real-time weather conditions.

Figure 16. PV power by applying the stochastic

H_{\infty}

controller under real-time weather conditions.

Figure 17. Battery SoC under real-time weather conditions.

Figure 18. Mass of hydrogen produced under real-time weather conditions.

Figure 19. Mass of water consumed under real-time weather conditions.

Figure 20. Operating modes of the microgrid under real-time weather conditions.

Table 1. Levels of the global load consumption under standard test conditions.

$θ_{t}$	1	2	3	4	5	6	7	8
$P_{l o a d} (θ_{t}) (W)$	250	100	450	400	150	200	350	300

Table 2. Levels of the global load consumption under real-time weather conditions.

$θ_{t}$	1	2	3	4	5	6	7	8
$P_{l o a d} (θ_{t}) (W)$	100	50	300	10	150	75	250	80

Table 3. Comparison between this study and related works in the literature.

Literature	Power Optimization Aspect	Load Forecasting	Previous Data Requirement
[45]	Deterministic	–	No
[46]	Deterministic	–	No
[47]	Deterministic	–	No
[48]	Stochastic	Stochastic	YES
This paper	Stochastic	MDP	No

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Aatabe, M.; Jenkal, W.; Mosaad, M.I.; Hussien, S.A. Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems. Energies 2025, 18, 3899. https://doi.org/10.3390/en18153899

AMA Style

Aatabe M, Jenkal W, Mosaad MI, Hussien SA. Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems. Energies. 2025; 18(15):3899. https://doi.org/10.3390/en18153899

Chicago/Turabian Style

Aatabe, Mohamed, Wissam Jenkal, Mohamed I. Mosaad, and Shimaa A. Hussien. 2025. "Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems" Energies 18, no. 15: 3899. https://doi.org/10.3390/en18153899

APA Style

Aatabe, M., Jenkal, W., Mosaad, M. I., & Hussien, S. A. (2025). Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems. Energies, 18(15), 3899. https://doi.org/10.3390/en18153899

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stochastic Control for Sustainable Hydrogen Generation in Standalone PV–Battery–PEM Electrolyzer Systems

Abstract

1. Introduction

1.1. Literature Review

1.2. Main Contribution

2. Overview of the Standalone PV–PEM–Microgrid System

2.1. PV Conversion System

2.2. Battery Energy Storage System

2.3. PEM Electrolyzer System

2.4. Control Objectives for Energy Management

3. MDP-Driven Approach to Optimizing Hydrogen Production

3.1. MDP-Driven Load Consumption Forecasting

3.2. PV Power Optimization Under Unpredictable Load Consumption

3.3. Stochastic Power Flow Management

4. Simulation Results and Discussion

4.1. Performance Evaluation and Interpretation of Key Findings

4.1.1. Scenario 1: Synthetic Weather Profiles

4.1.2. Scenario 2: Real-Time Weather Data

4.2. Benchmarking Against Existing Control Methods

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI