3.5.1. Objective Function
The MPC objective function is designed to balance multiple, and potentially conflicting, performance goals, including power tracking accuracy, SoC regulation, and mitigation of component stress. Accordingly, the cost function minimized at each control step is formulated as [
36,
38,
39]:
where
denotes the electrical load demand at the DC bus,
represents the desired battery state-of-charge reference,
is the battery current, and
corresponds to the rate of change in the fuel cell power. The weighting coefficients
and
are non-negative scalars that reflect the relative importance of power tracking performance, SoC regulation, battery stress reduction, and fuel cell ramp-rate limitation, respectively [
36,
38,
39,
61]. Regarding the admissible values of these weights, they are non-negative scalars (
w ≥ 0) that encode only the relative importance of the competing objectives, and they are not subject to any normalization constraint; in particular, their sum is not required to equal unity. Because the cost function in Equation (16) is minimized, multiplying all weights by a common positive factor leaves the optimal solution unchanged, so only their ratios are meaningful. For this reason the power-tracking weight is fixed to
= 1.00 as a reference, and the remaining weights are expressed relative to it. Finally, the weights are held constant throughout the entire mission and are not varied dynamically as a function of the flight phase; the use of flight-situation-dependent (adaptive) weighting is identified as a direction for future work.
The weighting coefficients used in the objective function were determined through iterative sensitivity-oriented tuning trials conducted under representative UAV mission conditions. During the tuning process, different weighting combinations were evaluated in terms of DC-bus power balance performance, hydrogen utilization, battery SoC regulation, PEMFC power smoothness, and transient battery current behavior. The power tracking term
was assigned the highest priority to guarantee continuous load supply during dynamically varying flight phases. The SoC regulation term
was selected to prevent excessive battery depletion, while the battery current penalty term
and PEMFC ramp-rate penalty term
were introduced to reduce abrupt transient electrical stress and improve operational smoothness of the hybrid power system. The final weighting coefficients were selected as a compromise between power continuity, component stress mitigation, operational stability, and computational tractability within the considered mission profile [
36,
38,
39,
61].
The power tracking term enforces DC-bus power balance and minimizes load mismatch, ensuring uninterrupted operation of the propulsion and onboard systems. The SoC regulation term prevents excessive battery depletion or overcharging, while the battery current penalty limits high current excursions that may accelerate battery degradation. In addition, Penalizing rapid variations in fuel cell power mitigates degradation mechanisms such as oxygen starvation and membrane stress, while limiting battery current reduces thermal and electrochemical aging effects. These terms collectively act as degradation proxies, enabling health-conscious operation without requiring an explicit aging model [
36,
38,
39,
62].
Through this formulation, the MPC achieves a systematic trade-off between endurance maximization and component health preservation under dynamic flight conditions. Unlike conventional rule-based or heuristic energy management strategies that rely on predefined thresholds and often induce abrupt switching between power sources, the predictive and optimization-based nature of MPC enables smooth and continuous power sharing based on system dynamics and future load evolution [
36,
38]. As a result, component stress is proactively managed rather than reactively addressed, making the proposed MPC framework particularly well-suited for hybrid PEMFC-powered quadcopter platforms operating under highly transient flight profiles [
39,
61].
3.5.2. Battery SoC Reference Strategy
Unlike conventional charge-sustaining strategies that enforce a fixed SoC reference, a charge-depleting SoC reference is adopted in this study. This choice is motivated by the mission-oriented nature of UAV applications, where the primary objective is to maximize flight endurance rather than preserve battery energy for indefinite operation. By allowing the battery SoC to gradually decrease within predefined safety limits, the PEMFC can operate more steadily, reducing transient stress and improving overall system efficiency [
30,
36,
63].
The SoC reference trajectory is therefore selected within a bounded interval
,
, ensuring that the battery remains within a safe and functional operating region throughout the mission. This formulation provides a practical compromise between endurance maximization and battery health preservation [
39].
The MPC optimization problem explicitly incorporates a set of operational constraints reflecting the physical and safety limitations of the hybrid power system:
These constraints ensure safe operation of both the fuel cell and the battery, while preventing excessive current draw, overcharging, deep discharging, and abrupt fuel cell power ramps. By embedding these limits directly into the optimization problem, constraint satisfaction is guaranteed by design rather than enforced through heuristic post-processing [
29,
39,
58].
At each sampling instant, the finite-horizon optimization problem was solved using an in-house MATLAB/Simulink implementation developed for the proposed hybrid UAV energy management framework. The controller evaluates admissible PEMFC power trajectories within the defined operational constraints over the prediction horizon and selects the control sequence minimizing the objective function. The adopted implementation follows a constrained finite-horizon search strategy suitable for the relatively low-dimensional control problem considered in this study. No artificial intelligence-based optimizer or external commercial MPC solver was employed. This implementation was selected to maintain transparency, computational simplicity, and direct controllability of the optimization procedure within the considered UAV energy management application.
The upper and lower operational limits defined within the MPC framework are determined by taking into account manufacturer data, electrochemical constraints, and operating ranges proposed in the literature to ensure the safe and efficient operation of the fuel cell. Particularly in PEMFC-based UAV systems, the limited power density and slow dynamic response characteristics of fuel cells make it difficult to meet sudden power demands on their own, thus necessitating the use of hybrid energy systems [
64,
65]. In the present work, this hybrid energy system is not treated only as a general concept from the literature but is explicitly modeled: the specific configuration considered and simulated here is a PEMFC–battery hybrid architecture, in which the PEMFC is described by the semi-empirical polarization model (
Section 3.2), the battery by a Thevenin equivalent circuit with coulomb-counting state of charge (
Section 3.3), and the two sources are coupled through the DC-bus power balance together with the balance-of-plant loads (
Section 3.1). The operational limits discussed here are precisely the constraints imposed on this modeled hybrid system within the MPC formulation.
In this context, MPC offers an effective approach for the management of fuel cell-based UAV energy systems with complex and constrained operating conditions due to its ability to directly incorporate constraints such as maximum power, power ramp rate, and system dynamics into the model. Determining the power limits in PEMFC-based UAV systems is critical to prevent operating conditions that shorten cell life, such as overloading and high current density. The literature indicates that continuous operation in high-power regions can adversely affect fuel cell performance and accelerate degradation processes. Furthermore, it is stated that commercial PEMFC UAV systems generally prefer more simplified operating condition management systems, and therefore these systems are more sensitive to external factors such as ambient temperature, pressure, and humidity. This situation increases the importance of appropriate constraint definitions and advanced energy management strategies, particularly highlighting MPC-based approaches [
65].
Similarly, the maximum battery current and PEMFC power ramp-rate limits have been determined by considering the transient response characteristics of the hybrid power system, the durability of the electrochemical components, and long-term operational reliability. The literature indicates that PEMFC systems are subject to different degradation mechanisms, especially under start/stop, idle, nominal load, and high-power operating conditions, and that dynamic load variations have a decisive effect on system lifetime [
66]. During rapid load changes, short-term reactant starvation due to reactant transport limitations within the PEMFC, along with temperature and humidity gradients and sudden voltage fluctuations, can occur. This situation accelerates performance loss by creating additional stresses on the internal water/thermal distribution, catalyst layer stability, and membrane integrity. It has been reported that sudden load increases, especially under high current density, cause effects such as cathode catalyst layer degradation, Pt agglomeration, membrane fatigue, and reduction in the electrochemical active surface area [
67]. It has been shown that voltage drop, instability during load transients, and catalyst layer damage become more pronounced as the ramp rate increases, whereas more controlled load transients can reduce PEMFC degradation. Additionally, it is stated that short-term reactant starvation and flooding effects occurring during dynamic load cycles accelerate irreversible damage mechanisms such as carbon corrosion, catalyst dissolution, and membrane degradation [
66]. On the battery side, safe current limits have been defined considering that high current draws may increase thermal stress and aging effects. The operational limits determined in this context have been selected to limit sudden load changes on the PEMFC, reduce battery current stress, and maintain the safe operating region of the system components. Thus, the MPC-based energy management structure has been configured as a multi-objective framework that limits not only hydrogen consumption and SoC regulation but also rapid power changes that could accelerate PEMFC degradation.
Accordingly, the numerical constraint values adopted in this study were selected to remain within the manufacturer-recommended operating region of the Aerostak A-500 PEMFC stack and the safe operating limits of the Li-Po battery system. In particular, the PEMFC maximum power limit was restricted to the nominal rated operating range of the stack, the minimum battery SoC threshold was selected to avoid excessive deep discharge conditions, and the PEMFC ramp-rate constraint was chosen to reduce abrupt transient loading associated with electrochemical and thermal stress. The selected limits therefore represent a compromise between power availability, component protection, degradation mitigation, and real-time operational stability under representative UAV mission conditions.
To ensure robust convergence and real-time feasibility, the MPC optimization problem is solved using a brute-force search over a discretized set of admissible fuel cell power levels. Although this approach is computationally simple, it provides a global optimum within the predefined candidate set and avoids issues related to non-convexity and local minima that may arise in nonlinear hybrid system models. The prediction horizon Np is selected to balance control performance and computational burden, capturing the dominant dynamics of the fuel cell–battery system while remaining compatible with real-time execution constraints. Overall, this MPC formulation provides a systematic and transparent framework for hybrid energy management in PEMFC-powered quadcopters, enabling efficient power allocation, constraint satisfaction, and endurance-oriented operation under realistic flight conditions [
36,
38,
68].
From a computational perspective, the proposed MPC framework maintains relatively low algorithmic complexity because only a single control variable, namely the fuel cell power reference, is optimized over a finite discretized candidate set at each sampling instant. Let Nc denote the number of admissible candidate control inputs and Np represent the prediction horizon length. Under the adopted brute-force search formulation, the computational complexity scales approximately with O(Nc × Np), which remains computationally tractable for the prediction horizon and discretization levels considered in this study. Owing to the low-dimensional structure of the optimization problem, the proposed formulation is considered compatible with real-time embedded implementation constraints of UAV-class onboard processors. Nevertheless, detailed execution-time benchmarking and hardware-level latency analysis on practical embedded platforms remain important topics for future hardware-in-the-loop validation studies.
To further clarify the practical deployability of the proposed strategy on resource-constrained UAV platforms, the model-simplification choices, the rationale behind the prediction-horizon selection, the optimization solver, and an analytical real-time feasibility assessment for typical onboard processors are detailed below.
Model simplification. The control-oriented model deliberately replaces high-fidelity, spatially distributed electrochemical and multi-physics PEMFC representations with a lumped semi-empirical polarization model (Equations (9)–(11)), augmented by a single first-order transient voltage term (Equations (3) and (4)) and a coulomb-counting battery model (Equations (14) and (15)). This reduces the predicted system state to a low-dimensional set, namely the battery SoC, the filtered PEMFC operating point, and the DC-bus power balance, thereby eliminating the stiff partial-differential dynamics and large state vectors that render detailed electrochemical models computationally prohibitive for embedded prediction. The adopted simplification preserves the dominant low-frequency dynamics that govern power sharing and endurance, which are precisely the quantities the supervisory controller must regulate, while the high-frequency transient component is delegated to the fast-response battery buffer.
Horizon and discretization selection. A sampling time of Δt = 1 s and a prediction horizon of Np = 10 steps, corresponding to a 10 s look-ahead, were selected to match the dominant time scales of the supervisory energy-management problem rather than the fast electrical transients, which are buffered by the battery. The 1 s sampling interval is consistent with the slow PEMFC dynamics (effective time constant on the order of seconds) and with the ramp-rate-limited fuel-cell power command ( = 20 W/s); a finer interval would therefore yield no additional control authority over the fuel cell while increasing the computational load. The 10-step horizon is sufficiently long to anticipate battery SoC drift and mission-phase transitions, yet short enough to keep the candidate search space small. During tuning, horizons beyond this value produced negligible changes in endurance and power-continuity metrics while increasing computation approximately linearly. No separate control horizon is employed, since a single control input, namely the PEMFC power reference, is optimized at each sampling instant.
Optimization solver. At each sampling instant, the controller performs an exhaustive (brute-force) evaluation over a discretized candidate set of admissible PEMFC power levels. The admissible range [
] is discretized at a fixed resolution, yielding Nc candidates; for the present configuration (0–500 W at a 5 W resolution), this corresponds to Nc ≈ 100 candidates. Each candidate is propagated over the Np-step horizon and scored using the cost function in Equation (16), subject to the operational constraints defined in
Section 3.5.2. This procedure guarantees the global optimum within the candidate set and avoids the convergence, initialization, and local-minimum issues associated with gradient-based or interior-point solvers applied to the non-convex hybrid model, at the cost of a deterministic and bounded computational budget.
Real-time feasibility analysis. The per-step computational cost scales as O(Nc × Np). For the adopted configuration (Nc ≈ 100, Np = 10), this amounts to on the order of 10
3 candidate-step evaluations per control update, each consisting of a small number of algebraic floating-point operations from the lumped model. This workload is several orders of magnitude below the throughput of modern UAV-class flight-control processors. Typical autopilot platforms are built on ARM Cortex-M-class microcontrollers equipped with hardware floating-point units operating at hundreds of MHz (e.g., the STM32H7-based controllers used in Pixhawk-class autopilots), which sustain tens to hundreds of millions of floating-point operations per second. Consequently, the estimated worst-case execution time per control update remains a small fraction of the 1 s sampling period, leaving a substantial real-time margin for the sensing, state-estimation, and flight-control tasks that run concurrently. Comparable control-oriented predictive energy-management formulations have been reported to execute within embedded real-time constraints on similar hardware, which supports the feasibility of the proposed strategy [
36,
45,
61]. It should nonetheless be emphasized that the present assessment is analytical; direct execution-time benchmarking and hardware-in-the-loop latency measurement on a target embedded platform remain part of the planned validation work described in
Section 5.
In the present implementation, the MPC framework does not incorporate an explicit disturbance forecasting or wind prediction module within the prediction horizon. Instead, stochastic wind disturbances are introduced as external time-varying load perturbations affecting the propulsion power demand, and the controller reacts to these variations based on the updated system states at each sampling instant. Therefore, the proposed formulation should be interpreted as a disturbance-reactive MPC strategy rather than a disturbance-predictive framework. Incorporating mission-aware load forecasting and wind prediction mechanisms into the predictive layer represents an important direction for future research.
In this study, health awareness is incorporated in an implicit manner rather than through an explicit state-of-health (SoH) model. Specifically, degradation-related effects are approximated by penalizing fuel cell power ramp rates and battery current levels within the MPC cost function. These quantities are strongly correlated with known degradation mechanisms such as membrane stress, catalyst degradation, and battery aging. Therefore, the proposed formulation can be interpreted as an implicit degradation-aware stress-mitigation strategy, where component stress is regulated through degradation-related proxy variables rather than through direct electrochemical aging estimation.
It should be noted that the degradation-aware capability considered in this study is implemented in an implicit and control-oriented manner through stress-related proxy terms, namely fuel cell power ramp-rate limitation and battery current penalization. Although these variables are closely associated with known degradation mechanisms such as membrane stress, catalyst aging, and electrochemical loading effects, the proposed framework does not include a detailed electrochemical aging model or reversible voltage recovery dynamics. In practical PEMFC systems, degradation behavior is governed by complex coupled physicochemical mechanisms that may exhibit both reversible and irreversible characteristics over long-term operation. Therefore, the present formulation should be interpreted as a computationally efficient stress-mitigation strategy rather than a full physics-based health prediction framework. Incorporating higher-fidelity electrochemical degradation and reversible aging models into the predictive control architecture remains an important direction for future research.