Abstract
Fuel cell stack configuration optimization is known to be a problem that, in addition to presenting engineering challenges, is computationally hard. This paper presents an improved computational heuristic for solving the problem. The problem addressed in this paper is one of constrained optimization, where the goal is to seek optimal (or near-optimal) values of (i) the number of proton exchange membrane fuel cells (PEMFCs) to be connected in series to form a group, (ii) the number of such groups to be connected in parallel, and (iii) the cell area, such that the PEMFC assembly delivers the rated voltage at the rated power while the cost of building the assembly is as low as possible. Simulation results show that the proposed method outperforms four of the best-known methods in the literature. The improvement in performance afforded by the proposed algorithm is validated with statistical tests of significance.
1. Introduction
Energy has been identified as “humanity’s number one problem for the next 50 years” [1]. Fuel cells [2,3,4,5,6], which are electrochemical devices that convert chemical energy to electrical energy, offer a viable alternative to fossil-fuel-based sources of energy. Of the various types of fuel cells (see, for example, references [6,7] for an overview), proton exchange membrane (or polymer electrolyte membrane) fuel cells, or PEMFCs, provide a relatively low-cost, low-temperature, high-efficiency and near-zero-pollution energy source for both stationary and portable applications. This paper uses a machine learning approach to address a PEM fuel cell problem of practical interest.
Fuel cells are complex systems whose electrochemistry, thermodynamics and engineering have not yet been fully understood. The complexity and non-linearity of the underlying processes in fuel cells (and many other areas of energy research) are often too hard for physics-, chemistry-, or engineering-based methods to build and interpret models that offer a balance between theoretical rigor and practical usefulness. It is here that data-driven or machine learning approaches come in handy, providing an alternative route to predictive analysis and parameter optimization.
The literature on the use of machine learning in energy research is vast; a few representative examples are mentioned below. A recent example of the use of deep learning (e.g., [8]) in nuclear fusion energy is found in [9] where a method has been developed for predicting disruptive instabilities in controlled fusion plasmas in magnetic-confinement tokamak reactors. Related work on the same type of problem has used machine learning strategies such as neural network [10,11,12], fuzzy logic and regression trees [13], support vector machine classification [14], and genetic algorithms [15]. Deep neural network is shown to outperform linear regression and (shallow) neural network for a short-term natural gas load forecasting application [16]. A problem of allocating optical links for connecting automatic circuit breakers in a utility power grid has been solved using a multi-objective genetic algorithm (NSGA-II) [17] in [18]. Energy optimization under performance constraints in chip multiprocessor systems has been addressed in [19] where deep neural network is shown to outperform reinforcement learning (e.g., [20]) and Kalman filtering (e.g., [21]). Training a neural network on weather and turbine data, Google’s DeepMind system predicted “wind power output 36 hours ahead of actual generation … [and] boosted the value of … wind energy by roughly 20 percent” [22]. A study of the prediction of hydrogen production via biomass gasification is undertaken in [23] where the following four algorithms (e.g., [24,25]) are used: linear regression, K-nearest neighbors regression, support vector machine regression, and decision tree regression.
Modeling, simulation, design, development and control of fuel cells and fuel cell-based systems present a variety of challenges, some of which have, in recent years, been formulated as classification, clustering, regression or optimization and addressed with computational (algorithmic) approaches based on machine learning. Examples abound; just a few illustrative ones are mentioned here. Genetic programming (e.g., [26,27]) is used in a supervised learning mode for static and dynamic (load-following) modeling of solid oxide fuel cells (SOFCs) [28] and also in the optimization of the forming process of bipolar plates of PEMFCs [29]. Differential evolution (e.g., [30,31]) is used to optimize seven parameters for modeling the polarization curve of a PEMFC stack [32]. An adaptive-neuro-fuzzy-inference-system-based maximum-power-point-tracking controller is designed [33] for a proton exchange membrane fuel cell system used in electric vehicle applications. Fault classification of PEMFCs that are used in trams is achieved by employing a hidden Markov model (HMM) approach (e.g., [25]) in [34] where K-means clustering (e.g., [24]) is used to eliminate singular data points as part of the preprocessing step before the HMM is applied; classification results produced by the HMM are shown to be better than those produced by the support vector machine method. Deep learning is applied [35] to sequence fault diagnosis of a PEMFC water management subsystem. Particle swarm optimization [36] is used in component sizing for a PEMFC-battery hybrid system for locomotive applications [37]. Adaptive neuro-fuzzy controllers are designed [38] for performance enhancement of PEMFCs. A prediction method for PEMFC performance degradation is proposed [39] using long short-term memory (LSTM) recurrent neural network [40] along with the auto-regressive integrated moving average (ARIMA) method.
In practical applications, for meeting specific output voltage, current or power requirements, a number of fuel cells are typically assembled in series and/or parallel connections. The design of the stack configuration of fuel cells for use as stand-alone power-supply systems presents engineering challenges and is a non-trivial computational problem (e.g., [41,42,43]). This paper addresses the design problem considered in [41,42,43]. Specifically, the problem is to find optimal (or near-optimal) values for (i) the number of PEMFCs to be connected in series in a group, (ii) the number of such groups to be connected in parallel, and (iii) the cell area, such that the assembly delivers the rated voltage at the rated power at the minimum possible cost. The current delivered by a cell depends on, among other factors, its active surface area. The total number of cells and the cell area determine the physical size of the stack and thus the space needed to install the stack in a household setting. A second motivation for seeking to reduce the total stack area is that Pt used in the electrodes is expensive [43]. In references [41,42], computational heuristics were used to solve this design optimization problem, with the heuristic of [42] shown to be better than the genetic algorithm of [41]. In the present paper, we develop an algorithm (heuristic), based on ideas from the population-based evolutionary algorithm (e.g., [44]) and the Jaya algorithm [45], producing results better than those reported in [41,42,43] and those produced by the Jaya algorithm [45].
The remainder of this paper is organized as follows. Section 2 provides the background along with a detailed description of the problem. Section 3 develops the new method and derives the computational complexity of the algorithm. Simulation results are presented in Section 4 where performance comparisons of the competing algorithms are provided using statistical tests of significance. Conclusions are drawn in Section 5.
2. The Problem
2.1. Theoretical Background
Part of this subsection is taken from [46,47]. The reversible thermodynamic potential or equilibrium voltage or open-circuit electromotive force (EMF) of the fuel cell is given by the Nernst equation, which is generally considered to be the cornerstone of fuel cell thermodynamics [2,46,47]:
where is the reference (standard) EMF at unit activity and atmospheric pressure, i and j are the numbers of reactant and product species, a represents the activity, is the stoichiometric coefficient of species i, R is the universal gas constant, F is Faraday’s constant, n is the number of electrons transferred for each molecule of the fuel participating in the reaction, and T is the temperature [46,47]. For a hydrogen-oxygen fuel cell (e.g., SOFC or PEMFC), the reactants are hydrogen and oxygen, and water (steam) is the product. (This paper uses many of the notations of [42,46]. A nomenclature is provided at the end of the paper.)
The reference EMF, , depends on temperature T:
where is the standard EMF at temperature , and is the change in entropy. The activity a of an ideal gas is given in terms of its pressure (or partial pressure) p:
where is the standard-state pressure (1 atm).
When the fuel cell is operated below 100 °C, so that liquid water is produced (as in proton exchange membrane fuel cells), the activity of water can be taken to be unity ( = 1). In that case, the Nernst equation takes the form [46,47]
where use has been made of the fact that for a hydrogen fuel cell.
The terminal (output) voltage is generally obtained by subtracting from the following types of losses (or “irreversibilities”) [46,47]:
- Activation loss
- Concentration loss
- Ohmic loss
- Fuel crossover and internal current loss
Defining the current density, , as
where represents the cell active area, we can express the activation loss, caused by the slowness of the electrochemical reactions taking place on the surface of the electrode [6], as
where is the electron transfer coefficient, is the current density, and the exchange current density (with , so that the logarithm is positive). Since the Tafel constant (Tafel slope) A [2] is given by
this loss can be expressed as
Concentration loss or mass transport loss results from the decrease in concentration of the reactants at the triple-phase-boundaries as the fuel is used:
where B is a parametric coefficient (V), and (>) is the limiting current density.
Ohmic loss is caused by the electrical resistance of the electrodes, the polymer membrane, and the conducting resistance between the membrane and the electrodes [2]:
where is the area-specific resistance.
Losses also occur because of the leakage of fuel through the electrolyte and because of internal currents [48]. This type of loss is usually modeled by an additional component of current, called the fuel crossover and internal current.
As mentioned above, the terminal voltage of a single cell is obtained by subtracting the four losses from . This simple, lumped, zero-dimensional [46] model leaves out the issues of, for example, parasitic power, aspect ratio, heat diffusion, water blockage, and flow channel effects.
2.2. Problem Statement
When a number of (identical) fuel cells are connected in series, the equivalent voltage of the group is the sum of the voltages of the individual cells, while the same amount of current flows through each cell. For a parallel connection of the cells, on the other hand, the total current is given by the sum of the individual currents, while the voltage of the group is the same as that of an individual cell.
Consider a group of cells connected in series and such groups connected in parallel. Let the entire assembly be called a stack. If represents the load current density for the entire assembly (stack), the stack terminal voltage is given by [42]
where is an additional component of current density brought into the equation to account for the combined effect of fuel crossover and internal current. The fuel crossover and internal current (the “leakage” or “lost” current) does not “flow” through the same path that the “regular” current follows [48], and therefore the ohmic loss for a cell should be modeled by the expression . For conformity with the treatment in Ref. [42] (this is a primary reference against which comparative results are studied later in this paper), however, the expression for in Equation (9) retains the inexact inclusion [48] of in the calculation of the ohmic loss term. Table 1 shows the values of the cell parameters used in this problem.
Table 1.
PEMFC single cell parameters (from Refs. [41,42]).
Given the cell parameters (except for the cell area) and the lower and upper bounds on the number of cells and also on the cell area, the problem is to produce an optimal stack design that minimizes the number of cells in series in each group, the number of such groups in parallel, and the cell area such that the rated load voltage at the maximum power point of the stack is 12 V and the maximum power is at least 200 W (corresponding to 730 kWh per year) [41,42]. These requirements come from a “research project aimed to design a power supply system to provide dc electricity to a single dwelling in a remote area of a developing country” [41]. As in references [41,42,43], the rated load voltage and the rated power come from the problem statement.
2.3. Previous Work on This Problem
A simple selection-crossover-mutation genetic algorithm was applied [41] to find (near-)optimal values of the three variables , and . The linear ranking selection used in that work favored trial solutions with a low difference between the rated voltage (12 V) and the trial solution’s maximum-power-point voltage (the sampling method accompanying the selection strategy was stochastic universal sampling). A penalty was applied to trial solutions not meeting the stipulated power of 200 W. The exact definitions of the objective function and the penalty function, however, are not provided in that paper.
A stochastic heuristic was developed in [42] to solve this problem. That method searched for the three variables , and by minimizing an objective function that linearly combined several terms, favoring low values for all the three variables, in addition to favoring small absolute differences between the problem-specified rated voltage and the trial solution’s output voltage at the maximum power point. An additive component of the objective function was a penalty term applicable to trial solutions producing a maximum power below the stipulated power (200 W). Two different penalty schemes were used: static (stationary) and dynamic (time-variant). For varying the relative importance of the different components of the objective function, different numerical constants were used, with values chosen heuristically. Unlike the genetic algorithm or any other member of the evolutionary computation family, that heuristic was point-based, not population-based.
Techniques from qualimetric engineering and extremal analysis were employed [43] to attack this problem by minimizing the total stack area . Unlike the previous two methods (and unlike the present paper’s approach), that approach used the data provided in the problem statement on the rated voltage and the rated power to simplify (reduce) the problem to one where the load current at the maximum power point was calculated simply as 200 W/12 V ≈ 16.667 A and was held fixed at this value for the rest of the computation (never revised to obtain the true current at the maximum power point). A second major simplification was achieved by using the Taguchi method [49] to set to its minimum possible value, namely unity. Finally, values of and were determined by using a quasi-analytical approach involving a mix of numerical estimation, approximation and curve-fitting.
The authors of [41,42] obtained a trial solution’s maximum-power-point voltage numerically from Equation (9), by iterating over current. That iterative computation was avoided in [43] through the use of the fixed value of 16.667 A as an estimate of the current corresponding to the maximum power point.
2.4. Cost Function
Following reference [42], the objective or cost function to be minimized is given by
where = 12 V represents the rated output terminal voltage of the PEMFC stack; stands for the output voltage at the maximum power point of the stack; = 200 W is the rated output power of the stack; is the maximum output power of the stack; , , and = 0.001 are (heuristically chosen) positive constants [42] that allow us to vary the relative weights of the different components of the cost function; and the penalty term is zero if is at least but positive otherwise. and are obtained numerically from Equation (9), via iterations over current (see Section 4.2).
The cost function can be considered an anti-fitness function where the fitness function is traditionally maximized in the evolutionary computation literature. For the values in Table 1 and Table 2, the cost function is positive. Fewer cells and smaller cell areas lead to lower costs. A larger deviation (in either direction) of the maximum-power-point voltage from the rated voltage makes the cost higher (worse). For a penalty-free solution vector () (i.e., one for which ), the above cost function is some measure of the monetary cost of building the stack. The present paper uses the adaptive penalty method (Section 3.6 of [42]) along with the coefficients/constants (Table 3 of [42]) of [42].
Table 2.
Bounds of the design variables (from references [41,42]).
3. The Improved Algorithm
The proposed algorithm is based on the general concept of evolutionary algorithms (e.g., [44]) and particularly on the Jaya algorithm [45]. The pseudocode of the original Jaya [45] is shown in Algorithm 1.
| Algorithm 1: Jaya. |
![]() |
The improved method modifies the Jaya algorithm in two ways. The first of these two modifications causes a major change in the algorithm design, while the second is relatively minor. The outline pseudocode of the improved algorithm is presented in Algorithm 2.
| Algorithm 2: Pseudocode of the improved algorithm. |
![]() |
The first modification introduces a new policy for updating (and using) the best and the worst individuals. In a population of size N, the individuals (trial solutions) can be thought of as occupying N consecutive positions (slots or locations) in an array or some other data structure. Each individual is a vector of three variables (parameters): an integer representing the number of cells in series in a group (), a second integer for the number of parallel groups (), and a floating-point number representing the cell area (). Population initialization is done by choosing values for each of the three problem parameters (, , ) uniformly randomly from the interval defined by their respective lower and upper bounds (Table 2).
A new individual is produced from the current individual at generation g (see Line 9 in Algorithm 2) as follows:
for i = 1 to 3:
where , i = 1 to 3, represent the three variables (, , ) to be optimized; and are random numbers between 0.0 and 1.0; and and represent, respectively, the best and the worst individual in the population at the time of the creation of from . If happens to fall outside its bounds, it is clamped at the appropriate (lower or upper) bound.
In the standard Jaya algorithm, the population’s best individual and the worst individual are determined once at every new generation and are not updated during the course of a generation; that is, they are determined again at the next generation.
In the improved algorithm, however, whenever a new individual replaces an existing individual, we check whether the new individual is better (less costly) than the current best individual, and if it is, it (the new individual) becomes the current best, outsmarting the previous (most recent) best individual. By (conditionally) updating the best individual as soon a new individual appears in the population, our method is able to utilize the new (hitherto unavailable) information at the earliest possible opportunity, without having to wait for the entire (new) population to be created. This demonstrably leads to a better search for the (near-)optimal solution.
A similar strategy can in principle be used to continually update the worst individual in the population. That, however, turns out to be computationally expensive because finding the worst individual is an operation (where N is the population size), which would entail a total additional cost of per generation. We bypass this problem by putting, at each generation, the population’s worst individual at a location (position in the population) that we know will be the last one to be accessed during the course of any given generation. Since, at any generation, the individuals in a population are processed sequentially, starting with the first and ending with the last, the last position is a sensible choice for holding the worst individual. At each generation, just before the beginning of the loop over all individuals, we swap the population’s worst individual with the one at the last location (position) in the population.
The proposed algorithm’s policy of continual update of the best and the worst individuals in the population renders the concept of the generation unnecessary or irrelevant. The proposed method thus implements a “steady-state” [50,51] mechanism of sorts. The reasons we still keep the generation loop are twofold: (i) we need to keep the worst individual at the last position; and (ii) the Jaya algorithm resets the random parameters (, etc.) exactly once every generation, and therefore for a head-to-head comparison with Jaya, the proposed algorithm needs to do the same.
The second modification consists in altering the acceptance criterion for the new individual. The standard Jaya algorithm accepts the new individual (the child) only if it is better than the original individual (the parent). The proposed method accepts the new individual if it is better than or equal in cost to the original. This keeps the search process moving even on a flat landscape ([31,52]).
Note that inside the for-loop spanning Lines 8–22 in Algorithm 2, we do not need to check if the “incoming individual” is worse than the current individual because the “incoming individual” is guaranteed to be at least as good as the one that it replaces. Note also that the if-statement on Lines 25–27 is required because, after the completion of the for-loop (on Lines 8–22), the best individual in the entire population may reside at the “last position” and thus take part in the swap operation (on Line 24). This is essentially the same reason why the finding of the best individual and the initialization of bestPosition to the location (position) of the best individual (Line 4) must follow, not precede, the swap statement on Line 3 of the pseudocode.
Computational Complexity
The improved method (Algorithm 2) incurs nominal extra computational cost compared to Jaya. The additional computation involves:
- the swap operation in Line 3;
- the assignment in Line 7;
- the two conditional updates of bestPosition (Lines 16–18) and newWorstPosition (Lines 19–21) inside the for-loop; and
- the if-statement (Lines 23–28), with the swap (Line 24) and the conditional update of bestPosition (Lines 25–27) within it.
For an entire generation, therefore, the total additional cost is no more than linear in the population size, or .
4. Simulation Results
The relative effectiveness of five methods in optimizing the stack design are studied in this section. We present simulation results of head-to-head comparisons between the proposed algorithm and Jaya, between the proposed algorithm and the stochastic heuristic of [42], between the proposed algorithm and the genetic algorithm of [41], between the proposed algorithm and the quasi-analytical method of [43], and between Jaya and the heuristic of [42].
For a comparative analysis, the performances of the competing algorithms were observed at the same number of objective function evaluations; this number is given by the parameter maxEvals, which is the product of the number of generations (maxGen) for which the algorithm was run and the population size (popSize). Thirteen configurations were randomly chosen to cover maxEvals values from 200 to 10,000 (see Table 3).
Table 3.
Parameter settings used in different runs of the competing algorithms.
4.1. Proposed Algorithm vs. Jaya
The proposed algorithm and Jaya were coded in Java, and for each of the thirteen configurations in Table 3, 100 independent runs of either method were executed. The best, mean, and standard deviation of the best-of-run costs from 100 independent runs of either algorithm are presented in Table 4 for each of the thirteen configurations. Since this is a problem of minimization, numerically lower values of the cost function indicate better performance.
Table 4.
Comparison of the solution quality between Jaya and the proposed method (each row corresponds to 100 independent runs).
The proposed method outperforms Jaya in the majority of cases on each of the following three metrics: (a) the mean of the 100 best-of-run costs; (b) the best of the 100 best-of-run costs; and (c) the standard deviation of the hundred best-of-run costs.
Results of large-sample unpaired tests between the two means of the best-of-run costs in Table 4 are shown in Table 5, where the difference was obtained by subtracting the proposed method’s cost from the corresponding cost produced by the Jaya algorithm. The sample size is 100 for either algorithm for each of the thirteen configurations. For all configurations but one, the proposed method is apparently better, judging by the sign of the z-score. The p-values (not shown in this table) corresponding to many of the z-scores are small, but it is arguable whether they are small enough for the difference to be statistically significant (much depends on the choice, arguably arbitrary, of the level of significance). For the lone negative z-score, the relatively high value of p implies that the proposed method is certainly not inferior to Jaya.
Table 5.
Large-sample unpaired tests between the two means of best-of-run costs in Table 4.
That each configuration in Table 3 represents a certain fixed set of algorithm parameter values implies that a performance comparison confined to any one configuration presents only a fragmented picture of the capabilities of the competing algorithms. For a more meaningful analysis, therefore, a configuration ensemble must be studied. This is what is done in Table 6 where Wilcoxon signed-rank tests are performed on the data of Table 4. The thirteen configurations yield as many independent samples of each algorithm’s performance in Table 6 where two performance metrics are considered separately: the average of the 100 best-of-run costs and the best of the 100 best-of-run costs.
Table 6.
Wilcoxon signed rank tests on the data in Table 4 (sample size = 13).
A comparison of the means of the best-of-run costs of the two methods (the “Mean” column under Jaya versus the “Mean” column under “Proposed Algorithm” in Table 4) is shown in the second column of Table 6 where a two-sample paired test was performed to test the null hypothesis
against the one-sided alternativeJaya mean - Proposed algorithm mean = 0
Jaya mean > Proposed algorithm mean.
In this table, n denotes the effective sample size, obtained by ignoring the zero differences, if any. The W-statistic was obtained as min(). The critical W was obtained from standard tables [53]. The z-score was obtained (using the normal distribution approximation) as
where
and
The p-value was calculated from standard tables of the unit normal distribution. The data in the second column of Table 6 show that, for the given sample size and the given significance level, the W-statistic (=13) is less than the critical W (=21), a fact that allows us to reject the null hypothesis in favor of the alternate hypothesis. Again, the p-value is less than 0.05; thus, the difference, Jaya mean—Proposed algorithm mean, is significant at the 5% significance level.
The last column of Table 6 shows the comparison between the two best values. The effective number of samples, n, is 12 in this case. Unlike the comparison in the second column of the table, the W statistic (=24) in the last column is larger than the corresponding critical W (=17), which means that the null hypothesis cannot be rejected at the 5% level. The z-score and the p-value, however, do not establish any superiority of Jaya over the proposed method at the 5% level.
The results in Table 4, Table 5 and Table 6 are all about the quality (cost) of the solution without any reference to the computation time needed to obtain that quality. In the next three tables, we present a comparative study of the time (as measured by the number of cost function evaluations) taken by the algorithms to obtain solutions of a given quality. Specifically, we set a cut-off or target cost of 13.5 (following [42]) and record, for each run of the algorithm, the number of cost evaluations needed to produce, for the very first time in the run, a solution better (that is, numerically smaller) than or equal to that target cost. Let us denote by firstHitEvals the number of evaluations needed by an algorithm to hit (meet or beat) the target for the very first time in a given run (execution) of the algorithm. Given that the algorithms discussed here are really heuristics, once an algorithm hits the target for the very first time in a run, it is generally likely (but never guaranteed) that it will produce better solutions than the target value if the run is allowed to proceed further, that is, if the run consumes more evaluations than firstHitEvals. Note also that it is possible for an algorithm to never hit the target in a given (finite) number of evaluations.
Comparative results on the firstHitEvals metric are presented in Table 7 where each row was obtained from 100 independent runs of either algorithm. For each of the two competing algorithms, the #Success column shows the number of runs (out of the 100) that did hit the target within the maximum quota of maxEvals evaluations (recall that the maxEvals values for different configurations are given in Table 3). The proposed method’s success rate is better than its competitor’s in all thirteen cases. Again, the mean of firstHitEvals is smaller (better) for the proposed method in the majority of configurations; the pattern with the standard deviation of firstHitEvals is similar.
Table 7.
Mean and standard deviation of firstHitEvals (each row is obtained from 100 independent runs for each algorithm).
Table 8 shows the results of large-sample unpaired tests on the two means of firstHitEvals in Table 7. Unlike Table 5, which has a sample size of 100 for either algorithm for each of the 13 configurations, Table 8 does not in general have the same sample size for the two algorithms for a given configuration, and the sample sizes are not always 100 (this is simply because not all runs in Table 7 were successful). For example, the sample sizes for the second configuration in Table 7 (and also in Table 8) are 34 and 42.
Table 8.
Large-sample unpaired tests on the two means of firstHitEvals from Table 7.
The use of the sample standard deviation in lieu of the population standard deviation in the calculation of the z-statistic in Table 8 is not inappropriate because the sample sizes, with the exception of the first configuration, are greater than 30, so that the central limit theorem may be used to justify the normal distribution approximation [54].
With a few exceptions, the p-values corresponding to the z-scores in Table 8 cannot be used to show the superiority (in a statistically significant way) of the proposed method for individual configurations. As discussed earlier, the overall algorithm behavior across different configurations can be captured by the configuration ensemble. This is done in Table 9 where Wilcoxon signed rank tests conducted on the data in Table 7 are presented. The results in Table 9 show that not only does the proposed algorithm succeed in hitting the target significantly more often than its competitor, it also needs significantly fewer evaluations to achieve that feat.
Table 9.
Wilcoxon signed rank tests on the data in Table 7 (sample size = 13).
The next three tables show the performance of the algorithms on the metric of firstHitCost, which is the cost of the solution obtained at firstHitEvals; clearly, firstHitCost is less than or equal to the target cost. The mean and standard deviation of firstHitCost values obtained from 100 independent runs (these runs are the same as the ones used to obtain firstHitEvals in Table 7) are shown for each of the two competing methods in Table 10.
Table 10.
Mean and standard deviation of firstHitCost (each row is obtained from the same set of 100 independent runs used in Table 7 for each algorithm).
Table 11 shows results of large-sample unpaired tests on the two means of firstHitCost in Table 10. As before, the one-tailed p-values (not shown in Table 11) produced by the z-scores fail to establish a statistically significant outcome with respect to individual configurations, a fact that leads us to the statistical analysis of the configuration ensemble in Table 12, which presents the results of the Wilcoxon signed rank test on the set of 13 samples in Table 10. Table 12 shows that the average cost of the solution produced after consuming firstHitEvals evaluations is statistically significantly better (at the 5% level) for the proposed method.
Table 11.
Large-sample unpaired tests on the two means of firstHitCost in Table 10.
Table 12.
Wilcoxon signed rank test on the data in Table 10 (sample size = 13).
The best stack designs produced by Jaya and the proposed algorithm, given by the solution vectors (, , ) and the corresponding , and cost, are shown in Table 13 where each row corresponds to a given configuration (popSize-maxGen combination). A solution in this table represents, for the corresponding algorithm and the corresponding configuration, the best of all the solutions produced in the 100-run suite. While the proposed method’s best solution produces a better (lower) cost in the majority of cases in Table 13, neither algorithm is statistically significantly better than the other on this particular metric (as seen in the results of Table 6). The mean best solution of the proposed method, however, was found to be statistically significantly better than that of Jaya (Table 6).
Table 13.
Best of best-of-run solutions (stack designs).
4.2. Effect of Current Step Size on Numerical Calculations of and
The maximum power point corresponding to a given stack configuration vector () and a given set of cell-parameter values (, etc.) cannot be obtained analytically. In other words, the problem of finding
with given by Equation (9), is not analytically solvable. A numerical, iterative method has been used in this paper, where a loop over current values is executed in order to numerically find an approximation to the maximum power point. Since the maximum power point needs to be determined for every single cost (objective function) evaluation, the total computation time to complete all such iterations for all runs in a test suite may not be trivial. It is, therefore, good to be able to use a relatively large step size in incrementing the current value in the loop. A large step size, however, reduces the accuracy of the computed and . The results presented so far in this paper used a current step of 50 mA. To study whether the step size affects the conclusions about the relative performance of the algorithms, we obtain a new set of results using a step size of 1 mA. These results, arguably more accurate than their 50 mA counterparts, are presented in Table 14, Table 15 and Table 16. A target cost of 13.62 is used in Table 15 because the previously used target of 13.5 was met by no run in the new set of results. Results of Wilcoxon signed-rank tests on the new set of data are shown in Table 17 which establishes the proposed method as significantly better than Jaya on the success rate as well as on the mean of best-of-run costs. Jaya is significantly better on the best of best-of-run costs metric, while the methods are statistically tied (the one-tail p-value is close to 0.5) on the remaining two metrics. Given the vagaries of chance, for stochastic heuristics, the average performance rather than the single best performance is generally considered to be a reliable indicator of a method’s performance. The new data, therefore, do not offer a convincing reason to argue that Jaya performs better than the proposed method. Combining the new results with the old ones for all five performance metrics, we conclude that the proposed heuristic meets or beats the Jaya algorithm.
Table 14.
Comparison of the solution quality between Jaya and the proposed method. Each row corresponds to 100 independent runs. Current step size = 1 mA.
Table 15.
Comparison between Jaya and the proposed method on firstHitEvals and firstHitCost (each row corresponds to 100 independent runs). Current step size = 1 mA, target cost = 13.62.
Table 16.
Best of best-of-run solutions (stack designs). Current step size = 1 mA.
4.3. Proposed Algorithm vs. Point-Based Stochastic Heuristic
We now present performance comparisons of the proposed method against the heuristic of [42]. Each of the four maxEvals values reported in Table 9 of [42] corresponds to two different configurations in Table 3 of the present paper (see Table 18).
Table 18.
Correspondence between [42] and the present paper on algorithm parameter settings.
The best-cost solution vector from Table 4 on p. 536 of reference [42] is shown in Table 19 where the values of , and are copied from [42], and are computed by the iterative numerical method described in Section 4.2 (using two different values of the current step size), and the cost is computed from Equation (10).
Table 19.
Best solution (stack design) of [42].
The first row of Table 19 shows that the current step of 50 mA produces a close agreement of , and cost values in this table with the corresponding values in Table 4 of [42]. For the rest of this subsection, therefore, results corresponding to only the 50 mA step size are used.
Table 9 of [42] and Table 4 and Table 10 of the present paper show that the proposed algorithm outperforms the algorithm of [42] in all of the following metrics: (i) best of the 100 best-of-run costs; (ii) mean of the 100 best-of-run costs; (iii) standard deviation of the 100 best-of-run costs; (iv) count (out of the one hundred) of successful runs; and (v) mean firstHitCost obtained from the successful runs (the mean firstHitCost of [42] is slightly better for one of the two cases—Configuration 11—of maxEvals = 4000).
To investigate the statistical significance, if any, of the difference between the means of the best-of-run costs of the two algorithms, we performed unpaired t-tests, the results of which are presented in Table 20. We tested the null hypothesis against the one-sided alternative , where and are the (population) means of the method in [42] and the proposed method, respectively. We chose a level of significance of 0.05. Since the sample variances differ by a factor of about 10,000, the standard two-sample t-test cannot be used. We, therefore, used the Smith–Satterthwaite test [32,54] corresponding to unequal variances. The test statistic is given by
and its sampling distribution can be approximated by the t-distribution with
degrees of freedom (rounded down to the nearest integer), where and represent the two sample means, and are the two sample standard deviations, and and are the two sample sizes (100 each).
Table 20.
The method in [42] versus the proposed method: results of Smith–Satterthwaite tests for comparing the means of 100 best-of-run costs for each configuration.
Table 20 presents, for each configuration, the following:
- t-statistic;
- degrees of freedom (d.f.);
- the critical t value (obtained from standard tables of t-distribution) at 95% (right-tail probability of 0.05) and for the specified degrees of freedom; and
- 95% confidence interval for the difference of the two means,at the specified degrees of freedom .
The results show that, for each case, the t-statistic exceeds for the relevant degrees of freedom. Therefore, the null hypothesis is rejected in all of these cases. Furthermore, none of the confidence intervals contains zero. We thus conclude that the improvements produced by the proposed method are statistically significant.
Smith–Satterthwaite tests were also performed for comparing [42] against Jaya, and the results (see Table 21) establish Jaya as significantly better than [42].
Table 21.
The method in [42] versus Jaya: results of Smith–Satterthwaite tests for comparing the means of 100 best-of-run costs for each configuration.
4.4. Proposed Algorithm vs. Genetic Algorithm
The best solution produced by the genetic algorithm in [41] recommends the following stack design: = 21, = 1, = 156.25 cm2. Plugged into Equations (9) and (10), this design yields the maximum power point value and the corresponding cost shown in Table 22 where two sets of calculations (for the two step sizes for current) were used. The proposed method’s best solutions outperform both cases in Table 22 not only on the objective function (cost) but also on the output voltage at the maximum power point: the values in Table 22 are less than the rated voltage of 12 V.
Table 22.
Best solution (stack design) of [41].
4.5. Proposed Algorithm vs. Quasi-Analytical Approach
The best solution produced by the quasi-analytical method in [43] yields the following stack design: = 22, = 1, = 151.4 cm2. Plugging these values into Equations (9) and (10) gives the maximum power point values and the corresponding costs in Table 23. All of the best solutions of the proposed method in Table 13 and Table 16 have a better (lower) cost than those in Table 23. The objective function minimized in [43] is the total stack area given by , and yet the proposed method’s best solutions produce better (smaller) total areas, while meeting the requirements of the rated voltage and rated power.
Table 23.
Best solution (stack design) of [43].
The characterization in [43] of the design vector (22, 1, 154.16) as the best design in [42] does not seem to be correct. The vector (22, 1, 154.16) is mentioned in [42] as a “typical” design, not the best or optimal design. In fact, several of the designs in Table 4 of [42] have cell areas that are smaller than 154.16 cm2, with two of them even smaller than the “optimal” area of 151.4 cm2 in [43]. The design (22, 1, 149.597) from Ref. [42] outperforms the “optimal” design of (22, 1, 151.4) of [43] on the cell area metric as well as on the objective (cost) function for both the step sizes (see Table 19 and Table 23).
4.6. Polarization and Power Characteristics
Figure 1 shows the polarization curve for the stack design produced by the best of best-of-run solutions of the proposed algorithm for Configuration 13 (the last row of Table 16). The corresponding power characteristics are plotted in Figure 2. The nature of these plots agrees well with that of polarization and power characteristics of typical PEM fuel cells in the literature.
Figure 1.
Stack voltage vs. load current plot for the stack given by = 22, = 1, = 148.443337 cm2, and the single-cell parameter values from Table 1.
Figure 2.
Power vs. load current (top) and power vs. voltage (bottom) plots corresponding to the polarization in Figure 1. = 200.003419 W, = 12.246727 V.
5. Conclusions
A PEMFC stack design problem of practical importance has been addressed in this paper. This constrained optimization problem requires finding optimal values of three parameters of the stack configuration (namely, the number of cells in series, the number of groups in parallel, and the cell area) such that the configuration delivers the rated voltage at the rated power (the load voltage at the maximum power point is to be 12 V and the maximum power is to be at least 200 W), while keeping the total cost at a minimum. A new approach has been developed, based on the Jaya algorithm and ideas from evolutionary computation, and the new approach has been empirically (that is, via numerical simulation) compared with the Jaya algorithm and with the methods in [41,42,43], using the following performance measures: (a) the best of best-of-run costs from 100 independent runs; (b) the average of the 100 best-of-run costs; (c) the standard deviation of the 100 best-of-run costs; (d) the number of cost function evaluations needed to reach a pre-determined target cost for the very first time in a run of the algorithm; and (e) the cost of the very first (earliest) solution in a run that meets (or beats) the target cost. The proposed method has been shown to outperform the three methods in [41,42,43] and is competitive with Jaya. The improvement in performance provided by the proposed algorithm has been substantiated with statistical tests of significance.
Following the authors of [41,42,43], this paper considered only a stand-alone PEM fuel cell stack. Auxiliary systems, such as heat exchangers, compressors, etc. that often significantly affect the mass, size and cost of the entire system, were not considered; the membrane type, the type of the bipolar plate material, and many operating and constructive parameters were not considered, either. Inclusion of some of these issues would be in the agenda for future research.
Funding
This research was funded in part by United States National Science Foundation Grant IIS-1115352.
Acknowledgments
Three anonymous reviewers provided detailed comments on an earlier version of this paper.
Conflicts of Interest
The author declares no conflict of interest.
Nomenclature
| Number of series cells in a group | |
| Number of groups in parallel | |
| Nernst potential (open-circuit EMF) of a single cell, V | |
| Standard (reference) EMF of a single cell, V | |
| Standard (reference) EMF of a single cell at temperature , V | |
| T | Temperature, K |
| n | Number of electrons transferred |
| a | Activity |
| Activity of hydrogen | |
| Activity of oxygen | |
| Activity of water vapor (steam) | |
| Change in entropy, J/(mol K) | |
| p | Pressure or partial pressure, atm |
| Standard-state pressure, atm | |
| A | Tafel slope, V |
| B | Concentration loss constant, V |
| Output terminal voltage of the PEMFC stack, V | |
| Rated output terminal voltage of the PEMFC stack, V | |
| Output voltage at maximum power point of the PEMFC stack, V | |
| Rated output power of the PEMFC stack, W | |
| Maximum output power of the PEMFC stack, W | |
| Cell area, cm2 | |
| Current density, A/cm2 | |
| Exchange current density, A/cm2 | |
| Load current density, A/cm2 | |
| Limiting current density, A/cm2 | |
| Crossover and internal current density, A/cm2 | |
| Area-specific resistance of a single cell, KΩ cm2 | |
| Partial pressure of hydrogen, atm | |
| Partial pressure of oxygen, atm | |
| Partial pressure of water vapor, atm | |
| Charge transfer coefficient | |
| Coefficients in the objective function | |
| Activation loss, V | |
| Concentration loss, V | |
| Ohmic loss, V | |
| N | Population size used in the algorithm |
| R | Universal gas constant, J/(mol K) |
| F | Faraday’s constant, Coulombs/mol |
References
- Rajeshwar, K. The Hydrogen Rush: Boom or Bust? Electrochem. Soc. Interface 2006, 15, 43. [Google Scholar]
- Larminie, J.; Dicks, A. Fuel Cell Systems Explained, 2nd ed.; Wiley: Chichester, UK, 2003. [Google Scholar]
- Singhal, S.C.; Kendall, K. (Eds.) High-Temperature Solid Oxide Fuel Cells: Fundamentals, Design and Applications; Elsevier: Amsterdam, The Netherlands, 2003. [Google Scholar]
- Barbir, F. PEM Fuel Cells: Theory and Practice, 2nd ed.; Elsevier: Amsterdam, The Netherlands, 2013. [Google Scholar]
- O’hayre, R.; Cha, S.W.; Colella, W.; Prinz, F.B. Fuel Cell Fundamentals, 3rd ed.; John Wiley: Hoboken, NJ, USA, 2016. [Google Scholar]
- Fuel Cell Handbook, 7th ed.; EG&G Technical Services, Inc.: Dahlgren, VA, USA, 2004.
- Types of Fuel Cells. Available online: https://www.energy.gov/eere/fuelcells/types-fuel-cells (accessed on 28 July 2019).
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
- Kates-Harbeck, J.; Svyatkovskiy, A.; Tang, W. Predicting disruptive instabilities in controlled fusion plasmas through deep learning. Nature 2019, 568, 526–531. [Google Scholar] [CrossRef] [PubMed]
- Wroblewski, D.; Jahns, G.; Leuer, J. Tokamak disruption alarm based on a neural network model of the high-beta limit. Nucl. Fusion 1997, 37, 725. [Google Scholar] [CrossRef]
- Cannas, B.; Fanni, A.; Marongiu, E.; Sonato, P. Disruption forecasting at JET using neural networks. Nucl. Fusion 2004, 44, 68. [Google Scholar] [CrossRef]
- Windsor, C.G.; Pautasso, G.; Tichmann, C.; Buttery, R.J.; Hender, T.C.; Contributors, J.E. A cross-tokamak neural network disruption predictor for the JET and ASDEX upgrade tokamaks. Nucl. Fusion 2005, 45, 337. [Google Scholar] [CrossRef]
- Murari, A.; Vagliasindi, G.; Arena, P.; Fortuna, L.; Barana, O.; Johnson, M.; JET-EFDA Contributors. Prototype of an adaptive disruption predictor for JET based on fuzzy logic and regression trees. Nucl. Fusion 2008, 48, 035010. [Google Scholar] [CrossRef]
- Vega, J.; Dormido-Canto, S.; López, J.M.; Murari, A.; Ramírez, J.M.; Moreno, R.; Ruiz, M.; Alves, D.; Felton, R.; JET-EFDA Contributors. Results of the JET real-time disruption predictor in the ITER-like wall campaigns. Fusion Eng. Des. 2013, 88, 1228–1231. [Google Scholar] [CrossRef]
- Rattá, G.A.; Vega, J.; Murari, A.; Dormido-Canto, S.; Moreno, R.; Contributors, J.E.T. Global optimization driven by genetic algorithms for disruption predictors based on APODIS architecture. Fusion Eng. Des. 2016, 112, 1014–1018. [Google Scholar] [CrossRef]
- Merkel, G.D.; Povinelli, R.J.; Brown, R.H. Short-term load forecasting of natural gas with deep neural network regression. Energies 2018, 11, 2008. [Google Scholar] [CrossRef]
- Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef]
- Corrêa, H.P.; de Carvalho Vaz, R.R.; Vieira, F.H.T.; de Araújo, S.G. Reliability Based Genetic Algorithm Applied to Allocation of Fiber Optics Links for Power Grid Automation. Energies 2019, 12, 2039. [Google Scholar] [CrossRef]
- Moghaddam, M.G.; Guan, W.; Ababei, C. Dynamic energy optimization in chip multiprocessors using deep neural networks. IEEE Trans. Multi-Scale Comput. Syst. 2018, 4, 649–661. [Google Scholar] [CrossRef]
- Mitchell, T. Machine Learning; McGraw Hill: New York, NY, USA, 1997. [Google Scholar]
- Welch, G.; Bishop, G. An Introduction to the Kalman Filter; Technical Report; University of North Carolina at Chapel Hill: Chapel Hill, NC, USA, 1995. [Google Scholar]
- Witherspoon, S.; Fadrhonc, W. Machine Learning Can Boost the Value of Wind Energy. Published 26 February 2019. Available online: https://blog.google/technology/ai/machine-learning-can-boost-value-wind-energy/ (accessed on 20 June 2019).
- Ozbas, E.E.; Aksu, D.; Ongen, A.; Aydin, M.A.; Ozcan, H.K. Hydrogen production via biomass gasification, and modeling by supervised machine learning algorithms. Int. J. Hydrogen Energy 2019, 44, 17260–17268. [Google Scholar] [CrossRef]
- Tan, P.-N.; Steinbach, M.; Karpatne, A.; Kumar, V. Introduction to Data Mining, 2nd ed.; Pearson: London, UK, 2018. [Google Scholar]
- Russell, S.J.; Norvig, P. Artificial Intelligence: A Modern Approach, 3rd ed.; Pearson: London, UK, 2009. [Google Scholar]
- Koza, J. Genetic Programming: On the Programming of Computers by Means of Natural Selection; MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
- Banzhaf, W.; Olson, R.S.; Tozier, W.; Riolo, R. (Eds.) Genetic Programming Theory and Practice XV; Springer: Cham, Switzerland, 2019. [Google Scholar]
- Chakraborty, U.K. Static and dynamic modeling of solid oxide fuel cell using genetic programming. Energy 2009, 34, 740–751. [Google Scholar] [CrossRef]
- Huang, Y.; Garg, A.; Asghari, S.; Peng, X.; Le, M.L.P. Robust model for optimization of forming process for metallic bipolar plates of cleaner energy production system. Int. J. Hydrogen Energy 2018, 43, 341–353. [Google Scholar] [CrossRef]
- Chakraborty, U.K. (Ed.) Advances in Differential Evolution; Springer: Heidelberg, Germany, 2008. [Google Scholar]
- Storn, R. Differential evolution research-trends and open questions. In Advances in Differential Evolution; Chakraborty, U.K., Ed.; Springer: Berlin/Heidelberg, Germany, 2008; Chapter 1. [Google Scholar]
- Chakraborty, U.K.; Abbott, T.; Das, S.K. PEM fuel cell modeling using differential evolution. Energy 2012, 40, 387–399. [Google Scholar] [CrossRef]
- Reddy, K.J.; Sudhakar, N. ANFIS-MPPT control algorithm for a PEMFC system used in electric vehicle applications. Int. J. Hydrogen Energy 2019, 44, 15355–15369. [Google Scholar] [CrossRef]
- Liu, J.; Li, Q.; Chen, W.; Cao, T. A discrete hidden Markov model fault diagnosis strategy based on K-means clustering dedicated to PEM fuel cell systems of tramways. Int. J. Hydrogen Energy 2018, 43, 12428–12441. [Google Scholar] [CrossRef]
- Liu, J.; Li, Q.; Yang, H.; Han, Y.; Jiang, S.; Chen, W. Sequence Fault Diagnosis for PEMFC Water Management Subsystem Using Deep Learning with t-SNE. IEEE Access 2019, 7, 92009–92019. [Google Scholar] [CrossRef]
- Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the ICNN’95-IEEE International Conference on Neural Networks, Perth, Australia, 27 November–1 December 1995. [Google Scholar]
- Sarma, U. Determination of the component sizing for the PEM fuel cell-battery hybrid energy system for locomotive application using particle swarm optimization. J. Energy Storage 2018, 19, 247–259. [Google Scholar] [CrossRef]
- Sreedharan, D.; Paul, V.; Thottungal, R. Mathematical modelling of polymer electrolyte membrane fuel cell and fuzzy-based intelligent controllers for performance enhancement. Comput. Electr. Eng. 2019, 77, 354–365. [Google Scholar] [CrossRef]
- Ma, R.; Li, Z.; Breaz, E.; Liu, C.; Bai, H.; Briois, P.; Gao, F. Data-Fusion Prognostics of Proton Exchange Membrane Fuel Cell Degradation. IEEE Trans. Ind. Appl. 2019, 55, 4321–4331. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
- Mohamed, I.; Jenkins, N. Proton exchange membrane (PEM) fuel cell stack configuration using genetic algorithms. J. Power Sources 2004, 131, 142–146. [Google Scholar] [CrossRef]
- Chakraborty, U. A new stochastic algorithm for proton exchange membrane fuel cell stack design optimization. J. Power Sources 2012, 216, 530–541. [Google Scholar] [CrossRef]
- Besseris, G.J. Using qualimetric engineering and extremal analysis to optimize a proton exchange membrane fuel cell stack. Appl. Energy 2014, 128, 15–26. [Google Scholar] [CrossRef]
- Michalewicz, Z. Genetic Algorithms + Data Structures = Evolution Programs; Springer: Heidelberg, Germany, 1996. [Google Scholar]
- Rao, R.V. Jaya: A simple and new optimization algorithm for solving constrained and unconstrained optimization problems. Int. J. Ind. Eng. Comput. 2016, 7, 19–34. [Google Scholar]
- Chakraborty, U.K. A new model for constant fuel utilization and constant fuel flow in fuel cells. Appl. Sci. 2019, 9, 1066. [Google Scholar] [CrossRef]
- Chakraborty, U.K. Reversible and irreversible potentials and an inaccuracy in popular models in the fuel cell literature. Energies 2018, 11, 1851. [Google Scholar] [CrossRef]
- Chakraborty, U. Fuel crossover and internal current in proton exchange membrane fuel cell modeling. Appl. Energy 2016, 163, 60–62. [Google Scholar] [CrossRef]
- Taguchi, G.; Chowdhury, S.; Wu, Y. Taguchi’s Quality Engineering Handbook; Wiley: Hoboken, NJ, USA, 2004. [Google Scholar]
- Syswerda, G. A study of reproduction in generational and steady state genetic algorithms. In Foundations of Genetic Algorithms 1; Rawlins, G.J.E., Ed.; Morgan Kaufmann: San Mateo, CA, USA, 1991; pp. 94–101. [Google Scholar]
- Chakraborty, U.K.; Deb, K.; Chakraborty, M. Analysis of selection algorithms: A Markov chain approach. Evol. Comput. 1996, 4, 133–167. [Google Scholar] [CrossRef]
- Jansen, T.; Wegener, I. Evolutionary algorithms-how to cope with plateaus of constant fitness and when to reject strings of the same fitness. IEEE Trans. Evol. Comput. 2001, 5, 589–599. [Google Scholar] [CrossRef]
- Critical Values for the Wilcoxon Signed-Rank Test. Available online: https://math.ucalgary.ca/files/math/wilcoxon_signed_rank_table.pdf (accessed on 20 June 2019).
- Johnson, R.A. Miller & Freund’s Probability and Statistics for Engineers, 6th ed.; Pearson: London, UK, 2000. [Google Scholar]
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

