2.1. Description and Modeling of Ship SOFC System
Due to the high cost, fault-related risks, and long experimental cycle of standalone SOFC power generation systems, this study investigates a shipboard standalone SOFC power generation system by means of dynamic modeling. Based on the available experimental data and literature, a system-level dynamic model is established in the MATLAB (Version R2022b)/Simulink environment according to the Nernst equation, energy conservation, and species conservation, and is then validated using experimental data to provide a basis for subsequent fault simulation and diagnosis [
21].
The SOFC system consists of the stack and the balance-of-plant (BOP). The BOP mainly includes heat exchangers, a combustor, a blower, and mass flow controllers (MFCs) [
22]. Among these components, the SOFC stack is the core part of the whole system. In this study, the stack model is divided into five small stack modules (nodes) using a finite-volume-like discretization idea. Each node can operate independently, and the five nodes are connected in series to form the complete stack model of the SOFC system. Each node is further divided into a solid control unit, a fluid control unit, and an electrochemical performance module. The solid control unit includes the metal interconnect and the PEN structure, while the fluid control unit includes the fuel channel and the air channel. The state variables of each control unit are calculated based on conservation laws, and the output of one node serves as the input of the next node. The overall process configuration of the standalone SOFC power generation system is shown in
Figure 2.
- i.
Electrochemical performance model
The electrochemical model is used to describe the stack output voltage and its polarization loss characteristics [
23]. The stack voltage
is expressed as the product of the single-cell voltage
and the number of cells, as given by
where
is the number of cells. The single-cell voltage is defined as the open-circuit voltage minus the various polarization losses [
24]:
where
denotes the Gibbs free energy at temperature
, and
denotes the partial pressure of each gas species. The terms
,
, and
represent the activation loss, ohmic loss, and concentration loss, respectively. The activation loss mainly results from the kinetic limitation of the electrochemical reactions at the electrode surface and is usually related to current density and exchange current density [
25]. The ohmic loss is mainly caused by the electrolyte, interconnect, and contact resistance, and is usually related to material conductivity and operating temperature [
26]. The concentration loss reflects the mass-transfer limitation caused by the concentration difference between the electrode surface and the main gas channel [
27]. The overall stack output voltage is obtained by iterating the single-cell voltage of each node, thereby characterizing the voltage distribution and performance variation in the stack along the flow direction.
- ii.
Energy conservation equation
The energy conservation equation is used to describe the temperature dynamics of the solid and fluid units in each node. For the solid control unit, the temperature variation is determined by electrochemical reaction heat, Joule heat, convective heat transfer with the fluid, and heat conduction between adjacent nodes [
28]. For the fluid control unit, the temperature variation is governed by the enthalpy difference in the gas flow, heat exchange with the solid unit, and the thermal effect induced by gas-phase reactions. The general form of the energy conservation equation is written as follows [
29]:
where
is the flow rate of the control volume,
is the constant-volume heat capacity,
is the temperature of the control volume,
and
represent the inlet and outlet enthalpy flow rates, respectively, and
is the internal heat source term. By establishing coupled energy balances between the solid and fluid control units, the spatiotemporal temperature evolution inside the SOFC stack can be dynamically captured.
- iii.
Species conservation equation
The species conservation equation is used to describe dynamic variations in the molar amounts and mole fractions of each component in the fuel and air channels. Since gas inflow, gas outflow, and electrochemical consumption or generation simultaneously occur on both the fuel side and the air side of the SOFC system, a separate conservation equation is established for each gas component. Its general form is shown as follows [
30]:
where
is the molar amount of species
in the control volume,
and
are the inlet and outlet molar flow rates of species
, respectively, and
is the generation or consumption term caused by electrochemical reactions or reforming reactions. Based on the molar amounts of each component, the gas mole fractions, partial pressures, and total pressure can be further calculated and coupled with the Nernst voltage, reaction rate, and heat generation, thereby enabling the modeling of gas distribution and dynamic response inside the SOFC [
31].
- iv.
BOP modeling equations
In addition to the stack, the SOFC system also includes BOP components such as heat exchangers, a combustor, a blower, and a mixer. To improve the model’s ability to represent the actual operating process, dynamic or quasi-steady-state models are established for each BOP component. Their mechanisms are described through flow, pressure, and temperature equations, while still following species conservation or energy conservation laws. The heat exchanger model is established based on the energy exchange relationship between hot and cold fluids and is used to describe the variation in gas temperature at the inlet and outlet. The combustor model is established according to the combustion reaction of the unreacted fuel components and the corresponding energy conservation relationship, and is used to calculate the heat release and outlet temperature of the exhaust gas combustion. The heat exchanger and combustor models both follow the energy conservation law. The gas mixing process in the mixer is also modeled through thermal interaction among gas streams, and its principle is similar to that of the heat exchanger. The blower power model is developed based on the isentropic efficiency model. The MFC model is realized using a first-order inertial link together with a time-delay element. All BOP components are coupled with the stack model through variables such as flow rate, temperature, species concentration, and pressure, thereby forming the complete dynamic model of the SOFC system.
In
Table 1,
and
represent the fuel heat and convective heat transfer, respectively. In the blower power model,
denotes the air flow rate,
denotes the constant-pressure specific heat of air,
denotes the blower efficiency,
and
denote the inlet and outlet air pressures of the blower, respectively, and
denotes the specific heat ratio of air. In the MFC mechanism equation,
denotes the inertial time constant, and
denotes the time delay.
2.2. Fault Simulation Mechanisms and Equations
Fuel leakage is one of the most influential typical faults in SOFC systems. It mainly changes the concentration, pressure, and molar flow rate of the fuel gas at the stack inlet, thereby weakening the electrochemical reaction intensity and reducing the output performance of the system. Since hydrogen leakage in practical systems usually occurs in the connecting pipelines between different modules, the fuel leakage fault is introduced between the second heat exchanger and the stack in this study so as to better represent realistic operating conditions. To characterize the process of crack initiation, propagation, and eventual stabilization, a fuel leakage fault factor
is introduced into the model. This factor varies from 0 to 1, and its evolution trend is used to simulate the process in which a small pipeline crack gradually expands and then tends to stabilize under the dynamic balance between leakage and pressure. In this way, the effect of fuel leakage on the performance evolution of the SOFC system can be represented. The mechanism equation for fuel leakage can be expressed as follows:
where
is the fuel molar flow rate entering the stack under normal operating conditions, and
is the actual fuel molar flow rate under the fuel leakage condition. The schematic diagram of the fuel leakage fault is shown in
Figure 3.
Electrode delamination is one of the main causes of increased ohmic resistance and performance degradation in SOFC stacks. This fault is usually caused by non-uniform material distribution or material degradation in the electrode, which leads to delamination or cracks inside the electrode. As a result, the transport resistance of gas, ions, and electrons increases, and local temperature non-uniformity occurs, which further aggravates thermal stress and thermal imbalance. The essential effect of this fault is the destruction of the effective reaction region of the electrode, which reduces the effective contact area between the anode and cathode and consequently weakens the electrochemical reaction capability. Since the actual formation of electrode delamination is relatively slow, whereas the simulation time is short, this study assumes that the fault can be formed within a short period. An electrode delamination fault factor
is therefore introduced into the SOFC model, and the fault is simulated by reducing the effective reaction area of a single stack, so as to represent its influence on system performance degradation. The expression and evolution trend of
are given as follows:
where
, in Equation (13), is the Gaussian-type evolution function used to describe the development trend of electrode delamination,
is the fault-evolution variable, and
is the parameter used to determine the standard deviation of the Gaussian function. In this study,
is set to 10. In Equation (14),
represents the accumulated delamination effect, obtained by integrating the evolution function over the interval
, where
denotes the evolution step. Based on the normalized accumulated effect, the fault factor
is constructed in Equation (15), where
is the terminal value of
. In this way,
is used to characterize the gradual reduction in the effective reaction area caused by electrode delamination, thereby representing its progressive influence on SOFC system performance degradation.
Through the above formulation, the gradual evolution of electrode delamination can be described in the model, and its impact on SOFC system performance can be analyzed.
2.3. Fault Diagnosis Strategy
To achieve fault identification for SOFC systems under multiple operating conditions, a partially monotone decision tree (PMDT) is adopted in this study to construct the fault diagnosis model. SOFC fault data exhibit clear complexity: some features show monotonic trends with changes in fault severity, whereas other features exhibit non-monotonic behavior due to the strong thermo-electro-gas coupling of the system. Traditional classification methods usually fail to make full use of such latent ordered information, and therefore their diagnostic performance and generalization ability are limited under complex operating conditions. PMDT can simultaneously handle mixed ordered classification problems composed of monotonic and non-monotonic features, and is therefore suitable for multi-condition SOFC fault diagnosis.
Let the ordered classification sample set be denoted as
, where
is the sample set,
is the feature set, and
is the decision label set with ordinal relationships. The feature set is further divided into a monotonic feature subset
and a non-monotonic feature subset
. If the following conditions are satisfied as
then the dataset can be regarded as a mixed ordered decision set. For mixed ordered features, the mixed dominance set with respect to feature
is defined as
On this basis, the number of consistent samples of feature
with respect to feature
is defined as
and the total number of consistent samples with respect to feature
is further obtained as
Equations (17)–(21) are used to measure the consistency relationship between features and decision labels, and are then employed for feature evaluation and screening. By removing irrelevant or weakly correlated features, the model complexity can be reduced and the robustness and generalization ability of the subsequent classifier can be improved.
In the decision tree construction stage, PMDT takes the ordered classification sample
and the stopping criterion
as inputs. First, the monotonicity of the features is determined based on the screened feature set. If all samples at the current node belong to the same class, the split is terminated. If the impurity reduction condition is satisfied, the optimal split
is generated according to Equation (22):
Otherwise, the optimal split
is generated according to Equation (23):
When , the splitting process is terminated; otherwise, new child nodes are recursively generated until all samples have been classified. In this way, the resulting PMDT model can simultaneously exploit monotonic and non-monotonic feature information and extract diagnostic rules suitable for mixed ordered classification problems. Compared with conventional decision tree methods, PMDT is more suitable for handling the nonlinearity, strong coupling, and multi-condition characteristics widely present in SOFC fault data. Therefore, it is adopted in this study as the fault classification model, and point-biserial correlation is further combined to analyze feature importance, so as to improve the accuracy and robustness of single-fault, compound-fault, and cross-condition fault diagnosis. The diagnostic dataset was generated from the validated SOFC model under different fault scenarios. In the present study, the total dataset size was set to 20,000 samples. The dataset was randomly divided into a training set and a test set at a ratio of 70%/30%, where the training set was used for model construction and the test set was used for performance evaluation.
The PMDT-based fault diagnosis procedure used in this study is summarized in
Table 2. To improve the robustness of the classifier and reduce the dependence on a specific data partition, K-fold cross-validation (with K = 10) was adopted during the training stage. Specifically, the training dataset was randomly divided into 10 non-overlapping subsets, and the final validation performance was obtained by averaging the results over all folds. By using this procedure, the overall performance of the model on the available dataset could be evaluated more reliably, while avoiding overdependence on a particular validation subset.
To further justify the selection of PMDT, additional comparisons with several representative baseline classifiers were conducted for two single-fault diagnosis tasks, namely fuel leakage and electrode delamination. The considered baseline methods include KNN, SVM, Logistic Regression, and Naive Bayes. The comparison results are summarized in
Table 3 and
Table 4.
As shown in
Table 3 and
Table 4, PMDT achieves the highest diagnostic accuracy among all the considered classifiers for both single-fault cases. For fuel leakage diagnosis, PMDT yields a training accuracy of 98.7% and a test accuracy of 95.4%, which are significantly higher than those of KNN, SVM, Logistic Regression, and Naive Bayes. A similar trend is observed for electrode delamination diagnosis, where PMDT also outperforms the other baseline methods, reaching a training accuracy of 89.3% and a test accuracy of 84.2%. These results indicate that PMDT is more suitable for the SOFC fault data considered in this study, because it can better exploit the mixed monotonic and non-monotonic feature relationships embedded in the diagnostic features.
To further quantify the correlation between each fault level and the feature variables, point-biserial correlation (PBC) is introduced for feature analysis. PBC is suitable for measuring the linear correlation between a binary categorical variable and a continuous variable, and can essentially be regarded as a special case of the Pearson correlation coefficient in a binary-classification scenario. In this study, whether a certain fault level occurs is represented as a binary variable, while the measured state parameters or performance indicators of the system are taken as continuous variables, and the correlation between them is then calculated. The expression is given as
where
denotes the binary fault label corresponding to the
-th sample and takes a value of 0 or 1;
denotes the continuous feature variable corresponding to the
-th sample; and
and
are the mean values of the binary variable and the continuous variable, respectively. The larger the absolute value of
, the stronger the correlation between the feature and the corresponding fault level, while its sign reflects the direction of the correlation. Based on Equation (24), the point-biserial correlation coefficient between different fault levels and each feature variable is calculated to evaluate the sensitivity and discriminative ability of each feature for fault identification, thereby providing a basis for subsequent feature selection and interpretation of the diagnostic results.