Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine

Wang, Haiyan; Wang, Zihan; Shi, Biao

doi:10.3390/app15116299

Open AccessArticle

Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine

by

Haiyan Wang

¹

,

Zihan Wang

^1,*

and

Biao Shi

²

¹

Merchant Marine College, Shanghai Maritime University, Shanghai 201306, China

²

SAIC Motor R&D Innovation Headquarters, SAIC Motor Corporation Limited, Shanghai 201804, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(11), 6299; https://doi.org/10.3390/app15116299

Submission received: 26 April 2025 / Revised: 29 May 2025 / Accepted: 30 May 2025 / Published: 4 June 2025

(This article belongs to the Special Issue Applications and Advances in Marine Traffic Engineering, Maritime Transportation and Offshore Exploitation)

Download

Browse Figures

Versions Notes

Abstract

To enable proactive prediction of marine diesel engine failure time and root causes, thereby reserving sufficient time for maintenance, this study proposes a data-driven multi-algorithm integration framework for performance assessment and fault early warning in marine diesel engines. By integrating the SSD (steady-state detection) algorithm, a data-driven CLIQUE clustering algorithm was chosen for automatic multi-parameter high-dimensional running condition partitioning. This innovative approach overcomes the limitations of traditional single-parameter approaches or dimensionality reduction techniques, significantly enhancing state classification accuracy. The improved classification results subsequently increase the reliability of Mahalanobis distance as a performance indicator for marine diesel engine condition assessment. Finally, the cumulative anomaly method combined with the Yamamoto test was employed for anomaly detection analysis, enabling precise identification of fault occurrence time and establishing an effective early-warning mechanism. The study demonstrates that this technique effectively characterizes the overall performance of marine diesel engines and captures their performance degradation features. Implemented on a 6RT-flex82T marine diesel engine dataset, the method achieved precise prediction of fault occurrence time with early warnings, providing approximately 20 days advance notice for maintenance planning. Furthermore, comparative analyses with existing studies revealed its superior capability in pinpointing the anomaly to the jacket cooling water outlet temperature of cylinder #2. These results confirm the method’s effectiveness in both performance assessment and fault early warning for marine diesel engines, offering a novel approach for intelligent maintenance of shipboard equipment.

Keywords:

performance assessment; data-driven; high-dimensional running condition partitioning; mutation detection; fault early warning

1. Introduction

As the core power generation unit of marine vessels, marine main diesel engines are prone to frequent failures due to prolonged high-load operations under harsh marine conditions. Performance evaluation of these engines enables early prediction of impending failures and proactive warnings, thereby allocating sufficient time for corrective maintenance or preventive interventions. However, the structural complexity and variable operational conditions of marine diesel main engines present significant challenges in achieving reliable performance assessments. Conventional onboard data acquisition systems typically record limited operational parameters that insufficiently reflect the engine’s actual performance state, while critical deep-seated condition parameters remain unmonitored. All measurable parameters exhibit operational condition dependency, with early-stage performance degradation often manifesting as subtle fluctuations in these parameters. Such incipient anomalies are frequently obscured by strong background noise interference, rendering traditional threshold-based warning methods inadequate due to insufficient sensitivity and high false-alarm rates.

With the advancement of big data and communication network technologies, new requirements have been proposed for the intelligent operation and maintenance of marine diesel engines. The concept of the “smart ship” was first introduced by Det Norske Veritas (DNV) in its Future Shipping Industry report in 2014, which significantly promoted global research in this field [1]. In 2015, the China Classification Society (CCS) released the world’s first smart ship code, marking a major step forward for China in the development of smart ship technologies [2]. Subsequently, in 2017, Lloyd’s Register published the Design Rules for Intelligent Ship Systems, further providing technical guidance for this domain [3]. To accelerate the development of intelligent ships, Prognostics and Health Management (PHM) technologies have been gradually introduced into the maritime industry [4]. Data-driven PHM techniques establish fault prediction models by analyzing large volumes of historical data and use real-time equipment data for health management. From early sensor and data logging systems to today’s cloud computing and artificial intelligence technologies, PHM has continuously improved the reliability and availability of equipment [5] while becoming increasingly intelligent and autonomous. Li et al. investigated the application of PHM in marine diesel engines, described the characteristics of traditional fault diagnosis methods, proposed a system architecture for diesel engine PHM, summarized the key enabling technologies, and provided insights into data analysis, fault diagnosis, early warning, and health management [6]. It is evident that health assessment and fault diagnosis of marine diesel engines based on monitoring data, intelligent algorithms, and condition monitoring have become an inevitable trend in the intelligent operation and maintenance of ship propulsion systems. The core objective of intelligent marine diesel engine management lies in the real-time monitoring, evaluation, and analysis of engine health and operational performance to enable timely fault prediction. This technological pathway primarily relies on big data and information technologies to uncover latent correlations among performance parameters through data mining, allowing for accurate identification of both visible and latent fault features. Such approaches significantly enhance the level of intelligent decision making for marine diesel engines [7]. Based on this objective, this study conducts research utilizing monitoring data to analyze and assess the health condition of marine main diesel engines. It quantifies the overall engine performance to visually observe operational trend variations and proposes a fault early-warning technique according to the degradation trend of holistic engine performance. This technique can issue pre-failure alerts before abnormal performance parameters emerge, thereby preventing potential malfunctions. The implementation of this technology enhances navigation safety and reliability while providing theoretical foundations and technical tools for intelligent ship maintenance systems.

During ship navigation, the vessel is not always in a steady state. Monitored data from navigation states such as acceleration, deceleration, and shutdown cannot characterize the overall performance of marine diesel main engines but rather reduce the accuracy of performance evaluation. Therefore, stable running intervals must be identified prior to analysis, with only steady-state navigation data used for performance assessment. Current mainstream steady-state detection algorithms fall into quantitative and qualitative categories. Qualitative methods include the confidence interval method, R-test, combined statistical testing, and SSD algorithm, while quantitative approaches encompass physics-informed neural networks (PINN) and modified adaptive polynomial filtering algorithms. The combined statistical test and confidence interval method proposed by Narasimhan [8] in the 1980s detects steady states by comparing dual means and variances. However, its assumption of zero-mean normal distribution makes it vulnerable to interference in practical applications with insufficient robustness. The R-test improves F-test through variance ratio analysis, yet its threshold is significantly affected by filtering parameters and shows sensitivity to random noise with poor stability. Physics-informed neural networks integrate physical equations (e.g., partial differential equations, conservation laws, etc.) into loss functions to jointly optimize data fitting and physical constraints but exhibit convergence instability for high-dimensional nonlinear problems [9]. The robust adaptive polynomial filtering algorithm (ASF) based on M-estimation detects states through threshold parameter comparison, offering advantages of fewer parameters and strong noise resistance, yet requires manual parameter setting and shows poor adaptability to steady-state fluctuations [10]. In contrast, Kelly et al. used the SSD algorithm, hich constructs threshold criteria using Student’s t-test within window widths and requires only two parameters (window width and significance level), while demonstrating superior performance in multivariate collaborative detection [11]. Given that marine diesel main engine stability identification involves coupled analysis of multiple thermodynamic parameters, this study adopted the SSD algorithm to identify stable operating intervals. Its concise parameter system and multivariate compatibility effectively support steady-state discrimination requirements under complex operating conditions.

Current research on fault prediction can generally be categorized into two approaches: state prediction and state classification. State prediction, constrained by human factors and data quality, suffers from insufficient model accuracy due to challenges in realistically simulating diesel engine operating conditions and quantitative/qualitative modeling with scarce samples. For instance, Liu et al. developed a CNN-BiGRU-based exhaust temperature prediction model for marine diesel engines in which only exhaust temperature data were utilized for prediction. By artificially linearly adjusting the dataset to simulate abnormal exhaust temperature changes when potential faults occur in diesel engines, the model revealed prediction errors [12]. Su et al. innovatively proposed a PCA-CNN-BiLSTM hybrid model, which employs only two parameters (after PCA dimensionality reduction) to represent overall engine performance and considers solely stable operating conditions of a specific diesel engine model with minimal load variations [13]. Consequently, such prediction models struggle to establish corresponding fault prediction frameworks for complex, multivariate, and correlated fault systems [14]. This study addresses this issue by adopting state classification with statistical data analysis methods to process multi-dimensional real-ship monitoring data, capturing temporal variations in statistical features while eliminating subjective influences, followed by autonomous operating condition partitioning. Current mainstream data analysis methods include principal component analysis (PCA), data mining clustering algorithms, and time series analysis. For example, Su et al. used PCA to extract two feature parameters that explained more than half of the characteristics in the sample data [13], while Liu et al. derived performance parameters through coefficient and correlation efficiency definitions, established initial diesel engine performance maps via surface fitting, and tracked performance degradation trends by comparing real-time and initial performance maps [15]. Fault warning methodologies are divided into physics-based models and data-driven approaches. Physics-based methods require precise mathematical or physical models to describe equipment operation [14]. However, under dynamic load conditions, these models often produce larger errors, resulting in systemic biases in the data. In contrast, data-driven approaches construct analytical models based on historical operational data and utilize algorithms to mine potential fault features, effectively circumventing the complexities of physical modeling. With the advancement of intelligent algorithms, data-driven methods have become the dominant research direction in fault prediction for marine diesel engines.

The technical methods employed in this study include data mining clustering algorithms and time series analysis. Clustering algorithms are particularly suitable for solving complex operating condition partitioning in marine main diesel engines. For instance, Zheng et al. utilized the K-means clustering algorithm to partition engine operating intervals, followed by quadratic polynomial fitting to derive relationships between engine load and fuel consumption rates [16]. Perera et al. applied the EM algorithm to identify the frequent operating ranges of marine diesel engines and analyzed engine performance using PCA [17,18]. Erik Vanem et al. explored methods including K-means clustering, Gaussian mixture models, density-based clustering, self-organizing maps, and support vector machines after data dimensionality reduction, concluding that dimensionality reduction may cause information loss. Although these methods are straightforward to implement, they require parameter tuning and validation prior to practical application [19]. Time series analysis investigates statistical regularities in the temporal evolution of sequential data, enabling prediction and control of future events, such as preventing fault occurrences. Zhang et al. integrated time–domain analysis with deep learning algorithms, focusing on temporal characteristics of faults and resulting in higher diagnostic accuracy compared to traditional methods [20]. Je-Gal H et al. proposed combining time–domain and frequency–domain analyses, achieving superior accuracy over frequency–domain methods alone [21].

In current research on marine diesel engine fault prediction, the use of single or limited data sources to evaluate overall engine performance often results in incomplete assessments and introduces systematic bias into model predictions. Moreover, existing studies that focus solely on predicting stable operating conditions for a specific diesel engine model fail to adequately support the evolving dynamic requirements of intelligent ship engine operation and maintenance. To address these limitations, this study selected 22 performance parameters of a marine diesel engine over an extended time span. By incorporating the SSD steady-state detection algorithm, a data-driven approach using the CLIQUE clustering algorithm is herein proposed for autonomous classification of multi-parameter operating conditions. This approach overcomes the constraints of traditional single-variable or dimensionality-reduced methods, enabling objective and precise condition partitioning. The Mahalanobis distance was computed across different time points to construct a time series that provides a more comprehensive and accurate foundation for evaluating engine performance. Finally, a time–domain analysis method was employed for mutation detection in the performance degradation curve, thereby improving the accuracy of fault warning timing. The effectiveness of the proposed methodology was validated through experimental data.

2. Introduction to Performance Assessment and Fault Warning Methods

2.1. Steady-State Interval Identification Algorithm

Ships operate under various navigation states, such as acceleration, deceleration, and stoppage, resulting in the collection of substantial non-steady-state data by sensors. These transient datasets introduce significant interference in the performance evaluation of marine diesel main engines. The SSD (slope sign detection) steady-state detection algorithm, which requires minimal manual parameter configuration and demonstrates strong capability in multi-variable steady-state detection, is particularly suitable for identifying the stable operating ranges of marine diesel main engines [11]. The fundamental principle of the SSD algorithm is as follows:

The SSD algorithm is based on the assumption that the system exhibits dynamic drift. The univariate process signal y(t) of the system is constructed by multiplying the relative time by a non-zero slope, with the mathematical expression formulated as follows:

y_{t} = m t + μ + a_{t}

(1)

In the equation,

m t

represents the dynamic drift component;

t

denotes the cycle index;

μ

is the mean under the assumed steady-state condition, whose value equals the arithmetic mean or sample mean of the window when the slope is zero;

a_{t}

corresponds to a white noise or random error sequence following a normal distribution (0,

σ_{a}

).

The difference between the signal

y_{t - 1}

at the previous time step and the signal

y_{t}

at the current time step is

m = \frac{1}{n} \sum_{i = 1}^{n} (y_{t} - y_{t - 1})

(2)

The steady-state judgment criterion is constructed using Student’s t-test, with the mathematical expression formulated as follows:

|y_{t} - μ| \leq t_{c r i t} σ_{a}

(3)

In the equation,

t_{c r i t}

is the critical value of Student’s t-test, determined from statistical tables based on the significance level

a

and degrees of freedom v; when the significance level is set to 0.05, it implies a 5% Type I error rate (i.e., 5 non-steady-state data points out of 100 tested);

σ_{a}

represents the standard deviation of the window sample, calculated as follows:

σ_{a} = \sqrt{\frac{1}{n - 2} \sum_{t = 1}^{n} {(x_{t} - m t - μ)}^{2}}

(4)

2.2. Operational Condition Classification Algorithm

Clustering algorithms demonstrate advantages in analyzing unlabeled marine diesel engine monitoring data through their capacity to automatically determine optimal cluster numbers via performance metrics, eliminating the need for manual intervention. The grid-based CLIQUE (Clustering In QUEst) algorithm, also recognized as a high-dimensional clustering approach, exhibits particular suitability for this study due to its inherent strengths in autonomous processing of high-dimensional datasets and independence from operator-specific domain expertise.

The fundamental principle of CLIQUE involves partitioning the multidimensional data space into non-overlapping rectangular cells of equal size. Cluster formation operates through density-based evaluation, where a cell exceeding a predefined density threshold qualifies as a dense unit. Cell density is defined by the number of data points contained within it. The final clusters emerge as maximally connected regions composed of adjacent dense cells [22,23].

A Priori Properties of the CLIQUE Clustering Algorithm:

Theorem 1.

Anti-Monotonicity: If a k-dimensional spatial unit is dense, its projection onto any (k − 1)-dimensional subspace is also dense.

Theorem 2.

If any projection of a k-dimensional spatial unit onto a (k − 1)-dimensional subspace contains one or more sparse units, the k-dimensional unit is necessarily non-dense.

Three-Step Procedure for CLIQUE Clustering in Multidimensional Space:

Step 1: Identification of All Dense Subspaces Containing Clusters

The multidimensional space is partitioned into disjoint rectangular units. Dense units are identified using the a priori properties via a bottom-up approach:

The multidimensional space is first partitioned into disjoint rectangular units, and the a priori properties are then utilized to determine whether these units are dense. The identification of dense units employs a “bottom-up” approach, which incrementally processes data from lower to higher dimensions. For instance, once the set of dense units Dk − 1 in the (k − 1)-dimensional space is identified, the candidate dense unit set

D_{k}'

for the k-dimensional space can be derived using the a priori properties. If

D_{k}'

is empty, the highest-dimensional subspace

S_{k - 1} = D_{1} \times D_{2} \times \dots \times D_{k - 1}

containing clusters is obtained. If

D_{k}'

is non-empty, the k-dimensional data are traversed again to eliminate non-dense units in

D_{k}'

using the a priori properties, yielding the dense unit set

D_{k}

for the k-dimensional space. Subsequently, the candidate dense unit set

D_{k + 1}'

for the (k + 1)-dimensional space is generated. This process iterates until the candidate dense unit set for a certain dimension becomes empty.

Step 2: Cluster Identification

Input: A set D of dense units in a k-dimensional subspace.

Output: A partition

\{D_{1}, D_{2}, \dots D_{q}\}

of D into connected components, where no two distinct components

u_{i} \in D_{i}, u_{j} \in D_{j} (i \neq j)

are interconnected.

This step is analogous to searching for connected components in a graph, where the units in the multidimensional space are treated as vertices, and an edge exists between two units if they are connected.

Step 3: Cluster Description Generation

Input: A dense unit set in a k-dimensional subspace, forming a cluster C.

Output: A collection R within the same subspace, where R ∈ C, and every unit in C belongs to at least one member of R. For each cluster, generate concise descriptions using the minimal covering hyper-rectangles of R.

From the analysis of the above clustering process, the CLIQUE algorithm implementation for marine diesel engine operational mode classification proceeds through the following systematized workflow:

(1): Dimensionality Specification: Determine the dimensionality of the multidimensional space according to the number of performance parameters required for operational mode differentiation;
(2): Grid Partitioning: Define parameter λ to partition each dimension into λ equal-length intervals, establishing the fundamental grid structure;
(3): Initial Candidate Generation: At k = 1 (where k denotes subspace dimensionality), designate all rectangular units as candidate dense units;
(4): Density Computation: Systematically examine each k-dimensional subspace to calculate unit densities, quantified as the number of data points contained within individual units;
(5): Dense Unit Identification: Apply density threshold τ to classify units exceeding this value as k-dimensional dense units;
(6): Subspace Pruning: Eliminate subspaces failing to meet minimum density criteria through iterative validation;
(7): Dimensionality Expansion: Generate (k + 1)-dimensional candidate dense units from the intersection of k-dimensional dense unit sets;
(8): Cluster Extraction: Identify final clusters as maximally connected regions within k-dimensional dense unit assemblies;
(9): Data Archiving: Store derived cluster characteristics and metadata in designated databases for subsequent operational mode analysis.

2.3. Performance Evaluation Metrics—Mahalanobis Distance

Marine diesel main engine performance parameters exhibit strong interdependencies with heterogeneous dimensionality and variation ranges. To address this multivariate challenge, the Mahalanobis distance (MD)—a covariance-based metric proposed by P.C. Mahalanobis—was adopted for its efficacy in quantifying similarity between multidimensional datasets. Consider a sample_x. The Mahalanobis distance from X to another sample set can be mathematically expressed as

ρ_{M a h} (x) = \sqrt{{(x - μ)}^{T} S^{- 1} (x - μ)}

(5)

In the equation,

μ

denotes the sample mean of the dataset, and

S

represents the covariance matrix. When

S

reduces to an identity matrix, the Mahalanobis distance becomes equivalent to the Euclidean distance. Two critical properties emerge from this formulation: First, the Mahalanobis distance calculation is scale-invariant [24], meaning it remains unaffected by measurement unit variations across parameters in different datasets. Second, through explicit incorporation of the covariance matrix, this metric inherently eliminates parameter correlations and incorporates the overall distribution characteristics of the dataset.

2.4. Fault Early Warning Method

The cumulative anomaly curve method and Yamamoto detection method were employed for abrupt change identification in the Mahalanobis distance time series.

2.4.1. Cumulative Anomaly Curve Method

The cumulative anomaly method calculates cumulative anomaly values for each temporal node, generating a time-indexed curve that visually represents trend variations. The curve’s inflection points, determined by slope direction reversals (ascending-to-descending or vice versa), approximate the timing of abrupt changes in the target time series [25]. For a given time series X, the cumulative anomaly at time t is defined as

x_{t} = \sum_{i = 1}^{t} (x_{i} - \bar{x})

(6)

In the equation,

\bar{x}

denotes the mean value of series _x.

2.4.2. Yamamoto Detection Method

The Yamamoto detection method identifies abrupt changes by assessing the statistical significance of mean differences between pre- and post-test point subsequences [26]. This methodology conceptualizes abrupt change detection as a hypothesis-testing problem concerning population mean equivalence. For a given series _x, let

n_{1}

and

n_{2}

represent the sample sizes of the pre- and post-test subsequences, respectively. The signal-to-noise ratio (SNR) is defined as

S N R = \frac{|\bar{x_{1}} - \bar{x_{2}}|}{s_{1} + s_{2}}

(7)

In the equation,

\bar{x_{1}}

and

\bar{x_{2}}

represent the mean values of the pre- and post subsequences, respectively, with s₁ and s₂ denoting their standard deviations.

If the signal-to-noise ratio (SNR) is greater than 1.0, it is considered that an abrupt change occurs at that moment. If the SNR exceeds 2.0, the abrupt change is classified as a strong mutation. If the value of the SNR is set to 10, it corresponds to a confidence level of 95%, while a value of 14 corresponds to a confidence level of over 99.5%.

3. Performance Degradation Trend Analysis and Fault Early Warning of Marine Diesel Main Engines

3.1. Data Processing

3.1.1. Data Sources

The monitoring data were collected from a 6RT-flex82T marine diesel main engine (Wärtsilä, Helsinki, Finland) installed on a commercial vessel, comprising 43,186 performance samples spanning 9 June 2020 06:20 to 14 May 2021 08:10. Key technical specifications of the engine are detailed in Table 1.

The monitoring parameters were collected and stored using a dedicated data acquisition system with sampling intervals of 1 s, 1 min, 10 min, and 1 h. Each recorded sample includes timestamps, data types, parameter values, and parameter classifications. As shown in Figure 1, the 10 min interval dataset spanning 16 June 2020 05:30 to 23 June 2020 04:00 exhibited significant anomalies, necessitating preprocessing prior to analysis.

3.1.2. Performance Parameter Selection

Standard thermodynamic parameters of marine diesel main engines effectively reflect operational health and performance, with most parameters now being automatically monitored, collected, and stored in real-time [27]. Performance degradation manifests as deviations of these parameters from factory-set baselines, where the magnitude of deviation serves as the primary evaluation metric.

Rational parameter selection forms the foundation of engine performance assessment. This study selected 22 parameters encompassing power output, fuel type, rotational speed, scavenge air temperature, fuel flow rate, specific fuel oil consumption (SFOC), exhaust temperatures of cylinders #1–6, jacket cooling water outlet temperatures of cylinders #1–6, lubricating oil inlet temperature, jacket cooling water inlet temperature, scavenge air pressure, and jacket cooling water inlet pressure. Due to sensor limitations, engine room ambient temperature was approximated using scavenge air temperature, while sea condition influences were excluded through operational condition partitioning. As a critical indicator of fuel efficiency, SFOC was calculated using

S F O C = \frac{Q}{P}

(8)

In the equation,

Q

denotes fuel flow rate (kg/h), and P represents engine power (kW). The selected parameters comprehensively cover major engine subsystems, ensuring holistic performance characterization. The performance parameter table is as shown in Table 2. Storage specifications for all parameters except SFOC are detailed in Table 3.

3.1.3. Data Preprocessing

Data preprocessing is one of the critical steps to ensure the accuracy of marine diesel main engine performance evaluation, aiming to eliminate abnormal performance samples. The primary causes of abnormal data include the following: First, sensor malfunctions during data acquisition lead to invalid values such as negative numbers, constant values, or abrupt fluctuations. Second, data packet loss occurs due to communication issues, resulting in parsing errors during data reception. Third, inconsistent sampling frequencies among sensors, causing missing parameters in monitoring data. The main categories of abnormal data are as follows:

(1) Missing parameters: One or more performance parameters are absent in a sample, preventing the calculation of performance evaluation metrics;

(2) Parameter anomalies: One or more parameters exhibit abrupt deviations or values far from the mean under current operating conditions (e.g., rotational speed below the minimum stable threshold, abnormally low shaft power, or fuel flow rate below operational limits). These anomalies are caused by sensor failures or data parsing errors and do not reflect the actual performance of the marine diesel main engine.

Criteria for abnormal performance samples are as follows:

Any performance parameter in the sample is null.

Rotational speed falls below the minimum stable threshold (20 RPM).

Power output is lower than 10 kW.

Fuel flow rate is less than 0.0001 kg/h.

Table 4 summarizes the statistical characteristics of the preprocessed dataset. As visualized in Figure 2, the postprocessed data (red curve) effectively eliminates subthreshold rotational speed anomalies (black raw data segments) through this protocol.

3.2. Steady-State Interval Identification

This paper employs the SSD steady-state detection algorithm with the first parameter (window width) set to 12,the window size is selected based on practical scenarios and; determined through multiple experiments. The second parameter is the critical value of Student’s t-test. The critical value is derived from statistical tables based on degrees of freedom ν and significance level α. Configured with α = 0.025, the degrees of freedom are calculated as

ν = n - 2

(9)

In the equation, ν represents the number of independent parameters in statistical estimation, and n denotes the window width.

Steady-state operation is confirmed when thermodynamic parameters satisfy the stability criterion, thereby indicating nominal engine operation. The identified stable intervals are statistically summarized in Table 5.

Figure 3 illustrates the rotational speed trend before and after steady-state detection. The black curve represents the preprocessed rotational speed profile, while the red curve denotes the post-detection steady-state sequence. As evident from the raw data (black curve), two pronounced fluctuations occur, likely corresponding to acceleration/deceleration maneuvers or shutdown events. The steady-state detection protocol successfully eliminates these transient disturbances (red curve), yielding a stabilized operational profile that better reflects nominal engine behavior.

3.3. Operational Condition Classification

The 37,026 steady-state processed sample data were used as the data source. Performance parameters such as fuel type, power, speed, scavenge air temperature, and scavenge air pressure were selected as the grid dimensions (i.e., input dimensions) to establish a five-dimensional sample space. The remaining performance parameters were used as output samples, with the output sample dimensions being 17. The dimensional information of the sample space is shown in Table 6. The density threshold is used to determine whether a cell is a dense unit. In this study, the density threshold was set to 0.002. If the number of samples within a cell exceeds the product of the total number of samples and the density threshold, the cell is considered a dense unit; otherwise, it is classified as a non-dense unit. The maximum connected region of adjacent dense units is defined as a cluster. The clustering results are shown in Figure 4, which generated 33 operating conditions (clusters), with condition 1 containing 23,720 performance samples, the largest cluster, and condition 33 containing only 8 performance samples. When a ship is sailing at sea, the engine speed is usually set according to the shipowner’s requirements, ensuring that the main engine’s operating condition remains stable for a certain period of time or a single voyage. The operating condition that the ship’s main engine frequently operates under is called the common operating range, and the cluster with the largest number of samples represents the common operating range condition, which is condition 1 in Figure 4. The basic information of the common operating range condition is shown in Table 7, with 23,720 performance samples under the common operating range condition, spanning from 19:00 on 7 July 2020 to 07:40 on 14 May 2021.

3.4. Performance Degradation Analysis and Fault Early Warning

3.4.1. Performance Evaluation Metrics—Calculation of Mahalanobis Distance

Following operational condition partitioning of marine diesel main engines using the CLIQUE clustering algorithm, a time series of Mahalanobis distances was established by calculating the distances between performance samples at different time points under identical operational conditions. This study established an initial performance dataset

Y = {[Y_{1}, Y_{2}, T_{3}, \dots, Y_{100}]}^{T}

comprising the first 1000 sets of performance samples, where each synchronized parameter configuration constitutes a performance sample. We subsequently calculated the Mahalanobis distance between temporal performance samples and this reference dataset. For a given performance sample X at time t, its Mahalanobis distance to dataset Y is formulated as

M D = \sqrt{(x - μ) S^{- 1} {(x - μ)}^{T}}

(10)

In the equation, µ denotes the mean vector of the initial performance dataset Y, and S represents its covariance matrix. These statistical descriptors are formally defined as

μ = \frac{1}{n} \sum_{i = 1}^{n} Y_{i}

(11)

S = \frac{1}{n - 1} \sum_{i = 1}^{n} (Y_{i} - μ) {(Y_{i} - μ)}^{T}

(12)

In the equation, n = 1000, denoting the sample size of the reference dataset. The Mahalanobis distance serves as a robust metric for marine diesel engine performance assessment, quantitatively characterizing the engine’s degradation trajectory. A monotonic relationship exists between the distance magnitude and degradation severity: larger Mahalanobis distances indicate more pronounced performance deterioration, while smaller values correspond to preserved operational integrity.

3.4.2. Performance Evaluation and Degradation Trend Analysis

Using the frequently encountered operational condition range as a case study, comprehensive performance evaluation and degradation analysis were conducted for marine diesel main engines. The first 1000 performance samples (collected from 19:00 on 7 July 2020 to 07:20 on 6 September 2020) were selected as the baseline performance sample space, with their statistical means summarized in Table 8. Subsequent calculation of Mahalanobis distances between performance samples from different time intervals and this baseline space yielded a time series comprising 22,720 data points. The resultant performance degradation trend curve is presented in Figure 5.

Figure 5 demonstrates that the Mahalanobis distance exhibited a gradual increase from September 2020 to late January 2021, aligning with empirical patterns of marine diesel main engine performance degradation. However, a sharp surge and pronounced fluctuations in Mahalanobis distances were observed between mid-January and March 2021, indicative of acute systemic performance deterioration. After March 2021, the growth rate of Mahalanobis distances stabilized, demonstrating restoration of normative degradation patterns. This temporal pattern suggests anomalous parameter deviations during the mid-January–March 2021 period, potentially attributable to transient mechanical faults. The subsequent normalization of degradation trends implies successful fault mitigation through operational interventions.

To validate this hypothesis, sequential elimination of individual performance parameters was conducted with subsequent Mahalanobis distance recalculations to reconstruct degradation trend curves. The results demonstrate that removal of the “No. 2 cylinder jacket cooling water outlet temperature” parameter yielded a gradual degradation profile devoid of abrupt surges or significant fluctuations (Figure 6). In contrast, elimination of other parameters maintained degradation trends consistent with Figure 5 (Figure 7 and Figure 8). Subsequent analysis revealed substantial amplitude variations in the No. 2 cylinder jacket cooling water temperature profile between late February and March 2021 (Figure 9), suggesting probable mechanical fault occurrence within this period. The localized temperature anomalies indicate fault origination proximate to the No. 2 cylinder jacket cooling water outlet, consistent with the observed performance deviations.

3.4.3. Fault Early Warning

Fault early warning aims to provide alerts before a failure occurs. In practical navigation, if a warning is only issued after a significant change in the performance degradation trend, it may be too late. This paper proposes a fault early warning technique based on time–domain analysis. The core idea is to analyze the performance degradation curve of the ship’s main engine, identify the time of the first abrupt change in the curve as the fault warning time, and analyze the cause of the anomaly to prevent potential failures.

The cumulative anomaly curve method and Yamamoto detection method were employed for abrupt change identification in the Mahalanobis distance time series. The effectiveness of this technique was validated through experimental verification. As shown in Figure 10, between late February and March 2021, the outlet temperature of the jacket cooling water for cylinder #2 exhibited abnormal fluctuations, accompanied by a significant increase in performance variation. However, from late January to early February 2021, the performance degradation curve of the marine diesel engine already showed the first abrupt change, while the “outlet temperature of the cylinder jacket cooling water for cylinder #2” only slightly deviated from the normal mean without showing any significant anomaly. Therefore, by identifying the first abrupt change in the performance degradation curve during this period, fault warning can be effectively triggered.

Implementing time–domain analysis for precise abrupt change identification involves a two-stage methodology: preliminary temporal range estimation via the cumulative anomaly method, followed by Yamamoto-based exact localization. The cumulative anomaly curve provides visual detection of degradation trend transitions. As shown in Figure 11, the Mahalanobis distances exhibit progressive growth over time, accompanied by decreasing cumulative anomaly values—a pattern consistent with typical marine diesel main engine degradation. However, a marked inflection in the cumulative anomaly curve occurred in late January 2021, indicating an abrupt change in the degradation trajectory. This initial inflection phase was temporally bounded between January 23 and 26 February 2021.

Subsequent refinement using the Yamamoto method entailed SNR calculations for performance sequences within this critical window, with results fitted to a validation curve (Figure 12 and Figure 13). The detection threshold (SNR > 1.0) was first breached at 18:10 on 25 January 2021, establishing this timestamp as “The definitive onset of degradation curve destabilization; The optimal fault early-warning trigger point”.

Figure 14, Figure 15 and Figure 16 further demonstrate a significant temporal separation between the initial inflection of the engine’s overall performance degradation curve and subsequent parametric anomalies, providing additional validation of the effectiveness and timeliness of the degradation-based fault early-warning technology.

4. Conclusions

This study proposes a data-driven methodology for marine diesel engine performance assessment, effectively revealing performance degradation trends and enabling proactive fault warning through degradation pattern analysis. The results demonstrate the capability to detect and issue warnings for impending faults 20–30 days prior to measurable parameter anomalies. Key findings are summarized as follows:

(1) Utilizing 43,186 operational data points collected over one year, 37,026 stable samples were obtained through preprocessing and steady-state detection. The CLIQUE clustering algorithm automatically partitioned these samples into 33 operational conditions, with Condition 1 (containing 23,720 samples) identified as the dominant operational range. These results quantitatively validate the algorithm’s efficacy in automatic operational condition classification;

(2) Analysis of the Mahalanobis distances within the dominant operational range yielded a fitted performance degradation curve. While the initial phase followed empirical degradation patterns, subsequent abrupt changes and fluctuations indicated systemic anomalies. Sequential parameter elimination localized the anomaly near the No. 2 cylinder jacket cooling water outlet, with temporal analysis confirming abnormal parameter deviations.

(3) Comparative analysis revealed that degradation curve destabilization preceded measurable temperature fluctuations at the No. 2 cylinder outlet by approximately 20 days, establishing a critical early-warning window. The time–domain analysis-based warning system demonstrated operational validity, with precise warning triggering at 18:10 on 25 January 2021.

The proposed methodology provides novel insights for marine diesel engine health management while offering methodological references for intelligent marine equipment condition classification and abrupt change detection. Future research will incorporate additional conventional thermodynamic parameters to enhance holistic engine performance characterization. Concurrently, algorithmic refinements to the SSD framework will implement adaptive window width configuration, establishing a data-driven foundation for integrated ship–shore performance diagnostics.

Author Contributions

H.W., writing—review and editing, methodology, and funding acquisition; Z.W., writing—review and editing, software, and data curation; B.S., writing—original draft and investigation. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Shanghai Engineering Research Center of Ship Intelligent Maintenance and Energy Efficiency Control under Grant 20DZ2252300.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data care contained within the article.

Conflicts of Interest

Author Biao Shi was employed by the company Shanghai Ocean Shipping Co., Ltd. The remaining author declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Mauro, F.; Kana, A.A. Digital twin for ship life-cycle: A critical systematic review. Ocean Eng. 2023, 269, 113479. [Google Scholar] [CrossRef]
Ahn, Y.G.; Kim, T.; Kim, B.R.; Lee, M.K. A Study on the Development Priority of Smart Shipping Items-Focusing on the Expert Survey. Sustainability 2022, 14, 6892. [Google Scholar] [CrossRef]
Chen, W.Y.; Xin, B.; Zhu, Z.Q. Real-time Collision Avoidance for Unmanned Ships under Constraints of Rules. In Proceedings of the 2019 IEEE 16th International Conference on Networking, Sensing and Control (ICNSC 2019), Banff, AB, Canada, 9–11 May 2019; pp. 80–85. [Google Scholar]
Hu, Y.; Miao, X.W.; Si, Y.; Pan, E.S.; Zio, E. Prognostics and health management: A review from the perspectives of design, development and decision. Reliab. Eng. Syst. Saf. 2022, 217, 108063. [Google Scholar] [CrossRef]
Hou, W.K.; Yan, J.F.; Yao, G.P. A Comprehensive Validation Approach of PHM System’s Diagnosis and Verification. In Proceedings of the 2014 Prognostics and System Health Management Conference (PHM-2014 Hunan), Zhangjiajie, China, 24–27 August 2014; pp. 199–202. [Google Scholar]
Liu, Z.C.; Zhang, C.X.; Dong, E.Z.; Wang, R.C.; Li, S.Y.; Han, Y.M. Research Progress and Development Trend of Prognostics and Health Management Key Technologies for Equipment Diesel Engine. Processes 2023, 11, 1972. [Google Scholar] [CrossRef]
Zhan, Y.; Zeng, J. Implementation of intelligent operation and maintenance for ships supported by big data. J. Shanghai Marit. Univ. 2019, 40, 62–66. [Google Scholar]
Narasimhan, S.; Kao, C.S.; Mah, R.S.H. Detecting changes of steady states using the mathematical theory of evidence. AIChE J. 1987, 33, 1930–1932. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Jeong, J.J.; Kim, S. Robust Adaptive Filter Algorithms Against Impulsive Noise. Circuits Syst. Signal Process 2019, 38, 5651–5664. [Google Scholar] [CrossRef]
Kelly, J.D.; Hedengren, J.D. A steady-state detection (SSD) algorithm to detect non-stationary drifts in processes. J. Process Control 2013, 23, 326–331. [Google Scholar] [CrossRef]
Liu, B.; Gan, H.; Chen, D.; Shu, Z. Research on Fault Early Warning of Marine Diesel Engine Based on CNN-BiGRU. J. Mar. Sci. Eng. 2023, 11, 56. [Google Scholar] [CrossRef]
Su, Y.; Gan, H.; Ji, Z. Research on multi-parameter fault early warning for marine diesel engine based on PCA-CNN-BiLSTM. J. Mar. Sci. Eng. 2024, 12, 965. [Google Scholar] [CrossRef]
Zhang, B.S.; Zhang, L.; Zhang, B.; Yang, B.; Zhao, Y. A Fault Prediction Model of Adaptive Fuzzy Neural Network for Optimal Membership Function. IEEE Access 2020, 8, 101061–101067. [Google Scholar] [CrossRef]
Yifan, L.; Lianzhong, H.; Peiting, S.; Ranqi, M. Performance evaluation method for marine main diesel engines in real ship data environments. Trans. Intern. Combust. Engines 2018, 36, 182–191. [Google Scholar]
Zheng, Q.; Lin, J.; Zhao, Z.; Yu, S. Performance evaluation of marine main engine frequent operating zones based on clustering analysis. J. Quanzhou Norm. Univ. 2021, 39, 34–39. [Google Scholar]
Perera, L.P.; Mo, B. Data analysis on marine engine operating regions in relation to ship navigation. Ocean Eng. 2016, 128, 163–172. [Google Scholar] [CrossRef]
Perera, L.P.; Mo, B. Marine engine-centered data analytics for ship performance monitoring. J. Offshore Mech. Arct. Eng. 2017, 139, 021301. [Google Scholar] [CrossRef]
Vanem, E.; Brandsæter, A. Unsupervised anomaly detection based on clustering methods and sensor data on a marine diesel engine. J. Mar. Eng. Technol. 2019, 20, 217–234. [Google Scholar] [CrossRef]
Zhang, J.; Hu, D.; Yang, T.; Zhou, H.; Li, X. A time series and deep fusion framework for rotating machinery fault diagnosis. Eng. Appl. Artif. Intell. 2024, 128, 107456. [Google Scholar] [CrossRef]
Je-Gal, H.; Lee, S.-J.; Yoon, J.-H.; Lee, H.-S.; Yang, J.-H.; Kim, S. A Novel Time–Frequency Feature Fusion Approach for Robust Fault Detection in a Marine Main Engine. J. Mar. Sci. Eng. 2023, 11, 1577. [Google Scholar] [CrossRef]
Ma, F.; Wang, C.; Huang, J.; Zhong, Q.; Zhang, T. Key grids based batch-incremental CLIQUE clustering algorithm considering cluster structure changes. Inf. Sci. 2024, 660, 120109. [Google Scholar] [CrossRef]
Deng, G.; Liu, C.; Xiong, Y. Research and implementation of the CLIQUE clustering algorithm based on grid and density. Comput. Mod. 2008, 12, 8–11. [Google Scholar]
Fu, C.; Liang, X.; Li, Q.; Lu, K.; Gu, F.; Ball, A.D.; Zheng, Z. Comparative study on health monitoring of a marine engine using multivariate physics-based models and unsupervised data-driven models. Machines 2023, 11, 557. [Google Scholar] [CrossRef]
Jiang, Y.; Xu, Z.; Wang, J. A comparative study of five trend detection methods based on annual runoff series. J. Hydraul. Eng. 2020, 51, 845–857. [Google Scholar]
Zhou, J. Study on the Runoff Variation Trend and Prediction in the Baoji Section of the Weihe River Basin. Master’s Thesis, Zhejiang University of Technology, Hangzhou, China, 2017; pp. 10–15. [Google Scholar]
Stoumpos, S.; Theotokatos, G. A novel methodology for marine dual fuel engines sensors diagnostics and health management. Int. J. Engine Res. 2022, 23, 974–994. [Google Scholar] [CrossRef]

Figure 1. Variation curves of certain monitoring parameters (monitoring curve subplot).

Figure 2. Trend Variation of the Engine Speed over a Specific Period After Preprocessing.

Figure 3. Trend changes in rotational speed post-steady-state detection.

Figure 4. Clustering information.

Figure 5. Degradation trend curve of performance.

Figure 6. Performance Degradation Trend without “Cooling Water Outlet Temperature of Cylinder #2 Liner”.

Figure 7. Performance degradation trend excluding “cooling water outlet temperature of cylinder #1 liner”.

Figure 8. Performance degradation trend without “inlet temperature of cylinder jacket cooling water”.

Figure 9. Trend of the performance parameter “outlet temperature of jacket cooling water for cylinder #2”.

Figure 10. Comparison of overall performance with the outlet temperature of jacket cooling water for cylinder #2.

Figure 11. Cumulative anomaly curve.

Figure 12. Yamamoto test curve.

Figure 13. Enlarged view of the abrupt change point in the Yamamoto test curve.

Figure 14. Comparison of cumulative anomaly with the trend of outlet temperature of jacket cooling water for cylinder #2.

Figure 15. Comparison of Yamamoto test with the trend of outlet temperature of jacket cooling water for cylinder #2.

Figure 16. Enlarged view of the abrupt change point for comparison.

Table 1. Technical parameters of the 6RT-flex82T host.

Technical Parameter	Parameter Value
Number of Cylinders	6
Bore Diameter/mm	820
Stroke/mm	3375
Firing Order	1-5-3-4-2-6
Rated Power/kW	25,200
Rated Speed/(r/min)	76
Rated Specific Fuel Consumption (g/kW·h)	176.7
Control Modes	Bridge Control-ECR Control-Local Control
Service Power	22,680
Service Speed (r/min)	72
Service Fuel Consumption (g/kW·h)	200.02
Critical Speed Range (r/min)	34–45
Number of Turbochargers	2
Engine Type	Vertical Two-Stroke

Table 2. Performance parameters information.

Performance Parameter	The Performance of a Marine Diesel Main Engine Reflects
Speed, Power	The power performance of the marine diesel main engine
Fuel Type, Fuel Flow Rate, SFOC	The performance of the fuel system
Cylinder Exhaust Temperature (Six-cylinder)	The performance of the combustion system
Lubricating Oil Inlet Temperature	The performance of the lubrication system
Scavenge Air Temperature, Scavenge Air Pressure	The performance of the turbocharging system
Jacket Cooling Water Inlet Temperature, Jacket Cooling Water Inlet Pressure Jacket Cooling Water Outlet Temperature (Six-cylinder)	The performance of the cooling system

Table 3. Performance parameters store information.

Performance Parameter	Data Type	Unit	Nullable	Lower Limit	Upper Limit
Speed	float (8, 3)	RPM	Yes	−200	200
Power	float (8, 1)	kW	Yes	−20,000	20,000
Fuel Tcype	int (11)	1: HFO, 2: MGO, 3: LSHFO, 4: MDO	Yes	\	\
Fuel Flow Rate	float (12, 2)	kg/h	Yes	\	\
Cylinder Exhaust Temperature (Six-cylinder)	float (8, 3)	°C	Yes	0	600
Lubricating Oil Inlet Temperature	float (8, 3)	°C	Yes	0	200
Jacket Cooling Water Inlet Temperature	float (8, 3)	°C	Yes	0	200
Jacket Cooling Water Inlet Pressure	float (8, 3)	Mpa	Yes	−0.1	0.6
Jacket Cooling Water Outlet Temperature (Six-cylinder)	float (8, 3)	°C	Yes	0	200
Scavenge Air Temperature	float (8, 3)	°C	Yes	0	200
Scavenge Air Pressure	float (8, 3)	Mpa	Yes	0	0.6

Table 4. Sample basic information.

Feature	Basic Information
Data Acquisition Period	9 June 2020 06:20–14 May 2021 08:10
Sample Size	43,186
Sampling Interval	10 min

Table 5. Basic information of samples after steady-state identification.

Feature	Basic Information
Data Acquisition Period	14 June 2020 3:00–14 May 2021 7:40
Sample Size	37,026
Sampling Interval	10 min

Table 6. Information of each dimension of sample space.

Grid Dimension	Minimum	Maximum	Segments
Fuel Type	0	4	8
Speed/RPM	0	20,000	500
Power/kW	20	80	240
Scavenge Air Temperature/°C	20	80	240
Scavenge Air Pressure/MPa	0	0.6	60

Table 7. Basic information about common working conditions.

Parameter		Range
Input Dimension Parameters	Fuel Type	Low-Sulfur Heavy Fuel Oil (LSHFO)
	Power (kW)	10,520–12,240
	Rotational Speed (RPM)	57.5–58.75
	Scavenge Air Temperature (°C)	46.75–52
	Scavenge Air Pressure (MPa)	0.07–0.1
Output Dimension Parameters	Cylinder 1 Exhaust Temperature (°C)	352.71–361.28
	Cylinder 2 Exhaust Temperature (°C)	350.05–360.57
	Cylinder 3 Exhaust Temperature (°C)	351.51–361.32
	Cylinder 4 Exhaust Temperature (°C)	346.1–356.28
	Cylinder 5 Exhaust Temperature (°C)	355.71–363.86
	Cylinder 6 Exhaust Temperature (°C)	353.67–361.62
	Lubricating Oil Inlet Temperature (°C)	45.19–46.34
	Cylinder 1 Jacket Cooling Water Outlet Temperature (°C)	88.29–89.69
	Cylinder 2 Jacket Cooling Water Outlet Temperature (°C)	88.21–89.87
	Cylinder 3 Jacket Cooling Water Outlet Temperature (°C)	87.86–89.31
	Cylinder 4 Jacket Cooling Water Outlet Temperature (°C)	88.02–89.37
	Cylinder 5 Jacket Cooling Water Outlet Temperature (°C)	88.6–90.01
	Cylinder 6 Jacket Cooling Water Outlet Temperature (°C)	88.32–89.74
	Jacket Cooling Water Inlet Pressure (MPa)	0.4685–0.4791
	Fuel Flow Rate (kg/h)	1935.52–2022.81
	Specific Fuel Consumption (g/kWh)	172.42–182.04
	Jacket Cooling Water Inlet Temperature (°C)	84.05 (Fixed)

Table 8. Mean Values of Performance Parameters in the Initial Performance Sample Space for the Common Operating Range Condition.

Performance Parameter	Mean Value
Cylinder Jacket Cooling Water Inlet Temperature (°C)	83.773101089477535
Cylinder 1 Jacket Cooling Water Outlet Temperature (°C)	89.696554016113282
Cylinder 2 Jacket Cooling Water Outlet Temperature (°C)	89.845977897644048
Cylinder 3 Jacket Cooling Water Outlet Temperature (°C)	89.163089996337888
Cylinder 4 Jacket Cooling Water Outlet Temperature (°C)	89.092101951599119
Cylinder 5 Jacket Cooling Water Outlet Temperature (°C)	89.833513893127446
Cylinder 6 Jacket Cooling Water Outlet Temperature (°C)	89.516971015930181
Cylinder Jacket Cooling Water Inlet Pressure (MPa)	0.47260199856758117
Cylinder 1 Exhaust Temperature (°C)	358.57207760620116
Cylinder 2 Exhaust Temperature (°C)	352.28291729736327
Cylinder 3 Exhaust Temperature (°C)	351.78961108398437
Cylinder 4 Exhaust Temperature (°C)	343.9371090698242
Cylinder 5 Exhaust Temperature (°C)	356.3969108276367
Cylinder 6 Exhaust Temperature (°C)	352.38188513183593
Lubricating Oil Inlet Temperature (°C)	45.5445690536499
Scavenge Air Temperature (°C)	51.23264501571655
Scavenge Air Pressure (MPa)	0.084497000284492974
Speed (RPM)	58.234216999053956
Power (kW)	11,815.623912109375
Fuel Flow Rate (kg/h)	2035.4115
Specific Fuel Consumption (g/kW·h)	172.31603485696255
Fuel Type	3

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, H.; Wang, Z.; Shi, B. Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine. Appl. Sci. 2025, 15, 6299. https://doi.org/10.3390/app15116299

AMA Style

Wang H, Wang Z, Shi B. Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine. Applied Sciences. 2025; 15(11):6299. https://doi.org/10.3390/app15116299

Chicago/Turabian Style

Wang, Haiyan, Zihan Wang, and Biao Shi. 2025. "Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine" Applied Sciences 15, no. 11: 6299. https://doi.org/10.3390/app15116299

APA Style

Wang, H., Wang, Z., & Shi, B. (2025). Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine. Applied Sciences, 15(11), 6299. https://doi.org/10.3390/app15116299

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on Data-Driven Performance Assessment and Fault Early Warning of Marine Diesel Engine

Abstract

1. Introduction

2. Introduction to Performance Assessment and Fault Warning Methods

2.1. Steady-State Interval Identification Algorithm

2.2. Operational Condition Classification Algorithm

2.3. Performance Evaluation Metrics—Mahalanobis Distance

2.4. Fault Early Warning Method

2.4.1. Cumulative Anomaly Curve Method

2.4.2. Yamamoto Detection Method

3. Performance Degradation Trend Analysis and Fault Early Warning of Marine Diesel Main Engines

3.1. Data Processing

3.1.1. Data Sources

3.1.2. Performance Parameter Selection

3.1.3. Data Preprocessing

3.2. Steady-State Interval Identification

3.3. Operational Condition Classification

3.4. Performance Degradation Analysis and Fault Early Warning

3.4.1. Performance Evaluation Metrics—Calculation of Mahalanobis Distance

3.4.2. Performance Evaluation and Degradation Trend Analysis

3.4.3. Fault Early Warning

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI