Next Article in Journal
Liquid Reservoir Weld Defect Detection Based on Improved YOLOv8s
Previous Article in Journal
MSRA-Net: A Multi-Task Learning Model for Soil Texture Prediction with Dynamic Weighting and Prior Knowledge Soft Constraints
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Probabilistic Prognostics and Health Management of Power Transformers Using Dissolved Gas Analysis Sensor Data and Duval’s Polygons

by
Fabio Norikazu Kashiwagi
1,
Miguel Angelo de Carvalho Michalski
1,*,
Gilberto Francisco Martha de Souza
1,
Halley José Braga da Silva
1 and
Hyghor Miranda Côrtes
1,2
1
Department of Mechatronics and Mechanical Systems Engineering, Polytechnic School, University of São Paulo, Avenida Professor Mello de Moraes 2231, São Paulo 05508-030, SP, Brazil
2
EDP Brasil, Avenida Cassiano Ricardo, 1973—Jardim Alvorada, São José dos Campos 12240-540, SP, Brazil
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(21), 6520; https://doi.org/10.3390/s25216520
Submission received: 22 August 2025 / Revised: 7 October 2025 / Accepted: 15 October 2025 / Published: 23 October 2025
(This article belongs to the Section Fault Diagnosis & Sensors)

Highlights

What are the main findings?
  • A probabilistic framework is proposed for transformer fault detection, diagnosis, and prognosis using Dissolved Gas Analysis (DGA) sensor data.
  • The method integrates self-adaptive ARIMA forecasting with probabilistic extensions of Duval’s polygons, enabling uncertainty-aware fault classification and failure risk estimation.
What is the implication of the main findings?
  • The framework improves reliability of transformer condition monitoring by providing early warnings and robust fault evolution tracking.
  • It supports risk-based maintenance decisions in smart grid environments, enhancing operational safety and asset lifetime.

Abstract

Power transformers are critical assets in modern power grids, where failures can lead to significant operational disruptions and financial losses. Dissolved Gas Analysis (DGA) is a key sensor-based technique widely used for condition monitoring, but traditional diagnostic approaches rely on deterministic thresholds that overlook uncertainty in degradation dynamics. This paper proposes a probabilistic framework for Prognostics and Health Management (PHM) of power transformers, integrating self-adaptive Auto Regressive Integrated Moving Average modeling with a probabilistic reformulation of Duval’s graphical methods. The framework enables automated estimation of fault types and failure likelihood directly from DGA sensor data, without requiring labeled datasets or expert-defined rules. Dissolved gas dynamics are forecasted using time-series models with residual-based uncertainty quantification, allowing probabilistic fault inference from predicted gas trends without assuming deterministic persistence of a specific fault type. A sequential pipeline is developed for real-time fault tracking and reliability assessment, aligned with IEC, IEEE, and CIGRE standards. Two case studies validate the method: one involving gas loss in an experimental setup and another examining thermal degradation in a 345 kV transformer. Results show that the framework improves diagnostic reliability, supports early fault detection, and enhances predictive maintenance strategies. By combining probabilistic modeling, time-series forecasting, and sensor-based diagnostic inference, this work contributes a practical and interpretable PHM solution for sensor-enabled monitoring environments in modern power grids.

1. Introduction

Power transformers are critical assets in electrical substations and modern power grids, where failures can lead to significant operational disruptions, financial losses, and safety risks [1,2]. Their operational lifespan is primarily determined by the condition of the insulation system, which gradually degrades due to electrical, environmental, and mechanical stresses [3].
Beyond operational disruptions, transformer failures can also result in substantial financial losses. For example, distribution-level studies have estimated aggregate impacts of several hundred million euros annually [4], while analyses at the transmission level show that failures of high-voltage units are associated with the highest interruption costs for end-users [5]. From a life-cycle perspective, failures also entail both direct recovery expenses and indirect social and economic losses [6], and reviews consistently highlight the significant financial risks and prolonged outages linked to such events [7]. These figures reinforce the practical relevance of accurate diagnostics and prognostics for such critical assets.
Dissolved Gas Analysis (DGA) is a sensor-based diagnostic technique widely adopted for monitoring oil-immersed transformers, enabling early fault detection through the evaluation of gas concentrations and their evolution over time [8,9,10]. Standard diagnostic models, such as Doernenburg’s method, Rogers’ ratios, and the IEC 60599 and IEEE C57.104 guides, use deterministic gas thresholds to classify fault types [11,12]. Structured graphical approaches like Duval’s Triangle [13,14,15] and Pentagon [16,17,18] offer domain-informed decision support, but still rely on fixed concentration boundaries. This rigidity limits their capacity to capture fault progression dynamics and to provide probabilistic insight into failure risk.
With the advent of online monitoring systems and embedded sensors [19,20], continuous measurement of gas concentrations has become feasible, making real-time prognostics increasingly viable. However, conventional DGA-based classification methods remain categorical and threshold-driven, offering limited support for uncertainty quantification, an essential requirement in modern Prognostics and Health Management (PHM) strategies [21,22].
Artificial intelligence (AI) techniques such as Artificial Neural Networks (ANNs) [23,24,25], fuzzy logic [26,27], Support Vector Machines (SVMs) [28,29,30], and hybrid models [31,32,33] have been proposed to enhance diagnostic performance. While often effective in static classification, these models generally require large labeled datasets, expert-defined rules, or high computational resources. As such, they are not always suitable for continuous or adaptive PHM in industrial settings.
This work introduces a probabilistic framework for transformer health assessment, integrating fault detection, diagnosis, and failure prognosis under uncertainty. The framework combines self-adaptive Auto Regressive Integrated Moving Average (ARIMA) forecasting with a reformulated, uncertainty-aware version of Duval’s diagnostic polygons. By eliminating the need for labeled training data or heuristic rules, it extends existing DGA interpretation methods while remaining compatible with international standards. Gas concentration trends are modeled using dynamically selected ARIMA processes via the Hyndman–Khandakar algorithm [34,35], which enables the probabilistic estimation of fault states and future degradation.
Although ARIMA and its variants have been previously applied to forecast dissolved gas concentrations in transformers, such as SARIMA-based trend prediction considering temperature effects [36], wavelet-ARIMA with non-parametric kernel density estimation for interval forecasts [37], and ensemble learning optimized by Dempster–Shafer theory in comparison with SARIMA [38], these works remain focused on concentration forecasting, and their integration into a structured diagnostic framework has scarcely been addressed. The key innovation of the present study lies in embedding adaptive ARIMA forecasting into Duval’s diagnostic polygons, reformulated as probabilistic models. This enables dynamic fault classification with quantified uncertainty and supports risk-informed prognosis by estimating the probability of failure within a defined time horizon, contributing to reliability-centered maintenance planning.
While some recent studies explore probabilistic models for transformer diagnostics using DGA [39,40], they do not combine time-series forecasting with structured inference, nor do they offer a unified PHM-oriented methodology grounded in industrial practice. By integrating probabilistic forecasting with uncertainty-aware diagnostic inference, the proposed framework supports robust, data-driven decision-making while maintaining computational efficiency. Its continuous adaptability to new operating conditions makes it well-suited for real-time monitoring, offering a practical and scalable solution for transformer health management in operational environments.
The main contributions of this work are as follows:
  • A unified probabilistic reformulation of Duval’s diagnostic structures;
  • Integration of adaptive time-series forecasting and residual-based uncertainty modeling;
  • A sequential and interpretable pipeline encompassing detection, diagnosis, and risk-informed prognosis;
  • Compatibility with international reliability standards (IEC, IEEE, CIGRE), ensuring industrial applicability.
The proposed method is validated using published real-world transformer fault data, demonstrating its suitability for operational PHM environments. The remainder of this paper is organized as follows: Section 2 reviews DGA fundamentals, Duval’s methods, and ARIMA modeling. Section 3 details the proposed probabilistic framework. Section 4 presents validation results. Section 5 concludes with key findings and future directions.

2. Theoretical Background

This section outlines the core concepts of the proposed framework, including DGA for transformer monitoring, conventional classification methods (e.g., gas ratios and Duval’s polygons), and the fundamentals of self-adaptive ARIMA for gas trend forecasting.

2.1. DGA in Transformer Monitoring

The reliability and lifespan of power transformers depend on the condition of their cellulose-based insulation immersed in mineral oil. With proper maintenance, these assets can operate for 25 to 50 years [22]. However, internal stresses (e.g., insulation degradation, partial discharges, localized overheating) and external factors (e.g., lightning, switching surges, overloads) accelerate aging and increase failure risks [22,41].
DGA enables early fault detection by analyzing gas concentrations dissolved in insulating oil. It supports preventive maintenance by identifying gas evolution patterns indicative of insulation aging and fault severity [40]. Recent advances in sensor technology allow continuous online monitoring, improving responsiveness and diagnostic accuracy, while traditional practices rely on periodic oil sampling and laboratory analysis [20].
DGA-based diagnostic frameworks, such as IEC 60599 [11] and IEEE C57.104 [12], classify transformer faults based on both gas composition and concentration levels. They provide structured methodologies to associate gas patterns with specific fault types, such as thermal degradation, electrical discharges, and chemical anomalies.
According to IEC 60599 [11], thermal faults result from localized or prolonged overheating, whereas electrical faults involve ionization of the insulating medium under high-voltage stress [42,43]. Thermal faults (T1–T3) escalate in severity, from minor aging effects (<300 °C) to severe carbonization and insulation damage (>700 °C). Electrical discharges are similarly classified: low-energy (D1) events cause limited insulation breakdown, whereas high-energy (D2) discharges lead to extensive carbonization and metal erosion, posing immediate risks [11]. The IEEE C57.104 standard [12] complements this framework by incorporating gas trend analysis to monitor fault evolution and proximity to critical thresholds.
Table 1 summarizes the fault types according to gas behavior and temperature thresholds, consistent with the categories currently adopted in international standards and industry practice.
Recent studies [17,44,45] have introduced additional fault subtypes, particularly new sub-zones in Duval’s Triangles and Pentagons to differentiate arcing in paper versus oil and to identify carbonization of paper insulation. These valuable refinements, however, are not yet consolidated in IEC 60599 (2022) [11] or IEEE C57.104 (2019) [12], and remain outside current industry standards.
DGA interpretation is guided by standardized methodologies that associate gas thresholds and temporal patterns with fault types. While IEEE and IEC rely on absolute concentration limits and trend analysis, CIGRE enhances predictive capability by incorporating historical failure data and probabilistic thresholds [11,12,46]. Despite sharing core principles, IEEE, IEC, and the CIGRE Joint Task Force D1.01/A2.11 differ in key aspects:
  • Concentration Thresholds: IEEE and IEC use decision matrices; CIGRE integrates historical data to estimate failure probabilities.
  • Rate-of-change Analysis: IEEE emphasizes baseline trends; IEC focuses on distinguishing aging from abnormal degradation; CIGRE introduces explicit sampling intervals and slope thresholds.
  • Pre-Failure Limits: Only CIGRE defines pre-failure gas concentration thresholds, allowing RUL estimation. IEEE and IEC recommend increased monitoring when elevated gas levels are detected [11,12,15,46].
Among the three, CIGRE uniquely offers a structured methodology for pre-failure risk assessment, supporting long-term asset management and enabling utilities to intervene before transformers reach critical conditions [46].

2.2. DGA Fault Classification Methods

The classification of transformer faults using DGA has been a central topic in diagnostic research for decades, leading to the development of ratio-based and graphical methods. These techniques analyze the composition and proportions of dissolved gases to detect failures early and support maintenance planning strategies [22].
Gas ratio methods are among the earliest and most established approaches. They rely on predefined relationships between gases such as hydrogen (H2), methane (CH4), ethane (C2H6), ethylene (C2H4), and acetylene (C2H2) to classify faults, including thermal overheating and electrical discharges [11,42]. Despite their simplicity and field applicability, methods like the Doernenburg Ratio Method (DRM), Rogers Ratio Method (RRM), and IEC Ratio Method (IRM) often yield inconclusive results in mixed or evolving fault conditions due to rigid thresholds and limited sensitivity [43].
To overcome these limitations, graphical techniques were developed to map gas concentrations into predefined fault zones. The most widely adopted is the Duval Triangle Method (DTM 1), which uses C2H2, C2H4, and CH4 to distinguish between thermal and electrical faults [13]. However, to improve fault resolution, additional models were introduced. Duval Triangle 4 (DTM 4) includes H2, CH4, and C2H6 for better detection of low-energy faults, while Triangle 5 (DTM 5) focuses on high-temperature events using C2H2, C2H4, and C2H6 [12].
Duval Pentagon models further expand diagnostic resolution. Pentagon 1 (DPM 1) incorporates all five gases to improve differentiation of low-energy discharges and mild overheating [44]. Pentagon 2 (DPM 2) enables more granular classification of thermal faults (T3-H, C, O, S) and discharge types (PD, D1, D2), making it particularly valuable for aging assessment [16,47].
Comparative studies confirm the superiority of Duval’s graphical methods over ratio-based ones. Liu et al. (2015) [48] reported 93.94% accuracy for DTM 1, while Faiz & Soleimani (2017) [49] showed over 90% performance in arcing fault classification using Duval Triangles. Despite its robustness, DTM 1 lacks sensitivity to certain gases like C2H6 and H2, a limitation addressed by the extended models [50].
Overall, the Duval Triangle and Pentagon series remain the most interpretable and accurate tools for traditional DGA. Their structured design, validation history, and adaptability to refinements make them ideal candidates for integration into hybrid or probabilistic diagnostic frameworks [10,22].
In parallel, recent advances in machine learning and AI have improved DGA-based classification, especially in borderline or ambiguous cases. These data-driven models learn from historical patterns and real-time monitoring, reducing reliance on static thresholds and enhancing adaptability [20,51]. Neural networks have demonstrated superior performance in identifying nonlinear gas-fault relationships, outperforming conventional methods in fault differentiation [8,23,24,25,52,53,54].
Despite their high accuracy, purely AI-based models often lack interpretability, require extensive labeled datasets, and involve significant computational overhead, limiting their applicability in continuous monitoring of critical infrastructure. These constraints are particularly relevant in industrial contexts where transparency, standard compliance, and scalability are essential.
Consequently, there is growing interest in hybrid approaches that balance data-driven adaptability with structured diagnostic reasoning grounded in expert knowledge. These approaches, such as AI-Duval models, integrate graphical tools with adaptive algorithms to dynamically refine decision boundaries while preserving interpretability [39,40,55].
Following this direction, this study introduces a framework that integrates probabilistic modeling, uncertainty quantification, and adaptive machine learning to combine the strengths of statistical learning and structured expert-based inference. The result is a scalable, interpretable, and risk-informed solution for transformer health assessment.

2.3. Self-Adaptive ARIMA Modeling

Accurate prediction of gas concentration trends in DGA is essential for assessing the severity and progression of power transformer failures. In this sense, ARIMA models offer a robust statistical framework for time series modeling and can dynamically adapt to changes in gas behavior after failure initiation [36,37,38,56,57].
ARIMA extends the Autoregressive Moving Average (ARMA) model by incorporating differencing to handle non-stationary data. Unlike Markovian stochastic models, where future states depend solely on the current state, ARIMA captures dependencies across both past and present observations [58]. This ability to model autocorrelation differentiates ARIMA from simpler methods like Exponential Smoothing, which may fail to capture long-term sequential dependencies.
An ARIMA model is defined by three parameters: (p, d, q), where p represents the number of autoregressive terms (dependence on past observations), d is the degree of differencing required for stationarity, and q is the order of the moving average component (dependence on past forecast errors) [59]. If seasonal components are considered, the model extends to ARIMA (p, d, q)(P, D, Q)m, where (P, D, Q) defines seasonal parameters and m denotes periodicity [34].
Each component of the ARIMA model serves a specific function: the autoregressive (AR) term models linear dependencies on past values, the integrated (I) component ensures stationarity through differencing, and the moving average (MA) term captures dependencies on past forecast errors, improving adaptability. These elements form a flexible framework, where any unexplained variance is treated as additive noise.
A generalized ARMA model can be expressed as presented by Equation (1),
y t i = 1 p φ i   y t i = ε t + j = 1 q θ j   ε t j
where yt is the time series value at time t, ji are the autoregressive coefficients, qj are the moving average coefficients, and et is white noise. For ARIMA models, differencing is applied using the lag operator L(·), extending Equation (1) to Equation (2), where j(L) is the autoregressive component, given by Equation (3), and q(L) is the moving average component, given by Equation (4) [60].
φ L 1 L d y t = θ L ε t
φ L = 1 i = 1 p φ i   L i
θ L = 1 + j = 1 q θ j   L j
To improve stationarity and adherence to normality and homoscedasticity assumptions, the Box–Cox transformation is applied, as shown in Equation (5):
y t λ = y t λ 1 λ       ,       λ 0 ln y t       ,       λ = 0
where λ is optimized to maximize log-likelihood. Adjustments are made when zero or negative values occur [61], for instance, by adopting the shifted version of the Box–Cox transformation [62]. In DGA forecasting, this transformation is beneficial for stabilizing variance, especially when gas levels exhibit exponential growth or abrupt shifts following fault detection, thereby enhancing the performance of the ARIMA model.
Traditional ARIMA models require manual parameter selection, but self-adaptive ARIMA employs automated hyperparameter optimization, enabling dynamic updates as new data arrive. This adaptability improves real-time forecasting and enhances fault prognosis accuracy. In this framework, the Hyndman-Khandakar algorithm [35] automates model selection through an iterative process (Figure 1). It begins by testing stationarity via unit root tests and applying differencing as needed to determine d. The Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) assist in selecting p and q, followed by model evaluation using the corrected Akaike Information Criterion (AICc) [63]. Residual diagnostics are then used to verify that white noise assumptions are met, with parameter refinement applied if autocorrelation persists [58].
Once optimized, the ARIMA model generates forecasts for a defined horizon h beyond the current time t, following Equation (6):
1 L d y τ + h | τ = i = 1 p φ i   L i 1 L d y τ + h | τ + j = 1 q θ j   L j ε τ + h | τ       ;       h q i = 1 p φ i   L i 1 L d y τ + h | τ       ;       h > q
where future residuals et+h|t are unknown, impacting long-term accuracy when h < q. Prediction intervals (e.g., 95% confidence) account for uncertainty, increasing as h extends. If residuals deviate from normality, bootstrap resampling is applied to generate robust intervals under constant variance and uncorrelated assumptions [64,65].
Despite certain limitations, such as sensitivity to stationarity, reliance on linear assumptions, and reduced performance under heteroscedastic or nonlinear conditions [66], ARIMA remains an efficient and interpretable choice for forecasting, especially considering short-term horizons. Due to this efficiency, hybrid models combining ARIMA with nonlinear approaches have been proposed to enhance adaptability while preserving interpretability. Compared to more sophisticated techniques, such as Long Short-Term Memory (LSTM) networks that capture complex temporal dependencies, ARIMA is lightweight and better suited for real-time monitoring in data-limited industrial environments [37,67].

Comparing ARIMA with Alternative Forecasting Methods for DGA

Accurate DGA forecasting is essential for tracking gas accumulation and assessing fault progression. Forecasting models must balance process constraints, reliability, and interpretability. Key requirements include:
  • Autocorrelation: Gas behavior follows degradation patterns rather than random fluctuations. Models must capture autocorrelation caused by thermal decomposition and electrical discharges, as well as structured decreases after remediation [68].
  • Interpretability: As DGA supports diagnosis and maintenance planning, models must provide clear insights into gas trends. Black-box approaches limit transparency and hinder operational adoption.
  • Computational Efficiency: Real-time monitoring demands models with acceptable accuracy and low computational cost. Lightweight statistical methods are often preferable to data-hungry deep learning techniques.
Different methods address these trade-offs with varying success. Holt-Winters exponential smoothing is efficient but fails to model autocorrelation, limiting its utility for DGA [34]. Bayesian Structural Time Series (BSTS) captures dependencies, but adds complexity and interpretability issues [69]. Gaussian Process Regression (GPR) supports probabilistic forecasts but demands large datasets rarely available in practice [70].
Deep learning models, such as LSTMs, GRUs, and Transformers, model complex temporal dependencies but require substantial data and high computational power, making them less suitable for real-time, data-limited environments [71]. While LSTMs often outperform ARIMA in long-term forecasts, ARIMA offers a balanced solution for short-term horizons due to its efficiency and transparency [71]. Recent studies confirm ARIMA’s strong performance in time series forecasting under constraints of limited data and high interpretability [37,38].
Table 2 summarizes these methods based on their ability to model autocorrelation, handle small datasets, provide interpretability, and support real-time applications, highlighting trade-offs between model complexity and operational feasibility. Given these considerations, Self-Adaptive ARIMA offers a well-balanced solution, combining adequate accuracy, transparent interpretation, and low computational cost, making it particularly suitable for real-time DGA forecasting in industrial settings where adaptability and clarity are essential

3. Proposed Framework

The proposed framework comprises eight stages, integrating time-series forecasting and probabilistic fault inference for transformer condition assessment. It supports risk-informed decisions by processing DGA data from acquisition to maintenance planning. Figure 2 provides an overview of the full pipeline.

3.1. Data Collection and Loading

The monitoring process involves the periodic acquisition of dissolved gas concentrations (H2, CH4, C2H2, C2H4, and C2H6) through continuous online sensors. Each data point is time-stamped and stored in a structured database, ensuring traceability for further analysis. Before modeling, the dataset undergoes preprocessing to remove missing values and detect inconsistencies, preserving statistical robustness. To optimize computational efficiency, only the five primary gases and their timestamps are retained.

3.2. Operational Parameter Definition

The framework defines key operational parameters for fault detection, confirmation, and prognosis, ensuring a structured and standardized analysis. These thresholds classify transformers as operating normally, developing faults, or at high failure risk.
Detection thresholds, indicating initial gas concentration anomalies, are derived from IEEE C57.104 [12], IEC 60599 [11], or CIGRE Joint Task Force D1.01/A2.11 [46]. Typically associated with Typical Gas Concentration (TGC) values, these thresholds represent the maximum acceptable gas levels, exceeded by only 10% of transformers, prioritizing maintenance for the most at-risk units [11].
Confirmation thresholds, which determine whether an anomaly evolves into a fault, are based on Typical Rates of Gas Increase (TRGI), ensuring consistency across regulatory standards. For failure prognosis, the Pre-Failure Gas Concentration (PFGC), defined by CIGRE [46], identifies transformers at high risk over future time horizons. While CIGRE also introduces Alarm Gas Concentration (AGC) values, these are not considered in this framework. A comprehensive reference for threshold values and degradation limits is provided by Duval (2008) [15], facilitating comparative analysis and improving risk assessments.
Beyond these thresholds, additional parameters must be defined to support the forecasting and diagnostic procedures. The analysis window (w), which determines the number of past observations used for time series modeling, should be selected to ensure statistical robustness while minimizing the influence of short-term fluctuations. Its length may vary depending on data resolution and fault progression dynamics. In practice, windows of around 50 observations or more provide a useful estimate of the autocorrelation function, which is a fundamental step in ARIMA model identification [72]. The projection window (h) defines the forecasting horizon for failure prognosis and predictive maintenance planning. Confidence levels, typically set at 95%, establish probabilistic bounds, supporting a balanced assessment of false positives and false negatives in transformer fault evaluation.

3.3. Fault Detection

Anomalies are detected when gas concentrations exceed predefined detection thresholds. Upon the first occurrence of an abnormal reading, the system records the detection time (tdetection) and triggers an alert for further verification.

3.4. Fault Confirmation

Fault confirmation is conducted by analyzing gas growth rates over time. A fault is confirmed when the growth rate exceeds the confirmation threshold, defined by the TRGI for each gas. Since TRGI values are specified on an annual basis, whereas data collection typically occurs hourly, appropriate scaling is necessary.
Dividing the annual TRGI by 8766 (the number of hours in a year) provides an hourly equivalent, but applying this to single-step comparisons yields thresholds that are too narrow, resulting in unreliable assessments. To address this, fault confirmation leverages the same analysis window (w) defined in Section 3.2, aggregating multiple observations and computing a more stable estimate of gas growth.
Upon detecting an anomaly, the system retrospectively evaluates the last (w) observations before tdetection. If at least one gas exhibits a growth rate exceeding its adjusted TRGI value, the fault is confirmed, and the confirmation time is recorded as tconfirmation. If no fault is detected, the transformer enters enhanced monitoring mode, where each new observation triggers a re-evaluation until confirmation occurs.
Once a fault is confirmed, the reference time (τ) for gas concentration forecasting is established, as defined in Equation (7), and the monitored gas concentration data are extrapolated for further analysis. This definition ensures that forecasting begins only after both fault detection and confirmation have occurred, thereby preventing premature extrapolation from unverified signals and providing a reliable basis for prognostic analysis.
τ = max t detection , t confirmation

3.5. Data Forecast

Once a fault is confirmed, ARIMA models are employed to forecast gas concentration trends. The process begins by selecting a historical data window (w) extending backward from the reference fault time τ.
A Box–Cox transformation is applied when necessary to stabilize variance and improve normality, optimizing λ to maximize likelihood. The Hyndman-Khandakar algorithm then automates ARIMA model selection, incorporating unit root tests, differencing operations, and AICc-based model evaluation. A stepwise search method [73,74] ensures computational efficiency while exploring optimal parameter configurations.
Once the model is defined, it is trained using data up to t, followed by residual diagnostics to confirm adherence to white noise properties and absence of autocorrelation. The Shapiro–Wilk test [75] evaluates normality, ensuring the residuals fit an expected distribution. If residuals deviate from normality, corrective measures such as Box–Cox transformations are applied to improve forecast suitability. The transformed residuals are then used to define kernel density distributions for probabilistic forecasting of gas concentrations.
Additional diagnostic tests ensure robustness:
  • The Ljung–Box test [76,77] verifies residual independence, confirming that the ARIMA model effectively captures temporal dependencies. The number of lags is set to w to maintain diagnostic robustness without overfitting.
  • The Lagrange Multiplier (LM) test for ARCH [78] assesses variance stability, identifying potential heteroskedasticity or volatility clustering. The number of lags is also set to w , consistent with the Ljung–Box test.
Once validated, the ARIMA model forecasts gas concentration values over the defined horizon h, ensuring accuracy in industrial applications. The forecasting process adapts to different time horizons:
  • For short-term horizons (hq), predictions incorporate past residuals and autoregressive terms, capturing short-term deviations.
  • For longer horizons (h > q), the model relies exclusively on autoregressive components, iteratively using prior forecasts as inputs to generate subsequent values.
For non-stationary series (d < 0), inverse differencing restores forecasts to the original scale, preserving consistency with historical data. Confidence intervals (typically 95%) quantify uncertainty, with prediction ranges widening over extended horizons. These intervals support maintenance planning, providing probabilistic bounds within which actual concentrations are expected to fall.
If a Box–Cox transformation was applied during training, forecasts are adjusted back to the original scale using Equation (8).
y ^ t = λ y ^ t λ + 1 1 λ       ,       λ 0 exp y ^ t λ       ,       λ = 0
where y ^ t λ is the forecasted value in the transformed space. However, since this naïve retransformation can lead to systematic underestimation in the presence of residual variance, bias-corrected versions are recommended.
Specifically, for λ = 0 (log transformation), bias correction via residual variance prevents systematic underestimation (Equation (9)).
y ^ t = e x p l o g y ^ t λ + 1 + σ ε 2 2
where σ ε 2 is the residual variance.
For λ ≠ 0, a Taylor expansion adjustment (Equation (10)) accounts for nonlinearity.
y ^ t = λ y ^ t λ + 1 1 λ e x p λ 2 σ ε 2 2
These corrections ensure that forecasts accurately represent the mean in the original space, particularly in high-variance scenarios, mitigating systematic biases. In practice, Equation (8) is rarely used alone; bias-corrected forms (Equations (9) and (10)) are recommended whenever residual variance is non-negligible.
Following the ARIMA-based forecast, probabilistic distributions of gas concentrations are constructed for each time step in h, aligning residual kernels with ARIMA projections. If normality is not confirmed, the Box–Cox transformation is reapplied, using the same optimized λ parameter from the ARIMA model selection. While this transformation does not guarantee normality, it ensures that the residuals are statistically suitable for kernel density estimation, aligning them with the ARIMA forecasts. If normality is confirmed, the residuals are used directly.
To align the residual kernel with ARIMA projections, key statistical parameters—including the mean and quantiles (e.g., 2.5% and 97.5%) of the ARIMA forecasts—are computed at each time step within h, defining the lower and upper bounds of the confidence interval. The residual distributions are then scaled and translated to match these characteristics, ensuring they accurately capture both uncertainty in residuals and the forecast dynamics of gas concentrations over time.
For a given Confidence Level (CL), the Lower Bound (LB) and Upper Bound (UB) of the forecast interval over h steps are computed using Equations (11) and (12). When CL = 95%, these correspond to Q2.5% and Q97.5% of the ARIMA forecast distribution.
L B = Q 1 CL 2
U B = Q 1 + CL 2
The adjustment process includes two key transformations:
  • Scaling to match the spread of the ARIMA projections, where the scale factor (SF) at time t is given by Equation (13).
S F t = U B ARIMA , t L B ARIMA , t U B Residuals L B Residuals
where UBARIMA,t and LBARIMA,t represent the upper and lower quantiles of the ARIMA projections at time t, and UBResiduals and LBResiduals are the corresponding quantiles from the residual kernel distribution.
  • Translation to align the residual kernel’s mean with the ARIMA forecast mean at each t, using the translation factor (TF) from Equation (14).
T F t = Mean ARIMA , t Mean Residuals S F t
where MeanARIMA,t is the mean of the ARIMA forecast at time t, and MeanResiduals is the mean of the residual kernel.
The final adjusted kernel distribution at t is obtained using Equation (15), ensuring that it accurately captures both residual variability and ARIMA forecast trends. These adjusted distributions form the basis for failure probability estimation and probabilistic fault classification using Duval’s methods.
Adjusted   Kernel t = Residual   Kernel S F t + T F t

3.6. Fault Prognosis

After forecasting gas concentrations, failure prognosis estimates the probability of each monitored gas exceeding its Pre-Failure Gas Concentration (PFGC) over the forecast horizon h. Unlike deterministic approaches based solely on projected mean values, this methodology employs a probabilistic assessment, leveraging the adjusted kernel distributions constructed in the previous step.
At each future time step, failure probability is calculated as the likelihood that the gas concentration surpasses its PFGC threshold. This probability is derived using the Empirical Cumulative Distribution Function (ECDF), evaluated at the PFGC value for each gas, as given by Equation (16):
P failure 1 F ^ PFGC
where F ^ PFGC represents the ECDF evaluated at the PFGC value of the respective gas.
For each time step within h, this calculation involves:
  • Generating the ECDF from the adjusted kernel distribution at time t.
  • Evaluating the ECDF at the predefined PFGC value.
  • Computing the complement of the ECDF to determine the probability of gas concentration exceeding PFGC.
This process generates a probabilistic failure risk profile, enabling a time-based reliability assessment of the transformer. Failure probabilities are stored alongside their corresponding forecasted time points, ensuring robust estimates by aligning the residual kernel with ARIMA projections. By incorporating uncertainty and temporal dependencies, this approach supports data-driven predictive maintenance and failure mitigation strategies.

3.7. Fault Diagnosis

Transformer fault diagnosis is performed using a probabilistic extension of Duval’s methods [14,16,17,44], classifying failure modes based on relative gas concentrations. Unlike traditional approaches that rely on fixed thresholds, this framework treats gas concentration ratios as random variables, employing Monte Carlo simulations to estimate probability distributions of fault types over time.

3.7.1. Duval’s Triangles Probabilistic Analysis

Duval’s Triangles classify failure types based on three-gas ratios, with each triangle using different gas combinations, as previously outlined. Given the probabilistic nature of gas forecasts, these ratios are treated as random variables, ensuring that uncertainty in gas concentrations is incorporated into the diagnostic process.
For each time step t, the normalized gas proportions Ri,t are computed as given in Equation (17):
R i , t = C i . t C 1 . t + C 2 . t + C 3 . t
where Ci,t represents the concentration of gas i at time t. These proportions satisfy the constraint in Equation (18), ensuring interdependence among the three gas ratios. Since one proportion directly determines the others, assuming independence would lead to incorrect probability estimates. Therefore, a joint probability approach must be used to accurately compute the likelihood of failure modes.
R 1 . t + R 2 . t + R 3 . t = 1
Each Monte Carlo iteration generates a set of gas proportions, forming probability distributions at each time step. The failure mode probability PF,t is then computed as the likelihood that a sampled gas proportion satisfies the classification boundaries of a given failure mode. Mathematically, this is determined using the Joint Empirical Cumulative Distribution Function (JECDF), as given in Equation (19):
P F , t F ^ R 1 , t , R 2 , t , R 3 , t I 1 , I 2 , I 3
where F ^ R 1 , t , R 2 , t , R 3 , t I 1 , I 2 , I 3 represents the JECDF of the gas proportions, and Ii denotes the classification boundaries for each failure mode within the Duval Triangle model.
A complete formulation of failure mode probabilities for Duval’s Triangles, considering gas-specific classification regions, is provided in Appendix A, ensuring a rigorous probabilistic fault classification approach.

3.7.2. Duval’s Pentagons Probabilistic Analysis

The probabilistic approach used for Duval’s Triangles extends to Duval’s Pentagons, which classify failure modes based on five gas proportions: H2, CH4, C2H2, C2H4, and C2H6. These pentagon-based models provide greater differentiation for low-energy discharges, high-temperature thermal faults, and carbonization effects, introducing an additional dimension to the diagnostic space.
Unlike the triangle-based analysis, which directly compares gas concentrations to threshold limits, the pentagon method maps gas concentrations onto a two-dimensional coordinate system. This transformation allows failure modes to be evaluated based on geometric regions within the pentagon space.
For each time step t, the coordinates (xg,t, yg,t) for each gas g are computed using Equations (20) and (21):
x g , t = 100 R g , t cos θ g
y g , t = 100 R g , t sin θ g
where Rg,t is the normalized gas proportion (Equation (22)), and θg is the angular position of gas g in the pentagon, defined in Table 3.
R g , t = C g , t i = 1 N C i , t
where Cg,t represents the concentration of gas g, and N = 5 is the total number of gases considered in the pentagon model.
Once gas proportions are mapped, a Monte Carlo simulation estimates the probabilistic distribution of the centroid position within the pentagon. For each iteration, gas concentrations are sampled using empirical kernel density estimates, and their coordinates are computed using the transformation equations.
The pentagon is divided into four triangular subregions, preserving the gas order defined in Table 3 (H2 → C2H6 → CH4 → C2H4 → C2H2). The area of each sub-triangle is computed using the determinant method (Equation (23)), and the total pentagon area is calculated in Equation (24):
A i = x i y i + 1 x i + 1 y i         ,           for     i = 1 , 2 , 3 , 4
A total = 1 2 i = 1 4 A i
where the last gas (i = 5) wraps around to connect with i = 1.
The centroid coordinates (Cx, Cy) are then computed using Equations (25) and (26):
C x = 1 6 A total i = 1 4 x i + x i + 1 A i
C y = 1 6 A total i = 1 4 y i + y i + 1 A i
This ensures that the centroid’s position accounts for the relative gas proportions in a statistically robust manner.
After computing the probabilistic distribution of (Cx, Cy), fault classification is performed by determining the probability that the centroid falls within predefined fault regions in Duval’s Pentagon. Unlike deterministic classification, this probabilistic approach captures interdependencies between centroid coordinates, which arise due to gas proportion normalization.
To model these correlations accurately, the probability of a failure mode occurring at time step t is computed using the JECDF, as given in Equation (27):
P F , t F ^ C x , t , C y , t I x , I y
where F ^ C x , t , C y , t I x , I y represents the JECDF of the gas proportions, and Ix and Iy are the classification region boundaries for each Duval Pentagon model.
The complete formulation of failure mode probabilities for Duval’s Pentagons, considering gas-specific classification regions, is provided in Appendix A.

3.8. Decision Support and Maintenance Planning

Accurate DGA interpretation is crucial for assessing power transformer health, requiring structured fault classification that accounts for measurement uncertainty and probabilistic centroid estimation. To ensure reliable diagnosis, Duval’s graphical models follow a hierarchical approach, applied in the following sequence:
  • Primary Fault Diagnosis (Duval’s Triangle 1 & Pentagon 1)—Identifies six fundamental failure modes:
    PD: Partial discharges (corona);
    D1: Low-energy discharges;
    D2: High-energy discharges;
    T1: Low-temperature thermal faults (<300 °C);
    T2: Intermediate-temperature thermal faults (300–700 °C);
    T3: High-temperature thermal faults (>700 °C).
If the diagnosis is clear and consistent, further analysis is not required.
2.
Refinement for Mild Overheating (Duval’s Triangle 4 & Pentagon 2)—Applied if T1, T2, or PD is detected to differentiate:
S: Stray gassing (<200 °C);
O: Overheating of paper or oil (<250 °C, without significant degradation);
PD: Specific types of partial discharges.
This step helps exclude misleading diagnoses in cases of mild overheating.
3.
Refinement for High-Temperature Faults (Duval’s Triangle 5 & Pentagon 2)—Applied if T2 or T3 is detected to distinguish:
C: Paper carbonization (>300 °C);
T3-H: High-temperature thermal fault affecting only the insulating oil.
This step ensures the correct differentiation between failures involving solid insulation degradation and those occurring exclusively in the insulating fluid.
4.
Convergence Analysis & Multiple Fault Detection—After applying graphical models, results are assessed for consistency:
Single failure: If all models agree on the same failure mode.
Multiple failures: If discrepancies exist between triangles and pentagons, requiring further analysis of centroid distributions and interactions.
Once failure modes are identified, their probabilities are estimated, moving beyond deterministic classification to probabilistic assessment. Based on failure probability, different maintenance strategies are adopted:
  • Low probability: Predictive maintenance with continued DGA monitoring.
  • Moderate probability: Transformer inspections, visual assessments, and potential offline testing.
  • High probability: Corrective actions, including component replacement or scheduled shutdown.
The final decision considers transformer criticality within the power grid and the dissolved gas evolution rate, as outlined in IEC 60599 [11,42]. The combined use of Duval’s Triangles and Pentagons enhances failure prediction accuracy, providing a robust framework for transformer maintenance and operational decision-making.

4. Validation and Results

The proposed probabilistic framework is validated through two case studies based on real failure data. Gas concentration values were extracted from graphical representations and discretized into hourly intervals to create structured datasets for modeling and fault assessment. Since the datasets were already preprocessed, the application of the framework begins at the Operational Parameter Definition stage. Additionally, while the analysis in both cases is conducted retrospectively, considering all available observations from the dataset’s start to the fault confirmation point, in industrial settings, the framework would operate sequentially, using only the data available at each time step as new measurements are acquired.
The first case investigates gas losses in an experimental oil tank, showing how diffusion in free-breathing systems can distort DGA-based classification. The second involves a thermal fault in a 345 kV transformer, with paper insulation carbonization confirmed after shutdown.
In the case study, all available observations are analyzed retrospectively, from the dataset’s beginning up to the fault detection and confirmation points. In practice, however, industrial applications would perform confirmation sequentially, using only the data available at each time step as new measurements are acquired.

4.1. Case Study 1: Gas Losses in an Experimental Oil Tank Setup

The first case study analyzes gas loss mechanisms in an experimental oil tank setup, as reported by Riedmann (2023) [41]. It examines dissolved gas behavior in free-breathing transformer systems, where gas diffusion affects DGA-based fault classification. The study identifies an electrical arcing fault, characterized by significant C2H2 generation, highlighting how gas losses impact fault interpretation and condition assessment. Gas concentrations were recorded over the last 80 h of operation before the shutdown, as presented in Figure 3.
The first step in the case analysis involves defining the operational parameters used for detection and confirmation. Thresholds were based on Level 2 limits for gas concentration (ppm) and gas increase rate (ppm/year), representative of an average CIGRE/IEC power transformer. Pre-failure gas concentration limits (ppm) were also defined according to CIGRE standards [15], as summarized in Table 4.
The analysis window (w) was set to 48 observations, corresponding to 48 h of historical data. This choice balances statistical robustness and responsiveness: it provides enough data points to allow reliable ARIMA identification, close to the guideline of approximately 50 observations commonly recommended for a useful estimate of the autocorrelation function [72], while remaining short enough to capture relevant variations in gas behavior without introducing excessive lag.
The projection window (h) was set to 32 h, constrained by the available data and the timing of fault detection within the dataset.
Regarding fault detection, the first gas to exceed the concentration threshold was C2H2, at time step 15. For fault confirmation, the first gas to surpass the rate-of-increase threshold was also C2H2, at time step 48. Thus, the reference time for failure prognosis was established at step 48. As discussed in Section 3.4, fault confirmation relies on the analysis window (w) to mitigate distortions introduced when converting annual gas growth rates into hourly equivalents.
Once the fault is confirmed, the framework proceeds to the Data Forecast stage. Due to data limitations, the ARIMA model was trained starting at time step 48, using a single reference point for forecasting. Table 5 presents the selected ARIMA parameters and results of residual diagnostics.
The ARIMA model parameters comprise the order (p, d, q) and the Box–Cox transformation parameter (λ), employed to stabilize variance. Model adequacy is assessed by the log-likelihood, where higher values denote superior fit, and by the Root Mean Square Error (RMSE), where lower values indicate greater forecasting accuracy. Residual properties are further examined through statistical tests: the first-order autocorrelation (ACF1), expected to approximate zero for white noise; the Shapiro–Wilk test, where p-values above 0.05 confirm residual normality; the Box-Ljung test, where high p-values indicate the absence of significant autocorrelation; and the ARCH LM test, where high p-values suggest homoscedasticity, while low values reveal the presence of volatility clustering.
The ARIMA models exhibit varying levels of complexity and performance across the monitored gases. H2 and CH4 include drift or a non-zero mean, reflecting underlying trends, while C2H2 requires second-order differencing (d = 2), indicating greater non-stationarity. The Box–Cox transformation (λ) is close to 1 for H2, suggesting near-normal behavior, whereas CH4 has the lowest λ (0.07754), requiring a stronger transformation to stabilize the variance.
In terms of model fit, C2H2 achieves the highest log-likelihood (–31.22), indicating a better fit, while CH4 presents the lowest (–152.4), suggesting greater noise or structural shifts. Forecasting accuracy, assessed through RMSE, is lowest for CH4 (0.3077), indicating a more predictable trend, whereas C2H4 exhibits the highest RMSE (6.9917), reflecting greater uncertainty in predictions.
Residual diagnostics confirm that C2H2 has well-behaved residuals (ACF1 = −0.0123), while CH4 shows residual correlation (ACF1 = 0.2443). The Shapiro–Wilk test confirms normality for all gases except CH4, reinforcing the need for variance stabilization in methane forecasts. The Box-Ljung test indicates no significant autocorrelation, confirming that the models sufficiently capture temporal dependencies.
The ARCH LM test suggests mild heteroscedasticity for C2H6 (p-value = 0.04912) and C2H4 (p-value = 0.04526), though values remain above 0.01, making the models statistically valid but requiring careful interpretation. Overall, the models effectively capture gas concentration trends, supporting predictive analysis, though minor limitations in CH4 and variance instability in C2H6 and C2H4 should be considered in practical applications.
Figure 4 presents the gas concentration forecasts generated by the ARIMA models. Dots represent the training data, small squares indicate the mean forecasts, and shaded areas denote the prediction intervals. The original dataset is shown as a solid line for comparison.
After forecasting, the analysis proceeds to Fault Prognosis, estimating the probability of failure within 32 h, and Fault Diagnosis, assessing fault type likelihood. Figure 5 displays the forecasted gas concentrations, failure thresholds, and associated probabilities. Since none of the gases exceed CIGRE’s pre-failure limits [15], the predicted failure probability is zero. Figure 6 and Figure 7 present the fault classification results using Duval’s Triangle and Pentagon analyses.
Duval Triangle 1 (Figure 6a,b) indicates that the D1 failure mode (low-energy electrical discharge) remains at 100% probability, while T1 (low-temperature thermal fault) also appears with a significant probability (>60%) throughout most of the forecast horizon. The probability of D2 (high-energy discharge) remains marginal, suggesting that a severe discharge is unlikely.
Given the presence of T1, further assessment using Duval Triangle 4 was conducted, but no substantial probability of fault evolution was detected (Figure 6c,d), reinforcing the absence of a developing thermal fault trend. Although Triangle 5 is not typically used in this scenario, its projections suggest a notable probability of T2 (intermediate-temperature thermal fault) and C (carbonization of insulation), as presented in Figure 6e,f.
Duval Pentagon 1 (Figure 7a–c) confirms D1 and D2 as dominant failure modes, both reaching 100% probability. Since Pentagon 1 alone is sufficient to establish a diagnosis in this scenario, Pentagon 2 (Figure 7a,b,d) serves only as a confirmation, rather than introducing new insights. According to best practices, similar failure types should be interpreted collectively, considering the hierarchical severity of failure modes. The most probable diagnosis is a D2 fault (high-energy discharge), followed by D1 (low-energy discharge), aligning with probabilistic trends across the different Duval methodologies.
The results are consistent with Riedmann (2023) [41], who observed that gas loss mechanisms in free-breathing transformers could alter fault classifications over time, impacting condition assessments. Their study highlighted a transition in Duval Triangle 1 from D1 to D2 after 57 h, reinforcing the importance of time-dependent dissolved gas diagnostics. The current analysis corroborates these findings, as D1 remains dominant throughout the forecast period, with T1 maintaining a high probability. However, unlike Riedmann’s study, no significant shift to D2 was detected within the analyzed time horizon when using triangles alone. The Pentagon analysis, however, suggests an early indication of D2, revealing a possible fault evolution trend not immediately apparent in Triangle-based assessments.
Additionally, Duval Triangle 4 (Figure 6c,d) does not indicate strong thermal fault evolution, countering the hypothesis that gas loss effects lead to misinterpretation of thermal faults due to differential degassing rates. The combined analysis of Triangle 4 and Pentagons suggests that the fault is more likely an evolving electrical arc rather than a developing thermal fault. While Triangle 5 indicates possible T2 and C faults, these results should not be considered in this case, following best-practice classification guidelines.
Riedmann [41] also emphasized that gas loss effects could distort Duval Triangle classifications, potentially masking failure modes. While the present study does not focus explicitly on gas loss corrections, the Pentagon-based fault evolution analysis confirms that D1 and D2 remain the most probable mechanisms, reinforcing the robustness of the classification approach.
This highlights a fundamental advantage of the proposed probabilistic framework: even in the presence of potential masking effects, the method successfully predicts fault evolution, quantifies uncertainties, and provides a comprehensive assessment of equipment health. Unlike deterministic approaches that classify faults solely based on observed data, this method enables early fault detection, enhances risk assessment, and supports proactive decision-making. However, its application requires careful evaluation by engineers and technical teams, as demonstrated in this case study.

4.2. Case Study 2: Thermal Fault in a 345 kV Power Transformer

This case study evaluates a thermal fault in a 345 kV power transformer, based DGA monitoring data, as reported by Riedmann (2022) [79]. The fault was initially detected through abnormal increases in C2H4 and CH4. Following the shutdown and internal inspection, carbonization of the paper insulation was confirmed.
Gas concentration data were collected over the last 165 h before shutdown (Figure 8). The preprocessed dataset follows the same concentration and rate-of-increase thresholds as in Case Study 1 (Table 4). The analysis window (w) includes 48 observations to maintain consistency and modeling stability, while the forecast window (h) spans 24 observations, representing a one-day horizon for short-term failure risk assessment.
CH4 was the first gas to exceed its detection threshold at time step 0, with fault confirmation occurring 103 h later. This delay is explained by CH4 concentrations already surpassing pre-failure thresholds from the first observed time step and remaining stable until approximately time step 100, when all gases except C2H2 showed a concentration increase.
With the fault confirmed, data forecasting was performed. Unlike the first case study, this analysis considers two reference time steps: 103 and 139, allowing for an additional evaluation 36 h after fault confirmation. This approach highlights the framework’s ability to track fault evolution over time.
Following the same methodology as Case Study 1, Table 6 and Table 7 present the ARIMA model parameters and residual diagnostics for each monitored gas at τ = 103 and τ = 139, respectively.
At time step 103, the ARIMA models applied to H2, C2H6, C2H4, and C2H2 follow a (0,0,0) structure with a non-zero mean, indicating that these gas concentrations remain relatively stable within this time window. In contrast, CH4 follows a (1,1,0) model, incorporating a first-order autoregressive term and differencing, suggesting some temporal dependency.
Variance stabilization, assessed via Box–Cox λ varies significantly across gases. C2H6 and C2H4 have λ = 1.9999, suggesting that the original data required minimal transformation. Conversely, CH4 and H2 have λ = −0.8999, indicating the need for stronger variance adjustments.
In terms of model fit, CH4 achieves the highest log-likelihood (399.41), suggesting a good representation of its concentration dynamics, while C2H4 (–414.05) presents the lowest value, indicating a weaker model fit. RMSE values confirm that CH4 exhibits the highest forecasting error (17.9064), whereas C2H2 achieves the lowest (0.3304).
Residual diagnostics reveal mixed results. ACF1 values indicate strong autocorrelation in H2 (0.8160) and C2H4 (0.8359), suggesting some remaining temporal dependencies. Normality tests confirm that C2H6 is the only gas with normally distributed residuals, while all others deviate from normality.
The Box-Ljung test confirms that all models adequately capture residual dependencies, with p-values above 0.05, indicating no significant autocorrelation. However, the ARCH LM test identifies heteroscedasticity in CH4 (p-value = 0.0092), suggesting variance instability (which may impact long-term forecasting reliability), while other gases maintain stable variance.
At time step 139, the models show increased complexity, incorporating higher-order differencing and autoregressive components, reflecting evolving gas concentration dynamics. All gases require second-order differencing (d = 2), emphasizing a stronger non-stationarity compared to τ = 103.
Box–Cox λ values shift, with H2 (0.7301) and CH4 (0.3797) requiring moderate variance transformations, whereas C2H2 (1.6785) is closer to normality. C2H4 (−0.2405) exhibits a more extreme transformation need, suggesting greater variance instability.
In terms of model fit, C2H4 achieves the highest log-likelihood (231.76), while H2 presents the lowest (–20.48). RMSE values indicate CH4 (5.5989) maintains a higher error rate, although reduced compared to τ = 103, while C2H2 (0.1683) exhibits the lowest error.
Residual diagnostics show a slight reduction in autocorrelation compared to the earlier reference point, with ACF1 values stabilizing. However, normality tests indicate that none of the gases follow a normal distribution, reinforcing the need for further variance stabilization.
The Box-Ljung test suggests adequate residual independence for most gases, except for C2H4 (p-value = 0.02489), indicating possible autocorrelation effects. Additionally, the ARCH LM test detects heteroscedasticity in C2H6 (p-value = 0.02719) and C2H4 (p-value = 9.944 × 10−7), indicating significant variance instability.
Overall, the results indicate that gas concentration trends evolve, requiring more complex ARIMA models and greater differencing at τ = 139. The improvement in RMSE for CH4 suggests a better adaptation in the time-series behavior. However, the increase in heteroscedasticity and residual dependencies at later time steps emphasizes the importance of uncertainty quantification in predictive maintenance strategies.
Following the same methodology as before, data forecasting was performed at two reference time steps (τ = 103 and τ = 139), allowing for a comparative assessment of forecast evolution over time. The ARIMA model predictions for each gas are shown in Figure 9, where dots represent training data, small squares indicate mean forecasts, and shaded areas denote confidence intervals. The original dataset is displayed as a solid line for reference, with (a) corresponding to τ = 103 and (b) to τ = 139.
Next, the analysis proceeded to Fault Prognosis, estimating the probability of transformer failure at both reference time steps. Figure 10 presents the forecasted values and pre-failure thresholds for each gas, along with the computed failure probability over the h-window (24 h). A key observation in this case study is that CH4 concentrations exceeded the pre-failure threshold from the first recorded time step. Consequently, at τ = 103, the failure probability was already close to 100%, primarily driven by CH4. By τ = 139, C2H4 concentrations also increased significantly, triggering a failure condition due to both gases simultaneously.
For Fault Diagnosis, the Duval Triangle and Pentagon analyses were conducted at both reference time steps, identifying the evolution of failure modes over time. The Duval Triangle analysis results are presented in Figure 11 and Figure 12, corresponding to τ = 103 and τ = 139, respectively. Similarly, the Duval Pentagon analysis results are detailed in Figure 13 and Figure 14 for the same time steps.
The probabilistic analysis of Duval’s Triangles and Pentagons in the second case study reveals key insights into the progression of transformer faults over time, demonstrating how failure modes evolve dynamically rather than remaining fixed as in deterministic approaches. By quantifying uncertainty and interdependencies, this framework offers a more comprehensive diagnostic perspective compared to traditional methods.
At τ = 103, Duval Triangle 1 (Figure 11a,b) identifies D2 (high-energy electrical discharge) as the dominant fault mode (~100% probability), with DT (combined electrical and thermal faults) appearing at lower probability levels. Simultaneously, T2 (thermal fault between 300–700 °C) is detected at 100% probability until t = 116, after which T3 (thermal fault > 700 °C) becomes predominant throughout the forecast horizon. This trend indicates a progressive increase in thermal severity, consistent with insulation material degradation under sustained overheating conditions.
Given the strong presence of T3, further evaluation using Duval Triangles 4 and 5 was conducted. Triangle 4 (Figure 11c,d) suggests a high probability of C (carbonization of paper) and S (gas loss from mineral oil), with a minor probability of O (overheating < 250 °C) emerging at t = 116. Triangle 5 (Figure 11e,f) further reinforces this trend, maintaining a 100% probability of C throughout the entire period, while T3 exhibits a clear increasing trend from t = 116 onward. These findings confirm progressive thermal degradation leading to paper carbonization, suggesting long-term structural damage to the insulation system.
The Duval Pentagon analysis at τ = 103 supports these conclusions. Pentagon 1 (Figure 13a–c) confirms a 100% probability for T1 (thermal fault < 300 °C) and T2, with T3 appearing at lower probability levels, indicating thermal stress escalation. Meanwhile, Pentagon 2 (Figure 13a,b,d) shows 100% probability for both C and O, reflecting severe degradation of both insulation materials and oil stability.
By τ = 139, notable shifts in fault probabilities emerge (Figure 12b). D2 probability declines, while DT increases slightly, suggesting a gradual transition in fault characteristics. However, T2 and T3 remain stable, confirming ongoing thermal degradation mechanisms. At this stage, Triangle 4 (Figure 12c,d) no longer indicates significant probabilities for additional failure types, while Triangle 5 (Figure 12e,f) maintains 100% probability for C until t = 152, after which a slight decline occurs, coinciding with a rising probability of T3. This pattern suggests that while carbonization remains the primary failure mode, severe thermal degradation (T3) continues to intensify.
The Duval Pentagon results at τ = 139 reinforce these observations (Figure 14). Pentagon 1 still confirms 100% probability for T1 and T2, but T3 experiences a significant rise, indicating increasing thermal stress and insulation deterioration. Meanwhile, Pentagon 2 maintains 100% probability for C and O, with T3-H (thermal fault affecting oil only) emerging from t = 152 onward, signaling progressive oil degradation as an additional failure mechanism.
These results align with Riedmann (2022) [79], which classified the observed failure as a thermal fault confirmed by post-shutdown inspection, revealing carbonization of paper insulation. The current study corroborates this conclusion, as the probabilistic approach successfully predicts fault progression (T1 → T2 → T3), capturing insulation degradation trends over time.
A key distinction between the proposed method and deterministic approaches lies in its ability to forecast fault evolution and quantify uncertainty. While Riedmann’s analysis relied on post-failure assessments, the probabilistic framework anticipates insulation degradation trends and the increasing probability of severe faults before transformer shutdown. This capability is crucial for early maintenance planning, enabling proactive risk mitigation instead of reactive fault detection.
As seen in the gas concentration forecasts, uncertainty increases over time, leading to greater variability in fault probability estimates. This effect is particularly pronounced in longer forecast horizons, where fault probabilities fluctuate due to cumulative model uncertainties and increased gas concentration dispersion. The residual behavior observed in Table 6 and Table 7, particularly the presence of heteroscedasticity and autocorrelation, reinforces this observation, as indicated by the ARCH LM test results confirming time-dependent variance for certain gases. These findings highlight the importance of carefully interpreting long-term predictions, recognizing that model confidence decreases as the forecast horizon extends.
Despite this, the probabilistic approach remains superior to deterministic methods, as it enables a dynamic assessment of evolving failure risks rather than relying on fixed classification thresholds. By quantifying uncertainty, this framework provides a more realistic representation of transformer conditions, ensuring that maintenance decisions incorporate both fault likelihoods and confidence levels in those assessments.

4.3. Model Performance and Predictive Reliability

The performance of the proposed probabilistic framework was evaluated through two documented case studies drawn from the literature, which serve as validated benchmarks for fault diagnosis and gas behavior in power transformers. These benchmarks, presented by Riedmann et al. [41,79], offer well-characterized scenarios involving thermal degradation and controlled electrical discharges under varying operational conditions.
In both case studies, the framework achieved 100% accuracy in fault type classification when compared to the diagnoses originally reported by Riedmann. Moreover, all observed gas concentrations fell within the 95% confidence intervals generated by the ARIMA-based forecasts, as illustrated in Figure 4 and Figure 10. This confirms that the models provide statistically consistent and reliable predictions, even in the presence of nonstationarity and short time series.
To evaluate forecast performance beyond visual agreement, residual diagnostics were conducted to assess the quality and stability of the ARIMA models. While some gases showed signs of autocorrelation and variance instability, especially at later forecast horizons, the models still captured the overall trends effectively. The presence of residual autocorrelation and variance instability observed in specific cases suggests that some gases exhibit more complex temporal behaviors as faults evolve.
Still, the forecast envelopes proved resilient, preserving predictive reliability even under departures from ideal statistical conditions. This indicates that the proposed framework provides robust forecasts with well-calibrated uncertainty bounds. It remains effective even when statistical assumptions are partially violated, offering a practical and interpretable tool for supporting early fault detection and maintenance planning.
An additional strength of the proposed approach is its capacity to anticipate fault behavior well before the end of the observation period, a critical feature for operational decision-making in asset management. In the first case study, derived from Riedmann (2023) [41], data were available up to time step t = 80, but the full diagnostic and prognostic analysis was performed at t = 48. This implies a 32-step look-ahead window, during which the framework maintained consistent classification of the D1 fault mode and detected early probabilistic signals of D2 escalation.
Notably, Riedmann’s own analysis observed a transition from D1 to D2 only after 57 h, reinforcing the importance of time-dependent DGA interpretation and highlighting the benefit of early detection. The proposed method was thus able to identify the dominant failure mode and anticipate a potential escalation earlier than the transition point reported in the benchmark study.
In the second case study, based on Riedmann (2022) [79], measurements extended up to t = 163, but the model conducted diagnostic assessments at two earlier points: τ = 103 and τ = 139. These points were chosen to evaluate the progression of thermal degradation, and the framework successfully identified the evolution from T1 to T3 faults, along with early signs of carbonization (C) and oil degradation (O), well before the final shutdown and post-failure inspection.
The prediction at τ = 103 revealed early indicators of severe thermal behavior, while the update at τ = 139 confirmed the intensification of degradation mechanisms and the emergence of oil-related fault signatures. This anticipatory capability supports risk-based maintenance strategies and aligns with IEC and IEEE recommendations on continuous monitoring and early intervention.
The diagnostic module integrates probabilistic versions of Duval’s Triangles and Pentagons, which are widely accepted tools in transformer fault analysis. By replacing hard boundaries with probabilistic classification zones, the framework accounts for both measurement uncertainty and overlapping fault signatures.
This is particularly relevant in ambiguous conditions, such as partial discharges co-occurring with thermal effects, or scenarios affected by differential gas loss rates, where deterministic classification may oscillate or fail to produce consistent results. In contrast, the probabilistic model maintains stability by distributing fault likelihood across multiple potential classes and tracking their evolution over time.

5. Conclusions

This study introduced a probabilistic framework for transformer fault detection, diagnosis, and risk-based prognosis, integrating self-adaptive ARIMA modeling with an uncertainty-aware extension of Duval’s methods. By incorporating probabilistic fault classification and time-series forecasting of dissolved gas concentrations, the proposed approach provides a more adaptive, data-driven, and computationally efficient alternative to deterministic and machine learning-based methods for DGA-based transformer analysis.
It should be noted that the proposed forecasting component estimates the probabilistic evolution of measured dissolved gas concentrations, rather than attempting to predict specific fault types deterministically. While fault mechanisms in transformers may change or appear intermittently, the gas dynamics themselves exhibit consistent temporal correlations that can be statistically modeled and used to infer fault-related risk under uncertainty.
To validate the proposed method, two case studies were conducted, demonstrating its ability to forecast the individual behavior of dissolved gases, predict their concentration trends, estimate fault evolution, and provide dynamic risk assessments.
The first case study, based on an experimental oil tank setup, highlighted the model’s capability to forecast gas concentration trends and estimate fault evolution, even under gas loss conditions that often distort traditional DGA-based classifications. The probabilistic adaptation of Duval’s Triangle and Pentagon enabled a refined failure mode interpretation, distinguishing low-energy (D1) and high-energy (D2) discharges while mitigating gas diffusion effects.
The second case study, involving a 345 kV power transformer with a thermal fault, further validated the model’s effectiveness. The proposed framework successfully tracked the progressive thermal degradation (T1 → T2 → T3), forecasting increasing failure probabilities before the transformer was taken offline. Unlike conventional deterministic methods that rely on fixed thresholds, this probabilistic approach enables continuous risk assessment and supports earlier, more proactive maintenance decisions.
A key strength of the proposed framework lies in its ability to quantify uncertainty in both fault classification and prognosis while preserving structured diagnostic inference through established DGA methodologies, such as Duval’s polygons. This addresses a major limitation of traditional DGA techniques, which rely on fixed thresholds and lack adaptability. Besides, the ARIMA-based modeling approach eliminates the need for labeled datasets, offering a scalable and practical solution for real-world transformer monitoring based on continuous DGA sensor data. Its probabilistic design enhances adaptability, enabling dynamic updates of forecasts and failure probabilities in response to evolving gas behavior.
Another important feature of the framework is its ability to accommodate the specific nature of different fault types. Because each gas is modeled individually, the method naturally captures both fast-changing dynamics, such as intermittent partial discharges, and slower progressions, such as thermal degradation. The subsequent probabilistic reformulation of Duval’s polygons ensures that these distinct gas signatures are consistently reflected in the diagnostic stage, further strengthening the robustness and applicability of the approach.
Future research could focus on hybrid models integrating ARIMA with nonlinear techniques, enhancing adaptability while preserving interpretability, thereby further improving transformer fault prognosis. Expanding the model to account for varying operational conditions, such as seasonal fluctuations and sudden load changes, would further strengthen its robustness under real-world conditions. Finally, implementing the framework in real-time continuous monitoring systems, particularly those enabled by online DGA sensors and IoT-based infrastructures, would provide a definitive validation of its industrial applicability, bridging the gap between theoretical advancements and practical reliability in power system maintenance.
Another promising direction is to explicitly account for the influence of operating temperature and load on gas solubility, either by incorporating these variables as exogenous inputs in the forecasting models or by adjusting concentrations according to known solubility curves, thereby reducing uncertainty in the presence of daily fluctuations. Future work may also address the recently introduced fault subtypes, including the new sub-zones in Duval’s Triangles and Pentagons that distinguish arcing in oil from arcing in paper, as well as additional categories related to paper insulation carbonization. Although not yet consolidated in current standards, their eventual integration into the proposed framework could further improve diagnostic resolution and align the methodology with emerging advances in DGA interpretation.
The findings presented demonstrate that the proposed probabilistic approach represents a significant advancement in DGA-based transformer fault analysis, offering improved predictability, robustness, and reliability for risk-based decision-making. By incorporating uncertainty quantification and dynamic forecasting, the framework contributes to a scalable and interpretable PHM solution tailored to critical power assets. Its alignment with industry standards and adaptability to real-world conditions ensure practical relevance for enhancing transformer reliability within modern power systems, sensor-enabled monitoring environments, and smart grids.

Author Contributions

Conceptualization, F.N.K., M.A.d.C.M. and G.F.M.d.S.; methodology, F.N.K. and M.A.d.C.M.; software, F.N.K. and M.A.d.C.M.; validation, F.N.K., M.A.d.C.M., G.F.M.d.S., H.J.B.d.S. and H.M.C.; formal analysis, F.N.K. and M.A.d.C.M.; investigation, F.N.K. and M.A.d.C.M.; resources, G.F.M.d.S., H.J.B.d.S. and H.M.C.; data curation, F.N.K. and M.A.d.C.M.; writing—original draft preparation, F.N.K. and M.A.d.C.M.; writing—review and editing, F.N.K., M.A.d.C.M., G.F.M.d.S., H.J.B.d.S. and H.M.C.; visualization, F.N.K. and M.A.d.C.M.; supervision, G.F.M.d.S.; project administration, G.F.M.d.S.; funding acquisition, G.F.M.d.S. All authors have read and agreed to the published version of the manuscript.

Funding

The authors thank the financial support of Fundação para o Desenvolvimento Tecnológico da Engenharia (FDTE), Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), and EDP Brasil for the development of the present research as part of an ANEEL R&D Project. Gilberto Souza also wishes to acknowledge the support of CNPq by grant 303986/2022-0.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Some or all the raw data supporting the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to thank the Fundação para o Desenvolvimento Tecnológico da Engenharia (FDTE) and EDP Brasil for their support of this research project. During the preparation of this manuscript, the authors used ChatGPT, GPT-5 (OpenAI, 2025) for the purposes of improving language clarity and readability. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

This appendix provides the complete formulation of each failure mode probability expression considering Duval’s Triangle and Pentagon methods.

Appendix A.1. Duval’s Triangle 1 (T1)

P D 1 , t = P R C 2 H 4 , t < 23 ,     R C 2 H 2 , t > 13
P D 2 , t = P 23 < R C 2 H 4 , t < 40 ,     13 < R C 2 H 2 , t < 29 + P R C 2 H 4 , t > 23 ,     R C 2 H 2 , t > 29
P DT , t = P R C 2 H 4 , t < 50 ,     4 < R C 2 H 2 , t < 13 + P 40 < R C 2 H 4 , t < 50 ,     13 < R C 2 H 2 , t < 29 + P R C 2 H 4 , t > 50 ,     15 < R C 2 H 2 , t < 29
P T 1 , t = P R CH 4 , t < 98 ,     R C 2 H 4 , t < 20 ,   R C 2 H 2 , t < 4
P T 2 , t = P 20 <   R C 2 H 4 , t < 50 ,   R C 2 H 2 , t < 4
P T 3 , t = P R C 2 H 4 , t > 50 ,   R C 2 H 2 , t < 15
P PD , t = P R CH 4 , t > 98

Appendix A.2. Duval’s Triangle 2 (T2)

P O , t = P R H 2 , t < 9 ,     R C 2 H 6 , t > 30
P C , t = P R CH 4 , t > 36 ,     R C 2 H 6 , t > 24 + P R H 2 , t < 15 ,   24 <   R C 2 H 6 , t < 29
P S , t = P R H 2 , t > 9 ,     30 < R C 2 H 6 , t < 46 + P R H 2 , t > 15 ,   24 < R C 2 H 6 , t < 30 + P R CH 4 , t < 36 ,   1 < R C 2 H 6 , t < 24 + P 15 < R CH 4 , t < 36 ,   R C 2 H 6 , t < 1 + P R CH 4 , t < 2 ,   R C 2 H 6 , t < 1
P PD , t = P 2 < R CH 4 , t < 15 ,     R C 2 H 6 , t < 1
P ND , t = P   R H 2 , t > 9 ,   R C 2 H 6 , t > 46

Appendix A.3. Duval’s Triangle 3 (T3)

P O , t = P 1 < R C 2 H 4 , t < 10 ,   2 < R C 2 H 6 , t < 14 + P R C 2 H 4 , t < 1 ,   R C 2 H 6 , t < 2 + P R C 2 H 4 , t < 10 ,   R C 2 H 6 , t > 54
P T 2 , t = P 10 < R C 2 H 4 , t < 35 ,     R C 2 H 6 , t < 12
P T 3 , t = P R C 2 H 4 , t > 35 ,   R C 2 H 6 , t < 12 + P R CH 4 , t > 50 ,   12 < R C 2 H 6 , t < 14 + P R C 2 H 4 , t > 70 ,   R C 2 H 6 , t < 14 + P R C 2 H 4 , t > 35 ,   R C 2 H 6 , t > 30
P PD , t = P R C 2 H 4 , t < 1 ,   2 < R C 2 H 6 , t < 14
P ND , t = P   10 < R C 2 H 4 , t < 35 ,   R C 2 H 6 , t > 30
P S , t = P   R C 2 H 4 , t < 10 ,   14 < R C 2 H 6 , t < 54
P C , t = P   10 < R C 2 H 4 , t < 50 ,   12 < R C 2 H 6 , t < 14 + P   10 < R C 2 H 4 , t < 70 ,   14 < R C 2 H 6 , t < 30

Appendix A.4. Duval’s Pentagon 1 (P1)

P PD , t = P C x , t , C y , t R PD
where
R PD = 0 ,   33 , 1 ,   33 , 1 ,   24.5 , 0 ,   24.5
P D 1 , t = P C x , t , C y , t R D 1
where
R D 1 = 0 ,   40 , 38 ,   12 , 32 ,   6.1 , 4 ,   16 , 0 ,   1.5
P D 2 , t = P C x , t , C y , t R D 2
where
R D 2 = 4 ,   16 , 32 ,   6.1 , 24.3 ,   30 , 0 ,   3 , 0 ,   1.5
P T 1 , t = P C x , t , C y , t R T 1
where
R T 1 = 6 ,   4 , 22.5 ,   32.4 , 23.5 ,   32.4 , 35 ,   3 , 0 ,   1.5 , 0 ,   3
P T 2 , t = P C x , t , C y , t R T 2
where
R T 2 = 6 ,   4 , 1 ,   32.4 , 22.5 ,   32.4
P T 3 , t = P C x , t , C y , t R T 3
where
R T 3 = 0 ,   3 , 24.3 ,   30 , 23.5 ,   32.4 , 1 ,   32 , 6 ,   4
P S , t = P C x , t , C y , t R S
where
R S = 0 ,   1.5 , 35 ,   3.1 , 38 ,   12.4 , 0 ,   40 , 0 ,   33 , 1 ,   33 , 1 ,   24.5 , 0 ,   24.5

Appendix A.5. Duval’s Pentagon 2 (P2)

P PD , t = P C x , t , C y , t R PD
where
R PD = 0 ,   33 , 1 ,   33 , 1 ,   24.5 , 0 ,   24.5
P D 1 , t = P C x , t , C y , t R D 1
where
R D 1 = 0 ,   40 , 38 ,   12 , 32 ,   6.1 , 4 ,   16 , 0 ,   1.5
P D 2 , t = P C x , t , C y , t R D 2
where
R D 2 = 4 ,   16 , 32 ,   6.1 , 24.3 ,   30 , 0 ,   3 , 0 ,   1.5
P S , t = P C x , t , C y , t R S
where
R S = 0 ,   1.5 , 35 ,   3.1 , 38 ,   12.4 , 0 ,   40 , 0 ,   33 , 1 ,   33 , 1 ,   24.5 , 0 ,   24.5
P T 3 - H , t = P C x , t , C y , t R T 3 - H
where
R T 3 - H = 0 ,   3 , 24.3 ,   30 , 23.5 ,   32.4 , 2.5 ,   32.4 , 3.5 ,   3
P C , t = P C x , t , C y , t R C
where
R C = 3.5 ,   3 , 2.5 ,   32.4 , 21.5 ,   32.4 , 11 ,   8
P O , t = P C x , t , C y , t R S
where
R O = 3.5 ,   3 , 11 ,   8 , 21.5 ,   32.4 , 23.5 ,   32.4 , 35 ,   3.1 , 0 ,   1.5 , 0 ,   3

References

  1. Rexhepi, V. An Analysis of Power Transformer Outages and Reliability Monitoring. Energy Procedia 2017, 141, 418–422. [Google Scholar] [CrossRef]
  2. Atwa, O.S.E. Practical Power System and Protective Relays Commissioning; Elsevier: Amsterdam, The Netherlands, 2019; ISBN 9780128168585. [Google Scholar]
  3. Ranga, C.; Kumar, A.; Chandel, R. Influence of Electrical and Thermal Ageing on the Mineral Insulating Oil Performance for Power Transformer Applications. Insight—Non-Destr. Test. Cond. Monit. 2020, 62, 222–231. [Google Scholar] [CrossRef]
  4. Doukas, H.; Xidonas, P.; Angelopoulos, D.; Askounis, D.; Psarras, J. Distribution Transformers Failures: How Does It Cost? Evidence from Greece. Energy Syst. 2016, 7, 601–613. [Google Scholar] [CrossRef]
  5. El-Bassiouny, A.; El-Shimy, M.; Hammouda, R. Impact of Power Transformer Failures on Customer Interruptions Costs Using Customer Damage Functions. In Proceedings of the 2017 Nineteenth International Middle East Power Systems Conference (MEPCON), Cairo, Egypt, 19–21 December 2017; pp. 565–570. [Google Scholar]
  6. Zhao, X.; Gui, F.; Chen, H.; Fan, L.; Pan, P. Life Cycle Cost Estimation and Analysis of Transformers Based on Failure Rate. Appl. Sci. 2024, 14, 1210. [Google Scholar] [CrossRef]
  7. Abdugaffar Ugli, A.A.; Zokirovich, T.Z.; Kamolovich, J.T.; Khamidjon Ugli, K.M.; Zukhriddin Ugli, N.Z.; Ziyodulla Ugli, S.M. A Review on Power Transformer Failures: Analysis of Failure Types and Causative Factors. Indones. J. Electr. Eng. Comput. Sci. 2025, 38, 713–722. [Google Scholar] [CrossRef]
  8. Verma, N.; Chandel, A.K. Incipient Fault Diagnosis of Power Transformer Based on Duval Pentagon Using Medium Neural Network. In Proceedings of the 2022 1st International Conference on Sustainable Technology for Power and Energy Systems (STPES), Srinagar, India, 4 July 2022; pp. 1–5. [Google Scholar]
  9. Duval, M. Dissolved Gas Analysis: It Can Save Your Transformer. IEEE Electr. Insul. Mag. 1989, 5, 22–27. [Google Scholar] [CrossRef]
  10. Bessa, A.R.; Fardin, J.F.; Ciarelli, P.M.; Encarnação, L.F. Conventional Dissolved Gases Analysis in Power Transformers: Review. Energies 2023, 16, 7219. [Google Scholar] [CrossRef]
  11. IEC 60599; Mineral Oil-Filled Electrical Equipment in Service—Guidance on the Interpretation of Dissolved and Free Gases Analysis. IEC: Geneva, Switzerland, 2022.
  12. IEEE SA IEEE Std C57.104; IEEE Guide for the Interpretation of Gases Generated in Mineral Oil-Immersed Transformers. IEEE: New York, NY, USA, 2019.
  13. Duval, M. A Review of Faults Detectable by Gas-in-Oil Analysis in Transformers. IEEE Electr. Insul. Mag. 2002, 18, 8–17. [Google Scholar] [CrossRef]
  14. Duval, M. The Duval Triangle for Load Tap Changers, Non-Mineral Oils and Low Temperature Faults in Transformers. IEEE Electr. Insul. Mag. 2008, 24, 22–29. [Google Scholar] [CrossRef]
  15. Duval, M. Calculation of DGA Limit Values and Sampling Intervals in Transformers in Service. IEEE Electr. Insul. Mag. 2008, 24, 7–13. [Google Scholar] [CrossRef]
  16. Duval, M.; Lamarre, L. The Duval Pentagon-a New Complementary Tool for the Interpretation of Dissolved Gas Analysis in Transformers. IEEE Electr. Insul. Mag. 2014, 30, 9–12. [Google Scholar] [CrossRef]
  17. Duval, M.; Buchacz, J. Identification of Arcing Faults in Paper and Oil in Transformers—Part I: Using the Duval Pentagons. IEEE Electr. Insul. Mag. 2022, 38, 19–23. [Google Scholar] [CrossRef]
  18. Mansour, D.-E.A. Development of a New Graphical Technique for Dissolved Gas Analysis in Power Transformers Based on the Five Combustible Gases. IEEE Trans. Dielectr. Electr. Insul. 2015, 22, 2507–2512. [Google Scholar] [CrossRef]
  19. Suwarno; Sutikno, H.; Prasojo, R.A.; Abu-Siada, A. Machine Learning Based Multi-Method Interpretation to Enhance Dissolved Gas Analysis for Power Transformer Fault Diagnosis. Heliyon 2024, 10, e25975. [Google Scholar] [CrossRef]
  20. Wani, S.A.; Rana, A.S.; Sohail, S.; Rahman, O.; Parveen, S.; Khan, S.A. Advances in DGA Based Condition Monitoring of Transformers: A Review. Renew. Sustain. Energy Rev. 2021, 149, 111347. [Google Scholar] [CrossRef]
  21. Aizpurua, J.I.; Stewart, B.G.; McArthur, S.D.J.; Penalba, M.; Barrenetxea, M.; Muxika, E.; Ringwood, J.V. Probabilistic Forecasting Informed Failure Prognostics Framework for Improved RUL Prediction under Uncertainty: A Transformer Case Study. Reliab. Eng. Syst. Saf. 2022, 226, 108676. [Google Scholar] [CrossRef]
  22. Ali, M.S.; Abu Bakar, A.H.; Omar, A.; Abdul Jaafar, A.S.; Mohamed, S.H. Conventional Methods of Dissolved Gas Analysis Using Oil-Immersed Power Transformer for Fault Diagnosis: A Review. Electr. Power Syst. Res. 2023, 216, 109064. [Google Scholar] [CrossRef]
  23. Miranda, V.; Castro, A.R.G.; Lima, S. Diagnosing Faults in Power Transformers with Autoassociative Neural Networks and Mean Shift. IEEE Trans. Power Deliv. 2012, 27, 1350–1357. [Google Scholar] [CrossRef]
  24. Pereira, F.H.; Bezerra, F.E.; Junior, S.; Santos, J.; Chabu, I.; de Souza, G.F.M.; Micerino, F.; Nabeta, S.I. Nonlinear Autoregressive Neural Network Models for Prediction of Transformer Oil-Dissolved Gas Concentrations. Energies 2018, 11, 1691. [Google Scholar] [CrossRef]
  25. Ghoneim, S.S.M.; Taha, I.B.M.; Elkalashy, N.I. Integrated ANN-Based Proactive Fault Diagnostic Scheme for Power Transformers Using Dissolved Gas Analysis. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1838–1845. [Google Scholar] [CrossRef]
  26. Huang, Y.-C.; Sun, H.-C. Dissolved Gas Analysis of Mineral Oil for Power Transformer Fault Diagnosis Using Fuzzy Logic. IEEE Trans. Dielectr. Electr. Insul. 2013, 20, 974–981. [Google Scholar] [CrossRef]
  27. Taha, I.B.M.; Ghoneim, S.S.M.; Zaini, H.G. A Fuzzy Diagnostic System for Incipient Transformer Faults Based on DGA of the Insulating Transformer Oils. Int. Rev. Electr. Eng. (IREE) 2016, 11, 305. [Google Scholar] [CrossRef]
  28. Zeng, B.; Guo, J.; Zhu, W.; Xiao, Z.; Yuan, F.; Huang, S. A Transformer Fault Diagnosis Model Based On Hybrid Grey Wolf Optimizer and LS-SVM. Energies 2019, 12, 4170. [Google Scholar] [CrossRef]
  29. Aciu, A.-M.; Nițu, M.-C.; Nicola, C.-I.; Nicola, M. Determining the Remaining Functional Life of Power Transformers Using Multiple Methods of Diagnosing the Operating Condition Based on SVM Classification Algorithms. Machines 2024, 12, 37. [Google Scholar] [CrossRef]
  30. Yuan, F.; Guo, J.; Xiao, Z.; Zeng, B.; Zhu, W.; Huang, S. A Transformer Fault Diagnosis Model Based on Chemical Reaction Optimization and Twin Support Vector Machine. Energies 2019, 12, 960. [Google Scholar] [CrossRef]
  31. Dai, J.; Song, H.; Sheng, G.; Jiang, X. Dissolved Gas Analysis of Insulating Oil for Power Transformer Fault Diagnosis with Deep Belief Network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. [Google Scholar] [CrossRef]
  32. Shang, H.; Xu, J.; Zheng, Z.; Qi, B.; Zhang, L. A Novel Fault Diagnosis Method for Power Transformer Based on Dissolved Gas Analysis Using Hypersphere Multiclass Support Vector Machine and Improved D–S Evidence Theory. Energies 2019, 12, 4017. [Google Scholar] [CrossRef]
  33. Kari, T.; Gao, W.; Zhao, D.; Zhang, Z.; Mo, W.; Wang, Y.; Luan, L. An Integrated Method of ANFIS and Dempster-Shafer Theory for Fault Diagnosis of Power Transformer. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 360–371. [Google Scholar] [CrossRef]
  34. Hyndman, R.J.; Athanasopoulos, G. Forecasting: Principles and Practice, 2nd ed.; OTexts: Melbourne, Australia, 2018. [Google Scholar]
  35. Hyndman, R.J.; Khandakar, Y. Automatic Time Series Forecasting: The Forecast Package for R. J. Stat. Softw. 2008, 27, 1–22. [Google Scholar] [CrossRef]
  36. Liu, J.; Zhao, Z.; Zhong, Y.; Zhao, C.; Zhang, G. Prediction of the Dissolved Gas Concentration in Power Transformer Oil Based on SARIMA Model. Energy Rep. 2022, 8, 1360–1367. [Google Scholar] [CrossRef]
  37. Wang, T.-L.; Yang, J.-G.; Li, J.-S.; Li, Y.; Wu, P.; Lin, J.-S.; Liang, J.-B. Time Series Prediction of Dissolved Gas Concentrations in Transformer Oil Using ARIMA-NKDE Method. In Proceedings of the 2022 2nd International Conference on Electrical Engineering and Control Science (IC2ECS), Nanjing, China, 16–18 December 2022; pp. 129–133. [Google Scholar]
  38. Zhang, P.; Hu, K.; Yang, Y.; Yi, G.; Zhang, X.; Peng, R.; Liu, J. Research on Prediction of Dissolved Gas Concentration in a Transformer Based on Dempster–Shafer Evidence Theory-Optimized Ensemble Learning. Electronics 2025, 14, 1266. [Google Scholar] [CrossRef]
  39. Lakehal, A.; Tachi, F. Bayesian Duval Triangle Method for Fault Prediction and Assessment of Oil Immersed Transformers. Meas. Control 2017, 50, 103–109. [Google Scholar] [CrossRef]
  40. Paul, D.; Goswami, A.K.; Chetri, R.L.; Roy, R.; Sen, P. Bayesian Optimization-Based Gradient Boosting Method of Fault Detection in Oil-Immersed Transformer and Reactors. IEEE Trans. Ind. Appl. 2022, 58, 1910–1919. [Google Scholar] [CrossRef]
  41. Riedmann, C.; Schichler, U.; Häusler, W.; Neuhold, W. Gas Losses in Transformers—Influences and Consideration. IEEE Access 2023, 11, 58654–58663. [Google Scholar] [CrossRef]
  42. Duval, M.; dePabla, A. Interpretation of Gas-in-Oil Analysis Using New IEC Publication 60599 and IEC TC 10 Databases. IEEE Electr. Insul. Mag. 2001, 17, 31–41. [Google Scholar] [CrossRef]
  43. Rogers, R. IEEE and IEC Codes to Interpret Incipient Faults in Transformers, Using Gas in Oil Analysis. IEEE Trans. Electr. Insul. 1978, EI-13, 349–354. [Google Scholar] [CrossRef]
  44. Duval, M.; Buchacz, J. Detection of Carbonization of Paper in Transformers Using Duval Pentagon 2 and Triangle 5. IEEE Trans. Dielectr. Electr. Insul. 2023, 30, 1534–1539. [Google Scholar] [CrossRef]
  45. Duval, M.; Buchacz, J. Gas Formation from Arcing Faults in Transformers—Part II. IEEE Electr. Insul. Mag. 2022, 38, 12–15. [Google Scholar] [CrossRef]
  46. Duval, M.; Arvidsson, L.; Boman, P.; Carrander, K.; Foschum, H.; Grisaru, M.; Hall, A.C.; Haug, A.M.; Kelly, J.; Knab, H.; et al. Recent Developments in DGA Interpretation—Joint Task Force.01/A2.11; CIGRE: Paris, France, 2006. [Google Scholar]
  47. Nanfak, A.; Samuel, E.; Fofana, I.; Meghnefi, F.; Ngaleu, M.G.; Hubert Kom, C. Traditional Fault Diagnosis Methods for Mineral Oil-immersed Power Transformer Based on Dissolved Gas Analysis: Past, Present and Future. IET Nanodielectrics 2024, 7, 97–130. [Google Scholar] [CrossRef]
  48. Liu, C.; Lin, T.; Yao, L.; Wang, S. Integrated Power Transformer Diagnosis Using Hybrid Fuzzy Dissolved Gas Analysis. IEEJ Trans. Electr. Electron. Eng. 2015, 10, 689–698. [Google Scholar] [CrossRef]
  49. Faiz, J.; Soleimani, M. Dissolved Gas Analysis Evaluation in Electric Power Transformers Using Conventional Methods a Review. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 1239–1248. [Google Scholar] [CrossRef]
  50. Gupta, A.; Jain, K.; Sood, Y.R.; Sharma, N.K. Comparative Study of Duval Triangle with the New DGA Interpretation Scheme. In Advances in System Optimization and Control—Select Proceedings of ICAEDC 2017; Singh, S.N., Wen, F., Jain, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2019; pp. 261–268. [Google Scholar]
  51. Islam, M.M.; Lee, G.; Hettiwatte, S.N. Application of Parzen Window Estimation for Incipient Fault Diagnosis in Power Transformers. High Volt. 2018, 3, 303–309. [Google Scholar] [CrossRef]
  52. Mharakurwa, E.T.; Nyakoe, G.N.; Akumu, A.O. Power Transformer Fault Severity Estimation Based on Dissolved Gas Analysis and Energy of Fault Formation Technique. J. Electr. Comput. Eng. 2019, 2019, 9674054. [Google Scholar] [CrossRef]
  53. Malik, H.; Khatri, A.; Dohare, R. Probabilistic Neural Network Based Incipient Fault Identification Using DGA Dataset. Procedia Comput. Sci. 2015, 58, 665–672. [Google Scholar] [CrossRef]
  54. Taha, I.B.M.; Ibrahim, S.; Mansour, D.-E.A. Power Transformer Fault Diagnosis Based on DGA Using a Convolutional Neural Network With Noise in Measurements. IEEE Access 2021, 9, 111162–111170. [Google Scholar] [CrossRef]
  55. Taha, I.B.M.; Mansour, D.A.; Ghoneim, S.S.M.; Elkalashy, N.I. Conditional Probability-based Interpretation of Dissolved Gas Analysis for Transformer Incipient Faults. IET Gener. Transm. Distrib. 2017, 11, 943–951. [Google Scholar] [CrossRef]
  56. Zhang, W.; Zeng, Y.; Li, Y.; Zhang, Z. Prediction of Dissolved Gas Concentration in Transformer Oil Considering Data Loss Scenarios in Power System. Energy Rep. 2023, 9, 186–193. [Google Scholar] [CrossRef]
  57. Huang, X.; Zhuang, X.; Tian, F.; Niu, Z.; Chen, Y.; Zhou, Q.; Yuan, C. A Hybrid ARIMA-LSTM-XGBoost Model with Linear Regression Stacking for Transformer Oil Temperature Prediction. Energies 2025, 18, 1432. [Google Scholar] [CrossRef]
  58. de Carvalho Michalski, M.A.; de Andrade Melani, A.H.; da Silva, R.F.; de Souza, G.F.M. Remaining Useful Life Estimation Based on an Adaptive Approach of Autoregressive Integrated Moving Average (ARIMA). In Extended Abstracts, Proceedings of the 32nd European Safety and Reliability Conference, Dublin, Ireland, 28 August–1 September 2022; Research Publishing Services: Singapore; pp. 1227–1234.
  59. Bisht, D.C.S.; Ram, M. Recent Advances in Time Series Forecasting; Parsaei, H.R., Karwowski, W., Eds.; CRC Press: Boca Raton, FL, USA, 2021; ISBN 9781003102281. [Google Scholar]
  60. Brockwell, P.J.; Davis, R.A. Introduction to Time Series and Forecasting; Springer Texts in Statistics; Springer International Publishing: Cham, Switzerland, 2016; ISBN 978-3-319-29852-8. [Google Scholar]
  61. Guerrero, V.M. Time-Series Analysis Supported by Power Transformations. J. Forecast. 1993, 12, 37–48. [Google Scholar] [CrossRef]
  62. Box, G.E.P.; Cox, D.R. An Analysis of Transformations. J. R. Stat. Soc. Series. B Stat. Methodol. 1964, 26, 211–243. [Google Scholar] [CrossRef]
  63. Akaike, H. A New Look at the Statistical Model Identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  64. Lam, J.-P.; Veall, M.R. Bootstrap Prediction Intervals for Single Period Regression Forecasts. Int. J. Forecast. 2002, 18, 125–130. [Google Scholar] [CrossRef]
  65. Stine, R.A. Bootstrap Prediction Intervals for Regression. J. Am. Stat. Assoc. 1985, 80, 1026. [Google Scholar] [CrossRef]
  66. Wu, B.; Gao, Y.; Feng, S.; Chanwimalueang, T. Sparse Optimistic Based on Lasso-LSQR and Minimum Entropy De-Convolution with FARIMA for the Remaining Useful Life Prediction of Machinery. Entropy 2018, 20, 747. [Google Scholar] [CrossRef] [PubMed]
  67. Zhou, Y.; Huang, M. Lithium-Ion Batteries Remaining Useful Life Prediction Based on a Mixture of Empirical Mode Decomposition and ARIMA Model. Microelectron. Reliab. 2016, 65, 265–273. [Google Scholar] [CrossRef]
  68. Liang, Y.; Zhang, Z.; Li, K.J.; Li, Y.C. New Correlation Features for Dissolved Gas Analysis Based Transformer Fault Diagnosis Based on the Maximal Information Coefficient. High Volt. 2022, 7, 302–313. [Google Scholar] [CrossRef]
  69. Scott, S.L.; Varian, H.R. Predicting the Present with Bayesian Structural Time Series. Int. J. Math. Model. Numer. Optim. 2014, 5, 4–23. [Google Scholar] [CrossRef]
  70. Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; Dietterich, T., Ed.; MIT Press: Cambridge, MA, USA, 2006; ISBN 026218253X. [Google Scholar]
  71. Casolaro, A.; Capone, V.; Iannuzzo, G.; Camastra, F. Deep Learning for Time Series Forecasting: Advances and Open Problems. Information 2023, 14, 598. [Google Scholar] [CrossRef]
  72. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control, 5th ed.; Wiley: Hoboken, NJ, USA, 2016; ISBN 1118675029. [Google Scholar]
  73. Hocking, R.R. The Analysis and Selection of Variables in Linear Regression. Biometrics 1976, 32, 1–49. [Google Scholar] [CrossRef]
  74. Ralston, A.; Wilf, H.S. Mathematical Methods for Digital Computers; John Wiley & Sons: Hoboken, NJ, USA, 1960. [Google Scholar]
  75. Shapiro, S.S.; Wilk, M.B. An Analysis of Variance Test for Normality (Complete Samples). Biometrika 1965, 52, 591. [Google Scholar] [CrossRef]
  76. Ljung, G.M.; Box, G.E.P. On a Measure of Lack of Fit in Time Series Models. Biometrika 1978, 65, 297–303. [Google Scholar] [CrossRef]
  77. Box, G.E.P.; Pierce, D.A. Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models. J. Am. Stat. Assoc. 1970, 65, 1509. [Google Scholar] [CrossRef]
  78. Engle, R.F. Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation. Econometrica 1982, 50, 987–1007. [Google Scholar] [CrossRef]
  79. Riedmann, C.; Schichler, U.; Häusler, W.; Neuhold, W. Online Dissolved Gas Analysis Used for Transformers—Possibilities, Experiences, and Limitations. e Elektrotech. Inf. 2022, 139, 88–97. [Google Scholar] [CrossRef]
Figure 1. ARIMA (p, d, q) model selection algorithm.
Figure 1. ARIMA (p, d, q) model selection algorithm.
Sensors 25 06520 g001
Figure 2. Illustrative Diagram of the Proposed Framework.
Figure 2. Illustrative Diagram of the Proposed Framework.
Sensors 25 06520 g002
Figure 3. Case Study 1: Gas Concentrations Over Time.
Figure 3. Case Study 1: Gas Concentrations Over Time.
Sensors 25 06520 g003
Figure 4. Case Study 1: ARIMA model data and projections on the original data.
Figure 4. Case Study 1: ARIMA model data and projections on the original data.
Sensors 25 06520 g004
Figure 5. Case Study 1: ARIMA Forecasts and Equipment Failure Probability: (a) ARIMA forecasts with pre-failure thresholds for each gas (dashed lines); (b) Equipment failure probability based on each forecasted gas concentration.
Figure 5. Case Study 1: ARIMA Forecasts and Equipment Failure Probability: (a) ARIMA forecasts with pre-failure thresholds for each gas (dashed lines); (b) Equipment failure probability based on each forecasted gas concentration.
Sensors 25 06520 g005
Figure 6. Case Study 1: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 6. Case Study 1: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g006
Figure 7. Case Study 1: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 7. Case Study 1: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g007
Figure 8. Case Study 2: Gas Concentrations Over Time.
Figure 8. Case Study 2: Gas Concentrations Over Time.
Sensors 25 06520 g008
Figure 9. Case Study 2: ARIMA Model Data and Forecasts: (a) Forecasts at τ = 103; (b) Forecasts at τ = 139.
Figure 9. Case Study 2: ARIMA Model Data and Forecasts: (a) Forecasts at τ = 103; (b) Forecasts at τ = 139.
Sensors 25 06520 g009
Figure 10. Case Study 2: ARIMA Forecasts and Transformer Failure Probability: (a,c) Forecasted gas concentrations and pre-failure thresholds at τ = 103 and τ = 139 (dashed lines); (b,d) Transformer failure probability based on each gas concentration threshold at τ = 103 and τ = 139.
Figure 10. Case Study 2: ARIMA Forecasts and Transformer Failure Probability: (a,c) Forecasted gas concentrations and pre-failure thresholds at τ = 103 and τ = 139 (dashed lines); (b,d) Transformer failure probability based on each gas concentration threshold at τ = 103 and τ = 139.
Sensors 25 06520 g010
Figure 11. Case Study 2, τ = 103: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 11. Case Study 2, τ = 103: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g011
Figure 12. Case Study 2, τ = 139: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 12. Case Study 2, τ = 139: Duval Triangle Analysis—Gas Projections and Failure Mode Probabilities: (a,c,e) Forecasted relative gas concentrations for Duval Triangles 1, 4, and 5 over time; (b,d,f) Probability of failure mode evolution for Duval Triangles 1, 4, and 5. The colored lines in (b,d,f) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g012
Figure 13. Case Study 2, τ = 103: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 13. Case Study 2, τ = 103: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g013
Figure 14. Case Study 2, τ = 139: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Figure 14. Case Study 2, τ = 139: Duval Pentagon Analysis—Gas Projections, Centroid Evolution, and Failure Mode Probabilities: (a) Forecasted relative gas concentrations over time; (b,d) Probability of failure mode evolution for Pentagons 1 and 2; (c) Centroid position evolution with x-y coordinates and uncertainties. The colored lines in (b,d) correspond to the respective failure modes indicated by the abbreviations in each panel title.
Sensors 25 06520 g014
Table 1. Transformer Failure Modes (basic faults and fault subtypes) Detectable via DGA.
Table 1. Transformer Failure Modes (basic faults and fault subtypes) Detectable via DGA.
CodeFault TypeDescription
RCatalytic ReactionsOccur due to moisture and interactions with galvanized steel in transformer oil sampling valves or rust on the tank steel.
PDPartial DischargesFormation of cold plasma discharges, potentially leading to wax-like deposits on the insulating paper.
SStray gassing from
Mineral Oil
Occurs at temperatures below 200 °C due to chemical instability caused by modern refining techniques or incompatibility between materials (metal passivators).
T1Thermal Failure < 300 °COccurs in mineral oil and/or paper due to temperatures below 300 °C, causing paper discoloration (browning).
OOverheating < 250 °CAffects paper or mineral oil at temperatures below 250 °C, without carbonization of the paper or loss of its electrical insulation properties.
CCarbonization of PaperPossible carbonization of paper insulation, indicating severe overheating or prolonged thermal stress, typically associated with T2–T3 thermal ranges (>300 °C).
T2Thermal Failure
300–700 °C
Occurs in paper insulation, leading to burning when exposed to temperatures between 300 °C and 700 °C.
T3Thermal Failure > 700 °CSevere thermal degradation, with strong evidence of mineral oil carbonization, metal discoloration (~800 °C), or metal melting (>1000 °C).
T3-HThermal Failure
(Oil only, >700 °C)
Specific to mineral oil, involving extensive degradation without significant damage to solid insulation.
D1Low-Energy
Discharges
Partial spark-type discharges in mineral oil and/or paper, causing charred perforations in insulation, surface carbonization traces, and free carbon particles in the oil. Often associated with divergent tap-changer operation.
D2High-Energy
Discharges
Occur in mineral oil and/or paper, evidenced by extensive destruction and carbonization of paper, metal melting at discharge ends, and severe carbonization of oil. In some cases, this leads to equipment shutdown, confirming the presence of large current flow.
DTCombined Electrical and Thermal FaultsRepresents intermediate fault conditions, where both electrical discharges and thermal failures coexist within the transformer.
NNot Detected/
Not Identified
Indicates that no fault has been classified based on the available dissolved gas concentrations. However, this does not confirm that the transformer is in a normal operating condition, as early-stage faults or transient issues may not yet be detectable.
Table 2. Transformer Failure Modes Detectable via DGA.
Table 2. Transformer Failure Modes Detectable via DGA.
MethodHandles Autocorrelation?Works with Few Data Points?InterpretabilityComputational CostSuitability for DGA
ARIMASensors 25 06520 i001 YesSensors 25 06520 i001 YesSensors 25 06520 i001 HighSensors 25 06520 i001 LowSensors 25 06520 i001 Best fit
Holt-WintersSensors 25 06520 i002 NoSensors 25 06520 i001 YesSensors 25 06520 i001 HighSensors 25 06520 i001 LowSensors 25 06520 i002 Lacks autocorrelation modeling
BSTSSensors 25 06520 i001 Yes⃟   Moderate⃟   Moderate⃟   Moderate⃟   Possible alternative
GPR⃟   PartialSensors 25 06520 i002 NoSensors 25 06520 i002 LowSensors 25 06520 i002 HighSensors 25 06520 i002 Not feasible
LSTM/GRUSensors 25 06520 i001 YesSensors 25 06520 i002 NoSensors 25 06520 i002 LowSensors 25 06520 i002 HighSensors 25 06520 i002 Not feasible
TransformersSensors 25 06520 i001 YesSensors 25 06520 i002 No Sensors 25 06520 i002 LowSensors 25 06520 i002 Very highSensors 25 06520 i002 Not feasible
Table 3. Angular Positions and Indexes of Gases in Duval’s Pentagon.
Table 3. Angular Positions and Indexes of Gases in Duval’s Pentagon.
GasSymbolθg (Degrees)Index i
HydrogenH290°1
EthaneC2H6162°2
MethaneCH4234°3
EthyleneC2H4306°4
AcetyleneC2H218°5
Table 4. Detection, Confirmation, and Failure Prognosis Thresholds.
Table 4. Detection, Confirmation, and Failure Prognosis Thresholds.
GasSymbolDetection
Threshold
Confirmation ThresholdFailure
Threshold
HydrogenH2100 ppm179 ppm/year725 ppm
EthaneC2H655 ppm175 ppm/year400 ppm
MethaneCH480 ppm176 ppm/year900 ppm
EthyleneC2H4170 ppm218 ppm/year800 ppm
AcetyleneC2H23 ppm7 ppm/year450 ppm
Table 5. Case Study 1: ARIMA Model Parameters and Residual Diagnostic Test Results.
Table 5. Case Study 1: ARIMA Model Parameters and Residual Diagnostic Test Results.
GasH2C2H6CH4C2H4C2H2
Model (p, d, q)(1,1,0) with drift(0,1,1) (1,0,0) with non-zero mean (0,1,1) with drift(2,2,1)
Box–Cox λ0.93080.73320.077540.74790.8410
Log-Likelihood–82.8–79.19–152.4–100.87–31.22
RMSE1.48731.39890.30776.99170.7250
ACF1−0.06660.10070.2443–0.0695–0.0123
NormalityNormalNormalNot NormalNormalNormal
Box-Ljung p-value 0.38320.4510.63310.81440.419
ARCH LM p-value0.67490.049120.95230.045260.8858
Table 6. Case Study 2: ARIMA Model Parameters and Residual Diagnostic Test Results (t = 103).
Table 6. Case Study 2: ARIMA Model Parameters and Residual Diagnostic Test Results (t = 103).
GasH2C2H6CH4C2H4C2H2
Model (p, d, q)(0,0,0) with non-zero mean (0,0,0) with non-zero mean (1,1,0)(0,0,0) with non-zero mean (0,0,1) with non-zero mean
Box–Cox λ–0.89991.9999–0.89991.99990.1886
Log-Likelihood285.79–342.64399.41–414.05–109.07
RMSE1.20291.750517.90643.76580.3304
ACF10.8160–0.14630.00760.83590.2600
NormalityNot NormalNormalNot NormalNot NormalNot Normal
Box-Ljung p-value 0.86810.4810.98490.6620.1898
ARCH LM p-value0.66050.63750.0092390.71160.5765
Table 7. Case Study 2: ARIMA Model Parameters and Residual Diagnostic Test Results (t = 139).
Table 7. Case Study 2: ARIMA Model Parameters and Residual Diagnostic Test Results (t = 139).
GasH2C2H6CH4C2H4C2H2
Model (p, d, q)(1,2,0)(2,2,0)(0,2,1)(2,2,2)(1,2,1)
Box–Cox λ0.73010.17760.3797–0.24051.6785
Log-Likelihood–20.48122.7333.31231.7639.02
RMSE1.06171.05295.59892.12500.1683
ACF10.00170.16540.2322–0.1441–0.3428
NormalityNot NormalNot NormalNot NormalNot NormalNot Normal
Box-Ljung p-value 0.75890.16370.39360.024890.05745
ARCH LM p-value0.037650.027190.49089.944 × 10−79.926 × 10−7
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kashiwagi, F.N.; Michalski, M.A.d.C.; de Souza, G.F.M.; da Silva, H.J.B.; Côrtes, H.M. Probabilistic Prognostics and Health Management of Power Transformers Using Dissolved Gas Analysis Sensor Data and Duval’s Polygons. Sensors 2025, 25, 6520. https://doi.org/10.3390/s25216520

AMA Style

Kashiwagi FN, Michalski MAdC, de Souza GFM, da Silva HJB, Côrtes HM. Probabilistic Prognostics and Health Management of Power Transformers Using Dissolved Gas Analysis Sensor Data and Duval’s Polygons. Sensors. 2025; 25(21):6520. https://doi.org/10.3390/s25216520

Chicago/Turabian Style

Kashiwagi, Fabio Norikazu, Miguel Angelo de Carvalho Michalski, Gilberto Francisco Martha de Souza, Halley José Braga da Silva, and Hyghor Miranda Côrtes. 2025. "Probabilistic Prognostics and Health Management of Power Transformers Using Dissolved Gas Analysis Sensor Data and Duval’s Polygons" Sensors 25, no. 21: 6520. https://doi.org/10.3390/s25216520

APA Style

Kashiwagi, F. N., Michalski, M. A. d. C., de Souza, G. F. M., da Silva, H. J. B., & Côrtes, H. M. (2025). Probabilistic Prognostics and Health Management of Power Transformers Using Dissolved Gas Analysis Sensor Data and Duval’s Polygons. Sensors, 25(21), 6520. https://doi.org/10.3390/s25216520

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop