Research on Identification Method of Transformer Windings’ Loose Vibration Spectrum Considering a Multi-Load Current Condition

Jin Fang; Xudong Deng; Yuancan Xia; Chen Wu; Yuehua Li; Xin Li; Kaixin Chen; Fan Wang; Zhanlong Zhang

doi:10.3390/app15126949

,

and

¹

State Key Laboratory of Power Transmission Equipment Technology, School of Electrical Engineering, Chongqing University, Chongqing 400044, China

²

Ultra-High Voltage Branch of State Grid Chongqing Electric Power Company, Chongqing 400050, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2025, 15(12), 6949;https://doi.org/10.3390/app15126949

This article belongs to the Section Electrical, Electronics and Communications Engineering

Version Notes

Order Reprints

Abstract

During transformer operation, long-term vibration causes the winding to loosen axially. When hit by a short-circuit, the winding deforms to different extents. Thus, identifying early looseness faults in transformer windings is vital for power systems’ stability. To address issues including scarce vibration data across multiple load conditions for transformer winding looseness faults, inadequate extraction of two-dimensional spectrogram features, and the inability to boost recognition accuracy caused by overfitting during fault recognition model training, this study constructed a 10 kV power transformer vibration test platform. It measured the vibration signals on the box surface under various winding looseness conditions and built a time–frequency-domain vibration spectrum library for different load currents. Then, a fault identification model based on vibration spectra and ConvNeXt was constructed, and model verification and analysis were carried out. The results indicate that after training, the fault recognition accuracy of the spectrum containing three load conditions is comparable to that of a single load condition. The average recognition accuracy at six box-surface measuring points reaches 97.9%. Moreover, the ConvNeXt model outperforms the traditional ResNet50 by 1.2%. This new model effectively addresses overfitting and offers strong technical support for detecting different transformer winding looseness faults.

Keywords:

transformer; winding looseness; vibration spectrum; convolutional neural network; fault identification

1. Introduction

During the long-term operation of power transformers, the leakage magnetic flux inside the windings interacts with the current, generating the leakage electromagnetic force [1,2], which, in turn, causes the vibration of the winding and its related structural parts. Long-term vibration will bring about the reduction in bolt preload on the winding’s end plate, which will lead to the axial loosening of the coil. If the early loosening of the winding is not found and handled in time, when the transformer encounters a short-circuit impact, the winding will withstand a huge short-circuit electromagnetic force, resulting in complex mechanical stress; as a result of the cumulative effect, the winding may undergo serious deformation—such as axial or radial deformation, circumferential buckling, and winding tilt tension—under multiple short-circuit impacts [3,4,5]. The survey data show that more than 40% of transformer accidents are caused by winding faults [6], and the annual failure rate of transformers is between 0.49% and 9% [7]. Therefore, it is of great significance to detect and eliminate the early loosening faults of power transformer windings to maintain the normal operation of transformers.

In recent years, the employment of the vibration method for the detection of the mechanical status of power transformers has drawn the focus of scholars across the globe and in domestic research circles. Unlike the traditional fault detection method, the vibration signal can realize electrical isolation and real-time online monitoring, which can maintain the secure and dependable operation of the power grid [8,9]. At the moment, the classification of the mechanical fault state of power transformer windings based on vibration signals is mainly achieved by extracting one-dimensional vibration signal eigenvalues or two-dimensional feature map information, and by using machine learning or deep learning algorithms for classification and recognition.

Ref. [10] collected the winding vibration signals under different operating conditions of transformers, extracted the signal spectrum entropy value by wavelet transform as the input feature vector, and trained and tested the feature quantity using support vector machine. The identification and diagnosis of transformer windings under different operating conditions were achieved. Ref. [11] collected the short-circuit fault vibration signals of transformer windings in different degrees under transient and steady-state operating conditions. The Short-Time Fourier Transform (STFT) [12] was employed to analyze the transient phase signal, and the Fourier transform was utilized for the analysis of the steady-state phase. The energy index, along with the total harmonic distortion index, was put forward for the training of the neural network, and then the accurate identification of different degrees of transformer winding faults was achieved. Ref. [13] proposes a diagnostic approach for transformer winding faults, relying on statistical time features (STFs) and support vector machine (SVM) [14]. Several indices in the vibration signal of the transformer were calculated as statistical time features. Fisher score analysis was used to analyze the most discriminative features, and linear discriminant analysis was applied to reduce the dimensions of features. In the end, the SVM was employed to accomplish automatic diagnosis. Ref. [15] proposed a residual attention diagnosis model for power transformer winding fault fusion based on vibration signals and designed a Gramian guided filtering module to generate and fuse two-dimensional images at different positions from the original vibration signal. A high-dimensional convolutional attention mechanism module for an improved deep residual network was proposed to conduct the diagnosis of transformer winding faults.

Ref. [16] proposes a fault diagnosis method for the mechanical structure of power transformer windings based on comprehensive feature extraction and the Subtraction-Average-Based Optimizer (SABO) algorithm. Initially, the original vibration signal is extracted through a dual-feature approach using wavelet transform, and the optimized variational mode decomposition is implemented, which is based on the mean reduction optimizer algorithm (SABO). Then, the weight coefficient of vibration feature vectors is weighted based on the fuzzy analytic hierarchy process, and the combined eigenvalue is calculated by feature vectors and fuzzy weights. The integral eigenvalue is used as the input vector, and the SABO algorithm is applied to enhance the probabilistic neural network to train the vibration signal for diagnosis. In Ref. [17], the distribution of harmonics and the fundamental wave ratio are used as the feature information of a 110 kV power transformer’s winding looseness fault. The SHapley Additive exPlanations (SHAP) method is introduced to analyze the constructed feature information, and the key feature information combination set is generated. Finally, the high-accuracy identification of transformer winding looseness is achieved.

As deep learning models have advanced rapidly in the area of image recognition [18,19,20], the one-dimensional vibration signal is converted into a two-dimensional image, and the convolutional neural network is applied for feature extraction [21], so as to achieve image classification and recognition, which can effectively improve the accuracy of fault recognition. Common two-dimensional image generation methods mainly include the time–frequency analysis [22], wavelet transform [23], and image coding methods. Image coding technology mainly includes Markov transition fields, Gram-angle field transformation, and recurrence plots. The two-dimensional images generated by these methods can fully express the characteristic information of one-dimensional vibration signals. Ref. [24] proposes a transformer winding loose fault diagnosis method based on Gram-angle field transformation and transfer learning–AlexNet. The two-dimensional image set of the Gram-angle field of transformer vibration signals is generated by the sample construction method [25]. The generated image set is input into AlexNet for transfer learning, and the optimized neural network fault diagnosis model is obtained.

In summary, at this stage, there are still the following limitations in the identification of transformer winding faults: (1) The sample data of winding looseness faults under multi-load transformer current conditions are lacking, and the adaptability of fault diagnosis models is not strong. (2) The fault identification of single measuring points on the outer surface of the transformer box is not universal. (3) The time correlation is lost in the extraction of two-dimensional feature map information of vibration signals. The traditional convolutional neural network model is prone to overfitting in the training of transformer windings’ loose fault feature maps, which poses a significant challenge for improving the recognition accuracy.

In view of the limitations in the above-mentioned research on transformer winding fault identification, this study makes the following improvements and breakthroughs: (1) Vibration signals of winding looseness under different load current conditions of the transformer are measured to enhance the adaptability of the fault diagnosis model. (2) Six vibration acceleration sensors are uniformly arranged on the surface of the transformer tank to eliminate the contingency of fault identification from a single measurement point. (3) A method for generating time-domain vibration signal feature maps using relative position matrices is proposed to preserve the time correlation of vibration signals, and the ConvNeXt model is employed to address the overfitting issue of traditional convolutional neural network models during map training.

The remaining sections of this paper are organized as follows: Section 2 introduces the transformer winding loosening test and analyzes the characteristics of vibration signals under winding loosening faults. Section 3 presents a method for constructing two-dimensional feature maps based on the relative position matrix and Gram-angle field transformation to generate time–frequency-domain vibration feature maps for different winding loosening fault states of the transformer. Section 4 proposes a transformer winding loosening fault identification method based on feature maps and ConvNeXt, and it presents the construction of a fault identification model. Section 5 analyzes the identification accuracy of the transformer winding loosening fault model under multi-load current conditions and at different measurement points, verifying the robustness of ConvNeXt and its superiority over models such as ResNet50, GoogLeNet, and AlexNet.

The research objectives of this paper are to enrich the characteristic information of vibration signals under transformer winding looseness faults, construct a fault identification model for transformer windings based on feature spectrograms and ConvNeXt, address the issue of unimproved recognition accuracy caused by training overfitting in traditional models, achieve high-precision identification of transformer winding looseness faults under integrated multi-load current conditions, and provide technical support for mechanical fault diagnosis of transformer windings.

2. Research on Winding Looseness Tests of Power Transformers

2.1. Construction of Transformer Winding Loosening Test Platform

The experiment measured the vibration signals on the transformer tank surface under different degrees of winding looseness, which required adjusting the bolt’s pre-tightening force at the upper and lower ends of the winding press plate to simulate various looseness states. Given the large size of 110 kV transformers, regulating different winding looseness conditions is technically challenging and costly. Based on national standards and theoretical calculations, this experiment adopted a 10 kV oil-immersed power transformer to establish the vibration test platform. The detailed parameters are illustrated in Table 1 as follows:

Table 1. Electrical parameters of 10 kV power transformer.

The test adopted the load short-circuit test; that is, the high-voltage side was pressurized and the low-voltage side’s three-phase winding was short-circuited. The test platform consisted of a 10 kV power transformer, vibration acceleration sensor, signal acquisition instrument, and host computer, as shown in Figure 1. In the figure, A, B, and C respectively represent the three phases (A, B, and C) of the transformer. The piezoelectric acceleration sensor of the model 1A941E was selected for measuring the vibration signal. It was evenly arranged in the front of the transformer box by magnetic attraction, and an acceleration sensor was pasted on the upper and lower 1/4 of the box surface corresponding to each phase winding, labels 1–6 indicated the six positions where sensors were installed. The signal acquisition instrument model was DH5902N, and the sampling frequency was 100 kHz.

Figure 1. Vibration signal acquisition system.

With the aim of ensuring accurate fault identification under different load current conditions, a total of three load current conditions were set in the test: 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

, where

I_{N}

is the rated load current of the high-voltage side’s winding. The vibration data collection duration for each condition was 10 s.

2.2. Transformer Winding Loose Fault Setting

The internal windings of the 10 kV power transformer were tightened at both the top and bottom ends by the pull screw. As shown in Figure 2, the digital torque wrench was used to adjust the torque of the pull bolt to change the looseness of the winding.

Figure 2. Transformer winding loose adjustment diagram.

According to the national standard, the calculation formula for the rated torque of bolts is presented below:

T = k F_{N} d

(1)

where T is the bolt torque, k is the bolt tightening coefficient, and the value range is 0.1~0.3; according to the general machining surface, the value of this test is 0.13.

F_{N}

represents the rated preload of the bolt; in general, the rated preload is 80% of the yield strength of the bolt material.

d

stands for the nominal diameter of the bolt (unit: mm). The following shows the formula used to compute the rated preload of bolts:

F_{N} = (0.5 ~ 0.7) σ_{s} A_{s}

(2)

where

σ_{s}

is the yield strength of the bolt material (unit:

{N / mm}^{2}

) and

A_{s}

is the stress cross-sectional area (unit:

{mm}^{2}

).

The test transformer’s winding upper and lower pressure plate pull bolt model was M12, with a strength grade of 4.6; nominal diameter: 12 mm; stress cross-sectional area: 84.3

{mm}^{2}

; yield strength: 240

{N / mm}^{2}

. From Formula (1) and Formula (2), the formula of bolt torque can be obtained as follows [26]:

T = \frac{k (0.5 ~ 0.7) σ_{s} A_{s} d}{1000}

(3)

The rated torque range of the tension bolt at the end of the winding was calculated to be 16~22

N \cdot m

by Formula (3). In the process of the winding loosening test, the digital torque wrench was used to measure the maximum rated torque of the winding to be 18

N \cdot m

, so as to determine the rated torque of the bolt at the end of the winding to be 18

N \cdot m

; that is, the transformer winding was not loose, expressed as 100%

F_{N}

.

Through the digital torque wrench, the torque of the bolts on the pull screw at the end of the three-phase winding of the transformer was adjusted in turn, to 13.5

N \cdot m

, 9

N \cdot m

, and 4.5

N \cdot m

, respectively. The three degrees of looseness of the transformer winding were defined as 75%

F_{N}

, 50%

F_{N}

, and 25%

F_{N}

, respectively. Therefore, when the value measured by the torque wrench is 13.5

N \cdot m

, it can be considered that the transformer winding has a loose fault, which is an abnormal state.

2.3. Vibration Signal Acquisition and Characteristic Analysis

Under the operating condition of the transformer at its rated load, the vibration signal acquisition instrument was employed to collect the vibration signal waveforms of the transformer windings under five different degrees of looseness. Through time-domain analysis, the vibration signal showed no obvious regular change in amplitude. The time-domain signals were subjected to discrete Fourier transform to obtain the frequency-domain signals. The Figure 3 below shows the spectrum waterfall diagram of the three-phase windings of the transformer at Measuring Point 1 under different looseness fault states.

Figure 3. The 3D spectrum waterfall diagram of the three-phase winding under different loose faults.

By analyzing the figure, it becomes clear that the fundamental frequency of the vibration signal for the transformer winding with different degrees of looseness is 100 Hz. The fundamental frequency amplitude increases with the increase in winding looseness.

Ref. [27] indicates that the vibration acceleration of the transformer is essentially positively correlated with the square of the load current. By performing numerical fitting on the fundamental frequency amplitude of the vibration signal and the square of the load current when the transformer winding is in a normal condition and in a loose-fault state, the relationship curve is as shown in the figure.

From the analysis of Figure 4, it is evident that the amplitude of the fundamental frequency of the vibration signal increases with the square of the load current regardless of the normal operation of the transformer winding or the loosening fault. When the winding looseness fault occurs, the growth rate of the fundamental frequency amplitude and the load current change curve has a numerical relationship with the growth rate of the unloose winding.

Figure 4. Relationship curve between fundamental frequency amplitude and load current.

To further extract the characteristic information of vibration signals before and after transformer winding loosening more comprehensively and achieve the accurate diagnosis and identification of transformer winding loosening faults, it is essential to encode the original vibration signal time series into two-dimensional images in the time domain and frequency domain. Before and after the transformer winding looseness fault occurs, there is a mapping relationship between the fundamental frequency amplitude of the vibration signal and the load current; that is, the vibration signal containing different load current conditions can also characterize the characteristic information before and after the winding fault. Therefore, this paper constructs the time–frequency-domain two-dimensional characteristic spectrum of the vibration containing different load conditions, so as to enrich the sample data of the vibration signal of the transformer winding and achieve the diagnosis and identification of the winding looseness fault under multiple working conditions.

3. Construction Method of Vibration Signal Characteristic Spectrum Under Transformer Winding Loosening Fault

3.1. Time-Domain Feature Map Construction Method Based on Relative Position Matrix

In terms of two-dimensional image coding technology, the relative position matrix method contains redundant feature information of the original time series, which makes the intra-class and inter-class similarity information of the converted two-dimensional image easier to capture [28]. The fundamental notion is to boost the convolutional neural network model’s ability to understand the sequence structure and extract features. This goal is accomplished through encoding the relative position connections between time-series elements. The specific steps of constructing two-dimensional feature maps by the relative position matrix are as follows:

(1): Data normalization processing:

For the original time series, the normal distribution Z [29] is obtained by z-score normalization, as follows:

z_{t} = \frac{x_{t} - μ}{σ}, t = 1, 2, \dots, N

(4)

where

μ

indicates the average of X, while

σ

indicates the standard deviation of X.

(2): Data dimension reduction smoothing:

The piecewise aggregation approximation technique is employed to perform dimensionality reduction on the normalized time series. An appropriate value of the reduction factor k is chosen, and the dimension N is reduced to M, resulting in the creation of a new smooth time series.

\begin{array}{l} {\tilde{x}}_{i} = {\begin{array}{l} \frac{1}{k} \sum_{j = k^{*} (i - 1) + 1}^{k^{*} i} z_{j}, i = 1, 2, \dots, m, ⌈ \frac{N}{k} ⌉ - ⌊ \frac{N}{k} ⌋ = 0 \\ {\begin{array}{l} \frac{1}{k} \sum_{j = k^{*} (i - 1) + 1}^{k^{*} i} z_{j}, i = 1, 2, \dots, m - 1 \\ \frac{1}{N - k^{*} (m - 1)} \sum_{j = k^{*} (m - 1) + 1}^{N} z_{j}, i = m \end{array}, ⌈ \frac{N}{k} ⌉ - ⌊ \frac{N}{k} ⌋ > 0 \end{array} \\ m = ⌈ \frac{N}{k} ⌉ \end{array}

(5)

(3): Relative position matrix generation:

The relative position between the two timestamps in the time domain is calculated, and the normalized vibration signal time series is converted into a two-dimensional matrix M, as shown below:

M = [\begin{matrix} {\tilde{x}}_{1} - {\tilde{x}}_{1} & {\tilde{x}}_{2} - {\tilde{x}}_{1} & \dots & {\tilde{x}}_{m} - {\tilde{x}}_{1} \\ {\tilde{x}}_{1} - {\tilde{x}}_{2} & {\tilde{x}}_{2} - {\tilde{x}}_{2} & \dots & {\tilde{x}}_{m} - {\tilde{x}}_{2} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\tilde{x}}_{1} - {\tilde{x}}_{m} & {\tilde{x}}_{2} - {\tilde{x}}_{m} & \dots & {\tilde{x}}_{m} - {\tilde{x}}_{m} \end{matrix}]

(6)

The minimum–maximum normalization approach is employed to transform the M matrix into a gray value matrix, and finally, the relative position matrix F [30] is obtained:

F = \frac{M - \min (M)}{\max (M) - \min (M)} \times 255

(7)

(4): Pseudo-color processing:

Through the mapping rules set in advance, each element in the feature matrix is regarded as an index value, and the intensity values of the corresponding R, G, and B are obtained in the Color Look-up Table according to the index value, so as to achieve the colorization of the gray matrix.

The flowchart of the two-dimensional map generated by the relative position matrix of the original vibration signal time series as shown in Figure 5:

Figure 5. Two-dimensional feature map generation flowchart.

As shown in Figure 6, the two-dimensional feature map of the vibration signal at Measuring Point 1 under the non-loosening state of the transformer winding is generated by the relative position matrix:

Figure 6. Generated two-dimensional color feature map.

3.2. Construction of Frequency-Domain Feature Map Based on Gram Angle and Field

The spectrum distribution of transformer windings under different looseness fault states changes obviously, and the change in the fundamental frequency amplitude under each looseness state is positively correlated with the current. Therefore, the construction of frequency-domain feature maps under different load current conditions can more accurately reflect the characteristic information of different looseness faults of windings. The two-dimensional image generated by the Gram-angle field transformation [31] is capable of more effectively reflecting the characteristic information of the frequency-domain signal.

Before constructing the frequency-domain feature map, the original one-dimensional vibration signal needs to be transformed by Fourier transform to obtain the spectrum curve, and then the amplitude information corresponding to the frequency is saved in matrix form by Gram-angle field transform. Finally, the frequency-domain feature map is obtained by pseudo-color processing. The detailed procedures are presented below:

(1): Complex modulation refines the spectrum:

The vibration signal frequency spectrum of the power transformer winding is predominantly concentrated in the range of 0~1000 Hz. In this experiment, the vibration signal acquisition rate is 100 Hz. After Fourier transform, the characteristic information contained in the low-frequency component cannot be clearly displayed in the spectrum. Therefore, it is necessary to perform complex modulation refinement operations to filter out the high-frequency part, so that the generated spectrum curve is concentrated within 0~1000 Hz [32]. The complex modulation refinement spectrum flow is shown in Figure 7.

Figure 7. Complex modulation refinement spectrum flowchart.

As shown in Figure 7, the time series of the original vibration signal of the transformer

x_{i n (n)}

is transformed by Hilbert transform [33], and the complex analytical sequence

{\hat{x}}_{n}

is constructed, which is

{\hat{x}}_{n} = \{a_{1} + j b_{1}, a_{2} + j b_{2}, \dots, a_{n} + j b_{n}\}

. Then, the complex analytic signal is modulated by the complex exponential signal to achieve the spectrum shift. The modulated signal undergoes low-pass filtering. This process removes the high-frequency components of the signal, leaving only the frequency signals within the 0~1000 Hz range. The filtered signal is resampled, and the sampling frequency is

f_{s} / D

. The resampled signal undergoes a discrete Fourier transform, followed by frequency adjustment and refinement before being output.

(2): Data normalization processing:

The spectrum curve data after complex modulation refinement are normalized to make them range between [−1, 1], which can be directly used as the input of the cosine function. The detailed formula for normalized calculation is presented as follows:

{\tilde{x}}_{i} = \frac{2 (x_{i} - x_{\min})}{x_{\max} - x_{\min}} - 1

(8)

(3): Gram matrix generation:

The complex sequence after complex modulation refinement

{\hat{x}}_{n}

contains the amplitude and angle information of different frequency bands, and it does not need polar coordinate conversion. The obtained Gram-Angle Summation Field (GASF) matrix is as follows:

GASF = (\begin{array}{c} \cos (θ_{1} + θ_{1}) & \dots & \cos (θ_{1} + θ_{n}) \\ \dots & \cos (θ_{i} + θ_{i}) & \dots \\ \cos (θ_{n} + θ_{1}) & \dots & \cos (θ_{n} + θ_{n}) \end{array})

(9)

In this formula,

θ_{i}

is the angle value of the

i

-th sequence.

Carry out pseudo-color processing on the generated Gram matrix to obtain a two-dimensional characteristic graph in the frequency domain, and then compare it with the two-dimensional map generated without complex modulation refinement, as shown in Figure 8 below.

Figure 8. Comparison flowchart before and after frequency-domain feature map refinement.

3.3. Quantitative Analysis of Time–Frequency-Domain Feature Images

In this section, a spectral mapping construction method is employed to generate time–frequency two-dimensional feature images for transformer windings with different degrees of looseness. Taking Measurement Point 1 as an example, quantitative analysis of the generated images is carried out from two dimensions: spectral energy and information entropy values.

Spectral energy analysis, grounded in Fourier transform, serves to quantify the energy distribution of images in the frequency domain and can reflect their high-frequency and low-frequency characteristics. Information entropy is used to measure the information content of an image or the uncertainty of pixel distribution. A higher entropy value indicates more complex image content and richer textures, while a lower entropy value signifies a simpler image. The calculation formula is as follows:

H = - \sum_{i = 0}^{L - 1} p (i) \log_{2} p (i)

(10)

where

p (i)

represents the probability that the pixel value is

i

, and L is the range of pixel values.

This study defines low frequency as below 100 Hz, medium frequency within 100–500 Hz, and high frequency above 500 Hz. Information entropy values of time–frequency-domain feature maps and energy values of frequency-domain feature maps are computed for four transformer winding looseness faults (100%

F_{N}

, 75%

F_{N}

, 50%

F_{N}

, and 25%

F_{N}

). The resulting feature distributions are illustrated in the following figure.

Based on the analysis of Figure 9a, the information entropy values of frequency-domain spectra under four different loosening states of transformer windings exhibit no obvious regular changes, while the information entropy values of time-domain spectra increase with the aggravation of the loosening degree. According to the analysis of Figure 9b, as the winding loosening degree increases, the energy in the medium-frequency band rises, and the energies in both the high- and low-frequency bands decrease.

Figure 9. Quantitative analysis of spectral features under different loosening faults of transformer windings: (a) Distribution of time–frequency-domain information entropy values. (b) Frequency-domain spectral energy analysis.

4. Construction of a Different Transformer Winding Loose Fault Diagnosis and Recognition Model Based on ConvNeXt

4.1. Limitations of Traditional Residual Convolutional Neural Networks

Lately, within the domain of ImageNet image classification and recognition, residual convolutional neural networks have introduced a residual block structure. This innovation effectively deals with the issues of gradient vanishing and gradient explosion in deep neural networks, thus improving the network performance to a certain extent [34]. However, with the progressive increase in network depth, the residual neural network needs strong GPU computing power, resulting in large resource consumption. When dealing with small-scale datasets, the strong expression ability of the residual neural network may cause the training outcomes to overfitting.

Given the deficiencies of the conventional residual neural network in the realm of image recognition, combined with the design direction and training strategy of vision transformers in the field of visual recognition, the parameters and structure of the existing ResNet50 model were optimized, and the improved ConvNeXt network model was obtained [35]. ConvNeXt uses Swin Transformer‘s sliding-window strategy to reuse the calculation results between each local area of the image [36], thereby reducing the amount of calculation. The model achieved excellent results in industrial machinery fault detection [37].

4.2. The Basic Structure of ConvNeXt Model

The ConvNeXt architecture encompasses five distinct variants: ConvNeXt-Tiny, ConvNeXt-Small, ConvNeXt-Base, ConvNeXt-Large, and ConvNeXt-XLarge. As delineated in Table 2, ConvNeXt-Tiny features a streamlined design with fewer layers, while maintaining identical kernel dimensions to its counterparts. This lightweight configuration results in a parameter count just one-third that of ConvNeXt-Base, leading to reduced computational overheads and power consumption—qualities well suited for integration into fault diagnosis hardware. Moreover, its inference speed exceeds 80 FPS, fulfilling the latency requirements of real-time image classification tasks. Through comprehensive evaluation, ConvNeXt-Tiny was selected as the training model for this study.

Table 2. Parameter comparison of ConvNeXt model variants.

The ConvNeXt model draws on the design concept of transformer and adopts a hierarchical structure to divide the network into five modules. The model block diagram is shown in Figure 10. When the two-dimensional feature map enters the network, the Stem layer’s convolution operation conducts an initial extraction of feature information, and the layer normalization operation is utilized to lower the resolution. The ConvNeXt-Tiny model has four stages, each stage contains different numbers of ConvNeXt modules, and there are downsampling operations between the stages. The ConvNeXt blocks in stages 1 to 4 further extract and refine the feature maps output by the Stem layer, and they improve the expression ability of the feature maps by increasing the number of channels.

Figure 10. ConvNeXt-Tiny model block diagram.

Finally, the feature map output from stage 4 is globally averaged in the spatial dimension, and the feature map of each channel is compressed into a scalar. Then, the feature vector is mapped to the number of categories through a fully connected layer, and the recognition accuracy corresponding to each category is provided as output.

As shown in Figure 11, in contrast to the traditional residual module, the ConvNeXt module introduces depthwise convolution. Firstly, a 7 × 7 convolution kernel is used for each input channel to perform deep spatial convolution operation. After layer normalization, the influence of internal covariate offset is reduced, and the acceleration model converges. The 1 × 1 convolution is applied to fuse the output channels of the deep convolution, which significantly reduces the amount of calculation and maintains a good feature information extraction ability.

Figure 11. ConvNeXt module flowchart.

Gaussian Error Linear Unit (GELU) [38] was picked as the activation function, and the following is its formula:

G E L U (x) = x \cdot \frac{1}{2} (1 + e r f (\frac{x}{\sqrt{2}}))

(11)

In this formula, erf is the error function, defined as follows:

e r f (x) = \frac{2}{\sqrt{2}} \int_{0}^{x} e^{- t^{2}} d t

(12)

According to Formula (11), the GELU function is a nonlinear smoothing function, and the activation intensity is dynamically adjusted according to the distribution of input values. In the ConvNeXt module, the number of activation functions used is reduced, and only one activation function is added between two 1 × 1 convolutions. Finally, the input and output of the module are added, and the residual connection is helpful to alleviate the gradient disappearance, which makes it easier to train the model and enhance the model’s convergence.

The model training and visualization operations in this study were implemented using Python3.10. Code editing was performed via PyCharm 2022.3.3 (Professional Edition), the PyTorch framework was employed for deep learning and computer vision processing, and CUDA was utilized to drive GPU acceleration for image processing. Among them, the GPU version was an NVIDIA GeForce RTX 3070 Ti, and we used PyTorch version 2.5.1.

In terms of neural network training algorithm optimization, the AdamW [39] optimizer was used, and the relevant hyperparameters were set as follows: batch size = 16, max epoch = 100, and learning rate = 0.0001. The parameter settings of each module of the network are shown in Table 3.

Table 3. Network module parameter settings.

4.3. The Overall Process of Transformer Winding Looseness Fault Identification Based on the ConvNeXt Model

The process of diagnosing transformer winding looseness faults using the ConvNeXt model is illustrated in Figure 12 below.

Figure 12. Transformer winding loose fault diagnosis and classification identification process.

(1) Vibration signal acquisition: The signal acquisition instrument is employed to collect the vibration signals of the box surface under various loose fault conditions of the three-phase winding of the transformer.

(2) Two-dimensional map library construction: For the collected original vibration signal time series, the relative position matrix is applied to produce the time-domain map, and the Gram-angle field is used to generate the frequency-domain feature map. Subsequently, a time-domain and frequency-domain feature map library for transformer winding looseness faults under various load current conditions is established.

(3) Model training: The time–frequency-domain feature map training set is fed into the constructed ConvNeXt model for training. Then, a softmax classifier is applied to yield the classification results, and the model’s training parameters are saved.

(4) Test set verification: The time–frequency characteristic spectrum test set is fed into the trained ConvNeXt model for testing, and the accuracy of transformer winding loose fault identification is diagnosed and identified.

5. Test Comparison and Analysis

5.1. Construction of Two-Dimensional Feature Map Data Samples

Before the image is transmitted to the ConvNeXt network for training, it is necessary to construct a sample library for the generated time–frequency-domain feature map. The vibrations of 100%

F_{N}

, 75%

F_{N}

, 50%

F_{N}

, and 25%

F_{N}

loosening faults of the three-phase winding of the transformer are collected in the test; the signals of 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

load conditions are collected for each loosening fault; and the time–frequency two-dimensional characteristic spectrum is constructed. In order to ensure that each image contains complete periodic feature information, the original vibration signal time series is processed by continuous slicing; each slice contains 4000 sampling points. The image sample set generated in this way contains three load current conditions under each loose fault state. The image samples generated by the winding loose fault signal corresponding to each measuring point are 11,964, and the size of each image sample is 224 × 224 × 3.

Figure 13 shows some image sample sets under different loose fault states of transformer windings.

Figure 13. Partial image datasets of transformer windings in different loose states.

5.2. Comparative Analysis of Transformer Winding Loose Fault Identification Under Different Load Current Conditions

For the purpose of researching the precision of recognizing transformer winding looseness faults in different load situations, the time–frequency-domain feature maps generated under three load conditions of 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

of the transformer were input into ConvNeXt for training. A total of 3988 images were generated under each load current condition. The samples under the load condition were partitioned into a training set and test set at a ratio of 5 to 1, and there were 100 training rounds. Taking Measuring Point 1 as an example, the change curves of recognition accuracy and loss value of the test set were obtained as shown in Figure 14 below, the recognition accuracy under different load conditions is shown in Table 4.

Figure 14. The training process of transformer winding looseness identification under different load currents: (a) Test set recognition accuracy. (b) Test set loss value change.

Table 4. Recognition accuracy under different load conditions.

From the training correlation curves of transformer winding looseness identification under different load current conditions, it is evident that regarding the identification accuracy of the test set, the three load conditions of 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

all reached more than 99% after the 50th round of testing, and the accuracy of test set identification under the mixture of the three load conditions also reached 99.54%. In terms of the convergence performance of the loss function, the convergence performance of the test set under the three load current conditions had a significant advantage, and the loss value tended toward 0, while the other three load current conditions had different degrees of fluctuation in the loss value when they were trained separately.

In summary, the two-dimensional images under three load conditions of 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

not only enriched the dataset in the identification of transformer winding looseness faults but also achieved very high recognition accuracy and good convergence.

The T-distributed Stochastic Neighbor Embedding (T-SNE) dimension reduction processing [40] was performed on the fault classification and recognition feature quantity at Measuring Point 1, and the visual 3D effect diagram shown below was obtained. As shown in Figure 15, it can be seen that each loose fault classification shows good independence, of which the 75%

F_{N}

loose fault classification effect is better.

Figure 15. Classification and recognition T-SNE effect diagram.

To verify the sensitivity of the recognition accuracy of the ConvNeXt model to the number of image samples, 80%, 60%, and 50% of the spectrum samples from Measurement Point 1 were extracted and input into the model for training and recognition, with the number of training epochs set to 100. The distribution of the recognition accuracy of the test set samples at the 100th epoch is shown in Table 5 below.

Table 5. The distribution of model recognition accuracy under different sample numbers.

Analysis of the table reveals that when the number of spectrum samples drops to 80%, the 100th round of recognition accuracy of Measurement Point 1 decreases by nearly 3 percentage points. When the sample quantity is halved, the 100th round of recognition accuracy falls to 85.33%. Thus, the number of samples is one of the key factors influencing the recognition accuracy of the ConvNeXt model; the more sufficient the samples, the richer the fault feature information extracted by the model.

5.3. Comparative Analysis of Transformer Winding Loose Fault Identification Under Different Measuring Points

In this section, the two-dimensional spectra of different winding loose fault states, corresponding to six measuring points, were trained and identified. Each measuring point contained vibration data in four states of 100%

F_{N}

, 75%

F_{N}

, 50%

F_{N}

, and 25%

F_{N}

, with a total of 71,784 images. The samples included in the measuring point were partitioned into a training set and test set at a ratio of 5 to 1. The data samples were fed into the ConvNeXt model for training, and the number of training rounds was set to 100. The change curves of the recognition accuracy rate and the loss value of the test set after training for each measuring point were obtained, as shown in Figure 16 below:

Figure 16. The training process of transformer winding looseness identification under different measuring points: (a) Test set recognition accuracy. (b) Test set loss value change.

Comprehensive analysis of the test set recognition accuracy and loss value transformation curve in the figure shows that the convergence of Measuring Points 1, 2, and 5 is the best in the recognition process. When the testing reaches the 10th round, the recognition accuracy of Measuring Points 1, 2, and 5 tends to be stable, reaching 97.9%, 100%, and 97.0%, respectively. When the testing reaches 50 rounds, the loss function of Measuring Points 3 and 4 effectively reaches the convergence state, and the recognition accuracy is 94.7% and 96.9%, respectively. The convergence of the loss function at six measuring points is poor, and the loss value is 0.11. After 100 rounds of recognition on the test set, the accuracy rate reaches 93.7%, which has a relatively good classification effect.

Figure 17 shows the confusion matrix under the 100th round of identification of the test set of six measuring points.

Figure 17. Different measuring points’ classification recognition confusion matrix.

Based on the confusion matrix, the classification precision, recall rate, and F1-scores for each measurement point were calculated, yielding the evaluation indicators shown in Table 6. A comprehensive analysis revealed that Measurement Points 1, 2, 4, and 5 exhibited remarkably high average recognition accuracy, demonstrating the exceptional classification capability of the ConvNeXt model. Notably, the test set images at Measurement Point 2 achieved 100%

F_{N}

correct classification, representing a perfect performance. For Measurement Point 3, the 75%

F_{N}

fault condition showed significant recognition advantages, with an F1-score of 97.3%, while the recall rate for the 100%

F_{N}

fault condition was poor. At Measurement Point 6, the datasets under 75%

F_{N}

and 25%

F_{N}

fault conditions suffered severe classification confusion, with F1-scores of 85.8% and 84.2%, respectively—significantly worse than the other measurement points in terms of overall recognition performance.

Table 6. Classification evaluation index of different measuring points.

From the above-mentioned analysis, it is evident that the model recognition accuracy of the six measuring points on the surface of the transformer box is generally high, and the average recognition accuracy of all measuring points is 97.9%, which can essentially reflect the recognition effect under the three-phase loosening fault of the transformer winding.

5.4. Model Robustness Verification

To mitigate overfitting caused by fixed training–test splits and ensure the robustness of the ConvNeXt model, this section incorporated a 6-fold cross-validation pipeline during training. The dataset was repeatedly partitioned and trained across six iterations, with each iteration validated on a distinct subset. Two-dimensional spectra corresponding to different winding looseness fault states at six measurement points were trained individually. For instance, taking Measurement Point 1 as a case study, the boxplot of recognition accuracy for each fold after the 50th training epoch was generated.

Analysis of Figure 18 indicates that the six-fold test sets exhibit generally excellent overall recognition accuracy, with all boxplot boxes remaining narrow—evidence of the model’s robust recognition stability. Subsequently, the average recognition accuracy for each fold across the six measurement points was calculated, alongside the standard deviation of the six-fold training outcomes. The standard deviation formula is as follows:

σ = \sqrt{\frac{\sum_{i = 1}^{n} {(x_{i} - μ)}^{2}}{n}}

(13)

where

σ

denotes the volatility,

μ

signifies the average value of the data, and

n

indicates the quantity of data.

Figure 18. Test set recognition accuracy boxplot.

Analysis of Table 7 shows that, after six-fold cross-training, the average recognition accuracy of each measurement point dataset exhibits no significant change compared to the training results in Section 5.3, and the standard deviation is also minimal. This verifies that the ConvNeXt model has a certain degree of robustness in the division of the test set and validation set.

Table 7. The training recognition accuracy and standard deviation distribution of each measuring point.

To validate the ConvNeXt model’s robustness against noise, the following section classifies noisy vibration signals. Gaussian white noise with specified power was added to the original signals to generate samples at varying signal-to-noise ratios (SNRs). These noisy signals were then converted into time–frequency-domain 2D feature maps using the spectrum construction method. The Figure 19 below compares the 2D spectrum of the original signal with that of a signal processed at 40 dB SNR.

Figure 19. Comparison of time–frequency spectra of vibration signals containing noise.

The spectral dataset with an SNR of 40 dB was fed into the ConvNeXt model for training, yielding a boxplot of recognition accuracies for the six measurement points’ test sets after the 50th training epoch.

Analysis of Figure 20 reveals that Measurement Points 2, 4, and 5 exhibit higher box positions, higher medians, and lower dispersion—indicative of consistently excellent and stable test accuracy. In contrast, Points 3 and 6 show lower box positions and greater dispersion, leading to lower overall accuracy with pronounced fluctuations. Notably, after introducing Gaussian noise, the recognition accuracy of each measurement point dataset showed no significant change compared to the noise-free datasets.

Figure 20. Boxplot of recognition accuracy of each measuring point including noise.

The SNR values were sequentially varied to 40 dB, 30 dB, 20 dB, and 10 dB. Vibration spectra corresponding to different SNRs were fed into the ConvNeXt model for training and recognition, after which the average recognition accuracy across the six measurement points was calculated. Table 7 presents the model’s average recognition accuracy and standard deviation distribution under varying SNR conditions.

Analysis of Table 8 demonstrates that, when the SNR exceeds 30 dB, the classification performance of the condition recognition model remains unaffected. Conversely, as the SNR drops below 30 dB, although a slight decrease in accuracy is observed, the recognition rate still maintains a relatively high level. This indicates that the ConvNeXt model exhibits notable robustness against noise interference.

Table 8. Average recognition accuracy and sample difference under different signal-to-noise ratios.

5.5. Comparative Analysis of the Recognition Effect of Different Training Models

To validate the superiority of the ConvNeXt model, this study compared the recognition performance of four models: ConvNeXt, ResNet50, GoogLeNet, and AlexNet. A boxplot of recognition accuracies after the 50th training epoch was plotted, as shown in Figure 21 below:

Figure 21. Comparison of recognition accuracy of different models.

According to the analysis of Figure 21, The ConvNeXt model demonstrates marked superiority in recognition accuracy over other models, with lower dispersion and concentrated distribution. By contrast, the ResNet50 model exhibits higher dispersion in accuracy, occasionally showing notably lower values, indicating inferior stability to ConvNeXt. The GoogLeNet model features better accuracy concentration than ResNet50, yet its overall recognition performance remains inferior to that of ConvNeXt. Although AlexNet shows moderate dispersion, both its central tendency and concentration range lag behind those of other models. The average recognition accuracy and standard deviation of the four models after the 50th test epoch are statistically analyzed below, as tabulated in Table 9:

Table 9. Different models’ recognition accuracy and volatility.

From the comparative analysis of Table 9, it is clear that the ConvNeXt model has a distinct advantage in the accuracy of transformer winding looseness recognition, which is 1.2% higher than that of the traditional ResNet50 model. After the 50th test epoch, the standard deviation of recognition accuracy for ConvNeXt is only 0.002. While AlexNet exhibits the smallest standard deviation, its average recognition accuracy remains relatively low. Therefore, the ConvNeXt model has a significant effect in the diagnosis and identification of transformer winding looseness faults.

6. Conclusions

In this study, the time–frequency-domain feature map library of winding looseness faults under different transformer load current conditions was constructed by using the relative position matrix and Gram angle and field. The fault diagnosis and identification model of transformer winding looseness relying on feature maps and ConvNeXt was built, and the experimental verification was carried out. The following conclusions were reached:

(1) The time–frequency two-dimensional feature map library under three load currents of 90%

I_{N}

, 100%

I_{N}

, and 110%

I_{N}

was constructed, and the fault recognition accuracy and convergence of the test set reached the performance under the single load current condition. The recognition rate of the test set of Measuring Point 1 was as high as 99.54%; it provided sample data for the diagnosis and identification of transformer looseness faults across various load conditions, and it improved the adaptability of the model to the diagnosis of transformer winding looseness faults with various load conditions.

(2) The recognition accuracy of the ConvNeXt looseness fault model corresponding to the six measuring points on the measuring surface of the transformer box was generally high, and the average recognition accuracy of all measuring points was 97.9%.

(3) The ConvNeXt model improved the accuracy of transformer winding looseness fault identification by 1.2% compared with the traditional ResNet50, solving the overfitting problem of traditional model training to a certain extent, and showed significant advantages in fault classification effect.

In summary, the transformer winding looseness fault identification model proposed in this paper, integrating feature spectrograms and ConvNeXt, demonstrates remarkable advantages in both recognition efficiency and robustness. This model not only provides technical support for mechanical fault diagnosis of transformer windings in operational substations but also lays a theoretical foundation for the research on fault diagnosis devices.

This study still has certain limitations. First, only a single type of power transformer was used in the experiment, which cannot fully reflect the general law of transformer winding looseness. Second, only a single sensor arrangement scheme was used in the experiment, which cannot completely capture the characteristic information of transformer winding vibration. Third, image recognition for transformer winding looseness faults generally only identifies the occurrence of such faults, struggling to precisely locate the internal fault positions—such as the specific turn layer of a phase coil. Therefore, this paper proposes the following solutions:

1. Measure the vibration data of different types of transformer windings under faults to enrich the fault spectrum database of transformer winding looseness.

2. Increase the number of sensors, and uniformly arrange sensors on the top and sides of the transformer tank to fully collect the vibration signals of the transformer.

3. Position fiber-optic sensors between the transformer winding pancakes to monitor stress variations, generate spectrograms from tank surface vibration signals to detect faults, and use fiber-optic sensor data to localize faults. Establish a correlation model between the two sensor datasets to enable the precise localization of winding looseness faults in subsequent experiments using only vibration sensor data.

Author Contributions

Conceptualization, J.F., X.D., Y.X. and Z.Z.; Data Curation, J.F. and C.W.; Formal Analysis, J.F., X.D., Y.X., C.W., Y.L., X.L. and K.C.; Investigation, J.F., Y.X., C.W., Y.L. and X.L.; Methodology, J.F., X.D., Y.X., C.W., Y.L., X.L. and F.W.; Project Administration, X.D.; Resources, X.D., Y.X., C.W., Y.L., X.L. and K.C.; Software, J.F., X.D., Y.X., Y.L., K.C. and F.W.; Supervision, Z.Z.; Validation, J.F., X.D. and C.W.; Visualization, J.F., X.L. and K.C.; Writing—Original Draft, J.F., X.D., Y.X. and F.W.; Writing—Review and Editing, J.F. All authors have read and agreed to the published version of the manuscript.

Funding

State Grid Chongqing Electric Power Company Technology Project (2024 Yudian Technology 52#).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Authors Xudong Deng, Yuancan Xia, Chen Wu, Yuehua Li, Xin Li, Kaixin Chen were employed by the company Ultra-High Voltage Branch of State Grid Chongqing Electric Power Company. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Cao, C.; Wang, J.; Li, X.; Xu, B. Diagnosis Method of Transformer Winding Mechanical State Based on Current-Frequency-Vibration Parameter. Electr. Mach. Control 2024, 28, 62–72. [Google Scholar]
Liu, Y.; Li, X.; Li, H.; Yin, J.; Wang, J.; Fan, X. Spatially Continuous Transformer Online Temperature Monitoring Based on Distributed Optical Fibre Sensing Technology. High. Volt. 2022, 7, 336–345. [Google Scholar] [CrossRef]
Chen, C.; Xu, J.; Xin, L.; Li, X. State Diagnosis Method of Transformer Winding Deformation Based on Fusing Vibration and Reactance Parameters. IET Electr. Power Appl. 2020, 14, 818–826. [Google Scholar] [CrossRef]
Liu, Y.; Ji, S.; Yang, F.; Cui, Y.; Zhu, L.; Rao, Z.; Ke, C.; Yang, X. A Study of the Sweep Frequency Impedance Method and Its Application in the Detection of Internal Winding Short Circuit Faults in Power Transformers. IEEE Trns. Dielectr. Electr. Insul. 2015, 22, 2046–2056. [Google Scholar] [CrossRef]
Abbasi, A.R.; Parkash, C. Innovative Diagnosis of Transformer Winding Defects Using Fuzzy and Neutrosophic Cross Entropy Measures. Adv. Eng. Inform. 2025, 65, 103196. [Google Scholar] [CrossRef]
Transformer Reliability Survey; CIGRE Working Group A2.37; CIGRE: Paris, France, 2015.
Ribeiro, C.d.J.; Marques, A.P.; Bezerra Azevedo, C.H.; Poli Souza, D.C.; Alvarenga, B.P.; Nogueira, R.G. Faults and Defects in Power Transformers—A Case Study. In Proceedings of the 2009 IEEE Electrical Insulation Conference, Montreal, QC, Canada, 31 May–3 June 2009; IEEE: New York, NY, USA, 2009; pp. 142–145. [Google Scholar]
Cai, W.; Nie, L.; Ying, G.; Ma, S.; Li, W.; Wang, Z. Application of Vibration Analysis in Transformer Fault Diagnosis and Hidden Peril Management. Zhejiang Electr. Power 2022, 41, 53–59. [Google Scholar]
Sun, Y.; Ma, H. Research Progress on Oil-Immersed Transformer Mechanical Condition Identification Based on Vibration Signals. Renew. Sust. Energ. Rev. 2024, 196, 114327. [Google Scholar] [CrossRef]
Zhang, B.; Zhao, D.; Wang, F.; Shi, K.; Zhao, Z. Research on Mechanical Fault Diagnosis Method of Power Transformer Winding. J. Eng.—JOE 2019, 2019, 2096–2101. [Google Scholar]
Granados-Lieberman, D.; Huerta-Rosales, J.R.; Gonzalez-Cordoba, J.L.; Amezquita-Sanchez, J.P.; Valtierra-Rodriguez, M.; Camarena-Martinez, D.; Darmon, M. Time-Frequency Analysis and Neural Networks for Detecting Short-Circuited Turns in Transformers in Both Transient and Steady-State Regimes Using Vibration Signals. Appl. Sci. 2023, 13, 12218. [Google Scholar] [CrossRef]
Durak, L.; Arikan, O. Short-Time Fourier Transform: Two Fundamental Properties and an Optimal Implementation. IEEE Trans. Signal Process. 2003, 51, 1231–1242. [Google Scholar] [CrossRef]
Huerta-Rosales, J.R.; Granados-Lieberman, D.; Garcia-Perez, A.; Camarena-Martinez, D.; Amezquita-Sanchez, J.P.; Valtierra-Rodriguez, M. Short-Circuited Turn Fault Diagnosis in Transformers by Using Vibration Signals, Statistical Time Features, and Support Vector Machines on FPGA. Sensors 2021, 21, 3598. [Google Scholar] [CrossRef] [PubMed]
Hearst, M.A. Support Vector Machines. IEEE Intell. Syst. Appl. 1998, 13, 18–21. [Google Scholar] [CrossRef]
Zhou, Y.; He, Y.; Xing, Z.; Wang, L.; Shao, K.; Lei, L.; Li, Z. Vibration Signal-Based Fusion Residual Attention Model for Power Transformer Fault Diagnosis. IEEE Sens. J. 2024, 24, 17231–17242. [Google Scholar] [CrossRef]
Jing, Y.; Liu, Z.; Liu, Y.; Yu, Z.; Li, Y. Research on Digital Technology of Mechanical Structure Fault Diagnosis of Power Transformer Based on Comprehensive Feature Extraction and SABO-PNN. IEEE Trans. Appl. Supercond. 2024, 34, 5501204. [Google Scholar] [CrossRef]
Sun, Y.; Ma, H. Interpretable Analysis of Transformer Winding Vibration Characteristics: SHAP and Multi-Classification Feature Optimization. Int. J. Electr. Power Energy Syst. 2025, 166, 110585. [Google Scholar] [CrossRef]
Ansari, J.; Homayounzade, M.; Abbasi, A.R. Load Frequency Control in Power Systems by a Robust Backstepping Sliding Mode Controller Design. Energy Rep. 2023, 10, 1287–1298. [Google Scholar] [CrossRef]
Suwarno; Sutikno, H.; Prasojo, R.A.; Abu-Siada, A. Machine Learning Based Multi-Method Interpretation to Enhance Dissolved Gas Analysis for Power Transformer Fault Diagnosis. Heliyon 2024, 10, e25975. [Google Scholar] [CrossRef]
Zhang, L.; Xu, Z.; Qiao, T.; Lu, C.; Su, H.; Luo, Y. Transformer Fault Diagnosis Based on Adversarial Generative Networks and Deep Stacked Autoencoder. In Proceedings of the 2024 7th International Conference on Energy, Electrical and Power Engineering (CEEPE), Yangzhou, China, 26 April 2024; IEEE: New York, NY, USA, 2024; pp. 496–504. [Google Scholar]
Karen, S.; Andrea, V.; Andrew, Z. Deep inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps. In Proceedings of the 2nd International Conference on Learning Representations; International Conference on Learning Representations, Banff, AB, Canada, 2–4 May 2013; ICLR: Singapore, 2013; Volume 149800. [Google Scholar]
Parkash, C.; Abbasi, A.R. Transformer’s Frequency Response Analysis Results Interpretation Using a Novel Cross Entropy Based Methodology. Sci. Rep. 2023, 13, 6604. [Google Scholar] [CrossRef]
Li, S.; Jiang, Q.; Xu, Y.; Feng, K.; Zhao, Z.; Sun, B.; Huang, G.Q. Digital Twin-Assisted Interpretable Transfer Learning: A Novel Wavelet-Based Framework for Intelligent Fault Diagnostics from Simulated Domain to Real Industrial Domain. Adv. Eng. Inform. 2024, 62, 102681. [Google Scholar] [CrossRef]
Xiao, Y.; Ma, H. Transformer Winding Looseness Fault Diagnosis Model Based on GAF and Depth Residual Network. Electr. Mach. Control Appl. 2024, 51, 29–38. [Google Scholar]
Xue, J.; Ma, H.; Yang, H.; Ni, Y.; Wan, K.; Ze, H. A Fault Diagnosis Method for Transformer Winding Looseness Based on Gramian Angular Field and Transfer Learning-AlexNet. Power Syst. Prot. Control 2023, 51, 154–163. [Google Scholar]
Ji, S.; Jia, Y.; Huang, X.; Yang, X.; Zhang, F. Vibration characteristics and loosening fault detection method of transformer bushing turret area. Eng. Mech. 2024, 12. [Google Scholar] [CrossRef]
Ji, S.; Zhang, F.; Qian, G.; Zhu, Y.; Dong, H.; Zou, D. Characteristics and Influence Factors of Winding Axial Vibration of Power Transformer inSteady-State Operation Condition. High. Volt. Eng. 2016, 42, 3178–3187. [Google Scholar]
Chen, W.; Shi, K. A Deep Learning Framework for Time Series Classification Using Relative Position Matrix and Convolutional Neural Network. Neurocomputing 2019, 359, 384–394. [Google Scholar] [CrossRef]
Gu, Y.; Zeng, L.; Qiu, G. Bearing Fault Diagnosis with Varying Conditions Using Angular Domain Resampling Technology, SDP and DCNN. Measurement 2020, 156, 107616. [Google Scholar] [CrossRef]
Liu, Z.; Zhang, H.; Lv, Z.; Jia, H.; Liang, X.; Wang, Q. Identification of Composite Power Quality Disturbances Based on Relative Position Matrix. Front. Energy Res. 2024, 11, 1326522. [Google Scholar] [CrossRef]
Wang, Z.; Oates, T. Imaging Time-Series to Improve Classification and Imputation. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI), Buenos Aires, Argentina, 25–31 July 2015; pp. 3939–3945. [Google Scholar]
Zhang, Z.; Xiao, R.; Wu, Y.; Jiang, P.; Deng, J.; Pan, Z. Research on Multi-Level Feature Extraction Model of Converter Transformer Vibration Signal. Proc. CSEE 2021, 41, 7093–7104. [Google Scholar]
Zhang, X. Analysis of Transformer Fault Characteristics Based on Vibration Signal. Electr. Switchg. 2024, 62, 52–55. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; IEEE: New York, NY, USA, 2016; pp. 770–778. [Google Scholar]
Liu, Z.; Mao, H.; Wu, C.-Y.; Feichtenhofer, C.; Darrell, T.; Xie, S. A ConvNet for the 2020s. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; IEEE: New York, NY, USA, 2022; pp. 11966–11976. [Google Scholar]
Lee, G.T.; Kwon, O.-R. A Predictive Model Based on Transformer with Statistical Feature Embedding in Manufacturing Sensor Dataset. arXiv 2024, arXiv:2407.06682. [Google Scholar]
Zhang, C.; Qin, F.; Zhao, W.; Li, J.; Liu, T. Research on Rolling Bearing Fault Diagnosis Based on Digital Twin Data and Improved ConvNext. Sensors 2023, 23, 5334. [Google Scholar] [CrossRef]
Hendrycks, D.; Gimpel, K. Gaussian Error Linear Units (GELUs). arXiv 2023, arXiv:1606.08415. [Google Scholar]
Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. arXiv 2019, arXiv:1711.05101. [Google Scholar]
Chatzimparmpas, A.; Martins, R.M.; Kerren, A. T-viSNE: Interactive Assessment and Interpretation of t-SNE Projections. IEEE Trans. Visual. Comput. Graph. 2020, 26, 2696–2714. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Vibration signal acquisition system.

Figure 2. Transformer winding loose adjustment diagram.

Figure 3. The 3D spectrum waterfall diagram of the three-phase winding under different loose faults.

Figure 4. Relationship curve between fundamental frequency amplitude and load current.

Figure 5. Two-dimensional feature map generation flowchart.

Figure 6. Generated two-dimensional color feature map.

Figure 7. Complex modulation refinement spectrum flowchart.

Figure 8. Comparison flowchart before and after frequency-domain feature map refinement.

Figure 9. Quantitative analysis of spectral features under different loosening faults of transformer windings: (a) Distribution of time–frequency-domain information entropy values. (b) Frequency-domain spectral energy analysis.

Figure 10. ConvNeXt-Tiny model block diagram.

Figure 11. ConvNeXt module flowchart.

Figure 12. Transformer winding loose fault diagnosis and classification identification process.

Figure 13. Partial image datasets of transformer windings in different loose states.

Figure 14. The training process of transformer winding looseness identification under different load currents: (a) Test set recognition accuracy. (b) Test set loss value change.

Figure 15. Classification and recognition T-SNE effect diagram.

Figure 16. The training process of transformer winding looseness identification under different measuring points: (a) Test set recognition accuracy. (b) Test set loss value change.

Figure 17. Different measuring points’ classification recognition confusion matrix.

Figure 18. Test set recognition accuracy boxplot.

Figure 19. Comparison of time–frequency spectra of vibration signals containing noise.

Figure 20. Boxplot of recognition accuracy of each measuring point including noise.

Figure 21. Comparison of recognition accuracy of different models.

Table 1. Electrical parameters of 10 kV power transformer.

Rated Capacity (KVA)	100
Rated voltage (KV)	10 ± 2 × 2.5%/0.4
Rated current (A)	5.77/144.3
Connection group label	Dyn11
Phase number	3
Impedance voltage	4%
Cooling method	0NAN

Table 2. Parameter comparison of ConvNeXt model variants.

Model Variants	Kernel Size	Layers	Parameter Count	Inference Time	FPS
Tiny	1 × 7 × 7	18	~29 M	~12 ms	~83 FPS
Small	1 × 7 × 7	24	~50 M	~20 ms	~50 FPS
Base	1 × 7 × 7	24	~89 M	~35 ms	~28 FPS
Large	1 × 7 × 7	24	~190 M	~70 ms	~14 FPS
XLarge	1 × 7 × 7	24	~290 M	~110 ms	~09 FPS

Table 3. Network module parameter settings.

Block	Kernel_Size	Stride
Stem layer	4 × 4	4
Stage1	Depthwise: 7 × 7	1
	Expand: 1 × 1	/
	Shrink: 1 × 1	/
Downsample layer	2 × 2	2

Table 4. Recognition accuracy under different load conditions.

Different Load Conditions	The 100th Round of Recognition Accuracy (%)
$90 % I_{N}$	99.25
$100 % I_{N}$	100
$110 % I_{N}$	100
$90 % I_{N} + 100 % I_{N} + 110 % I_{N}$	99.54

Table 5. The distribution of model recognition accuracy under different sample numbers.

The Proportion of Image Samples (%)	The 100th Round of Recognition Accuracy (%)
100	99.54
80	96.77
60	89.31
50	85.33

Table 6. Classification evaluation index of different measuring points.

Measuring Point	Looseness Grade	Precision (%)	Recall (%)	F1-Score (%)
1	100% $F_{N}$	100	100	100
	75% $F_{N}$	100	99.5	99.7
	50% $F_{N}$	100	100	100
	25% $F_{N}$	99.5	98.7	99.1
2	100% $F_{N}$	100	100	100
	75% $F_{N}$	100	100	100
	50% $F_{N}$	100	100	100
	25% $F_{N}$	100	100	100
3	100% $F_{N}$	98.1	90.2	93.9
	75% $F_{N}$	98.9	95.8	97.3
	50% $F_{N}$	93.3	100	96.5
	25% $F_{N}$	91.3	95.2	93.2
4	100% $F_{N}$	99.8	99	99.4
	100% $F_{N}$	99.1	99.3	99.2
	75% $F_{N}$	100	100	100
	50% $F_{N}$	98.6	99.3	98.9
5	100% $F_{N}$	99.3	99.8	99.5
	75% $F_{N}$	99.6	100	99.8
	50% $F_{N}$	99.8	99.3	99.5
	25% $F_{N}$	100	99.7	99.8
6	100% $F_{N}$	99.8	100	99.9
	75% $F_{N}$	81.8	90.3	85.8
	50% $F_{N}$	100	100	100
	25% $F_{N}$	89.1	79.8	84.2

Table 7. The training recognition accuracy and standard deviation distribution of each measuring point.

Measuring Point	Average Recognition Accuracy	Standard Deviation
1	99.427	0.00207
2	99.999	0.00021
3	95.439	0.00558
4	98.291	0.00177
5	99.523	0.00059
6	92.529	0.00731

Table 9. Different models’ recognition accuracy and volatility.

Recognition Model	Identification Accuracy (%)	Standard Deviation
ConvNeXt	97.9	0.0020
ResNet50	96.7	0.0110
GoogLeNet	95.3	0.0042
AlexNet	96.1	0.0005

Table 8. Average recognition accuracy and sample difference under different signal-to-noise ratios.

SNR	Average Recognition Accuracy
raw vibration signal	97.9
40 dB	97.9
30 dB	97.9
20 dB	96.3
10 dB	95.7

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Research on Identification Method of Transformer Windings’ Loose Vibration Spectrum Considering a Multi-Load Current Condition

Abstract

1. Introduction

2. Research on Winding Looseness Tests of Power Transformers

2.1. Construction of Transformer Winding Loosening Test Platform

2.2. Transformer Winding Loose Fault Setting

2.3. Vibration Signal Acquisition and Characteristic Analysis

3. Construction Method of Vibration Signal Characteristic Spectrum Under Transformer Winding Loosening Fault

3.1. Time-Domain Feature Map Construction Method Based on Relative Position Matrix

3.2. Construction of Frequency-Domain Feature Map Based on Gram Angle and Field

3.3. Quantitative Analysis of Time–Frequency-Domain Feature Images

4. Construction of a Different Transformer Winding Loose Fault Diagnosis and Recognition Model Based on ConvNeXt

4.1. Limitations of Traditional Residual Convolutional Neural Networks

4.2. The Basic Structure of ConvNeXt Model

4.3. The Overall Process of Transformer Winding Looseness Fault Identification Based on the ConvNeXt Model

5. Test Comparison and Analysis

5.1. Construction of Two-Dimensional Feature Map Data Samples

5.2. Comparative Analysis of Transformer Winding Loose Fault Identification Under Different Load Current Conditions

5.3. Comparative Analysis of Transformer Winding Loose Fault Identification Under Different Measuring Points

5.4. Model Robustness Verification

5.5. Comparative Analysis of the Recognition Effect of Different Training Models

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics