2.1. Approach to Inspection
An example of non-destructive technologies for structural integrity monitoring of the aerial components considered in this paper is the eddy current inspection performance on the PZL-130 Orlik aircraft structure. Within the conducted Full-Scale Fatigue Test (FSFT) for that aircraft, a program of detailed NDT inspections was scheduled. A part of that program was the Eddy Current inspection delivery, especially for critical parts of the wing structure. Inspection delivered during the FSFT revealed the existence and development of more than 100 failure modes, which were detected and described.
According to the best practice standards, the exemplary failure modes that can be found during the aircraft inspection are as follows [
20]: hole peripheral cracks, through-thickness cracks in radius, hole through-thickness cracks, and hole radial countersink cracks.
Several eddy current techniques were implemented for damage detection, such as high-frequency reflection probes, high- and low-frequency automated scanning, and the magneto-optic imaging MOI™ system for the general and detailed structure visualization.
Figure 1 presents the example of the wing structure automated inspection result using the MAUS
® mobile system and C-scan data presentation. Such presentation mode shows the signal value (impedance module) as a function of the sensor location.
The inspection was delivered with the 20 kHz for surface-breaking cracks and 1 kHz for the subsurface, consecutively. Scan resolution (pixel-to-pixel size) was equal to 1 mm. Based on collected data, the NDT operator provides data analysis using visual recognition of the anomaly, which may exist on the provided image. That analysis is being supported by signal classification and description tools embedded in the system. Moreover, the data export function enables more advanced analysis based on signal processing methods [
21].
Such an inspection method has also benefited from data comparison possibility, a valuable function for the FSFT to monitor damage nucleation and damage progression. A more detailed analysis is possible for specific regions of interest (ROI) with the complex impedance plane analysis, signal evaluation, and size quantification tool.
Figure 2 presents the result of the failure presented as the hole radial countersink crack. The combined presentation of the real and imaginary part of the impedance allows evaluating damage severity and creates an opportunity to evaluate crack orientation and depth of location. However, with the lower frequency inspection, damage presence is not as clear as with higher ones and may cause some difficulties in recognizing deeper located flaws.
Once the FSFT inspection was completed, the need for an increase in the damage detection sensitivity, especially in the second or third layer, has arisen.
The approach to the detection and classification of damage considered in this article is based on using a probe adapted to detect damage in multilayer aluminum aircraft structures, an advanced multifrequency inspection method, and the mutual image similarity assessment.
From the observations and practical inspection, the use of a dedicated probe with the proper signal-to-noise ratio and efficient penetration depth is a required tool for detailed aircraft inspection. In the case of structures with multiple layers, it is also crucial to utilize many frequencies.
Analysis of the various signals achieved from all the frequencies is a complicated and time-consuming procedure. For this reason, the automation of the defect detection and identification process is essential for the proper and effective functioning of the inspection system.
In the classical ECT systems, the use of several frequencies requires the application of several probes with valid central frequencies to obtain the estimated depth of penetration. Additional limitations also come from a decrease in the current density with an increase in depth of penetration. Such phenomena reduce inspection reliability and affect the probability of damage detection. Thus, the inspection of multilayer structures and geometry constraints requires specialized probes and signal processing to keep linearity of damage detection and thickness change. In the proposed multifrequency ECT system, many of these limitations were overcome.
2.2. Data Acquisition
In the eddy current method, the measurement sensitivity depends on several factors, such as the transducer dimensions, an excitation frequency, and properties of a tested structure (permeability and conductivity). Unfortunately, high sensitivity and a high resolution usually cannot be achieved at the same time. Enlargement of the probe dimensions will result in higher sensitivity (for deeply loaded defects) and a lower spatial resolution. Decreasing the excitation frequency will cause a similar effect. In testing multilayer aircraft structures, high and possibly similar sensitivity for detecting defects in different layers and at different depths is required. At the same time, it is necessary to maintain the appropriate spatial resolution. It is crucial to detect even the most minor cracks in the vicinity of holes after riveted connections. In this case, it is particularly advantageous to use a measuring system based on the Massive Multifrequency Excitation and Spectrogram Method (MMFES), proposed in [
22]. This method uses a complex signal containing many sinusoidal components as an excitation signal and a spectrogram to detect and identify defects. A large variety of frequency components creates an opportunity for all defects to be detected using the most appropriate testing frequency. It is for the first time to apply the MMFES method for the inspection of riveted structures.
2.2.1. Measuring System Description
The measurements were carried out using the computer system implementing the Massive Multi-Frequency Excitation and Spectrogram (MMFES) method. In the MMFES system, a complex waveform is used as the excitation, created by adding sinusoidal components with precisely defined frequencies and amplitudes. The formula which describes the function defining the excitation voltage is as follows:
where
sE–waveform of the excitation signal,
n–number of harmonics,
Ui–amplitude of the
i-th component,
fi–amplitude of the
i-th component,
ϕi–phase of the
i-th component.
The frequency range is selected in such a way as to include the values for which the maximum interaction of eddy currents with defects occurs at possible depths of the tested structure. The number of harmonic components
n is selected in such a way as to achieve the highest possible signal to noise ratio at the eddy current transducer output. The relation of individual harmonics amplitude to the expected noise level and the possibility of filtering signals in the frequency domain are considered. The initial phases of individual harmonics do not significantly affect the ability to detect defects, but it is possible to limit the maximum value achieved by the excitation signal by appropriate selection. One of the critical issues is the selection of the excitation harmonic amplitudes. They are experimentally determined so that the harmonic components of the signal measured at the transducer output have the same value when it is located over the tested material that does not contain heterogeneity. The amplitude equalization process usually requires two to three iterations until the differences are satisfactorily reduced (i.e., below 1%). The experimental process allows eliminating the influence of the transducer frequency characteristic, lead wires, and other factors such as a lift-off and the inclination of the transducer concerning the tested material surface. The plot of excitation voltage measured on the output of the D/A converter as a function of time is shown in
Figure 3. The amplitude spectrum of this voltage is shown in
Figure 4.
The excitation driving voltage is generated in the D/A converter (NI PXI 5422-sample rate 200 MS/s, 80 MHz bandwidth, 16-Bit waveform generator) and then fed through the high-frequency power amplifier (NF HAS 4101, frequency range DC to 10 MHz, slew rate 5000 V/μs, max. current 1.4 A, amplification gain 20) to the excitation coils of the eddy current transducer.
The voltage generated in the measuring coil of the eddy current transducer is amplified and filtered with the Kronhite 3988 and then digitized in the NI PXI 5922 A/D converter (maximum sampling rate 15 MS/s, maximum resolution 24 bits). Due to the high resolution and accuracy of the A/D converter, which is achieved while maintaining a relatively high conversion rate, it is possible to obtain precise information about minor changes of amplitudes of the harmonic components in the processed signals. In these measurements, the sampling rate was set to 315 kS/s, which allows a resolution of 24 bits to be achieved. One of the critical issues in the MMFES system is to minimize spectral leakage. Therefore, the sampling frequency is precisely selected, considering the harmonic frequencies present in the excitation signal. Additionally, a clock that controls the A/D converter is used simultaneously to control the D/A converter generating the excitation. The block scheme of the MMFES system is shown in
Figure 5. The digitized signal from the sensing coil is then processed using the Fast Fourier Transform (FFT) to determine the individual harmonic amplitudes. The values of harmonic amplitudes acquired for a single transducer position constitute the frequency characteristic of the transducer and the material within the effective range of the generated electromagnetic field.
Signals obtained in the Ultrasonic Testing (UT) are commonly displayed in three different formats. The formats are known in the NDT terminology as A-scan, B-scan, and C-scan presentations. Each presentation format provides different information about the region of material being inspected. The A-scan presents the amount of received ultrasonic energy as a function of time for a single position of the transducer. Peaks visible in the A-scan presentation allow to estimate the location and size of the discontinuity in the inspected material. Similar information can be achieved from the frequency characteristic obtained in the MMFES ECT system. Therefore, we will use the term A-scan for such kind of data presentation despite some significant differences. An example of the MMFES ECT A-scan in the 300 Hz–2 kHz range is shown in
Figure 6.
In the UT method, the image composed of A-scans captured during the transducer’s linear scanning is called a B-scan. From the B-scan, it is possible to identify the depth of the discontinuity and its linear dimension in the scanning direction. A C-scan is an image obtained during two-dimensional scanning (x, y) of the transducer and showing values for a selected time of acquisition. Finally, D-scan is a three-dimensional representation of the signals measured during (x, y) scanning.
Moving the eddy current probe in a line over the tested material and registering the frequency characteristics for the following positions enables creating a spectrogram. The spectrogram is a two-dimensional plot of the relative amplitude of the signal frequency components from the pick-up coil versus the transducer x position. The spectrogram amplitude can be calculated as a difference between the current amplitude and the amplitude measured over the uniform material to emphasize material structure changes. The spectrogram corresponds to a B-scan.
Successive scanning of the tested element, line by line, allows creating a two-dimensional plot of the signal value against the orthogonal scanning directions (
x and
y). The plot of the signal value for a single frequency as a function of the transducer
x and
y positions (
Figure 7) is the equivalent of a C-scan.
Finally, the signal value three-dimensional plot against frequency f and the transducer position coordinates (x, y) corresponds to a D-scan.
The C-scans taken for the selected frequencies will be the input images to the automatic defect recognition system using image quality assessment based on similarity.
2.2.2. The Eddy Current Transducer
Inspection of the aluminum multilayer structures required a transducer characterized by high sensitivity, deep penetration, and the ability to detect defects in all directions with similar sensitivity. The eddy current differential transducer (
Figure 8a) consists of a ferrite core with five symmetrically placed columns.
A receiving coil containing 100 turns is wound on the central column. Four excitation coils (
EA,
EB,
EC,
ED) with 25 turns each are wound on the remaining columns. The excitation coils (
EA and
EB) are connected in series, as are the
EC and
ED coils (
Figure 8b). Both pairs are generating in the pick-up coil opposite directed magnetic fluxes. The resulting flux in the pick-up coil is close to zero in the equilibrium state. The output signal depends on a difference of fluxes
ϕx and
ϕy. The differential configuration of the transducer enables easy detection of the lack of symmetry in the tested specimen.
Figure 8c shows the distribution of eddy currents induced in the specimen obtained by numerical analysis of the transducer.
Although dedicated for tube inspection, the transducer with a quite similar configuration was presented for the first time [
23].
The transducer size was selected, considering the distribution of eddy currents induced in the tested specimen and the necessity to detect defects located in the deepest layers of the structure. The core dimensions are provided in
Figure 9.
2.2.3. Test Samples and Measurements
Preliminary experiments were carried out on a specimen (
Figure 10) taken from an aircraft operated under various load spectra. High loads initiated crack propagation at the edges of the rivet holes.
The cracks are very well visible in the X-ray image (
Figure 11) of the sample. Most of the observed cracks occurred in the first layer of the examined structure. Only some of the cracks appeared in the second layer (deeper under the surface). All the cracks were propagated from the edges of the rivet holes. The vast majority of the cracks were directed in one of two perpendicular directions due to the load’s directivity.
Most of these cracks were of considerable length (longer than 5 mm). There were also long cracks in the second layer, but it was observed that some of the cracks were less than 2 mm long.
This research aimed to develop a universal identification algorithm that works appropriately with defects of various sizes and orientations. However, as mentioned earlier, aircraft specimens contained a relatively limited set of naturally formed defects. For this reason, it was decided to produce aluminum specimens with a similar structure but with defects of a much more different nature, both in configuration and length.
Each sample consists of three 1 mm thick aluminum alloy plates joined together (
Figure 12). The test plates were sufficiently large (160 mm × 160 mm) that measurements could be made without significant influence of the edges. In all the plates, the holes were made in the center using a CNC machine. The diameter of the holes was 4 mm. In several plates, cuts simulating cracks of various lengths (2 mm, 5 mm, and combined two cuts 2 and 5 mm) were manufactured, as shown in
Figure 12.
In the research, we used 40 different samples, which were created from the assembly of the presented four plates with artificial defects in various configurations (defects in one plate and several plates simultaneously, on different layers, rotated by different angles, etc.). The use of various variants of the arrangement of basic defects creates a chance to test the system in a fairly wide range. However, it is planned to further expand the database of samples with artificial defects to verify responses in less frequent situations. At the same time, work is underway to collect a set of samples taken from real objects, but unfortunately, the number of experimental samples is limited by their availability. The defects found there are not very diverse and most often have larger dimensions than those that we would like to detect during routine inspections. For this reason, it is crucial to operating in two ways, collecting actual samples and using samples with artificially generated defects.
The measurements were performed by scanning the probe over the area surrounding the hole in steps of 0.5 mm in x and y-direction. The measurements were made using the multifrequency excitation signal consisting of sinusoidal components with frequencies from 315 Hz up to 1995 Hz (17 frequency components). The lift-off (the gap between the specimen and the sensor) was measured to be 0.4 mm.
The resulting C-scans for the lowest excitation frequency are presented in
Figure 13,
Figure 14 and
Figure 15. In
Figure 13, the results of measurements made for a hole with defects (5 mm long crack) on different layers are presented. It can be seen that the C-scan measured for the rivet hole without defects (
Figure 13a) is very symmetrical, and the presence of a defect on any layer causes a significant disturbance of the symmetry of the response (
Figure 13 b–d).
A similar phenomenon can be observed that it occurs even more intensively in the case of cuts manufactured at an angle of 90 degrees along the
y axis (
Figure 14).
However, as shown in
Figure 15, in the case of minor cuts or when they are deeper below the surface (
Figure 13d), the symmetry disturbances are not very large, and for effective detection of defects, it is necessary to use appropriate identification algorithms.
It is worth mentioning that the signals (C-scans) achieved for the other frequencies were not significantly different. Therefore, only one frequency component was selected for the presentation.
2.3. Similarity-Based Image Quality Assessment
Examples of the eddy current C-scans for a specified ROI using a single frequency are shown in the previous subsection as color plots. Nevertheless, the type of measurement data obtained as the C-scans is single-channel, similarly as in the case of greyscale images, although the data range is different. Hence, their color representation is unsuitable from the image processing point of view, where various three-channel color spaces are applied to analyze color images. Additionally, most state-of-the-art image quality assessment (IQA) methods, described further, were developed for monochrome images. Therefore, the obtained results were converted into grayscale 2D images, using 8-bit values in the range 0–255, for which the considered IQA methods may be directly applied.
Since the expected features of such obtained images are their symmetry and self-similarity, as it may be observed in
Figure 13a,
Figure 14a and
Figure 15a, the similarity-based IQA metrics are of primary interest in further investigations.
General-purpose image quality assessment methods, including those used in our experiments, may be considered as useful approaches for many applications where the image or video quality plays an important role, e.g., making it possible to skip the degraded frames during the analysis of video sequences or to optimize some newly developed image filtering or lossy compression algorithms.
On the other hand, the universal full-reference IQA methods, based on the comparison of an undistorted reference (“pristine”) image with the degraded one, may also be used for the detection of some types of contamination or image distortions as well as the estimation of their amount or level.
Although in many applications, where the reference image might be unavailable, the most desired approach would be the no-reference (NR) IQA, also referred to as “blind”, such methods are usually quite complex, and their performance is lower than state-of-the-art FR IQA metrics. Usually, such NR metrics are sensitive only to a limited number of distortion types, hence their universality is relatively low.
Nevertheless, the development of new IQA methods during recent years [
17,
18,
19,
24] has been concentrated on the possibly best correlation of the newly proposed metrics with subjective quality evaluations for the publicly available image datasets containing numerous degraded images, together with Mean Opinion Score (MOS) values gathered from human observers.
The most popular FR metric, known as Structural Similarity (SSIM), was proposed in [
25], being the combination of three factors representing the luminance distortions, loss of contrast, and structural distortions. Since the formula of the SSIM utilizes the averaging of the local similarity indexes calculated for each position of the sliding window (by default, 11 × 11 pixels Gaussian window is used), it is possible to build the quality map as well. Nevertheless, in most practical applications, a single scalar index is preferred, possibly well correlated with human perception of various distortions.
The simplified idea of the calculation of the local SSIM value between the blocks
x and
y, representing two fragments of the compared images for the specified location of the sliding window, may be expressed as the product of three components:
where
and
are the mean values whereas
and
denote the variance values in blocks
x and
y, respectively, and
stands for the covariance. Three stability constants:
,
and
, prevent the possible division by zero for very dark or flat areas of the image, assuming that
L is the dynamic range of pixels values. The final SSIM result is obtained as the average of local values calculated for all window positions.
Following the first success of the SSIM, many similar approaches and extensions of this metric were proposed by various researchers during recent years, starting from its multi-scale version, known as MS-SSIM [
26], making its results independent of the sliding window size. Some other modifications of the idea of Structural Similarity utilize the calculations of gradients, leading to Gradient Similarity [
27] as well as Gradient Magnitude Similarity Deviation (GMSD) [
28]. Another direction of development of SSIM-inspired metrics is related to the use of Riesz transform [
29,
30] and phase congruency, leading to well-performing Feature Similarity (FSIM) [
31], where the similarity of gradient magnitude is combined with the similarity of the phase congruency, calculated similarly as in the SSIM formula.
The same general approach is also used in the Edge Strength Similarity metric [
32], as well as in some other metrics utilizing local variance [
33] or spectral residual [
34]. Another interesting approach, originating from the idea of the SSIM, is based on the perceptual combination of multi-scale gradient magnitude maps that concern the micro- and macrostructural similarity that are further combined with color information similarity and high-distortion-based pooling, forming the final Perceptual Similarity (PSIM) metric [
35].
Nevertheless, although the performance of most of these improved metrics is significantly better in comparison to the “original” SSIM for many IQA benchmarking datasets, their application for some other types of images, e.g., containing multiple distortions [
36] or for the surface quality assessment of 3D prints [
37] not always leads to satisfactory results. Some other examples might be the development of quality metrics for audio-visual signals, stitched images [
38], or light field reconstruction, compression, and display [
39], where some specific types of distortions may take place. Therefore, the application of the general-purpose IQA methods in some other areas is not always straightforward and may require their extensions, significant modifications, and an appropriate choice of potentially useful metrics.
A remarkable extension of the SSIM formula was proposed by Ponomarenko et al. [
40], where the fourth multiplicative component was introduced, which is related to the similarity of predictability of image blocks. This component may be determined by calculating the mean square error (MSE) between the specified block and the neighboring blocks. Hence, the minimum MSE value is chosen as the local result and then multiplied by the three “original” SSIM components, namely similarity of the average luminance of image blocks, a correlation factor, and contrast similarity. Nevertheless, the additional necessity of searching for the most similar block increases the computational complexity. The formula of the method denoted as SSIM4 may be expressed as:
where
is assumed,
and
are the measures of predictability of the respective image blocks expressed as mean MSE values between the given block
A and its neighboring blocks
B according to
assuming that neighboring blocks
B are limited by a specified distance (the default parameters are search area 19 × 19 pixels and the block size
M =
N = 9). Nevertheless, the blocks that contain the whole central part consisting of 5 × 5 pixels block are excluded from the calculations of the predictability measure
[
33] as illustrated in
Figure 16, where three excluded red blocks and three green blocks are shown.
Although all SSIM-based metrics utilize the estimation of image similarity, they cannot be applied directly for the analysis of images obtained as the results of measurements considered in this paper. Hence, their modification related to the calculation of the mutual (internal) similarity between the image fragments is postulated.
The concept of similarity calculation between two images without using reference images was also used in blind image quality assessment, although in this case, the use of the pseudo-reference image (PRI) is necessary [
41,
42] is based on the use of multiple PRIs, assuming various degrees of distortion aggravation.
Nevertheless, the approach proposed in this paper is different since it does not require any comparisons with any other images, and the only required data for the mutual similarity calculation is the image that is the result of eddy current measurements.