1. Introduction
Data acquisition by means of ground penetrating radar (GPR) in railway track maintenance is an established, non-destructive measurement method for monitoring ballast conditions and other parameters [
1,
2,
3,
4]. GPR can provide measurements of the layer thickness and composition (lateral and longitudinal changes) of individual subgrade layers, moisture content and contamination condition of the ballast. It can map the lateral and longitudinal changes of ballast and subgrade layers to provide information on failure and deformation conditions with minimal disruption to train operations. In this respect, GPR can be used as an effective tool to uncover the causes of recurrent poor rail track geometries and unstable roadways [
5]. It is also possible to identify and better investigate problematic railway sections by considering, for example, successive inspection campaigns that show a systematic increase in geometric fault parameters over time [
6].
For this purpose, the radar measurement system is usually mounted directly on a rail vehicle, as shown in
Figure 1a. The unit of transmitter and receiver is called a transducer which emits pulses of radar waves (electromagnetic waves, usually with a center frequency of 200 MHz and above) towards the ground and receives their reflections (
Figure 1b). During the radar survey of a railway track, radar pulse echoes (single reflection signals referred to as “traces” or A-scans) are continuously detected and recorded at regular intervals while the vehicle is moving forward (
Figure 1c). In this way, a pictorial representation of the local radar reflection properties of the substructure can be generated by combining the individually received reflection signals column by column to form a so-called radargram (or B-scan,
Figure 2). This allows trained personnel to examine the subsurface structure of the track and to detect any anomalies (e.g., changes in the depth of the track planum, estimation of the degree of contaminations, and the moisture content as well as accumulated water and settlements, etc.). After manual evaluation of the recorded data, specific maintenance work can be derived. Thus, the evaluation (i.e., interpretation and categorization) of this measurement data requires a visual inspection, usually carried out by persons with geophysical training and experience. However, even for experienced persons familiar with railway radargrams, the accurate and reproducible labeling, e.g., of sub-track layer boundaries, still poses a practical challenge. On the one hand, GPR measurements can be carried out at high speeds of more than 100 km/h, so that large distances can be covered in a very short time. On the other hand, the subsequent manual evaluation of the collected image data is very time-consuming. Automatic support of this process could significantly reduce inspection times and increase objectivity in the evaluation.
Different approaches for automated support in the evaluation of GPR data in railway maintenance are reported in the literature. Frequently, these methods are based on time domain analysis or frequency domain analysis of individual GPR signals (traces) e.g., by means of 1D Fourier transform [
7,
8]. Combined time–frequency analysis methods are reported in [
9,
10,
11] but also applied only to single signal traces (1D). Typical applications refer to the quantification of ballast conditions, geometric ballast properties, or the measurement of moisture content [
12,
13,
14,
15,
16]. In [
17,
18], the application of image processing approaches supported by machine learning methods (image classification) is described to characterize mainly ballast conditions and ballast stone size distributions. Recent attempts deal with deep learning approaches for image-based classification, e.g., object detection methods [
19], but with the disadvantage that a large amount of high-quality labeled ground truth data is needed in advance to achieve good results.
The aim of the method presented here is to provide automated support for the exact (pixel-wise) segmentation of relevant image regions in typical railway radargrams. The data basis for our evaluations was provided by Groundcontrol GmbH, a company that has surveyed more than 100,000 km of railway tracks worldwide using GPR. With many years of experience in the interpretation of such measurement data, experts from this company have accompanied the development of the presented method. The novelty of this work consists in the application of 2D Gabor filtering for image texture analysis and the interpretation of the results of a subsequent dimensionality reduction by means of principal component analysis (PCA). It is shown that the first three principal components can be assigned to visually interpretable image structures, which are subsequently used for a pixel-precise segmentation of relevant image areas. Technically, the presented method can be classified as a two-dimensional combined time–frequency analysis approach for segmenting radargram images. As a result, the exact subdivision of radargram image structures becomes possible in a level of detail that far exceeds the purely visual capabilities of a human observer, e.g., for determining the exact course of ground structure boundaries.
The paper is structured as follows:
Section 2 presents in detail the segmentation methodology based on image texture analysis using 2-dimensional Gabor filters and principal component analysis (PCA). In
Section 3, the results of the proposed method are shown using typical radargrams from GPR railway surveys. In
Section 4, a refinement of the proposed method is presented, which allows to additionally consider phase information in the segmentation process. Finally, a summary and outlook are given in
Section 5.
3. Automatic Radargram Segmentation Results
Typical results of the segmentation procedure are presented in this chapter. The examples shown are based on radargrams that were recorded with a 400 MHz GPR system at measurement speeds of approx. 100 km/h with sampling rates of 1024 samples/trace. If, for example, the task is to identify very low-frequency content ranging from specific medium- to high-amplitude values (which typically represents the so-called track planum, the boundary between the ballast and sub-ballast layer), simply the lower right part of the 3D feature space in
Figure 8a must be selected and the corresponding pixels (image areas) can immediately be displayed in the segmented radargram (exemplarily indicated as blue/purple regions in
Figure 9b). These dominant structures can also be recognized quite well visually in the raw radargram (
Figure 9a).
High-frequency image structures with pronounced local scattering, on the other hand, typically indicates ballast fouling (i.e., when the finer materials mix with fresh ballasts due to heavy repeated train loads), which is shown as yellow/red areas in the segmentation result (
Figure 9b). These structures are much less pronounced and more difficult to detect visually in the original radargram.
Another possible representation is shown in
Figure 10, where five automatically extracted image segments of an examined railroad section are visualized separately. Each segment is characterized by specific signal frequency and amplitude ranges. This allows further investigations to be carried out, such as determining the signal energy or measuring the area (or “thickness”) of these specifically characterized texture segments, in order to automatically monitor the exceeding of critical threshold values (e.g., the amount of locally collected water typically represented by low-frequency and very high-amplitude regions without local scattering contributions).
Figure 11a exemplifies another advantage of this special form of automatic radargram segmentation. Due to the complex image structure in the raw images, it is difficult even for the trained human eye to determine the exact course and delineation of different relevant textures. The proposed automatic segmentation offers advantages here since, e.g., the exact course of the layer boundaries is now based on comprehensible criteria and no longer on a purely visual (subjective) assessment, which could possibly lead to different results for different assessors. Various further types of possible automated evaluations are shown in
Figure 11b–d, where, e.g., upper and lower formation boundaries have been extracted automatically. Subsequently, the exact course of these contours is the basis for further evaluations, for example to determine any undulations in the boundary layers or to determine the exact depth and thickness of substructure layers. In this way, potential problem areas can be identified very quickly in an automated way, which provide useful indications for track maintenance.
4. Phase Measurements at Low-Frequency Layer Interfaces
The segmentation process shown above can also be extended to include signal phase evaluation. Generally, the phase of the reflected GPR signal is influenced by the wave impedance. It is negative when the signal is split at an interface where the impedance decreases (i.e., the relative permittivity increases). Conversely, the phase is positive when the signals are split at an interface where the velocities and wave impedances increase (i.e., the relative permittivity decreases). Thus, if the relative permittivity of a target is lower than that of the surrounding medium (e.g., air-filled void or plastic pipe), there is no phase reversal of the backscattered GPR wave—on the contrary, e.g., especially for water, a phase reversal is produced. In this way, additional information about the boundary layer conditions (e.g., water content) or about possibly buried objects (especially of metallic ones) could be derived.
Based on the segmentation process described in this paper, phase shifts in extracted very low-frequency segments of the radargram can be determined. Signal traces
with a specific phase angle are identified by applying a 1D cross-correlation (CC) with Ricker wavelets
of tunable phase from −180° to 170° in steps of 5° (Equation (8),
Figure 12). The Ricker wavelet (also referred to as Mexican Hat wavelet) is used here because of its close similarity to the measured radar reflection signal. In other words, cross-correlation is used to determine that Ricker wavelet that best matches the shape of the current signal trace:
This additional signal phase information can be used for advanced classification and segmentation tasks where the identification of phase angle changes might be a valuable additional feature (e.g., for water content analysis or detection of buried objects).
Figure 13a shows exemplarily an extracted low-frequency segment. Then a further subdivision of this segment is performed by defining a phase angle threshold as a distinguishing feature: those traces with a phase angle equal to approx. 0° are shown in the middle image, (
Figure 13b) and those with a phase angle equal to approx. 180° are shown in the lower image (
Figure 13c). As can also be perceived visually, the middle image consists of signals with a phase angle of approx. 0°, i.e., the signal maximum (bright) is clearly recognizable here in the central area of the image segment. In the lower image, on the other hand, the central peak area is continuously pronounced as a signal minimum (dark) indicating phase angles of 180°.
5. Summary and Conclusions
The presented form of automatic texture segmentation based on 2D Gabor filtering contributes to reducing the complexity of interpreting and quantifying radargrams in rail track condition assessment. Based on the presented approach, it is immediately recognizable whether, for example, anomalous characteristics of radargram structures are present. The algorithm identifies those image areas that appear similar in terms of spatial frequency, amplitude, and “local scattering”—the individual interpretation of these characteristic image structures still must be carried out by experts. However, it can support the evaluation process and significantly increase the reproducibility of results, since the same objective analysis criteria are applied to assess characteristic image structures which can then be analyzed in detail, e.g., to support the difficult process of determining critical track conditions such as ballast fouling or clay fouling. This process is accompanied by automated image measurements, such as the identification of different layer structures and the corresponding measurement of layer thicknesses and layer depths. Even the extraction of signal phase information at layer boundaries becomes feasible. In this way, typical radargram sections covering areas of several kilometers of track can be analyzed on modern computers within a few seconds. The radargram analysis presented is already routinely used in track assessment by Groundcontrol GmbH as a supporting method, and the degree of automation has already been significantly improved in this way.
The possibilities of pixel-precise segmentation will be further explored in future activities. This includes, for example, the detailed comparison with data from bore holes or trial pits. Future work should additionally show whether the segmentation proposed here can also be used for automated labeling in conjunction with modern deep learning methods (i.e., to efficiently generate learning data for supervised learning algorithms in this way), which could then in turn be used to further improve the automated interpretation of radargrams. The presented approach thus represents a further step towards advancing the reproducibility and objectifiability of GPR analysis of track substructures and thus increasing the safety of railways. However, the presented approach can presumably also be transferred to other areas of application in which automated radar signal evaluation is of importance.