A Survey on Data-Driven Predictive Maintenance for the Railway Industry

Davari, Narjes; Veloso, Bruno; Costa, Gustavo de Assis; Pereira, Pedro Mota; Ribeiro, Rita P.; Gama, João

doi:10.3390/s21175739

Open AccessReview

A Survey on Data-Driven Predictive Maintenance for the Railway Industry

by

Narjes Davari

¹

,

Bruno Veloso

^1,2,3

,

Gustavo de Assis Costa

⁴

,

Pedro Mota Pereira

⁵,

Rita P. Ribeiro

^1,6,*

and

João Gama

^1,3,*

¹

Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal

²

Faculty of Science and Technology, University Portucalense, 4200-072 Porto, Portugal

³

School of Economics, University of Porto, 4099-002 Porto, Portugal

⁴

Federal Institute of Goiás, Campus Jataí, Unity Flamboyant, Jataí 75801-326, Brazil

⁵

Metro of Porto, 4350-158 Porto, Portugal

⁶

Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal

^*

Authors to whom correspondence should be addressed.

Sensors 2021, 21(17), 5739; https://doi.org/10.3390/s21175739

Submission received: 21 July 2021 / Revised: 18 August 2021 / Accepted: 20 August 2021 / Published: 26 August 2021

(This article belongs to the Special Issue Machine Learning from Heterogeneous Condition Monitoring Sensor Data for Predictive Maintenance and Smart Industry)

Download

Browse Figures

Versions Notes

Abstract

:

In the last few years, many works have addressed Predictive Maintenance (PdM) by the use of Machine Learning (ML) and Deep Learning (DL) solutions, especially the latter. The monitoring and logging of industrial equipment events, like temporal behavior and fault events—anomaly detection in time-series—can be obtained from records generated by sensors installed in different parts of an industrial plant. However, such progress is incipient because we still have many challenges, and the performance of applications depends on the appropriate choice of the method. This article presents a survey of existing ML and DL techniques for handling PdM in the railway industry. This survey discusses the main approaches for this specific application within a taxonomy defined by the type of task, employed methods, metrics of evaluation, the specific equipment or process, and datasets. Lastly, we conclude and outline some suggestions for future research.

Keywords:

condition-based maintenance; predictive maintenance; machine learning; deep learning; artificial intelligence; railway industry

1. Introduction

Cyber-physical systems in Industry 4.0 are reforming conventional decision-making processes, mainly through the integration of entities and functionalities via intercommunication systems and intelligent data processing approaches. This reformation brings new challenges and high complexity. Operational decisions are tougher to be made. However, these advancements might provide new solutions for typical problems, as system failures, and thus, for maintenance approaches. Among many existing maintenance approaches, Predictive Maintenance (PdM) is a data-based approach that emerged as a prominent field of research. It uses statistical analysis, Machine Learning (ML) models, and Deep Learning (DL) solutions for modeling system behavior, discovering the trends and predicting failures, which improves a system’s reliability. PdM methods divide into three main categories, namely [1]: model-based prognosis, knowledge-based prognosis, and data-driven prognosis. Data-driven PdM strategies appeared with great prominence and importance both in industry and academia.

Detecting and preventing failures in industries with high operational risk (e.g., the railway industry) is ultimately essential to improve not only the system efficiency (e.g., equipment utilization) but also its effectiveness (e.g., the integrity of the environment and human safety). An effective maintenance management approach is vital, and industries seek to minimize the number of operational failures, minimize their operational costs, and increase their productivity. Consequently, planning and analysis strategies are necessary to assess the equipment’s operating status and useful life. However, due to the complexity involved in an industrial process, several automated solutions were implemented to perform future projections about the state of equipment by signal processing techniques that can support decision making.

This literature survey attempts to present, classify, and analyze the existing data-driven approaches developed for the PdM, specifically in the railway industry. Modern transportation is highly dependent on it to move cargo and passengers. The global increase in production and logistics needs higher use of the railway industry. Thus, common damages will occur in the overall structure and components due to factors such as weather and degradation. These could potentially lead to accidents of different proportions, which can even cause fatalities [2]. Indeed, operational and technical failures have a significant impact on the railway industry.

Recent advances in sensing and computing technology have given rise to PdM which, unlike traditional maintenance management techniques (e.g., corrective maintenance and preventive maintenance), attempts to predict failures and avoid system shut down proactively. Doing so maximizes system utilization, minimizes maintenance costs, and improves the system’s safety, reliability, and efficiency. Precisely, for the railway industry, with recent technology advances in cloud storage, communication, and sensing, we can monitor any part of the system more precisely and in real-time. Thus, it is a natural need for more complex solutions to analyze data with more scalability, precision, and efficiency.

In the past decade, a large number of works addressed PdM by the use of ML/DL approaches, but mainly the latter. The monitoring and logging of industrial equipment events, like temporal behavior and fault events, can be obtained from data and records generated by various sensors installed on the equipment. Specifically, sensors can be implemented to PdM in order to decrease the failure rate and enhance the system reliability [3]. Such sensors can monitor and generate alerts for equipment with the need for attention. Progressive development of industrial (wireless) sensor networks and emerging technologies, e.g., IoT [3,4,5], brings about generating a massive amount of data with scale and higher reliability. In this perspective, ML/DL algorithms are particularly relevant to create advanced mining methods for the PdM.

Research in PdM practices for the railway industry progressively receive more attention by the industry and academia. A recent literature review regarding Big Data Analytics in the railway industry can be found in [6], where the level and the types of big data models are reviewed and summarized for operations, maintenance, and safety applications. Most of the works focus on solutions that assess the infrastructure health state like railway points (switches) and interlocking systems. Although, in the case of trains, there exist many other challenges related both to internal conditions, like the general functioning of wagons (e.g., wheels, air compressed units, brakes) and external conditions, like weather, geographical position, in addition to other variables.

The dynamic context of the railway system is exceptionally challenging and these areas, by themselves, require the study of many combinations of analysis. In this sense, we define a taxonomy specific to the context of the railway industry. Differently, from [6], our taxonomy classifies the related works in three areas: infrastructure, scheduling policies, and vehicles. We also classify the works based on the type of data analysis method used to address PdM practices. We also employed a classification grounded on ML and DL algorithms, following the work in [1]. In practice, PdM needs a timely decision-making process which in turn needs models able to process data and adjust themselves in a timely manner.

In short, in this survey, we try to answer the following questions.

What parts of the overall railway industry are subject to PdM techniques?
What kind of data are being used with PdM?
How the DL methods are employed in the PdM applications?
What solutions are supported by DL methods and which are being used to perform PdM on the railway industry?

The contributions of this paper are threefold: (i) we review the maintenance applications, specifically the PdM practices describing the taxonomy of the solution space in addition to some technical aspects and current trends, (ii) we review recent advancements for data-driven PdM practices, specifically for the railway industry, and (iii) we present some of the main evaluation metrics for the PdM practices.

This paper is organized as follows. Section 2 presents and classify the PdM practices. Section 3 reviews the main ML and DL algorithms implemented for the PdM practices, also, the reader can find some of the most used datasets for Data-driven PdM, serving as a starting point for new projects. Section 4 specifically devoted to data-driven PdM practices in the railway industry, and Section 5 reviews the evaluation metrics for the PdM methods. Finally, in Section 6, we conclude with our final remarks and envision potential future research directions.

2. Predictive Maintenance

Maintenance corresponds to the process that deals with equipment or system components to ensure their normal functioning under any circumstances. Over the years, several different maintenance approaches have been developed, each representing a different generation over time due to technological advances. Three main maintenance approaches can be classified as below [7]:

Corrective maintenance: it means run-to-failure, which is the simplest and the oldest method. The idea is to take action only after a machine or equipment fails. It would almost always lead to high (unexpected) downtime, besides having maintenance staff expenditure. This method usually generates a critical situation that will demand a great cost for companies.
Preventive maintenance: it provides planning of regular replacement of components and/or equipment. Considering historical failure data and/or the data provided by the equipment manufacturer, MTTF is calculated, which in turn is used by the maintenance team to propose a preventive action plan. Although this approach prevents unexpected shutdown, it usually needs additional costs and an increased unexploited lifetime.
PdM: it needs direct monitoring of the mechanical condition and other parameters that can determine the operating conditions over time. Indeed, due to technological advances, existing tools can process real-time data acquired from different equipment parts to predict any sign of failure.

An equipment failure is almost random and unpredictable which is impacted by several (unknown) factors. A well-known technique to decide on the maintenance approach is P-F curve analysis (cf. Figure 1), which allows understanding the condition of equipment over time [8]. During the time between the detection of potential failure and the actual failure, it is crucial to perform a maintenance action to address the problem before a functional failure occurs.

The improvement of computing capacity, communication, and storage infrastructure allowed the triggering of PdM of mechanical equipment as the focus of the next stage of development [5]. In industrial manufacturing, IoT embedded in machines and production lines is now a reality. Large-scale stream processing for real-time data also becomes a reality that needs to be considered by industries, mainly because of competitive issues. PdM became one of the central answers to this challenge [9].

The most common data collected from sensors are vibration, thermography, and tribology [7]. PdM planning usually uses data streams to obtain operational conditions information and predicts equipment failures. Usually, it contributes to cost reduction and the overall improvement of quality in production. Nevertheless, results could still be better if we make use of data from more sensors or even the combination of some of them [8].

Over the years, PdM practices have been developed from several perspectives; namely, ref. [10]: (i) f6+ailure prediction, to predict equipment failure overtime interval; (ii) RUL estimation, to estimate the remaining useful lifetime of equipment. These two perspectives are illustrated in Figure 2 and are detailed next.

2.1. Failure Prediction

Failure Prediction is the most generic and direct perspective for the PdM practices for which the main goal is to predict the approximate moment where some failure could occur.

PdM is generally employed based on the health status of critical elements. In an attempt to avoid possible interruptions or even more severe damage, based on the operational history of different components, this strategy can be used to predict failures over time, minimizing costs and extending the useful life of the components.

2.2. Remaining Useful Life (RUL)

Different maintenance management policies can be employed by the use of anomaly detection, diagnostics, and prognostics [11]. The RUL is strongly related to prognostics, which provides the amount of time equipment will be operational before it requires any repair or replacement. Prognostic is directly related to MTTF estimation and the likelihood of system failure occurrence. It can be regarded as a forecasting process given the current machine conditions and its historical record [12].

Based on the application type, goals may differ, i.e., PdM can be performed to predict the RUL of a specific asset or a set of assets to predict failure within a given time window or even just flagging abnormal behavior in a system. Current works reflect this modeling behavior, as will be seen in the following sections.

A categorization of methods and techniques for RUL can be found in [13]. As a fundamental task for RUL, prediction clearly defines the difference between run-to-failure (corrective maintenance) and time-to-failure (prognostics) strategies.

3. Data-Driven PdM

Unlike the model-based maintenance approaches (e.g., preventive maintenance approaches) that rely on forecasting the performance degradation by the use of stochastic models, data-driven PdM practices are based on data without prior knowledge of degradation conditions. Its performance strictly depends on the analysis of signals and data. While for complex systems, model-based solutions can be expensive and inaccurate, data-driven diagnosis methods are a promising alternative to fault/anomaly detection and isolation [14]. ML and DL algorithms and tools are naturally relevant to the PdM practices, mainly due to a large amount of data (specifically the unlabeled ones). Based on the availability of data and respective labels, learning methods can be classified into three different categories: (i) supervised learning, in which a labeled training data set are used for a mapping from the set of predictor variables values to a specified target variable; (ii) semi-supervised learning, where the goal is to learn from data sets that have the target variable value for only a subset of examples [15]; and (iii) Unsupervised learning, in which machine learns from data sets with no target variable.

In addition, RL and DL are also mainly implemented often under the scope of semi-supervised and/or unsupervised approaches [16]. The former is a technique that looks forward to discovering the actions needed to maximize a numerical reward in a trial-and-error fashion, while the latter is defined by the structure and functions of NNs [17]. DL differs on how features are handled. There is a hierarchy with features at different levels, where the composition of low-level features forms higher-level ones and, complex functions can be learned by mapping the input to the output [18].

Recent reviews on the ML/DL methods for PdM are found in the literature. We highlight some of those next. In [16], authors describe the recent advances in techniques and applications. In [19] the authors provide a review of the recent advancements of ML/DL techniques applied to PdM for smart manufacturing, and the works are classified based on ML/DL algorithms, ML/DL category, machinery and equipment used, device used in data acquisition, and data size and type. Finally, in [20] authors provide an insight into ML/DL used for PdM practices and provides an overview of industrial sensors and future research aspects of sensors in PdM practices.

Regarding the data available for the PdM practices, it is challenging to assign labels to the real-time data stream from sensors in an industrial plant. Firstly because of the limited types of measurements and secondly because of the cost and feasibility of having one or more specialists analyze data. Thus, we can argue that using supervised learning is not a feasible solution way in this context. Another important aspect is the scale. Different types of sensors are massively being adopted for use in a great variety of automation applications. With the IoT paradigm, new challenges are imposed for the storage and retrieval of large amounts of data and their meaningful visualization [21].

The last 6 years have been very productive in PdM research and works with ML/DL methods for industrial applications are becoming the majority of them. The current advances in this area contribute mutually to enhancing methods and the improvement of industrial planning. From this scenario, we can conceive many challenges. Next, we review the main ML and DL tools implemented in PdM practices and on the following public datasets available on the Web for PdM is reviewed.

3.1. Traditional Machine Learning Methods

Several ML algorithms and methods have been used to predict failures and RUL. Some approaches explored the use of classical algorithms as LR [22], SVR [23], SVM [24], RF [25] while osthers explored the combined use of algorithms with step phased approaches: ARIMA and SVM [26], SVR and SVM [27] and TL with RF [28]; and also with a comparative approach: RF, QRF, DT, KNN, SVR and PCR [25]. In here, we briefly review recent works used traditional ML methods in PdM applications.

AE, a network trained to attempt to copy its input to its output, is widely used in PdM practices. It is a method well-suited for unsupervised feature extraction. Based on the AE architecture, many works have adopted a common solution of extracting features from the input in an attempt to reduce concerns of overfitting in the models [29,30,31,32,33,34,35,36], or as in the case of [37], where AE was used as part of the ensemble model.

To make simple AE more robust, a Variational AE (VAR) is also proposed for learning deep latent-variable models and corresponding inference models by the use of stochastic gradient descent. In [38], the Variational AE was used to deal with insufficient labels in an asset failure prediction application.

Baptista et al. [39] proposed a framework based on ARMA to make predictions as an alternative to traditional life usage modeling. The case study involved a critical component of commercial aircraft. Zheng [40] presented a method to predict a bearing RUL based on a health indicator algorithm and a linear degradation model. Ordóñez et al. [26] proposed an algorithm supported by ARIMA and SVM models for RUL prediction of aircraft engines.

Using Empirical Mode Decomposition and Wavelet Transforms as pre-processing techniques to improve input quality, coupled with Particle Swarm Optimized Support Vector Machines (PSO+SVM), Souto Maior et al. [41] has estimated the RUL of bearing from the IEEE PHM Challenge 2012 big dataset.

Zhang et al. [42] proposed to use transfer learning with bi-directional LSTM for RUL estimation. They firstly train the models on different but related datasets and then fine-tuned by the target dataset. The performance of the estimation model is evaluated with two measures that were used: Scoring Function [43] and RMSE.

3.2. Deep Learning Methods

Traditional ML approaches show better performance for lesser amounts of input data. However, advancements in sensing technologies and the emergence of technologies such as IoT produce a vast amount of data, and consequently, the performance of traditional ML techniques could not meet the required scale. In this context, DL becomes a necessary choice [16]. DL techniques process highly non-linear and varying sequential data with minimal human input in several knowledge domains [44].

A recent survey in [45] presents a systematic review specifically DL techniques applied to PdM practices, where the DL benefits and limitations for fault diagnosis and prognostics are discussed. Another recent review for DL techniques applied to PdM practices can be found in [46]. Nevertheless, another recent review can be read in [47] specifically for DL applied to machine health monitoring in which an overview on AE and its variants and RBM and its variants including DBN and DBM, CNN, RNN are presented.

In addition to the review works, some recent works proposed to perform a comparative analysis of their PdM strategy to different classical ML algorithms [48,49,50,51]. Given the steadily increasing use of sensors and the amount of data produced by them, and the fact that these data are often materialized as real-time time series DL methods will undoubtedly be among the future PdM tools. Thus, in the following subsections, we will give focus on DL algorithms and methods.

3.2.1. Deep Neural Network (DNN)

A DNN is an ANN with multiple layers (more than two hidden layers) between the input and output layers without looping back, and the flow of the network goes through the layers, calculating the probability of each output [52,53].

Among the early applications of DL methods, we can refer to a multi-layer feed-forward ANN for engine fault diagnosis is developed in [54], an ANN method to classify diesel engine fault occurrences in [55], a feed-forward ANN prediction model to estimate conditions of laser welding processes in [56], and a two-layer ANN for a fault diagnosis framework which can learn features extracted from mechanical vibration signal.

Several relevant works also employed DNN to develop prediction models. In general, the goals are to diagnose different elements of an industrial plant, e.g., wind turbine gearbox [57], rolling bearings, and planetary gearboxes [58], among others [59,60,61,62,63].

3.2.2. Convolutional Neural Network (CNN)

A CNN is a type of DNN that is trained with the backpropagation algorithm and is common in image processing tasks [64] and is widely used for PdM practices. A diagnosis strategy to detect the fault type in the planet bearing is proposed in [65]. The strategy is based on the SST, where the Hilbert transform processes raw vibration signals to obtain the fault information. The 1D time-series signals are converted into 2D images, from which a DCNN can automatically learn underlying fault features by fault classification. Additionally, DCNN used in [66] to monitor the wear condition of an abrasive belt from grinding sound signals. Another fault recognition method for rotating machinery is proposed in [67] in which a multi-sensor data fusion and bottleneck layer optimized CNN is used to (i) convert vibration signals from multiple sensors to 2D images and (ii) extract features and fuse the multi-sensor data.

Fault diagnosis is also considered in Chen et al. [68], where a CNN and DWT method is used to identify the fault conditions of planetary gearboxes of wind turbines. CNN is used to learn the discriminating features from the coefficients of DWT. Moreover, Ma and Chu [37] proposes a diagnosis method for rotor and rolling bearings faults based on an ensemble DL formulation, which in turn is based on a multi-objective optimization algorithm. The ensemble learning approach is based on ResCNN, DBN and Deep AE.

CNN methods are also used for RUL estimation; e.g., Wang et al. [10] proposes an approach supported by Functional Data Analysis (FDA) for RUL estimation. The method incorporates the correlations within the same equipment and the discrepancy across sensor time series from different equipment. Additionally, Al-Dulaimi et al. [69] propose a Hybrid DNN model for RUL estimation that integrates two parallel paths (one LSTM and one CNN) followed by a fully connected multilayer fusion NN which combines the output of the two paths to form the target RUL.

3.2.3. Recurrent Neural Network (RNN)

In contrast to feed-forward networks, in RNN feedback loops are possible. Additionally, a cascade of neurons get fired in this kind of network, and the output of a neuron only affects its input at some later point in time, i.e., they have some limited duration before becoming inactive.

In [70], a method based on LSTM RNN, is proposed to assess bearing performance degradation. LSTM is an RNN architecture that has feedback connections and, in addition to single data points, it can also process sequences of data. A bearing degradation indicator is constructed to represent the bearing running states, validated with feature verification and selection by a simulation model based on a vibration response mechanism. Another LSTM architecture is proposed in [71] to predict whether a truck compressor failure will happen within a specified time window of 90 days. However, Nguyen and Medjaher [72] design a LSTM classifier to calculate the probabilities that the system will fall into different time intervals.

In [73], authors present two models to capture and encode characteristics of signals, or groups of signals on-board vehicles caused by air compressor faults in city buses. One approach used histograms, and the other is based on echo state networks (ESNs), a specific type of RNN, that exhibits fast training without local optima, and it is used for modeling the signal. Recently, Gugulothu et al. [74] present an approach based on RNN that processes sensor data in a sequence-to-sequence model to generate embeddings for multivariate time series. They generate separate embeddings for normal machines and degraded machines and, after comparison, it is possible to estimate the RUL, even in the presence of noise in sensor readings.

More recently, a RNN classifier has been introduced by Onchis [75] for condition monitoring of cantilever beams. They used the changes in natural frequencies based on time-frequency processing extracted from vibrating beams. Most recently, Lepenioti et al. [76] implements a RNN for predictive analytic and a multi-objective RL method for prescriptive analytic. The proposed method was implemented for a PdM scenario in a steel-making company.

3.2.4. Generative Adversarial Network (GAN)

CAN is an approach to generative modeling using DL, where two NNs compete with each other. It offers an alternative approach to maximum likelihood estimation techniques [16]. Yoon et al. [38] present a semi-supervised learning approach for modeling failures when there is a lack of a high number of labels on historical data. Using a non-linear embedding technique, based on a variational AE, they combined a GAN model parameterized by DNN. Authors have also used turbofan engine degradation data sets from NASA CMAPSS [77].

In a recent work Shao et al. [78] propose the framework based on GAN) to learn from mechanical sensor data. The framework composes of two parts: generator and discriminator. The network makes use of stacking one-dimensional convolution layers to learn local features from the original input. Most recently, two GAN networks were proposed in [79] for failure prediction based on experimental data collected from an Air Pressure System (APS) data set [80] and a turbofan engine degradation data sets from NASA CMAPSS [77].

Finally, we summarize the works on general data-driven solutions for PdM in Table 1. This table is outlined by employed methods and data sources, the equipment or process where the solutions were applied, and the respective references. From Table 1 we can observe that independently of the Goal or the Learning Task, most used techniques rely on different types of neural networks, showing the applicability of these techniques on different data sources (type of sensors/equipment).

3.3. Datasets for PdM

Some public datasets for testing and evaluating PdM techniques in different scenarios are provided in [87]. PdM strategy is distinctive and application-dependent, supported by the environment, available data, hardware, among others. Thus, these data sources give support to the development, testing, and comparisons with different ML techniques.

For failure prediction methods, a dataset proposed by [88] for a robot failure can be used, in which 463 samples and 30 attributes are provided. A second data source, proposed by [89], aimed to detect faults and estimate weights for a gearbox using some data and information about bearing geometry. In the dataset in [90], component failures were detected in the air pressure system of trucks, from where 76,000 samples and 171 attributes were obtained. A fourth data set, proposed by [91] is composed of faults detected from robot swarms.

For the mechanical failures, a well-known dataset, the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) [77] developed by NASA to simulate the operation of turbofan engines. The Case Western Reserve University Bearing Data Center (CWRU) [92] contains motor bearing data from different operation condition, as normal operating state, single-point drive, and fan defects. The third dataset can be considered as the one proposed in the Numenta Anomaly Benchmark (NAB) [93], where NAB version 1.1 is composed of over 50 labeled real-world and artificial time series data files. Measurements from motor current and vibration signals from the Paderborn University bearing Dataset [94] enable the verification of models and sensors of different signals to increase the accuracy of fail detection from bearings. We also can introduce PRONOSTIA [95], a popular dataset for predicting bearing’s RUL. It is known as the bearing accelerated life test dataset, which serves to investigate new algorithms. It provides real data related to the accelerated degradation of bearings in different operating conditions.

In Table 2, we collected the datasets mentioned above that can support experiments and comparative analysis in PdM studies. For each dataset, we provide the reference and a brief description.

4. Data-Driven PdM for the Railway Industry

PdM practices in the railway industry are not so recent as with many other application areas. However, recent advancements of AI technologies provide new opportunities for its expansion. Although ML/DL methods developed for the PdM practices in a wide range of applications, the literature with specific applications in the railway industry is yet scarce. A recent review regarding the data-driven PdM works in the railway tracks can be found in [96]. The works have been classified based on model types and application types. Their study indicates that in the new research trend ML/DL methods, unsupervised methods, and ensemble methods are the most implemented learning methods. Next, we also provide a review of the works developed between 2000 and 2021, classified in infrastructure, scheduling policies, and vehicles topics.

4.1. Infrastructure

Automated inspections and maintenance prediction of the infrastructure is becoming a major concern for the rail industry practitioners. Examples include but are not limited to the works reported for rail tracks and anchors. Failures on railway tracks can cause many problems related to costs, and consequently, there is great demand imposed to maintain rail tracks in a good state of repair [82].

Among the first works, an SVM based algorithm to predict impending failures and alarms of critical rail car components is proposed in [97], in which they use data from sensors installed along the railway. Recently, a data-driven PdM method has been developed in [98] for the railroad switch which is an arrangement of equipment that enables railway trains to switch from one track to another. Faults in this system can cause traffic delays. The author uses the data available from maintenance bookkeeping and railway controlling system logging. The proposal faced the problem with a supervised learning strategy to make predictions and tests are performed by SVM, RF, naive Bayes generative model, and LR methods. Railway tracks are critical components in the rail industry. Faults and failures will necessarily occur to tracks as with any other mechanical system with time and usage.

Another recent work in [99] proposes tree-based classification techniques (e.g., decision tree, random forest, and gradient boosted trees) for the maintenance need prediction, activity type, and trigger’s status of railway switches. This study criticized the expensiveness of employing additional data collection measures to record the assets’ behavior. The author has utilized historical data of visual inspection, condition state, and maintenance records. From comprehensive maintenance action data, e.g., visual inspections and maintenance records, this classification technique employs multiple models based on a DT, an RF and GBT.

More recently, ref. [100] design a four-layer big data architecture for establishing a data management framework to manage enormous amounts of data produced by railway switch points. A LSTM prediction model is implemented within the framework for detecting failures based on analytical tasks in the Italian railway industry. Additionally, a data-driven risk prediction model to predict and evaluate rail defects and service failures is proposed in [101], in which a framework to predict the risk of rail defects recurrence in different segments of the network is also developed.

Lately, an advanced data mining method based on ML techniques to create strategic decision support and draw up a risk and control plan for trains was proposed in [102]. They used stored-inactive data from a Greek railway company for the random forest classifier and decision tree classifier algorithms trained by the historical data for 6 years. According to the experience extraction from domain experts and the available resources from the system, the approach improves operations efficiency.

4.2. Scheduling Policies

Recent reviews for the railway industry [82,103] reveals that most works address track defects using corrective maintenance. In addition, the scheduling process is mainly planned in cases when defects are already known. Among the few works considered data-driven PdM practices, predictive and risk-based maintenance activities schedule is considered in [104], in which predictions for maintenance of railway infrastructure are performed by predicting the degradation state of certain assets. A two-stage stochastic linear program forecasts the future track conditions.

A data-driven policy for the inspection and maintenance of track geometry to give support on both corrective and preventive maintenance is proposed in [82], where a Markov chain and Bernoulli process were used to modeling data from some observed magnitudes. The results using RF, SVM and LR algorithms are compared and further used to model the relationship between the explanatory and the dependent variables. Moreover, a MCMC simulation is employed to calculate and compare the total cost of different policies.

An integrated method for the prediction of rail and geometry defects and optimal scheduling is proposed in [105]. In railway industry terminology, geometry defects are horizontal and/or vertical misalignment on the track, while rail defects include track wear such as corrosion or impairments such as broken rails or cracks. The solutions provide inspection and maintenance schedules. The authors make use of K-means to perform feature selection, followed by predicting the number of defects by RF and RNN methods. Moreover, a MDP to integrate the stochastic nature of defect occurrence into scheduling is used to find the optimum inspection policies.

4.3. Vehicles

Considering the components for which a data-driven PdM is practisced, vehicle maintenance prevails with a particular emphasis on the maintenance of four components: wheel, bearing, truck, and traction. In an early work, a knowledge discovery solution is presented to extract data from historical behavioral data collected by sensors in [106]. It is based on association rules, more specifically sequential pattern mining, to extract specialized classes. Using anomaly detection, they compare new patterns with sequential patterns describing normal behavior that were extracted before. Later, a RF based methodology was developed in [25] to assess the current health and predict RUL of both trucks (bogies) and wheels of a rail-car by fusing measurements from three types of detector. The MissForest, an RF based non-parametric imputation method, is also used to handle missing data in detector reading. The work in Fumeo et al. [85] deals with data streams coming from onboard sensors to make RUL predictions. They proposed a novel algorithm based on Streaming Data Analysis (SDA), where predictions are performed with online-SVR.

Recently, data extraction from open/close cycles controlling valves of a train door is proposed in [2], where the authors aimed to detect structural failures in the train door controlling system. Firstly, an anomaly detection algorithm is used with the support of different windowing strategies. After that, a low-pass filter is applied to the output in an attempt to improve anomaly detection. In addition, a temporal factor is incorporated in both phases.

DNN and traditional data-driven methods, regarding the extraction of fault features, are compared in [107]. These features should represent, effectively, essential information aiming to perform an intelligent diagnosis. The fault signals of bogies with big data were processed using a DNN, and the corresponding results are compared with those from a multi-hidden layer neural network, a single hidden layer neural network with a shallow structure. The work concludes that DNN can improve identification accuracy and are extremely useful in reducing defects into manually designing the features. A framework to detect air leakage and predict its severity to determine action plans is presented in [22], in which anomalies are detected to find air leakages from the logs of a compressor. The method is based on a LR classifier to model different classes of compressor behavior for the trains from a fleet. It also employs a clustering method to differentiate anomalies from outliers. The author claims that most failures can be detected one to four weeks before the occurrence and that their contextual anomaly detection method can avoid false alarms. They made use of real datasets from Dutch Rail.

Most recently, an online detection model for train speed is proposed in [108], in which an anomaly detection strategy and a Bayesian statistical model that represents train behavior in speed changes are developed. A linear regression model is employed, taking into account the time duration and travel distance from the departure station. In this study, the OpenRails platform is used to simulate the operation of trains and generate data aiming to evaluate the performance of the model. A learning method for the prediction of wheelsets RUL and failure types, combining linear regression loss, LR loss, and L2/L1 regularization, is proposed in [27]. The method is based on SVM for failure type classification and SVR for RUL prediction.

4.4. Overview

Following the literature we reviewed in the previous sections, a summary is presented in Table 3. Generally, it is possible to verify that a significant part of the references was conducted by supervised learning. The exceptions are the works in [2,105], which make use of semi-supervised and unsupervised learning, respectively. Moreover, there is an almost exact division in task employment, i.e., half-used anomaly detection and other half used prediction.

Excepting the works in [22], and ref. [27] that propose to perform both Failure Prediction (FP) and RUL estimation, all the other works aimed to reach distinct goals. As can be observed from Table 3, only two papers addressed RUL estimates for some railway assets while the rest proposed to predict some type of failure.

As we stated before, supervised learning is not a feasible solution in the context of PdM for the railway industry because it makes predictions based on known training examples. In addition, as the operation of this system is dynamic over-functioning time, we can realize one first challenge of having a model that can be updated in real-time (online learning) for the anomaly detection task. There are several challenges in robustly learning the distribution for any time series without any supervision [109].

More than half of these works gave attention to the maintenance need of trains behavior in the sense of cost reduction and accident avoidance. In the current context, this attention will increase due to the new challenges involving new ways of measuring and detecting the different parts of the train system in a multivariate analysis fashion. Another important aspect is the data types used in the experiments. Most of them were real data extracted from sensors/monitors, as stated in [87].

5. Evaluation Metrics in PdM!

In this section, we provide a review of the metrics used for performance evaluation of the PdM practices, specifically in the railway industry. Reviews for the measurement of the performance of anomaly detection methods and prognostic systems can be found in [110,111]. The most common performance evaluation metrics in the context of PdM are reported in Table 4 and described next.

5.1. Failure Prediction

The metrics proposed for the performance evaluation of Failure prediction methods mainly measure the number of failures predicted accurately and/or the number of wrong predicted failures. Accuracy is a natural metric through which the number of true predicted failures and true predicted non-failures over a total number of events is measured. The performance of the DNN developed for fault prediction in bogies in [107] was evaluated through the Accuracy metric. It also has been used to evaluate the performance of the fusion algorithms based on neural networks proposed in [5] for mechanical fault diagnosis. Accuracy, misclassification rate, and f-score were also used in [99] to evaluate the performance of classification technique for maintenance prediction of railway switches.

The other principal evaluation metric is PR score, in which the percentage of truly identified failures over the number of predicted failures (true or false) is calculated (precision) and is compared to the percentage of the failures identified truly overall the failures (recall) [112]. PR score was used to evaluate the sensors data pattern mining approach developed in [106] and to evaluate the performance of fault prediction of railway track geometry developed in [82].

PR score has also been used to evaluate the failure prediction developed in [98] for data of maintenance bookkeeping and system logging. The authors also made use of AUC-ROC [114] to evaluate prediction performance and error analysis.

The performance of the integrated inspection and maintenance scheduling operations proposed in [105] for train geometry defects predictions were evaluated using MAE and RMSE metrics. RMSE was also used in [22] to evaluate a logistic regression classifier and a density-based clustering method proposed for anomaly detection. Moreover, the failure prediction method proposed in [39] based on operational log data was evaluated through Accuracy and precision were the metrics approached, in addition to RMSE, the median absolute deviation, and MTBF, a metric from the reliability domain.

In [2], authors adapted two metrics, namely: rFAR and rIPR, to deal with outlier detection, benefiting from the early failure detection. The rFAR reduces the number of false alarms, appearing just before the correct identification of a failure. In rIPR reduce the number of impostors for appearing after the correct identification of a failure.

5.2. Remaining Useful Life

MAE, MAPE, MSE, and RMSE are among the most common performance metrics used to evaluate RUL prediction methods. MAPE and MSE were used to evaluate the RF based methodology was developed in [25] to predict RUL of both trucks and wheels of a rail-car. The MAPE was also used for performance evaluation of RUL prediction proposed in [27], in which the authors also used PR for the classification result, and RUL estimation of bearings proposed in [85].

The MAE and MAPE were used to evaluate an approach for RUL estimation on two datasets was proposed and evaluated in [74], and an algorithm based on ARIMA and SVM proposed in [26] for RUL estimation. MAPE and RMSE were also used in [28] to evaluate a mapping function using RF regression model for predicting RUL of equipment under the scenario that labeled data are only available for the source domain.

The other performance metric includes confusion probability in [72] for an LSTM classifier proposed to perform prognostics and Accuracy in [113] for an approach for RUL estimation on two datasets was proposed and evaluated in [74].

6. Conclusions and Future Directions

In this survey, we reviewed the main works developed ML/DL algorithms for PdM in the railway industry. Some questions were initially outlined, but during the review, we also got an overview of new trends and challenges that can be faced by academia and industry.

Although the data-driven PdM are gaining more research attention, specifically in the past few years, the number of works specifically designed for the railway industry is quite limited. Initially, we were interested in the works including the vehicles, e.g., the general functioning of wagons. However, the limited number of works led us to consider a broader context.

Considering the research trends reviewed in the previous section, we can observe some significant gaps to be researched in future works. As noted, only a few works have faced the problem of using data as time series. Sensors typically gather data in the time-series format. Thus, we can envision this scenario as a task of anomaly detection in time series. Anomaly detection is the problem characterized by identifying specific patterns or events in data that are pretty different from the rest. Anomalies can arise in the data for many reasons, and one of the most common examples is malicious activities, as in the case of credit card fraud.

In manufacturing systems, reducing downtime is critical, and anomaly detection enables PdM for downtime reduction. Recent works have addressed anomaly detection for PdM supported by learning strategies on sequential data [2,39,106,115,116,117,118]. In the last few years, several papers were published approaching Anomaly Detection with Time-Series data applied to the most different domains, including industry, public water, and energy systems, among many others [1,109,112,114,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140].

Dealing with models high volume of time-series in real-time to perform anomaly prediction is the major challenge. Moreover, currently used metrics are not feasible in this context, and it will be indispensable to look for new alternatives that can efficiently evaluate models.

The other essential line of action is to look for different DL algorithms and architectures like RNN, GAN, TL and RL. Recent works have proposed approaches based on DL to resolve the problem of anomaly detection in time-series [28,125,127,139,141,142]. Nevertheless, new proposals in this research line will be necessary.

The last challenge would be to achieve the desired synergy between ML/DL methods and RCA by gaining automatic reasoning power to explain causality, which these methods by themselves are unable to perform.

Author Contributions

Conceptualization, N.D., G.d.A.C., R.P.R. and J.G.; methodology, N.D. and G.d.A.C.; software, N.D., B.V. and G.d.A.C.; validation, N.D., B.V., G.d.A.C., P.M.P., R.P.R. and J.G.; formal analysis, N.D. and G.d.A.C.; investigation, N.D., B.V. and G.d.A.C.; resources, N.D. and G.d.A.C.; writing—original draft preparation, N.D. and G.d.A.C.; writing—review and editing, N.D., B.V., G.d.A.C. and R.P.R.; visualization, N.D. and G.d.A.C.; supervision, R.P.R. and J.G.; project administration, R.P.R. and J.G.; funding acquisition, R.P.R. and J.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was fully supported by FCT—Fundação para a Ciência e a Tecnologia, Portugal, I.P., under Grant DSAIPA/DS/0086/2018.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This research was carried out in the context of the project FailStopper (DSAIPA/DS/0086/2018). This work was supported by the CHIST-ERA grant CHIST-ERA-19-XAI-012, funded by Fundação para a Ciência e Tecnologia by the project XPM CHIST-ERA/0004/2019.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AE	Auto-Encoder
AE	Auto-Encoder
AI	Artificial Intelligence
ANN	Artificial Neural Network
ARMA	Auto-regressive Moving Average
ARIMA	Auto-Regressive Integrated Moving Average
AUC-ROC	Area Under Curve/Receiver Operating Characteristic
BN	Bayesian Network
CNN	Convolutional Neural Network
DBN	Deep Belief Network
DBM	Deep Boltzmann Machines
DCNN	Deep Convolutional Neural Network
DL	Deep Learning
DRL	Deep Reinforcement Learning
DNN	Deep Neural Networks
DT	Decision Tree
DWT	Discrete Wavelet Transformation
GAN	Generative Adversarial Network
GBT	Gradient Boosted Tree
GRU	Gated Recurrent Units
IoT	Internet of Things
KNN	K-Nearest Neighbour
LSTM	Long Short-Term Memory Network
LR	Logistic Regression
MAE	Mean Absolute Error
MAPE	Mean Absolute Percentage Error
MCMC	Markov Chain Monte Carlo
MDP	Markov Decision Process
ML	Machine Learning
MLP	Multi-Layer Perceptron
MSE	Mean Squared Error
MTBF	Mean Time Between Failure
MTTF	Mean Time To Failure
NN	Neural Network
PCR	Principal Component Regression
PhM	Prognostic and Health Management
PdM	Predictive Maintenance
PR	Precision-Recall
QRF	Quantile Regression Forests
ResCNN	Residual Convolutional Neural Network
RBM	Restricted Boltzmann Machines
RCA	Root Cause Analysis
RL	Reinforcement Learning
RF	Random Forest
rFAR	reduced False Alarm Rate
rIPR	reduced Impostor Pass Rate
RMSE	Root Mean Squared Error
RNN	Recurrent Neural Network
RUL	Remaining Useful Life
SCM	Structural Causal Models
SST	Synchro-Squeezing Transform
SVM	Support Vector Machine
SVR	Support Vector Regression
TL	Transfer Learning

References

Zhang, W.; Yang, D.; Wang, H. Data-Driven Methods for Predictive Maintenance of Industrial Equipment: A Survey. IEEE Syst. J. 2019, 13, 2213–2227. [Google Scholar] [CrossRef]
Ribeiro, R.P.; Pereira, P.M.; Gama, J. Sequential anomalies: A study in the Railway Industry. Mach. Learn. 2016, 105, 127–153. [Google Scholar] [CrossRef] [Green Version]
Fraga-Lamas, P.; Fernández-Caramés, T.M.; Castedo, L. Towards the Internet of Smart Trains: A Review on Industrial IoT-Connected Railways. Sensors 2017, 17, 1457. [Google Scholar] [CrossRef] [Green Version]
Killeen, P.; Ding, B.; Kiringa, I.; Yeap, T. IoT-based predictive maintenance for fleet management. Procedia Comput. Sci. 2019, 151, 607–613. [Google Scholar] [CrossRef]
Huang, M.; Liu, Z.; Tao, Y. Mechanical fault diagnosis and prediction in IoT based on multi-source sensing data fusion. Simul. Model. Pract. Theory 2019, 102, 101981. [Google Scholar] [CrossRef]
Ghofrani, F.; He, Q.; Goverde, R.M.P.; Liu, X. Recent applications of big data analytics in railway transportation systems: A survey. Transp. Res. Part C Emerg. Technol. 2018, 90, 226–246. [Google Scholar] [CrossRef]
Mobley, R.K. An Introduction to Predictive Maintenance, 2nd ed.; Elsevier: Philadelphia, PA, USA, 2002. [Google Scholar]
Bengtsson, M.; Lundström, G. On the importance of combining “the new” with “the old”–One important prerequisite for maintenance in Industry 4.0. Procedia Manuf. 2018, 25, 118–125. [Google Scholar] [CrossRef]
Wang, J.; Zhang, W.; Shi, Y.; Duan, S.; Liu, J. Industrial Big Data Analytics: Challenges, Methodologies, and Applications. arXiv 2018, arXiv:1807.01016. [Google Scholar]
Wang, Q.; Zheng, S.; Farahat, A.K.; Serita, S.; Gupta, C. Remaining Useful Life Estimation Using Functional Data Analysis. arXiv 2019, arXiv:1904.06442. [Google Scholar]
Susto, G.A.; Wan, J.; Pampuri, S.; Zanon, M.; Johnston, A.B.; O’Hara, P.G.; McLoone, S.F. An adaptive machine learning decision system for flexible predictive maintenance. In Proceedings of the 2014 IEEE International Conference on Automation Science and Engineering (CASE), New Taipei, Taiwan, 18–22 August 2014; pp. 806–811. [Google Scholar]
Galar, D.; Kumar, U.; Lee, J.; Zhao, W. Remaining Useful Life Estimation using Time Trajectory Tracking and Support Vector Machines. J. Phys. Conf. Ser. 2012, 364, 012063. [Google Scholar] [CrossRef] [Green Version]
Okoh, C.; Roy, R.; Mehnen, J.; Redding, L. Overview of Remaining Useful Life Prediction Techniques in Through-life Engineering Services. Procedia CIRP 2014, 16, 158–163. [Google Scholar] [CrossRef] [Green Version]
Khorasgani, H.; Farahat, A.; Ristovski, K.; Gupta, C.; Biswas, G. A Framework for Unifying Model-based and Data-driven Fault Diagnosis. In Proceedings of the Annual Conference of the PHM Society 2018, Philadelphia, PA, USA, 24–27 September 2018; Volume 10. [Google Scholar]
Chapelle, O.; Schlkopf, B.; Zien, A. Semi-Supervised Learning, 1st ed.; The MIT Press: Cambridge, MA, USA, 2010. [Google Scholar]
Alom, Z.; Taha, T.M.; Yakopcic, C.; Westberg, S.; Sidike, P.; Nasrin, M.S.; Hasan, M.; Essen, B.V.; Awwal, A.A.S.; Asari, V. A State-of-the-Art Survey on Deep Learning Theory and Architectures. Electronics 2019, 8, 292. [Google Scholar] [CrossRef] [Green Version]
Shalev-Shwartz, S.; Ben-David, S. Understanding Machine Learning: From Theory to Algorithms; Cambridge University Press: Cambridge, MA, USA, 2014. [Google Scholar]
Bengio, Y. Learning deep architectures for AI. Found. Trends Mach. Learn. 2009, 2, 1–127. [Google Scholar] [CrossRef]
Çınar, Z.M.; Abdussalam Nuhu, A.; Zeeshan, Q.; Korhan, O.; Asmael, M.; Safaei, B. Machine learning in predictive maintenance towards sustainable smart manufacturing in industry 4.0. Sustainability 2020, 12, 8211. [Google Scholar] [CrossRef]
Sohaib, M.; Mushtaq, S.; Uddin, J. Deep Learning for Data-Driven Predictive Maintenance. In Vision, Sensing and Analytics: Integrative Approaches; Springer: Berlin, Germany, 2021; pp. 71–95. [Google Scholar]
Chen, F.; Deng, P.; Wan, J.; Zhang, D.; Vasilakos, A.V.; Rong, X. Data Mining for the Internet of Things: Literature Review and Challenges. Int. J. Distrib. Sens. Netw. 2015, 11, 431047. [Google Scholar] [CrossRef] [Green Version]
Lee, W.J. Anomaly Detection and Severity Prediction of Air Leakage in Train Braking Pipes. Int. J. Progn. Health Manag. 2017, 8, 1–12. [Google Scholar]
Loutas, T.H.; Roulias, D.; Georgoulas, G. Remaining Useful Life Estimation in Rolling Bearings Utilizing Data-Driven Probabilistic E-Support Vectors Regression. IEEE Trans. Reliab. 2013, 62, 821–832. [Google Scholar] [CrossRef]
Chaudhuri, A. Predictive maintenance for industrial iot of vehicle fleets using hierarchical modified fuzzy support vector machine. arXiv 2018, arXiv:1806.09612. [Google Scholar]
Li, Z.; He, Q. Prediction of Railcar Remaining Useful Life by Multiple Data Source Fusion. IEEE Trans. Intell. Transp. Syst. 2015, 16, 2226–2235. [Google Scholar] [CrossRef]
Ordóñez, C.; Lasheras, F.S.; Roca-Pardiñas, J.; de Cos Juez, F.J. A hybrid ARIMA—SVM model for the study of the remaining useful life of aircraft engines. J. Comput. Appl. Math. 2019, 346, 184–191. [Google Scholar] [CrossRef]
Wang, W.; He, Q.; Cui, Y.; Li, Z. Joint Prediction of Remaining Useful Life and Failure Type of Train Wheelsets: Multitask Learning Approach. J. Transp. Eng. Part A Syst. 2018, 144, 04018016. [Google Scholar] [CrossRef]
Fan, Y.; Nowaczyk, S.; Rögnvaldsson, T. Transfer learning for Remaining Useful Life Prediction Based on Consensus Self-Organizing Models. arXiv 2019, arXiv:1909.07053. [Google Scholar] [CrossRef]
Junbo, T.; Weining, L.; Juneng, A.; Xueqian, W. Fault diagnosis method study in roller bearing based on wavelet transform and stacked auto-encoder. In Proceedings of the 27th Chinese Control and Decision Conference (2015 CCDC), Qingdao, China, 23–25 May 2015; pp. 4608–4613. [Google Scholar]
Tao, S.; Zhang, T.; Yang, J.; Wang, X.; Lu, W. Bearing fault diagnosis method based on stacked autoencoder and softmax regression. In Proceedings of the 2015 34th Chinese Control Conference (CCC), Hangzhou, China, 28–30 July 2015; pp. 6331–6335. [Google Scholar]
Lu, W.; Wang, X.; Yang, C.; Zhang, T. A novel feature extraction method using deep neural network for rolling bearing fault diagnosis. In Proceedings of the 27th Chinese Control and Decision Conference (2015 CCDC), Qingdao, China, 23–25 May 2015. [Google Scholar]
Li, K.; Wang, Q. Study on signal recognition and diagnosis for spacecraft based on deep learning method. In Proceedings of the 2015 Prognostics and System Health Management Conference (PHM), Beijing, China, 21–23 October 2015; pp. 1–5. [Google Scholar]
Galloway, G.S.; Catterson, V.M.; Fay, T.; Robb, A.J.; Love, C.P. Diagnosis of tidal turbine vibration data through deep neural networks. In Proceedings of the Third European Conference of the Prognostics and Health Management Society, Bilbao, Spain, 5–8 July 2016. [Google Scholar]
Wang, L.; Zhao, X.; Pei, J.; Tang, G. Transformer fault diagnosis using continuous sparse autoencoder. SpringerPlus 2016, 5, 1–13. [Google Scholar] [CrossRef] [Green Version]
Mao, W.; He, J.; Li, Y.; Yan, Y. Bearing fault diagnosis with auto-encoder extreme learning machine: A comparative study. Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2017, 231, 1560–1578. [Google Scholar] [CrossRef]
Chen, Z.; Li, W. Multisensor Feature Fusion for Bearing Fault Diagnosis Using Sparse Autoencoder and Deep Belief Network. IEEE Trans. Instrum. Meas. 2017, 66, 1693–1702. [Google Scholar] [CrossRef]
Ma, S.; Chu, F. Ensemble deep learning-based fault diagnosis of rotor bearing systems. Comput. Ind. 2019, 105, 143–152. [Google Scholar] [CrossRef]
Yoon, A.S.; Lee, T.; Lim, Y.; Jung, D.; Kang, P.; Kim, D.; Park, K.; Choi, Y. Semi-supervised Learning with Deep Generative Models for Asset Failure Prediction. arXiv 2017, arXiv:1709.00845. [Google Scholar]
Baptista, M.; Sankararaman, S.; de Medeiros, I.P.; Nascimento, C.L.; Prendinger, H.; Henriques, E.M.P. Forecasting fault events for predictive maintenance using data-driven techniques and ARMA modeling. Comput. Ind. Eng. 2018, 115, 41–53. [Google Scholar] [CrossRef]
Zheng, Y. Predicting Remaining Useful Life Based on Hilbert-Huang Entropy with Degradation Model. J. Electr. Comput. Eng. 2019, 2019, 3203959:1–3203959:11. [Google Scholar] [CrossRef] [Green Version]
Souto Maior, C.; Moura, M.; Lins, I. Particle swarm-optimized support vector machines and pre-processing techniques for remaining useful life estimation of bearings. Eksploat. Niezawodn. Maint. Reliab. 2019, 21, 610–619. [Google Scholar] [CrossRef]
Zhang, A.; Wang, H.; Li, S.; Cui, Y.; Liu, Z.; Guanci, Y.; Hu, J. Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation. Appl. Sci. 2018, 8, 2416. [Google Scholar] [CrossRef] [Green Version]
Heimes, F.O. Recurrent neural networks for remaining useful life estimation. In Proceedings of the 2008 International Conference on Prognostics and Health Management, Denver, CO, USA, 6–9 October 2008; pp. 1–6. [Google Scholar] [CrossRef]
Ellefsen, A.L.; Bjorlykhaug, E.; Esoy, V.; Ushakov, S.; Zhang, H. Remaining useful life predictions for turbofan engine degradation using semi-supervised deep architecture. Reliab. Eng. Syst. Saf. 2019, 183, 240–251. [Google Scholar] [CrossRef]
Khan, S.; Yairi, T. A review on the application of deep learning in system health management. Mech. Syst. Signal Process. 2018, 107, 241–265. [Google Scholar] [CrossRef]
Zhang, L.; Lin, J.; Liu, B.; Zhang, Z.; Yan, X.; Wei, M. A Review on Deep Learning Applications in Prognostics and Health Management. IEEE Access 2019, 7, 162415–162438. [Google Scholar] [CrossRef]
Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep learning and its applications to machine health monitoring. Mech. Syst. Signal Process. 2019, 115, 213–237. [Google Scholar] [CrossRef]
Mathew, V.; Toby, T.; Singh, V.; Rao, B.M.; Kumar, M.G. Prediction of Remaining Useful Lifetime (RUL) of turbofan engine using machine learning. In Proceedings of the 2017 IEEE International Conference on Circuits and Systems (ICCS), Thiruvananthapuram, India, 20–21 December 2017; pp. 306–311. [Google Scholar] [CrossRef]
Amihai, I.; Gitzel, R.; Kotriwala, A.M.; Pareschi, D.; Subbiah, S.; Sosale, G. An Industrial Case Study Using Vibration Data and Machine Learning to Predict Asset Health. In Proceedings of the 2018 IEEE 20th Conference on Business Informatics (CBI), Vienna, Austria, 11–13 July 2018; Volume 1, pp. 178–185. [Google Scholar] [CrossRef]
Butte, S.; Prashanth, A.R.; Patil, S. Machine Learning Based Predictive Maintenance Strategy: A Super Learning Approach with Deep Neural Networks. In Proceedings of the 2018 IEEE Workshop on Microelectronics and Electron Devices (WMED), Boise, ID, USA, 20 April 2018; pp. 1–5. [Google Scholar] [CrossRef]
Luo, B.; Wang, H.; Liu, H.; Li, B.; Peng, F. Early Fault Detection of Machine Tools Based on Deep Learning and Dynamic Identification. IEEE Trans. Ind. Electron. 2019, 66, 509–518. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; Available online: http://www.deeplearningbook.org (accessed on 8 July 2021).
Nielsen, M.A. Neural Networks and Deep Learning. 2018. Available online: http://neuralnetworksanddeeplearning.com/ (accessed on 16 July 2021).
Ahmed, R.; El Sayed, M.; Gadsden, S.A.; Tjong, J.; Habibi, S. Automotive Internal-Combustion-Engine Fault Detection and Classification Using Artificial Neural Network Techniques. IEEE Trans. Veh. Technol. 2015, 64, 21–33. [Google Scholar] [CrossRef]
Jin, C.; Zhao, W.; Liu, Z.; Lee, J.; He, X. A vibration-based approach for diesel engine fault diagnosis. In Proceedings of the 2014 International Conference on Prognostics and Health Management, Cheney, WA, USA, 22–25 June 2014; pp. 1–9. [Google Scholar]
You, D.; Gao, X.; Katayama, S. WPD-PCA-Based Laser Welding Process Monitoring and Defects Diagnosis by Using FNN and SVM. IEEE Trans. Ind. Electron. 2015, 62, 628–636. [Google Scholar] [CrossRef]
Wang, L.; Zhang, Z.; Long, H.; Xu, J.; Liu, R. Wind Turbine Gearbox Failure Identification with Deep Neural Networks. IEEE Trans. Ind. Informatics 2017, 13, 1360–1368. [Google Scholar] [CrossRef]
Jia, F.; Lei, Y.; Lin, J.; Zhou, X.; Lu, N. Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mech. Syst. Signal Process. 2016, 72–73, 303–315. [Google Scholar] [CrossRef]
Zhou, F.; Gao, Y.; Wen, C. A Novel Multimode Fault Classification Method Based on Deep Learning. J. Control. Sci. Eng. 2017, 2017, 3583610. [Google Scholar] [CrossRef]
Cipollini, F.; Oneto, L.; Coraddu, A.; Murphy, A.J.; Anguita, D. Condition-Based Maintenance of Naval Propulsion Systems with supervised Data Analysis. Ocean. Eng. 2018, 149, 268–278. [Google Scholar] [CrossRef] [Green Version]
Zhang, R.; Peng, Z.; Wu, L.; Yao, B.; Guan, Y. Fault Diagnosis from Raw Sensor Data Using Deep Neural Networks Considering Temporal Coherence. Sensors 2017, 17, 549. [Google Scholar] [CrossRef]
Heydarzadeh, M.; Kia, S.H.; Nourani, M.; Henao, H.; Capolino, G. Gear fault diagnosis using discrete wavelet transform and deep neural networks. In Proceedings of the IECON 2016—42nd Annual Conference of the IEEE Industrial Electronics Society, Florence, Italy, 23–26 October 2016; pp. 1494–1500. [Google Scholar] [CrossRef]
Scalabrini Sampaio, G.; Vallim Filho, A.R.d.A.; Santos da Silva, L.; Augusto da Silva, L. Prediction of Motor Failure Time Using An Artificial Neural Network. Sensors 2019, 19, 4342. [Google Scholar] [CrossRef] [Green Version]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Zhao, D.; Wang, T.; Chu, F. Deep convolutional neural network based planet bearing fault classification. Comput. Ind. 2019, 107, 59–66. [Google Scholar] [CrossRef]
Cheng, C.; Li, J.; Liu, Y.; Nie, M.; Wang, W. Deep convolutional neural network-based in-process tool condition monitoring in abrasive belt grinding. Comput. Ind. 2019, 106, 1–13. [Google Scholar] [CrossRef]
Wang, H.; Li, S.; Song, L.; Cui, L. A novel convolutional neural network based fault recognition method via image fusion of multi-vibration-signals. Comput. Ind. 2019, 105, 182–190. [Google Scholar] [CrossRef]
Chen, R.; Huang, X.; Yang, L.; Xu, X.; Zhang, X.; Zhang, Y. Intelligent fault diagnosis method of planetary gearboxes based on convolution neural network and discrete wavelet transform. Comput. Ind. 2019, 106, 48–59. [Google Scholar] [CrossRef]
Al-Dulaimi, A.; Zabihi, S.; Asif, A.; Mohammadi, A. A multimodal and hybrid deep neural network model for Remaining Useful Life estimation. Comput. Ind. 2019, 108, 186–196. [Google Scholar] [CrossRef]
Zhang, B.; Zhang, S.; Li, W. Bearing performance degradation assessment using long short-term memory recurrent network. Comput. Ind. 2019, 106, 14–29. [Google Scholar] [CrossRef]
Chen, K.; Pashami, S.; Fan, Y.; Nowaczyk, S. Predicting Air Compressor Failures Using Long Short Term Memory Networks. In Proceedings of the Artificial Intelligence—19th EPIA Conference on Artificial Intelligence, EPIA 2019, Vila Real, Portugal, 3–6 September 2019; Proceedings, Part I. pp. 596–609. [Google Scholar]
Nguyen, K.T.; Medjaher, K. A new dynamic predictive maintenance framework using deep learning for failure prognostics. Reliab. Eng. Syst. Saf. 2019, 188, 251–262. [Google Scholar] [CrossRef] [Green Version]
Fan, Y.; Nowaczyk, S.; Rögnvaldsson, T.S.; Antonelo, E.A. Predicting Air Compressor Failures with Echo State Networks. In Proceedings of the Third European Conference of the Prognostics and Health Management Society, Bilbao, Spain, 5–8 July 2016. [Google Scholar]
Gugulothu, N.; Tv, V.; Malhotra, P.; Vig, L.; Agarwal, P.; Shroff, G. Predicting Remaining Useful Life using Time Series Embeddings based on Recurrent Neural Networks. arXiv 2017, arXiv:1709.01073. [Google Scholar]
Onchis, H.D.M. A deep learning approach to condition monitoring of cantilever beams via time-frequency extended signatures. Comput. Ind. 2019, 105, 177–181. [Google Scholar] [CrossRef]
Lepenioti, K.; Pertselakis, M.; Bousdekis, A.; Louca, A.; Lampathaki, F.; Apostolou, D.; Mentzas, G.; Anastasiou, S. Machine Learning for Predictive and Prescriptive Analytics of Operational Data in Smart Manufacturing. In Proceedings of the International Conference on Advanced Information Systems Engineering, Grenoble, France, 8–12 June 2020; Springer: Berlin, Germany, 2020; pp. 5–16. [Google Scholar]
Saxena, A.; Goebel, K. Turbofan Engine Degradation Simulation Data Set. NASA Ames Prognostics Data Repository; NASA Ames Research Center: Moffett Field, CA, USA, 2008.
Shao, S.; Wang, P.; Yan, R. Generative adversarial networks for data augmentation in machine fault diagnosis. Comput. Ind. 2019, 106, 85–93. [Google Scholar] [CrossRef]
Zheng, S.; Farahat, A.; Gupta, C. Generative Adversarial Networks for Failure Prediction. arXiv 2019, arXiv:1910.02034. [Google Scholar]
Dheeru, D.; Taniskidou, E.K. UCI Machine Learning Repository. 2017. Available online: https://archive.ics.uci.edu/ml/index.php (accessed on 17 July 2021).
Hendrickx, K.; Meert, W.; Mollet, Y.; Gyselinck, J.; Cornelis, B.; Gryllias, K.; Davis, J. A general anomaly detection framework for fleet-based condition monitoring of machines. Mech. Syst. Signal Process. 2020, 139, 106585. [Google Scholar] [CrossRef] [Green Version]
Sharma, S.; Cui, Y.; He, Q.; Mohammadi, R.; Li, Z. Data-driven optimization of railway maintenance for track geometry. Transp. Res. Part C Emerg. Technol. 2018, 90, 34–58. [Google Scholar] [CrossRef]
Verma, N.K.; Gupta, V.K.; Sharma, M.; Sevakula, R.K. Intelligent condition based monitoring of rotating machines using sparse auto-encoders. In Proceedings of the 2013 IEEE Conference on Prognostics and Health Management (PHM), Gaithersburg, MD, USA, 24–27 June 2013; pp. 1–7. [Google Scholar]
Lei, Y.; Jia, F.; Lin, J.; Xing, S.; Ding, S.X. An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data. IEEE Trans. Ind. Electron. 2016, 63, 3137–3147. [Google Scholar] [CrossRef]
Fumeo, E.; Oneto, L.; Anguita, D. Condition Based Maintenance in Railway Transportation Systems Based on Big Data Streaming Analysis. Procedia Comput. Sci. 2015, 53, 437–446. [Google Scholar] [CrossRef] [Green Version]
Wen, L.; Dong, Y.; Gao, L. A new ensemble residual convolutional neural network for remaining useful life estimation. Math. Biosci. Eng. 2019, 16, 862. [Google Scholar] [CrossRef]
Carvalho, T.P.; Soares, F.A.; Vita, R.; da P. Francisco, R.; Basto, J.P.; Alcalá, S.G. A systematic literature review of machine learning methods applied to predictive maintenance. Comput. Ind. Eng. 2019, 137, 106024. [Google Scholar] [CrossRef]
Lopes, L.S.; Camarinha-Matos, L.M. Robot Execution Failures Data Set. 1999. Available online: https://archive.ics.uci.edu/ml/datasets/Robot+Execution+Failures (accessed on 20 July 2021).
Lopes, L.S.; Camarinha-Matos, L.M. Gearbox Fault Detection Dataset. 2009. Available online: https://c3.nasa.gov/dashlink/resources/997/ (accessed on 12 July 2021).
Lindgren, T.; Biteus, J. IDA2016—Challenge Data Set. 2016. Available online: https://archive.ics.uci.edu/ml/datasets/IDA2016Challenge (accessed on 15 July 2021).
Tarapore, D.; Christensen, A.L.; Timmis, J. Generic, scalable and decentralized fault detection for robot swarms. PLoS ONE 2017, 12, e0182058. [Google Scholar] [CrossRef]
Saxena, A.; Goebel, K. Case Western Reserve University Bearing Data Center. 2008. Available online: https://csegroups.case.edu/bearingdatacenter/pages/download-data-file (accessed on 5 July 2021).
Ahmad, S.; Lavin, A.; Purdy, S.; Agha, Z. Unsupervised real-time anomaly detection for streaming data. Neurocomputing 2017, 262, 134–147. [Google Scholar] [CrossRef]
Lessmeier, C.; Kimotho, J.K.; Zimmer, D.; Sextro, W. Condition Monitoring of Bearing Damage in Electromechanical Drive Systems by Using Motor Current Signals of Electric Motors: A Benchmark Data Set for Data-Driven Classification. In Proceedings of the European Conference of the Prognostics and Health Management Society, Bilbao, Spain, 5–8 July 2016. [Google Scholar]
Nectoux, P.; Gouriveau, R.; Medjaher, K.; Ramasso, E.; Chebel-Morello, B.; Zerhouni, N.; Varnier, C. PRONOSTIA: An experimental platform for bearings accelerated degradation tests. In Proceedings of the IEEE International Conference on Prognostics and Health Management, Denver, CO, USA, 18–21 June 2012; pp. 1–8. [Google Scholar]
Xie, J.; Huang, J.; Zeng, C.; Jiang, S.H.; Podlich, N. Systematic Literature Review on Data-Driven Models for Predictive Maintenance of Railway Track: Implications in Geotechnical Engineering. Geosciences 2020, 10, 425. [Google Scholar] [CrossRef]
Li, H.; Parikh, D.; He, Q.; Qian, B.; Li, Z.; Fang, D.; Hampapur, A. Improving rail network velocity: A machine learning approach to predictive maintenance. Transp. Res. Part Emerg. Technol. 2014, 45, 17–26. [Google Scholar] [CrossRef]
Ojala, J. On Analysis of the Predictive Maintenance of Railway Points Processes and Possibilities. Master’s Thesis, School of Science, Aalto University, Espoo, Finland, 14 May 2018. [Google Scholar]
Allah Bukhsh, Z.; Saeed, A.; Stipanovic, I.; Doree, A. Predictive maintenance using tree-based classification techniques: A case of railway switches. Transp. Res. Part Emerg. Technol. 2019, 101, 35–54. [Google Scholar] [CrossRef]
Salierno, G.; Morvillo, S.; Leonardi, L.; Cabri, G. An architecture for predictive maintenance of railway points based on big data analytics. In Proceedings of the International Conference on Advanced Information Systems Engineering, Grenoble, France, 8–12 June 2020; Springer: Berlin, Germany, 2020; pp. 29–40. [Google Scholar]
Ghofrani, F. Data-Driven Railway Track Deterioration Modeling for Predictive Maintenance. Ph.D. Thesis, State University of New York, Buffalo, NY, USA, 2020. [Google Scholar]
Kalathas, I.; Papoutsidakis, M. Predictive Maintenance Using Machine Learning and Data Mining: A Pioneer Method Implemented to Greek Railways. Designs 2021, 5, 5. [Google Scholar] [CrossRef]
Turner, C.; Tiwari, A.; Starr, A.; Blacktop, K. A review of key planning and scheduling in the rail industry in Europe and UK. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2016, 230, 984–998. [Google Scholar] [CrossRef] [Green Version]
Consilvio, A.; Febbraro, A.D.; Sacco, N. Stochastic scheduling approach for predictive risk-based railway maintenance. In Proceedings of the 2016 IEEE International Conference on Intelligent Rail Transportation (ICIRT), Birmingham, UK, 23–25 August 2016; pp. 197–203. [Google Scholar]
Gerum, P.C.L.; Altay, A.; Baykal-Gürsoy, M. Data-driven predictive maintenance scheduling policies for railways. Transp. Res. Part Emerg. Technol. 2019, 107, 137–154. [Google Scholar] [CrossRef]
Rabatel, J.; Bringay, S.; Poncelet, P. Anomaly detection in monitoring sensor data for preventive maintenance. Expert Syst. Appl. 2011, 38, 7003–7015. [Google Scholar] [CrossRef] [Green Version]
Hu, H.; Tang, B.; Gong, X.; Wei, W.; Wang, H. Intelligent Fault Diagnosis of the High-Speed Train with Big Data Based on Deep Neural Networks. IEEE Trans. Ind. Inform. 2017, 13, 2106–2116. [Google Scholar] [CrossRef]
Kang, S.; Sristi, S.; Karachiwala, J.S.; Hu, Y.C. Detection of Anomaly in Train Speed for Intelligent Railway Systems. In Proceedings of the 2018 International Conference on Control, Automation and Diagnosis (ICCAD), Marrakech, Morocco, 19–21 March 2018; pp. 1–6. [Google Scholar]
Toledano, M.; Cohen, I.; Ben-Simhon, Y.; Tadeski, I. Real-time anomaly detection system for time series at scale. In Proceedings of the KDD 2017: Workshop on Anomaly Detection in Finance; Anandakrishnan, A., Kumar, S., Statnikov, A., Faruquie, T., Xu, D., Eds.; PMLR: Stockholm, Sweden, 2018; Volume 71, pp. 56–65. [Google Scholar]
Xu, X.; Liu, H.; Yao, M. Recent Progress of Anomaly Detection. Complexity 2019, 2019, 2686378:1–2686378:11. [Google Scholar] [CrossRef]
Saxena, A.; Celaya, J.R.; Balaban, E.; Goebel, K.; Saha, B.; Saha, S.; Schwabacher, M. Metrics for evaluating performance of prognostic techniques. In Proceedings of the 2008 International Conference on Prognostics and Health Management, Denver, CO, USA, 6–9 October 2008; pp. 1–17. [Google Scholar]
Lu, Y.; Kumar, J.; Collier, N.; Krishna, B.; Langston, M.A. Detecting Outliers in Streaming Time Series Data from ARM Distributed Sensors. In Proceedings of the 2018 IEEE International Conference on Data Mining Workshops (ICDMW), Singapore, 17–20 November 2018; pp. 779–786. [Google Scholar]
Bektas, O.; Jones, J.A.; Sankararaman, S.; Roychoudhury, I.; Goebel, K. A neural network filtering approach for similarity-based remaining useful life estimation. Int. J. Adv. Manuf. Technol. 2019, 101, 87–103. [Google Scholar] [CrossRef] [Green Version]
Calikus, E.; Nowaczyk, S.; Sant’Anna, A.P.; Dikmen, O. No Free Lunch But A Cheaper Supper: A General Framework for Streaming Anomaly Detection. arXiv 2019, arXiv:1909.06927. [Google Scholar]
Liu, J.; Guo, J.; Orlik, P.V.; Shibata, M.; Nakahara, D.; Mii, S.; Takác, M. Anomaly Detection in Manufacturing Systems Using Structured Neural Networks. In Proceedings of the 2018 13th World Congress on Intelligent Control and Automation (WCICA), Changsha, China, 4–8 July 2018; pp. 175–180. [Google Scholar]
Zare, S. Fault Detection and Diagnosis of Electric Drives Using Intelligent Machine Learning Approaches. Master’s Thesis, University of Windsor, Windsor, ON, Canada, 2018. [Google Scholar]
Yolacan, E.N. Learning from Sequential Data for Anomaly Detection. Master’s Thesis, Northeastern University, Boston, MA, USA, 2014. [Google Scholar]
Andrade, T.; Gama, J.; Ribeiro, R.P.; Sousa, W.; Carvalho, A. Anomaly Detection in Sequential Data: Principles and Case Studies. In Wiley Encyclopedia of Electrical and Electronics Engineering; American Cancer Society: Atlanta, GA, USA, 2019; pp. 1–14. [Google Scholar] [CrossRef]
Malhotra, P.; Vig, L.; Shroff, G.; Agarwal, P. Long Short Term Memory Networks for Anomaly Detection in Time Series. In Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), Bruges, Belgium, 22–24 April 2015. [Google Scholar]
Thi, N.N.; Cao, V.L.; Le-Khac, N.A. One-Class Collective Anomaly Detection Based on LSTM-RNNs. In Transactions on Large-Scale Data-and Knowledge-Centered Systems XXXVI; Springer: Berlin/Heidelberg, Germany, 2017; Volume 36, pp. 73–85. [Google Scholar]
Gamboa, J.C.B. Deep Learning for Time-Series Analysis. arXiv 2017, arXiv:1701.01887. [Google Scholar]
Shipmon, D.T.; Gurevitch, J.M.; Piselli, P.M.; Edwards, S.T. Time Series Anomaly Detection: Detection of anomalous drops with limited features and sparse examples in noisy highly periodic data. arXiv 2017, arXiv:1708.03665. [Google Scholar]
Giannoni, F.; Mancini, M.; Marinelli, F. Anomaly Detection Models for IoT Time Series Data. arXiv 2018, arXiv:1812.00890. [Google Scholar]
Zhang, C.; Song, D.; Chen, Y.; Feng, X.; Lumezanu, C.; Cheng, W.; Ni, J.; Zong, B.; Chen, H.; Chawla, N.V. A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data. In Proceedings of the AAAI, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Pereira, J.; Silveira, M. Unsupervised Anomaly Detection in Energy Time Series Data Using Variational Recurrent Autoencoders with Attention. In Proceedings of the 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 17–20 December 2018; pp. 1275–1282. [Google Scholar] [CrossRef]
Lamrini, B.; Gjini, A.; Daudin, S.; Armando, F.; Pratmarty, P.; Travé-Massuyès, L. Anomaly Detection Using Similarity-based One-Class SVM for Network Traffic Characterization. In Proceedings of the 29th International Workshop on Principles of Diagnosis, Warsaw, Poland, 27–30 August 2018. [Google Scholar]
Maya, S.; Ueno, K.; Nishikawa, T. dLSTM: A new approach for anomaly detection using deep learning with delayed prediction. Int. J. Data Sci. Anal. 2019, 8, 137–164. [Google Scholar] [CrossRef] [Green Version]
Lindemann, B.; Fesenmayr, F.; Jazdi, N.; Weyrich, M. Anomaly detection in discrete manufacturing using self-learning approaches. Procedia CIRP 2019, 79, 313–318. [Google Scholar] [CrossRef]
Su, Y.; Zhao, Y.; Niu, C.; Liu, R.; Sun, W.; Pei, D. Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; KDD ’19. ACM: New York, NY, USA, 2019; pp. 2828–2837. [Google Scholar] [CrossRef]
Nguyen, L.H.; Goulet, J.A. Real-time anomaly detection with Bayesian dynamic linear models. Struct. Control. Health Monit. 2019, 26, e2404. [Google Scholar] [CrossRef]
Feremans, L.; Vercruyssen, V.; Meert, W.; Cule, B.; Goethals, B. A framework for pattern mining and anomaly detection in multi-dimensional time series and event logs. In Proceedings of the International Workshop on New Frontiers in Mining Complex Patterns, held in Conjunction with ECML-PKDD 2019, Würzburg, Germany, 16–20 September 2019. [Google Scholar]
Feremans, L.; Vercruyssen, V.; Cule, B.; Meert, W.; Goethals, B. Pattern-Based Anomaly Detection in Mixed-Type Time Series. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, Würzburg, Germany, 16–20 September 2019. [Google Scholar]
Munir, M.; Siddiqui, S.; Chattha, M.; Dengel, A.; Ahmed, S. FuseAD: Unsupervised Anomaly Detection in Streaming Sensors Data by Fusing Statistical and Deep Learning Models. Sensors 2019, 19, 2451. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Munir, M.; Siddiqui, S.; Dengel, A.; Ahmed, S. DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series. IEEE Access 2018, 7, 1991–2005. [Google Scholar] [CrossRef]
Zhang, X.; Lin, Q.; Xu, Y.; Qin, S.; Zhang, H.; Qiao, B.; Dang, Y.; Yang, X.; Cheng, Q.; Chintalapati, M.; et al. Cross-dataset Time Series Anomaly Detection for Cloud Systems. In Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference, USENIX ATC ’19, Renton, WA, USA, 10–12 July 2019; pp. 1063–1076. [Google Scholar]
Elsner, D.; Khosroshahi, P.A.; MacCormack, A.D.; Lagerström, R. Multivariate Unsupervised Machine Learning for Anomaly Detection in Enterprise Applications. In Proceedings of the 52nd Hawaii International Conference on System Sciences, Maui, HI, USA, 8–11 January 2019. [Google Scholar]
Brandsæter, A.; Vanem, E.; Glad, I.K. Efficient on-line anomaly detection for ship systems in operation. Expert Syst. Appl. 2019, 121, 418–437. [Google Scholar] [CrossRef]
Tran, L.; Fan, L.; Shahabi, C. Outlier Detection in Non-stationary Data Streams. In Proceedings of the 31st International Conference on Scientific and Statistical Database Management, SSDBM ’19, Santa Cruz, CA, USA, 23–25 July 2019; ACM: New York, NY, USA, 2019; pp. 25–36. [Google Scholar] [CrossRef]
Yeh, Y.C.; Hsu, C.Y. Application of Auto-Encoder for Time Series Classification with Class Imbalance. In Proceedings of the Asia Pacific Industrial Engineering & Management Science Conference, APIEMS 2019, Kanazawa, Japan, 2–5 December 2019; pp. 14–17. [Google Scholar]
Graß, A.; Beecks, C.; Soto, J.A.C. Unsupervised Anomaly Detection in Production Lines. Machine Learning for Cyber Physical Systems; Beyerer, J., Kühnert, C., Niggemann, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2019; pp. 18–25. [Google Scholar]
Vercruyssen, V.; Meert, W.; Davis, J. Transfer Learning for Time Series Anomaly Detection. In Proceedings of the Workshop and Tutorial on Interactive Adaptive Learning Co-Located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017), Skopje, Macedonia, 18–22 September 2017; pp. 27–36. [Google Scholar]
Oh, M.H.; Iyengar, G. Sequential Anomaly Detection Using Inverse Reinforcement Learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD ’19, Anchorage, AK, USA, 4–8 August 2019; ACM: New York, NY, USA, 2019; pp. 1480–1490. [Google Scholar] [CrossRef]

Figure 1. P-F reliability curve in maintenance management [8].

Figure 2. Classification of automatic industrial maintenance approaches.

Table 1. ML/DL methods used for PdM.

Goal	Learning Task	ML/DL Method	Data Source	Equipment/Process	Ref.
Failure Prediction (FP)	Anomaly Detection	Hierarchical Clustering	General faults	Time and Frequency	[81]
	Classification	RF, SVM and LR	Physical faults	Track geometry	[82]
		AE	General faults	Rolling bearing	[35,36]
				Spacecraft	[32]
				Transformers	[34]
				Rotor bearing systems	[37]
			Vibration	Tidal turbine	[33]
			Vibration	Bearings	[29]
			Acoustic signals Sensor data	Motors	[83]
		DNN	Vibration	Bearings	[84]
				Gasoline engines	[54]
				Engines	[63]
			Vibration, pressure and speed	Diesel engines	[55]
			Optical and visual	Laser welding	[56]
		CNN	Vibration	Planetary gearbox	[65,68]
			Grinding faults	Abrasive belt wear	[66]
			General faults	Rotor bearing systems	[37]
			Vibration and images	Rotating machinery	[67]
		RNN	General faults	Rolling bearing	[70]
				Air compressor	[71]
				Air compressor in buses	[73]
			Sensor data	Turbofan engine degradation	[72]
			Time-frequencies	Cantilever beams	[75]
		GAN	Sensor data	Turbofan engine degradation	[79]
		GAN	Vibration	Induction motor	[78]
Remaining Useful Life (RUL)	Regression	Online-SVR	Vibration	Rolling Bearing	[85]
		PSO+SVM	Vibration	Rolling Bearing	[41]
		Bi-directional LSTM	Sensor data	Turbofan engine degradation	[42]
		AE	Acoustic signal Sensor data	Turbofan engine degradation	[38]
		CNN	Sensor data	Turbofan engine degradation	[10,69,86]
		RNN	Sensor data	Turbofan engine degradation	[74]

Table 2. List of datasets publicly available for PdM experiments.

Ref.	Dataset Description
[93]	Numenta Anomaly Benchmark (NAB) dataset: temperature sensors on industrial machines
[88]	Force and torque measurements to detect robot failures
[89]	Failure data of a generic gearbox
[92]	CWRU: ball bearing test data for normal and faulty bearings
[94]	Synchronous measurement of motor current and vibration signals
[90]	Operational data from a pressurizing system in trucks
[95]	PRONOSTIA: bearing accelerated life test dataset
[77]	NASA C-MAPSS tools: simulate realistic large commercial turbofan engines
[91]	Failure data in a simulated swarm of robots

Table 3. Data-driven PdM for the railway industry.

Goal	Learning Task	ML/DL Method	Data Source	Equipment/Process	Ref.
Failure Prediction (FP)	Anomaly Detection	Sequential Pattern Mining	Real Data: sensors on trains	Trains	[106]
		AE, OCC!, OCSVM!, boxplotEns	Real Data: sensors on trains	Pneumatic valves of train doors	[2]
		Linear Regression	Openrails simulation platform	Train speed	[108]
	Classification	SVM	Real Data: detectors on the railway	Railway	[97]
		LR, Bayes Classifier, SVM, RF	Real Data: log files and reports	Railway turnouts	[98]
		RF	Public Real Data	Railway track geometry	[82]
		ANN	Software SIMPACK	Trains	[107]
		RF, RNN, K-means	Real defect database	Rail and geometry defects	[105]
		DT, RF, Gradient Boosting Trees	Real Data: SAP/ERP Maintenance Request Process (MRP)	Railway switches	[99]
Remaining Useful Life (RUL)	Regression	online-SVR	Real Data: detectors on trains	Train axle bearings	[85]
		RF, QRF, DT, KNN, SVR, PCR	Real Data: detectors on trains	Wheels and trucks (bogies)	[25]
		SVR, SVM	Real Data: North America Railroad	Train wheelsets	[27]
		LR	Real Data: Dutch Railways VIRM	Air leakage in braking pipes of trains	[22]

Table 4. Evaluation Metrics used in PdM.

Metric	Ref.
Accuracy	[5,82,98,99,105,107]
PR	[27,82,106,112]
Confusion Probability Matrix	[72,99]
RMSE	[22,25,26,28,39,74,105,113]
MAE	[26,74,113]
MAPE	[25,27,28,74,85,113]
AUC-ROC	[98,114]
rFAR and rIPR	[2]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Davari, N.; Veloso, B.; Costa, G.d.A.; Pereira, P.M.; Ribeiro, R.P.; Gama, J. A Survey on Data-Driven Predictive Maintenance for the Railway Industry. Sensors 2021, 21, 5739. https://doi.org/10.3390/s21175739

AMA Style

Davari N, Veloso B, Costa GdA, Pereira PM, Ribeiro RP, Gama J. A Survey on Data-Driven Predictive Maintenance for the Railway Industry. Sensors. 2021; 21(17):5739. https://doi.org/10.3390/s21175739

Chicago/Turabian Style

Davari, Narjes, Bruno Veloso, Gustavo de Assis Costa, Pedro Mota Pereira, Rita P. Ribeiro, and João Gama. 2021. "A Survey on Data-Driven Predictive Maintenance for the Railway Industry" Sensors 21, no. 17: 5739. https://doi.org/10.3390/s21175739

APA Style

Davari, N., Veloso, B., Costa, G. d. A., Pereira, P. M., Ribeiro, R. P., & Gama, J. (2021). A Survey on Data-Driven Predictive Maintenance for the Railway Industry. Sensors, 21(17), 5739. https://doi.org/10.3390/s21175739

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Survey on Data-Driven Predictive Maintenance for the Railway Industry

Abstract

1. Introduction

2. Predictive Maintenance

2.1. Failure Prediction

2.2. Remaining Useful Life (RUL)

3. Data-Driven PdM

3.1. Traditional Machine Learning Methods

3.2. Deep Learning Methods

3.2.1. Deep Neural Network (DNN)

3.2.2. Convolutional Neural Network (CNN)

3.2.3. Recurrent Neural Network (RNN)

3.2.4. Generative Adversarial Network (GAN)

3.3. Datasets for PdM

4. Data-Driven PdM for the Railway Industry

4.1. Infrastructure

4.2. Scheduling Policies

4.3. Vehicles

4.4. Overview

5. Evaluation Metrics in PdM!

5.1. Failure Prediction

5.2. Remaining Useful Life

6. Conclusions and Future Directions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI