Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks

Hembert, Pierre; Ghnatios, Chady; Cotton, Julien; Chinesta, Francisco

doi:10.3390/s24051580

Open AccessArticle

Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks

¹

PIMM Laboratory, Arts et Métiers Institute of Technology, Centre National de la Recherche Scientifique (CNRS), 151 Boulevard de l’Hôpital, 75013 Paris, France

²

Andra, French National Radioactive Waste Management Agency, 92298 Châtenay-Malabry, France

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(5), 1580; https://doi.org/10.3390/s24051580

Submission received: 19 December 2023 / Revised: 23 February 2024 / Accepted: 25 February 2024 / Published: 29 February 2024

(This article belongs to the Section Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

A deep geological repository for radioactive waste, such as Andra’s Cigéo project, requires long-term (persistent) monitoring. To achieve this goal, data from a network of sensors are acquired. This network is subject to deterioration over time due to environmental effects (radioactivity, mechanical deterioration of the cell, etc.), and it is paramount to assess each sensor’s integrity and ensure data consistency to enable the precise monitoring of the facilities. Graph neural networks (GNNs) are suitable for detecting faulty sensors in complex networks because they accurately depict physical phenomena that occur in a system and take the sensor network’s local structure into consideration in the predictions. In this work, we leveraged the availability of the experimental data acquired in Andra’s Underground Research Laboratory (URL) to train a graph neural network for the assessment of data integrity. The experiment considered in this work emulated the thermal loading of a high-level waste (HLW) demonstrator cell (i.e., the heating of the containment cell by nuclear waste). Using real experiment data acquired in Andra’s URL in a deep geological layer was one of the novelties of this work. The used model was a GNN that inputted the temperature field from the sensors (at the current and past steps) and returned the state of each individual sensor, i.e., faulty or not. The other novelty of this work lay in the application of the GraphSAGE model which was modified with elements of the Graph Net framework to detect faulty sensors, with up to half of the sensors in the network being faulty at once. This proportion of faulty sensors was explained by the use of distributed sensors (optic fiber) and the environmental effects on the cell. The GNNs trained on the experimental data were ultimately compared against other standard classification methods (thresholding, artificial neural networks, etc.), which demonstrated their effectiveness in the assessment of data integrity.

Keywords:

graph neural network; classification; sensor state; sensor network; nuclear waste monitoring

1. Introduction

This section explores the context of the research, ranging from the industrial situation regarding nuclear waste storage to the scientific background regarding graph neural networks.

1.1. Andra’s Cigéo Project

Radioactive waste storage is a contemporary issue that is addressed in different countries around the globe. One of the most feasible solutions is deep underground storage. However, such storage needs to be actively monitored for safety reasons. This study was part of Andra’s Cigéo project, which aims to design, develop and monitor a deep geological repository for radioactive waste. This repository will store higher-activity waste (HAW), including high-level waste (HLW) and intermediate-level waste (ILW), as shown in Figure 1. Given the dynamic conditions of the storage and the impossibility of accessing the sensors, there is a need for a method that can ensure the consistency of the data. Indeed, the sensor network will evolve over time, in part caused by aging sensors, drifts and failures.

1.2. The High-Level Waste (HLW) Demonstrator Cell

The data used was acquired in Andra’s Underground Research Laboratory (URL), in which there is an HLW demonstrator cell. This demonstrator is a prototype of an HLW storage and is heavily instrumented with both distributed sensors (such as optic fiber) and point sensors. Figure 2 shows the position of each sensor with respect to the demonstrator cell.

This prototype was used in an experiment that consisted of the thermal loading of the cell using the thermal sources presented in Figure 3. This experiment provided us with the data required for the proposed machine learning algorithms. The use of this experimental data was one of the novelties of this work. To the best knowledge of the authors, no other work dealing with sensor integrity assessment around high-level waste demonstrator cells in a deep geological layer has been published in the literature.

1.3. Graph Neural Networks and Message Passing

Abbreviations section describes the different notations used in this paper.

The sensor network presented in Figure 2 can be represented as a graph by linking measurement points that are in the vicinity of each other. Moreover, graphs can capture and represent physical phenomena, such as thermal conduction, where energy conservation operates on nodes and heat flows through the graph’s edges. The case studied in this work was one-dimensional, which involved considering only the data of an optic fiber sensor as a unidirectional graph, in order to test the efficiency of a GNN for assessment (classification) of the sensor integrity.

A GNN is a machine learning algorithm that takes a graph as the input and by modifying the graph’s embeddings (from the nodes and the edges), can perform various prediction tasks (i.e., clustering, classification and regression). GNNs can operate predictions at multiple levels [1,2,3,4,5,6,7]:

At the graph level, for instance, one could give a molecule (as an input graph) and try to find out whether the molecule is toxic.
At the edge level, typical operations are friend recommendations on a social network graph.
At the node level, classification tasks can be performed, as was the case in this work, where the state (healthy or faulty) of each sensor (i.e., each node) was derived from the graph of the sensor network.

To perform these predictions, GNNs require the raw data to be updated into more relevant data that are easier to process. Thus, the input graph was updated iteratively using a mechanism called message passing, which was described by Gilmer et al., 2017 [8]. The message-passing mechanism can be divided into the following tasks [1,2,3,4,5,6,7,8,9]:

Select a node v.
Collect information (the messages) from the neighboring nodes $N (v)$ (and edges).
Concatenate the messages using a node-order equivariant function $α$ .
Update the embedding $x_{v}$ of the node using an update function $ϕ$ (which can be discovered by a neural network). This update function takes the concatenated messages and the selected node’s embedding as the inputs and outputs the updated node’s embedding $x_{v}^{'}$ .

This mechanism, which is presented in Figure 4, leverages the connectivity of the graph by using the information of neighboring nodes (and edges) to update the nodes’ embeddings. Equation (1) describes the updating of a single node’s embedding using the message-passing mechanism [2,8]:

x_{v}^{'} = ϕ (α {(x_{u})}_{u \in N (v)}, x_{v}) .

(1)

Note that the update function

ϕ

and the concatenation function

α

used in the message-passing algorithm are shared across all nodes [9], which means that the same exact functions are used to update all nodes’ embeddings. This notion is central to the concept of a GNN.

The message-passing mechanism can be understood as a convolution operation on the graph [4,5,6,9,10,11,12,13,14].

A layer of a GNN is created by repeating this mechanism for all the nodes [1,2,4,5,8,9,10]. Indeed, a layer updates all nodes’ embeddings using the message-passing mechanism.

Figure 5 explains how to build a complete GNN: the top part describes the GNN model, and the bottom part showcases its application to a simple graph. This model can be split in two main elements [1,2,4,5,9]:

The core of a GNN aims at transforming the input graph’s embeddings into embeddings that are easier to interpret by the second part of the model. This is achieved by stacking multiple layers of the GNN, each of which is based on message passing. The connectivity of the graph is not modified during this step. At the bottom of Figure 5, the embeddings $x^{0}$ are transformed into embeddings $x^{n}$ by the repeated application of message passing.
The specific network takes the updated graph as the input and performs the desired prediction task (in the example, it is node classification). This network’s architecture depends on the related task (pooling layers are used for graph classification, softmax or sigmoid activation functions for classification, etc.). Then, a loss function is applied to the model in order to train it.

Equation (2) is a variation of Equation (1). It describes the updating of the nodes’ embeddings in layer l of a GNN. This equation shows the methodology used to obtain the updated embeddings of any node

x_{v}^{l + 1}

(and, by extension,

X_{V}^{l + 1}

) given the previous nodes’ embeddings

X_{V}^{l} = (x_{v}^{l}, v \in V)

[2,8,9]. From now on, this kind of equation is used to describe a GNN.

x_{v}^{l + 1} = ϕ^{l} (α {(x_{u}^{l})}_{u \in N (v)}, x_{v}^{l}) .

(2)

1.4. Graph Neural Network Models

There exist various models of GNNs [2,3,4,5,6,7,15]. This study focused on a GNN that used convolution operators [2,3,4,5,6,7,15], contrary to the historical recurrent operators [16,17,18,19,20]. Convolution operators can either be spectral [8,9,10,11,12,21,22,23,24,25,26,27,28,29,30] or spatial [13,14,31,32].

A graph convolutional network (GCN) is one of the most straightforward models that use a simple convolution operator presented by Kipf and Welling 2017 [10]. In this model, the update function is the same for all nodes of the graph. Equation (3) presents the GCN model [8,9,10,12,23,24], which uses an activation function

σ

(often non-linear, such as ReLU) and two matrices

W_{0}^{l}

and

W_{1}^{l}

that represent the multi-layer perceptron (MLP) shared across all nodes.

x_{v}^{l + 1} = σ (x_{v}^{l} W_{0}^{l} + \sum_{u \in N (v)} \frac{x_{u}^{l}}{{| N (u) |}^{\frac{1}{2}} {| N (v) |}^{\frac{1}{2}}} W_{1}^{l}) .

(3)

Equation (2) is the general form of Equation (3).

This model was later expanded by Schlichtkrull et al., 2017 [29], who included different updating functions that depend on the type of connection between two nodes. The latter is called a relational-GCN (R-GCN) because the type of relationship between two nodes induces a different convolution operator. This is prompted by the fact that the relationship connecting two nodes is meaningful and should be taken into account by the GNN. For instance, on Facebook, two users might be connected because they are friends or because they blocked each other, both of which are fundamentally different. Equation (4) presents an R-GCN model using a type of relationship between two nodes noted r:

x_{v}^{l + 1} = σ (x_{v}^{l} W_{0}^{l} + \sum_{r \in R} \sum_{u \in N_{r} (v)} \frac{x_{u}^{l}}{| N_{r} (v) |} W_{r}^{l}) .

(4)

This model’s main weakness is the increased algorithmic complexity when dealing with a high number of relations. This problem is partially alleviated when using a basis decomposition for

W_{r}^{l}

or defining

W_{r}^{l}

with a diagonal block matrix. But even with these improvements, the model is not the most suitable for handling large-sized graphs.

Generalizing even further, graph attention networks (GATs) [31,32,33] use an attention mechanism to operate on the updating function. The strengh of a GAT is that there is no need to have relationship sets; the attention mechanism defines the relationship between two nodes by using their embeddings. Equation (5) presents a GAT model with the attention function

A^{l} (x_{u}, x_{v})

, which is a learnable parameter:

x_{v}^{l + 1} = σ ([A^{l} (x_{v}, x_{v}) \cdot x_{v}^{l} + \sum_{u \in N (v)} A^{l} (x_{u}, x_{v}) \cdot x_{u}^{l}] W^{l}) .

(5)

Similar to the R-GCN, this model is not easy to apply to large graphs, given its computational complexity.

The model used in this study for the sensor integrity assessment was a variant of the GCN called GraphSAGE, which was developed by Hamilton et al., 2018 [1,9,13]. This model uses a chosen node order invariant aggregation function (such as a sum, average or max) to collect messages from the neighboring nodes, as shown in Figure 6. This model is efficient when operating on large graphs, which warrants its use.

Equation (6) presents the mathematical model of GraphSAGE, which is similar to Equation (2), with

α

as the chosen aggregation function (e.g., sum) and

ϕ^{l} (a, b) = σ ([b, a] W^{l})

:

x_{v}^{l + 1} = σ ([x_{v}^{l}, A g g_{u \in N (v)} (x_{u}^{l})] W^{l}) .

(6)

1.5. Related Works

The model presented in this work was derived from the GraphSAGE model presented by Hamilton et al., 2018 [13] and aimed at detecting faulty sensors within a network. Therefore, this study built on previous works using GNNs for the following tasks:

Anomaly detection.
Node classification.
Sensor network monitoring.

There are plenty of graph neural networks specialized in anomaly detections [7], whether they are used for IT security [34,35,36,37,38], time series [39] or the industrial use of the Internet of things (IoT) [40].

Node classification is one of the main tasks GNNs can operate [3,4,5,6,7,15,41] and many models have been developed toward this end [42,43,44,45].

Eventually, with sensor-related improvement (IoT, increased precision and compatibility, etc.), sensor network monitoring is being simplified and GNNs have been used for such applications [46,47,48,49,50,51,52,53].

The novelty of the work presented in this paper lay in the combination of these three elements: using a graph neural network for anomaly detection in a sensor network by employing node classification models. Similar work was achieved in Jiang et Luo 2023 [54], where a model for sensor self-diagnosis was used. In Deng et Hoooi 2021 [39], a GNN was developed for the detection of anomalies in time series, which were measured using sensors.

However, both these models use a partial GAT [31], whereas the model proposed in this paper was based on GraphSAGE [13]. The GraphSAGE model has reduced computational costs when dealing with large graphs compared with a GAT, which led to its selection for our application. Another novelty of this work lay in the modification of the GraphSAGE model using elements from the Graph Net framework [2].

2. Materials and Methods

This section presents the available data and the modifications applied to generate training datasets for our machine learning algorithms. Then, the used machine learning and comparative classification models are described.

2.1. Generating a Training Dataset

As presented in the Introduction, one of the novelties of this study came from the use of industrial data that originated from Andra’s URL. These data represent the responses of a subset of the thermal sensors used in the thermal loading of the HLW demonstrator cell. We collected the responses of one of the distributed optic fiber sensors (Figure 2 in blue), thus inducing a one-dimensional study case. These data could be viewed as a unidirectional graph, as shown in Figure 7.

Figure 8 presents the responses of the distributed sensor over time at various sample points. The lower temperatures were closer to the gallery (i.e., to the cold point) and the higher temperatures were closer to the heat sources.

Figure 9 shows the temperature along the distributed sensor for different time samples. Similar to Figure 8, we can observe the gallery at x = 0 p (where p is the sensor’s position and consecutive sensors were 5 cm apart), which corresponded to the cold source; heating elements were present in the second half of the HLW demonstrator; and the rock at x = 485 p acted as an insulator (given the small heat flow). The considered time step was one day, while the spatial sampling had a step size of 5 cm.

For the rest of this study, the responses of the distributed optic fiber sensors were considered as a series of point sensors. Thus, distributed sensor failures, such as breaking of the optic fiber, were not modeled, but could be easily inferred. The sensor failures that were considered in this work were partial failures [55,56,57,58,59]:

Bias.
Drifting.
Precision degradation.
Gain.

The data from the thermal loading of the HLW demonstrator measured by the optic fiber were supposed to have no inaccuracies. Therefore, there was a need to introduce inappropriate responses for some of the sensors to simulate sensor partial failures.

The process used to induce inaccuracies is presented below. Before altering the sensor outputs, a time step was chosen. The sensor responses at the current time step were the starting point of the process that added synthetic errors in the sensors’ outputs.

First, the number of sensors with inaccuracies needed to be determined. This number followed a discrete uniform distribution between 0 and 250 among the 485 measurement points. Equation (7) presents the distribution of the number of sensors that were degraded:

n_{d e g r a d e d} \sim U_{D} ([[0, 250]]) .

(7)

Later on, the sensors that were degraded were chosen via an unordered random draw without replacement. This caused one-quarter of the data to be degraded on average, as shown in Equation (8):

P (v_{f a u l t y}) = \sum_{i = 0}^{250} \underset{1 / 251}{\underset{︸}{P (n_{d e g r a d e d} = i)}} \overset{i / 485}{\overset{︷}{P (v_{f a u l t y} | n_{d e g r a d e d} = i)}} = \frac{250}{990} \approx 25.25 %

(8)

Once the measurement points to degrade were selected, a mask of sensor integrity was created. This mask returned a Boolean output that described the state of the sensor (i.e., faulty or healthy), as showcased in Equation (9):

S_{V} = {S_{v}, v \in V} : \forall v \in V, S_{v} = \{\begin{matrix} 0 (healthy) \\ 1 (faulty) \end{matrix}

(9)

Inappropriate sensor responses were induced on the previously selected sensors. These inaccuracies were modeled by an offset of a minimum of 2 °C and up to 8 °C, and could be either positive or negative. These offsets followed a continuous uniform distribution, as described in Equation (10):

Δ T \sim U_{C} ([- 8^{\circ} C, - 2^{\circ} C] \cup [2^{\circ} C, 8^{\circ} C])

(10)

The complete process of inducing inaccuracies in otherwise accurate data is presented in Figure 10.

Now that the process of adding inaccuracies for simulating sensor degradations has been presented, the inputs and outputs of our machine learning algorithm can be described. Figure 11 represents the inputs (in black) and outputs (in grey) of our model. The input was a graph that contained three successive sensor responses for all measurement points. The responses in the two first time steps (temporally) were unaltered, clean sensor responses, as represented by the two curves with dashes in Figure 11. However, the third graph was based on data that had been degraded by following the aforementioned process and is shown by the full black curve in Figure 10. The expected output was the sensor integrity mask that identified the degraded data, as shown by the grey curve in Figure 11.

The use of three consecutive measurements was based on the integrity of measuring at least two correct values, then incrementally identifying the faulty measures. The two correct initial measurements (the initial condition) gave a baseline to our model.

The measurements at the first time step allowed the model to learn the temperature distribution. The measurements at the second time step enabled the model to learn the evolution of the temperature distribution over time, and therefore, permit a prediction of the temperature distribution at the third time step. Then, the measurements at the third time step introduced sensor degradation.

Using the method presented above, two datasets of a total of 5000 inputs and outputs were created: a training dataset and a testing dataset. These datasets were taken from real measurements performed on the HLW demonstrator shown in Figure 2 and Figure 3. The exact same training and testing datasets were used throughout this study for the different kinds of machine learning algorithms presented. Therefore, a comparison between different machine learning models could be performed on the same testing dataset. Later, we compare the efficiency of different trained models with different combinations of hyperparameters.

2.2. The Graph Neural Network Architecture

The GNN used for the node-level assessment of the integrity of the measurements was divided into three successive tasks:

The creation of the input graph.
The updating of the graph’s embeddings, i.e., the core of the GNN, using message-passing layers based on the GraphSAGE model.
The classification of each independent node.

The first element of the GNN, which is presented in Figure 12, took the unidirectional graph representing the different measurements along the optic fiber considered and added the edges’ embeddings (which could ultimately correspond to the heat flow between each pair of neighboring nodes). The initial graph (

G^{0}

) was composed of three successive time steps, namely, the first two unaltered steps (

T (t_{1})

and

T (t_{2})

) and the third step with added inaccuracies (

T_{d} (t_{3})

). This calculation was performed by a multi-layer perceptron (MLP)

W_{F l o w}

that was applied to neighboring nodes.

W_{F l o w}

was shared across all the edges. The nodes’ embeddings remained unchanged. Equations (11) and (12) model the first part of our GNN. This initialization was aimed at introducing physics by creating edges’ embeddings similar to a physical flow of energy.

x_{v}^{1} = x_{v}^{0} = (\begin{matrix} T (v, t_{1}) \\ T (v, t_{2}) \\ T_{d} (v, t_{3}) \end{matrix}) .

(11)

e_{u, v}^{1} = R e L U ([x_{u}, x_{v}] W_{F l o w}) w i t h u \in N (v) a n d e_{u, v}^{1} = (\begin{matrix} e_{1}^{1} (u, v) \\ e_{2}^{1} (u, v) \\ e_{3}^{1} (u, v) \end{matrix}) .

(12)

The nodes’ and edges’ embeddings of graphs

G^{0}

and

G^{1}

were three-dimensional vectors. Henceforth, all embeddings of any layer l of the GNN were three dimensional vectors:

x_{v}^{l} = (\begin{matrix} x_{1}^{l} (v) \\ x_{2}^{l} (v) \\ x_{3}^{l} (v) \end{matrix})

and

e_{u, v}^{l} = (\begin{matrix} e_{1}^{l} (u, v) \\ e_{2}^{l} (u, v) \\ e_{3}^{l} (u, v)) \end{matrix})

.

The second element was the core of the GNN. It was used to update the graph embeddings, which made them easier to interpret for the classification network. This element was composed of subsets called GNN layers. Each layer updated the embeddings of the nodes and edges, and by stacking them, the core of the GNN was created. Each layer function used a variation of the GraphSAGE model, as shown by Figure 13 and Figure 14. The novelty of this model lay in the modification of the GraphSAGE model with elements of the Graph Net (GN) framework developed by Battaglia et al., 2018 [2]. This model aims at inferring physics by deducing the flow (on edges) from energy levels (on nodes). And inversely, to apply energy conservation (on nodes) by the cumulation of flows (on edges). The layers update the nodes’ embeddings by aggregating its neighboring edges and vice-versa. Two update functions, namely,

W_{e}^{l}

and

W_{x}^{l}

, are used to update the edges and the nodes’ embeddings, respectively. These update functions depend on the layer l and are built using a multilayer perceptron (MLP). Moreover they are shared across all nodes and edges. Equations (13) and (14) describe the updating of the nodes and the edges’ embeddings:

x_{v}^{l + 1} = R e L U ([x_{v}^{l}, A g g_{u \in N (v)} (e_{u, v}^{l})] W_{x}^{l}) .

(13)

e_{u, v}^{l + 1} = R e L U ([e_{u, v}^{l}, A g g (x_{u}^{l}, x_{v}^{l})] W_{e}^{l}) w i t h u \in N (v) .

(14)

The third element of the network was the classifier presented in Figure 15. Its role was to classify each node with respect to the sensor integrity. This was a local element that, for each node, took the embedding of the corresponding node and the embeddings of its neighboring edges in order to classify the node sensor state as healthy or faulty. This calculation was undertaken by an MLP network denoted as

W_{C L A}

, which was shared across all the nodes. Equation (15) describes the network’s Boolean prediction

{\hat{S}}_{v}

of the sensor’s state at node v:

{\hat{S}}_{v} = S i g m o i d ([x_{v}^{N}, A g g_{u \in N (v)} (e_{u, v}^{N})] W_{C L A}) .

(15)

The loss function used for this model was a variation of the binary cross-entropy (BCE) [60] described by Equation (16):

L (S_{V}, {\hat{S}}_{V}) = \frac{1}{n} \sum_{v \in V} w S_{v} log ({\hat{S}}_{v}) + (1 - S_{v}) log (1 - {\hat{S}}_{v})

(16)

We used a weight w to alter the BCE, with the aim to improve the detection of faulty sensors using our model. Increasing the weight w reduced the rate of false negatives (FNs) but increased the rate of false positives (FPs). However, the false negatives (FNs) were the most critical errors in our system, as presented in Figure 16 later in this section.

The different hyperparameters used in defining the GNN were as follows:

The size of the various MLPs in terms of the number of neurons.
The aggregation functions used for the message-passing layers and the classifier.
The number of message-passing layers.
The weight w used in the loss function.

The main idea was to create multiple GNNs with different hyperparameters and to create some kind of benchmark. Table 1 presents all the different hyperparameters tested.

Every combination of the hyperparameters presented in Table 1 was used to train a GNN. In total, there were 162 (

3^{4} \times 2

) unique sets of hyperparameters. For each set of hyperparameters, 10 similar GNNs (for a total of 1620) were trained on the unique training dataset generated following the process presented in Figure 10.

The training parameters were identical for each GNN:

The used optimizer was the Adam stochastic gradient descent algorithm [61].
A total of 50 training epochs.
A batch size of 20 per iteration of the gradient descent algorithm.
A validation split of 15% (same split for all the networks).

Each GNN was then tested on the unique testing dataset. The results were then displayed in a confusion matrix [62], as presented in Figure 16.

We note that the sensors’ data were used in predictive models later on and if a sensor was detected as faulty, it was not used in the models. As such, false positives (FPs) slightly decreased the efficiency of these models because they lowered the amount of usable data. However, false negatives (FNs) fed wrong information to the predictive models, making them a major threat.

2.3. The Thresholding Classification Method

To assess the efficiency of the GNN for the classification of the sensors’ states, a comparison with verified methods, the first of which was the thresholding method described below, was necessary. The used method was based on the same inputs and outputs as the GNN. The method used was decomposed into the following steps:

The prediction of the temperature distribution at the third time step without errors was derived from the distributions at the two first time steps. For this purpose, a linear extrapolation method in time was used. Equation (17) presents how to obtain a prediction of the temperature at the third time step.
The prediction of the temperature distribution was compared against the deteriorated response (at the third time step) by determining a threshold $ε$ under which the sensor was supposedly healthy and over which the sensor was supposedly faulty. Equation (18) shows how the threshold value was used to assess the sensor integrity.
Multiple thresholds $ε$ were tested and the one with the best results was used.

{\hat{T}}_{3} = T_{2} + (T_{2} - T_{1}) .

(17)

\{\begin{matrix} | {\hat{T}}_{3} - T_{3} | < ε \Rightarrow S_{v} = 0 (h e a l t h y) \\ | {\hat{T}}_{3} - T_{3} | ⩾ ε \Rightarrow S_{v} = 1 (f a u l t y) \end{matrix} \cdot

(18)

2.4. The Multi-Layer-Perceptron-Based Classifier

The second classification method used to evaluate the GNN model was a simple feedforward neural network. This neural network inputted the three consecutive temperature responses at one of the graph’s nodes and outputted the sensor state (at said node). Thus, the input and output dimensions were, respectively, three and one.

This multi-layer perceptron was composed of the following dense layers:

Layer 1 was composed of 15 neurons and used ReLU as its activation function.
Layer 2 was composed of five neurons and used ReLU as its activation function.
Layer 3 was composed of one neuron and used sigmoid as its activation function.

This multi-layer perceptron’s training parameters were as follows:

The optimizer used was the Adam stochastic gradient descent algorithm [61].
A total of 200 training epochs.
A batch size of 200 per iteration of the gradient descent algorithm.
A validation split of 15%.

2.5. The Decision Tree Classification Method

The last model used to evaluate the GNN model performance was a decision tree. This model shared the same inputs and outputs as the multi-layer perceptron and the thresholding method. The scikit-learn [63] decision tree model was used.

3. Results

This section provides the results from the different methods used on the testing dataset. Different metrics are then given to evaluate the performances of the different methods. Finally, these results are compared with each other.

3.1. Results of the GNNs

This section shows the results of each of the 1620 GNNs tested on the unique testing dataset. Using the confusion matrix presented in Figure 16, it was possible to use recall metrics [62,64,65,66], such as the true positive rate (TPR) and true negative rate (TNR), to plot the efficiency of each of the GNNs. Figure 17 presents the recall metrics on the testing dataset for each GNN trained. Equations (19) and (20) describe the recall metrics:

T P R = \frac{T P}{T P + F N} = \frac{T P}{P}

(19)

T N R = \frac{T N}{T N + F P} = \frac{T N}{N}

(20)

Then, by using the accuracy metrics [62,64,65,66], it was possible to plot the proportion of GNNs that attained certain accuracy thresholds. This is shown in Figure 18. Equation (21) presents the accuracy metric:

A c c u r a c y = \frac{T P + T N}{P + N} = \frac{T P + T N}{n_{s a m p l e}}

(21)

The same analysis could be performed using precision metrics [62,64,65,66], such as positive predictive value (PPV) and negative predictive value (NPV). Figure 19 shows the distribution of the precision metrics on the testing dataset for each GNN tested. Equations (22) and (23) describe the precision metrics:

P P V = \frac{T P}{T P + F P} = \frac{T P}{P P}

(22)

N P V = \frac{T N}{T N + F N} = \frac{T N}{P N}

(23)

Eventually, the same could be achieved using the F1-scores [62,64,65,66], and the results are presented in Figure 20. The F1-score is the harmonic mean of the recall and precision, as showcased by Equation (24):

F 1 = \frac{2}{r e c a l l^{- 1} + p r e c i s i o n^{- 1}}

(24)

Then, it was interesting to evaluate which hyperparameters presented in the previous section had the most influence on the performances of the GNNs. Figure 21 and Figure 22 present the impacts of the various hyperparameters. Figure 21 shows the proportion of networks that performed over a certain accuracy, similarly to Figure 18, only this time, we split the population into groups with regard to the hyperparameters. Each row of the figure corresponds to a set of hyperparameters (aggregation function, number of layers, etc.), and the subfigure on the right is a magnified view of the figure on the left. This magnified view is meaningful because it showcases which hyperparameter gave the best results when only the best GNNs were taken into account. Thus, the GNNs that did not learn sufficiently were out of the equation.

Figure 22 presents the distribution of hyperparameters for the 100 top-performing GNNs that were trained. Subfigure (a) presents the aggregation functions used for the classification and the message passing, subfigure (b) shows how many layers of message passing the top-performing GNNs used, subfigure (c) shows the sizes of the MLPs used and subfigure (d) presents the w coefficient used in the BCE loss function.

3.2. Results of the Thresholding

Figure 23 presents the value of

ε

optimized on the training dataset to provide the best overall performance using the recall metrics.

Later on, the thresholding method was evaluated on the testing dataset using the same threshold

ε

identified using the training dataset. The confusion matrix of the thresholding method is shown in Table 2.

3.3. Results of the Multi-Layer-Perceptron-Based Classifier

Similar to the GNN, the multi-layer-perceptron-based classifier was optimized on the training dataset and was then applied to the test dataset. The confusion matrix associated with this method is shown in Table 3.

3.4. Results of the Multi-Layer-Perceptron-Based Classifier

Table 4 presents the confusion matrix that resulted from the application of the decision tree optimized on the training dataset to the test dataset.

3.5. Trained GNNs Compared against the Other Classification Methods

Table 5 provides the metrics for the five top-performing GNNs and the other standard classification methods that used the test dataset. The hyperparameters used in the top-performing GNNs are presented in Table 6.

4. Discussion

This section explores the results presented in the previous section. It aims to provide an analysis of the hyperparametric study, review the efficiency of the used model and tackle the limitations of the selected GNN architecture.

4.1. The Impacts of Different Hyperparameters

The impacts of the different hyperparameters are discussed using Figure 21 and Figure 22. To better understand Figure 21, subfigures 1 on the left and subfigures 2 on the right must be distinguished. Subfigures 1 represent the whole population of trained GNNs for one hyperparameter, whereas subfigures 2 present only the 30% best networks for this hyperparameter. This means that subfigures 1 showcase which hyperparameter had the better odds to produce a GNN with a precision from 70% up to 99%. In contrast, subfigures 2 only focus on networks whose accuracy exceeded 99.5%. The choice between these two options can be made based on the selected objective or precision of the analysis.

First of all, one may choose the hyperparameters that performed the best according to subfigures 1 if the aims are as follows:

To have a set of hyperparameters that performs well on average.
To train only a few GNNs and have decent results.

On the other hand, one may choose the hyperparameters that perform the best according to subfigures 2 if the goals are as follows:

To have a set of hyperparameters that performs the best but many trained GNNs might have relatively bad precision.
To train a lot of GNNs and pick out the top performers.

Moreover, Figure 22 only presents a distribution of the 100 top-performing GNNs. Thus, it is to be used in the same manner as subfigures 2.

The choice of an aggregation function (both for the classifier and the layers of the GNN) had little impact on the general accuracy of the GNN, as showcased by Figure 21(a1,b1). However, among the top-performing networks, the sum function seemed to perform worse than the max and average functions, as shown in Figure 21(a2,b2) and Figure 22a.

This might have been because the sum was dependent on the size of the neighborhood, whereas the average and maximum were normalized. Indeed, there were two elements in the graph that only had one neighbor (the extremities, i.e., nodes

v_{1}

and

v_{n}

of the graph, as shown in Figure 7), while the rest of the nodes had two. This might have been the reason why the networks with sum as its aggregation function underperformed.

Another impactful parameter was the number of layers of message passing used. Figure 21(c1) shows that the networks with zero layers of message passing performed better overall than the networks with one layer of message passing, which, in turn, performed better overall than the networks with two layers of message passing. However, among the top-performing (15% best) networks, the GNNs with one layer of message passing seemed to perform better, as showcased by Figure 21(c2) and Figure 22b.

This result is a bit strange. Indeed, it was expected that the more layers of message passing, the better the result of the GNN. This can be explained by the limited number of epochs because adding another layer of message passing would complexify the optimization of the GNN, and therefore, require more effort and data to learn the correct weights of the network. Moreover, a GNN with a higher number of layers requires optimizing more weights, which increases the risk of falling into a local minimum. Both these phenomena can explain why having fewer layers of message passing increased the overall accuracy but did not transfer to the top-performing GNNs, for which added complexity permitted better results.

Furthermore, Figure 21(d1) suggests that the medium-sized MLPs performed better overall and that the large-sized MLPs performed worse overall. Figure 21(d2) and Figure 22c seem to show the same trend. This could have been the effect of the same phenomena of optimization complexification as exemplified by the number of GNN layers, but this could have also been linked to the inherent dimension of the data. Indeed, using fewer data than the GNN’s inherent dimension limits the ability to learn because the GNN cannot take into account all the problem’s variables, but a higher data dimension can also hinder the ability to learn because the information is too dilute.

Moreover, as expected, when the importance of detecting a faulty sensor was induced by increasing the weight w in the BCE formula (Equation (16)), the rate of TPs increased, whereas the rate of TNs decreased.

Finally, this analysis did not take into consideration the correlation between one or more sets of hyperparameters. For instance, small-sized neural networks may perform well with two layers of message passing but not with zero or one. However, this analysis was complex and singling out hyperparameters already showed great empirical results.

4.2. Efficiency of GNNs for Sensors’ Integrity Assessment

As presented in Table 5, the most precise GNNs outperformed the thresholding method for every metric, which demonstrated the effectiveness of the model for assessing the sensors’ integrity. However, the GNN had a longer training and a slightly longer prediction time. Although the computation time is a relevent comparison criterion, in this application, the measurements are performed daily, and therefore, there is no need for real-time computation.

When compared with the MLP-based classifier, the most precise GNNs had relatively similar results for all metrics. These results may bring into question the appeal of the use of a GNN for the studied problem; however, it is worth pointing out that the data used here contained very few topological components given they were one-dimensional. Moreover, the temperature profiles of the used experimentation presented in Figure 8 and Figure 9 were rather simple because they involved a relatively uniform heat source. The model for sensor failure was also quite simple (i.e., an offset from 2 °C to 8 °C). The GNN model will outperform by a suitable amount the MLP for a more complex graph, a more complex heat source distributions or a more intricate sensor failure model thanks to its ability to gain insights into the underlying physical information and topology.

The decision tree outperformed the GNN for a majority of the metrics but had a tendency to have an increased amount of false negatives in comparison with the MLP and GNN, which means it might not be the best method for our problem.

In a nutshell, the performance of the method proposed in this work was similar to common classification methods for one-dimensional data that are relatively homogeneous. This guaranteed that the used method was effective for detecting sensor failures.

4.3. Upscaling the Model to Complex Networks

The next logical step of this research is to apply this model to more complex sensor networks, for instance, a network composed of all the sensors presented in Figure 2. Two challenges seem to arise from this upscaling: adapting the model to a complex graph and creating the graph of the sensors.

The first can be almost entirely bypassed because the model is based on node-order invariant aggregation functions, such as sum, average or maximum. However, as exposed in the previous subsection, the sum function is not normalized using the neighborhood size, which might cause issues during upscaling. But for the most part, applying the same GNN to a different graph is completely feasible. Moreover, because this model is based on the GraphSAGE model [13], it is really efficient at dealing with large graphs that require fewer computational resources than other GNN models.

The second and main challenge of this upscaling is to create the sensor network’s graph. This task is tricky because we need to define a systematic way to connect various nodes (i.e., sensors) without the graph becoming too dense or too sparse. Indeed, a very dense graph loses topological expressiveness, in addition to being very costly to compute with a GNN. This task is even more complex if we consider that the sensors are not all in the same part of the cell (some are in concrete, others in a cylinder liner, etc.).

4.4. Limitations of the Used Model

The first limitation of the used model was linked to the definitions of the sensor failure models, which were considered to be a simple restricted (between 2 °C and 8 °C) offsets in this work. This kind of failure is easier to pinpoint than real errors for two main reasons: first, it is a homogeneous definition, meaning all faults are similar, making them easier to identify; second, it is a simple definition that does not represent all the various ways a sensor can malfunction. This limitation could be lifted by integrating each type of sensor fault separately in the model and by having one class for each type of sensor failure.

The second limitation of the used model was the training complexity, which means that in order to have a GNN with decent predictive capabilities, multiple training sessions are required. It is therefore required to create a list of objective metrics to identify the top-performing networks that operate without biases. This also means that if a new type of fault is identified, the models will need to be retrained in order to be able to measure their performance on this particular problem.

The third limitation was technical and linked to the computational resources required to run a GNN over large graphs, which requires a lot of RAM (random access memory) and processing power. This limitation could be lifted using subgraph learning [67] and by setting up multithreading. Subgraph learning consists of training the GNN on subsets of the initial graph and then reconstructing the network on the whole graph, hence reducing the amount of memory required to store the model.

Another limitation was the use of the GraphSAGE model, which lacks the generalization capabilities of the attention mechanism used in a GAT [31]. However, when dealing with large graphs, the computation of the attention mechanism is very costly. Therefore, there is a tradeoff between the generalization capabilities and the computational resources required for training and predictions.

5. Conclusions

In this work, we propose a novel method based on graph neural networks to assess a sensor’s response integrity. The method was applied on real data obtained using Andra’s HLW cell demonstrator. The method was compared to state-of-the-art models (i.e., thresholding, MLP and decision tree) and showed a similar performance. The method could perform even better when dealing with more complex data (from a topological and thermal standpoint). Moreover, the GNN could adapt more easily to various data topologies, which warrants its use for the assessment of sensor integrity for nuclear waste monitoring. Multiple GNNs were trained and compared to find the optimal neural network hyperparameters. The results show that a single message-passing layer was often enough for the selected application, while multiple message-passing layers were harder to train and could result in overfitting.

Future works will deal with the whole sensor network instead of only the data along one of the optic fibers (see Figure 2). The main challenge with this upgrade may be the creation of the sensors’ network, which will have to consider the location of the sensor (i.e., the structural element the sensor is mounted on). In contrast, the scaling of the model from a one-dimensional graph to a complex graph is not a concern since the architecture of the GNN remains similar. Eventually, future works may be directed toward the exploration of multiple GNN models and on the sensor failure models.

Further developments will include interpolation under dynamic conditions (evolution of the sensor network, the medium, etc.) using a GNN. Indeed, once a faulty sensor is identified, it is paramount to evaluate the temperature at this spot to ensure that numerical models can continue to monitor the facility.

Author Contributions

Conceptualization, C.G., J.C. and F.C.; methodology, P.H. and C.G.; software, P.H.; validation, C.G., J.C. and F.C.; formal analysis, P.H.; investigation, P.H.; resources, C.G. and J.C.; data curation, J.C.; writing—original draft preparation, P.H.; writing—review and editing, P.H., C.G., J.C. and F.C.; visualization, P.H.; supervision, C.G., J.C. and F.C. All authors read and agreed to the published version of the manuscript.

Funding

This research was funded by Andra. Pierre Hembert is a PhD student under a ANDRA-AMValor Doctoral Contract.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this paper are the property of Andra, and therefore, are not available.

Acknowledgments

The authors acknowledge the role played by Andra. The following Python libraries were instrumental in developing this model: numpy [68], keras [69], plotly [70] and scikit-learn [63].

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

Notations
Variable	Definition
G	Graph
V	Set of a graph’s nodes
n	Number of nodes in the graph ( $\| V \|$ )
E	Set of a graph’s edges
R	Set of the graph’s relations (edge type)
$u, v, v_{i}$	Individual nodes
$(u, v)$	Individual edge adjacent to nodes u and v
r	Individual relation
$N (v)$	Neighborhood of node v (subset of V)
$N_{r} (v)$	Neighborhood of node v given a relation r (subset of $N (v)$ )
l	Individual graph neural network layer
N	Graph neural network’s final layer
$ϕ$	General update function
W	Weight matrix of a neural network
$σ$	Non-linear activation function
$R e L U$	Rectified linear unit function
$S i g m o i d$	Sigmoid function
$α$	Concatenation function
$A g g$	Node-order equivariant aggregation function
A	Attention function
$L$	Loss function
$x_{v}^{l}$	Embedding of node v at layer l
$e_{u, v}^{l}$	Embedding of edge $(u, v)$ at layer l
$X_{V}^{l}$	Set of the nodes’ embeddings at layer l
$X_{E}^{l}$	Set of the edges’ embeddings at layer l
$X_{G}^{l}$	Set of the graph’s embeddings at layer l
$S_{v}$	Real sensor state at node v
${\hat{S}}_{v}$	Predicted sensor state at node v
$S_{V}$	Set of the sensor state’s real value
${\hat{S}}_{V}$	Set of the sensor state’s predicted value
T	Temperature in degrees Celsius
$\hat{T}$	Predicted temperature in degrees Celsius
t	Time in days
p	Sensor’s position
P	Probability
$U_{D}$	Discrete uniform distribution
$U_{C}$	Continuous uniform distribution
$ε$	Threshold value in degrees Celsius
log	Natural logarithm function
Abbreviations
GNN	Graph neural network
RecGNN	Recurrent graph neural network
GCN	Graph convolutional network
R-GCN	Relational graph convolutional network
GAT	Graph attention network
MLP	Multi-layer perceptron
URL	Underground Research Laboratory (of Andra)
HLW	High-level radioactive waste
ILW	Intermediate-level radioactive waste
ILW-LL	Intermediate-level long-lived radioactive waste
LLW	Low-level radioactive waste
HAW	High-activity waste
Agg	Aggregation
BCE	Binary cross-entropy
P	Real positive
N	Real negative
PP	Predicted positive
PN	Predicted negative
TN	True negative
TP	True positive
FN	False negative
FP	False positive
TNR	True negative ratio
TPR	True positive ratio
NPV	Negative predictive value
PPV	Positive predictive value

References

Sanchez-Lengeling, B.; Reif, E.; Pearce, A.; Wiltschko, A.B. A Gentle Introduction to Graph Neural Networks. Distill 2021, 6, e33. [Google Scholar] [CrossRef]
Battaglia, P.W.; Hamrick, J.B.; Bapst, V.; Sanchez-Gonzalez, A.; Zambaldi, V.; Malinowski, M.; Tacchetti, A.; Raposo, D.; Santoro, A.; Faulkner, R.; et al. Relational inductive biases, deep learning, and graph networks. arXiv 2018, arXiv:1806.01261. [Google Scholar] [CrossRef]
Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A Comprehensive Survey on Graph Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2019, 32, 4–24. [Google Scholar] [CrossRef]
Zhang, Z.; Cui, P.; Zhu, W. Deep Learning on Graphs: A Survey. arXiv 2020, arXiv:1812.04202. [Google Scholar] [CrossRef]
Zhou, J.; Cui, G.; Hu, S.; Zhang, Z.; Yang, C.; Liu, Z.; Wang, L.; Li, C.; Sun, M. Graph neural networks: A review of methods and applications. AI Open 2020, 1, 57–81. [Google Scholar] [CrossRef]
Hoang, V.T.; Jeon, H.J.; You, E.S.; Yoon, Y.; Jung, S.; Lee, O.J. Graph Representation Learning and Its Applications: A Survey. Sensors 2023, 23, 4168. [Google Scholar] [CrossRef]
Nguyen, H.X.; Zhu, S.; Liu, M. A Survey on Graph Neural Networks for Microservice-Based Cloud Applications. Sensors 2022, 22, 9492. [Google Scholar] [CrossRef] [PubMed]
Gilmer, J.; Schoenholz, S.S.; Riley, P.F.; Vinyals, O.; Dahl, G.E. Neural Message Passing for Quantum Chemistry. arXiv 2017, arXiv:1704.01212. [Google Scholar] [CrossRef]
Daigavane, A.; Ravindran, B.; Aggarwal, G. Understanding Convolutions on Graphs. Distill 2021, 6, e32. [Google Scholar] [CrossRef]
Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2017, arXiv:1609.02907. [Google Scholar] [CrossRef]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. arXiv 2017, arXiv:1606.09375. [Google Scholar] [CrossRef]
Wu, F.; Souza, A.; Zhang, T.; Fifty, C.; Yu, T.; Weinberger, K.Q. Simplifying Graph Convolutional Networks. arXiv 2019, arXiv:1902.07153. [Google Scholar] [CrossRef]
Hamilton, W.L.; Ying, R.; Leskovec, J. Inductive Representation Learning on Large Graphs. arXiv 2018, arXiv:1706.02216. [Google Scholar] [CrossRef]
Xu, K.; Hu, W.; Leskovec, J.; Jegelka, S. How Powerful are Graph Neural Networks? arXiv 2019, arXiv:1810.00826. [Google Scholar] [CrossRef]
Chen, Z.; Chen, F.; Zhang, L.; Ji, T.; Fu, K.; Zhao, L.; Chen, F.; Wu, L.; Aggarwal, C.; Lu, C.T. Bridging the Gap between Spatial and Spectral Domains: A Survey on Graph Neural Networks. arXiv 2021, arXiv:2002.11867. [Google Scholar] [CrossRef]
Weisfeiler, B.Y.; Leman, A.A. The Reduction of a Graph to Canonical Form and the Algebra which appears therein. nti Ser. 1968, 2, 12–16. [Google Scholar]
Scarselli, F.; Gori, M.; Tsoi, A.C.; Hagenbuchner, M.; Monfardini, G. The Graph Neural Network Model. IEEE Trans. Neural Netw. 2009, 20, 61–80. [Google Scholar] [CrossRef]
Sperduti, A.; Starita, A. Supervised neural networks for the classification of structures. IEEE Trans. Neural Netw. 1997, 8, 714–735. [Google Scholar] [CrossRef]
Zhou, D.; Bousquet, O.; Lal, T.N.; Weston, J.; Schölkopf, B. Learning with Local and Global Consistency. Adv. Neural Inf. Process. Syst. 2003, 16. Available online: https://proceedings.neurips.cc/paper_files/paper/2003/file/87682805257e619d49b8e0dfdc14affa-Paper.pdf (accessed on 18 December 2023).
Micheli, A. Neural network for graphs: A contextual constructive approach. IEEE Trans. Neural Netw. 2009, 20, 498–511. [Google Scholar] [CrossRef] [PubMed]
Bruna, J.; Zaremba, W.; Szlam, A.; LeCun, Y. Spectral Networks and Locally Connected Networks on Graphs. arXiv 2014, arXiv:1312.6203. [Google Scholar] [CrossRef]
He, M.; Wei, Z.; Wen, J.R. Convolutional Neural Networks on Graphs with Chebyshev Approximation, Revisited. arXiv 2022, arXiv:2202.03580. [Google Scholar] [CrossRef]
Niepert, M.; Ahmed, M.; Kutzkov, K. Learning Convolutional Neural Networks for Graphs. arXiv 2016, arXiv:1605.05273. [Google Scholar] [CrossRef]
Dehmamy, N.; Barabási, A.L.; Yu, R. Understanding the Representation Power of Graph Neural Networks in Learning Graph Topology. arXiv 2019, arXiv:1907.05008. [Google Scholar] [CrossRef]
Atwood, J.; Towsley, D. Diffusion-Convolutional Neural Networks. arXiv 2016, arXiv:1511.02136. [Google Scholar] [CrossRef]
Wang, Y.; Sun, Y.; Liu, Z.; Sarma, S.E.; Bronstein, M.M.; Solomon, J.M. Dynamic Graph CNN for Learning on Point Clouds. arXiv 2019, arXiv:1801.07829. [Google Scholar] [CrossRef]
Levie, R.; Isufi, E.; Kutyniok, G. On the Transferability of Spectral Graph Filters. arXiv 2019, arXiv:1901.10524. [Google Scholar] [CrossRef]
Hammond, D.K.; Vandergheynst, P.; Gribonval, R. Wavelets on Graphs via Spectral Graph Theory. arXiv 2009, arXiv:0912.3848. [Google Scholar] [CrossRef]
Schlichtkrull, M.; Kipf, T.N.; Bloem, P.; Van Den Berg, R.; Titov, I.; Welling, M. Modeling Relational Data with Graph Convolutional Networks. arXiv 2017, arXiv:1703.06103. [Google Scholar] [CrossRef]
Beaini, D.; Passaro, S.; Létourneau, V.; Hamilton, W.L.; Corso, G.; Liò, P. Directional Graph Networks. arXiv 2021, arXiv:2010.02863. [Google Scholar] [CrossRef]
Veličković, P.; Cucurull, G.; Casanova, A.; Romero, A.; Liò, P.; Bengio, Y. Graph Attention Networks. arXiv 2018, arXiv:1710.10903. [Google Scholar] [CrossRef]
Knyazev, B.; Taylor, G.W.; Amer, M.R. Understanding Attention and Generalization in Graph Neural Networks. arXiv 2019, arXiv:1905.02850. [Google Scholar] [CrossRef]
Zheng, X.; Liu, Y.; Pan, S.; Zhang, M.; Jin, D.; Yu, P.S. Graph Neural Networks for Graphs with Heterophily: A Survey. arXiv 2022, arXiv:2202.07082. [Google Scholar] [CrossRef]
He, Z.; Chen, P.; Li, X.; Wang, Y.; Yu, G.; Chen, C.; Li, X.; Zheng, Z. A Spatiotemporal Deep Learning Approach for Unsupervised Anomaly Detection in Cloud Systems. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 1705–1719. [Google Scholar] [CrossRef] [PubMed]
Jacob, S.; Qiao, Y.; Ye, Y.; Lee, B. Anomalous distributed traffic: Detecting cyber security attacks amongst microservices using graph convolutional networks. Comput. Secur. 2022, 118, 102728. [Google Scholar] [CrossRef]
Somashekar, G.; Dutt, A.; Vaddavalli, R.; Varanasi, S.B.; Gandhi, A. B-MEG: Bottlenecked-Microservices Extraction Using Graph Neural Networks. In Proceedings of the Companion of the 2022 ACM/SPEC International Conference on Performance Engineering, Bejing, China, 9–13 April 2022; ICPE ’22. Association for Computing Machinery: New York, NY, USA, 2022; pp. 7–11. [Google Scholar] [CrossRef]
Chen, J.; Huang, H.; Chen, H. Informer: Irregular traffic detection for containerized microservices RPC in the real world. High-Confid. Comput. 2022, 2, 100050. [Google Scholar] [CrossRef]
Zhang, C.; Peng, X.; Sha, C.; Zhang, K.; Fu, Z.; Wu, X.; Lin, Q.; Zhang, D. DeepTraLog: Trace-log combined microservice anomaly detection through graph-based deep learning. In Proceedings of the 44th International Conference on Software Engineering, Pittsburgh, PA, USA, 21–29 May 2022; ICSE ’22. Association for Computing Machinery: New York, NY, USA, 2022; pp. 623–634. [Google Scholar] [CrossRef]
Deng, A.; Hooi, B. Graph Neural Network-Based Anomaly Detection in Multivariate Time Series. Proc. AAAI Conf. Artif. Intell. 2021, 35, 4027–4035. [Google Scholar] [CrossRef]
Wu, Y.; Dai, H.N.; Tang, H. Graph Neural Networks for Anomaly Detection in Industrial Internet of Things. IEEE Internet Things J. 2022, 9, 9214–9231. [Google Scholar] [CrossRef]
Skarding, J.; Gabrys, B.; Musial, K. Foundations and modelling of dynamic networks using Dynamic Graph Neural Networks: A survey. IEEE Access 2021, 9, 79143–79168. [Google Scholar] [CrossRef]
Maekawa, S.; Noda, K.; Sasaki, Y.; Onizuka, M. Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs. arXiv 2022, arXiv:2206.09144. [Google Scholar] [CrossRef]
Liu, Z.; Jiang, Z.; Zhong, S.; Zhou, K.; Li, L.; Chen, R.; Choi, S.H.; Hu, X. Editable Graph Neural Network for Node Classifications. arXiv 2023, arXiv:2305.15529. [Google Scholar] [CrossRef]
Xiao, S.; Wang, S.; Dai, Y.; Guo, W. Graph neural networks in node classification: Survey and evaluation. Mach. Vis. Appl. 2021, 33, 4. [Google Scholar] [CrossRef]
Maurya, S.K.; Liu, X.; Murata, T. Simplifying approach to Node Classification in Graph Neural Networks. arXiv 2021, arXiv:2111.06748. [Google Scholar] [CrossRef]
Protogerou, A.; Papadopoulos, S.; Drosou, A.; Tzovaras, D.; Refanidis, I. A graph neural network method for distributed anomaly detection in IoT. Evol. Syst. 2021, 12, 19–36. [Google Scholar] [CrossRef]
Wang, Z.; Wu, Z.; Li, X.; Shao, H.; Han, T.; Xie, M. Attention-aware temporal–spatial graph neural network with multi-sensor information fusion for fault diagnosis. Knowl.-Based Syst. 2023, 278, 110891. [Google Scholar] [CrossRef]
Dong, G.; Tang, M.; Wang, Z.; Gao, J.; Guo, S.; Cai, L.; Gutierrez, R.; Campbel, B.; Barnes, L.E.; Boukhechba, M. Graph Neural Networks in IoT: A Survey. ACM Trans. Sens. Netw. 2023, 19, 47:1–47:50. [Google Scholar] [CrossRef]
Li, H.; Zhang, S.; Su, L.; Huang, H.; Jin, D.; Li, X. GraphSANet: A Graph Neural Network and Self Attention Based Approach for Spatial Temporal Prediction in Sensor Network. In Proceedings of the 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, 10–13 December 2020; pp. 5756–5758. [Google Scholar] [CrossRef]
Chen, D.; Liu, R.; Hu, Q.; Ding, S.X. Interaction-Aware Graph Neural Networks for Fault Diagnosis of Complex Industrial Processes. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 6015–6028. [Google Scholar] [CrossRef]
Zhang, W.; Zhang, Y.; Xu, L.; Zhou, J.; Liu, Y.; Gu, M.; Liu, X.; Yang, S. Modeling IoT Equipment with Graph Neural Networks. IEEE Access 2019, 7, 32754–32764. [Google Scholar] [CrossRef]
Owerko, D.; Gama, F.; Ribeiro, A. Predicting Power Outages Using Graph Neural Networks. In Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Anaheim, CA, USA, 26–29 November 2018; pp. 743–747. [Google Scholar] [CrossRef]
Casas, S.; Gulino, C.; Liao, R.; Urtasun, R. SpAGNN: Spatially-Aware Graph Neural Networks for Relational Behavior Forecasting from Sensor Data. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 9491–9497. [Google Scholar] [CrossRef]
Jiang, D.; Luo, X. Sensor self-diagnosis method based on a graph neural network. Meas. Sci. Technol. 2023, 35, 035109. [Google Scholar] [CrossRef]
Kullaa, J. Detection, identification, and quantification of sensor fault in a sensor network. Mech. Syst. Signal Process. 2013, 40, 208–221. [Google Scholar] [CrossRef]
Jäger, G.; Zug, S.; Casimiro, A. Generic Sensor Failure Modeling for Cooperative Systems. Sensors 2018, 18, 925. [Google Scholar] [CrossRef]
Zou, X.; Liu, W.; Huo, Z.; Wang, S.; Chen, Z.; Xin, C.; Bai, Y.; Liang, Z.; Gong, Y.; Qian, Y.; et al. Current Status and Prospects of Research on Sensor Fault Diagnosis of Agricultural Internet of Things. Sensors 2023, 23, 2528. [Google Scholar] [CrossRef] [PubMed]
ElHady, N.E.; Provost, J. A Systematic Survey on Sensor Failure Detection and Fault-Tolerance in Ambient Assisted Living. Sensors 2018, 18, 1991. [Google Scholar] [CrossRef] [PubMed]
Liu, B.; Xu, Q.; Chen, J.; Li, J.; Wang, M. A New Framework for Isolating Sensor Failures and Structural Damage in Noisy Environments Based on Stacked Gated Recurrent Unit Neural Networks. Buildings 2022, 12, 1286. [Google Scholar] [CrossRef]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar] [CrossRef]
van Rijsbergen, C.K. Information Retrieval. 1979. Available online: https://openlib.org/home/krichel/courses/lis618/readings/rijsbergen79_infor_retriev.pdf (accessed on 18 December 2023).
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Powers, D.M.W. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv 2020, arXiv:2010.16061. [Google Scholar] [CrossRef]
Kozak, J.; Probierz, B.; Kania, K.; Juszczuk, P. Preference-Driven Classification Measure. Entropy 2022, 24, 531. [Google Scholar] [CrossRef]
Iwendi, C.; Khan, S.; Anajemba, J.H.; Mittal, M.; Alenezi, M.; Alazab, M. The Use of Ensemble Models for Multiple Class and Binary Class Classification for Improving Intrusion Detection Systems. Sensors 2020, 20, 2559. [Google Scholar] [CrossRef]
Zha, B.; Yilmaz, A. Subgraph Learning for Topological Geolocalization with Graph Neural Networks. Sensors 2023, 23, 5098. [Google Scholar] [CrossRef] [PubMed]
Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef] [PubMed]
Chollet, F. Keras: Deep Learning for Humans. 2015. Available online: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=VfYhf2wAAAAJ&citation_for_view=VfYhf2wAAAAJ:9pM33mqn1YgC (accessed on 18 December 2023).
Plotly Technologies Inc. Plotly: Collaborative Data Science. 2015. Available online: https://plotly.com/chart-studio-help/citations/ (accessed on 18 December 2023).

Figure 1. Andra’s Cigéo project.

Figure 2. Sensor distribution around the HLW demonstrator cell.

Figure 3. Installation of the thermal source in the HLW demonstrator cell.

Figure 4. Message-passing mechanism and its use in GNN.

Figure 5. The graph neural network model.

Figure 6. The GraphSAGE model.

Figure 7. Unidirectional graph representing the data.

Figure 8. Thermal responses of the optic fiber sensor at set sample points over time.

Figure 9. Responses of the integrality of the distributed sensors for different time samples.

Figure 10. Process of introducing sensor inaccuracies into clean distributed sensor data.

Figure 11. Inputs and outputs of the machine learning algorithm.

Figure 12. First element of the GNN, which was used to create the input graph.

Figure 13. One GNN layer of message passing: updating the nodes’ embeddings.

Figure 14. One GNN layer of message passing: updating the edges’ embeddings.

Figure 15. The local classifier predicts the sensor state.

Figure 16. Confusion matrix of our problem.

Figure 17. Performances of the GNNs on the testing dataset in terms of recall metrics.

Figure 18. Proportion of GNNs above a certain accuracy.

Figure 19. Performances of the GNNs on the testing dataset in terms of precision metrics.

Figure 20. Performances of the GNNs on the testing dataset in terms of F1-scores.

Figure 21. Proportion of GNNs above a certain accuracy, sorted by hyperparameters.

Figure 22. Distribution of the hyperparameters’ set for the 100 top-performing GNNs.

Figure 23. Optimizing the value of

ε

on the training dataset.

Figure 23. Optimizing the value of

ε

on the training dataset.

Table 1. Various hyperparameters used in the benchmark.

Agg MP	Agg CLA	No. of Layers	MLP Size (Flow, Update, CLA) in Number of Neurons	BCE Weight
Sum	Sum	0	Small (2, 15, 15)	1
Avg	Avg	1	Medium (4, 30, 30)	2
Max	Max	2	Large (8, 60, 60)

Table 2. Confusion matrix of the thresholding method that used the test dataset.

TN = 1,791,832	FP = 5240
FN = 555	TP = 627,373

Table 3. Confusion matrix of multi-layer-perceptron-based classifier that used the test dataset.

TN = 1,796,452	FP = 620
FN = 133	TP = 627,795

Table 4. Confusion matrix of the decision tree classification method that used the test dataset.

TN = 1,796,452	FP = 11
FN = 327	TP = 627,795

Table 5. Comparison between the top-performing GNNs and other standard classification methods.

Model	Recall		Precision		F1		Accuracy
Model	TNR	TPR	NPV	PPV	F1-N	F1-P	Accuracy
Thresholding	99.708%	99.912%	99.969%	99.172%	99.839%	99.540%	99.761%
MLP	99.965%	99.979%	99.993%	99.901%	99.979%	99.940%	99.969%
Decision tree	99.999%	99.948%	99.982%	99.998%	99.991%	99.973%	99.986%
GNN-1	99.964%	99.975%	99.991%	99.898%	99.978%	99.937%	99.967%
GNN-2	99.975%	99.942%	99.980%	99.927%	99.977%	99.935%	99.966%
GNN-3	99.974%	99.905%	99.967%	99.924%	99.970%	99.914%	99.956%
GNN-4	99.956%	99.950%	99.983%	99.874%	99.969%	99.912%	99.954%
GNN-5	99.954%	99.936%	99.978%	99.869%	99.966%	99.902%	99.949%

Table 6. Hyperparameters of the top-performing GNNs.

Model	Agg MP	Agg CLA	N Layers	MLPs’ Size	BCE Weight
GNN-1	Max	Max	1	Medium	2
GNN-2	Avg	Sum	1	Large	1
GNN-3	Sum	Sum	1	Small	2
GNN-4	Sum	Avg	1	Small	2
GNN-5	Max	Avg	1	Large	2

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hembert, P.; Ghnatios, C.; Cotton, J.; Chinesta, F. Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks. Sensors 2024, 24, 1580. https://doi.org/10.3390/s24051580

AMA Style

Hembert P, Ghnatios C, Cotton J, Chinesta F. Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks. Sensors. 2024; 24(5):1580. https://doi.org/10.3390/s24051580

Chicago/Turabian Style

Hembert, Pierre, Chady Ghnatios, Julien Cotton, and Francisco Chinesta. 2024. "Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks" Sensors 24, no. 5: 1580. https://doi.org/10.3390/s24051580

APA Style

Hembert, P., Ghnatios, C., Cotton, J., & Chinesta, F. (2024). Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks. Sensors, 24(5), 1580. https://doi.org/10.3390/s24051580

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Assessing Sensor Integrity for Nuclear Waste Monitoring Using Graph Neural Networks

Abstract

1. Introduction

1.1. Andra’s Cigéo Project

1.2. The High-Level Waste (HLW) Demonstrator Cell

1.3. Graph Neural Networks and Message Passing

1.4. Graph Neural Network Models

1.5. Related Works

2. Materials and Methods

2.1. Generating a Training Dataset

2.2. The Graph Neural Network Architecture

2.3. The Thresholding Classification Method

2.4. The Multi-Layer-Perceptron-Based Classifier

2.5. The Decision Tree Classification Method

3. Results

3.1. Results of the GNNs

3.2. Results of the Thresholding

3.3. Results of the Multi-Layer-Perceptron-Based Classifier

3.4. Results of the Multi-Layer-Perceptron-Based Classifier

3.5. Trained GNNs Compared against the Other Classification Methods

4. Discussion

4.1. The Impacts of Different Hyperparameters

4.2. Efficiency of GNNs for Sensors’ Integrity Assessment

4.3. Upscaling the Model to Complex Networks

4.4. Limitations of the Used Model

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI