Integrating Cluster Analysis into Multi-Criteria Decision Making for Maintenance Management of Aging Culverts

Negligence in relation to aging infrastructure systems could have unintended consequences and is therefore associated with a risk. The assessment of the risk of neglecting maintenance provides valuable information for decision making in maintenance management. However, infrastructure systems are interdependent and interconnected systems of systems characterized by hierarchical levels and a multiplicity of failure scenarios. Assessment methodologies are needed that can capture the multidimensional aspect of risk and simplify the risk assessment, while also improving the understanding and interpretation of the results. This paper proposes to integrate the multi-criteria decision analysis with data mining techniques to perform the risk assessment of aging infrastructures. The analysis is characterized by two phases. First, an intra failure scenario risk assessment is performed. Then, the results are aggregated to carry out an inter failure scenario risk assessment. A cluster analysis based on the k-medoids algorithm is applied to reduce the number of alternatives and identify those which dominate the decision problem. The proposed approach is applied to a system of aging culverts of the German waterways network. Results show that the procedure allows to simplify the analysis and improve communication with infrastructure stakeholders.


Challenges in Infrastructure Management
Infrastructure management is a complex discipline which plays an essential role in the functioning and the well-being of modern societies. Among the activities that infrastructure management involves, maintenance management is becoming increasingly prevalent. In spite of the acknowledged fact that age affects the performance of civil infrastructures and their robustness against environmental and natural threats [1], investments in maintenance are still insufficient [2]. As a result, the maintenance backlog [3], which represents the number of unfulfilled maintenance demands concerning predefined security standards, is growing. The German waterways system also has to deal with a backlog accumulation of maintenance actions. The asset of waterways infrastructures, managed by the Federal Waterways and Shipping Administration of Germany (Wasserstraßen-und Schifffahrtsverwaltung des Bundes (WSV)) with the support of the Federal Waterways Engineering and Research Institute (Bundesanstalt für Wasserbau (BAW)), includes different types of infrastructures, such as locks, weirs, culverts, canal bridges, and lighthouses, for an estimated asset value of forty billion Euros. Inspections have pointed out that many infra-structures are or will soon be in bad condition. However, urgent maintenance interventions are systematically disregarded due to logistic and economic constraints. There are currently concerns about the culvert system, as it has often been neglected and the failure of one culvert could lead to flooding.
As in the case of extreme natural events or terrorism, disregarded maintenance might lead to the occurrence of unwanted consequences, and it is thus linked to a risk [4]. Riskbased maintenance strategies have been developed to identify those system elements linked to higher probabilities of failure and the magnitude of failure consequences [5]. However, the evaluation of risk is not trivial for infrastructure systems.
The status of risk as a mental construction, and the fact that the concept of risk spans different disciplines, from medicine to economics and engineering, has major implications that should be considered before starting any risk assessment. One such implication is the fact that, although the concept of risk has been the object of several studies and investigation, there is still not a single agreement about the definition of risk, as it clearly emerges from [6,7], where ten definitions of risk and five different examples of metrics are proposed, respectively. Nonetheless, most of those definitions and interpretations stem from the one given by [8], who stated clearly that risk is a measure of the probability and severity of adverse effects. This definition also represents the basis of the one provided by [9] and largely used nowadays, according to which risk can be quantified through a set of triplets: scenario, probability of failure, and consequence. This definition is as simple as it is ambiguous, since it can be interpreted in two different ways, which are both valid but imply different types of analysis [10]: 1) in terms of the probability of occurrence of adverse effects; 2) in terms of the probability of the severity of adverse effects, given their occurrence. Regardless of the definition, risk assessment represents a true challenge in the case of complex systems. The real world is in effect constituted by interdependent and interconnected (I-I) complex systems of systems (SoS) [11]. Each SoS is composed of many subsystems, each of which is characterized by a hierarchy of shared or interacting components having multiple states, functions, operations, databases, costs, and stakeholders. All SoS are subject to multiple adverse initiating events that could originate outside or inside the SoS. In contrast to a single system, no single model can capture the essence of SoS. Furthermore, the I-I subsystems share most of the previously mentioned building blocks and especially their states. In addition to these challenges, it should be considered that a clear separation between the probability and consequences of failure is difficult to achieve in practice. This is due to the fact that, despite the attempt to describe the world through simple, linear models, the causal chain describing failure scenarios is often non-linear, with chain links playing both the role of causes and effects at the same time and activating reinforcement loops.
As the assessment of the risk of neglecting maintenance supports decision making in infrastructure management and the planning of maintenance actions, strategies are needed that tackle these challenges and clarify the ambiguities inherent in the risk analysis of infrastructure systems.

The Multi-Criteria Decision Analysis for the Maintenance Management of Physical Infrastructures
Maintenance activities involve different types of decision problems which span from single choice to portfolio problematics and are often deeply combined and connected to each other. These decision problems are characterized by different decision criteria and involve short and long-term consequences, about which a certain degree of information is available. Considering that human beings can simultaneously process a limited amount of information, methods are needed which support decision making in maintenance management, which also capture the multidimensional nature of risk and increase the rationality of the decision process. A tool which can provide valuable help in many phases of the decision process concerning maintenance activities is the multi-criteria decision analysis (MCDA).
The MCDA represents a collection of formal approaches which seek to take explicit account of multiple criteria in helping individuals or groups explore decisions that matter. The MCDA provides an aid to decision making by pursuing the following objectives: (1) to integrate objective measurement with value judgment; (2) to make explicit and to manage the subjectivity inherent in decision making, especially in the choice of the criteria and their relative importance; (3) to illuminate controversy and overcome difficult trade-offs, e.g., by promoting the generation of new alternatives; (4) to increase decision makers' learning and understanding of the decision problem and thus to promote the exploration of new capabilities; (5) to provide a structure for the decision problem which serves as a focus and a language for discussion.
The MCDA has been already applied to the maintenance of infrastructure systems [12,13]. MCDA methods differ in many aspects, such as the input information, the modelling effort required, as well as the outcome and its granularity. Depending on the outcome, they can be more suited for making choices, sorting and classification, ranking, elimination, design, or the description of problems. In general, all the MCDA approaches incorporate subjective information in the form of preferences. However, they follow different models, which can be grouped into three main categories: the value measurement models; the goal, aspiration or reference level models; the outranking models [14]. In the context of maintenance management, the following applications have been recently developed: value measurement models are implemented in the maintenance management of railways, bridges, and roads [13,[15][16][17]; goal, aspiration, or reference level models such as multi-objective programming methods are implemented in the maintenance management of bridges and highways [18,19]; outranking models are implemented in the maintenance management of bridges and roads [20,21]. The review of the literature shows a trend towards using value measurement models to prioritize maintenance actions on single objects, also considering the relevance of the object at network level.
However, the MCDA is suited to deal with small sets of simple and static qualitative and/or quantitative data. It is thus inadequate to process large amounts of complex data which nowadays are collected during the several stages of maintenance and infrastructure management [22]. In addition, MCDA approaches have found limited implementation by government agencies, organizations, and institutions [23]. Currently, decisions are taken quickly, instinctively, and emotionally, considering only a few of all the possible alternatives, and without collecting information or using the information available for properly describing the decision consequences. As decision problems are inherently complex and multidimensional, strong arguments have been made in favor of a methodological pluralism in which the MCDA is used in conjunction with other methods in order to expand their influence and their acceptance among practitioners [24].

Artificial Intelligence and Data Mining Techniques in Maintenance Management
Contrary to MCDA, a group of techniques that are especially suited for dealing with big sets of complex data is artificial intelligence and data mining algorithms. Because maintenance management involves complex problems involving many variables and large data sets, the application of these algorithms for maintenance management has already been developed. A good example is represented by [25,26], in which unsupervised learning algorithms such as cluster algorithms are used in order to identify groups of similar maintenance activities. The potential of such techniques is evident and indisputable since they allow for the automation of the maintenance management process, taking full advantage of the available data. However, artificial intelligence and data mining techniques are ill-equipped to tackle novel problems in uncertain environments for which an abstract view is required, which are frequently encountered in the work of government institutions and organizations like the WSV and the BAW. Therefore, such algorithms prove helpful to organizational decisions when integrated with complementary techniques which support a holistic approach.

Integration of Data Mining Techniques in the Multi-Criteria Decision Analysis
Currently, there is a need to integrate the MCDA with other methods in order to enhance its practical value in operational research. Hybrid approaches in which MCDA is combined with artificial intelligence and data mining techniques have already been developed [22,27,28]. A comprehensive collection of recent contributions in which the MCDA is supported by artificial and computational intelligence is especially given by [28], in which several references to older approaches have also been provided. A review of the literature reveals that the integration has the practical purposes of improving the structure of the decision problem, performing data pre-processing, modeling the preferences. However, to date, hybrid MCDA has found almost no application in infrastructure management. Exceptions are represented by [29], in which artificial neural networks support the MCDA to evaluate municipal water mains performance, and [30], in which the MCDA and genetic algorithms are implemented together with other techniques in order to develop a multi-year maintenance planning for bridges. Nonetheless, the proposed methodologies lack integration and generality. The development of hybrid approaches in which the MCDA is integrated with artificial intelligence and data mining techniques could increase the objectivity of the decision process and thus enhance the practical value of the MCDA in the maintenance management of waterway infrastructures and in many other disciplines.
It should be mentioned that the social relevance of such integrated approaches has increased since the onset of the crisis due to the COVID-19 pandemic, as governments have been forced to make difficult choices that profoundly affect the health, wealth, and freedom of their populations. These high-stakes decisions have often been made quickly, without using sound information, and with little involvement of stakeholders in the deliberation about which policies to pursue. A more inclusive, objective, and data-driven decision-making process could contribute to more trustworthy and legitimate decisions on difficult ethical questions and political trade-offs during the pandemic and beyond [31].
The aim of this paper is to increase the rationality of decision making in infrastructure management by developing a hybrid approach for risk-based maintenance planning in which the MCDA is integrated with data mining techniques. The integration of the two methodologies is aimed at reducing their weaknesses and exploiting their full potential. The great advantages of a combined approach are to make explicit the subjective component of decision making and to combine it with advanced data analysis.
The rest of this paper is organized as follows. Section 2 presents the basic methodologies which have been integrated and combined to develop the hybrid approach for the risk assessment of aging infrastructures. Section 3 describes the main features of the hybrid approach and its application to the maintenance prioritization of a system of aging culverts. Section 4 summarizes the main findings of this work as well as its limitations and defines future research directions.

A Systematic Framework for the Integration of Data Mining Techniques in the Multi-Criteria Decision Analysis
The integration of data mining techniques in the MCDA should follow a systematic process in which the main dimensions of the design of the integrated method are clarified. In general, the objectives pursued by integrating the two methodologies and applying them to infrastructure and maintenance management are the following: • Increase the objectivity of the decision process by integrating analytic and intuitive thinking and thus reducing the cognitive bias.

•
Increase the transparency and understanding of the decision process, and by doing so, promote inclusive, accountable, and trustworthy decisions.
• Increase the practical value of these approaches by overcoming the weaknesses of individual methods and gaining particular benefits from their integration.

•
Promote data-driven decisions by exploiting primary and secondary databases.
Tackle complex decision problems characterized by several phases requiring different approaches.
Relevant dimensions of the design of the integrated method are as follows: 1. Rationales of the integration, such as: • Expansion/development/enhancement: when the scope is to extend the range of application of the MCDA, increase their credibility, foster a multiplicity of perspectives within their application, and improve the precision of the results. 2. Timing of the integration: the integration could be concurrent or sequential. In the first case, the integration includes components which are integrated in parallel, while in the second case the components are linked in series. Sequential and concurrent designs are determined by the dependency between the components. Two components are dependent when the implementation of one depends on the results of the other.
3. Typological approach: according to the timing of the components, several typologies or taxonomies are identified, which are determined by the peculiar juxtaposition of the data mining with MCDA components. Examples of such typologies are: • Convergent parallel design: the MCDA and data mining techniques are applied independently. The results are then brought together in an overall interpretation. • Exploratory or explanatory sequential design: data mining techniques are used for data pre-or post-processing.

•
Embedded design: a strand of data mining techniques is added to enhance the MCDA.

•
Multiphase design: the above-mentioned designs are combined in a unique approach.
The complexity of the integrated approach, which is driven by the multiplicity of integration points and data sources, should also be considered. In general, a multiphase design is more complex than the other types of design. Figure 1 shows the overall research framework and identifies the main scopes, methods, and data used in this research. In the following subsections, attention is placed on the methods and their intended use in this research.

The Simple Multi-Attribute Rating Technique Swing
A MCDA method that is of particular interest due to its simplicity is the simple multiattribute rating technique (SMART) [32]. SMART belongs to the group known as value measurement models, which are characterized by the following relevant steps: 1. Definition of the context of the decision problem: in this phase, the problematic characterizing the decision problem is identified, such as sorting, classification, ranking, description, learning, and design. This step allows the definition of the desired results and the identification of the model which can provide such outputs. Then, a first formulation for the alternatives, criteria, and goal of the decision problem can be developed. 2. Construction of the model: in this step, the value tree, which links alternatives, attributes, criteria, and goals of the decision problem, is built. Approaches to the elicitation and description of the preferences are also identified.
3. Validation of the model: this step is an interactive process during which the analyst cooperates with the decision maker in order to assess the validity of the results. In this step, sensitivity analyses could be carried out. The SMART method represents a simplified version of the multi-attribute utility theory (MAUT) approach, and it uses the strategy of heroic approximation to justify the simplification of single-attribute utility function and use of an additive aggregation model. In MAUT, elicitation of the details of the utility functions could be tedious and demanding, and the contribution of those details to wiser and more valuable choices is often negligible. In SMART, simple linear single-attribute utility functions are considered. At the same time, whether this simplification might lead to possible suboptimal choices in the problem at hand is assessed. If not, time has been saved and an elicitation error has been avoided. A further improvement of SMART is represented by the SMARTS method, which has been obtained by including an invention called swing weighting (the additional S denotes exactly Swing). Swing weighting is related to the techniques used in order to define the weights of the attributes. The word "swing" especially refers to the operation of changing the score of some objects of evaluation from the worst to the best. By assessing the attractiveness of each swing, it is possible to define the weights which consider not only the importance of the attribute, but also the associated range of values.
Suppose that a function u relates the utility or value or desirability of some physical or judged quantity, u(x), to its magnitude, x. The function u is thus a utility function. Suppose that u(x) is known for each relevant attribute. The easiest to use and most familiar model to aggregate the vector of u(x) values into a scalar is the additive model. If h (h = 1, 2, … , H) is an index identifying the alternatives (objects of the evaluation) and k where the values u (X ) are the single-attribute utilities and W k are the weights, one for each attribute; by convention, they sum to 1.

Assessment of the Risk Linked with One Failure Scenario
The MCDA could support the decision process in infrastructure management in many ways. It is especially possible to see prioritization tasks as decision problems characterized by the ranking problematic. Given a backlog of objects which are in a critical condition and for which maintenance should be executed, value measurement models could be effectively used in order to evaluate the risk linked to each object with respect to an identified failure scenario, and prioritize maintenance actions.
This approach requires first to identify and describe the failure scenarios, which consists in understanding how failure propagates through system levels and in describing the process with effective tools. Several tools and methods could be used in order to describe failure scenarios, such as failure trees, event trees, and causal chains, such as those implemented in the failure modes and effects analysis, the anticipatory failure determination, influence diagrams, cognitive maps, and directed acyclic graphs [33]. Once the dynamics of failure are investigated, recognized, understood, and described, a relationship has to be established between the failure scenario and the required information for assessing the related risk, according to which the value tree is built (Figure 2). This information could be already available. It could be collected through further investigation, or it could be difficult or even impossible to obtain. The intention is to perform a risk assessment based on the information already available, the results of which can also drive the collection of further information. In addition to primary databases, which are databases explicitly devoted to maintenance planning, the information extracted from secondary databases, which are data collected by someone else for other purposes, also plays a significant role. The information available shapes the structure of the value tree, especially the definition of the attributes characterizing the alternatives, which will be linked to the two main criteria defining risk, namely the probability of failure and the magnitude of the consequences. Once failure scenarios are identified and the value tree is developed, the performance of each alternative could be assessed with the SMARTS method, which in this case corresponds to the risk linked to the object with respect to the considered failure scenario.

Aggregation of Many Failure Scenarios with the Weighted Sum Model
However, many failure scenarios exist, so as many value trees should be built as there are failure scenarios, and the risk linked to each one should be aggregated in a total risk. It is desirable that the aggregation could be obtained applying a simple additive model such as the weighted sum model (WSM). Suppose that j (j = 1, 2, … , J) is an index identifying the failure scenarios, the total performance of the alternative h can be calculated applying Equation (2) [34]: where R are the scenarios' performances and W j are the weights, one for each failure scenario, which sum to 1. Depending on how risk is approached and modelled, the weights can have different meanings and could represent different aspects of risk. Therefore, the use of the MCDA goes in parallel with specification about the mental construction of risk.

The k-Medoids Algorithm
Unsupervised learning is the field of practice that supports finding patterns in cluttered data and it is based on techniques such as clustering and dimensionality reduction.
Clustering especially refers to the overarching process that involves finding groups of similar data in a dataset. A popular clustering approach is the k-medoids or partitioning around medoids algorithm [35], which partitions a data set into k groups or clusters. Each cluster is represented by one of the data points in the cluster which is named a medoid. The medoid is especially that point for which the average dissimilarity between it and all the other cluster members is minimal, and thus it corresponds to the most centrally located point in the cluster. Compared to the k-means algorithm, the k-medoids algorithm is more robust since it is less sensitive to noise and outliers.
A requirement of this algorithm is to specify the number of clusters to be generated, which is often unknown a priori. However, the silhouette method could be used for identifying the optimal number of clusters [36]. The idea is to implement the k-medoids algorithm using different values of the clusters k. Next, the average cluster silhouette is drawn according to the number of clusters, which measures the quality of a clustering. The higher the value of the silhouette width, the better the clustering. The optimal number of clusters k is the one that maximizes the average silhouette over a range of possible value of k.

Description of the Hybrid Multi-Criteria Decision Analysis
An infrastructure which is often neglected despite the magnitude of the possible failure consequences is the culvert. A culvert allows water or other fluids to flow under an obstruction from one side to the other. The obstruction is often represented by a transportation system, such as roads, railways, and waterways. In the case of waterways, the culvert may encroach on a levee. Culverts are usually parts of a broader infrastructure systems which can be decomposed in hierarchical levels. As historical failures have shown, neglecting inspections and maintenance of culverts could initiate failure events which usually start at component levels and propagate to higher levels, showing a cascade behavior and leading to disastrous effects [37][38][39].
The flowchart in Figure 3 describes the integrated MCDA which has been applied in order to perform the risk assessment of a system of culverts. The first step is the clarification of the scope of the decision problem, which is the prioritization of maintenance activities on a system of culverts, and the identification of the alternatives, which are the aging culverts requiring maintenance interventions. After that, relevant failure scenarios of the objects are identified. At the same time, the available information about the object which could be relevant for the risk assessment is considered. Then, the risk attributes are identified. These are the main criteria according to which risk is assessed, for which information is already collected in the database. The identification of alternatives, attributes, criteria, and goals of the decision problem allows the definition of the value trees, one for each considered failure scenario. At this point, a decision has to be met. If the structure characterizing the value trees is simple, it is possible to directly proceed with the risk assessment. Otherwise, attempts are made in order to simplify the decision problem. A cluster analysis applying the k-medoids algorithm is especially executed, whose scope is to group alternatives showing similar values of the risk attributes. If patterns in the data are found, a restructuring of the decision problem could be considered, which is finalized at reducing the number of alternatives or the number of attributes. Finally, the risk of each alternative with respect to the identified failure scenario is assessed applying the SMARTS method and the total risk is obtained by aggregating the risk linked to each failure scenario with the WSM. In this way, the objects could be ranked according to their total risk. In the next stages of the research, the ranking will support further decisions about the planning of maintenance activities.
The integration of the cluster analysis based on the k-medoids algorithm in the SMARTS model has especially the following purposes: 1. To simplify the decision problem. The cluster analysis based on the k-medoids algorithm is executed after the definition of the value tree and before the risk linked to each failure scenario is assessed. Thus, the timing and the taxonomy of the integration of the data mining technique in the SMARTS method is respectively sequential and embedded. Table 1 shows the hierarchical decomposition which characterizes the culverts of the German waterways system. The culvert itself shows a hierarchical structure since it can be decomposed into components and subcomponents. The WSV has collected in a database called WADABA information about the structural and typological characteristics of 48 culverts intersecting the West German network of canals, a system of artificial navigable canals built at the turn of the 20th century in the Ruhr region in order to transport bulk goods to the industries located in the area. Although the data have not been explicitly collected in order to plan maintenance activities on the objects, the WADABA database could be exploited with this aim, and therefore it represents a source of secondary data for maintenance management. In order to understand the data relevant for the risk assessment, failure scenarios for culverts are first identified ( Table 2). The four main failure scenarios are: collapse of the barrel and collapse of the headwalls (which, both involving the structural collapse of the culvert, have been considered as a unique failure scenario), leakage out of the culvert, piping inside the culvert, and afflux. The failure scenarios are formulated with respect to the object level by identifying possible causes, and short and long terms consequences linked to main events characterizing the failure scenario. Given the identified failure scenarios, the metadata of WADABA which could be relevant for the risk assessment are identified, together with the values which they could assume, determining the type of variable (Table 3). The metadata represent risk attributes since they are linked with the probability and the consequences of the failure scenarios listed in Table 2. The study presents the following challenges: 1. In addition to the identified risk attributes, other factors play a role in the assessment of risk, and especially affect the probability of unwanted consequences, such as the length of the chain of cause-effect according to which the failure scenario can be described and the detectability of the failure scenario before long term consequences are obtained. 2. The decision problem is characterized by many alternatives, to which multiple risk attributes are associated. In addition to that, the decision problem is characterized by a complex structure that implies a complex process of preference elicitation. In this circumstance, it is difficult to anticipate, explain, and understand the results of the risk assessment. Recalling that one alternative dominates another when it is at least as good with respect to all criteria and it is strictly preferred on at least one criterium, it is especially difficult to define the dominating or dominated alternatives, whose identification could not only improve the understanding and the communication of the results of the analysis, but also simplify the structure of the decision problem.

Structuring of the Decision Problem
The integrated approach that has been developed in this study aims at properly addressing the above-mentioned challenges.

Construction of the Value Tree
The value tree explicates the links between the alternatives, the risk attributes, the criteria, and the scope of the decision problem. In Figure 4, it is possible to see the general structure of the value tree characterizing the decision problem under assessment. It should be noted that the contribution of the attributes to the risk linked with a certain failure scenario changes with the considered failure scenario. The relationship between the risk attribute and the risk linked to a certain failure scenario is in some cases direct, while in other cases inverse. An example is represented by the risk attributes "Presence of a rake" and "Presence of a sand trap". Rakes and sand traps are culvert components which protect the barrel from damage by debris and particles, or avoid obstructions within the conduit. However, if they are not properly cleaned, or in the case of flash flooding or ponding water, they could provoke the obstruction of the inlet structure, triggering the failure scenario "Afflux".
An additional complication is represented by the fact that some risk attributes could affect both the probability and the consequence of failure. This is the case of the drainage basin in the failure scenario "Afflux": the larger a drainage basin is, the higher the probability that debris and particles will be transported and eventually obstruct the culvert, but also the greater the resulting cross catchment flooding.

Cluster Analysis Applying the k-Medoids Algorithm
The k-medoids algorithm is applied in order to identify clusters in the set of considered culverts characterized by the attributes listed in Table 3.
First of all, the silhouette width for several clusters is computed, according to which the optimal number of clusters can be identified. As Figure 5 shows, the value of the silhouette width increases with increasing values of the number of clusters. The maximum value is obtained when 15 clusters are considered. Then, it decreases progressively for an increasing number of clusters. The maximum value of the silhouette width is above 0.8, which indicates that the data are characterized by a strong structure. This result is not surprising, since most of the attributes are discrete. The number of clusters which corresponds to the maximum value of the silhouette width is considered ( Table 4). The number of objects in each cluster varies between 1 and 7. Three clusters are composed by only one object, and therefore they do not actually represent a group of objects. The median value of the attribute "Drainage basin" within each cluster has been considered since the variation of the values of the attribute within the cluster is limited. Although the identification of dominating/dominated alternatives is hampered by the complex elicitation process of the preferences required by this decision problem, which is executed on two different levels (intra and inter failure scenarios), the results suggest that clearly dominating alternatives cannot be identified. In effect, there are no culverts which encroach on a levee and also have significant values of the drainage basin. On the other side, some culverts do not encroach on a levee and have negligible values of the drainage basin (Cluster 3, 5, 8, and 12). These clusters, which altogether form a group of 14 objects, can be identified as dominated alternatives.

Intra Failure Scenario Risk Assessment
The crucial steps of the SMARTS model are applied, which are the definition of single-attribute utility functions and the swing weighting. The definition of the single-attribute utility functions is trivial for discrete attributes, since they can assume only two values, which correspond to the maximum and minimum utility. For the only numerical attribute "Drainage basin", a linear function can be assumed, because it does not matter if small improvements in the value of the attributes fell near the minimum, in the middle, or near the maximum. The single-attribute linear utility function u can be obtained by creating a function which normalizes the values x of the attribute: where min(x) , max (x) are the minimum and maximum values of the attribute, respectively. The single-attribute utility function for the attribute "Drainage basin" is defined according to Equation (3), and the swing weighting method is applied. The elicitation of the swing weights proceeds in two steps. The first step yields the rank order of the weights, which can be obtained by determining the sequence of attributes which contribute the most to risk by swinging their values. The second step yields the weights themselves. Several methods exist to execute this approach and one of these is via direct magnitude estimates. Assuming that the swing of the attribute ranked as first is worth a full 100 points, the swing on the other attributes is assessed with respect to it. Then, the weights are determined by dividing the assigned points with their total sum. Tables 5-8 summarize the results of the swing weighting for the failure scenarios identified in Table  2. Now that single attribute functions and the attribute weights have been defined, the preference for each alternative can be expressed through a scalar number by applying the Equation (1). This model assumes conditional monotonicity. This assumption can be verified by considering when, at one level of the value of a certain attribute, more of another attribute is better than less, while at another level of the same attribute, less of another attribute is better than more. Since conditional monotonicity is verified, the additive model can be a good approximation in order to express the preference for each alternative.

Inter Failure Scenario Risk Assessment
In order to compute the total risk of each alternative, the WSM expressed by Equation (2) is applied, previous standardization of the utilities related to each failure scenario. The crucial part of the application of this method is the definition of the weights, which represent the importance of the scenario. It is assumed here that the importance depends on three factors: 1) the length of the cause-effect chain according to which the failure scenario could be described; 2) the detectability of ongoing failure scenarios; 3) the magnitude of the long-term failure consequences.
In Table 9, the failure scenarios are evaluated according to the above-mentioned dimensions. The weights are than developed applying the direct rating method. The highest rate is assigned to the failure scenario "Collapse of the barrel/headwalls". Although this scenario is characterized by a long causal chain, and a medium detectability, it could lead to disastrous consequences. In effect, this failure scenario could not only initiate instability and erosion in levees, but could also trigger other failure scenarios. Therefore, it is considered that this failure scenario is associated to critical cascade effects, and for this reason it earns 100 points. The lowest rate of 50 is given to the scenario "Afflux". Although this scenario is usually characterized by a short cause-effect chain, it could be easily detected since the accumulation of water is in general visible and it grows seamless. The resulting flooding, although of considerable size, is not comparable to those resulting from the breach of a levee. The remaining failure scenarios have obtained an equal rate of 75. These scenarios are characterized by long cause-effect chains but a low detectability, as signs of their development are visible only when significant consequences have already developed. They are both associated with soil instability and erosion, which could eventually result in the breach of a levee.
The results shown in Table 10 reveal that the groups of culverts encroaching on a levee obtain the highest scores of aggregated utilities. This result is not surprising since the breach of a levee is the most undesired consequence. The procedure which has been applied in this paper allows to consider the contribution to the total risk given by the other risk attributes, and by doing so to prioritize within these groups. The groups of culverts encroaching on a levee are followed by two groups of culverts characterized by very high values of the drainage basin. The clusters 3, 5, 8, and 12, which have been previously recognized as dominated alternatives, earn the ranking positions 10, 13, 11, and 12 respectively, confirming the prediction. The last ranking positions are obtained by clusters 11 and 13, which also do not encroach on a levee and present small values of the drainage basin. These clusters are also characterized by the presence of a rake and the presence of a sand trap. It is possible to conclude that the presence of these culvert components is linked with lower aggregated risk.
These results represent valuable information for maintenance management at operational, tactical, and strategic levels. This approach provides a concise view of the items that need urgent maintenance, and consequently supports discussion at the strategic level, the identification of long-term goals, and the definition of tactics to achieve them. In addition to that, risk quantification combined with optimization algorithms allows for the optimal allocation of maintenance resources and detailed scheduling of repair interventions.
1. Risk has many dimensions, since both the probability and the consequence of failure depend on many factors which could be qualitative or quantitative. A quantitative definition of risk could be not only difficult and time consuming, but also unnecessary when the analysis is driven by prioritization tasks. 2. Same risk factors could play different roles depending on the considered scenario. In order to compute the total risk of neglecting maintenance of a certain object, the risk linked to single failure scenarios should be properly aggregated. 3. Aspects such as the length of the causal chain characterizing the failure scenario and the detectability of the failure scenario significantly affect risk, and therefore they should also be considered in the analysis.
In order to overcome these obstacles, this paper proposes to approach the assessment of risk by applying multi-criteria decision analysis and integrating it with data mining techniques. Multi-criteria decision analysis supports decision making, providing a rigorous structure for the decision problem whereby the subjectivity inherent in the decision process can be better managed and assessed. The models predominantly considered are the simple multi-attribute rating technique improved with swing weighting and the weighted sum model. The scopes of their application are, respectively:

•
To perform an intra-failure scenario risk assessment, and especially to evaluate the contribution of each risk factor to the risk linked with a certain failure scenario.

•
To perform an inter-failure scenario risk assessment, and especially to aggregate the risk linked with single failure scenario.
The integration of the multi-criteria decision analysis with data mining techniques is aimed at promoting data-driven decisions, simplifying the structure of the decision problem, and improving the interpretation of the results. The cluster analysis based on the kmedoids algorithm is applied in order to group alternatives presenting similar risk attributes. In this way, the alternatives of the decision problem are reduced to the prototypes representing each group. The application of the k-medoids algorithm allows to reach the following objectives in succession: 1. Evaluate the profiles which characterize the alternatives. 2. Facilitate the identification of dominating/dominated alternatives. 3. Catalyze the learning and understanding of the problem situation, as well as the explanation of the results to infrastructure stakeholders.
Results show that 15 clusters are clearly identified within the set of 48 culverts. Each cluster is characterized by a prototype representing the prevalent profile of the alternatives within that cluster. The protypes can be considered as the alternatives of the decision problem. The visual assessment of the profiles characterizing each cluster allows us to recognize that four clusters containing 14 culverts are dominated alternatives.
However, a limitation of this study is that information regarding only one level of the infrastructure system, namely the object level, has been considered. In order to obtain more refined results, information regarding other system levels should be included, such as the typology of degradation processes affecting the object components, state of degradation, and potential of the failure consequences to damage social, economic, and infrastructural systems. In the next steps of this research, this approach will be further improved and extended in order to process information from different system levels. The final scope is to carry out a multilevel and multi-scenario risk assessment of aging infrastructure systems.