Ensembles of Random SHAPs

Utkin, Lev; Konstantinov, Andrei

doi:10.3390/a15110431

Open AccessArticle

Ensembles of Random SHAPs

by

Lev Utkin

^*,†

and

Andrei Konstantinov

^†

Higher School of Artificial Intelligence, Institute of Computer Science and Technology, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, 195251 St. Petersburg, Russia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Algorithms 2022, 15(11), 431; https://doi.org/10.3390/a15110431

Submission received: 6 October 2022 / Revised: 2 November 2022 / Accepted: 11 November 2022 / Published: 17 November 2022

(This article belongs to the Special Issue Ensemble Algorithms and/or Explainability)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The ensemble-based modifications of the well-known SHapley Additive exPlanations (SHAP) method for the local explanation of a black-box model are proposed. The modifications aim to simplify the SHAP which is computationally expensive when there is a large number of features. The main idea behind the proposed modifications is to approximate the SHAP by an ensemble of SHAPs with a smaller number of features. According to the first modification, called the ER-SHAP, several features are randomly selected many times from the feature set, and the Shapley values for the features are computed by means of “small” SHAPs. The explanation results are averaged to obtain the final Shapley values. According to the second modification, called the ERW-SHAP, several points are generated around the explained instance for diversity purposes, and the results of their explanation are combined with weights depending on the distances between the points and the explained instance. The third modification, called the ER-SHAP-RF, uses the random forest for a preliminary explanation of the instances and determines a feature probability distribution which is applied to the selection of the features in the ensemble-based procedure of the ER-SHAP. Many numerical experiments illustrating the proposed modifications demonstrate their efficiency and properties for a local explanation.

Keywords:

explanation model; XAI; SHAP; random forest; ensemble model

1. Introduction

Machine learning models and algorithms have shown increasing importance and success in many domains. Despite the success, there are obstacles for applying machine learning algorithms, especially in areas of risk, for example, in medicine, reliability maintenance, autonomous vehicle systems, and security applications. One of the obstacles is that many machine learning models have sophisticated architectures and, therefore, they are viewed as black boxes. As a result, models have a limited interpretability, and a user of the corresponding model cannot understand and explain the predictions and decisions provided by the model. Another obstacle is that a single testing instance has to be explained in many cases, i.e., a user needs to understand only a single prediction, for example, a diagnosis of a patient stated by a model. In order to overcome these obstacles, additional interpretable models should be developed that could help to answer the question, which features of an analyzed instance lead to the black-box survival model prediction. In other words, these models should select the most important features which impact the black-box model prediction. It should be noted that some models, including linear regression, logistic regression, and decision trees, are intrinsically explainable due to their peculiarities. At the same time, most machine learning models, especially deep learning models, are black boxes and cannot be directly explained. An explanation of these models and their predictions motivated developing a lot of methods and models which try to explain the predictions of the deep classification and regression algorithms. There are several detailed survey papers providing a deep dive into the variety of interpretation methods and models [1,2,3,4,5,6,7,8], which show the increasing importance of the interpretation methods and a growing interest in them.

The interpretation of the black-model local prediction aims to select features which significantly impact on this prediction, i.e., by using the interpretation model, we try to determine which features of an analyzed instance lead the obtained black-box model prediction. There are two groups of interpretation methods. The first one consists of the so-called local methods. They try to interpret a black-box model locally around a test instance. The second group contains global methods which derive interpretations on the whole dataset or its part. The present paper focuses on the first group of local interpretation methods, though the proposed approach can be simply extended to the global interpretation.

Two very popular post hoc approaches to interpretation can be selected among many others. The first one is LIME (Local Interpretable Model-Agnostic Explanation) [9], which is based on building an approximating linear model around the instance to be explained. This follows from the intuition that the explanation may be derived locally from many instances generated in the neighborhood of the explained instance with weights defined by their distances from the explained instance. The coefficients of the linear model are interpreted as the feature’s importance. The linear regression for solving the regression problem or the logistic regression for solving the classification problem allow us to construct the corresponding linear models. LIME has many advantages. It successfully interprets models dealing with tabular data, text, and images. However, there are some shortcomings of LIME. The first one is that LIME is not robust. This means that it may provide very different explanations for two nearby data points. The definition of neighborhoods is also very vague. Moreover, LIME may provide an incorrect explanation when there is a small difference between the training and testing data. LIME is also sensitive to the parameters of the explanation model, for example, to the weights of the generated instances, to the number of the generated instances, etc.

The second approach consists of the well-known SHAP (SHapley Additive exPlanations) method [10,11] and its modifications. The method is inspired by game-theoretic Shapley values [12] which can be interpreted as average expected marginal contributions over all possible subsets (coalitions) of features to the black-box model prediction. The SHAP has many advantages, for example, it can be used for local and global explanations in contrast to LIME, but there are also two important shortcomings. The first one is a question as to how to add or remove features in order to implement their subsets as inputs for the black-box model. There are many approaches to removing features, exhaustively described by [13], but the SHAP may be too sensitive to each of them, and there are no strong justifications for their use. Nevertheless, the SHAP can be regarded as the most promising and efficient explanation method.

The second shortcoming is that the SHAP is computationally expensive when there is a large number of features due to considering all possible coalitions whose number is

2^{m}

, where m is the number of features. Therefore, the computational time grows exponentially. Several simplifications and approximations have been proposed in order to overcome this difficulty. Some of them are presented by [11,14,15]. One of the simplifications is based on using the ordered permutations of the feature indices and the probability distributions of the features [14]. Another approximation is the quasi-random and adaptive sampling which includes two improvements [15]. The first one is based on exploiting the Monte Carlo integration. The second improvement is based on the optimal number of the perturbations of every feature in accordance with its variance to minimize the overall approximation error. Ref. [15] also proposed to average the local contributions of the values of each feature across all instances. Another interesting approach to simplify the SHAP is the Kernel SHAP [10] which can be regarded as a computationally efficient approximation to the Shapley values in higher dimensions. In order to relax the assumption of the feature independence accepted in the Kernel SHAP, ref. [16] extended the Kernel SHAP method to handle dependent features. Ref. [17] proposed the polynomial-time approximation of the Shapley values, called the Deep Approximate Shapley Propagation method.

In spite of the many approaches to simplify the SHAP, it is difficult to expect a significant simplification from the above modifications of the SHAP. Therefore, a new approach is proposed for simplifying the SHAP method and for reducing the computational expenses for calculating the Shapley values. A key idea behind the proposed approach is to apply a modification of the random subspace method [18] and to consider an ensemble of random SHAPs, called the ensemble of random SHAPs (ER-SHAP). The approach is very similar to the random forests, when an ensemble of randomly built decision trees is used to obtain some average classification or regression measures. Random SHAPs are constructed by a random selection of t features with indices

J_{k} = (i_{1}, \dots, i_{t})

from the instance for an explanation, and the obtained subset of the instance features is analyzed by the SHAP as a separate instance. Repeating this procedure N times, we obtain a set

S = {S_{1}, \dots, S_{N}}

of the Shapley values corresponding to the input subsets of the features, where the k-th subset is

S_{k} = {ϕ_{i} : i \in J_{k}}

. By applying some combination rule for combining subsets

S_{k}

from

S

, we obtain the final Shapley values.

The above general approach considering an ensemble of random SHAPs has several extensions which form the corresponding methods and algorithms. First of all, we can generate points around the analyzed instance and construct

S_{k}

for the k-th generated point. In this case, every point is assigned by a weight, depending on the distance from the analyzed point. As a result, we can combine the subsets

S_{k}

of the Shapley values with weights which are defined as a function of the distance from the analyzed point. This modification is called the ensemble of random weighted SHAPs (ERW-SHAP).

Another extension or modification is to select features in accordance with a probability distribution to obtain instances consisting of features with indices from the set

J_{k}

. Let us define the discrete probability distribution over the set of all indices. It can be produced, for example, by using the random forest [19] which plays the role of a feature selection model. At that, the random forest is constructed by using a set of points (instances) locally generated around the explained point. Every decision tree is built by using a single point from the set of the generated points. This modification is called the ensemble of random SHAPs generated by the random forest (ER-SHAP-RF).

In sum, the contribution of this paper can be formulated as follows:

A new approach to implementing an ensemble-based SHAP with random subsets of features of the explained instance is proposed.
Several combination schemes are studied for aggregating the subsets of the important features obtained by using random SHAPs.
The approach is extended by generating random points in the local area around a test instance and computing the subsets of the important features separately for every point. Some kind of diversity is implemented with this extension.
Another extension is to use a probability distribution for the random selection of the features defined by the means of the random forest constructed by using the generated points in the local area around a test instance. The preliminary feature selection can be regarded as a pre-training procedure.

A lot of numerical experiments with an algorithm implementing the proposed method on synthetic and real datasets demonstrate its efficiency and the properties for the local and global interpretation.

This paper is organized as follows. The related work is in Section 2. The Shapley values and the SHAP method as a powerful tool for local and global explanations are introduced in Section 3. A detailed description of the proposed modifications of the SHAP, including the ER-SHAP, ERW-SHAP, and ER-SHAP-RF, is provided in Section 4. The numerical experiments with synthetic data and real data using the local interpretation by means of the proposed models and their comparison with the standard SHAP method are given in Section 5. The concluding remarks can be found in Section 6.

2. Related Work

The increasing importance of machine learning models and algorithms leads to the development of new explanation methods taking into account the various peculiarities of applied problems. Among the various approaches, we consider the local interpretation models which aim to explain a specific decision or a prediction obtained for a single instance. The local interpretation is especially important in medicine where a diagnosis of a patient has to be confirmed. As a result, many models of the local interpretation have been proposed. The success and simplicity of the LIME interpretation method resulted in the development of several of its modifications, for example, ALIME [20], Anchor LIME [21], LIME-Aleph [22], GraphLIME [23], SurvLIME [24], etc. A comprehensive analysis of LIME, including the study of its applicability to different data types, for example, text and image data, was provided by [25]. The same analysis for tabular data was proposed by the same authors [26]. An image version of LIME with its deep theoretical study was presented by [27]. An interesting information-theoretic justification of interpretation methods on the basis of the concept of the explainable empirical risk minimization was proposed by [28].

In order to relax the linearity condition for the local interpretation models like LIME and to adequately approximate a black-box model, several interpretation methods based on using Generalized Additive Models [29] were proposed [30,31,32,33]. Another interesting class of models based on using a linear combination of neural networks, such that a single feature is fed to each network, was proposed by [34]. The impact of every feature on the prediction in these models is determined by its corresponding shape function obtained by each neural network. Following the ideas behind these interpretation models, [35] proposed a similar model. In contrast to the method proposed by [34], an ensemble of gradient boosting machines was used in [35] instead of neural networks in order to simplify the explanation model training process.

Another explanation method was the SHAP [10,11], which takes a game-theoretic approach for optimizing a regression loss function based on the Shapley values. General questions of the computational efficiency of the SHAP were investigated by [36]. Ref. [37] proposed the generalized SHAP method which allows us to compute the feature importance of any function of a model’s output. Ref. [38] presented an approach to applying the SHAP to ensemble models. The problem of explaining the predictions of graph neural networks by using the SHAP was considered by [39]. Ref. [40] introduced the so-called off- and on-manifold Shapley values for high-dimensional multi-type data. The application of the SHAP to the explanation of recurrent neural networks was studied in [41]. Ref. [42] presented a new approach to explaining fairness in machine learning, based on the Shapley value paradigm. Ref. [43] studied how to explain the anomalies detected by autoencoders using the SHAP. The problem of explaining the anomalies detected by a PCA was also considered by [44]. Ref. [45] proposed the X-SHAP which extends one of the approximations of the SHAP called the Kernel SHAP [10]. The SHAP was also applied to the problems of explaining individual predictions when features are dependent [16] or when features are mixed [46]. The SHAP was used in real applications to explain the predictions of the black-box models, for example, it was used to rank the failure modes of reinforced concrete columns and to explain why a machine learning model predicts a specific failure mode for a given sample [47]. It was also used in chemoinformatics and medicinal chemistry [48]. An interesting application of the SHAP in the desirable interpretation of the machine learning-based model results for identifying m7G sites in the gene expression analysis was proposed by [49]. The basic problems of the SHAP were also analyzed by [50].

Many other interpretation methods, their analyses, and critical reviews can also be found in survey papers [1,2,3,6,51,52,53,54,55].

3. Shapley Values and the Explanation Model

One of the most powerful approaches to explaining predictions of the black-box machine learning models is the approach based on using the Shapley values [12] as a key concept in coalitional games. According to the concept, the total gain of a game is distributed to players such that desirable properties, including efficiency, symmetry, and linearity, are fulfilled. In the framework of the machine learning, the gain can be viewed as the machine learning model prediction or the model output, and a player is a feature of input data. Hence, contributions of features to the model prediction can be estimated by Shapley values. The i-th feature importance is defined by the Shapley value

ϕ_{i} (f) = ϕ_{i} = \sum_{S \subseteq N \ {i}} B (S, N) [f (S \cup {i}) - f (S)],

(1)

where

f (S)

is the characteristic function in terms of coalitional games or the black-box model prediction under condition that a subset S of features are used as the corresponding input in terms of machine learning; N is the set of all features;

B (S, N)

is defined as

B (S, N) = \frac{|S|! (|N| - |S| - 1)!}{|N|!} .

(2)

It can be seen from the above expression that the Shapley value

ϕ_{i}

can be regarded as the average contribution of the i-th feature across all possible permutations of the feature set.

The Shapley value has the following important properties:

Efficiency. The total gain is distributed as

\sum_{k = 0}^{m} ϕ_{k} = f (x)

.

Symmetry. If two players with numbers i and j make equal contributions, i.e.,

f (S \cup {i}) = f (S \cup {j})

for all subsets S which contain neither i nor j, then

ϕ_{i} = ϕ_{j}

.

Dummy. If a player makes zero contributions, i.e.,

f (S \cup {j}) = f (S)

for a player j and all

S \subseteq N \ {j}

, then

ϕ_{j} = 0

.

Linearity. A linear combination of multiple games

f_{1}, \dots, f_{n}

, represented as

f (S) = \sum_{k = 1}^{n} c_{k} f_{k} (S)

, has gains derived from f:

ϕ_{i} (f) = \sum_{k = 1}^{m} c_{k} ϕ_{i} (f_{k})

for every i.

Let us consider a machine learning problem. Suppose that there is a dataset

{(x_{1}, y_{1}),

\dots, (x_{n}, y_{n})}

of n points

(x_{i}, y_{i})

, where

x_{i} \in X \subset R^{m}

is a feature vector consisting of m features,

y_{i}

is the observed output for the feature vector

x_{i}

such that

y_{i} \in R

in the regression problem and

y_{i} \in {1, 2, \dots, T}

in the classification problem with T classes. If a task is to interpret or to explain the prediction from the model

f (x^{*})

at a local feature vector

x^{*}

, then the prediction

f (x^{*})

can be represented by using Shapley values as follows [10,11]:

f (x^{*}) = ϕ_{0} + \sum_{j = 0}^{m} ϕ_{j}^{*},

(3)

where

ϕ_{0} = E [f (x)]

,

ϕ_{j}^{*}

is the value

ϕ_{j}

for the prediction

x = x^{*}

.

The above implies that the Shapley values explain the difference between the prediction

f (x^{*})

and the global average prediction.

One of the crucial questions for implementing the SHAP method is how to remove features from subset

N \ S

, i.e., how to fill input features from subset

N \ S

in order to obtain predictions

f (S)

of the black-box model. A detailed description of the various ways for removing features is presented by [13]. One of the ways is simply by setting the removed features to zero [56,57] or by setting them to user-defined default values [9]. According to this way, features are often replaced with their mean values. Another way removes a feature by replacing them with a sample from a conditional generative model [58]. In the LIME method for tabular data, features are replaced with independent draws from specific distributions [13] such that each distribution depends on the original feature values. These are only a part of all the ways of removing features.

4. Modifications of SHAP

4.1. Ensemble of Random SHAPs

In spite of the many approaches to simplify SHAP, it is difficult to expect a significant simplification from the above modifications of SHAP. Therefore, a new approach is proposed for simplifying the SHAP method and for reducing computational expenses for calculating the Shapley values. A key idea behind the proposed approach is to apply a modification of the random subspace method [18] and to consider an ensemble of random SHAPs. The approach is very similar to the random forests when an ensemble of randomly built decision trees is used to obtain some average classification or regression measures.

Suppose that instance

x \in R^{m}

has to be interpreted under condition that the black-box model has been trained on the dataset

D = {(x_{1}, y_{1}), \dots, (x_{n}, y_{n})}

. A general scheme of the first approach called ensemble of random SHAPs (ER-SHAP) for case

N = 3

is illustrated in Figure 1. ER-SHAP is iteratively constructed by random selection of t different features N times. Value t is a training parameter. If we refer to random forests, then one of the heuristics is

t \approx \sqrt{m}

. However, the optimal t is obtained by considering many of its values. Suppose that indices of selected features at the k-th iteration form the set

J_{k} = (i_{1}, \dots, i_{t})

. The corresponding vector of t features is regarded as an instance

z_{k} = (x_{i_{1}}, \dots, x_{i_{t}}) \in R^{t}

. Subsets of selected features with indices

J_{k}

are shown in Figure 1 as successive features. However, this is only a schematic illustration. Features are randomly selected in accordance with the uniform distribution and can be located at arbitrary places of vector

x

.

As a result, we have a set of N instances

z_{1}, \dots, z_{N}

. The next step is to use the black-box model and SHAP to compute Shapley values for every instance such that the subset

S_{k} = {ϕ_{i}^{(k)} : i \in J_{k}}

of the Shapley values

ϕ_{i}^{(k)}

is produced for instance

z_{k}

. Repeating this procedure N times, we obtain a set

S = {S_{1}, \dots, S_{N}}

of the Shapley values corresponding to all

z_{k}

,

k = 1, \dots, N

, or all input subsets of features. Having set

S

, we can apply several combination rules to combining subsets

S_{k}

from

S

. One of the simplest rules is based on averaging of the Shapley values over all subsets

S_{k}

:

ϕ_{i} = \frac{1}{N_{i}} \sum_{k : i \in J_{k}} ϕ_{i}^{(k)}, i = 1, \dots, m,

(4)

where

N_{i}

is the number of the i-th feature selections among all iterations, i.e.,

N_{i} = \sum_{k : i \in J_{k}} 1

.

It should be noted that the input of the black-box model has to have m features. Therefore, for performing SHAPs with every

z_{k}

, average values of features over all dataset D are used to fill

m - t

remaining features, though other methods [13] can also be used to fill these features.

Algorithm 1 can be viewed as a formal scheme implementing ER-SHAP. It is supposed that the black-box model has been already trained.

Algorithm 1 ER-SHAP

Require:: Training set D; point of interest $x$ ; the number of iterations N; the number of selected features t; the black-box model for explaining $f (x)$
Ensure:: The Shapley values $S = {ϕ_{1}$ ,…, $ϕ_{m}}$
1:: for $k = 1$ , $k \leq N$ do
2:: Select randomly t features from $x$ and form the set $J_{k}$ of indices of randomly selected features $x_{i}$ , $i \in J_{k}$
3:: Use SHAP for computing $ϕ_{i}^{(k)}$ , $i \in J_{k}$ and form the set $S_{k} = {ϕ_{i}^{(k)} : i \in J_{k}}$
4:: end for
5:: Combine sets $S_{k}$ , $k = 1, \dots, N$ , to compute S, for example, by using a simple averaging: $ϕ_{i} = N_{i}^{- 1} \sum_{k : i \in J_{k}} ϕ_{i}^{(k)}$ , where $N_{i} = \sum_{k : i \in J_{k}} 1$ .

If the number of subsets S in the standard SHAP or the number of differences

f (S \cup {i}) - f (S)

which have to be computed is

2^{m}

, then the number of the same differences in ER-SHAP is

N \cdot 2^{t}

. For comparison purposes, if we consider a dataset with

m = 25

and

t = \sqrt{m} = 5

, then N can be taken

2^{25} / 2^{5} = 2^{20}

in order to make equal computational complexity of SHAP and ER-SHAP.

4.2. Ensemble of Random Weighted SHAPs

The next algorithm is called the ensemble of random weighted SHAPs (ERW-SHAP) algorithm and differs from ER-SHAP in the following parts. A general scheme is shown in Figure 2. First of all, N points

h_{1}, \dots, h_{N}

are generated in the neighborhood of explained instance

x

. These points do not need to belong to the dataset D. Then, t features are randomly selected from every

h_{k}

, and they produce instances

z_{1}, \dots, z_{N}

. Moreover, the weight

w_{k}

of each instance

h_{k}

is defined as a function of the distance

d_{k}

between the explained instance

x

and the generated neighbor

h_{k}

. The weights are used to implement the weighted average of the Shapley values. The final Shapley values are calculated now as follows:

ϕ_{i} = \frac{1}{W_{i}} \sum_{k : i \in J_{k}} w_{k} ϕ_{i}^{(k)}, i = 1, \dots, m,

(5)

where

W_{i} = \sum_{k : i \in J_{k}} w_{k}

.

On the one hand, using these changes of ER-SHAP, we implement an idea of some kind of diversity of SHAPs to make the randomly selected feature vectors more independent. On the other hand, the approach is similar to the LIME method where the analyzed instance is perturbed in order to build an approximating linear model around the instance to be explained. The diversity of SHAPs is a very important peculiarity of the proposed ER-SHAP. It prevents SHAP from the situation when a rule for filling the removed features produces features coinciding with the explained instance features. In this case, the Shapley values are incorrectly computed. The use of generated neighbors allows us to avoid this case and to obtain more accurate results.

Algorithm implementing ERW-SHAP differs from the similar Algorithm 1 implementing ER-SHAP only in two lines. First, after line 1 or before line 2, the line indicating how to generate neighbors has to be inserted. Second, line 5 (combination of the Shapley values) is replaced with expression (5).

4.3. Ensemble of Random SHAPs Generated by the Random Forest

In order to control the process of the random feature selection, it is reasonable to choose features for producing

z_{1}, \dots, z_{N}

in accordance with some probability distribution different from the uniform distribution, which would take into account the preliminary importance of features. The intuition behind this modification is to reduce the selection of unimportant features which do not impact on the black-box prediction corresponding to

x

a priori.

One of the ways to implement this control is to compute the preliminary feature importance by means of the random forest. Although it is known that the random forest does not always give acceptable results related to the feature selection problem, the proposed approach does not have this drawback because we propose to train the random forest on instances generated in the neighborhood of explained instance

x

. The next algorithm is called the ensemble of random SHAPs generated by the random forest (ER-SHAP-RF) algorithm. The random forest plays a role of the important feature selection model. It can be also viewed as some kind of pre-training for important features. The idea to train the random forest on generated neighbors allows us to implement a preliminary explanation method. It should be noted that the random forest is not a unique model for selecting important features. There are many methods [59], which could be used for solving this task. We use the random forest as one of the popular and simple methods having a few parameters. In the same way, the linear regression model could be used instead of the random forest. The random forest can be used as an explanation model by applying an approach proposed by [60] based on a scalable method for transforming a decision forest into a single decision tree which is interpretable.

The LIME method can be also applied to obtain the probability distribution of features. In the case of its use, normalized absolute values of linear regression coefficients can be regarded as the probability distribution of features.

For solving the feature selection task by random forests, we use the well-known simple method [19]. According to this method, for every tree from the random forest, we compute how much the impurity is decreased by a feature. The more the feature decreases the impurity, the more important the feature is. The impurity decreasing is averaged across all trees in the random forest, and the obtained value corresponds to the final importance of the feature.

The proposed approach may lead to small probabilities of unimportant features. However, it does not mean that these features will not selected for using in an explanation by means of SHAP. They have a smaller chance to be selected under condition that their probabilities are not equal to zero. This implies that the classification or regression models for constructing the probability distribution P should not provide some sparse predictions such as the Lasso because only a small part of features in this case will take part in explanation.

A general scheme of ER-SHAP-RF is shown in Figure 3 where a number, say M, of neighbors

h_{1}, \dots, h_{M}

are generated around the instance

x

to be explained. Every generated neighbor

h_{j}

is fed into the black-box model to obtain its class label

y_{j}^{*}

. It should be noted that the training instances can be taken as neighbors. However, they should be classified by using the black-box model in order to take into account this model in an explanation.

Having points

(h_{j}, y_{j}^{*})

, we train the random forest which provides a feature importance measure in the form of the probability distribution

P = (p_{1}, \dots, p_{m})

. The distribution P is used to select features from instance

x

for constructing the vectors

z_{1}, \dots, z_{N}

, namely t features are selected from

x

with replacement N times in accordance with the distribution P. SHAPs are used to find the Shapley values of vectors

z_{1}, \dots, z_{N}

. They are combined similarly to ERW-SHAP by means of averaging as follows:

ϕ_{i} = \frac{1}{N_{i}} \sum_{k : i \in J_{k}} ϕ_{i}^{(k)}, i = 1, \dots, m .

(6)

It is important that the number

N_{i}

of the i-th feature selections among all iterations N is used instead of N.

The whole algorithm can be divided into two stages which are separated in time. According to the first stage, neighbors

h_{1}, \dots, h_{M}

are generated for obtaining predictions

y_{1}^{*}, \dots, y_{M}^{*}

by the black-box model and for training the random forest which provides probabilities of features. This stage is depicted by dashed lines in Figure 3. The second stage is to use these probabilities for using SHAPs. This stage is depicted by solid lines in Figure 3.

The random forest should be built with a large depth of trees and with a small number of trees in order to avoid a rather sparse probability distribution of features when a large part of probabilities will be equal to zero or close to zero. Another way for avoiding small probabilities of features is to apply calibration methods and to recalculate the obtained probabilities, for example, by using the temperature scaling as the simplest extension of Platt scaling [61]:

p_{k}^{*} = \frac{exp (p_{k} / T)}{\sum_{i = 1}^{m} exp (p_{i} / T)}, k = 1, \dots, m,

(7)

where T is the temperature which controls the smoothness of the probability distribution, but it does not change the relationship of probabilities

p_{k}

,

i = 1, \dots, m

.

Algorithm 2 implementing ER-SHAP-RF can be viewed as an extension of ER-SHAP.

Algorithm 2 ER-SHAP-RF

Require:: Training set D; point of interest $x$ ; the number of iterations N; the number of selected features t; the black-box model for explaining $f (x)$ ; parameters of the random forest (the number and depth of trees, number of instances for building trees)
Ensure:: The Shapley values $S = {ϕ_{1}$ ,…, $ϕ_{m}}$
1:: Generate M instances $h_{1}, \dots, h_{M}$ which are from the neighborhood of $x$ or from the whole training set
2:: Compute the class label $y_{j}^{*} = f (h_{j})$ for every generated instance by using the black-box model
3:: Train the random forest on $(h_{j}, y_{j}^{*})$ , $j = 1, \dots, M$
4:: Compute the probability distribution P of features by using the random forest
5:: for $k = 1$ , $k \leq N$ do
6:: Select randomly t features from $x$ in accordance with the probability distribution P and form the index set $J_{k}$ of features
7:: Use SHAP for computing $ϕ_{i}^{(k)}$ , $i \in J_{k}$ and form the set $S_{k} = {ϕ_{i}^{(k)} : i \in J_{k}}$
8:: end for
9:: Combine sets $S_{k}$ , $k = 1, \dots, N$ , to compute S, for example, by using a simple averaging: $ϕ_{i} = N_{i}^{- 1} \sum_{k : i \in J_{k}} ϕ_{i}^{(k)}$ , where $N_{i} = \sum_{k : i \in J_{k}} 1$ .

It is interesting to point out that the fourth algorithm can also be proposed, which is represented as a combination of ERW-SHAP and ER-SHAP-RF. N points are generated for implementing diversity in accordance with ERW-SHAP, and M points are generated for training the random forest in accordance with ER-SHAP-RF and for computing the prior probability distribution P of features. Then, the random features are selected not from the vector

x

, as it is done in ER-SHAP-RF, but from every vector

h_{k}

with the probability distribution P,

k = 1, \dots, N

. However, this algorithm is not studied because it can be regarded as the combination of ERW-SHAP and ER-SHAP-RF, which are analyzed in detail.

Let us consider complexity of the models. If we assume that the complexity of the black-box model is

B (m, n)

, the random forest tree depth is d, and the number of trees is T, then the complexity of the random forest training is

O (T \cdot m \cdot M \cdot log (M))

, the complexity of the random forest predicting is

O (T \cdot d \cdot M)

. The complexity of SHAP is

O (2^{m} \cdot B (m, n))

. The complexity of ER-SHAP is

O (2^{t} \cdot N \cdot B (m, n))

. It follows from the above that ER-SHAP is more effective than SHAP when

2^{m} > 2^{t} \cdot N

or

m > t + {log}_{2} (N)

. The complexity of ER-SHAP-RF is

O (2^{t} \cdot N \cdot B (m, n) + T \cdot m \cdot M \cdot log (M) \cdot B (m, n) + T \cdot d \cdot M) .

It can be seen from the above that the complexity of the random forest training and predicting does not sufficiently impact on the complexity of ER-SHAP-RF in comparison with the complexity of ER-SHAP. The same can be said about ERW-SHAP.

5. Numerical Experiments

First, we consider several numerical examples for which training instances are randomly generated. Each generated synthetic instance consists of 5 features. Two features are generated as shown in Figure 4, and other features are uniformly generated in intervals

[- 1, 1]

. Each picture in Figure 4 corresponds to a certain location of instances of two classes such that the instances of classes 0 and 1 are depicted by small triangles and crosses, respectively. This generation corresponds to the case when the first two features may be important. These features allow us to analyze the feature importance in accordance with the data location and with the separating function. Other features are not important, and they are used to generalize numerical experiments with synthetic data.

Separating functions in Figure 4 are obtained by means of SVM which can be regarded as the black-box model. It used the RBF kernel whose parameter depends on a dataset trained. The SVM allows us to obtain different separating functions by changing the kernel parameter. Figure 4a illustrates the linearly separating case. The specific class area in the form of a stripe is shown in Figure 4b. A saw-based separating function is used in Figure 4c. The class area in the form of a wedge is given in Figure 4d. A checkerboard with an attempt of SVM to separate the checkerboard cages can be found in Figure 4e. For every generated dataset from Figure 4, we compare SHAP with the proposed modifications.

Measures for comparison: In order to compare the proposed modifications with the original SHAP method, we use the concordance index C of pairs, which is defined as the proportion of concordant pairs of the Shapley values divided by the total number of possible evaluation pairs. Let

ϕ_{i}^{*}

and

ϕ_{i}

be the Shapley values obtained by means of the original SHAP method and one of its modifications (ER-SHAP, ERW-SHAP, ER-SHAP-RF), respectively. Two pairs of the Shapley values

(ϕ_{i}, ϕ_{j})

and

(ϕ_{i}^{*}, ϕ_{j}^{*})

are concordant if they hold

(ϕ_{i} > ϕ_{j}, ϕ_{i}^{*} > ϕ_{j}^{*})

or

(ϕ_{i} < ϕ_{j}, ϕ_{i}^{*} < ϕ_{j}^{*})

. In contrast to the well-known C-index in survival analysis, the introduced concordance index compares predictions provided by two methods. If the index is close to 1, then the models provide the same results. A motivation for the concordance index introduction is that the Shapley values computed by using original SHAP and the proposed modifications may be different. However, we are interested in their relationship. If the original SHAP method gives the inequality

ϕ_{i}^{*} > ϕ_{j}^{*}

for some i and j, then we are expecting to have

ϕ_{i} > ϕ_{j}

for the proposed method, but not equalities

ϕ_{i}^{*} = ϕ_{i}

and

ϕ_{j}^{*} = ϕ_{j}

. It should be noted that original SHAP may provide incorrect results. Therefore, the introduced concordance index should be viewed as a desirable measure under condition of correct SHAP results.

We use the Kernel SHAP [10] in numerical experiments and compare obtained results with it.

In spite of importance of the concordance index, we also use the normalized Euclidean distance E between vectors

(ϕ_{1}^{*}, \dots, ϕ_{m}^{*})

and

(ϕ_{1}, \dots, ϕ_{m})

. The distance shows how the absolute Shapley values of two methods are close to each other. It is important to take into account that the Shapley values in the original SHAP method satisfy the efficiency property when

ϕ_{1}^{*} + \dots + ϕ_{m}^{*} = f (x) - f (⌀)

. This property is not fulfilled for modifications because they do not enumerate all subsets of features. Therefore, in order to consider the Shapley values in the same scale, all values

ϕ_{i}

and

ϕ_{i}^{*}

are normalized to be in interval

[0, 1]

.

5.1. ER-SHAP

First, we consider the results of the numerical experiments obtained by means of the ER-SHAP with the SVM as a black-box model trained on the datasets shown in Figure 4. The explained instance for the experiments has all identical features which are equal to

0.25

. The concordance indices of the ER-SHAP as functions of the number of iterations N for the numbers of the selected features

t = 2

(the solid line) and

t = 3

(the dashed line) are illustrated in Figure 5, where pictures (a–e) correspond to pictures (a–e) shown in Figure 4. It can be seen from Figure 5 that the concordance index increases with N on average. This implies that the ER-SHAP provides results comparable with the SHAP. It can be also seen from the pictures that the concordance index is significantly larger for

t = 3

in comparison to the case of

t = 2

. This observation is obvious because the large number of selected features in each iteration brings the modification closer to the original SHAP method. Though, one can see from Figure 5b that the case

t = 2

provides better concordance index by

N \geq 7

.

Figure 6 illustrates how the Euclidean distances between the ER-SHAP and SHAP as functions of the number of iterations N for

t = 2

(the solid line) and 3 (the dashed line) decrease with N. We again consider five training sets, shown in Figure 4.

In order to explicitly illustrate how the Shapley values

ϕ_{i}^{*}

and

ϕ_{i}

obtained by the SHAP and ER-SHAP, respectively, are close to each other, we show the Shapley values for all five cases in Figure 7. It can be seen that despite the difference in the absolute values, the Shapley values indicate to the same important features.

5.2. ERW-SHAP

To study the ERW-SHAP, the features of the explained instance are noised by using the normal distribution of noise with the zero expectation and standard deviations

0.01

and

0.1

. The weights of the generated instances

h_{i}

are defined by

w_{i} = exp (- {∥h_{i} - x∥}^{2}) .

(8)

We consider the similar results of the numerical experiments obtained by means of the ERW-SHAP with the SVM as a black-box model trained on the datasets shown in Figure 4 with the same explained instance. The concordance indices of the ERW-SHAP as functions of N for

t = 2

(the solid line) and

t = 3

(the dashed line) are illustrated in Figure 8, where pictures (a–e) correspond to pictures (a–e) shown in Figure 4. The standard deviation of the normal distribution generating noise is

0.01

. If we compare the concordance indices for the ERW-SHAP (Figure 8) and for the ER-SHAP (Figure 5), then it is obvious that the ERW-SHAP provides better results in comparison to the ERW-SHAP for most of the datasets.

At the same time, the Euclidean distances between the SHAP and ERW-SHAP slightly differ from the same distances between the SHAP and ER-SHAP. This follows from Figure 9 where the Euclidean distances between the ERW-SHAP and SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) are presented for the above datasets.

To illustrate how the Shapley values

ϕ_{i}^{*}

and

ϕ_{i}

obtained by the SHAP and ERW-SHAP, respectively, are close to each other, we show the Shapley values for the five cases in Figure 10 and Figure 11. Figure 10 and Figure 11 provide results under the condition that the normal distribution of the generated noise has the standard deviations

0.1

and

0.01

, respectively. We again observe that the ERW-SHAP can be regarded as a good approximation of the SHAP because the Shapley values of the ERW-SHAP and SHAP are very close to each other.

5.3. ER-SHAP-RF

We again study the modification by using the datasets shown in Figure 4. The result shows that the ER-SHAP-RF outperforms the ER-SHAP as well as the ERW-SHAP for most of the datasets. Indeed, if we compare the concordance indices for the ER-SHAP-RF (Figure 12) with the ERW-SHAP (Figure 8) and ER-SHAP (Figure 5), then we see that all the examples provide better results. In contrast to the concordance indices, the Euclidean distances shown in Figure 13 demonstrate worse results. At the same time, the Shapley values given in Figure 14 almost coincide with the corresponding values obtained by means of the ERW-SHAP (Figure 11). It should be noted that a more accurate tuning of the random forest might provide outperforming results.

Let us summarize the numerical results obtained on the synthetic data for the models ER-SHAP, ERW-SHAP, and ER-SHAP-RF. First, we compare the C-indices corresponding to the models, which are depicted in Figure 5, Figure 8, and Figure 12. It can clearly be seen from the results that the ER-SHAP-RF outperforms the ER-SHAP as well as the ERW-SHAP for all five datasets. The ERW-SHAP outperforms the ER-SHAP for the datasets depicted in Figure 4b,c,e. However, the ERW-SHAP is inferior to the ER-SHAP for the datasets depicted in Figure 4a,d. Moreover, Figure 8 shows that the C-index of the ERW-SHAP behaves unstably. However, if we compare the ER-SHAP and ERW-SHAP using the Euclidean distances between the results provided by these models and the SHAP, then we can conclude that the ERW-SHAP outperforms the ER-SHAP for all the datasets. It is interesting to point out that the ER-SHAP-RF outperforms the ERW-SHAP only for the first dataset (see Figure 4a) if we consider the Euclidean distances. However, we have mentioned that the Euclidean distance cannot be viewed as the best measure for a comparison of the explainable models. Therefore, we can conclude that the ER-SHAP-RF provides the best results, though this model requires generating neighbors and training the random forest.

5.4. Boston Housing Dataset

Let us consider the real data called the Boston Housing dataset. It can be obtained from the StatLib archive (http://lib.stat.cmu.edu/datasets/boston, accessed on 2 November 2022). The Boston Housing dataset consists of 506 instances such that each instance is described by 13 features.

The heatmap reflecting the concordance index of the ER-SHAP for the Boston Housing dataset is shown in Figure 15. Each element at position

(i, j)

, where i and j are the numbers of the row and column, respectively, indicates the value of the concordance index. Each row corresponds to the number of iterations N, and each column corresponds to the number of selected features t. It can be seen from Figure 15 that the concordance index increases with N and t. This implies that the ER-SHAP provides results coinciding with the SHAP by rather large numbers of iterations N. Figure 16 illustrates how the computation time

τ_{SHAP}

of the SHAP exceeds the computation time

τ_{ER - SHAP}

of the ER-SHAP. The heatmap shows the ratio

τ_{ER - SHAP} / τ_{SHAP}

. One can see, from Figure 16, a clear advantage of using the ER-SHAP from the computational point of view.

Figure 17 shows the heatmap of the concordance index of the ERW-SHAP for the Boston Housing dataset. It is clearly seen from Figure 17 that the introduction of weights and generated instances significantly improves the approximation.

The Shapley values obtained by means of the ER-SHAP and SHAP as well as the ERW-SHAP and SHAP are shown in Figure 18 and Figure 19, respectively. One can see from Figure 18 and Figure 19 that the ERW-SHAP can be viewed as a better approximation of the SHAP because the corresponding bars almost coincide, as shown in Figure 19. It should be noted that the Shapley values provided by the ER-SHAP also behave like values of the SHAP (see Figure 18), but they do not coincide for the most important features.

Figure 20 and Figure 21 illustrate the heatmaps of the concordance index of the ER-SHAP-RF for the Boston Housing dataset. They are obtained without using the temperature scaling in accordance with (7) and with this calibration method, respectively. It is interesting to observe from Figure 20 and Figure 21 that the use of the calibration leads to a more contrasting heatmap and to an obvious improvement in the approximation quality.

5.5. Breast Cancer Dataset

The next real dataset is the Breast Cancer Wisconsin (Diagnostic) dataset. It can be found in the well-known UCI Machine Learning Repository (https://archive.ics.uci.edu, accessed on 2 November 2022). The Breast Cancer dataset contains 569 instances such that each instance is described by 30 features. For the classes of the breast cancer diagnosis, the malignant and the benign are assigned by classes 0 and 1, respectively. We consider the corresponding model in the framework of the regression with outcomes in the form of probabilities from 0 (malignant) to 1 (benign).

The heatmaps given in Figure 22 and Figure 23 are similar to the same heatmaps obtained for the Boston Housing dataset (Figure 15 and Figure 16). It is interesting to observe from Figure 23 that there are N and t such that the ratio

τ_{ER - SHAP} / τ_{SHAP}

is larger 1. This implies that the SHAP is computationally simpler in comparison to the ER-SHAP. However, these cases take place only for large values N and t.

At first glance, it is difficult to evaluate from Figure 24 whether the ERW-SHAP provides better results than the ER-SHAP. Figure 24 shows the heatmap of the concordance index of the ERW-SHAP for the Breast Cancer dataset. However, we can see that the legend in Figure 24 is changed in the interval

[0.4, 0.95]

, whereas the legend in Figure 22 is changed in the interval

[0.4, 0.9]

. This implies that the ERW-SHAP outperforms the ER-SHAP in this numerical example.

The Shapley values for all the features of the Breast Cancer dataset, which are obtained by means of the ER-SHAP and SHAP, are shown in Figure 25. The similar values obtained by means of the ERW-SHAP and SHAP are shown in Figure 26. One can see from Figure 25 and Figure 26 that the Shapley values obtained by means of the ERW-SHAP better approximate the SHAP Shapley values. For example, if we look at the feature “worst radius”, which is important due to the original SHAP method, then the ER-SHAP provides the incorrect result, whereas the ERW-SHAP is totally consistent with the SHAP.

Figure 27 and Figure 28 illustrate the heatmaps of the concordance index of the ER-SHAP-RF for the Breast Cancer dataset. They show results similar to the results obtained for the Boston Housing dataset demonstrated in Figure 20 and Figure 21, respectively. This implies that the use of “pre-training” in the form of the random forest combined with the calibration method leads to a better approximation.

We also compare the proposed models with the SHAP and Kernel SHAP by using the following datasets. The California Housing dataset obtained from the StatLib repository consists of 20,640 instances such that each instance is described by eight features. It can be found in https://www.dcc.fc.up.pt/~ltorgo/Regression/cal_housing, accessed on 2 November 2022. The KDD Coil 7 dataset consists of 282 instances such that each instance is described by 11 features. The PBC dataset has 276 instances with 18 features. The Plasma Retinol dataset has 315 instances with 13 features. The Cholesterol dataset consists of 297 instances such that each instance is described by 13 features. The datasets KDD Coil 7, PBC, Plasma Retinol, and Cholesterol can be found at https://www.openml.org/search?type=data, accessed on 2 November 2022. We compare the proposed models with the Kernel SHAP and with the SHAP on these datasets by using the C-index. The number of the iteration N and the number of the selected features t are taken as 20 and 3, respectively, for all the datasets. The corresponding results are shown in Table 1 and Table 2. It follows from Table 1 and Table 2 that the ER-SHAP, ERW-SHAP, and ER-SHAP-RF provide almost the same results as the Kernel SHAP and SHAP because the values of the C-index are close to 1.

6. Conclusions

It is important to note that only three modifications of the ensemble-based SHAP have been presented. At the same time, many additional modifications of the general approach based on constructing the ensemble of SHAPs can be developed following the proposed modifications and the idea of the ensemble-based approximation.

First of all, the model of the feature selection used in the ER-SHAP-RF for “pre-training” can be changed. There are many methods solving the feature selection problem. Moreover, simple explanation methods can also be applied to the preliminary selection of the important features and to computing their probability distribution.

Second, various rules different from averaging can be applied to combining the results of the SHAPs, for example, the largest (smallest) Shapley values can be computed for providing pessimistic (optimistic) decisions.

The ensemble-based approach can be applied to an explanation of the classification as well as regression black-box models. It gives many opportunities for developing new methods which can be viewed as directions for further research. The proposed approach can be applied to local and global explanations. However, its main advantage is that it significantly reduces the computation time for solving the explanation problem.

Author Contributions

Conceptualization, L.U. and A.K.; methodology, L.U.; software, A.K.; validation, L.U. and A.K.; formal analysis, L.U.; investigation, A.K.; resources, L.U.; data curation, A.K.; writing—original draft preparation, L.U.; writing—review and editing, A.K.; visualization, A.K.; supervision, L.U.; project administration, L.U.; funding acquisition, L.U. All authors have read and agreed to the published version of the manuscript.

Funding

The research is partially funded by the Ministry of Science and Higher Education of the Russian Federation as part of the World-class Research Center program: Advanced Digital Technologies (contract No. 075-15-2020-934 dated 17 November 2020).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to express their appreciation to the anonymous referees whose very valuable comments have improved the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

LIME	Local Interpretable Model-Agnostic Explanation
SHAP	SHapley Additive exPlanations
ER-SHAP	Ensemble of Random SHAPs
ERW-SHAP	Ensemble of Random Weighted SHAPs
ER-SHAP-RF	Ensemble of Random SHAPs generated by the Random Forest
SVM	Support Vector Machine
RBF	Radial Basis Function
C-index	Concordance index

References

Belle, V.; Papantonis, I. Principles and Practice of Explainable Machine Learning. arXiv 2020, arXiv:2009.11698. [Google Scholar] [CrossRef]
Guidotti, R.; Monreale, A.; Ruggieri, S.; Turini, F.; Giannotti, F.; Pedreschi, D. A Survey of Methods for Explaining Black Box Models. ACM Comput. Surv. 2019, 51, 93. [Google Scholar] [CrossRef] [Green Version]
Liang, Y.; Li, S.; Yan, C.; Li, M.; Jiang, C. Explaining the black-box model: A survey of local interpretation methods for deep neural networks. Neurocomputing 2021, 419, 168–182. [Google Scholar] [CrossRef]
Marcinkevics, R.; Vogt, J. Interpretability and Explainability: A Machine Learning Zoo Mini-tour. arXiv 2020, arXiv:2012.01805. [Google Scholar]
Molnar, C. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. 2019. Available online: https://christophm.github.io/interpretable-ml-book/ (accessed on 2 November 2022).
Xie, N.; Ras, G.; van Gerven, M.; Doran, D. Explainable Deep Learning: A Field Guide for the Uninitiated. arXiv 2020, arXiv:2004.14545. [Google Scholar]
Zablocki, E.; Ben-Younes, H.; Perez, P.; Cord, M. Explainability of deep vision-based autonomous driving systems: Review and challenges. arXiv 2021, arXiv:2101.05307. [Google Scholar] [CrossRef]
Zhang, Y.; Tino, P.; Leonardis, A.; Tang, K. A Survey on Neural Network Interpretability. arXiv 2020, arXiv:2012.14261. [Google Scholar] [CrossRef]
Ribeiro, M.; Singh, S.; Guestrin, C. “Why Should I Trust You?” Explaining the Predictions of Any Classifier. arXiv 2016, arXiv:1602.04938v3. [Google Scholar]
Lundberg, S.; Lee, S.I. A unified approach to interpreting model predictions. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 4765–4774. [Google Scholar]
Strumbelj, E.; Kononenko, I. An Efficient Explanation of Individual Classifications using Game Theory. J. Mach. Learn. Res. 2010, 11, 1–18. [Google Scholar]
Shapley, L. A value for n-person games. In Contributions to the Theory of Games; Annals of Mathematics Studies 28; Princeton University Press: Princeton, NJ, USA, 1953; Volume II, pp. 307–317. [Google Scholar]
Covert, I.; Lundberg, S.; Lee, S.I. Explaining by Removing: A Unified Framework for Model Explanation. arXiv 2020, arXiv:2011.14878. [Google Scholar]
Strumbelj, E.; Kononenko, I. A General Method for Visualizing and Explaining Black-Box Regression Models. In Proceedings of the Adaptive and Natural Computing Algorithms. ICANNGA 2011, Ljubljana, Slovenia, 14–16 April 2011; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2011; Volume 6594, pp. 21–30. [Google Scholar]
Strumbelj, E.; Kononenko, I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 2014, 41, 647–665. [Google Scholar] [CrossRef]
Aas, K.; Jullum, M.; Loland, A. Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. arXiv 2019, arXiv:1903.10464. [Google Scholar] [CrossRef]
Ancona, M.; Oztireli, C.; Gros, M. Explaining Deep Neural Networks with a Polynomial Time Algorithm for Shapley Values Approximation. arXiv 2019, arXiv:1903.10992. [Google Scholar]
Ho, T. The Random Subspace Method for Constructing Decision Forests. IEEE Trans. Pattern Anal. Mach. Intell. 1998, 20, 832–844. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Shankaranarayana, S.; Runje, D. ALIME: Autoencoder Based Approach for Local Interpretability. arXiv 2019, arXiv:1909.02437. [Google Scholar]
Ribeiro, M.; Singh, S.; Guestrin, C. Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; pp. 1527–1535. [Google Scholar]
Rabold, J.; Deininger, H.; Siebers, M.; Schmid, U. Enriching Visual with Verbal Explanations for Relational Concepts: Combining LIME with Aleph. arXiv 2019, arXiv:1910.01837v1. [Google Scholar]
Huang, Q.; Yamada, M.; Tian, Y.; Singh, D.; Yin, D.; Chang, Y. GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks. arXiv 2020, arXiv:2001.06216. [Google Scholar] [CrossRef]
Kovalev, M.; Utkin, L.; Kasimov, E. SurvLIME: A method for explaining machine learning survival models. Knowl.-Based Syst. 2020, 203, 106164. [Google Scholar] [CrossRef]
Garreau, D.; von Luxburg, U. Explaining the Explainer: A First Theoretical Analysis of LIME. arXiv 2020, arXiv:2001.03447. [Google Scholar]
Garreau, D.; von Luxburg, U. Looking Deeper into Tabular LIME. arXiv 2020, arXiv:2008.11092. [Google Scholar]
Garreau, D.; Mardaoui, D. What does LIME really see in images? arXiv 2021, arXiv:2102.06307. [Google Scholar]
Jung, A. Explainable Empirical Risk Minimization. arXiv 2020, arXiv:2009.01492. [Google Scholar]
Hastie, T.; Tibshirani, R. Generalized Additive Models; CRC Press: Boca Raton, FL, USA, 1990; Volume 43. [Google Scholar]
Chang, C.H.; Tan, S.; Lengerich, B.; Goldenberg, A.; Caruana, R. How Interpretable and Trustworthy are GAMs? arXiv 2020, arXiv:2006.06466. [Google Scholar]
Lou, Y.; Caruana, R.; Gehrke, J. Intelligible Models for Classification and Regression. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; pp. 150–158. [Google Scholar]
Nori, H.; Jenkins, S.; Koch, P.; Caruana, R. InterpretML: A Unified Framework for Machine Learning Interpretability. arXiv 2019, arXiv:1909.09223. [Google Scholar]
Zhang, X.; Tan, S.; Koch, P.; Lou, Y.; Chajewska, U.; Caruana, R. Axiomatic Interpretability for Multiclass Additive Models. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 226–234. [Google Scholar]
Agarwal, R.; Frosst, N.; Zhang, X.; Caruana, R.; Hinton, G. Neural Additive Models: Interpretable Machine Learning with Neural Nets. arXiv 2020, arXiv:2004.13912. [Google Scholar]
Konstantinov, A.; Utkin, L. Interpretable machine learning with an ensemble of gradient boosting machines. arXiv 2020, arXiv:2010.07388. [Google Scholar] [CrossRef]
den Broeck, G.; Lykov, A.; Schleich, M.; Suciu, D. On the Tractability of SHAP Explanations. arXiv 2020, arXiv:2009.08634v2. [Google Scholar]
Bowen, D.; Ungar, L. Generalized SHAP: Generating multiple types of explanations in machine learning. arXiv 2020, arXiv:2006.07155v2. [Google Scholar]
Rozemberczki, B.; Sarkar, R. The Shapley Value of Classifiers in Ensemble Games. arXiv 2021, arXiv:2101.02153. [Google Scholar]
Yuan, H.; Yu, H.; Wang, J.; Li, K.; Ji, S. On Explainability of Graph Neural Networks via Subgraph Explorations. arXiv 2021, arXiv:2102.05152. [Google Scholar]
Frye, C.; de Mijolla, D.; Cowton, L.; Stanley, M.; Feige, I. Shapley-based explainability on the data manifold. arXiv 2020, arXiv:2006.01272. [Google Scholar]
Bento, J.; Saleiro, P.; Cruz, A.; Figueiredo, M.; Bizarro, P. TimeSHAP: Explaining Recurrent Models through Sequence Perturbations. arXiv 2020, arXiv:2012.00073. [Google Scholar]
Begley, T.; Schwedes, T.; Frye, C.; Feige, I. Explainability for fair machine learning. arXiv 2020, arXiv:2010.07389. [Google Scholar]
Antwarg, L.; Miller, R.; Shapira, B.; Rokach, L. Explaining Anomalies Detected by Autoencoders Using SHAP. arXiv 2020, arXiv:1903.02407v2. [Google Scholar]
Takeishi, N. Shapley Values of Reconstruction Errors of PCA for Explaining Anomaly Detection. arXiv 2019, arXiv:1909.03495. [Google Scholar]
Bouneder, L.; Leo, Y.; Lachapelle, A. X-SHAP: Towards multiplicative explainability of Machine Learning. arXiv 2020, arXiv:2006.04574. [Google Scholar]
Redelmeier, A.; Jullum, M.; Aas, K. Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees. In Proceedings of the Machine Learning and Knowledge Extraction. CD-MAKE 2020, Dublin, Ireland, 25–28 August 2020; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2020; Volume 12279, pp. 117–137. [Google Scholar]
Mangalathu, S.; Hwang, S.H.; Jeon, J.S. Failure mode and effects analysis of RC members based on machinelearning-based SHapley Additive exPlanations (SHAP) approach. Eng. Struct. 2020, 219, 110927. [Google Scholar] [CrossRef]
Rodriguez-Perez, R.; Bajorath, J. Interpretation of machine learning models using Shapley values: Application to compound potency and multi-target activity predictions. J. Comput. Aided Mol. Des. 2020, 34, 1013–1026. [Google Scholar] [CrossRef]
Bi, Y.; Xiang, D.; Ge, Z.; Li, F.; Jia, C.; Song, J. An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP. Mol. Ther. Nucleic Acids 2020, 22, 362–372. [Google Scholar] [CrossRef]
Kumar, I.; Venkatasubramanian, S.; Scheidegger, C.; Friedler, S. Problems with Shapley-value-based explanations as feature importance measures. In Proceedings of the International Conference on Machine Learning, PMLR, Virtual, 13–18 July 2020; pp. 5491–5500. [Google Scholar]
Adadi, A.; Berrada, M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access 2018, 6, 52138–52160. [Google Scholar] [CrossRef]
Arrieta, A.; Diaz-Rodriguez, N.; Ser, J.D.; Bennetot, A.; Tabik, S.; Barbado, A.; Garcia, S.; Gil-Lopez, S.; Molina, D.; Benjamins, R.; et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 2020, 58, 82–115. [Google Scholar] [CrossRef] [Green Version]
Carvalho, D.; Pereira, E.; Cardoso, J. Machine Learning Interpretability: A Survey on Methods and Metrics. Electronics 2019, 8, 832. [Google Scholar] [CrossRef] [Green Version]
Das, A.; Rad, P. Opportunities and Challenges in ExplainableArtificial Intelligence (XAI): A Survey. arXiv 2020, arXiv:2006.11371v2. [Google Scholar]
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef] [Green Version]
Petsiuk, V.; Das, A.; Saenko, K. RISE: Randomized input sampling for explanation of black-box models. arXiv 2018, arXiv:1806.07421. [Google Scholar]
Zeiler, M.; Fergus, R. Visualizing and understanding convolutional networks. In Proceedings of the ECCV 2014, Zurich, Switzerland, 6–12 September 2014; Springer: Cham, Switzerland, 2014; Volume 8689, pp. 818–833. [Google Scholar]
Yu, J.; Lin, Z.; Yang, J.; Shen, X.; Lu, X.; Huang, T. Generative image inpainting with contextual attention. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 5505–5514. [Google Scholar]
Speiser, J.; Miller, M.; Tooze, J.; Ip, E. A comparison of random forest variable selection methods for classification prediction modeling. Expert Syst. Appl. 2019, 134, 93–101. [Google Scholar] [CrossRef]
Sagi, O.; Rokach, L. Explainable decision forest: Transforming a decision forest into an interpretable tree. Inf. Fusion 2020, 61, 124–138. [Google Scholar] [CrossRef]
Chuan, G.; Pleiss, G.; Sun, Y.; Weinberger, K. On calibration of modern neural networks. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 1321–1330. [Google Scholar]

Figure 1. A scheme of the ER-SHAP.

Figure 2. A scheme of the ERW-SHAP.

Figure 3. A scheme of the ER-SHAP-RF.

Figure 4. Five synthetic datasets and the boundaries between classes provided by SVM in the form of: (a) the linear separation; (b) a stripe; (c) a saw-based separating function; (d) a wedge; (e) the checkerboard cages.

Figure 5. Concordance indices of ER-SHAP as functions of the number of iterations N for

t = 2

(the solid line) and 3 (the dashed line) for trained SVMs and five datasets depicted in corresponding Figure 4a–e.

Figure 5. Concordance indices of ER-SHAP as functions of the number of iterations N for

t = 2

(the solid line) and 3 (the dashed line) for trained SVMs and five datasets depicted in corresponding Figure 4a–e.

Figure 6. Euclidean distances between ER-SHAP and SHAP as functions of the number of iterations N for

t = 2

(the solid line) and 3 (the dashed line) for trained SVMs and five datasets depicted in corresponding Figure 4a–e.

Figure 6. Euclidean distances between ER-SHAP and SHAP as functions of the number of iterations N for

t = 2

(the solid line) and 3 (the dashed line) for trained SVMs and five datasets depicted in corresponding Figure 4a–e.

Figure 7. Shapley values obtained by means of SHAP and ER-SHAP for all features of five datasets depicted in corresponding Figure 4a–e and trained SVMs as black-boxes.

Figure 8. Concordance indices of ERW-SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) and for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 8. Concordance indices of ERW-SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) and for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 9. Euclidean distances between ERW-SHAP and SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs as black-boxes.

Figure 9. Euclidean distances between ERW-SHAP and SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs as black-boxes.

Figure 10. Shapley values obtained by means of SHAP and ERW-SHAP for all features of five datasets depicted in corresponding Figure 4a–e and trained SVMs under condition of using the normal distribution of feature changes with the standard deviation 0.1.

Figure 11. Shapley values obtained by means of SHAP and ERW-SHAP for all features of five datasets depicted in corresponding Figure 4a–e and trained SVMs under condition of using the normal distribution of feature changes with the standard deviation

0.01

.

Figure 11. Shapley values obtained by means of SHAP and ERW-SHAP for all features of five datasets depicted in corresponding Figure 4a–e and trained SVMs under condition of using the normal distribution of feature changes with the standard deviation

0.01

.

Figure 12. Concordance indices of ER-SHAP-RF as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 12. Concordance indices of ER-SHAP-RF as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 13. Euclidean distances between ER-SHAP-RF and SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 13. Euclidean distances between ER-SHAP-RF and SHAP as functions of N for

t = 2

(the solid line) and 3 (the dashed line) for five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 14. Shapley values obtained by means of SHAP and ER-SHAP-RF for all features of five datasets depicted in corresponding Figure 4a–e and trained SVMs.

Figure 15. The heatmap reflecting the concordance index C obtained by ER-SHAP for the Boston Housing dataset.

Figure 16. The heatmap illustrating the relationship between computation times of SHAP and ER-SHAP for the Boston Housing dataset.

Figure 17. The heatmap reflecting the concordance index C obtained by ERW-SHAP for the Boston Housing dataset.

Figure 18. Shapley values obtained by means of SHAP and ER-SHAP for the Boston Housing dataset.

Figure 19. Shapley values obtained by means of SHAP and ERW-SHAP for features of the Boston Housing dataset under condition of using the normal distribution of feature changes with the standard deviation

0.1

.

Figure 19. Shapley values obtained by means of SHAP and ERW-SHAP for features of the Boston Housing dataset under condition of using the normal distribution of feature changes with the standard deviation

0.1

.

Figure 20. The heatmap reflecting the concordance index C obtained by ER-SHAP-RF for the Boston Housing dataset without using the temperature scaling.

Figure 21. The heatmap reflecting the concordance index C obtained by ER-SHAP-RF for the Boston Housing dataset using the temperature scaling.

Figure 22. The heatmap reflecting the concordance index C obtained by ER-SHAP for the Breast Cancer dataset.

Figure 23. The heatmap illustrating the relationship between computation times of SHAP and ER-SHAP for the Breast Cancer Housing dataset.

Figure 24. The heatmap reflecting the concordance index C obtained by ERW-SHAP for the Breast Cancer dataset.

Figure 25. Shapley values obtained by means of SHAP and ER-SHAP for features of the Breast Cancer dataset.

Figure 26. Shapley values obtained by means of SHAP and ERW-SHAP for features of the Breast Cancer dataset under condition of using the normal distribution of feature changes with the standard deviation

0.1

.

Figure 26. Shapley values obtained by means of SHAP and ERW-SHAP for features of the Breast Cancer dataset under condition of using the normal distribution of feature changes with the standard deviation

0.1

.

Figure 27. The heatmap reflecting the concordance index C obtained by ER-SHAP-RF for the Breast Cancer dataset without using the temperature scaling.

Figure 28. The heatmap reflecting the concordance index C obtained by ER-SHAP-RF for the Breast Cancer dataset with using the temperature scaling.

Table 1. Comparison of the proposed models with Kernel SHAP for several datasets.

Dataset	C-Index with Kernel SHAP
	ER-SHAP	ERW-SHAP	ER-SHAP-RF
California Housing	$0.990$	$0.996$	$1.000$
KDD Coil 7	$0.876$	$0.980$	$0.982$
PBC	$0.926$	$0.908$	$0.948$
Plasma Retinol	$0.954$	$0.954$	$0.948$
Cholesterol	$0.926$	$0.944$	$0.974$

Table 2. Comparison of the proposed models with SHAP for several datasets.

Dataset	C-Index with SHAP
	ER-SHAP	ERW-SHAP	ER-SHAP-RF
California Housing	$0.974$	$0.984$	$0.994$
KDD Coil 7	$0.872$	$0.978$	$0.978$
PBC	$0.908$	$0.886$	$0.934$
Plasma Retinol	$0.927$	$0.932$	$0.944$
Cholesterol	$0.922$	$0.938$	$0.970$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Utkin, L.; Konstantinov, A. Ensembles of Random SHAPs. Algorithms 2022, 15, 431. https://doi.org/10.3390/a15110431

AMA Style

Utkin L, Konstantinov A. Ensembles of Random SHAPs. Algorithms. 2022; 15(11):431. https://doi.org/10.3390/a15110431

Chicago/Turabian Style

Utkin, Lev, and Andrei Konstantinov. 2022. "Ensembles of Random SHAPs" Algorithms 15, no. 11: 431. https://doi.org/10.3390/a15110431

APA Style

Utkin, L., & Konstantinov, A. (2022). Ensembles of Random SHAPs. Algorithms, 15(11), 431. https://doi.org/10.3390/a15110431

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Ensembles of Random SHAPs

Abstract

1. Introduction

2. Related Work

3. Shapley Values and the Explanation Model

4. Modifications of SHAP

4.1. Ensemble of Random SHAPs

4.2. Ensemble of Random Weighted SHAPs

4.3. Ensemble of Random SHAPs Generated by the Random Forest

5. Numerical Experiments

5.1. ER-SHAP

5.2. ERW-SHAP

5.3. ER-SHAP-RF

5.4. Boston Housing Dataset

5.5. Breast Cancer Dataset

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI