Distributions of Outputs Given Subsets of Inputs and Dependent Generalized Sensitivity Indices

Matieyendou Lamboni

doi:10.3390/math13050766

¹

Department DFR-ST, University of Guyane, 97346 Cayenne, France

²

228-UMR Espace-Dev, University of Guyane, University of Réunion, IRD, University of Montpellier, 34090 Montpellier, France

Mathematics2025, 13(5), 766;https://doi.org/10.3390/math13050766

This article belongs to the Section D1: Probability and Statistics

Version Notes

Order Reprints

Abstract

Better understanding mathematical and numerical models often requires investigating the impacts of inputs on the model outputs, as well as interactions. Quantifying such effects for models with non-independent input variables (NIVs) relies on conditional distributions of the outputs given every subset of inputs. In this paper, by firstly providing additional dependency models of NIVs, functional outputs are composed by dependency models (yielding equivalent representations of outputs) to derive distributions of outputs conditional on inputs. We then provide an algorithm for selecting the necessary and sufficient equivalent representations that allow for obtaining all the conditional distributions of outputs given every subset of inputs, and for assessing the main, total, and interaction effects (i.e., indices) of every subset of NIVs. Unbiased estimators of covariances of sensitivity functionals and consistent estimators of such indices are derived by distinguishing the case of the multivariate and/or functional outputs, including dynamic models. Finally, analytical results and numerical results are provided, including an illustration based on a dynamic model.

Keywords:

computer models; copulas; dependency models; multivariate sensitivity analysis; non-independent and discrete variables

MSC:

49Q12; 60E05; 62F12

1. Introduction

Mathematical and numerical models and/or simulators are being increasingly developed and used for supporting decision-making in different scientific fields, such as engineering, environment, agronomy, and biology. To better understand such complex models or to design emulators, it is worth investigating the impacts of inputs on the model outputs, including the interactions among inputs. Uncertainty quantification and global sensitivity analysis are dedicated to addressing such issues. While variance-based methods (see, e.g., Sobol’ indices [] for real-valued functions and generalized sensitivity indices [,,,,,] for dynamic and multivariate outputs) are well established for independent inputs, three are cases where the input variables are correlated or dependent.

Non-independent variables (NIVs) arise when two or more variables do not vary freely and are widely encountered in different scientific fields, such as in data analysis, quantitative risk analysis, and inverse problems. Performing uncertainty quantification and variance-based sensitivity analysis of computer and/or mathematical models in the presence of dependent and/or correlated input variables (i.e., NIVs) still remains a challenge when one is interested in assessing the contributions of any subset of input variables and their interactions over all the outputs. Indeed, the dependency structures inferred by the constraints imposed on model outputs and/or inputs, and the dependency structures of the initial distributions of inputs may have significant impacts on the results of Sobol’ indices and generalized sensitivity indices. A number of papers report inconsistent ranking of inputs using the above indices in the presence of NIVs (e.g., [,,,]).

In the presence of NIVs, most existing variance-based methodologies provide the same first-order index of one input or one group of dependent inputs (e.g., [,,,,,]). Some of them provide the total indices of inputs, and there can be cases in which the first-order index is greater than the total index. To be able to rank input variables, the works in [,] introduced a methodology that ensures that the first-order index is always less than the total index for every single input and for real-valued functions. In the same sense, the recent works in [,,] provided in-depth approaches for quantifying the effects of each single input and some subsets of inputs (but not all the subsets) by making use of dependency models of NIVs. Note that dependency models provide functions that model the dependency structures of NIVs ([,,]), and such recent approaches rely on different DMs of the same NIVs for defining and computing dependent generalized sensitivity indices (dGSIs) of some subsets of inputs.

In this paper, we propose a new methodology for (i) deriving all the conditional distributions of outputs given every subset of NIVs and (ii) assessing the impacts of every subset of uncertain NIVs and their interactions on the model outputs by making use of the necessary and sufficient equivalent representations of the model of interest. Equivalent representations are obtained by composing the model outputs by the necessary and sufficient DMs, leading to reducing the number of model runs for computing such quantities of interest.

This paper is organized as follows: in Section 2, we provide additional and generic DMs of NIVs so as to cover more distribution functions, and we investigate the DM transforms, which avoids searching dependency functions for some distributions by making use of known DMs already derived in [,,]. We also provide computational DMs when analytical distributions of the input variables are not available, such as the resultant distribution of inputs associated with complex mathematical models under constraints. Section 3 provides the famous algorithm for selecting the necessary and sufficient DMs. It also deals with different representations of the model outputs associated with subsets of NIVs that are equivalent in distributions using the necessary and sufficient DMs. Such representations are useful for assessing the main, total, and interaction effects of any subset of NIVs in Section 4 by providing dGSIs of every subset of inputs and for the multivariate and/or functional outputs, including spatiotemporal models and dynamic models. Section 5 aims at constructing unbiased estimators of the cross-covariances of sensitivity functionals and the consistent estimators of dGSIs. We also provide the asymptotic distributions of dGSIs. Analytical results and numerical results are provided, including an illustration based on a dynamic model (see Section 6), and we conclude this work in Section 7.

General Notation

For an integer

d > 0

, we use

X

: =

(X_{1}, \dots, X_{d})

for a random vector of NIVs. Given

u \subseteq {1, \dots, d}

, we use

X_{u}

: =

(X_{j}, \forall j \in u)

,

X_{\sim u}

: =

(X_{j}, \forall j \in {1, \dots, d} ∖ u)

and

| u |

for its cardinality, leading to the partition

X = (X_{u}, X_{\sim u})

. We also use

Z \overset{d}{=} X

to say that

Z

and

X

have the same cumulative distribution function (CDF). For

a \in R^{n}

, we use

{||a||}_{L^{2}}

for the Euclidean norm. For a matrix

Σ \in R^{n \times n}

, we use

Tr (Σ)

for the trace of

Σ

, and

{||Σ||}_{F} : = \sqrt{Tr (Σ Σ^{T})}

for the Frobenius norm of

Σ

. We use

E [\cdot]

for the expectation operator and

V [\cdot]

for the variance–covariance operator.

2. Dependency Functions of Non-Independent Random Variables

This section provides generic DMs of NIVs and some transformations of such DMs. Formally, the inputs

X

have F as the CDF and C as the copula, that is,

F (x) = C (F_{1} (x_{1}), \dots, F_{d} (x_{d}))

with

F_{x_{j}}

or

F_{j}

the marginal CDF of

X_{j}

,

j = 1, \dots, d

and

x

a sample value of

X

. We use

F_{j}^{\leftarrow}

for the generalized inverse of

F_{j}

, and

F_{j | k}

for the distribution of

X_{j}

conditional on

X_{k}

for all

j, k \in {1, \dots, d}

and

j \neq k

. For a given discrete variable

X_{i}

, let us consider the distribution transform of

X_{i}

given by

τ_{F_{i}} (x_{i}, λ_{i}) = P (X_{i} < x_{i}) + λ_{i} P (X_{i} = x_{i})

with

λ_{i} \in [0, 1]

. Such a distribution transform ensures that for all

U_{i} \sim U (0, 1)

[,,,]

V_{i} : = τ_{F_{i}} (X_{i}, U_{i}) \sim U (0, 1), X_{i} \overset{d}{=} F_{i}^{\leftarrow} (V_{i}) a . s . .

(1)

For continuous distributions, the former term of Equation (1) comes down to the Rosenblatt transform [], and the latter term is equivalent to the inverse of the Rosenblatt transform. Denote with

(w_{1}, \dots, w_{d - 1})

an arbitrary permutation of

{1, \dots, d} ∖ {j}

,

X_{\sim j} : = (X_{w_{1}}, \dots, X_{w_{d - 1}})

and

Z \sim U {(0, 1)}^{d - 1}

. A generic DM of

X

is given by [,]

\begin{matrix} X_{\sim j} \overset{d}{=} r_{j} (X_{j}, Z) = (r_{w_{1}} (X_{j}, Z_{w_{1}}), r_{w_{2}} (X_{j}, Z_{w_{1}}, Z_{w_{2}}), \dots, r_{w_{d - 1}} (X_{j}, Z)), \end{matrix}

(2)

where

Z

is a random vector of independent variables, and it is independent of

X_{j}

as well. It is worth noting that the dependency function

r_{j} : = (r_{w_{1}}, \dots, r_{w_{d - 1}}) : R^{d} \to R^{d - 1}

is not unique. Indeed, one may replace, in Equation, (2)

Z_{w_{i}}

with

F_{w_{i}} (V_{w_{i}})

for any continuous variable

V_{w_{i}} \sim F_{w_{i}}

or with

τ_{G_{w_{i}}} (W_{w_{i}}, U_{w_{i}})

for any discrete variable

W_{w_{i}} \sim G_{w_{i}}

, provided that such variables are independent of

X_{j}

. But, DMs are uniquely defined once the marginal CDFs of

Z

are prescribed.

Proposition 1

([]). Let

r_{j}

be a dependency function;

0 < p \leq d - 1

;

v : = (j, w_{1}, \dots, w_{p})

be a vector and

Z_{\sim v} \sim U {(0, 1)}^{d - p - 1}

. Then,

X_{v} : = (X_{j}, X_{w_{1}}, \dots, X_{w_{p}}) \overset{d}{=} (X_{j}, r_{w_{1}} (X_{j}, Z_{w_{1}}), \dots, r_{w_{p}} (X_{j}, Z_{w_{1}}, \dots, Z_{w_{p}})) .

(3)

Moreover, there exists

r_{v} : R^{d} \to R^{d - p - 1}

such that

X_{\sim v} : = (X_{w_{p + 1}}, \dots, X_{w_{d - 1}}) \overset{d}{=} r_{u} (X_{v}, Z_{\sim u}) .

(4)

2.1. Distribution-Based and Copula-Based Expressions of Dependency Models

The distribution-based dependency model of

X

is given by [,]

\begin{matrix} \begin{matrix} X_{w_{1}} & = & r_{w_{1}} (X_{j}, Z_{w_{1}}) = F_{w_{1} | j}^{\leftarrow} (Z_{w_{1}} | X_{j}) \\ X_{w_{d - 1}} & = & F_{w_{d - 1} | j, w_{1}, \dots, w_{d - 2}}^{\leftarrow} (Z_{w_{d - 1}} | X_{j}, r_{w_{1}} (X_{j}, Z_{w_{1}}), \dots, r_{w_{d - 2}} (X_{j}, Z_{w_{1}}, \dots, Z_{w_{d - 2}})) \end{matrix} . \end{matrix}

(5)

Likewise, the copula-based dependency function is of interest to master all joint distributions having the same copula C, regardless of their marginal CDFs ([,]). It allows one to use the same DMs for the class of distributions sharing the same copula. For independent variables

U \sim U {(0, 1)}^{d}

,

Z

, and

X_{j}

, the copula-based DM is given by

\begin{matrix} \begin{matrix} X_{w_{1}} & = & r_{w_{1}} (X_{j}, Z_{w_{1}}, U_{j}) = F_{w_{1}}^{\leftarrow} (C_{w_{1} | j}^{\leftarrow} (Z_{w_{1}} | τ_{F_{j}} (X_{j}, U_{j}))) \\ X_{w_{d - 1}} & = & F_{w_{d - 1}}^{\leftarrow} (C_{w_{d - 1} | (\sim w_{d - 1})}^{\leftarrow} (Z_{w_{d - 1}} | τ_{F_{j}} (X_{j}, U_{j}), τ_{F_{w_{1}}} (r_{w_{1}} (X_{j}, Z_{w_{1}}, U_{j}), U_{w_{1}}), \dots)) \end{matrix}, \end{matrix}

(6)

by making use of the conditional sampling algorithm based on copulas [,]. Note that for Gaussian copulas and continuous marginal CDFs of inputs, DMs in (6) have a particular form provided in []. Lemma 1 extends such DMs to cope with discrete CDFs as well. To that end, denote with

C^{G a u s s} (U_{1}, \dots, U_{d}, R)

the Gauss copula having

R

as the correlation matrix;

L

the Cholesky factor of

R

(i.e.,

R = L L^{T}

);

Φ

the CDF of the standard Gaussian variable; and

I

the identity matrix.

Lemma 1.

Let

X_{j}

,

Z \sim N_{d - 1} (0, I)

and

U_{j} \sim U (0, 1)

be independent variables.

If

(X_{j}, X_{\sim j})

has the copula

C^{G a u s s} (U_{j}, U_{w_{1}}, \dots, U_{w_{d - 1}}, R)

, then

X_{\sim j} \overset{d}{=} r_{j} (X_{j}, Z, U_{j}) = [\begin{matrix} X_{w_{1}} & = & F_{w_{1}}^{\leftarrow} [Φ (Y_{w_{1}})] \\ ⋮ \\ X_{w_{d - 1}} & = & F_{w_{d - 1}}^{\leftarrow} [Φ (Y_{w_{d - 1}})] \end{matrix}],

(7)

with

[\begin{matrix} Y_{j} \\ Y_{\sim j} \end{matrix}] : = L [\begin{matrix} Φ^{- 1} (τ_{F_{j}} (X_{j}, U_{j})) \\ Z \end{matrix}] .

Proof.

See Appendix A. □

Likewise, to provide DMs for the Student copulas (Lemma 2), we use

t (ν, 0, 1)

for the standard t-distribution with

ν

degrees of freedom and

T_{ν}

for its CDF.

Lemma 2.

Let

X_{j}

,

{\{Z_{w_{i}} \sim t (ν + i, 0, 1)\}}_{i = 1}^{d - 1}

and

U_{j}

be independent variables.

If

(X_{j}, X_{\sim j})

has the Student copula

C^{S t} (U_{j}, U_{w_{1}}, \dots, U_{w_{d - 1}}, ν, R)

, then

X_{\sim j} \overset{d}{=} r_{j} (X_{j}, Z, U_{j}) = [\begin{matrix} X_{w_{1}} & = & F_{w_{1}}^{\leftarrow} [T_{ν} (Y_{w_{1}})] \\ ⋮ \\ X_{w_{d - 1}} & = & F_{w_{d - 1}}^{\leftarrow} [T_{ν} (Y_{w_{d - 1}})] \end{matrix}],

(8)

with

[\begin{matrix} Y_{j} \\ Y_{\sim j} \end{matrix}] : = L [\begin{matrix} T_{ν}^{- 1} (τ_{F_{j}} (X_{j}, U_{j})) \\ \sqrt{\frac{ν + {(T_{ν}^{- 1} (τ_{F_{j}} (X_{j}, U_{j})))}^{2}}{ν + 1}} Z_{w_{1}} \\ ⋮ \\ \sqrt{\frac{(ν + {(T_{ν}^{- 1} (τ_{F_{j}} (X_{j}, U_{j})))}^{2}) \prod_{k = 1}^{d - 2} (ν + k + Z_{w_{k}}^{2})}{\prod_{k = 1}^{d - 1} (ν + k)}} Z_{w_{d - 1}} \end{matrix}] .

Proof.

See Appendix B. □

Note that for every continuous variable

X_{j}

, we have to replace

τ_{F_{j}} (X_{j}, U_{j})

with

F_{j} (X_{j})

in Lemmas 1 and 2. For discrete variables, we have to include additional and independent uniformly distributed variables

U

in copula-based DMs (see Equation (6)).

2.2. Empirical and Computational Dependency Models

This section deals with the derivation of DMs for unknown distributions of inputs, such as distributions obtained by imposing constraints on the initial inputs or outputs. Formally, given a function

c : R^{d} \to R^{n}

and a domain of interest D, we are interested in deriving a dependency function of a random vector defined by

X^{c} \overset{d}{=} \{X \sim F : c (X) \in D\} .

(9)

While we are able to derive the analytical distribution of

X^{c}

and its dependency function for some distributions and constraints (see [,]), we have to estimate such dependency functions in general. Using Equation (9), we can generate a sample of

X^{c}

, that is,

X_{1}^{c}, \dots, X_{m}^{c}

and a pseudo-sample from the copula C of

X^{c}

, that is,

{\hat{F}}_{1} (X_{i, 1}^{c}), \dots, {\hat{F}}_{d} (X_{i, d}^{c})

with

i = 1, \dots, m

for continuous variables, where

{\hat{F}}_{j}

is an estimator of

F_{j}

. In general, we use

\hat{τ_{F_{1}}} (X_{i, 1}^{c}, U_{1}), \dots, \hat{τ_{F_{d}}} (X_{i, d}^{c}, U_{d})

.

With such samples, we consider two main ways to derive the empirical dependency functions. Firstly, we fit a distribution to such observations and then derive dependency functions using results from Section 2. To that end, there are numerous papers about fitting a distribution to data. For instance, direct methods for estimating densities and distributions can be found in [,,,], and the copula-based methods for modeling distributions are provided in [,,,,,].

Secondly, we derive empirical dependency functions using the estimators of the conditional quantile functions [,,,,,,,]. Formally, consider the loss function of Koenker and Bassett [] given by

L (x, u) = x (u - 1 I_{{x < 0}})

with

u \in [0, 1]

and

1 I_{{x < 0}}

the indicator function. A dependency function can be written as follows:

r_{j} (X_{j}^{c}, Z_{w_{1}}) : = arg min_{f \in F} E [L (X_{w_{1}}^{c} - f (X_{j}^{c}), Z_{w_{1}}) | X_{j}^{c}, Z_{w_{1}}],

where

F

is a class of smooth functions, and

Z_{w_{1}} \sim U (0, 1)

is independent of

(X_{j}^{c}, X_{w_{1}}^{c})

. Using the sample of

(X_{j}^{c}, X_{w_{1}}^{c})

, the M-estimator of a dependency function is given by ([], Lemma 3)

\hat{r_{j}} (X_{j}^{c}, Z_{w_{1}}) : = arg min_{f \in H} \sum_{i = 1}^{m} L (X_{i, w_{1}}^{c} - f (X_{i, j}^{c}), Z_{w_{1}}) + \frac{λ}{2} {||f (X_{j}^{c}) - b||}_{H}^{2},

(10)

where

λ \in R_{+}

is a bandwidth,

{||\cdot||}_{H}

is a given norm, and

b \in R

.

3. Equivalent Representations of Functional Outputs

This section formalizes different representations of complex models with NIVs

X

. Formally, given

Θ \subseteq R

,

n \in N^{*}

, consider a function

f : R^{d} \times Θ \to R^{n}

with random evaluations, that is,

f (X, θ) \in R^{n}

with

θ \in Θ

. It may represent any multivariate and functional outputs. When

Θ = {θ_{0}}

with

θ_{0} \in R

, we obtain a class of vector-valued functions. In what follows,

X

is organized as follows:

(O):

X : = (X_{1}, \dots, X_{d})

consists of K independent random vector(s), that is,

X = (X_{π_{1}}, \dots, X_{π_{K}})

where

π_{1}, \dots, π_{K}

are the sets that form a partition of

{1, \dots, d}

;

X_{π_{k_{1}}}

: =

(X_{j}, \forall j \in π_{k_{1}})

is independent of

X_{π_{k_{2}}}

: =

(X_{j}, \forall j \in π_{k_{2}})

for all

k_{1}, k_{2} \in {1, \dots, K}

and

k_{1} \neq k_{2}

. Without loss of generality, we use

X_{π_{1}}

for a random vector of

d_{1} \geq 0

independent variable(s);

X_{π_{k}}

with

k \geq 2

for a random vector of

d_{k} \geq 2

NIVs. Note that

| π_{k} | = d_{k}

.

Denote with

w_{k}

: =

(w_{1, k}, \dots, w_{d_{k}, k})

an arbitrary permutation of

π_{k}

and

w_{\sim 1, k}

: =

(w_{2, k}, \dots, w_{d_{k}, k})

. Thus,

w_{i, k} \in π_{k}, \forall i \in {1, \dots, d_{k}}

. If we use

s

: =

{w_{1, 2}, \dots, w_{1, K}}

, then

X_{s}

: =

(X_{w_{1}, 2}, \dots, X_{w_{1}, K})

contains

K - 1

independent variables and

X_{\sim s}

: =

(X_{w_{\sim 1, 2}}, \dots, X_{w_{\sim 1, K}})

with

X_{w_{\sim 1, k}}

: =

(X_{w_{2}, k}, \dots, X_{w_{d_{k}}, k})

,

k = 2, \dots, K

.

Using the DMs of Section 2, we can write

X_{\sim s} = (X_{w_{\sim 1, 2}} \overset{d}{=} r_{w_{1, 2}} (X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}), \dots, X_{w_{\sim 1, K}} \overset{d}{=} r_{w_{1, K}} (X_{w_{1, K}}, Z_{w_{\sim 1, K}}));

(11)

where

Z_{w_{\sim 1, k}}

: =

(Z_{w_{2, k}}, \dots, Z_{w_{d_{k}}, k})

is a vector of independent variables,

X_{w_{1, k}}

is independent of

Z_{w_{\sim 1, k}}, k = 2, \dots, K

. By using

Z_{w_{\sim 1}}

: =

(Z_{w_{\sim 1, 2}}, \dots, Z_{w_{\sim 1, K}})

, we can see that

(X_{π_{1}}, X_{s}, Z_{w_{\sim 1}})

contains only independent variables. Compiling the

K - 1

DMs given by (11) in one function, that is,

r_{s} : R^{d - d_{1}} \to R^{d - d_{1} - K + 1}

yields

X_{\sim s} \overset{d}{=} r_{s} (X_{s}, Z_{w_{\sim 1}}) = (r_{w_{1, 2}} (X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}), \dots, r_{w_{1, K}} (X_{w_{1, K}}, Z_{w_{\sim 1, K}})),

and we have the following partition:

(X_{2}, \dots, X_{K}) = (X_{s}, X_{\sim s}), X \overset{d}{=} (X_{π_{1}}, X_{s}, r_{s} (X_{s}, Z_{w_{\sim 1}})) .

(12)

Composing

f (\cdot, θ)

by (12) yields

g (X_{π_{1}}, X_{s}, Z_{w_{\sim 1}}, θ) : = f (X_{π_{1}}, X_{s}, r_{s} (X_{s}, Z_{w_{\sim 1}}), θ),

(13)

and Lemma 3 provides useful properties of g linked to f. For a given integer

0 \leq p_{k} \leq d_{k}

, consider the vectors

v_{k}

: =

(w_{2, k}, \dots, w_{p_{k}, k})

and

u_{k}

: =

(w_{1, k}, v_{k})

with

k = 2, \dots, K

. By definition, when

p_{k} = 0

, we have

v_{k} = u_{k} = \emptyset

, and when

p_{k} = 1

,

v_{k} = \emptyset

and

u_{k} = w_{1, k}

.

Lemma 3.

Let

u_{0} \subseteq π_{1}

,

{k_{1}, \dots, k_{m}} \subseteq {2, \dots, K}

,

{w_{1, k_{1}}, \dots, w_{1, k_{m}}} \subseteq s

. Then,

f (X, θ) | X_{u_{0}}, X_{u_{k_{1}}}, \dots, X_{u_{k_{m}}} \overset{d}{=} g (X_{π_{1}}, X_{s}, Z_{w_{\sim 1}}, θ) | X_{u_{0}}, X_{w_{1, k_{1}}}, Z_{v_{k_{1}}}, \dots, X_{w_{1, k_{m}}}, Z_{v_{k_{m}}} .

Proof.

See Appendix C. □

It comes out from Lemma 3 that the distribution of

f (X, θ)

given the inputs

X_{u}

: =

(X_{u_{0}}, X_{u_{k_{1}}}, \dots, X_{u_{k_{m}}})

is equal (in distribution) to the distribution of

g (X_{π_{1}}, X_{s}, Z_{w_{\sim 1}}, θ)

conditional on

(X_{u_{0}}, X_{w_{1, k_{1}}}, Z_{v_{k_{1}}}, \dots, X_{w_{1, k_{m}}}, Z_{v_{k_{m}}})

. Thus, we are able to assess the effect of

X_{u}

on

f (X, θ)

using

g (X_{π_{1}}, X_{s}, Z_{w_{\sim 1}}, θ)

and

(X_{u_{0}}, X_{w_{1, k_{1}}}, Z_{v_{k_{1}}}, \dots, X_{w_{1, k_{m}}}, Z_{v_{k_{m}}})

, leading to the following definition.

Definition 1.

Consider

u \subseteq {1, \dots, d}

, and f and g are given by Equation (13).

A function g is said to be an equivalent representation of f regarding the input(s)

X_{u}

if the distribution of

f (X, θ) | X_{u}

can be determined using g and some of its inputs.

Different equivalent representations (ERs) of f (i.e., g) are necessary for assessing the effects of

X_{u}

for all

u \subseteq {1, \dots, d}

. For instance, a representation in Lemma 3 for a given set

s

can be used to assess the effects of some subsets of inputs given by

\begin{matrix} \{(X_{u_{0}}, X_{u_{k_{1}}}, \dots, X_{u_{k_{m}}}) : \begin{matrix} \forall u_{0} \subseteq π_{1}, \forall {k_{1}, \dots, k_{m}} \subseteq {2, \dots, K}, \\ \forall p_{k_{i}} \in {0, \dots, d_{k_{i}}}, i = 1, \dots, m \end{matrix}\} . \end{matrix}

Modifying

s

and

v_{k}, k = 2, \dots, K

leads to another representation of f, which allows for assessing other inputs’ effects, such as the effects of

X_{ι}

s with

ι \notin s

. Permutations of

π_{k}

s give such modifications. Obviously, we have

\prod_{\begin{matrix} i = 2 \end{matrix}}^{K} d_{i}!

ERs of f that allow for deriving the effects of all the subsets of inputs with the possibility to have some replications.

Definition 2.

Let

u \subseteq {1, \dots, d}

and

g_{1} \neq g_{2}

be two ERs of f.

The representations

g_{1}, g_{2}

are said to be replicated representations of f regarding

X_{u}

if

g_{1}, g_{2}

allow for determining the distribution of

f (X, θ) | X_{u}

.

To avoid unnecessary replicated representations and to be able to recover all the subsets of

{1, \dots, d}

using permutations, we use Algorithm 1 for selecting the necessary and sufficient permutations (see Lemma 4). Formally, consider integers

j_{0, k} : = \{\begin{matrix} \frac{d_{k}}{2} & if d_{k} is even \\ \frac{d_{k} + 1}{2} & otherwise \end{matrix}, k = 2, \dots, K,

(14)

and the super-sets

A_{j_{0, k}}

given by

A_{j_{0, k}} = {u \subseteq π_{k} : | u | = j_{0, k}}, k = 2, \dots, K .

The set

A_{j_{0, k}}

consists of all the subsets of

π_{k}

that contain exactly

j_{0, k}

elements, and its cardinality is

|A_{j_{0, k}}| = (\binom{d_{k}}{j_{0, k}})

. Basically, the algorithm takes

A_{j_{0, k}}

as the input and provides the super-sets

B_{k}, P_{k}, E_{k}

, which are initially empty. The first step of Algorithm 1, corresponding to

e_{0} = 1

, focuses on selecting

d_{k} = (\binom{d_{k}}{e_{0} = 1})

permutations, and on bringing different sets of the form

{w_{1, k}}, \dots, {w_{1, k}, \dots, w_{j_{0, k}, k}}

in one hand and

{w_{d_{k} - J_{0, k} + 1, k}, \dots, w_{d_{k}, k}}, \dots, {w_{d_{k}, k}}

in the other hand. We repeat that process by increasing

e_{0}

and bringing the sets of the forms:

{w_{1, k}, \dots, w_{e_{0}, k}}, \dots, {w_{1, k}, \dots, w_{j_{0, k}, k}}; {w_{d_{k} - J_{0, k} + 1, k}, \dots, w_{d_{k}, k}}, \dots, {w_{d_{k} - e_{0} + 1, k}, \dots, w_{d_{k}, k}};

until we are able to derive all the subsets of

π_{k}

, that is, until

|A_{j_{0, k}}| = 0

. The formal algorithm is given as follows:

Algorithm 1: Construction of the sets

B_{k}

and

P_{k}

for all

k \in {2, \dots, K}

.

Algorithm 1 provides the permutations selected (i.e.,

P_{k}

) that are used for deriving different DMs. For instance, if

d_{2} = 3, π_{2} = {1, 2, 3}

, the permutations

(1, 2, 3)

and

(3, 2, 1)

lead to the following DMs:

(X_{2}, X_{3}) \overset{d}{=} r_{1} (X_{1}, Z_{2}, Z_{3}); (X_{2}, X_{1}) \overset{d}{=} r_{3} (X_{3}, Z_{2}, Z_{1}),

and the associated ERs, that is,

f (X) \overset{d}{=} f (X_{1}, r_{1} (X_{1}, Z_{2}, Z_{3})); f (X) \overset{d}{=} f (X_{3}, r_{3} (X_{3}, Z_{2}, Z_{1})) .

The set

P_{k}

from Algorithm 1 contains

(\binom{d_{k}}{j_{0, k}})

permutations. The set

B_{k}

is built using

P_{k}

, and it consists of sets containing the first ı elements of

w_{k}

for any

ι \in {1, \dots, d_{k}}

and

w_{k} \in P_{k}

. Lemma 4 shows the properties of such sets.

Lemma 4.

Consider

j_{0, k}

given by (14) and

B_{k}, P_{k}

given by Algorithm 1. Then,

B_{k} = \{u \subseteq π_{k} : \forall | u | > 0\};

(15)

B_{k} = \{{w_{1, k}, \dots, w_{ι, k}}, ι = 1, \dots, d_{k} : \forall w_{k} \in P_{k}\} .

(16)

Proof.

See Appendix D. □

Based on Lemma 4, we quantify the necessary and sufficient number of ERs of f given

X_{u}

for all

u \subseteq {1, \dots, d}

in Theorem 1. For

w_{k} \in P_{k}

, recall that

w_{\sim 1, k}

: =

(w_{2, k}, \dots, w_{d_{k}, k})

and

|P_{k}| = (\binom{d_{k}}{j_{0, k}})

with

k = 2, \dots, K

.

Theorem 1.

Consider integers

p_{2} \leq d_{2}, \dots, p_{K} \leq d_{K}

. Then, the minimum number of ERs of f given

X_{u}

for all

u \subseteq {1, \dots, d}

is

R_{min} : = \prod_{k = 2}^{K} (\binom{d_{k}}{j_{0, k}}) .

(17)

Such necessary and sufficient ERs are given by

\begin{matrix} f (X, θ) & \overset{d}{=} & g_{l} (X_{π_{1}}, X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}, \dots, X_{w_{1, K}}, Z_{w_{\sim 1, K}}, θ), \end{matrix}

(18)

where

l : = (w_{2}, \dots, w_{K})

for all

w_{k} \in P_{k}

and

k = 2, \dots, K

.

Proof.

See Appendix E. □

It is worth noting that each ER of Equation (18) shares the same distribution of

f (X, θ)

, and two different ERs should be considered independent to avoid misleading dependencies. When a function includes only independent variables, we see that

R_{min} = 1

. Reducing

R_{min}

will depend on the analysis of interest one is going to perform. For instance, one ER is sufficient to determine the distribution of

f (X, θ)

conditional on

X_{u}

for all

u \in \{{u_{0}, w_{1, k}, \dots, w_{ι, k}} : \forall u_{0} \subseteq π_{1}, \forall ι \in {0, \dots, d_{k}}, k = 2, \dots, K\} .

Likewise,

R_{1} = max (d_{2}, \dots, d_{K})

ERs are used for assessing the main and total effects of

X_{j}

s for any

j \in {1, \dots, d}

in []. It is also worth noting that the above ERs can lead to assessing the effects of other inputs or groups of inputs.

Discussions About High-Dimensional Cases

Implementation of Algorithm 1 can be made fast for moderate values of each

d_{k}

, that is

d_{k} \leq 10

with

k = 2, \dots, K

. Since Algorithm 1 must be run separately for each

π_{k}

with

k = 2, \dots, K

and

d = \sum_{k = 1}^{K} d_{k}

, implementation of Algorithm 1 remains fast in high-dimensional settings, provided that each

d_{k} \leq 10

with

k = 2, \dots, K

. When there is

k_{0} \in {2, \dots, K}

such that

d_{k_{0}} > 10

, Algorithm 1 can still be used with a substantial requirement of time to obtain all the necessary and sufficient permutations of

π_{k_{0}}

. But, as the selected permutations are not going to change for a given

d_{k}

with

k \geq 2

, a table of selected permutations for different values of

d_{k}

will avoid tuning this algorithm all the time.

Regarding the

R_{min}

ERs, we can check that

2^{K} = 2^{\frac{d - d_{1}}{2}} \leq R_{min} \leq 2^{d - d_{1}}

because

(\binom{d_{k}}{j_{0, k}}) = (\binom{d_{k}}{[\frac{d_{k}}{2}]}) \leq 2^{d_{k}}

with

[\frac{d_{k}}{2}]

the largest integer that is less than

\frac{d_{k}}{2}

. Thus, computing all the effects of inputs will require at least

2^{\frac{d - d_{1}}{2}}

ERs and at most

2^{d - d_{1}}

ERs, which grow exponentially with respect to

d (1 - α)

with

α : = \frac{d_{1}}{d} \in [0, 1]

. If the effects of

X_{π_{k_{0}}}

are not significant, then

α = \frac{d_{1} + d_{k_{0}}}{d}

. Based on the values of

α

, different conclusions can be drawn.

4. Dependent Multivariate Sensitivity Analysis

This section extends dependent generalized sensitivity indices (dGSIs) for models with NIVVs introduced in [] by defining the dGSIs for every subset of inputs and for functional outputs. Since the ER given by Equation (18) includes only independent variables, we define dGSIs by relying on the multivariate sensitivity analysis ([,,,,]). To ensure that the proposed dGSIs in this section are well defined, assume that

(A1):

0 < \int_{Θ} E [{||f (X, θ)||}_{L^{2}}^{2}] d θ < + \infty

.

Definitions of GSIs and dGSIs are based on sensitivity functionals (SFs), which contain information about the single and overall contributions of input variables over the whole functional outputs [,,,]. To define SFs, recall that

0 \leq p_{k} \leq d_{k}

,

v_{k} = (w_{2, k}, \dots, w_{p_{k}, k})

and

u_{k} = (w_{1, k}, v_{k})

with

k = 2, \dots, K

, and define

{\bar{u}}_{k} : = (w_{p_{k} + 1, k}, \dots, w_{d_{k}, k})

. According to Lemmas 3 and 4, for all

u \subseteq {1, \dots, d}

, there exists

u_{0} \subseteq π_{1}

, a vector

w_{k} \in P_{k}

and

p_{k}

with

k = 2, \dots, K

such that

X_{u} = (X_{u_{0}}, X_{u_{2}}, \dots, X_{u_{K}})

and

f (X, θ) | X_{u} \overset{d}{=} g_{l} (X_{π_{1}}, X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}, \dots, X_{w_{1, K}}, Z_{w_{\sim 1, K}}, θ) | X_{u_{0}}, R_{u_{2}}, \dots, R_{u_{K}},

where

R_{u_{k}} : = (X_{w_{1}, k}, Z_{v_{k}})

,

k = 2, \dots, K

. Thus, the effects of inputs

X_{u}

are equal to the effects of

X_{u_{0}}, R_{u_{2}}, \dots, R_{u_{K}}

using

g_{l}

. For concise notations, we use

X_{{π_{1}, s}} : = (X_{j}, \forall j \in {π_{1}, s}); Z_{w_{\sim 1}} = (Z_{w_{\sim 1, 2}}, \dots, Z_{w_{\sim 1, K}}),

R_{u} : = (X_{u_{0}}, R_{u_{2}}, \dots, R_{u_{K}}); R_{\sim u} : = (X_{π_{1} ∖ u_{0}}, Z_{{\bar{u}}_{2}}, \dots, Z_{{\bar{u}}_{K}}) .

Note that

R : = (R_{u}, R_{\sim u})

is a partition of

(X_{{π_{1}, s}}, Z_{w_{\sim 1}})

.

The first-order SF of

X_{u}

with

u = \{u_{0}, u_{2}, \dots, u_{K}\}

is given by

f_{u}^{f o} (R_{u}, θ) : = E [g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ) | R_{u}] - E [g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ)],

(19)

and the total SF, which contains the overall information about

X_{u}

, is given by

f_{u}^{t o t} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ) : = g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ) - E_{R_{u}} [g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ)],

(20)

where

E_{R_{u}}

means that the expectation is taken with respect to

R_{u}

. The SFs given by (19) and (20) are random processes, and their components may be correlated and/or dependent. Using the variance–covariance as an importance measure, the definitions of dGSIs rely on the cross-covariances of SFs. For

θ_{1}, θ_{2} \in Θ

, the first-order cross-covariance or the cross-covariance of

f_{u}^{f o}

is given by

Σ_{u} (θ_{1}, θ_{2}) : = E [f_{u}^{f o} (R_{u}, θ_{1}) f_{u}^{f o} {(R_{u}, θ_{2})}^{T}] .

Furthermore, the cross-covariances of

f_{u}^{t o t}

and f are given as follows:

Σ_{u}^{t o t} (θ_{1}, θ_{2}) : = E [f_{u}^{t o t} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{1}) f_{u}^{t o t} {(X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{2})}^{T}],

\begin{matrix} Σ (θ_{1}, θ_{2}) & : = & E [g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{1}) g_{l} {(X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{2})}^{T}] \\ - E [g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{1})] E [g_{l} {(X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ_{2})}^{T}] . \end{matrix}

To account for the different properties of the aforementioned SFs, we distinguish three different types of dGSIs.

Definition 3.

Consider the cross-covariances of SFs, and assume that (A1) holds.

(i): For the first-type dGSIs, the first-order and total indices are given by

$d G S I_{u, 1} : = \frac{\int_{Θ} T r (Σ_{u} (θ, θ)) d θ}{\int_{Θ} T r (Σ (θ, θ)) d θ}, d G S I_{T_{u}, 1} : = \frac{\int_{Θ} T r (Σ_{u}^{t o t} (θ, θ)) d θ}{\int_{Θ} T r (Σ (θ, θ)) d θ} .$

(21)
(ii): The second-type dGSIs are defined as follows:

$d G S I_{u, 2} : = {(\frac{\int_{Θ} {||Σ_{u} (θ, θ)||}_{F}^{2} d θ}{\int_{Θ} {||Σ (θ, θ)||}_{F}^{2} d θ})}^{1 / 2}, d G S I_{T_{u}, 2} : = {(\frac{\int_{Θ} {||Σ_{u}^{t o t} (θ, θ)||}_{F}^{2} d θ}{\int_{Θ} {||Σ (θ, θ)||}_{F}^{2} d θ})}^{1 / 2} .$

(22)
(iii): The third-type dGSIs are given as follows:

$d G S I_{u, 3} : = {(\frac{\int_{Θ^{2}} {||Σ_{u} (θ_{1}, θ_{2})||}_{F}^{2} d θ_{1} d θ_{2}}{\int_{Θ^{2}} {||Σ (θ_{1}, θ_{2})||}_{F}^{2} d θ_{1} d θ_{2}})}^{1 / 2}, d G S I_{T_{u}, 3} : = {(\frac{\int_{Θ^{2}} {||Σ_{u}^{t o t} (θ_{1}, θ_{2})||}_{F}^{2} d θ_{1} d θ_{2}}{\int_{Θ^{2}} {||Σ (θ_{1}, θ_{2})||}_{F}^{2} d θ_{1} d θ_{2}})}^{1 / 2} .$

(23)

The first-type and the second-type dGSIs treat independently the outputs

f (X, θ_{1})

and

f (X, θ_{2})

when

θ_{1} \neq θ_{2}

, but the second-type dGSIs account for the correlations among the components of SFs from the same output

f (X, θ)

. Furthermore, the third-type dGSIs account for the correlations among the cross-components of SFs.

4.1. Properties of Dependent Generalized Sensitivity Indices

The two types of dGSIs share the same properties as those proposed in [] for

0 \leq p_{k} \leq 1

, only. Proposition 2 extends such properties for all

p_{k} \in {0, \dots, d_{k}}

.

Proposition 2.

Under (A1), consider the dGSIs of Definition 3. Then,

0 \leq d G S I_{u, 1} \leq d G S I_{T_{u}, 1} \leq 1; 0 \leq d G S I_{u, 2} \leq d G S I_{T_{u}, 2} \leq 1 .

Moreover, if the cross-covariances are positive semi-definite, then we have

0 \leq d G S I_{u, 3} \leq d G S I_{T_{u}, 3} \leq 1 .

Proof.

See Appendix F. □

When the total dGSI of

X_{u} = (X_{u_{0}}, X_{u_{1}}, \dots, X_{u_{K}})

is zero or almost zero (i.e.,

d G S I_{T_{u}, •} \approx 0

), we have to fix

X_{u}

using the DM associated with each

X_{π_{k}}

,

k = 2, \dots, K

. Indeed, for the DM

X_{u_{k}} = r_{\sim u_{k}} (X_{\sim u_{k}}, Z_{u_{k}}))

, fixing

X_{u_{K}}

comes down to fix

Z_{u_{k}}

to its nominal values. Since we can compute the total

d G S I

of each block of NIVs, that is,

X_{π_{k}}

using any ER of f, it becomes possible to quickly identify the non-influential block of NIVs, and then put our computational efforts on the most important groups of NIVs.

Remark 1.

Given

j_{1}, j_{2} \in {1, \dots, d}

, the same rankings of the inputs

X_{j_{1}}

and

X_{j_{2}}

using either

d G S I_{T_{j}, 1}

or

d G S I_{T_{j}, 2}

with

j \in {j_{1}, j_{2}}

are obtained under the assumptions

Σ_{j_{1}}^{t o t} (θ, θ) ⪯ Σ_{j_{2}}^{t o t} (θ, θ)

or

Σ_{j_{2}}^{t o t} (θ, θ) ⪯ Σ_{j_{1}}^{t o t} (θ, θ)

. Note that

A_{1} ⪯ A_{2}

means that

A_{2} - A_{1}

is a positive semi-definite matrix, a.k.a. the Loewner partial ordering between matrices (see also Section 6.3 in []).

Remark 2.

Case of the multivariate dynamic function

Consider a model that is evaluated at

X

and provides n dynamic(s), such as a spatiotemporal model. Such a model is a particular case of multivariate and functional outputs. Indeed, the multivariate dynamic model given by

f : R^{d} \times [0, T] \to R^{n}

and

f (X, t) \in R^{n}

with

t \in [0, T]

is mathematically identical to

f (X, θ)

using

Θ = [0, T]

and

θ = t

.

4.2. Case of the Multivariate Response Models

When

Θ = {θ_{0}}

, the multivariate and functional outputs come down to

f (X, θ_{0}) = : h (X)

with h:

R^{d} \to R^{n}

. Thus, the dGSIs of Definition 3 can be adapted for quantifying the effect of inputs. It is worth noting that the third-type dGSIs are equal to the second-type dGSIs, and both types of dGSIs boil down to the second-type dGSIs provided in Definition 4. Moreover, the cross-covariances become the covariances, and we use

Σ_{u}

: =

Σ_{u} (θ_{0}, θ_{0})

,

Σ_{u}^{t o t}

: =

Σ_{u}^{t o t} (θ_{0}, θ_{0})

and

Σ

: =

Σ (θ_{0}, θ_{0})

.

Definition 4.

Consider the above covariances of SFs and assume (A1) holds.

(i): The first-type dGSIs for a given multivariate response function are

$d G S I_{u, 1}^{M} : = \frac{T r (Σ_{u})}{T r (Σ)}, d G S I_{T_{u}, 1}^{M} : = \frac{T r (Σ_{u}^{t o t})}{T r (Σ)} .$

(24)
(ii): The second-type dGSIs are defined as follows:

$d G S I_{u, 2}^{M} : = \frac{{||Σ_{u}||}_{F}}{{||Σ||}_{F}}, d G S I_{T_{u}, 2}^{M} : = \frac{{||Σ_{u}^{t o t}||}_{F}}{{||Σ||}_{F}} .$

(25)

Also, in the case where

n = 1

, the two types of dGSIs of Definition 4 are equal and boil down to dependent sensitivity indices (dSIs) for single-response models (see Definition 5).

Definition 5.

For a function

f : R^{d} \to R

(

n = 1

), the dSIs of

X_{u}

are given by

d S_{u} : = \frac{Σ_{u}}{Σ}, d S_{T_{u}} : = \frac{Σ_{u}^{t o t}}{Σ} .

(26)

5. Estimators of Dependent Generalized Sensitivity Indices

In this section, we provide unbiased estimators of the cross-covariances and covariances of SFs, consistent estimators of dGSIs, and their asymptotic distributions. For the sake of simplicity, we provide estimators of covariances and dGSIs using the functions

g_{l}

that include only independent variables, that is,

X_{{π_{1}, s}}, Z_{w_{\sim 1}}

Note that the Supplementary Materials provides such estimators using f directly thanks to the relation

\begin{matrix} g_{l} (X_{{π_{1}, s}}, Z_{w_{\sim 1}}, θ) & = & f (X_{{π_{1}, s}}, r_{s} (X_{s}, Z_{w_{\sim 1}}), θ) \\ = & f (X_{{π_{1}, s}}, r_{w_{1, 2}} (X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}), \dots, r_{w_{1, K}} (X_{w_{1, K}}, Z_{w_{\sim 1, K}}), θ) . \end{matrix}

To derive the estimators of cross-covariances that are useful for computing dGSIs, we are given two i.i.d. samples from

(X_{{π_{1}, s}}, Z_{w_{\sim 1}})

, that is,

{\{(X_{i, {π_{1}, s}}^{(1)}, Z_{i, w_{\sim 1}}^{(1)})\}}_{i = 1}^{m}

and

{\{(X_{i, {π_{1}, s}}^{(2)}, Z_{i, w_{\sim 1}}^{(2)})\}}_{i = 1}^{m}

, and we use

X_{i}^{(k)} : = (X_{i, {π_{1}, s}}^{(k)}, r_{s} (X_{i, s}^{(k)}, Z_{i, w_{\sim 1}}^{(k)}))

,

k = 1, 2

,

i = 1, \dots, m

. Since

R : = (R_{u}, R_{\sim u})

is a partition of

(X_{{π_{1}, s}}, Z_{w_{\sim 1}})

, we can deduce the following two i.i.d. samples:

{\{R_{i}^{(1)} : = (R_{i, u}^{(1)}, R_{i, \sim u}^{(1)})\}}_{i = 1}^{m}

and

{\{R_{i}^{(2)} : = (R_{i, u}^{(2)}, R_{i, \sim u}^{(2)})\}}_{i = 1}^{m}

.

Theorem 2.

Assume (A1) holds. Then, unbiased and consistent estimators of

Σ_{u} (θ_{1}, θ_{2})

,

Σ_{u}^{t o t} (θ_{1}, θ_{2})

and

Σ (θ_{1}, θ_{2})

are, respectively, given by

\begin{matrix} \hat{Σ_{u} (θ_{1}, θ_{2})} : = \frac{1}{2 m} \sum_{i = 1}^{m} \\ [g_{l} (R_{i}^{(1)}, θ_{1}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)}, θ_{1})] {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}, θ_{2}) - g_{l} (R_{i}^{(2)}, θ_{2})]}^{T}; \end{matrix}

\begin{matrix} \hat{Σ_{u}^{t o t} (θ_{1}, θ_{2})} : = \frac{1}{2 m} \sum_{i = 1}^{m} \\ [g_{l} (R_{i}^{(1)}, θ_{1}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)}, θ_{1})] {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(1)}, θ_{2}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)}, θ_{2})]}^{T}; \end{matrix}

\begin{matrix} \hat{Σ (θ_{1}, θ_{2})} & : = & \frac{1}{2 m} \sum_{i = 1}^{m} [g_{l} (R_{i}^{(1)}, θ_{1}) - g_{l} (R_{i}^{(2)}, θ_{1})] {[g_{l} (R_{i}^{(1)}, θ_{2}) - g_{l} (R_{i}^{(2)}, θ_{2})]}^{T} \\ = \frac{1}{2 m} \sum_{i = 1}^{m} [f (X_{i}^{(1)}, θ_{1}) - f (X_{i}^{(2)}, θ_{1})] {[f (X_{i}^{(1)}, θ_{2}) - f (X_{i}^{(2)}, θ_{2})]}^{T} . \end{matrix}

Proof.

See Appendix G. □

Note that when

θ_{1} = θ_{2} = θ

, the estimators provided in Theorem 2 are minimum-variance unbiased estimators (MVUEs). Using Theorem 2, we deduce the estimators of the variance–covariances of SFs for the multivariate response models and single-response models. To provide such results for the vector-valued function

f : R^{d} \to R^{n}

in Corollary 1, let us consider the following symmetric kernels:

\begin{matrix} K_{u}^{f o} (R_{i}^{(1)}, R_{i}^{(2)}) & : = & [g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})] {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})]}^{T} \\ + [g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})] {[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{T}; \end{matrix}

\begin{matrix} K_{u}^{t o t} (R_{i}^{(1)}, R_{i}^{(2)}) & : = & [g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})] {[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{T} \\ + [g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})] {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})]}^{T}; \end{matrix}

\begin{matrix} K (R_{i}^{(1)}, R_{i}^{(2)}) & : = & [g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})] {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{T} \\ + [g_{l} (R_{i}^{(1)}) - g_{l} (R_{i}^{(2)})] {[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i}^{(2)})]}^{T} . \end{matrix}

Corollary 1.

Assume that f has finite fourth moments (i.e., (A2)) and (A1) hold. Then, the MVUEs and consistent estimators of the covariance matrices

Σ_{u}

,

Σ_{u}^{t o t}

, and Σ are, respectively, given by

\hat{Σ_{u}} : = \frac{1}{4 m} \sum_{i = 1}^{m} K_{u}^{f o} (R_{i}^{(1)}, R_{i}^{(2)}), \hat{Σ_{u}^{t o t}} : = \frac{1}{4 m} \sum_{i = 1}^{m} K_{u}^{t o t} (R_{i}^{(1)}, R_{i}^{(2)});

\hat{Σ} : = \frac{1}{4 m} \sum_{i = 1}^{m} K (R_{i}^{(1)}, R_{i}^{(2)}) .

Proof.

See Appendix H. □

For single-response models (i.e.,

n = 1

), the MVUEs of the covariances of SFs in Corollary 1 have simple expressions, given below.

Corollary 2.

Under the assumptions (A1)-(A2) and

n = 1

, the MVUEs

\hat{Σ_{u}}

,

\hat{Σ_{u}^{t o t}}

, and

\hat{Σ}

become, respectively,

\hat{σ_{u}} : = \frac{1}{2 m} \sum_{i = 1}^{m} [g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})] [g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})];

\hat{σ_{u}^{t o t}} : = \frac{1}{4 m} \sum_{i = 1}^{m} \{{[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{2} + {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i}^{(2)})]}^{2}\};

\hat{σ} : = \frac{1}{4 m} \sum_{i = 1}^{m} \{{[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i}^{(2)})]}^{2} + {[g_{l} (R_{i, u}^{(1)}, R_{i, \sim u}^{(2)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{2}\} .

If we are only interested in the total effects, we should use the expressions of

\hat{Σ_{u}^{t o t}}

and

\hat{σ_{u}^{t o t}}

given, respectively, by

\frac{1}{2 m} \sum_{i = 1}^{m} [g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})] {[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{T};

\frac{1}{2 m} \sum_{i = 1}^{m} {[g_{l} (R_{i}^{(1)}) - g_{l} (R_{i, u}^{(2)}, R_{i, \sim u}^{(1)})]}^{2} .

Using the results from Theorem 2, Corollary 1, and Corollary 2, we derive the estimators of dGSIs and dSIs in Corollary 3, Theorem 3, and Corollary 4, respectively.

Corollary 3.

Assume that (A1) and (A2) hold. If we observe the model outputs at

θ_{l} \in Θ

with

l = 1, \dots, L

, then

(i): the consistent estimators of the first-type dGSIs are given as follows:

$\hat{d G S I_{u, 1}} : = \frac{\sum_{l = 1}^{L} T r (\hat{Σ_{u} (θ_{l}, θ_{l})})}{\sum_{l = 1}^{L} T r (\hat{Σ (θ_{l}, θ_{l})})} \overset{P}{\to} d G S I_{u, 1},$

when $L \to \infty, m \to \infty$ , where $\overset{P}{\to}$ denotes the convergence in probability.

$\hat{d G S I_{T_{u}, 1}} : = \frac{\sum_{l = 1}^{L} T r (\hat{Σ_{u}^{t o t} (θ_{l}, θ_{l})})}{\sum_{l = 1}^{L} T r (\hat{Σ (θ_{l}, θ_{l})})} \overset{P}{\to} d G S I_{T_{u}, 1} .$
(ii): The estimators of the second-type dGSIs are given as follows:

$\hat{d G S I_{u, 2}} : = {(\frac{\sum_{l = 1}^{L} {||\hat{Σ_{u} (θ_{l}, θ_{l})}||}_{F}^{2}}{\sum_{l = 1}^{L} {||\hat{Σ (θ_{l}, θ_{l})}||}_{F}^{2}})}^{1 / 2} \overset{P}{\to} d G S I_{u, 2},$

$\hat{d G S I_{T_{u}, 2}} : = {(\frac{\sum_{l = 1}^{L} {||\hat{Σ_{u}^{t o t} (θ_{l}, θ_{l})}||}_{F}^{2}}{\sum_{l = 1}^{L} {||\hat{Σ (θ_{l}, θ_{l})}||}_{F}^{2}})}^{1 / 2} \overset{P}{\to} d G S I_{T_{u}, 2},$
(iii): The estimators of the third-type dGSIs are given as follows:

$\hat{d G S I_{u, 3}} : = {(\frac{\sum_{l_{1} = 1}^{L} \sum_{l_{2} = 1}^{L} {||\hat{Σ_{u} (θ_{l_{1}}, θ_{l_{2}})}||}_{F}^{2}}{\sum_{l_{1} = 1}^{L} \sum_{l_{2} = 1}^{L} {||\hat{Σ (θ_{l_{1}}, θ_{l_{2}})}||}_{F}^{2}})}^{1 / 2} \overset{P}{\to} d G S I_{u, 3},$

$\hat{d G S I_{T_{u}, 3}} : = {(\frac{\sum_{l_{1} = 1}^{L} \sum_{l_{2} = 1}^{L} {||\hat{Σ_{u}^{t o t} (θ_{l_{1}}, θ_{l_{2}})}||}_{F}^{2}}{\sum_{l_{1} = 1}^{L} \sum_{l_{2} = 1}^{L} {||\hat{Σ (θ_{l_{1}}, θ_{l_{2}})}||}_{F}^{2}})}^{1 / 2} \overset{P}{\to} d G S I_{T_{u}, 3} .$

Proof.

Using Theorem 2, the results hold by applying the Slutsky theorem. □

For computing the first-order and total-effect covariances of each

X_{j}

,

2 m (d + 1)

model runs are needed. Some of such runs can be combined for computing the model outputs’ covariance. From now on, M model runs are used for computing the model outputs’ covariance. The operator

Vec (\cdot)

transforms a matrix

Σ \in R^{n \times n}

into a vector, that is,

Vec (Σ) \in R^{n^{2}}

and

O \in R^{n \times n}

denotes the null matrix.

Theorem 3.

Assume that (A1) and (A2) hold,

m \to + \infty

,

M \to + \infty

and

m / M \to 0

.

(i) The estimators of the first-type dGSIs are given as follows:

\hat{d G S I_{u, 1}^{M}} : = \frac{T r (\hat{Σ_{u}})}{T r (\hat{Σ})} \overset{P}{\to} d G S I_{u, 1}^{M}; \hat{d G S I_{T_{u}, 1}^{M}} : = \frac{T r (\hat{Σ_{u}^{t o t}})}{T r (\hat{Σ})} \overset{P}{\to} d G S I_{T_{u}, 1}^{M},

(27)

with the following asymptotic distributions:

\sqrt{m} (\hat{d G S I_{u, 1}^{M}} - d G S I_{u, 1}^{M}) \overset{D}{\to} N (0, \frac{V [T r \{K_{u}^{f o} (R_{1}^{(1)}, R_{1}^{(2)})\}]}{{(T r (Σ))}^{2}});

\sqrt{m} (\hat{d G S I_{T_{u}, 1}^{M}} - d G S I_{T_{u}, 1}^{M}) \overset{D}{\to} N (0, \frac{V [T r \{K_{u}^{t o t} (R_{1}^{(1)}, R_{1}^{(2)})\}]}{{(T r (Σ))}^{2}}) .

(ii) For the second-type dGSIs, we have

\hat{d G S I_{u, 2}^{M}} : = \frac{{||\hat{Σ_{u}}||}_{F}}{{||\hat{Σ}||}_{F}} \overset{P}{\to} d G S I_{u, 2}^{M}; \hat{d G S I_{T_{u}, 2}^{M}} : = \frac{{||\hat{Σ_{u}^{t o t}}||}_{F}}{{||\hat{Σ}||}_{F}} \overset{P}{\to} d G S I_{T_{u}, 2}^{M},

(28)

\sqrt{m} (\hat{d G S I_{u, 2}^{M}} - d G S I_{u, 2}^{M}) \overset{D}{\to} N (0, \frac{V e c {(Σ_{u})}^{T} V [V e c \{K_{u}^{f o} (R_{1}^{(1)}, R_{1}^{(2)})\}] V e c (Σ_{u})}{{||Σ_{u}||}_{F}^{2} {||Σ||}_{F}^{2}}) .

\sqrt{m} (\hat{d G S I_{T_{u}, 2}^{M}} - d G S I_{T_{u}, 2}^{M}) \overset{D}{\to} N (0, \frac{V e c {(Σ_{u}^{t o t})}^{T} V [V e c \{K_{u}^{t o t} (R_{1}^{(1)}, R_{1}^{(2)})\}] V e c (Σ_{u}^{t o t})}{{||Σ_{u}^{t o t}||}_{F}^{2} {||Σ||}_{F}^{2}}),

provided that

Σ_{u} \neq O

and

Σ_{u}^{t o t} \neq O

.

Proof.

See Appendix I. □

Using Theorem 3, we give the estimators of dSIs (see Definition 5) for real-valued functions in Corollary 4.

Corollary 4.

Let

n = 1

and

σ^{2} : = V [f (X)]

. Assume that (A1)-(A2) hold,

m \to + \infty

,

M \to + \infty

and

m / M \to 0

. The estimators of dSIs are given as follows:

\hat{d S_{u}} : = \frac{\hat{σ_{u}}}{\hat{σ}} \overset{P}{\to} d S_{u}; \sqrt{m} (\hat{d S_{u}} - d S_{u}) \overset{D}{\to} N (0, \frac{V [K_{u}^{f o} (R_{1}^{(1)}, R_{1}^{(2)})]}{σ^{4}});

(29)

\hat{d S_{T_{u}}} : = \frac{\hat{σ_{u}^{t o t}}}{\hat{σ}} \overset{P}{\to} d S_{T_{u}}; \sqrt{m} (\hat{d S_{T_{u}}} - d S_{T_{u}}) \overset{D}{\to} N (0, \frac{V [K_{u}^{t o t} (R_{1}^{(1)}, R_{1}^{(2)})]}{σ^{4}}) .

(30)

The computation of the dGSIs or dSIs of

X_{u}

for all

u \subseteq {1, \dots, d}

using the above estimators will require

R_{min}

ERs of f. When we are only interested in

u \subseteq {1, \dots, d}

with

| u | = 1

,

R_{min, 1} : = max (d_{2}, \dots, d_{K})

, the ERs of f are necessary and sufficient, and we need

2 \times m (d + 1)

model evaluations to estimate the first-order and total dGSIs or dSIs of

X_{j}

for all

j \in {1, \dots, d}

. Note that with the same

R_{min, 1}

ERs and additional model runs, we are able to compute the effects of other subsets of inputs and interactions.

6. Analytical and Numerical Results

In this section, we illustrate our approach by means of analytical test cases, including a dynamic model so as to highlight some theoretical properties of the new indices.

6.1. Linear Function Without Explicit Interaction ( $d = 3$ , $n = 1$ )

We consider

f (X) = X_{1} + X_{2} + X_{3}

with

X \sim N (0, [\begin{matrix} σ_{1}^{2} & ρ_{12} σ_{1} σ_{2} & ρ_{13} σ_{1} σ_{3} \\ ρ_{12} σ_{1} σ_{2} & σ_{2}^{2} & ρ_{23} σ_{2} σ_{3} \\ ρ_{13} σ_{1} σ_{3} & ρ_{23} σ_{2} σ_{3} & σ_{3}^{2} \end{matrix}])

. A dependency function of

X

is given by

(X_{2}, X_{3}) = r_{1} (X_{1}, Z_{2}, Z_{3})

, where

\begin{matrix} \{\begin{matrix} X_{2} & = & \frac{ρ_{12} σ_{2}}{σ_{1}} X_{1} + \sqrt{1 - ρ_{12}^{2}} Z_{2} \\ X_{3} & = & \frac{ρ_{13} σ_{3}}{σ_{1}} X_{1} + \frac{σ_{3} (ρ_{23} - ρ_{12} ρ_{13})}{σ_{2} \sqrt{1 - ρ_{12}^{2}}} Z_{2} + \sqrt{\frac{1 - ρ_{12}^{2} - ρ_{13}^{2} - ρ_{23}^{2} + 2 ρ_{12} ρ_{13} ρ_{23}}{1 - ρ_{12}^{2}}} Z_{3} \end{matrix}, \end{matrix}

Z_{i} \sim N (0, σ_{i}^{2}), i = 2, 3

, and an ER of f is given by

\begin{matrix} g_{1} (X_{1}, Z_{2}, Z_{3}) & = & (1 + \frac{ρ_{12} σ_{2}}{σ_{1}} + \frac{ρ_{13} σ_{3}}{σ_{1}}) X_{1} + (\sqrt{1 - ρ_{12}^{2}} + \frac{σ_{3} (ρ_{23} - ρ_{12} ρ_{13})}{σ_{2} \sqrt{1 - ρ_{12}^{2}}}) Z_{2} \\ + \sqrt{\frac{1 - ρ_{12}^{2} - ρ_{13}^{2} - ρ_{23}^{2} + 2 ρ_{12} ρ_{13} ρ_{23}}{1 - ρ_{12}^{2}}} Z_{3} . \end{matrix}

Using such ERs of f, we have the following dSIs of

X_{1}

and

(X_{1}, X_{2})

:

d S_{1} = d S_{T_{1}} = \frac{{(σ_{1} + ρ_{12} σ_{2} + ρ_{13} σ_{3})}^{2}}{\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3}},

d S_{12} = d S_{T_{12}} = \frac{(1 - ρ_{12}^{2}) {(σ_{1} + ρ_{12} σ_{2} + ρ_{13} σ_{3})}^{2} + {(σ_{2} (1 - ρ_{12}^{2}) + σ_{3} (ρ_{23} - ρ_{12} ρ_{13}))}^{2}}{(1 - ρ_{12}^{2}) (\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3})},

respectively. Using the same reasoning and knowing that

R_{min} = 3

, the two extra ERs of f lead to the remaining results. When using

g_{2} (X_{2}, Z_{3}, Z_{1})

, we have

d S_{2} = d S_{T_{2}} = \frac{{(σ_{2} + ρ_{12} σ_{1} + ρ_{23} σ_{3})}^{2}}{\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3}},

d S_{23} = d S_{T_{23}} = \frac{(1 - ρ_{23}^{2}) {(σ_{2} + ρ_{12} σ_{1} + ρ_{23} σ_{3})}^{2} + {(σ_{3} (1 - ρ_{23}^{2}) + σ_{1} (ρ_{13} - ρ_{12} ρ_{23}))}^{2}}{(1 - ρ_{23}^{2}) (\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3})} .

Likewise, using

g_{3} (X_{3}, Z_{1}, Z_{2})

, we obtain

d S_{3} = d S_{T_{3}} = \frac{{(σ_{3} + ρ_{13} σ_{1} + ρ_{23} σ_{2})}^{2}}{\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3}},

d S_{13} = d S_{T_{13}} = \frac{(1 - ρ_{13}^{2}) {(σ_{3} + ρ_{13} σ_{1} + ρ_{23} σ_{2})}^{2} + {(σ_{1} (1 - ρ_{13}^{2}) + σ_{2} (ρ_{12} - ρ_{13} ρ_{23}))}^{2}}{(1 - ρ_{13}^{2}) (\sum_{j = 1}^{3} σ_{j}^{2} + 2 ρ_{12} σ_{1} σ_{2} + 2 ρ_{13} σ_{1} σ_{3} + 2 ρ_{23} σ_{2} σ_{3})} .

6.2. Functional Outputs: Dynamic Model ( $d = 2$ , $n = 1$ )

The following dynamic model includes two inputs

X \sim N (0, [\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}])

:

f (X, t) = (X_{1} + X_{2} + a X_{1} X_{2}) (2 - α (t)) + (X_{1}^{2} + \sqrt{2} X_{2}) (- 1 + α (t)),

where

a \in R

,

t \in [0, 365]

and

α (t) \in R

. When

a = 0

, there is no explicit interaction between

X_{1}

and

X_{2}

. To illustrate the difference between both types of dGSIs, we suppose that we have observed the model outputs at

t \in {t_{1}, t_{2}}

with

α (t_{1}) = 1

and

α (t_{2}) = 2

. Using the dependency functions

X_{2} = r_{1} (X_{1}, Z_{2}) = ρ X_{1} + \sqrt{1 - ρ^{2}} Z_{2}

and

X_{1} = r_{2} (X_{2}, Z_{1}) = ρ X_{2} + \sqrt{1 - ρ^{2}} Z_{1}

with

Z_{i} \sim N (0, 1)

and

i = 1, 2

, Figure 1 shows the two types of dGSIs.

Figure 1. First-type, prime second-type, and second-type dGSIs for different values of the correlation between the two inputs and for

a = - 2, 0

.

The first figure (top-left panel) depicts the first-type and second-type dGSIs, and we see that both types of dGSIs give the same ranking of inputs for negative values of the correlation. In the absence of correlation (i.e.,

ρ = 0

),

X_{1}

and

X_{2}

have the same effect according to the first-type dGSIs, while the second-type dGSIs identify

X_{2}

as the most influential input. When

ρ > 0.5

, the first-type dGSIs show that

X_{2}

is the most important input, while the second-type dGSIs suggest that both inputs have the same total effects. The second figure (top-right panel) compares the prime second-type and the second-type total dGSIs, and the results are similar to those of the first figure. The third figure (bottom-left panel) compares the first-order dGSIs of the first and second types, and it comes out that such dGSIs give different main effects of inputs for positive correlations. In the last figure, the first-type and second-type dGSIs give the same ranking of inputs except for

ρ = 0

. As different rankings of inputs can happen using both types of dGSIs, and knowing that the second-type dGSIs account for the correlations among SFs, we should prefer such indices.

6.3. Sobol’s g-Function ( $d = 10$ , $K = 3$ )

Here, we consider

f (x) : = \prod_{j = 1}^{d = 10} \frac{| 4 x_{j} - 2 | + a_{j}}{1 + a_{j}}

with

a : = (35, 35, \dots, 35) \in R^{d}

. The

d = 10

inputs are organized into

K = 3

blocks as follows:

$(X_{j} \sim U (0, 1), j = 4, \dots, 8)$ are independent variables, that is, $π_{1} = {4, \dots, 8}$ ;
$π_{2} = {1, \dots, 3}$ and $(X_{1}, X_{2}, X_{3})$ have a Gaussian copula with $ρ_{12} = 0, ρ_{13} = 0.01, ρ_{23} = 0.85$ as the correlation values, and $X_{j} \sim U (0, 1), j = 1, 2, 3$ ;
$π_{3} = {9, 10}$ , where $X_{9} \sim U (0, 1), X_{10} \sim U (0, 1)$ with $X_{9} + X_{10} \leq 1$ .

The dependency functions of

X_{π_{3}}

are given by

X_{10}^{c} = U_{10} (1 - X_{9}^{c})

and

X_{9}^{c} = U_{9} (1 - X_{10}^{c})

where

U_{10} \sim U (0, 1)

,

U_{9} \sim U (0, 1)

; and

U_{9}, U_{10}

are independent of

X_{9}^{c} \sim B e t a (1, 2)

,

X_{10}^{c} \sim B e t a (1, 2)

[].

Based on the

R_{min} = 6

ERs of

f (X)

(see Appendix J), we computed the dSIs using Sobol’s sequences and the sample size m = 10,000 (see Table 1). We can see that

X_{2}, X_{3}, X_{9}, X_{10}

are the most influential inputs. For fixing

X_{1}

, we need the copula-based dependency model of

X_{π_{2}}

, that is,

X_{1} = r_{1} (X_{2}, Z_{3}, Z_{1})

. Since the block of inputs

X_{π_{1}}

is not important, we have also computed the dSIs of the pairs of variables selected out of

(X_{π_{2}}, X_{π_{3}})

(see Table 1). As expected, the total indices are always greater than the first-order indices.

Table 1. Estimates of dSIs for Sobol’s g-function with NIVs.

7. Conclusions

In this paper, we have proposed a new way of deriving the distributions of the model outputs conditional on every subset of inputs using dependency models of random vectors of NIVs. We have provided additional generic dependency models, including empirical or computational dependency functions, of d-dimensional random vectors following many distributions, such as copula-based distributions with discrete variables, as well as distributions of inputs of complex mathematical models subjected to constraints. It came out that

\prod_{k = 2}^{K} (\binom{d_{k}}{j_{0, k}})

different equivalent representations of the model output of interest are necessary and sufficient for recovering all the conditional distributions and for assessing the effects of every subset of NIVs, including their interactions. An algorithm is then provided for selecting such representations or equivalently such dependency models.

Based on such conditional distributions and using variance–covariance as an important measure, we have extended i) dGSIs for the multivariate and/or functional outputs (including spatiotemporal models and dynamic models), and ii) dependent sensitivity indices for single-response models so as to cope with every subset of NIVs. Such indices are also well suited for models with both discrete and continuous variables. Consistent estimators of such indices and their asymptotic distributions are provided. It is worth noting that such conditional distributions are also relevant for commuting the variance-based Shapley effects [].

Analytical test cases confirmed that the first-order index of any subset of inputs is less than its total index, as expected. In the case of the dynamic model considered, it came out that the second-type dGSIs, which account for the correlations among the components of sensitivity functionals, and the first-type dGSIs can give different rankings of input variables. Therefore, we should prefer the second-type dGSIs in practice. Moreover, it came out that the sum of the main and interaction indices can be greater than one. In the next works, it will be interesting to investigate a new approach for which the main and interactions indices sum up to one. Also, the derivation of MSEs of the estimators of covariances of sensitivity functionals is quite interesting.

Since the computations of the effects of all the subsets of inputs rely on

2^{\frac{d - d_{1}}{2}} \leq R_{min} 2^{d - d_{1}}

ERs, which grow exponentially with respect to

d (1 - α)

, it becomes difficult to deploy such efficient approaches in higher dimensions under the following conditions:

α = d_{1} / d ≪ 1

and all the K groups of NIVs are important, which is not often the case.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math13050766/s1, Derivation of estimators of covariances and dGSIs.

Funding

This research received no external funding.

Data Availability Statement

No data sets are used in this paper. This paper provides theoretical results with analytical test cases. Simulated data are in the paper.

Acknowledgments

We would like to thank the two referees and the associate editor for their comments and suggestions that have helped improving this paper.

Conflicts of Interest

The author has no conflicts of interest to declare regarding the content of this paper.

Appendix A. Proof of Lemma 1

Consider the variable

Y_{k} = Φ^{- 1} (F_{k} (X_{k}))

if

X_{k}

is continuous and

Y_{k} = Φ^{- 1} (τ_{F_{k}} (X_{k}, U_{k}))

otherwise. It is well known that

Y_{k}

follows the standard normal distribution with

k = 1, \dots, d

, and

Y = (Y_{j}, Y_{\sim j})

has the same copula as

X

[], as

Φ^{- 1} \circ F_{k}

(resp.

Φ^{- 1} \circ τ_{F_{k}}

) is a strictly increasing transformation on the range of

X_{k}

. Therefore,

Y \sim N_{d} (0, R)

. Knowing that the dependency function of

Y

is given by

(Y_{j}, Y_{\sim j}) = L {[Y_{j}, Z^{T}]}^{T}

(see []), the result follows using the inverse transformation of the form

X_{k} = F_{k}^{\leftarrow} \circ Φ (Y_{k})

.

Appendix B. Proof of Lemma 2

Using the same reasoning as in Appendix A, we can see that

Y \sim t_{d} (ν, 0, R)

with

Y_{k} = T_{v}^{- 1} (F_{k} (X_{k}))

for continuous variable

X_{k}

and

Y_{k} = T_{v}^{- 1} (τ_{F_{k}} (X_{k}, U_{k}))

otherwise. We then have to derive the dependency function of

Y

to obtain the result, and it is performed below. As

Y \sim t_{d} (ν, 0, I) ⟺ L Y \sim t_{d} (ν, 0, R)

, with

L

being the Cholesky factor of

R

, the result holds knowing that []

[\begin{matrix} Y_{j} \\ Y_{w_{1}} = \sqrt{\frac{ν + Y_{j}^{2}}{ν + 1}} Z_{w_{1}} \\ ⋮ \\ Y_{w_{d - 1}} = \sqrt{\frac{(ν + Y_{j}^{2}) \prod_{k = 1}^{d - 2} (ν + k + {(Z_{w_{k}})}^{2})}{\prod_{k = 1}^{d - 1} (ν + k)}} Z_{w_{d - 1}} \end{matrix}] \sim t_{d} (ν, 0, I) .

Appendix C. Proof of Lemma 3

Consider any measurable and integrable function

h : R^{n} \to R

. It is known from Proposition 1 that

X_{w_{\sim 1, k}} \overset{d}{=} r_{w_{1, k}} (X_{w_{1, k}}, Z_{v_{k}}, Z_{w_{\sim 1, k} ∖ v_{k}}) ⟹ X_{\sim u_{k}} \overset{d}{=} r_{u_{k}} (X_{u_{k}}, Z_{w_{\sim 1, k} ∖ v_{k}}) .

Using

Y : = (X_{π_{1} ∖ u_{0}}, X_{s ∖ {w_{1, k_{1}}, \dots, w_{1, k_{m}}}}, Z_{w_{\sim 1, k_{1}} ∖ v_{k_{1}}}, \dots, Z_{w_{\sim 1, k_{m}} ∖ v_{k_{m}}}, Z_{w_{\sim 1} ∖ {w_{\sim 1, k_{1}}, \dots, w_{\sim 1, k_{m}}}})

and the fact that the components of

(X_{π_{1}}, X_{s}, Z_{w_{\sim 1}})

are independent, we can write

\begin{matrix} E [h (g (X_{π_{1}}, X_{s}, Z_{w_{\sim 1}}, θ)) | X_{u_{0}}, X_{w_{1, k_{1}}}, Z_{v_{k_{1}}}, \dots, X_{w_{1, k_{m}}}, Z_{v_{k_{m}}}] \\ = & E_{Y} [h (f (X_{π_{1}}, X_{w_{1, 2}}, r_{w_{1, 2}} (X_{w_{1, 2}}, Z_{w_{\sim 1, 2}}), \dots, X_{w_{1, K}}, r_{w_{1, K}} (X_{w_{1, K}}, Z_{w_{\sim 1, K}}), θ))] \\ = & E_{Y} [h (f (X_{π_{1}}, X_{w_{1, k}}, r_{w_{1, k}} (X_{w_{1, k}}, Z_{v_{k}}, Z_{w_{\sim 1, k} ∖ v_{k}}), k = 2, \dots, K, θ))] \\ \overset{d}{=} & E_{Y} [h (f (X_{π_{1}}, X_{u_{k}}, r_{u_{k}} (X_{u_{k}}, Z_{w_{\sim 1, k} ∖ v_{k}}), k = 2, \dots, K, θ))] \\ = & E [h (f (X, θ)) | X_{u_{0}}, X_{u_{k_{1}}}, \dots, X_{u_{k_{m}}}] . \end{matrix}

Appendix D. Proof of Lemma 4

For Equation (15), at the end of the first step (i.e.,

e_{0} = 1

),

B_{k}

contains super-sets of

{j_{k}}

for all

j_{k} \in π_{k}

of the form

(j_{k}, t_{k})

with

t_{k} \subseteq π_{k} ∖ {j_{k}}

. Indeed, for two super-sets

(j_{k_{1}}, t_{k_{1}})

and

(j_{k_{2}}, t_{k_{2}})

of

B_{k}

, we must have

{j_{k_{1}}, v_{1, k_{1}}, \dots, v_{j, k_{1}}} \neq {j_{k_{2}}, v_{1, k_{2}}, \dots, v_{j, k_{2}}} for all j \in {0 \dots, j_{0, k}}, and

{v_{j, k_{1}}, \dots, v_{d_{k}, k_{1}}} \neq {v_{j, k_{2}}, \dots, v_{d_{k}, k_{2}}} for all j \in {j_{0, k} + 1 \dots, d_{k}},

leading to

\{u \subseteq π_{k} : | u | = 1\} \subseteq B_{k}

.

Secondly, when

e_{0} = 2

(from iteration

d_{k} + 1

to

\frac{d_{k} (d_{k} - 1)}{2}

), we add the super-sets of

{j_{k_{1}}, j_{k_{2}}}

of the form

(j_{k_{1}}, j_{k_{2}}, t_{k_{1}, k_{2}})

, which were not in

B_{k}

at the end of the first step

e_{0} = 1

. As for the two new super-sets, that is,

{j_{k_{1}}, j_{k_{2}}, t_{k_{1}, k_{2}}}

,

{j_{k_{3}}, j_{k_{4}}, t_{k_{3}, k_{4}}}

, we must have

{j_{k_{1}}, j_{k_{2}}} \neq {j_{k_{3}}, j_{k_{4}}}

,

{j_{k_{1}}, j_{k_{2}}} \neq {j_{k_{1}}, v_{1, k_{1}}}

, and

{j_{k_{1}}, v_{1, k_{1}}} \neq {j_{k_{3}}, j_{k_{4}}}

; the first two steps allow for obtaining

\{u \subseteq π_{k} : 1 \leq | u | \leq 2\} \subseteq B_{k}

.

Thirdly, we repeat that procedure up to

e_{0} = j_{0, k} - 1

to obtain the super-sets of

{j_{k_{1}}, \dots, j_{j_{0, k} - 1}}

and

\{u \subseteq π_{k} : 1 \leq | u | \leq j_{0, k} - 1\} \subseteq B_{k}

. These operations are possible because

(\binom{d_{k}}{e_{0}}) \leq (\binom{d_{k}}{j_{0, k}})

for all

e_{0} = 1, \dots, j_{0, k} - 1

, and we avoid permutations (

w_{k}

) that bring replicated sets in both

B_{k}

and

E_{k}

. Fourthly, the iterations

(\binom{d_{k}}{j_{0, k} - 1}) < i \leq (\binom{d_{k}}{j_{0, k}})

(when possible) aim to add the remaining subsets of

j_{0, k}

elements.

Fifthly, we have

\{u \subseteq π_{k} : | u | = j_{0, k} + 1\} \subseteq B_{k}

because for any

v_{1} \subseteq π_{k}

with

| v_{1} | = j_{0, k} + 1

, there exists

w_{k}^{*} \in P_{k}

such that

{w_{j_{0, k} + 2, k}^{*}, \dots, w_{d_{k}, k}^{*}} ⋂ v_{1} = \emptyset

. Indeed,

w_{k}^{*}

was added in

P_{k}

when constructing all the subsets

u \subseteq π_{k}

with

| u | = d_{k} - j_{0, k} - 1 < j_{0, k}

thanks to

E_{k}

and the fact that

(\binom{d_{k}}{j_{0, k} + 1}) = (\binom{d_{k}}{d_{k} - j_{0, k} - 1})

. Thus,

v_{1} = {w_{1, k}^{*}, \dots, w_{j_{0, k} + 1, k}^{*}}

. Finally, we use the same reasoning to obtain the results.

Equation (16) is obvious by construction (see Algorithm 1).

Appendix E. Proof of Theorem 1

First, for

u_{k} \subseteq π_{k}

with

| u_{k} | > 0

, there exists

w_{k}^{*} \in P_{k}

such that

u_{k} = {w_{1, k}^{*}, \dots, w_{| u_{k} |, k}^{*}}

according to Lemma 4. Lemma 3 ensures the determination of the distribution of f given

X_{u_{k}}

using g associated with

w_{k}^{*}

.

Second, for

u : = (u_{0}, u_{2}, \dots, u_{K})

, where

u_{0} \subseteq π_{1}, u_{k} \subseteq π_{k}

, and

| u_{k} | > 0

with

k = 2, \dots, K

, there exists only one permutation

w_{k}^{*} \in P_{k}

such that

u_{k} = {w_{1, k}^{*}, \dots, w_{| u_{k} |, k}^{*}},

\forall k = 2, \dots, K

. As only one representation of f associated with

w_{k}^{*}, k = 2, \dots, K

allows for determining the distribution of f given

X_{u}

, and

| P_{k} | = (\binom{d_{k}}{j_{0, k}})

, then

R_{min} : = \prod_{k = 2}^{K} (\binom{d_{k}}{j_{0, k}})

different representations of f are needed to obtain the distribution of f given

X_{u}

for all

u \subseteq {1, \dots, d}

. The result follows because

R_{min}

is the highest number of possibilities of

\{u_{k} \subseteq π_{k}, k = 2, \dots, K : | u_{k} | = j_{0, k}\}

, and other possibilities are in the

R_{min}

representations (Lemma 4).

Appendix F. Proof of Proposition 2

The proofs are straightforward. The results rely on the Hoeffding decomposition of an equivalent representation of f and the fact that for two positive semi-definite matrices

A_{1}

,

A_{2}

, the Loewner partial ordering, that is,

A_{1} ⪯ A_{2}

, implies

Tr (A_{1}) \leq Tr (A_{2})

and

{||A_{1}||}_{F} \leq {||A_{2}||}_{F}

. See [] for more details.

Appendix G. Proof of Theorem 2

Since

θ_{1}, θ_{2}

are not random quantities, the proofs of Points (i)–(iii) are straightforward. Indeed, by expanding the above expressions of the estimators, we obtain unbiased estimators, and by applying the law of large numbers, we obtain consistent estimators. Detailed similar proofs can be found in [,,].

Appendix H. Proof of Corollary 1

Firstly, the proposed estimators are unbiased and consistent by applying Theorem 2 where

θ_{1} = θ_{2} = θ_{0}

is a constant. Secondly, the MVU properties are due to the symmetric properties of the kernels used []. Indeed, each kernel remains unchanged when one permutes its arguments. More details can be found in [] (Theorems 2 and 3).

Appendix I. Proof of Theorem 3

The results for the consistency are deduced from Corollary 3, as

Θ = {θ_{0}}

.

For the asymptotic distribution of Points (i)–(ii), the derivation of the results is similar to the one provided in [] (Theorem 6), under the condition

m / M \to 0

.

Appendix J. Equivalent Representations of the Function Used in Section 6.3

The

R_{min} = 6

equivalent representations of

f (X)

are given as follows:

\begin{matrix} g_{1} : = f (X_{1}, r_{1} (X_{1}, Z_{2}, Z_{3}), X_{π_{1}}, X_{9}, r_{9} (X_{9}, Z_{10})) & f o r & X_{1}, X_{9}, (X_{1}, X_{2}), (X_{1}, X_{9}), \\ X_{k}, (X_{1}, X_{k}), \forall k \in π_{1}, \dots; \\ g_{2} : = f (X_{2}, r_{2} (X_{2}, Z_{3}, Z_{1}), X_{π_{1}}, X_{9}, r_{9} (X_{9}, Z_{10})) & f o r & X_{2}, (X_{2}, X_{3}), (X_{2}, X_{9}), (X_{9}, X_{10}), \dots; \\ g_{3} : = f (X_{3}, r_{3} (X_{3}, Z_{1}, Z_{2}), X_{π_{1}}, X_{9}, r_{9} (X_{9}, Z_{10})) & f o r & X_{3}, (X_{3}, X_{1}), (X_{3}, X_{9}), \dots; \\ g_{4} : = f (X_{1}, r_{1} (X_{1}, Z_{2}, Z_{3}), X_{π_{1}}, X_{10}, r_{10} (X_{10}, Z_{9})) & f o r & X_{10}, (X_{1}, X_{10}), \dots; \\ g_{5} : = f (X_{2}, r_{2} (X_{2}, Z_{3}, Z_{1}), X_{π_{1}}, X_{10}, r_{10} (X_{10}, Z_{9})) & f o r & (X_{2}, X_{10}), \dots; \\ g_{6} : = f (X_{3}, r_{3} (X_{3}, Z_{1}, Z_{2}), X_{π_{1}}, X_{10}, r_{10} (X_{10}, Z_{9})) & f o r & (X_{3}, X_{10}), \dots, \end{matrix}

where

r_{1}, r_{2}, r_{3}

are particular cases of DMs provided in Lemma 1.

References

Sobol, I.M. Sensitivity analysis for non-linear mathematical models. Math. Model. Comput. Exp. 1993, 1, 407–414. [Google Scholar]
Lamboni, M. Multivariate sensitivity analysis: Minimum variance unbiased estimators of the first-order and total-effect covariance matrices. Reliab. Eng. Syst. Saf. 2019, 187, 67–92. [Google Scholar] [CrossRef]
Lamboni, M.; Monod, H.; Makowski, D. Multivariate sensitivity analysis to measure global contribution of input factors in dynamic models. Reliab. Eng. Syst. Saf. 2011, 96, 450–459. [Google Scholar] [CrossRef]
Gamboa, F.; Janon, A.; Klein, T.; Lagnoux, A. Sensitivity indices for multivariate outputs. Comptes Rendus Math. 2013, 351, 307–310. [Google Scholar] [CrossRef]
Xiao, S.; Lu, Z.; Xu, L. Multivariate sensitivity analysis based on the direction of eigen space through principal component analysis. Reliab. Eng. Syst. Saf. 2017, 165, 1–10. [Google Scholar] [CrossRef]
Lamboni, M. Derivative-based generalized sensitivity indices and Sobol’ indices. Math. Comput. Simul. 2020, 170, 236–256. [Google Scholar] [CrossRef]
Perrin, T.; Roustant, O.; Rohmer, J.; Alata, O.; Naulin, J.; Idier, D.; Pedreros, R.; Moncoulon, D.; Tinard, P. Functional principal component analysis for global sensitivity analysis of model with spatial output. Reliab. Eng. Syst. Saf. 2021, 211, 107522. [Google Scholar] [CrossRef]
Veiga, S.D.; Wahl, F.; Gamboa, F. Local Polynomial Estimation for Sensitivity Analysis on Models With Correlated Inputs. Technometrics 2009, 51, 452–463. [Google Scholar] [CrossRef]
Mara, T.A.; Tarantola, S. Variance-based sensitivity indices for models with dependent inputs. Reliab. Eng. Syst. Saf. 2012, 107, 115–121. [Google Scholar] [CrossRef]
Kucherenko, S.; Tarantola, S.; Annoni, P. Estimation of global sensitivity indices for models with dependent variables. Comput. Phys. Commun. 2012, 183, 937–946. [Google Scholar] [CrossRef]
Hao, W.; Zhenzhou, L.; Wei, P. Uncertainty importance measure for models with correlated normal variables. Reliab. Eng. Syst. Saf. 2013, 112, 48–58. [Google Scholar] [CrossRef]
Chastaing, G.; Gamboa, F.; Prieur, C. Generalized Hoeffding-Sobol’ decomposition for dependent variables—Applications to sensitivity analysis. Electron. J. Stat. 2012, 6, 2420–2448. [Google Scholar] [CrossRef]
Kucherenko, S.; Klymenko, O.; Shah, N. Sobol’ indices for problems defined in non-rectangular domains. Reliab. Eng. Syst. Saf. 2017, 167, 218–231. [Google Scholar] [CrossRef]
Mara, T.A.; Tarantola, S.; Annoni, P. Non-parametric methods for global sensitivity analysis of model output with dependent inputs. Environ. Model. Softw. 2015, 72, 173–183. [Google Scholar] [CrossRef]
Tarantola, S.; Mara, T.A. Variance-based sensitivity indices of computer models with dependent inputs: The fourier amplitude sensitivity test. Int. J. Uncertain. Quantif. 2017, 7, 511–523. [Google Scholar] [CrossRef]
Lamboni, M.; Kucherenko, S. Multivariate sensitivity analysis and derivative-based global sensitivity measures with dependent variables. Reliab. Eng. Syst. Saf. 2021, 212, 107519. [Google Scholar] [CrossRef]
Lamboni, M. On exact distribution for multivariate weighted distributions and classification. Methodol. Comput. Appl. Probab. 2023, 25, 1–41. [Google Scholar] [CrossRef]
Lamboni, M. Kernel-based Measures of Association Between Inputs and Outputs Using ANOVA. Sankhya A 2024, 86, 790–826. [Google Scholar] [CrossRef]
Skorohod, A.V. On a representation of random variables. Theory Probab. Appl. 1976, 21, 645–648. [Google Scholar]
Lamboni, M. Efficient dependency models: Simulating dependent random variables. Math. Comput. Simul. 2021, 200, 199–217. [Google Scholar] [CrossRef]
Ferguson, T. Mathematical Statistics: A Decision Theoretic Approach; Academic Press: New York, NY, USA, 1967. [Google Scholar]
Rschendorf, L. Stochastically ordered distributions and monotonicity of the oc-function of sequential probability ratio tests. Ser. Stat. 1981, 12, 327–338. [Google Scholar] [CrossRef]
Rschendorf, L. Stochastic ordering of risks, influence of dependence and a.s. constructions. In Advances on Models, Characterizations and Applications; Balakrishnan, N., Bairamov, I.G., Gebizlioglu, O.L., Eds.; CRC Press: Boca Raton, FL, USA, 2005. [Google Scholar]
Rschendorf, L. On the distributional transform, Sklar’s theorem, and the empirical copula process. J. Stat. Plan. Inference 2009, 139, 3921–3927. [Google Scholar] [CrossRef]
Rosenblatt, M. Remarks on a Multivariate Transformation. Ann. Math. Statist. 1952, 23, 470–472. [Google Scholar] [CrossRef]
Nelsen, R. An Introduction to Copulas; Springer: New York, NY, USA, 2006. [Google Scholar]
McNeil, A.J.; Frey, R.; Embrechts, P. Quantitative Risk Management; Princeton University Press: Princeton, NJ, USA; Oxford, UK, 2015. [Google Scholar]
Rosenblatt, M. Remarks on some nonparametric estimates of a density function. Ann. Math. Stat. 1956, 27, 832–837. [Google Scholar] [CrossRef]
Parzen, E. On estimation of a probability density function and mode. Ann. Math. Stat. 1962, 33, 1065–1076. [Google Scholar] [CrossRef]
Epanechnikov, V. Nonparametric estimation of a multidimensional probability density. Theory Probab. Appl. 1969, 14, 153–158. [Google Scholar] [CrossRef]
Silverman, B. Density Estimation for Statistics and Data Analysis; Chapman & Hall: New York, NY, USA, 1986. [Google Scholar]
Clayton, D.G. A Model for Association in Bivariate Life Tables and Its Application in Epidemiological Studies of Familial Tendency in Chronic Disease Incidence. Biometrika 1978, 65, 141–151. [Google Scholar] [CrossRef]
Joe, H. Multivariate Models and Dependence Concepts; Chapman & Hall/CRC: Boca Raton, FL, USA; London, UK; New York, NY, USA, 1997. [Google Scholar]
Smith, M.; Min, A.; Almeida, C.; Czado, C. Modeling Longitudinal Data Using a Pair-Copula Decomposition of Serial Dependence. J. Am. Stat. Assoc. 2010, 105, 1467–1479. [Google Scholar] [CrossRef]
Durante, F.; Sempi, C. Principles of Copula Theory; CRC/Chapman & Hall: London, UK, 2015. [Google Scholar]
Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
Truong, Y.K. Asymptotic Properties of Kernel Estimators Based on Local Medians. Ann. Stat. 1989, 17, 606–617. [Google Scholar] [CrossRef]
Hendricks, W.; Koenker, R. Hierarchical Spline Models for Conditional Quantiles and the Demand for Electricity. J. Am. Stat. Assoc. 1992, 87, 58–68. [Google Scholar] [CrossRef]
Koenker, R.; Ng, P.; Portnoy, S. Quantile smoothing splines. Biometrika 1994, 81, 673–680. [Google Scholar] [CrossRef]
Koenker, R.; Hallock, K.F. Quantile Regression. J. Econ. Perspect. 2001, 15, 143–156. [Google Scholar] [CrossRef]
Bassett, R., Jr.; Koenker, R. An Empirical Quantile Function for Linear Models with iid Errors. J. Am. Stat. Assoc. 1982, 77, 407–415. [Google Scholar]
Koenker, R. Quantile Regression; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
Takeuchi, I.; Le, Q.V.; Sears, T.D.; Smola, A.J. Nonparametric Quantile Estimation. J. Mach. Learn. Res. 2006, 7, 1231–1264. [Google Scholar]
Lamboni, M. Uncertainty quantification: A minimum variance unbiased (joint) estimator of the non-normalized Sobol’ indices. Stat. Pap. 2018, 61, 1939–1970. [Google Scholar] [CrossRef]
Lamboni, M. Weak derivative-based expansion of functions: ANOVA and some inequalities. Math. Comput. Simul. 2022, 194, 691–718. [Google Scholar] [CrossRef]
Lamboni, M. Global sensitivity analysis: An efficient numerical method for approximating the total sensitivity index. Int. J. Uncertain. Quantif. 2016, 6, 1–17. [Google Scholar] [CrossRef]
Owen, A.B. Sobol’ Indices and Shapley Value. Siam/Asa J. Uncertain. Quantif. 2014, 2, 245–251. [Google Scholar] [CrossRef]
Sugiura, N. Multisample and multivariate nonparametric tests based on U-statistics and their asymptotic efficiencies. Osaka J. Math. 1965, 2, 385–426. [Google Scholar]

Figure 1. First-type, prime second-type, and second-type dGSIs for different values of the correlation between the two inputs and for

a = - 2, 0

.

Table 1. Estimates of dSIs for Sobol’s g-function with NIVs.

	$\hat{dSI}$		$\hat{dSI}$
	Main	Total		First-Order	Total
X1	0.089	0.090	X1:X2	0.321	0.372
X2	0.241	0.294	X1:X3	0.321	0.372
X3	0.231	0.282	X1:X9	0.192	0.216
X4	0.093	0.093	X1:X10	0.193	0.217
X5	0.093	0.093	X2:X3	0.298	0.299
X6	0.093	0.093	X2:X9	0.349	0.426
X7	0.093	0.093	X2:X10	0.348	0.425
X8	0.093	0.093	X3:X9	0.334	0.407
X9	0.107	0.131	X3:X10	0.335	0.409
X10	0.108	0.132	X9:X10	0.185	0.185

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Distributions of Outputs Given Subsets of Inputs and Dependent Generalized Sensitivity Indices

Abstract

1. Introduction

General Notation

2. Dependency Functions of Non-Independent Random Variables

2.1. Distribution-Based and Copula-Based Expressions of Dependency Models

2.2. Empirical and Computational Dependency Models

3. Equivalent Representations of Functional Outputs

Discussions About High-Dimensional Cases

4. Dependent Multivariate Sensitivity Analysis

4.1. Properties of Dependent Generalized Sensitivity Indices

4.2. Case of the Multivariate Response Models

5. Estimators of Dependent Generalized Sensitivity Indices

6. Analytical and Numerical Results

6.1. Linear Function Without Explicit Interaction ( $d = 3$ , $n = 1$ )

6.2. Functional Outputs: Dynamic Model ( $d = 2$ , $n = 1$ )

6.3. Sobol’s g-Function ( $d = 10$ , $K = 3$ )

7. Conclusions

Supplementary Materials

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Lemma 2

Appendix C. Proof of Lemma 3

Appendix D. Proof of Lemma 4

Appendix E. Proof of Theorem 1

Appendix F. Proof of Proposition 2

Appendix G. Proof of Theorem 2

Appendix H. Proof of Corollary 1

Appendix I. Proof of Theorem 3

Appendix J. Equivalent Representations of the Function Used in Section 6.3

References

Article Metrics

Citations

Article Access Statistics

Distributions of Outputs Given Subsets of Inputs and Dependent Generalized Sensitivity Indices

Abstract

1. Introduction

General Notation

2. Dependency Functions of Non-Independent Random Variables

2.1. Distribution-Based and Copula-Based Expressions of Dependency Models

2.2. Empirical and Computational Dependency Models

3. Equivalent Representations of Functional Outputs

Discussions About High-Dimensional Cases

4. Dependent Multivariate Sensitivity Analysis

4.1. Properties of Dependent Generalized Sensitivity Indices

4.2. Case of the Multivariate Response Models

5. Estimators of Dependent Generalized Sensitivity Indices

6. Analytical and Numerical Results

6.1. Linear Function Without Explicit Interaction ( d = 3 , n = 1 )

6.2. Functional Outputs: Dynamic Model ( d = 2 , n = 1 )

6.3. Sobol’s g-Function ( d = 10 , K = 3 )

7. Conclusions

Supplementary Materials

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Lemma 2

Appendix C. Proof of Lemma 3

Appendix D. Proof of Lemma 4

Appendix E. Proof of Theorem 1

Appendix F. Proof of Proposition 2

Appendix G. Proof of Theorem 2

Appendix H. Proof of Corollary 1

Appendix I. Proof of Theorem 3

Appendix J. Equivalent Representations of the Function Used in Section 6.3

References

Article Metrics

Citations

Article Access Statistics

6.1. Linear Function Without Explicit Interaction ( $d = 3$ , $n = 1$ )

6.2. Functional Outputs: Dynamic Model ( $d = 2$ , $n = 1$ )

6.3. Sobol’s g-Function ( $d = 10$ , $K = 3$ )