Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection

Ranftl, Sascha; Melito, Gian Marco; Badeli, Vahid; Reinbacher-Köstinger, Alice; Ellermann, Katrin; von der Linden, Wolfgang

doi:10.3390/e22010058

Open AccessArticle

Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection

by

Sascha Ranftl

^1,*

,

Gian Marco Melito

²

,

Vahid Badeli

³

,

Alice Reinbacher-Köstinger

³

,

Katrin Ellermann

² and

Wolfgang von der Linden

^1,*

¹

Institute of Theoretical Physics-Computational Physics, Graz University of Technology, 8010 Graz, Austria

²

Institute of Mechanics, Graz University of Technology, 8010 Graz, Austria

³

Institute of Fundamentals and Theory in Electrical Engineering, Graz University of Technology, 8010 Graz, Austria

^*

Authors to whom correspondence should be addressed.

Entropy 2020, 22(1), 58; https://doi.org/10.3390/e22010058

Submission received: 26 November 2019 / Revised: 26 December 2019 / Accepted: 27 December 2019 / Published: 31 December 2019

(This article belongs to the Special Issue MaxEnt 2019—The 39th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

In 2000, Kennedy and O’Hagan proposed a model for uncertainty quantification that combines data of several levels of sophistication, fidelity, quality, or accuracy, e.g., a coarse and a fine mesh in finite-element simulations. They assumed each level to be describable by a Gaussian process, and used low-fidelity simulations to improve inference on costly high-fidelity simulations. Departing from there, we move away from the common non-Bayesian practice of optimization and marginalize the parameters instead. Thus, we avoid the awkward logical dilemma of having to choose parameters and of neglecting that choice’s uncertainty. We propagate the parameter uncertainties by averaging the predictions and the prediction uncertainties over all the possible parameters. This is done analytically for all but the nonlinear or inseparable kernel function parameters. What is left is a low-dimensional and feasible numerical integral depending on the choice of kernels, thus allowing for a fully Bayesian treatment. By quantifying the uncertainties of the parameters themselves too, we show that “learning” or optimising those parameters has little meaning when data is little and, thus, justify all our mathematical efforts. The recent hype about machine learning has long spilled over to computational engineering but fails to acknowledge that machine learning is a big data problem and that, in computational engineering, we usually face a little data problem. We devise the fully Bayesian uncertainty quantification method in a notation following the tradition of E.T. Jaynes and find that generalization to an arbitrary number of levels of fidelity and parallelisation becomes rather easy. We scrutinize the method with mock data and demonstrate its advantages in its natural application where high-fidelity data is little but low-fidelity data is not. We then apply the method to quantify the uncertainties in finite element simulations of impedance cardiography of aortic dissection. Aortic dissection is a cardiovascular disease that frequently requires immediate surgical treatment and, thus, a fast diagnosis before. While traditional medical imaging techniques such as computed tomography, magnetic resonance tomography, or echocardiography certainly do the job, Impedance cardiography too is a clinical standard tool and promises to allow earlier diagnoses as well as to detect patients that otherwise go under the radar for too long.

Keywords:

uncertainty quantification; multi fidelity; Gaussian processes; probability theory; Bayes; impedance cardiography; aortic dissection

1. Introduction

While Uncertainty Quantification (UQ) has become a term on its own in the computational engineering community, Bayesian Probability Theory is not widely spread yet. A comprehensive collection of reviews on the various methods and aspects of UQ from the point of view of the computational engineering and applied mathematics community can be found in Reference [1]. In References [2,3,4], a statistician’s perspective is discussed. The computational effort for performing UQ with brute force is typically prohibitively large; thus, surrogate models such as Polynomial Chaos Expansion (PCE) [5,6,7,8] or Gaussian Process (GP) regression [9,10,11,12] are used, the latter of which has had its renaissance recently from within the machine learning community.

This work is inspired by the article of Kennedy and O’Hagan in 2000 [13]. They performed UQ by making use of a computer simulation with different levels of “fidelity”, “sophistication”, “accuracy”, or “quality”. In other words, a cheap, simplified simulation serves as a surrogate. We will refer to this approach as Multi-Fidelity scheme (MuFi). The idea of MuFi [14], and MuFi with GPs specifically [15], recently found increasing attention again. In contrast to previously reported MuFi-GPs , we do not learn the parameters and subsequently neglect the parameter uncertainties but explicitly incorporate them in a rigorous manner. We find that this is tractable analytically for all parameters but especially for the ones that occur nonlinearly or inseparably in the GPs covariance.

While UQ in general has arrived fully in the biomedical engineering community [16], the Bayesian approach has not. Biehler et al. [17] were, to the best knowledge of the authors, the first to apply a Bayesian MuFi Scheme in the context of computational biomechanical UQ. We apply our method to quantify the uncertainties in finite element simulations [18] of Impedance Cardiography (ICG) [19] of Aortic Dissection (AD) [20]. The aorta is the largest blood vessel in the human body. In aortic dissection, blood fluid dynamics force open a tear in a weakened aortic wall, dilate it, and fill the wall itself with blood. This deforms the geometry of the aorta and, obviously, affects blood circulation unfavourably (p. 459, [21]). Aortic Dissection is highly dangerous and likely lethal if untreated. Thus, a fast response and, hence, a fast diagnosis are key to the treatment of patients. For diagnosis, physicians use a variety of imaging techniques such as Magnetic Resonance Tomography (MRT), Computed Tomography (CT), and Echocardiography [22]. Echocardiography performed by a trained cardiologist is comparably cheap and fast, yet sound wave propagation might be hindered, e.g., by the rib cage or body fat. In CT and MRT, the radiation fully penetrates the body. Still, they require a trained radiologist and long measurement times and pose radiation risks and high costs. Most importantly, these examinations are not performed without a specific reason.

Alternatively, impedance cardiography is rather cheap and simple and, more importantly, available in any clinic and many medical practices. In ICG, one places a pair of electrodes on the thorax (upper body), injects a defined low-amplitude, alternates electric current into the body, and measures the voltage drop. The specific resistance of blood is much lower than that of muscle, fat, or bone [23]. Since electric current seeks the path of least resistance, the current propagates through the aorta rather than through, e.g., the spine. Thus, if the local blood volume changes due to aortic dissection, the impedance signal changes as well. Impedance cardiography could therefore complement existing clinical procedures and could detect aortic dissection when medical imaging is not performed, be it due to the absence of suspicion or to the unavailablity of the device itself. We find a number of parameters which are well defined but usually neither known precisely nor accessible in the clinical setting, e.g., the size of the aortic dissection. A clinical trial is extremely difficult, and we resort to a theoretical investigation instead, in which we account for the uncertainties as well.

In Section 2, we develop a Bayesian uncertainty quantification model based on Gaussian processes using multi-fidelity data. We scrutinize the method with mock data in Section 3 and show that learning regression parameters has little meaning when data is little. In Section 4, we apply our method to finite element simulations of impedance cardiography of aortic dissection and show that low-fidelity data can indeed decrease high-fidelity uncertainties.

2. Bayesian Multi-Fidelity Scheme

2.1. Statistical Model

Let

C

be the conditional complex. Let

t = 1, . . ., N_{t}

denote the ranked levels of fidelity of a simulation, with level

N_{t}

being the highest fidelity.

z_{t} (x_{t})

is a vector of simulation results of fidelity-level t given input vector

x_{t}

. We assume that

z_{t} (x_{t})

is a realisation of a Gaussian Process (GP)

z_{t} (x_{})

with a Markov property of order 1, meaning that level t depends on level

t - 1

only via the following recursive relationship:

\begin{matrix} z_{1} (x_{}) & = δ_{1} (x_{}) \end{matrix}

(1a)

\begin{matrix} z_{t} (x_{}) & = ρ_{t - 1} z_{t - 1} (x_{}) + δ_{t} (x_{}) \forall t \geq 2 \end{matrix}

(1b)

\begin{matrix} \Rightarrow z_{N_{t}} (x_{}) & = \sum_{t = 1}^{N_{t}} δ_{t} (x_{}) \prod_{l = t}^{N_{t} - 1} ρ_{l} \end{matrix}

(1c)

with a “difference-GP”

δ_{t} (x_{})

and a proportionality constant

ρ_{t - 1}

. Further, we assume that all information about a level is contained in the data corresponding to the same pivot point at that level and its previous level. Formally, that is

C o v (z_{t} (x_{}), z_{t - 1} (x^{'}) ∣ z_{t - 1} (x_{})) = 0

. The difference-GP

δ_{t} (x_{})

shall be defined by the covariance matrix

σ_{t}^{2} K_{t} (x, x^{'})

and the mean function

h_{t} (x) β_{t}

.

h_{t} (x)

is a matrix of regression functions

h_{t}^{(k)}

evaluated at

x = {(x^{(1)}, . . ., x^{(j)}, . . ., x^{(N_{x})})}^{T}

with size

N_{x} \times N_{β_{t}}

, where

N_{x}

is the length of input vector

x_{}

and

N_{β_{t}}

is the expansion power, i.e., number of regression functions, at level t.

β_{t} = {(β_{t}^{(1)}, . . ., β_{t}^{(N_{β_{t}})})}^{T}

are the coefficients of level t’s regression functions, and

α_{t}

is the set of hyperparameters parametrizing the kernel function

k_{t}

. Formally, this is

\begin{matrix} p [δ_{t} (x_{}) ∣ C] & = N (h_{t} (x) β_{t}, σ_{t}^{2} K_{t} (x, x)) \end{matrix}

(2a)

\begin{matrix} {[h_{t} (x)]}_{j k} & = h_{t}^{(k)} (x^{(j)}) \end{matrix}

(2b)

\begin{matrix} {[h_{t} (x) β_{t}]}_{j} & = \sum_{k = 1}^{m} h_{t}^{(k)} (x^{(j)}) β_{t}^{(k)} \end{matrix}

(2c)

\begin{matrix} {[K_{t} (x, x)]}_{i j} & = k_{t} (x^{(i)}, x^{(j)}; α_{t}) \end{matrix}

(2d)

At this point, neither have we chosen the basis functions

h_{t}^{(k)}

building the mean function nor have we chosen the kernel functions

k_{t}

building the covariance matrices. Let us subsume the parameters as

θ = {θ_{t}}

,

θ_{t} = {β_{t}, σ_{t}, ρ_{t - 1}, α_{t}}

,

β_{} = {β_{t}}, β_{t} = {β_{t}^{(k)}}

,

σ_{} = {σ_{t}}

,

ρ_{} = {ρ_{t}}

, and

α_{} = {α_{t}}

with

t = 1, 2, . . ., N_{t}

. The data shall be

D = {D_{t}}

with

D_{t} = \{(x_{t}, z_{t} (x_{t}), z_{t - 1} (x_{t}))\}

, which comprises the input vector at level t, namely

x_{t}

, and its corresponding computer code outputs at level t, namely

z_{t} (x_{t})

, and at the previous level

t - 1

, namely

z_{t - 1} (x_{t})

. Further, we require a nested design of input vectors, i.e.,

x_{N_{t}} \subseteq x_{N_{t} - 1} \subseteq . . . x_{t} \subseteq x_{t - 1} . . . \subseteq x_{1}

. We want to draw conclusions from the predictive posterior probability of

z_{N_{t}} (x_{})

at a set of points x,

\begin{matrix} p [z_{N_{t}} (x_{}) ∣ D] & = \int \prod_{t = 1}^{N_{t}} d δ_{t} (x_{}) \int d θ p [z_{N_{t}} (x_{}) ∣ {δ_{t} (x_{})}_{t = 1}^{N_{t}}, θ, D] p [{δ_{t} (x_{})}_{t = 1}^{N_{t}} ∣ θ, D] p [θ ∣ D] \end{matrix}

This thing is quite unhandy. We will instead just deal with its moments only, namely the posterior mean

〈z_{N_{t}} (x_{})〉

and the posterior covariance

C o v (z_{N_{t}} (x_{})) = 〈z_{N_{t}} (x_{}) z_{N_{t}} {(x_{})}^{T}〉 - 〈z_{N_{t}} (x_{})〉 {〈z_{N_{t}} (x_{})〉}^{T}

, where the diagonal of the posterior covariance is the uncertainty band of the prediction. The moments of

z_{N_{t}} (x_{})

follow from

\begin{matrix} 〈f (z_{N_{t}} (x_{}))〉 & = \int d z_{N_{t}} (x_{}) f (z_{N_{t}} (x_{})) p [z_{N_{t}} (x_{}) ∣ D] \\ = \int \prod_{t = 1}^{N_{t}} d δ_{t} (x_{}) \int \prod_{t = 1}^{N_{t}} d θ_{t} f (\sum_{t = 1}^{N_{t}} δ_{t} (x_{}) \prod_{l = t}^{N_{t} - 1} ρ_{l}) \prod_{t = 1}^{N_{t}} p [δ_{t} (x_{}) ∣ θ_{t}, D_{t}] p [θ_{t} ∣ D_{t}] \end{matrix}

(3)

We have used the fact that

p [z_{N_{t}} (x_{}) ∣ {δ_{t} (x_{})}_{t = 1}^{N_{t}}, θ, D]

reduces to Dirac’s delta-distribution since

z_{N_{t}} (x_{})

is uniquely determined by Equation (1) and the knowledge of all difference-GPs,

{δ_{t} (x_{})}_{t = 1}^{N_{t}}

. Thus, integration with respect to

z_{N_{t}} (x_{})

is merely a replacement of

z_{N_{t}} (x_{}) \to \sum_{t = 1}^{N_{t}} δ_{t} (x_{}) \prod_{l = t}^{N_{t} - 1} ρ_{l}

. Per construction, we can factor

p [{δ_{t} (x_{})}_{t = 1}^{N_{t}} ∣ θ, D] p [θ ∣ D] = \prod_{t} p [δ_{t} (x_{}) ∣ θ_{t}, D_{t}] p [θ_{t} ∣ D_{t}]

. Since

δ_{t} (x_{})

is assumed to obey a GP, the prior probability of

δ_{t} (x_{t})

is multivariate normal. If the likelihood

p [D_{t} ∣ δ_{t} (x_{}), θ_{t}]

is Gaussian, then the posterior probability of

δ_{t} (x_{t})

,

p [δ_{t} (x_{t}) ∣ θ_{t}, D_{t}]

, is multivariate normal as well. Integration with respect to

δ_{t} (x_{})

yields thus the standard result of the posterior mean value (see, e.g., Reference [10]) and results in a replacement of the GP with its posterior mean.

2.2. Prediction and Its Uncertainty

To compute posterior mean and covariance, we are thus left with integration with respect to the hyperparameters. This is done analytically with all parameters (

β_{}, σ_{}, ρ_{}

) but the parameters of the kernel function,

α_{}

, since those usually occur nonlinearly in the kernel function. We are then left with numerical integration of expectations and covariances, both conditioned on

α_{}

. We assume flat priors for

β_{}

and

ρ_{}

and Jeffreys’ prior for

σ_{}

, i.e.,

p [σ_{t} ∣ C] = \frac{1}{σ_{t}}

. The prior of

α_{}

needs to be chosen only after the covariance kernel has been chosen. Let

〈\cdot ∣ α_{t}〉

denote the expectation value conditioned on

α_{}

. Here, for ease of notation, we will instead write

〈\cdot〉

only. The technicalities shall be detailed in the Appendix A , and the result is as follows:

\begin{matrix} 〈z_{N_{t}} (x_{})〉 & = \int \sum_{t = 1}^{N_{t}} 〈δ_{t} (x_{})〉 \prod_{l = t}^{N_{t} - 1} 〈ρ_{l}〉 p [α_{} ∣ D] d α_{} \\ 〈z_{N_{t}} (x_{}) {(z_{N_{t}} (x_{}))}^{T}〉 & = \int \sum_{t = 1}^{N_{t}} (〈σ_{t}^{2}〉 Σ_{t} (x, x) + 〈δ_{t} (x_{})〉 {〈δ_{t} (x_{})〉}^{T}) \prod_{l = t}^{N_{t} - 1} 〈ρ_{l}^{2}〉 p [α_{} ∣ D] d α_{} \\ 〈δ_{t} (x_{})〉 & = h_{t} (x) 〈β_{t}〉 + K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} (δ_{t} (x_{t}) - h_{t} (x_{t}) 〈β_{t}〉) \\ Σ_{t} & = (K_{t} (x_{}, x_{}) - K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} K_{t} (x_{t}, x_{})) \\ p [α_{} ∣ D] & = \frac{2^{- N_{t}}}{\sqrt{Φ_{1}}} \frac{Γ (γ_{1})}{Γ (γ_{1} - \frac{1}{2})} \prod_{t = 1}^{N_{t}} a_{t}^{- \frac{1}{2}} {(π Φ_{t})}^{- γ_{t} + \frac{1}{2}} Γ (γ_{t} - \frac{1}{2}) {[\frac{| K_{t} (x_{t}, x_{t}) |}{| A_{t} |}]}^{- \frac{1}{2}} p [α_{} ∣ C] \end{matrix}

(4a)

where

δ_{t} (x_{t})

is determined from the data and Equation (2),

Γ (\cdot)

is the complete Gamma-function,

∣ \cdot ∣

is the matrix determinant, and the conditional expectations of the hyperparameters are

\begin{matrix} 〈β_{t}〉 & = A_{t} t (z_{t} (x_{t}) - 〈ρ_{t - 1}〉 z_{t - 1} (x_{t})) & 〈ρ_{t - 1}〉 & = \frac{b_{t}}{a_{t}} \\ 〈σ_{t}^{2}〉 & = \frac{Φ_{t}}{2 γ_{t} - 3 + 3 δ_{t 1}} & 〈ρ_{t - 1}^{2}〉 & = \frac{〈σ_{t}^{2}〉}{a_{t}} + {(\frac{b_{t}}{a_{t}})}^{2} \end{matrix}

(4b)

where

δ_{t 1} = 1

for

t = 1

and 0 otherwise and the abbreviations are

\begin{matrix} γ_{t} & = \frac{N_{x_{t}} - N_{β_{t}}}{2} & Φ_{t} & = c_{t} - \frac{b_{t}^{2}}{a_{t}} \\ C_{t} & = K_{t} {(x_{t}, x_{t})}^{- 1} - t^{T} A_{t} t & c_{t} & = {(z_{t} (x_{t}))}^{T} C_{t} z_{t} (x_{t}) \\ B_{t} & = {h_{t} (x_{t})}^{T} K_{t} {(x_{t}, x_{t})}^{- 1} & b_{t} & = {(z_{t} (x_{t}))}^{T} C_{t} z_{t - 1} (x_{t}) \\ A_{t} & = {({(h_{t} (x_{t}))}^{T} K_{t} {(x_{t}, x_{t})}^{- 1} (h_{t} (x_{t})))}^{- 1} & a_{t} & = {(z_{t - 1} (x_{t}))}^{T} C_{t} z_{t - 1} (x_{t}) \end{matrix}

(4c)

N_{x_{t}}

is the number of pivot points in input vector

x_{t}

, and

N_{β_{t}}

is the expansion order of level t’s mean function. For

t = 1

, we need to define

ρ_{0} = 0

,

a_{1} = 1

, and

b_{1} = 0

. Quite importantly, we find following the requirement:

\begin{matrix} N_{β_{1}} & < N_{x_{1}} - 2 \\ N_{β_{t}} & < N_{x_{t}} - 3 \forall t \geq 2 \end{matrix}

(4d)

since otherwise the second moments of

σ_{t}

are not defined. The numerical evaluation of this result merely involves a couple of matrix operations. The only input is the data, regression functions, and covariance matrices. No parameters need to be tuned.

3. Algorithm and Mock Data Scrutiny

We test the method for the special case of two levels. Then, posterior mean and covariance are simply

\begin{matrix} 〈z_{2} (x_{})〉 & = \int 〈δ_{1} (x_{})〉 p [α_{1} ∣ D_{1}] d α_{1} + \int 〈ρ_{1}〉 〈δ_{2} (x_{})〉 p [α_{2} ∣ D_{2}] d α_{2} \\ 〈z_{2} (x_{}) {(z_{2} (x_{}))}^{T}〉 & = \int (〈σ_{1}^{2}〉 Σ_{1} (x, x) + 〈δ_{1} (x_{})〉 {〈δ_{1} (x_{})〉}^{T}) p [α_{1} ∣ D_{1}] d α_{1} \\ + \int 〈ρ_{1}^{2}〉 (〈σ_{2}^{2}〉 Σ_{2} (x, x) + 〈δ_{2} (x_{})〉 {〈δ_{2} (x_{})〉}^{T}) p [α_{2} ∣ D_{2}] d α_{2} \end{matrix}

We generate mock data according to Equations (1) and (2) and compare the data analysis results to the underlying truth. We have chosen as mean function bases the Legendre polynomials up to orders 10 and 4 for Levels 1 and 2, respectively. This is convenient since this basis is both orthogonal as well as normalized on

[- 1, 1]

already and the map onto the desired domain is trivial. The covariance kernel was chosen to be the squared exponential kernel, where

α_{1}

and

α_{2}

were defined as the inverse of the correlation length squared. This choice was inspired by the typical form of signals encountered in impedance cardiography, about which we will talk more in Section 4. The data set was one sample drawn per level; see Figure 2 . We chose Jeffreys’ prior for

α_{t}

. The integration bounds can be read from Figure 1, and an integration grid of 100 × 100 equally sized volumes turned out to be well converged. As an intermediate result, we compare the multi-fidelity estimates to the true parameter values in Table 1. The posterior probabilities of

β_{t}, ρ_{t}, σ_{t}

are Gaussian. Since

α_{1}

and

α_{2}

are not and thus cannot be reasonably well described with mean and variance only, we additionally show their posterior probabilities in Figure 1. The predictions and prediction uncertainties are compared to the true mean in Figure 2. We find that both the parameter estimates as well as the predictions statistically match the truth within their uncertainties. The mean function parameter uncertainties (

β_{}

) clearly illustrate that learning parameters by optimization has little meaning if there is little data. As the data set grows big, the posterior will contract to the maximum likelihood solution. Still, and luckily, the prediction uncertainties are kept low because the mean function parameter uncertainties do not appear in the prediction uncertainties directly. We emphasize that our proposed method naturally is applied to little data problems.

Since on level 2 we only have 11 data points, the expansion order of the mean function is limited to a maximum of 7 according to Equation (4d). Unsurprisingly, the solution rapidly worsens as we approach this constraint and entirely breaks down as we reach it because the posteriors become nonconclusive. This is exactly where we find the strength of our multi-fidelity approach. We can choose a high-order mean function on a level where data is abundant and a low-order mean function on a level which we are actually interested in but where data is scarce. The trick is thus actually that the difference of the levels can be modeled by a low-order mean function.

For the sake of completeness, we report the converged log-evidence to be

430 \pm 10

. In real-life applications of the method, one should and could compare different choices of mean function expansions and covariance kernel functions by Bayesian model comparison [24], i.e., compute each choice’s evidence. Let

N_{α_{t}}

be the number of hyperparameters in kernel function

k_{t}

. Since

p [α_{} ∣ D]

factorizes, the integrals are

N_{α_{t}}

-dimensional each rather than one single integral of dimension

\prod_{t} N_{α_{t}}

, making the computation of the evidence relatively easy. In our case, numerical Riemann integration was good enough. When choosing more sophisticated kernel functions with more hyperparameters, one might need to use statistical integration methods such nested sampling [25], which conveniently and automatically yields the evidence as well.

For the sake of numerical stability, it is advisable to rescale the data. Further, one might want to improve the condition numbers of the prior covariance matrices,

σ_{t}^{2} K_{t} (x, x)

, by adding a small term proportional to the identity matrix, where the proportionality constant should be several orders of magnitude smaller than

σ_{t}^{2}

.

Finally, we would like to point out that Equation (4) suggests trivial parallelisation of the code levels. This is not easily recognizable in the presentation of Kennedy and O’Hagan [3] but was found by Le Gratiet and Garnier [26] already.

The algorithm, implemented in Matlab (R2019a), shall be available on https://github.com/Sranf/Bayesian-MuFi-GP.git.

4. Application to Finite Element Simulations of Impedance Cardiography of Aortic Dissection

In this section, we quantify the uncertainties of real simulation data. We compare the uncertainties of our Bayesian MuFi-GP with normal Bayesian GP neglecting the additional LoFi data, that is, the special case of

N_{t} = 1

in Equation (4). We have described the physical and physiological model in our previous work [20,27] but shall restate a brief summary here for the reader’s convenience.

We solve Laplace’s equation:

\nabla \cdot ((σ + i ω ε) \nabla V) = 0,

with finite elements on a geometry depicted in Figure 3, where V is the electric potential,

σ

is the electrical conductivity (not to be confused with the regression parameter in the GP kernel),

ω

is the angular current frequency,

ε

is the permittivity, and i is the imaginary unit. We had von-Neumann boundary conditions, where

V = c o n s t .

on the top electrode and

V = 0

on the bottom electrode, where the surface integral of the current was held constant at 4 mA and air was assumed to be perfectly insulating. The current had a frequency of 100 kHz. We considered one cardiac cycle that spans one second. The dynamics were modelled via a time-dependent radius of the aorta and its dissection, which arises from pressure waves in a pulsatile flow. Further, the blood conductivity is parametrized in time via its dependence on flow velocity. In the dissected aorta, we assume flow to be stagnant [20,28]. The voltage drop is then measured from just below the top electrode to just above the bottom electrode. The impedance is then the ratio of voltage over current, and the admittance is the inverse of the impedance. We used Comsol Multiphysics for the modelling [29].

For the uncertainty quantification, we chose to expand the mean functions in Legendre polynomials up to order 8 and 2 for HiFi and LoFi, respectively. We choose the squared exponential kernel for both covariance matrices. In principle, one should compute the evidence for a number of plausible choices and choose the one with the most evidence. For the mean function, that would most simply be different expansion orders, while the covariance kernel could be taylored to the PDE at hand to enforce physical behaviour, as suggested in References [30,31].

For uncertain parameters, we consider the radius of the dissected aorta and perform a number of simulations with sensible values within the physiological and physical range, i.e.,

1.0

–

24.0

mm. The LoFi data set consisted of 24 time series, each with 21 pivot points in time. The HiFi data set consisted of 3 time series (5 mm, 11 mm, and 18 mm), each with 11 pivot points in time.

In Figure 4, we show the posterior of the kernel parameters, which turns out to be quite conclusive. In Figure 5, we plot the HiFi predictions and uncertainties which are enhanced by LoFi data and compare them to HiFi predictions and uncertainties which are not enhanced by LoFi data as well as to the test data set.

5. Conclusions

We devised a fully Bayesian multi-level Gaussian process model to improve uncertainty quantification of expensive and little high-fidelity simulation data by augmenting the data set with low-fidelity simulations. Our proposed method is rigorous and logically consistent, no ad hoc assumptions have been made, and the user is spared the embarrassment of having to tune any parameters. The method was scrutinized with mock data and shown to work with as little data as where simple Bayesian Gaussian process regression is not conclusive at all. We applied the method to finite element simulations of impedance cardiography of aortic dissection and quantified the uncertainty due to the unknown size of the aortic dissection. By using meshes of both high fidelity (defined by mesh convergence) and low fidelity, we reduced the uncertainty significantly. We have thus further shown that uncertainties due to geometrical parameters can be described with Gaussian processes on each level of fidelity. With a coarsened mesh, the result is qualitativelybut not quantitatively similar. Usually, that is not good enough and the low-fidelity data is entirely useless to the engineer. Here we show that this is not necessarily true in the context of uncertainty quantification. Ultimately, we want to diagnose aortic dissection from impedance cardiography signals, i.e., in the parlance of probability theory, we need to compare the evidences of healthy and diseased aortae. Unambiguous judgement will most likely, if at all, be possible only with several electrodes at once.

Author Contributions

Conceptualization, S.R.; methodology, S.R. and W.v.d.L.; software, S.R., W.v.d.L., G.M.M., V.B., and A.R.-K.; formal analysis, S.R.; investigation, S.R.; resources, S.R. and G.M.M; data curation, S.R., G.M.M., and V.B.; writing—original draft preparation, S.R.; writing—review and editing, W.v.d.L.; visualization, S.R.; supervision, W.v.d.L., K.E., and A.R.-K; project administration, S.R.; funding acquisition, W.v.d.L. and K.E. All authors contributed to the discussions. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Lead project “Mechanics, Modeling, and Simulation of Aortic Dissection” of the TU Graz (biomechaorta.tugraz.at).

Acknowledgments

The authors would like to acknowledge the use of HPC resources provided by the ZID of Graz University of Technology.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

The two-level code implemented in Matlab is available on https://github.com/Sranf/Bayesian-MuFi-GP.git. The ICG data is available from the authors upon reasonable request.

Appendix A. Mathematical Proofs

We start from Equation (3) and want to compute Equation (4) from it. We reintroduce the notation of conditional expectations, where

〈 Q ∣ P 〉

is the expectation of Q given some P, i.e., the integration is done with respect to all parameters but P.

\begin{matrix} p [δ_{t} (x_{t}) ∣ x_{t}, θ_{t}] & = N [δ_{t} (x_{t}) ∣ {\bar{δ}}_{t} (x_{t}), K_{t} (x_{t}, x_{t})] \\ {\bar{δ}}_{t} (x) & = h_{t} (x) β_{t} + K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} (δ_{t} (x_{t}) - h_{t} (x_{t}) β_{t}) \\ \Rightarrow p [θ ∣ D] & = \prod_{t} p [θ_{t} ∣ D_{t}] \\ p [D_{t} ∣ θ_{t}] & = N [δ_{t} (x_{t}) ∣ {\bar{δ}}_{t} (x_{t}), K_{t} (x_{t}, x_{t})] \end{matrix}

Thus, in the moments of

z_{N_{t}} (x_{})

, by integration with respect to

δ_{t} (x_{})

, we can replace

δ_{t} (x_{})

by its posterior mean:

\begin{matrix} 〈z_{N_{t}} (x_{})〉 & = \int d θ (\sum_{t = 1}^{N_{t}} {\bar{δ}}_{t} (x) \prod_{l = t}^{N_{t} - 1} ρ_{l}) p [θ ∣ D] d θ \\ 〈z_{N_{t}} (x_{}) z_{N_{t}} {(x_{})}^{T}〉 & = \int d θ (\sum_{t = 1}^{N_{t}} {\bar{δ}}_{t} (x) \prod_{l = t}^{N_{t} - 1} ρ_{l}) {(\sum_{t = 1}^{N_{t}} {\bar{δ}}_{t} (x) \prod_{l = t}^{N_{t} - 1} ρ_{l})}^{T} p [θ ∣ D] d θ \end{matrix}

Appendix A.1. Parameter Posterior and Parameter Estimates

We need the posterior of the parameters:

\begin{matrix} N [δ_{t} (x_{t}) ∣ {\bar{δ}}_{t} t, K_{t} (x_{t}, x_{t})] & = \frac{1}{Z_{t}} e^{- \frac{ψ_{t}}{2 σ_{t}^{2}}} \\ ψ_{t} & = {(δ_{t} (x_{t}) - h_{t} (x_{t}) β_{t})}^{T} K_{t} {(x_{t}, x_{t})}^{- 1} (δ_{t} (x_{t}) - h_{t} (x_{t}) β_{t}) \\ Z_{t} & = {(2 π σ_{t}^{2})}^{N_{x_{t}} / 2} {| K_{t} (x_{t}, x_{t}) |}^{1 / 2} \end{matrix}

(A1a)

The exponent of this Gaussian is a quadratic form in

β_{t}

, and we rewrite it as

\begin{matrix} - \frac{ψ_{t}}{2 σ_{t}^{2}} & = - \frac{1}{2 σ_{t}^{2}} {(β_{t} - 〈β_{t} ∣ α_{t}, ρ_{t - 1}〉)}^{T} A_{t}^{- 1} (β_{t} - 〈β_{t} ∣ α_{t}, ρ_{t - 1}〉) - \frac{1}{2 σ_{t}^{2}} δ_{t} {(x_{t})}^{T} C_{t} δ_{t} (x_{t}) \\ 〈β_{t} ∣ α_{t}, ρ_{t - 1}〉 & = A_{t} B_{t} (z_{t} (x_{t}) - ρ_{t - 1} z_{t} (x_{t - 1})) \\ C_{t} & = K_{t} {(x_{t}, x_{t})}^{- 1} - t^{T} A_{t} t \\ B_{t} & = {(h_{t} (x_{t}))}^{T} K_{t} {(x_{t}, x_{t})}^{- 1} \\ A_{t} & = {({(h_{t} (x_{t}))}^{T} K_{t} {(x_{t}, x_{t})}^{- 1} (h_{t} (x_{t})))}^{- 1} \end{matrix}

From this form, we can read the (conditional) expectation and covariance of

β_{t}

. We can now integrate with respect to

β_{t}

. We assume a flat prior for

β_{t}

:

\begin{matrix} p [α_{t}, σ_{t}, ρ_{t - 1} ∣ D_{t}] & = \int p [α_{t}, σ_{t}, ρ_{t - 1}, β_{t} ∣ D_{t}] d β_{t} \\ = exp (- \frac{1}{2 σ_{t}^{2}} δ_{t} {(x_{t})}^{T} C_{t} δ_{t} (x_{t})) {[\frac{| K_{t} (x_{t}, x_{t}) |}{| A_{t} |}]}^{- 1 / 2} {(2 π σ_{t}^{2})}^{- γ_{t}} \end{matrix}

with

γ_{t} = \frac{N_{x_{t}} - N_{β_{t}}}{2}

,

N_{x_{t}}

as the number of pivot points in input vector

x_{t}

, and

N_{β_{t}}

as the number of basis functions of level t. Apparently, the expectation of

β_{}

is independent of

σ_{}

and the covariance of

β_{}

is independent of

ρ_{}

. We will next tend to integrating with respect to

ρ_{t - 1}

. We find

ρ_{t - 1}

in

δ_{t} (x_{t})

only and thus as a quadratic form again:

\begin{matrix} - \frac{1}{2 σ_{t}^{2}} δ_{t} {(x_{t})}^{T} C_{t} δ_{t} (x_{t}) & = - \frac{1}{2 σ_{t}^{2}} (c_{t} - \frac{b_{t}^{2}}{a_{t}}) - \frac{1}{2 σ_{t}^{2}} a_{t} {(ρ_{t - 1} - 〈ρ_{t - 1} ∣ α_{t}, σ_{t}〉)}^{2} \\ 〈ρ_{t - 1} ∣ α_{t}, σ_{t}〉 & = \frac{b_{t}}{a_{t}} \\ v a r (ρ_{t - 1} ∣ α_{t}, σ_{t}) & = \frac{σ_{t}^{2}}{a_{t}} \\ c_{t} & = {(z_{t} (x_{t}))}^{T} C_{t} z_{t} (x_{t}) \\ b_{t} & = {(z_{t} (x_{t}))}^{T} C_{t} z_{t - 1} (x_{t}) \\ a_{t} & = {(z_{t - 1} (x_{t}))}^{T} C_{t} z_{t - 1} (x_{t}) \end{matrix}

We can now integrate with respect to

ρ_{t - 1}

. We assume a flat prior:

\begin{matrix} p [α_{t}, σ_{t} ∣ D_{t}] & = \int p [α_{t}, σ_{t}, ρ_{t - 1} ∣ D_{t}] d ρ_{t - 1} \\ = exp (- \frac{1}{2 σ_{t}^{2}} (c_{t} - \frac{b_{t}^{2}}{a_{t}})) {[\frac{| K_{t} (x_{t}, x_{t}) |}{| A_{t} |}]}^{- 1 / 2} {(2 π σ_{t}^{2})}^{- γ_{t}} \sqrt{\frac{2 π σ_{t}^{2}}{a_{t}}} \end{matrix}

The last random variable we can treat analytically is

σ_{t}

. With Jeffrey’s prior

p [σ_{t} ∣ C] = \frac{1}{σ_{t}}

, we have

\begin{matrix} p [α_{t} ∣ D_{t}] & = \int p [α_{t}, σ_{t} ∣ D_{t}] d σ_{t} \\ = \int Λ_{t} exp (- \frac{Φ_{t}}{2 σ_{t}^{2}}) {(σ_{t}^{2})}^{- γ_{t} + \frac{1}{2}} \frac{d σ_{t}}{σ_{t}} \\ Φ_{t} & = c_{t} - \frac{b_{t}^{2}}{a_{t}} \\ Λ_{t} & = {[\frac{| K_{t} (x_{t}, x_{t}) |}{| A_{t} |}]}^{- 1 / 2} {(2 π)}^{- γ_{t}} \sqrt{\frac{2 π}{a_{t}}} \end{matrix}

With the substitution

v = \frac{Φ_{t}}{2 σ_{t}^{2}}

, we find this to be a

Γ

-integral:

\begin{matrix} p [α_{t} ∣ D_{t}] & = \frac{Λ_{t}}{2} {(\frac{Φ_{t}}{2})}^{- γ_{t} + \frac{1}{2}} \underset{= Γ (γ_{t} - \frac{1}{2})}{\underset{︸}{\int e^{- v} v^{(γ_{t} - \frac{1}{2}) - 1} d v}} \end{matrix}

The moments of

σ_{t}

are

Γ

-integrals as well, and we find

\begin{matrix} \frac{〈σ_{t}^{ν} ∣ α_{t}〉}{〈σ_{t}^{0} ∣ α_{t}〉} & = {\sqrt{Φ_{t}}}^{ν} \frac{Γ (γ_{t} - \frac{1}{2} - \frac{ν}{2})}{Γ (γ_{t} - \frac{1}{2})} \end{matrix}

For the above

Γ

-integral and the

σ

-moments to exist, we require

γ_{t} - \frac{1}{2} - \frac{ν}{2} > 0

, i.e., Equation (4d). Note that, in the case of

t = 1

, we have no integration with respect to

ρ_{0}

and thus find one power of

\sqrt{σ_{1}}

less in the above

Γ

-integral, i.e., we need to substitute

γ_{1} - \frac{1}{2} \to γ_{1}

. The constraint is accordingly weakened for

t = 1

, which is due to the absence of the single parameter

ρ_{0}

. For most choices of the covariance kernel, we cannot go further analytically.

Now, we have everything we need to marginalize

σ_{t}

in the conditional expectations of

ρ_{t - 1}

.

\begin{matrix} 〈ρ_{t - 1}^{ν} ∣ α_{t}〉 & = \int 〈ρ_{t - 1}^{ν} ∣ α_{t}, σ_{t}〉 p [α_{t}, σ_{t} ∣ D_{t}] d σ_{t} \end{matrix}

which is now relatively easy.

\begin{matrix} 〈ρ_{t - 1}^{1} ∣ α_{t}〉 & = \frac{b_{t}}{a_{t}} \\ 〈ρ_{t - 1}^{2} ∣ α_{t}〉 & = \frac{〈σ_{t}^{2} ∣ α_{t}〉}{a_{t}} + {(\frac{b_{t}}{a_{t}})}^{2} \end{matrix}

The expected mean function parameters are simply

\begin{matrix} 〈β_{t} ∣ α_{t}〉 & = \int 〈β_{t} ∣ α_{t}, ρ_{t - 1}, σ_{t}〉 p [α_{t}, ρ_{t - 1}, σ_{t} ∣ D_{t}] d ρ_{t - 1} d σ_{t} \\ = A_{t} t (z_{t} (x_{t}) - 〈ρ_{t - 1} ∣ α_{t}〉 z_{t} (x_{t - 1})) \end{matrix}

The variance we read from the quadratic form in

β_{t}

is as follows:

\begin{matrix} v a r (β_{t}^{(k)} ∣ α_{t}) & = 〈σ_{t}^{2} ∣ α_{t}〉 \cdot {(A_{t})}_{k k} \end{matrix}

with

{(A_{t})}_{k k}

as the kth diagonal element of

A_{t}

.

Appendix A.2. Predictive Mean

We rewrite

{\bar{δ}}_{t}

such that

\begin{matrix} {\bar{δ}}_{t} (x) & = \underset{u_{t}}{\underset{︸}{K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} (z_{t} (x_{t}) - ρ_{t - 1} z_{t} (x_{t - 1}))}} + \underset{g_{t}}{\underset{︸}{(h_{t} (x) - K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} h_{t} (x_{t}))}} β_{t} \end{matrix}

The first term,

u_{t}

, assumes a rather simple form and only depends on

ρ_{t - 1}

and

α_{}

. Since we found

p [ρ_{}, α_{} ∣ D] = \prod_{k} p [ρ_{k}, α_{t} ∣ D_{k}]

previously, and each

ρ_{k}

occurs linearly. We find

\begin{matrix} 〈\prod_{k} ρ_{k} | α_{}〉 & = \prod_{k} 〈ρ_{k} ∣ α_{t}〉 \\ \Rightarrow 〈u_{t} \prod_{l = t}^{N_{t} - 1} ρ_{l} | α_{}〉 & = K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} (z_{t} (x_{t}) - 〈ρ_{t - 1} ∣ α_{t}〉 z_{t} (x_{t - 1})) \prod_{l = t}^{N_{t} - 1} 〈ρ_{l} ∣ α_{t}〉 \end{matrix}

The second term is rather trivial.

\begin{matrix} 〈g_{t} β_{t} \prod_{l = t}^{N_{t} - 1} ρ_{l} ∣ α_{t}〉 = g_{t} 〈β_{t} ∣ α_{t}〉 \prod_{l = t}^{N_{t} - 1} 〈ρ_{l} ∣ α_{t}〉 \end{matrix}

We have thus shown the predictive mean in Equation (4).

Appendix A.3. Predictive Covariance

We easily find

\begin{matrix} (\sum_{t = 1}^{N_{t}} δ_{t} (x_{}) (x) \prod_{l = t}^{N_{t} - 1} ρ_{l}) {(\sum_{t = 1}^{N_{t}} δ_{t} (x_{}) (x) \prod_{l = t}^{N_{t} - 1} ρ_{l})}^{T} & = \sum_{t t^{'}}^{N_{t}} δ_{t} (x_{}) δ_{t^{'}} {(x_{})}^{T} \prod_{l l^{'}}^{N_{t} - 1} ρ_{l} ρ_{l^{'}} \end{matrix}

Due to the assumption of independence of the levels or each levels’ parameters, formally

δ_{t} (x_{}) ⊥ z_{t - 1} (x_{})

, the cross-terms mixing different levels vanish in the expectation value and it suffices to reduce the double sum to a single sum. Further, we can factor the expectation of the product of

ρ_{l}

and the expectation of the sum.

\begin{matrix} 〈\sum_{t t^{'}}^{N_{t}} δ_{t} (x_{}) δ_{t^{'}} {(x_{})}^{T} \prod_{l l^{'}}^{N_{t} - 1} ρ_{l} ρ_{l^{'}}〉 & = 〈\sum_{t = 1}^{N_{t}} δ_{t} (x_{}) δ_{t} {(x_{})}^{T}〉 \prod_{l = t}^{N_{t} - 1} 〈ρ_{l}^{2}〉 \end{matrix}

The first factor on the right-hand side can be expressed via the GP’s posterior covariance and mean:

\begin{matrix} \sum_{t = 1}^{N_{t}} 〈δ_{t} (x_{}) δ_{t} {(x_{})}^{T}〉 & = \sum_{t = 1}^{N_{t}} Σ_{t} (x, x) + 〈δ_{t} (x_{})〉 {〈δ_{t} (x_{})〉}^{T} \end{matrix}

where

\begin{matrix} Σ_{t} (x, x) & = σ_{t}^{2} (K_{t} (x_{}, x_{}) - K_{t} (x_{}, x_{t}) K_{t} {(x_{t}, x_{t})}^{- 1} K_{t} (x_{t}, x_{})) \end{matrix}

is the GPs’ posterior covariance and the predictive mean we already know as follows:

\begin{matrix} 〈δ_{t} (x_{}) ∣ α_{t}〉 = 〈u_{t} ∣ α_{t}〉 + g_{t} 〈β_{t} ∣ α_{t}〉 \end{matrix}

We have thus shown everything we claimed.

References

Ghanem, R.G.; Owhadi, H.; Higdon, D. Handbook of Uncertainty Quantification; Springer: Basel, Switzerland, 2017. [Google Scholar]
Haylock, R. Bayesian Inference about Outputs of Computationally Expensive Algorithms with Uncertainty on the Inputs. Ph.D. Thesis, Universtity of Nottingham, Nottingham, UK, May 1997. [Google Scholar]
O’Hagan, A.; Kennedy, M.C.; Oakley, J.E. Uncertainty Analysis and Other Inference Tools for Complex Computer Codes. 1999. Available online: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.51.446&rep=rep1&type=pdf (accessed on 26 December 2019).
O’Hagan, A. Bayesian analysis of computer code outputs: A tutorial. Reliab. Eng. Syst. Saf. 2006, 91, 1290–1300. [Google Scholar] [CrossRef]
Wiener, N. The Homogeneous Chaos. Am. J. Math. 1938, 60, 897–936. [Google Scholar] [CrossRef]
Ghanem, R.G.; Spanos, P.D. Stochastic Finite Elements: A Spectral Approach; Springer: Basel, Switzerland, 1991. [Google Scholar]
Xiu, D.; Karniadakis, G.E. The Wiener-Askey polynomial chaos for stochastic differential equations. SIAM J. Sci. Comput. 2005, 27, 1118–1139. [Google Scholar] [CrossRef]
O’Hagan, A. Polynomial Chaos: A Tutorial and Critique from a Statistician’s Perspective. Available online: http://tonyohagan.co.uk/academic/pdf/Polynomial-chaos.pdf (accessed on 25 June 2019).
O’Hagan, A. Curve Fitting and Optimal Design for Prediction. J. R. Stat. Soc. Ser. B Methodol. 1978, 40, 1–42. [Google Scholar] [CrossRef]
Rasmussen, C.E.; Williams, C.K. Gaussian Processes for Machine Learning; The MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Bishop, C. Neural Networks for Pattern Recognition; Oxford University Press: Oxford, UK, 1996. [Google Scholar]
MacKay, D.J. Information Theory, Inference, and Learning Algorithms; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
Kennedy, M.C.; O’Hagan, A. Predicting the output from a complex computer code when fast approximations are available. Biometrika 2000, 87, 1–13. [Google Scholar] [CrossRef] [Green Version]
Koutsourelakis, P.S. Accurate Uncertainty Quantification using inaccurate Computational Models. SIAM J. Sci. Comput. 2009, 31, 3274–3300. [Google Scholar] [CrossRef]
Perdikaris, P.; Raissi, M.; Damianou, A.; Lawrence, N.D.; Karniadakis, G.E. Nonlinear information fusion algorithms for data-efficient multi-fidelity modelling. Proc. R. Soc. A Math. Phys. Eng. Sci. 2017, 473, 20160751. [Google Scholar] [CrossRef] [PubMed]
Eck, V.G.; Donders, W.P.; Sturdy, J.; Feinberg, J.; Delhaas, T.; Hellevik, L.R.; Huberts, W. A guide to uncertainty quantification and sensitivity analysis for cardiovascular applications. Int. J. Numer. Methods Biomed. Eng. 2016, 32, e02755. [Google Scholar] [CrossRef] [PubMed]
Biehler, J.; Gee, M.W.; Wall, W.A. Towards efficient uncertainty quantification in complex and large-scale biomechanical problems based on a Bayesian multi-fidelity scheme. Biomech. Model. Mechanobiol. 2015, 14, 489–513. [Google Scholar] [CrossRef] [PubMed]
Zienkiewicz, O.C.; Taylor, R.L.; Zhu, J.Z. The Finite Element Method: Its Basis and Fundamentals; Elsevier: Amsterdam, The Netherlands, 1967. [Google Scholar]
Miller, J.C.; Horvath, S.M. Impedance Cardiography. Psychophysiology 1978, 15, 80–91. [Google Scholar] [CrossRef] [PubMed]
Reinbacher-Köstinger, A.; Badeli, V.; Biro, O.; Magele, C. Numerical Simulation of Conductivity Changes in the Human Thorax Caused by Aortic Dissection. IEEE Trans. Magn. 2019, 55, 1–4. [Google Scholar] [CrossRef]
Humphrey, J.D. Cardiovascular Solid Mechanics; Springer: Basel, Switzerland, 2002; p. 758. [Google Scholar]
Khan, I.A.; Nair, C.K. Clinical, diagnostic, and management perspectives of aortic dissection. Chest 2002, 122, 311–328. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gabriel, C.; Gabriel, S.; Corthout, E.C. The dielectric properties of biological tissues: I. Literature survey. Phys. Med. Biol. 1996, 41, 2231–2249. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sivia, D.; Skilling, J. Data Analysis: A Bayesian Tutorial; Oxford University Press: Oxford, UK, 2006. [Google Scholar]
Skilling, J. Nested sampling for general Bayesian computation. Bayesian Anal. 2006, 1, 833–859. [Google Scholar] [CrossRef]
Le Gratiet, L.; Garnier, J. Recursive Co-Kriging Model for Design of Computer Experiments With Multiple Levels of Fidelity. Int. J. Uncertain. Quantif. 2014, 4, 365–386. [Google Scholar] [CrossRef]
Ranftl, S.; Melito, G.M.; Badeli, V.; Reinbacher-Köstinger, A.; Ellermann, K.; von der Linden, W. On the Diagnosis of Aortic Dissection with Impedance Cardiography: A Bayesian Feasibility Study Framework with Multi-Fidelity Simulation Data. Proceedings 2019, 33, 24. [Google Scholar] [CrossRef] [Green Version]
Alastruey, J.; Xiao, N.; Fok, H.; Schaeffter, T.; Figueroa, C.A. On the impact of modelling assumptions in multi-scale, subject-specific models of aortic haemodynamics. J. R. Soc. Interface 2016, 13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Comsol, A.B.; Stockholm, S. Comsol Multiphysics Version 5.4. Available online: http://www.comsol.com (accessed on 4 June 2019).
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Inferring solutions of differential equations using noisy multi-fidelity data. J. Comput. Phys. 2017, 335, 736–746. [Google Scholar] [CrossRef] [Green Version]
Albert, C.G. Gaussian processes for data fulfilling linear differential equations. Proceedings 2019, 33, 5. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Mock data analysis: Posterior probability density functions of the nonlinear kernel parameters

α_{1}

and

α_{2}

. Black dashed line: True value

Figure 1. Mock data analysis: Posterior probability density functions of the nonlinear kernel parameters

α_{1}

and

α_{2}

. Black dashed line: True value

Figure 2. Mock data analysis: Prediction. Note that the uncertainties have been multiplied by a factor of 10 for illustrative purposes.

Figure 3. Right: Mesh-converged HiFi model with 100,000–550,000 degrees of freedom. Left: LoFi model with 9000–15,000 with labels of the geometrical objects. Adapted from Reference [27]

Figure 4. Posterior probability of the nonlinear kernel parameters.

Figure 5. Data, prediction, and prediction uncertainty of the absolute value of the admittance, i.e., the inverse impedance in units of inverse Ohm:

z_{1} (x_{2})

denotes LoFi data at the same pivot points as HiFi data.

Figure 5. Data, prediction, and prediction uncertainty of the absolute value of the admittance, i.e., the inverse impedance in units of inverse Ohm:

z_{1} (x_{2})

denotes LoFi data at the same pivot points as HiFi data.

Table 1. Mock data analysis: Comparison of the hyperparameter estimates with their true values.

Hyperparameter	Estimate (Multi-Fidelity)	Truth
$β_{1}^{(1)}$	$0.30 \pm 0.07$	$0.32$
$β_{1}^{(2)}$	$- 0.30 \pm 0.09$	$- 0.40$
$β_{1}^{(3)}$	$0.02 \pm 0.08$	$0.1$
$β_{1}^{(4)}$	$0.34 \pm 0.06$	$0.35$
$β_{1}^{(5)}$	$- 0.50 \pm 0.04$	$- 0.51$
$β_{1}^{(6)}$	$0.35 \pm 0.02$	$0.33$
$β_{1}^{(7)}$	$- 0.033 \pm 0.008$	$- 0.034$
$β_{1}^{(8)}$	$- 0.146 \pm 0.005$	$- 0.142$
$β_{1}^{(9)}$	$0.1745 \pm 0.0008$	$0.1750$
$β_{1}^{(10)}$	$- 0.1031 \pm 0.0004$	$- 0.1034$
$β_{2}^{(1)}$	$0.42 \pm 0.59$	$0 . \dot{3}$
$β_{2}^{(2)}$	$- 0.17 \pm 0.55$	$0.15$
$β_{2}^{(3)}$	$0.11 \pm 0.17$	$0.01 \dot{6}$
$β_{2}^{(4)}$	$0.03 \pm 0.10$	0
$σ_{1}$	$0.133 \pm 0.096$	$0.1$
$σ_{2}$	$0.47 \pm 0.58$	$0.01$
$ρ_{1}$	$2.97 \pm 0.02$	3
$α_{1}$	$8.6 \pm 0.9$	$10.1$
$α_{2}$	$10 \pm 3$	$20.1$

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ranftl, S.; Melito, G.M.; Badeli, V.; Reinbacher-Köstinger, A.; Ellermann, K.; von der Linden, W. Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection. Entropy 2020, 22, 58. https://doi.org/10.3390/e22010058

AMA Style

Ranftl S, Melito GM, Badeli V, Reinbacher-Köstinger A, Ellermann K, von der Linden W. Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection. Entropy. 2020; 22(1):58. https://doi.org/10.3390/e22010058

Chicago/Turabian Style

Ranftl, Sascha, Gian Marco Melito, Vahid Badeli, Alice Reinbacher-Köstinger, Katrin Ellermann, and Wolfgang von der Linden. 2020. "Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection" Entropy 22, no. 1: 58. https://doi.org/10.3390/e22010058

APA Style

Ranftl, S., Melito, G. M., Badeli, V., Reinbacher-Köstinger, A., Ellermann, K., & von der Linden, W. (2020). Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection. Entropy, 22(1), 58. https://doi.org/10.3390/e22010058

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection

Abstract

1. Introduction

2. Bayesian Multi-Fidelity Scheme

2.1. Statistical Model

2.2. Prediction and Its Uncertainty

3. Algorithm and Mock Data Scrutiny

4. Application to Finite Element Simulations of Impedance Cardiography of Aortic Dissection

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Sample Availability

Appendix A. Mathematical Proofs

Appendix A.1. Parameter Posterior and Parameter Estimates

Appendix A.2. Predictive Mean

Appendix A.3. Predictive Covariance

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI