2. Bayesian Uncertainty Quantification
Based on the Bayesian framework we employ a spectral expansion to quantify the propagation of uncertainty through the model. First introduced by Wiener [
2] in the context of Hermite basis functions it was termed ‘polynomial chaos expansion’ at his time. Nowadays the notion of ‘chaos’ has shifted and the use of the term ‘spectral expansion’ is more appropriate. Once successfully achieved, the spectral representation is capable of quantifying the uncertainty for any point in model space or to serve as a surrogate model.
Since we calculate the sought-for spectral coefficients from a discrete set of collocation points in the space of the random variable, our approach is non-intrusive, but approximate. The emerging integrals in the calculation of the coefficients are evaluated by Gaussian quadrature which identifies the collocation points with those of the quadrature. Moreover, we assume mutually independent normally distributed random variables. The adjunctive set of orthonormal basis functions in such a case are Hermite polynomials.
To quantify the uncertainty of a result
R we seek the appropriate function
, such that
R will have the required distribution of the model response,
. As for all random variables with finite variance it is possible to find an infinite expansion
which we limit to polynomial order
P since the contributions of higher orders become numerically insignificant. The coefficients are given by
We assume Gaussian character for the random variable, so the density
is distributed according to the normal (probability) distribution
The adjunctive set of orthonormal basis functions is given by the so-called
probabilist Hermite functions, which read up to fourth order
It turns out that for the model simulations under consideration this polynomial order is sufficient since contributions from higher orders become numerically insignificant for the result. With these definitions the normalization constants in Equation (
2) are readily
Due to the Gaussian nature of the probability function omnipresent in the integrals above, it is beneficial to use Gauss-Hermite quadrature for the evaluation
where the weights
and the abscissas
are for instance provided by Numerical Recipes [
3]. Eventually, by exploiting the properties of the orthogonal Hermite polynomials the expectation value of the model outcome and its variance can be assigned to the spectral coefficients in Equation (
2)
In order to provide a measure for the influence of the uncertainty of input variables on the above variance we employ Sobol coefficients [
4]. They are defined by
where the evaluation of the integrals
results in combinations of the coefficients of Equation (
2) (the index of the function
relates to the specific variable(s)
which are omitted in the integral
). The higher the value of a Sobol coefficient with respect to the others is, the more it is advantageous to reduce the uncertainty of its associated variable in order reduce the uncertainty of the quantity of interest.
4. Results and Discussion
The above program is applied to simulate ion-solid interactions for the case of incident deuterium ions with an energy of eV at an impact angle of degrees to a surface consisting of iron with a commonly used surface binding energy of eV. We assume the parameters to be normally distributed within a standard deviation of roughly 10%, i.e., eV, degrees and eV.
First, in order to have a calibration standard to compare with we employ random sampling of the model response. For each realization of the random variable
there exists a model response
constituting the sample solution set
from which moments can be computed. The expected mean is and its variance read
In
Figure 1 the results of
= 8000 samples are shown for the above parameter settings. The mean value for the sputter yield is
with a standard deviation of
= 0.013. Even more, the full uncertainty distribution may be established with help of a histogram if the sample solution set is sufficiently large (
1000). But, although this procedure is straightforward and automatically contains the full model answer with all correlations, it has the vital drawback of a comparatively low convergence rate. If the computation time of a single model output is not in the order of seconds or becomes more sophisticated with a higher number of variables (curse of dimension), the mere accumulation of sample point densities to infer the complete distribution is futile. Much more promising in this respect is the spectral approach of
Section 2 which results will be discussed next.
Extending the formulas of
Section 2 to three random variables
with
,
and
, the summation of the terms in Equation (
6) runs over three indices
,
and
with an upper boundary of
in the present setup of fourth order polynomials (for numerical accuracy of the Gaussian quadrature it is expedient to be one order higher than the polynomial order of the spectral expansion). This results in a total of 216 terms (three nested summations, each running from
= 0 to 5 with
i = 1, 2, 3) over the collocation points composed of 6 Gaussian quadrature abscissas assigned to
and 6 weights
. The value for the function
is obtained from a SDTrimSP run, which takes roughly 3 min on a modern CPU. However, the complete run for the 216 terms can be speeded up enormously since the calculations are independent and can be done in parallel. Once calculated, the 35 coefficients of Equation (
6) establish a fast surrogate model, which is simply the evaluation of a polynomial. This is shown in
Figure 2 as the red mesh. The respective sputter yield, for which the uncertainty quantification was performed, is depicted in the center as the green sphere with
at/ion and its standard deviation of
as the green perpendicular line. The comparison with the result of the sampling approach above (
) shows excellent agreement.
Without the need to do any further simulations, various quantities may be inferred from the coefficients, e.g., the variance as in Equation (
2), or the Sobol coefficients, which allow to investigate the sensitivity of the result on the uncertainty of the input variables. For the above variables
,
and
we get a relationship of 10:20:70 in the Sobol coefficients (only first order is numerically significant) for
, indicating that the improvement of the knowledge of
is most rewarding if one wants to reduce the uncertainty of the sputter yield.
Following this trail, we performed a series of experimental measurements of the sputter yield for different impact angles with
= 0, 45, 60 and 75 degrees at
= 2 keV. Then we applied the uncertainty quantification method discussed above in order to provide quantitative estimates of the sputter yields at a variety of settings for the surface binding energy
. It turned out that the most probable value for the surface binding energy is
eV, one and a half standard deviations larger than the value commonly used up to now [
1], i.e.,
eV. With the revised setting of
we compared (see
Figure 3) simulations of the sputter yield for different incident energies of deuterium with results from Rutherford backscattering (RBS) and weight loss (WL) experiments and got an improved agreement (except for
= 1 keV). With these results the Bayes factor rules out another competitor to SDTrimSP (i.e., Monte Carlo decision for the occurrences of collisions of incident ions with atoms in the target) being the SRIM-model, which employs a quantum mechanical treatment of ion-atom collisions and seems not to comprise all important effects present..