Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods

Cacuci, Dan Gabriel

doi:10.3390/en16176355

Open AccessArticle

Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods

by

Dan Gabriel Cacuci

Center for Nuclear Science and Energy, University of South Carolina, Columbia, SC 29208, USA

Energies 2023, 16(17), 6355; https://doi.org/10.3390/en16176355

Submission received: 7 July 2023 / Revised: 1 August 2023 / Accepted: 25 August 2023 / Published: 1 September 2023

(This article belongs to the Section B4: Nuclear Energy)

Download

Browse Figures

Versions Notes

Abstract

:

The mathematical/computational model of a physical system comprises parameters and independent and dependent variables. Since the physical system is seldom known precisely and since the model’s parameters stem from experimental procedures that are also subject to uncertainties, the results predicted by a computational model are imperfect. Quantifying the reliability and accuracy of results produced by a model (called “model responses”) requires the availability of sensitivities (i.e., functional partial derivatives) of model responses with respect to model parameters. This work reviews the basic motivations for computing high-order sensitivities and illustrates their importance by means of an OECD/NEA reactor physics benchmark, which is representative of a “large-scale system” involving many (21,976) uncertain parameters. The computation of higher-order sensitivities by conventional methods (finite differences and/or statistical procedures) is subject to the “curse of dimensionality”. Furthermore, as will be illustrated in this work, the accuracy of high-order sensitivities computed using such conventional methods cannot be a priori guaranteed. High-order sensitivities can be computed accurately and efficiently solely by applying the high-order adjoint sensitivity analysis methodology. The principles underlying this adjoint methodology are also reviewed in preparation for introducing, in the accompanying Part II, the “High-Order Function/Feature Adjoint Sensitivity Analysis Methodology” (nth-FASAM), which aims at most efficiently computing exact expressions of high-order sensitivities of model responses to functions (“features”) of model parameters.

Keywords:

high-order sensitivity analysis; high-order uncertainty quantification; high-order predictive modeling; high-order adjoint sensitivity analysis methodology; curse of dimensionality

1. Introduction

The mathematical/computational models of physical systems comprise parameters and independent and dependent variables. The system’s independent variables and parameters are related to the system’s state (i.e., dependent) through a well-posed system of equations, which is usually nonlinear in all of its components. The minimum amount of information needed to use a model is the availability of nominal or mean values for the system parameters. With the known nominal parameter values, a model can be used to compute results of interest, which are called model/system “responses”, “objective functions”, or “indices of performance”. Since the physical processes themselves are seldom known precisely and since most of the model’s parameters stem from experimental procedures that are also subject to imprecisions and/or uncertainties, the results predicted by these models are also imprecise, being affected by the uncertainties underlying the respective model. Quantifying the reliability and accuracy of results (responses) computed using models is achieved by performing activities known as “sensitivity analysis”, “uncertainty quantification”, “model verification”, “model validation”, “data assimilation”, “model calibration”, and “predictive modeling”. Sensitivity analysis aims at quantifying the changes in the computed response that would be induced by changes in the model parameters, along with understanding the model by ranking the importance of the various parameters in contributing to the response. The availability of such rankings makes it possible to prioritize improvements in the model and also streamline it by eliminating unimportant parameters and/or processes to develop “reduced-order models”. Uncertainty quantification aims at quantitatively determining the uncertainties induced in a model response due to model parameter uncertainties. Model verification deals with the question, “Are the equations underlying the model solved correctly?” Model validation compares the model’s computational results to experimental information in order to address the question, “Does the model represent reality?” Data assimilation combines experimental and computational information to improve model responses while model calibration uses experimental information to adjust/calibrate the parameters to improve the accuracy of future computations. “Predictive modeling” aims at obtaining “best-estimate” optimally predicted results with reduced predicted uncertainties, subsuming data assimilation and model calibration. All of these activities, as well as optimizing the system and using models for “inverse problems”, require the availability of the functional derivatives—called “sensitivities”—of model responses with respect to the imprecisely known model parameters.

For large-scale models involving many parameters, even the first-order sensitivities are very expensive to compute by conventional methods. It is known that the “adjoint method” of sensitivity analysis is the most efficient method for computing exactly first-order sensitivities since it requires a single large-scale (adjoint) computation, independently of the number of model parameters. Cacuci [1,2] has conceived the rigorous first-order adjoint sensitivity analysis methodology for generic large-scale nonlinear (as opposed to linearized) systems involving generic operator responses and for having introduced these principles to the earth, atmospheric and other sciences, as credited, e.g., by Práger and Kelemen [3] and Luo, Wang, and Liu [4].

The computation of higher-order sensitivities by conventional methods is subject to the “curse of dimensionality”, a term coined by Bellman [5] to describe phenomena in which the number of computations increases exponentially in the respective phase space. In the particular case of sensitivity analysis using conventional methods, the number of large-scale computations increases exponentially in the phase space of the model parameter as the order of sensitivities increases. The general mathematical framework for the second-order adjoint sensitivity analysis methodology for generic linear and nonlinear systems, respectively, was conceived by Cacuci [6,7,8]. The unparalleled efficiency of the second-order adjoint sensitivity analysis methodology for linear systems was demonstrated [9,10,11,12,13,14] by applying this methodology to compute exactly the 21,976 first-order sensitivities and 482,944,576 second-order sensitivities (of which 241,483,276 are distinct from each other) for an OECD/NEA reactor physics benchmark [15], which is representative of a large-scale system that involves many (21,976, in this illustrative example) parameters. Such a large-scale system cannot be analyzed exactly and comprehensively by any other methods. Contrary to the widely held belief that second- and higher-order sensitivities are negligible for reactor physics systems, it was found [9,10,11,12,13,14] that many second-order sensitivities of the OECD benchmark’s response to the benchmark’s uncertain parameters were much larger than the largest first-order ones. This finding has motivated the investigation of the largest third-order sensitivities, many of which were found [16,17] to be even larger than the second-order ones. This finding has motivated the development of the mathematical framework for determining and computing the fourth-order sensitivities [18], many of which were found to be larger than the third-order ones. This sequence of findings has motivated the conception by Cacuci [19] of the nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (abbreviated as “nth-CASAM-L”). The nth-CASAM-L overcomes the curse of dimensionality in sensitivity analysis, enabling the efficient computation of exactly determined expressions of arbitrarily high-order sensitivities of a generic system response—which can depend on both the forward and adjoint state functions for linear systems—with respect to all of the parameters that characterize the physical system. The “nth-CASAM-L” mathematical framework was developed specifically for linear systems because the most important model responses produced by such systems are various Lagrangian functionals, which depend simultaneously on both the forward and adjoint state functions governing the respective linear system. Such responses can occur only for linear systems because responses in nonlinear systems can only depend on the system’s forward state functions (since nonlinear operators do not admit adjoint operators).

In parallel with the aforementioned developments, Cacuci [20] has extended his original work [1,2] on nonlinear systems to include the treatment of uncertain boundaries and interfaces, encompassing the mathematical framework for deriving and computing efficiently the exact expressions of sensitivities up to and including the fifth-order. This general methodology is called the nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (abbreviated as nth-CASAM-N). Similar to the nth-CASAM-L, the nth-CASAM-N [19] is formulated in linearly increasing higher-dimensional Hilbert spaces (as opposed to exponentially increasing parameter-dimensional spaces), thus overcoming the curse of dimensionality in sensitivity analysis of nonlinear systems, enabling the most efficient computation of exactly determined expressions of arbitrarily high-order sensitivities of generic nonlinear system responses with respect to model parameters, uncertain boundaries, and internal interfaces in the model’s phase space. Additional details regarding applications of the nth-CASAM-L and the nth-CASAM-N methodologies are provided in [21,22,23].

This work is structured as follows: Section 2 presents the mathematical modeling of physical systems, which comprises imprecisely known (uncertain) model parameters and physical boundaries that could arise from manufacturing tolerances. Using the OECD/NEA reactor physics benchmark [15], Section 3 illustrates the essential role and impact of high-order sensitivities in uncertainty quantification and predictive modeling. Section 4 illustrates the difficulties involved in computing high-order sensitivities using finite difference formulas, which, along with the high-order adjoint sensitivity analysis methodologies, nth-CASAM-N and nth-CASAM-N, are the only procedures available for computing high-order sensitivities. Section 4 also reviews the principles underlying the nth-CASAM-N, which will be used in the accompanying Part II [24] to conceive the “High-Order Function/Feature Adjoint Sensitivity Analysis Methodology” (nth-FASAM), which aims at computing exact expressions of high-order sensitivities of model responses to functions (“features”) of model parameters, which actually are found in many models in practice. By means of an illustrative paradigm example of particle transport (i.e., a simplified version of the OECD/NEA reactor physics benchmark [14]), the accompanying Part II [24] highlights the unparalleled gain in efficiency of the nth-FASAM (to be developed in [24]) by comparison to the nth-CASAM, which itself is currently peerless in terms of accuracy and efficiency for computing high-order sensitivities of responses to model parameters.

2. Modeling of a Nonlinear Physical System Comprising Imprecisely Known Parameters and Boundaries

The mathematical model that underlies the numerical evaluation of a process and/or state of a physical system comprises equations that relate the system’s independent variables and parameters to the system’s state/dependent variables. Such a model can generically be modeled by means of coupled equations, which are in general nonlinear, and can be represented in operator form as follows:

N [u (x); x; α] = Q (x; α), x \in Ω_{x} (x);

(1)

B [u (x); α] = C (α), x \in \partial Ω_{x} (α) .

(2)

Matrices are denoted using capital bold letters, while vectors are denoted using either capital or lower-case bold letters. The symbol “

≜

” is used to denote “is defined as” or “is by definition equal to”. Transposition is indicated by a dagger

(†)

superscript. The equalities in this work are considered to hold in the weak (“distributional”) sense. The right sides of Equations (1) and (2), as well as other equations to be derived in this work, may contain “generalized functions/functionals”, particularly Dirac distributions and derivatives thereof.

The results computed using a mathematical model are customarily called “model responses” (or “system responses”, “objective functions”, or “indices of performance”). As has been discussed in [21,23], all responses can be fundamentally analyzed in terms of the following generic integral representation:

R [u (x); α] ≜ \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x); x; α] d x_{1} \dots d x_{T I},

(3)

where

S [u (x); x; α]

is a suitably differentiable nonlinear function of

u (x)

and of

α

.

Without loss of generality, the quantities that appear in Equations (1)–(3) can be considered to be real-valued and have the following meanings:

1. The column-vector

α ≜ {(α_{1}, \dots, α_{T P})}^{†} \in ℝ^{T P}

represents the “vector of primary model parameters” and has components denoted as

α_{1}

, …,

α_{T P}

, where

T P

denotes the “total number of parameters” underlying the model under consideration. These parameters are “input” for the model and are subject to uncertainties stemming from processes that are external to the system under consideration. The nominal (expected/mean) values of the model parameters are considered to be known and will be denoted as

α^{0} ≜ {(α_{1}^{0}, \dots, α_{i}^{0}, \dots, α_{T P}^{0})}^{†}

; the superscript “0” will be used throughout this monograph to denote “nominal values”. The components of the vector

α

include not only parameters that appear in Equations (1) and (2), which define the computational model but also include parameters that may specifically occur only in the definition of the model’s response under consideration, cf. Equation (3). Without loss of generality, the model parameters

α_{1}

, …,

α_{T P}

can be considered for real-valued scalars;

2. The column vector

x ≜ {(x_{1}, \dots, x_{T I})}^{†} \in ℝ^{T I}

comprises the model’s independent variables, denoted as

x_{i}, i = 1, \dots, T I

, where “TI” denotes the “total number of independent variables”. The vector

x \in ℝ^{T I}

is considered to be defined on a phase space domain denoted as

Ω_{x} (x) ≜ {λ_{i} (α) \leq x_{i} \leq ω_{i} (α); i = 1, \dots, T I}

; the particular cases when

λ_{i} (α) = - \infty, ω_{i} (α) = \infty

for some independent variables

x_{i}, i = 1, \dots, T I

are included. The domain boundary

\partial Ω_{x} [λ (α); ω (α)] ≜ {λ_{i} (α) \cup ω_{i} (α), i = 1, \dots, T I}

of

Ω_{x} (x)

is defined to comprise the set of all of the endpoints

λ_{i} (α), ω_{i} (α), i = 1, \dots, T I

. These endpoints depend on the physical system’s geometrical dimensions, which may be imprecisely known because of manufacturing tolerances, and are considered, therefore, to be components of the vector

α ≜ {(α_{1}, \dots, α_{T P})}^{†} \in ℝ^{T P}

of primary model parameters. Furthermore, the boundary endpoints,

λ_{i} (α), ω_{i} (α), i = 1, \dots, T I

, may also depend on the parameters that define the material properties of the respective medium. For example, in models based on diffusion theory, the boundary conditions for materials facing air/vacuum are imposed on the “extrapolated boundary” of the respective spatial domain. The “extrapolated boundary” depends both on the imprecisely known physical dimensions of the system’s materials and also on the material’s properties, such as atomic number densities and microscopic transport cross-sections;

3. The column vector

u (x) ≜ {[u_{1} (x), \dots, u_{T D} (x)]}^{†}

comprises the model’s dependent variables

u_{i} (x), i = 1, \dots, T D

; the abbreviation “

T D

” denotes the “total number of dependent variables”;

4. The column vector

N [u (x); x; α] ≜ {(N_{1}, \dots, N_{T D})}^{†}

comprises components

N_{i} [u (x); x; g (α)], i = 1, \dots, T D

, which are operators that act linearly and/or nonlinearly on

u (x)

,

x

, and

α

;

5. The components

q_{i} [u (x); x; α], i = 1, \dots, T D

of the

T D

-dimensional column vector

Q [u (x); x; α] ≜ {(q_{1}, \dots, q_{T D})}^{†}

denote inhomogeneous source terms;

6. The components of

B [u (x); α; x]

and

C (α)

are nonlinear operators that represent boundary and/or initial conditions on

\partial Ω_{x}

.

3. Motivation for Computing Exact Expressions of High-Order Response Sensitivities to Parameters

Historically, the concept of “sensitivity analysis” arose from the need to predict and quantify the changes induced in the model’s responses by changes in the model parameters and to rank the importance of the model’s parameters in affecting the model’s responses. Initially, such predictions were performed by simply recomputing the result of interest using the variations of interest in the model parameters. Evidently, the results provided by such re-computations are specific to the respective parameter variation. Furthermore, such re-computations become unfeasible for large-scale models with many parameters. Nevertheless, such re-computations had become the basis for computing approximately, using finite differences, various partial derivatives of the response with respect to model parameters, which played the role of “sensitivities of responses with respect to parameters”. Subsequently, statisticians introduced various quantities (e.g., “measures of sensitivities” and/or “sensitivity indicators”) to play the role of “sensitivities” by using statistical quantities related to the variance of the response that might be induced by the variances assumed to be known, of the model parameters. These statistical measures and methodologies to compute them will also be mentioned briefly in this Section.

Mathematically, the only unambiguous concept for defining the “sensitivity of a function to its underlying parameters” is in terms of the partial derivatives of the respective function with respect to its underlying parameters. Thus, the “first-order sensitivities” of a model response are the first-order partial functional derivatives of the response with respect to the parameters; the “second-order sensitivities” are the second-order partial functional derivatives of the response with respect to the parameters; and so on; the “nth-order sensitivities” are the nth-order partial functional derivatives of the model response with respect to the model parameters.

There are three major areas of scientific activities that are built on the availability of response sensitivities to model parameters as follows: (i) sensitivity analysis; (ii) uncertainty quantification; and (iii) predictive modeling, which combines measured and computational information to obtain best-estimate predictions of model responses and parameters, with reduced predicted uncertainties. The current state-of-the-art in these three areas of scientific activities will be reviewed briefly in this Section.

3.1. High-Order Sensitivity Analysis

The scope of “sensitivity analysis” is to predict and quantify the changes induced in the model’s response by changes in the model parameters and to rank the importance of the model’s parameters in affecting the model’s response. Mathematically, the soundest concept for predicting unambiguously the changes induced in a function of parameters stemming from changes in the respective parameters is provided by the multivariate Taylor series of the function with respect to its component parameters. Hence, the concept of “sensitivity of a function to its underlying parameters” is based on the role of the partial derivatives of the respective function with respect to its underlying parameters. Specifically, the “first-order sensitivities” of a model response are the first-order partial functional derivatives of the response with respect to the parameters; the “second-order sensitivities” are the second-order partial functional derivatives of the response with respect to the parameters; and so on; the “nth-order sensitivities” are the nth-order partial functional derivatives of the model response with respect to the model parameters. Since the expected (or nominal) parameter values,

α^{0}

, are known, the formal expression of the multivariate Taylor series expansion of a model response, denoted as

R_{k} (α)

, as a function of the model parameters

α ≜ {(α_{1}, \dots, α_{T P})}^{†} \in ℝ^{T P}

around

α^{0}

has the following well-known formal expression:

\begin{array}{l} R_{k} (α) = R_{k} (α^{0}) + \sum_{j_{1} = 1}^{T P} {\frac{\partial R_{k} (α)}{\partial α_{j_{1}}}}_{α^{0}} δ α_{j_{1}} + \frac{1}{2} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} {\frac{\partial^{2} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}}}}_{α^{0}} δ α_{j_{1}} δ α_{j_{2}} \\ + \frac{1}{3!} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} {\frac{\partial^{3} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}}}}_{α^{0}} δ α_{j_{1}} δ α_{j_{2}} δ α_{j_{3}} \\ + \frac{1}{4!} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} \sum_{j_{4} = 1}^{T P} {\frac{\partial^{4} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}} \partial α_{j_{4}}}}_{α^{0}} δ α_{j_{1}} δ α_{j_{2}} δ α_{j_{3}} δ α_{j_{4}} + \dots, \end{array}

(4)

where

δ α_{j} ≜ (α_{j} - α_{j}^{0}), j = 1, \dots, T P

. Since the explicit expression of

R_{k} (α)

as a function of the model parameters

α ≜ {(α_{1}, \dots, α_{T P})}^{†} \in ℝ^{T P}

is not available, the explicit expressions of the sensitivities of

R_{k} (α)

are not available for numerical evaluation. It is evident from Equation (4) that, for a model comprising a total number of

T P

-parameters, for each model response, there are

T P

first-order sensitivities,

T P (T P + 1) / 2

distinct second-order sensitivities,

T P (T P + 1) (T P + 2) / 6

distinct third-order sensitivities, and so on. The number of sensitivities increases exponentially with the order of the respective sensitivities; hence, their computation by conventional methods (e.g., finite difference schemes, statistical assessments), which require re-computations using the original model (with suitably altered parameter values) is hampered by the curse of dimensionality [5] while providing approximate (often inaccurate), rather than exact, values for the sensitivities. The only extant method that overcomes, to the largest possible extent, the curse of dimensionality while providing exact expressions for computing efficiently sensitivities of all orders is the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM)” developed by Cacuci [21,23] by extending the original adjoint sensitivity analysis methodology conceived by Cacuci [1,2]. The nth-CASAM requires computations in linearly increasing higher-dimensional Hilbert spaces, as opposed to exponentially increasing parameter-dimensional spaces. In particular, for a scalar-valued response associated with a model comprising

T P

model parameters, the nth-CASAM requires one adjoint computation for computing exactly all of the first-order response sensitivities, as opposed to at least

T P

forward computations, as required by other methods to obtain approximate values for these sensitivities. All of the (mixed) second-order sensitivities are computed exactly by the nth-CASAM in most

T P

computations, as opposed to needing at least

T P (T P + 1) / 2

computations, as required by all other methods, and so on. For every lower-order sensitivity of interest, the nth-CASAM computes the “

T P

next-higher-order” sensitivities in one adjoint computation performed in a linearly increasing higher-dimensional Hilbert space. The nth-CASAM applies to any model (deterministic, statistical, etc.) and is also applicable to the computation of sensitivities of operator-valued responses, which cannot be obtained by statistical methods when the underlying problem comprises many parameters. The larger the number of model parameters, the more efficient the nth-CASAM becomes for computing arbitrarily high-order response sensitivities.

The practitioners of statistical methods cavalierly downplay the importance of the Taylor series by labeling it as being “local” (as opposed to “global”—as they usually characterize the statistical methods). In reality, the range of validity of the Taylor series is provided by its radius of convergence. The accuracy—as opposed to the “validity”—of the Taylor series in predicting the value of the response at an arbitrary point in the phase space of model parameters depends on the order of sensitivities retained in the Taylor expansion: the higher the respective order, the more accurate the respective response value predicted by the Taylor series. In the particular cases when the response happens to be a polynomial function of the model parameters, the Taylor series is actually exact (not only “non-local”). In contradistinction, no statistical method is ever “exact.

The need for the computation of higher-order sensitivities has been amply illustrated by Cacuci and Fang [22], using as a paradigm model the polyethylene-reflected plutonium (acronym “PERP”) reactor physics benchmark [15], which is included in the Nuclear Energy Agency (NEA) International Criticality Safety Benchmark Evaluation Project (ICSBEP). The composition and dimensions of this benchmark are summarized in Appendix A. The distribution of neutrons within the benchmark is modeled by the linear time-independent inhomogeneous neutron transport (Boltzmann) equation (see Appendix A), which is solved numerically (after discretization in the energy, spatial, and angular independent variables) using the software package PARTISN [25] and SOURCES4C [26]. The numerical model of the PERP benchmark includes 21,976 parameters, of which the following 7477 parameters have non-zero (but imprecisely known) nominal values: 180 group-averaged total microscopic cross-sections; 120 fission process parameters; 60 fission spectrum parameters; 10 parameters describing the experiment’s nuclear sources; 6 isotopic number densities; and 7101 non-zero group-averaged scattering microscopic cross-sections (the remaining scattering cross-sections, out of a total of 21,600, have zero nominal values). Solving the Boltzmann equation with these many parameters is representative of a “large-scale computation”.

The response of interest for the PERP benchmark is the total neutron leakage from the PERP sphere (numerical value:

1.7648 \times 10^{6}

neutrons/s), which is plotted in histogram form as a function of energy in Figure 1 [22].

All of the first-order sensitivities of the leakage response with respect to the 7477 non-zero parameters were computed [22] using the First-Order Comprehensive Sensitivity Analysis Methodology (first-CASAM). The vast majority of these sensitivities had relative values below 10%, but 16 of them exceeded unity in absolute value; the largest of all of the first-order relative sensitivities was the sensitivity of the leakage response with respect to the total group cross-section, denoted as

σ_{t, 6}^{30}

, of Hydrogen (labeled isotope #6) in group 30, which had the relative value

S^{(1)} (σ_{t, 6}^{30}) = - 9.366

. Among the benchmark’s parameters, the total microscopic cross-sections had the largest impact on the leakage response; the following sensitivities of this response with respect to the microscopic total cross-sections of hydrogen also had large values for energy groups 17–20 [

S^{(1)} (σ_{t, i = 6}^{16}) = - 1.164

;

S^{(1)} (σ_{t, i = 6}^{17}) = - 1.173

;

S^{(1)} (σ_{t, i = 6}^{18}) = - 1.141

;

S^{(1)} (σ_{t, i = 6}^{19}) = - 1.094

;

S^{(1)} (σ_{t, i = 6}^{20}) = - 1.033

). Also included in the top 16 were the (relative) sensitivities of the leakage response with respect to the total microscopic cross-sections of ²³⁹Pu (labeled isotope #1) in energy groups 12 and 13, which had the values

S^{(1)} (σ_{t, i = 1}^{12}) = - 1.320

and

S^{(1)} (σ_{t, i = 1}^{13}) = - 1.154

, respectively.

The second-order sensitivities also must be computed in order to compare them with the values of the largest first-order sensitivities and consequently decide which, if any, of the second-order sensitivities must be retained for subsequent uses (e.g., for uncertainty quantification and/or data assimilation and predictive modeling). Therefore, all of the distinct (7477 × 7478/2) second-order sensitivities of the PERP benchmark’s leakage response with respect to the benchmark’s parameters were computed [9,10,11,12,13,14,22] by applying the Second-Order Comprehensive Sensitivity Analysis Methodology (second-CASAM). It was thus established that ca. 1000 relative second-order sensitivities had absolute values larger than unity, with over 50 of them having absolute values between 10.0 and 100.0. The overall largest second-order relative sensitivity was the unmixed second-order sensitivity of the leakage response with respect to the total microscopic cross-section of hydrogen in energy group 30, which had the value

S^{(2)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}) = 429.6

.

These findings have motivated the computation [16,17,22] of the third-order sensitivities of the leakage response with respect to the benchmark’s total cross-sections. It was found there that 45,970 of such third-order sensitivities had absolute values between 1.0 and 10.0; 11,861 of such third-order sensitivities had absolute values between 10.0 and 100.0, and 1199 of such third-order sensitivities had absolute values larger than 100.0. All of the largest first-, second-, and third-order sensitivities involved the microscopic total cross-section for the lowest (30th) energy group of isotope ¹H (i.e.,

σ_{t, 6}^{30}

). The largest overall third-order sensitivity is the mixed third-order sensitivity

S^{(3)} (σ_{t, 1}^{g = 30}, σ_{t, 6}^{g^{'} = 30}, σ_{t, 6}^{g^{″} = 30}) = - 1.88 \times 10^{5}

, which also involves the microscopic total cross-section for the 30th energy group of isotope ²³⁹Pu (i.e.,

σ_{t, 1}^{g = 30}

). The total microscopic cross-section of isotopes ¹H and ²³⁹Pu are the two most important parameters affecting the PERP benchmark’s leakage response since they are involved in all of the large second- and third-order sensitivities.

These results subsequently motivated the computation [18,19,22] of the fourth-order sensitivities of the leakage response with respect to the total microscopic cross-section of hydrogen. Depending on the specific energy group, the values of the fourth-order relative sensitivity were found to be ca. 2 to 7 times larger than the corresponding third-order ones, ca. 5 to 52 times larger than the values of the corresponding second-order sensitivities, and ca. 8 to 220 times larger than the values of the corresponding first-order sensitivities. For illustrative purposes, the energy-dependence of the first- through fourth-order sensitivities of the leakage response with respect to the total microscopic cross-sections of hydrogen are depicted in Figure 2 [22], which underscores the overwhelming importance of the second- and higher-order sensitivities.

3.2. High-Order Uncertainty Quantification

In practice, the model parameters are not known exactly, even though they are not bona fide random quantities. For practical purposes, however, these model parameters are considered to be variates that obey a multivariate probability distribution function, denoted as

p_{α} (α)

. Although this distribution is seldom known, the various moments of

p_{α} (α)

can be defined in a standard manner by using the following notation for the “expected (or mean) value” of a function,

u (α)

, of the parameters, which is defined over the domain of definition of

p_{α} (α)

:

E [u (α)] ≜ \int u (α) p_{α} (α) d α,

(5)

In particular, the vector of expected values,

α_{j}^{0}

, of the model parameters

α_{j}

, is defined as follows:

α^{0} ≜ {(α_{1}^{0}, \dots, α_{T P}^{0})}^{†}, α_{j}^{0} ≜ E (α_{j}), j = 1, \dots, T P .

(6)

In practice, the expected values—which are taken as the nominal values for carrying out the computation of responses using the mathematical/numerical model—are known (or assumed to be known). In order to perform “uncertainty quantification/analysis” of the uncertainties induced in the responses by uncertainties in the parameters, it is necessary to also know the covariances,

cov (α_{i}, α_{j})

, between two parameters,

α_{i}

and

α_{j}

, which are defined as follows:

cov (α_{i}, α_{j}) ≜ E (δ α_{i} δ α_{j}) ≜ ρ_{i j} σ_{i} σ_{j}, i, j = 1, \dots, T P .

(7)

In Equation (7), the quantities

σ_{i}

and

σ_{j}

denote the standard deviations of

α_{i}

and

α_{j}

, respectively, while

ρ_{i j}

denotes the correlation between the respective parameters. Occasionally, the higher-order correlations between parameters can also be obtained. Formally, the third-order correlation,

t_{j 1 j 2 j 3}

, among three parameters is defined as follows:

t_{j_{1} j_{2} j_{3}} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} ≜ E (δ α_{j_{1}} δ α_{j_{2}} δ α_{j_{3}}); j_{1}, j_{2}, j_{3} = 1, \dots, T P .

(8)

The fourth-order correlation

q_{j_{1} j_{2} j_{3} j_{4}}

among four parameters is defined as follows:

q_{j_{1} j_{2} j_{3} j_{4}} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} σ_{j_{4}} ≜ E (δ α_{j_{1}} δ α_{j_{2}} δ α_{j_{3}} δ α_{j_{4}}); j_{1}, j_{2}, j_{3}, j_{4} = 1, \dots, T P .

(9)

The Taylor series shown in Equation (4) can be used in conjunction with the definitions provided in Equations (5)–(9), as was first performed by Tukey [27], to obtain the expressions for the moments of the response distribution. Using the Taylor series shown in Equation (4), Cacuci [21] has presented expressions for the first six moments of the joint distribution of responses and parameters, including the sixth-order in standard deviations.

The effects of high-order sensitivities on the moments (i.e., mean, variance, skewness) of the response distribution will be illustrated in this Section by considering the leakage response of the PERP reactor physics benchmark discussed in Section 3.1 above in the phase space of total microscopic cross-sections since the largest sensitivities of the leakage response are with respect to these parameters. A complete uncertainty analysis of the PERP leakage response can be found in the book by Cacuci and Fang [22].

The microscopic cross-sections will be considered to be uncorrelated and normally distributed in order to simplify the algebraic complexities. Under these conditions, the first three moments of the distribution of the leakage response are provided by the following expressions [22]:

(i): The expected value of the leakage response has the following particular expression:

{[E (L)]}_{t}^{(U, N)} = L (α^{0}) + {[E (L)]}_{t}^{(2, U, N)} + {[E (L)]}_{t}^{(4, U, N)},

(10)

where the superscript “U,N” indicates contributions from uncorrelated and normally distributed parameters; the subscript t indicates group-averaged microscopic “total” cross-section, and the letter “L” denotes “leakage response”. In Equation (10), the contributions from the third-order (and all odd-order) sensitivities vanish because the parameters are uncorrelated. Furthermore, the quantity

L (α^{0})

represents the leakage response computed using the nominal cross-section values, and the quantities

{[E (L)]}_{t}^{(2, U, N)}

and

{[E (L)]}_{t}^{(4, U, N)}

denote the contributions from the second-order and fourth-order response sensitivities, respectively, which are provided by the following expressions:

{[E (L)]}_{t}^{(2, U, N)} = \frac{1}{2} \sum_{j_{1} = 1}^{J_{σ t}} \frac{\partial^{2} L (α)}{\partial^{2} t_{j_{1}}} σ_{j_{1}}^{2},

(11)

{[E (L)]}_{t}^{(4, U, N)} = \frac{1}{8} \sum_{j_{1} = 1}^{J_{σ t}} \frac{\partial^{4} L (α)}{\partial^{4} t_{j_{1}}} σ_{j_{1}}^{4} .

(12)

In Equations (11) and (12), the quantity

t_{j_{1}}

represents the “j₁-th microscopic total cross-section” while the quantity

J_{σ t} = G \times I = 180

denotes the total number of microscopic total cross-sections for

G = 30

groups and

I = 6

isotopes contained in the PERP benchmark. The traditional notation “

σ

” for “microscopic cross-sections” is not used in the formulas to be presented in this section since this notation will be used to denote “standard deviation”.

(ii): The variance of the leakage response for the PERP benchmark takes on the following particular form:

{[var (L)]}_{t}^{(U, N)} = \sum_{i = 1}^{4} {[var (L)]}_{t}^{(i, U, N)},

(13)

where

{[var (L)]}_{t}^{(1, U, N)}

,

{[var (L)]}_{t}^{(2, U, N)}

,

{[var (L)]}_{t}^{(3, U, N)}

and

{[var (L)]}_{t}^{(4, U, N)}

denote the contributions of the terms involving the first-order through the fourth-order sensitivities, respectively, to the variance

{[var (L)]}_{t}^{(U, N)}

and are defined by the following expressions:

{[var (L)]}_{t}^{(1, U, N)} ≜ \sum_{j_{1} = 1}^{J_{σ t}} {[\frac{\partial L (α)}{\partial t_{j_{1}}}]}^{2} {(σ_{j_{1}})}^{2},

(14)

{[var (L)]}_{t}^{(2, U, N)} ≜ \frac{1}{2} \sum_{j_{1} = 1}^{J_{σ t}} \sum_{j_{2} = 1}^{J_{σ t}} {[\frac{\partial^{2} L (α)}{\partial t_{j_{1}} \partial t_{j_{2}}} σ_{j_{1}} σ_{j_{2}}]}^{2},

(15)

{[var (L)]}_{t}^{(3, U, N)} = \sum_{j_{1} = 1}^{J_{σ t}} \sum_{j_{2} = 1}^{J_{σ t}} [\frac{\partial^{3} L (α)}{\partial t_{j_{1}} \partial t_{j_{1}} \partial t_{j_{2}}} \frac{\partial L (α)}{\partial t_{j_{2}}}] σ_{j_{1}}^{2} σ_{j_{2}}^{2} + \frac{15}{36} \sum_{j_{1} = 1}^{J_{σ t}} {[\frac{\partial^{3} L (α)}{\partial t_{j_{1}} \partial t_{j_{1}} \partial t_{j_{1}}}]}^{2} σ_{j_{1}}^{6},

(16)

{[var (L)]}_{t}^{(4, U, N)} = \frac{1}{2} \sum_{j_{1} = 1}^{J_{σ t}} [\frac{\partial^{4} L (α)}{{(\partial t_{j_{1}})}^{4}} \frac{\partial^{2} L (α)}{{(\partial t_{j_{1}})}^{2}}] σ_{j_{1}}^{6} .

(17)

(iii): The third-order moment of the leakage response for the PERP benchmark takes on the following particular form:

{[μ_{3} (L)]}_{t}^{(U, N)} = \sum_{i = 1}^{4} {[μ_{3} (L)]}_{t}^{(i, U, N)},

(18)

where

{[μ_{3} (L)]}_{t}^{(1, U, N)}

,

{[μ_{3} (L)]}_{t}^{(2, U, N)}

,

{[μ_{3} (L)]}_{t}^{(3, U, N)}

, and

{[μ_{3} (L)]}_{t}^{(4, U, N)}

denote the contributions to

{[μ_{3} (L)]}_{t}^{(U, N)}

of the terms involving the first-order through the fourth-order sensitivities, respectively; these quantities have the following expressions:

{[μ_{3} (L)]}_{t}^{(2, U, N)} = 3 \sum_{j_{1} = 1}^{J_{σ t}} \sum_{j_{2} = 1}^{J_{σ t}} \frac{\partial L (α)}{\partial t_{j_{1}}} \frac{\partial L (α)}{\partial t_{j_{2}}} \frac{\partial^{2} L (α)}{\partial t_{j_{1}} \partial t_{j_{2}}} {(σ_{j_{1}} σ_{j_{2}})}^{2} + \sum_{j_{1} = 1}^{J_{σ t}} {[\frac{\partial^{2} L (α)}{{(\partial t_{j_{1}})}^{2}}]}^{3} σ_{j_{1}}^{6},

(19)

{[μ_{3} (L)]}_{t}^{(3, U, N)} = 6 \sum_{j_{1} = 1}^{J_{σ t}} \frac{\partial L (α)}{\partial t_{j_{1}}} \frac{\partial^{2} L (α)}{{(\partial t_{j_{1}})}^{2}} \frac{\partial^{3} L (α)}{{(\partial t_{j_{1}})}^{3}} σ_{j_{1}}^{6},

(20)

{[μ_{3} (L)]}_{t}^{(4, U, N)} = \frac{3}{2} \sum_{j_{1} = 1}^{J_{σ t}} {[\frac{\partial L (α)}{\partial t_{j_{1}}}]}^{2} \frac{\partial^{4} L (α)}{{(\partial t_{j_{1}})}^{4}} σ_{j_{1}}^{6} .

(21)

The skewness, denoted as

γ_{1} (L)

, of the response

L (α)

indicates the degree of the distribution’s asymmetry with respect to its mean and is defined as follows:

{[γ_{1} (L)]}_{t}^{(U, N)} = {[μ_{3} (L)]}_{t}^{(U, N)} / {{[var (L)]}_{t}^{(U, N)}}^{3 / 2} .

(22)

Using Equations (10)–(22), the effects of the sensitivities of various orders on the leakage response’s expectation, variance, and skewness have been quantified by considering uniform standard deviations of 1% (small) and 5% (moderate), respectively, for the microscopic total cross-sections. These results are presented in Table 1 and Table 2, respectively.

The results presented in Table 1 were obtained by considering a small relative standard deviation of 1% for each of the uncorrelated microscopic total cross-sections of the isotopes included in the PERP benchmark. The effects of the second-order and fourth-order sensitivities on the expected response value

{[E (L)]}_{t}^{(U, N)}

are both negligibly small since

{[E (L)]}_{t}^{(2, U, N)} ≅ 2.5 % \times {[E (L)]}_{t}^{(U, N)}

and

{[E (L)]}_{t}^{(4, U, N)} ≅ 0.3 % \times {[E (L)]}_{t}^{(U, N)}

. The results presented in the second column in Table 1 imply that

{[var (L)]}_{t}^{(1, U, N)} ≅ 70 % \times {[var (L)]}_{t}^{(U, N)},

{[var (L)]}_{t}^{(2, U, N)} ≅ 6 % \times {[var (L)]}_{t}^{(U, N)}

,

{[var (L)]}_{t}^{(3, U, N)} ≅ 20 % \times {[var (L)]}_{t}^{(U, N)}

, and

{[var (L)]}_{t}^{(4, U, N)} ≅ 4 % \times {[var (L)]}_{t}^{(U, N)}

, indicating that the contributions stemming from the first-order sensitivities to the response variance are significantly larger (ca. 70%) than those stemming from higher-order sensitivities. By comparison, the second-order sensitivities contribute about 6% to the response variance; the third-order sensitivities contribute about 20% to the response variance, while the fourth-order ones only contribute about 4% to the response variance. The results presented in Table 1 also indicate that

{[μ_{3} (L)]}_{t}^{(2, U, N)} ≅ 53 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

,

{[μ_{3} (L)]}_{t}^{(3, U, N)} ≅ 31 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

and

{[μ_{3} (L)]}_{t}^{(4, U, N)} ≅ 16 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

; thus, the contributions to the third-order response moment

{[μ_{3} (L)]}_{t}^{(U, N)}

stemming from the second-order sensitivities are the largest (e.g., around 53% in this case), followed by the contributions stemming from the third-order sensitivities, while the contributions stemming from the fourth-order sensitivities are the smallest. The response skewness,

{[γ_{1} (L)]}_{t}^{(U, N)}

, is positive, causing the leakage response distribution to be skewed toward the positive direction from its expected value.

Table 2 presents results obtained by considering a moderate relative standard deviation of 5% for each of the uncorrelated microscopic total cross-sections. These results show that

{[E (L)]}_{t}^{(2, U, N)} ≅ 65 % \times L (α^{0}) ≅ 17 % \times {[E (L)]}_{t}^{(U, N)}

, thereby indicating that the contributions from the second-order sensitivities to the expected response are around 65% of the computed leakage value

L (α^{0})

, and contribute around 17% to the expected value

{[E (L)]}_{t}^{(U, N)}

of the leakage response. Furthermore, the results presented in Table 2 also imply that

{[E (L)]}_{t}^{(4, U, N)} ≅ 213 % \times L (α^{0}) ≅ 56 % \times {[E (L)]}_{t}^{(U, N)}

, indicating that the contributions from the fourth-order sensitivities to the expected response are about 2.1 times larger than the computed leakage value

L (α^{0})

, and contribute around 56% to the expected value

{[E (L)]}_{t}^{(U, N)}

. Therefore, if the computed value,

L (α^{0})

, is considered to be the actual expected value of the leakage response, neglecting that the fourth-order sensitivities would produce an error of ca. 210%.

For a typical relative standard deviation of 5% for the uncorrelated microscopic total cross-sections, the results presented in Table 2 indicate that

{[var (L)]}_{t}^{(1, U, N)} ≅ 2 % \times {[var (L)]}_{t}^{(U, N)},

{[var (L)]}_{t}^{(2, U, N)} ≅ 3 % \times {[var (L)]}_{t}^{(U, N)}

,

{[var (L)]}_{t}^{(3, U, N)} ≅ 43 % \times {[var (L)]}_{t}^{(U, N)}

, and

{[var (L)]}_{t}^{(4, U, N)} ≅ 52 % \times {[var (L)]}_{t}^{(U, N)}

, which means that the contributions from the third- and fourth-order sensitivities to the response variance are remarkably larger than those from the first- and second-order ones. The results in Table 2 also show that

{[μ_{3} (L)]}_{t}^{(2, U, N)} ≅ 10 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

,

{[μ_{3} (L)]}_{t}^{(3, U, N)} ≅ 60 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

, and

{[μ_{3} (L)]}_{t}^{(4, U, N)} ≅ 30 % \times {[μ_{3} (L)]}_{t}^{(U, N)}

; thus, the contributions from the third-order sensitivities are the largest (e.g., around 60%), followed by the contributions from the fourth-order sensitivities, which contribute about 30%; the smallest contributions stem from the second-order sensitivities.

The results shown in Table 1 and Table 2 highlight the fact that the successively higher-order terms become increasingly less important for small (1%) parameter uncertainties but become increasingly more important for larger (e.g., 5%) parameter uncertainties. Applying the ratio test for convergence of infinite series to the third- and fourth-order sensitivities indicates that a uniform relative parameter variation of ca. 5% (which would correspond to a 5% uniform relative standard deviation for the model parameters) would yield a ratio of about unity, which is near the boundary of the domain of convergence of the Taylor series expansion shown in Equation (4). Consequently, if relative standard deviations of parameters that have large sensitivities (e.g., Hydrogen ¹H and/or Plutonium ²³⁹Pu, in the case of the PERP benchmark) were larger than 5% for certain model parameters, then the respective standard deviations would need to be reduced by re-calibrating the respective parameters. Such a re-calibration could be performed by using measurements of parameters and/or responses in conjunction with the high-order predictive modeling methodology developed by Cacuci [28,29], which will be briefly reviewed in the next subsection.

3.3. High-Order Predictive Modeling

Consider that the total number of computed responses and, correspondingly, the total number of experimentally measured responses is

T R

. The information usually available regarding the distribution of such measured responses comprises the first-order moments (mean values), which will be denoted as

r_{i}^{e}

,

i = 1, \dots, T R

, and the second-order moments (variances/covariances), which will be denoted as

cov {(r_{i}, r_{j})}_{e}

,

i, j = 1, \dots, T R

, for the measured responses. The letter “e” will be used either as a superscript or a superscript to indicate experimentally measured quantities. The expected values of the experimentally measured responses will be considered to constitute the components of a vector denoted as

R^{e} ≜ {(R_{1}^{e}, \dots, R_{T R}^{e})}^{†}

. The covariances of the measured responses are considered to be components of the

T R \times T R

-dimensional covariance matrix of measured responses, which will be denoted as

C_{r r}^{e} ≜ {[cov {(r_{i}, r_{j})}_{e}]}_{T R \times T R}

. In principle, it is also possible to obtain correlations between some measured responses and some model parameters. When such correlations between measured responses and measured model parameters are available, they will be denoted as

cor {(α_{i}, r_{j})}_{e}

,

i = 1, \dots, T P; j = 1, \dots, T R

, and they can formally be considered the elements of a rectangular correlation matrix, which will be denoted as

C_{α r}^{e} ≜ {[cor {(α_{i}, r_{j})}_{e}]}_{T P \times T R}

. As discussed in Section 3, cf. Equations (6) and (7), the model parameters are characterized by the vector of mean values

α^{0} ≜ {[α_{1}^{0}, \dots, α_{i}^{0}, \dots, α_{T P}^{0}]}^{†}

and the covariance matrix

C_{α α} ≜ {[cov (α_{i}, α_{j})]}_{T P \times T P}

.

Cacuci [28,29] has applied the MaxEnt principle [30] to construct the least informative (and hence, most conservative) distribution that includes all of the available computational and experimental information up to fourth-order sensitivities, thus obtaining the following expressions for the “best-estimate” (indicated by the superscript “be”) predicted responses and calibrated model parameters:

(i): Optimally predicted “best-estimate” values for the predicted model responses $R^{b e} ≜ {(R_{1}^{b e}, \dots, R_{T R}^{b e})}^{†}$ :

R^{b e} = R^{e} + C_{r r}^{e} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} [E_{c} (R) - R^{e}] .

(23)

The vector

E_{c} (R) ≜ {[E (R_{1}), \dots, E (R_{T R})]}^{†}

in Equation (23) has components

E (R_{k}), k = 1, \dots, T R

, each of which denotes the expected value of a generic computed response

R_{k} (α)

and is obtained from Equation (4) in the following form, up to and including fourth-order standard deviations of parameters:

\begin{array}{l} E (R_{k}) = R_{k} (α^{0}) + \frac{1}{2} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} {\frac{\partial^{2} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}}}}_{α^{0}} ρ_{j_{1} j_{2}} σ_{j_{1}} σ_{j_{2}} \\ + \frac{1}{3!} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} {\frac{\partial^{3} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}}}}_{α^{0}} t_{j_{1} j_{2} j_{3}} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} \\ + \frac{1}{4!} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} \sum_{j_{4} = 1}^{T P} \frac{\partial^{4} R_{k} (α^{0})}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}} \partial α_{j_{4}}} q_{j_{1} j_{2} j_{3} j_{4}} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} σ_{j_{4}} + \dots \end{array}

(24)

The covariance matrix

C_{r r}^{c} ≜ {[cov (R_{k}, R_{ℓ})]}_{T R \times T R}

of the computed responses in Equation (23) comprises as elements the covariances,

cov (R_{k}, R_{ℓ})

, of two responses,

R_{k} (α)

and

R_{ℓ} (α)

, respectively. Using Equations (4) and (24), one obtains the following expression:

\begin{array}{l} cov (R_{k}, R_{ℓ}) ≜ E {[R_{k} (α) - E (R_{k})] [R_{ℓ} (α) - E (R_{ℓ})]} \\ = \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} {\frac{\partial R_{k} (α)}{\partial α_{j_{1}}} \frac{\partial R_{ℓ} (α)}{\partial α_{j_{2}}}}_{α^{0}} ρ_{j_{1} j_{2}} σ_{j_{1}} σ_{j_{2}} + \frac{1}{2} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} {\frac{\partial^{2} R_{k} (α)}{\partial α_{j_{1}} \partial α_{j_{2}}} \frac{\partial R_{ℓ} (α)}{\partial α_{j_{3}}} \\ {+ \frac{\partial R_{k} (α)}{\partial α_{j_{1}}} \frac{\partial^{2} R_{ℓ} (α)}{\partial α_{j_{2}} \partial α_{j_{3}}}}}_{α^{0}} t_{j_{1} j_{2} j_{3}} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} + \frac{1}{4} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} \sum_{j_{4} = 1}^{T P} {\frac{\partial^{2} R_{k} (α^{0})}{\partial α_{j_{1}} \partial α_{j_{2}}} \frac{\partial^{2} R_{ℓ} (α^{0})}{\partial α_{j_{3}} \partial α_{j_{4}}}}_{α^{0}} \\ \times (q_{j_{1} j_{2} j_{3} j_{4}} - ρ_{j_{1} j_{2}} ρ_{j_{3} j_{4}}) σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} σ_{j_{4}} + \frac{1}{6} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} \sum_{j_{4} = 1}^{T P} {\frac{\partial^{3} R_{k} (α^{0})}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}}} \frac{\partial R_{ℓ} (α^{0})}{\partial α_{j_{4}}} \\ + {\frac{\partial R_{k} (α^{0})}{\partial α_{j_{1}}} \frac{\partial^{3} R_{ℓ} (α^{0})}{\partial α_{j_{2}} \partial α_{j_{3}} \partial α_{j_{4}}}}}_{α^{0}} q_{j_{1} j_{2} j_{3} j 4} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} σ_{j_{4}} + \dots \end{array}

(25)

(ii): Optimally predicted “best-estimate” values for the responses and calibrated parameters:

α^{b e} = α^{0} - C_{α r}^{c} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} [E_{c} (R) - R^{e}],

(26)

where the components of the covariance matrix

C_{α r}^{c} ≜ {[cov (α_{i}, R_{k})]}_{T P \times T R}

of computed responses and parameters have the following expressions up to and including fourth-order standard deviations of parameters, obtained by using Equations (4) and (24):

\begin{array}{l} cov (α_{i}, R_{k}) ≜ E {(α_{i} - α_{i}^{0}) [R_{k} (α) - E (R_{k})]} = \sum_{j_{1} = 1}^{T P} \frac{\partial R_{k} (α^{0})}{\partial α_{j_{1}}} ρ_{i, j_{1}} σ_{i} σ_{j_{1}} \\ + \frac{1}{2} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \frac{\partial^{2} R_{k} (α^{0})}{\partial α_{j_{1}} \partial α_{j_{2}}} t_{i, j_{1} j_{2}} σ_{i} σ_{j_{1}} σ_{j_{2}} + \frac{1}{6} \sum_{j_{1} = 1}^{T P} \sum_{j_{2} = 1}^{T P} \sum_{j_{3} = 1}^{T P} \frac{\partial^{3} R_{k} (α^{0})}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}}} q_{i, j_{1} j_{2} j_{3}} σ_{i} σ_{j_{1}} σ_{j_{2}} σ_{j_{3}} + \dots \end{array}

(27)

Since the components of the vector

E_{c} (R)

, and the components of the matrices

C_{r r}^{c}

and

C_{α r}^{c}

can contain arbitrarily high-order response sensitivities to model parameters, the expressions presented in Equations (23) and (26) generalize the previous formulas of this type found in data adjustment/assimilation procedures published to date (which contain at most second-order sensitivities). The best-estimate parameter values are the “calibrated model parameters”, which can be used for subsequent computations with the “calibrated model”;

(iii): Optimally predicted “best-estimate” values for covariance matrix, $C_{r r}^{b e}$ , for the best-estimate responses $R^{b e} ≜ {(R_{1}^{b e}, \dots, R_{T R}^{b e})}^{†}$ :

C_{r r}^{b e} = C_{r r}^{e} - C_{r r}^{e} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{r r}^{e} .

(28)

As indicated in Equation (28), the initial covariance matrix

C_{r r}^{e}

for the experimentally measured responses is multiplied by the matrix

[I - {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{r r}^{e}]

, which means that the variances contained on the diagonal of the best-estimate matrix

C_{r r}^{b e p}

will be smaller than the experimentally measured variances contained in

C_{r r}^{e}

. Hence, the incorporation of experimental information reduces the predicted best-estimate response variances in

C_{r r}^{b e}

by comparison to the measured variances contained a priori in

C_{r r}^{e}

. Since the components of the matrix

C_{r r}^{b e}

contain high-order sensitivities, the formula presented in Equation (28) generalizes the previous formulas of this type found in data adjustment/assimilation procedures published to date;

(iv): Optimally predicted “best-estimate” values for the posterior parameter covariance matrix $C_{α α}^{b e}$ for the best-estimate parameters $α^{b e}$ :

C_{α α}^{b e} = C_{α α} - C_{α r}^{c} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{α r}^{c} .

(29)

The matrices

C_{α α}

and

C_{α r}^{c} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{α r}^{c}

are symmetric and positively definite. Therefore, the subtraction indicated in Equation (29) implies that the components of the main diagonal of

C_{α α}^{b e}

must have smaller values than the corresponding elements of the main diagonal of

C_{α α}

. In this sense, the combination of computational and experimental information has reduced the best-estimate parameter variances on the diagonal of

C_{α α}^{b e}

. Since the components of the matrices

C_{α α}

,

C_{α r}^{c}

, and

C_{r r}^{c}

contain high-order response sensitivities, the formula presented in Equation (29) generalizes the previous formulas of this type found in data adjustment/assimilation procedures published to date;

(v): Optimally predicted “best-estimate” values for the posterior parameter correlation matrix $C_{α r}^{b e}$ and/or its transpose $C_{r α}^{b e}$ , for the best-estimate parameters and best-estimate responses:

C_{α r}^{b e} = C_{α r}^{e} - C_{α r}^{c} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{r r}^{e};

(30)

C_{r α}^{b e} = C_{r α}^{e} - C_{r r}^{e} {(C_{r r}^{e} + C_{r r}^{c})}^{- 1} C_{r α}^{c} = {(C_{α r}^{b e})}^{†} .

(31)

Since the components of the matrices

C_{α r}^{c}

and

C_{r r}^{c}

contain high-order sensitivities, the formulas presented in Equations (30) and (31) generalize the previous formulas of this type found in data adjustment/assimilation procedures published to date.

It is important to note from the results shown in Equations (23)–(31) that the computation of the best estimate parameter and response values, together with their corresponding best-estimate covariance matrices, only involves a single matrix inversion, for computing

{(C_{r r}^{e} + C_{r r}^{c})}^{- 1}

, which entails the inversion of a matrix of size

T R \times T R

. This is computationally very advantageous since

T R ≪ T P

, i.e., the number of responses is much lower than the number of model parameters in the overwhelming majority of practical situations. The unmatched guaranteed reduction in predicted uncertainties in the predicted responses (and model parameters) and the essential impact of higher-order sensitivities has been recently illustrated by Fang and Cacuci [31] by applying the second-BERRU-PM methodology [28,29] to the polyethylene-reflected plutonium OECD/NEA reactor physics benchmark, which was already described briefly in Section 3.1. The results predicted by using Equations (23) and (28) in conjunction with typical measurement uncertainties (relative standard deviation of 10%) and typical uncertainties (relative standard deviation of 5%) for the total cross-sections (which are the model parameters considered in this illustrative example) are presented in Figure 3 and Figure 4, below. The leakage of neutrons through the outer sphere of the plutonium benchmark is denoted by the letter “L” in Figure 3 and Figure 4. In Figure 3 and Figure 4, the superscript “e” denotes “experimentally measured quantities”, while the superscript “be” denotes “best-estimate optimally predicted quantities”. The standard deviation of the experimentally measured response is denoted as

S D^{(e)}

; the standard deviation of the computed response is denoted as

S D^{(i)}

, and the “best-estimate” standard deviation of the best-estimate response is denoted as

S D^{(b e, i)}

. The values 1, 2, and 4, taken on by the index “i” in the superscripts of these standard deviations, correspond, respectively, to the inclusion of only the first-order sensitivities, the inclusion of the first- and the second-order sensitivities, and the inclusion of the first + second + third + fourth-order sensitivities. The parameters (microscopic group-averaged total cross-sections) considered for obtaining the results depicted in Figure 3 and Figure 4 were considered to be uncorrelated and normally distributed, having uniform relative standard deviations of 5%.

The numerical results [in units of

n e u t r o n s / s

] that correspond to the graphs in Figure 3 and Figure 4 are as follows:

E (L^{c}) \pm S D^{(1)} = 1.765 \times 10^{6} \pm 9.246 \times 10^{5}

;

L^{b e} \pm S D^{(b e, 1)} = 5.093 \times 10^{6} \pm 5.581 \times 10^{5}

;

E (L^{c}) \pm S D^{(2)} = 2.914 \times 10^{6} \pm 1.629 \times 10^{6}

;

L^{b e} \pm S D^{(b e, 2)} = 6.363 \times 10^{6} \pm 6.431 \times 10^{5}

;

E (L^{c}) \pm S D^{(4)} = 6.681 \times 10^{6} \pm 7.386 \times 10^{6}

;

L^{b e} \pm S D^{(b e, 4)} = 6.997 \times 10^{6} \pm 6.969 \times 10^{5}

;

L^{e} \pm S D^{(e)} = 7.000 \times 10^{6} \pm 7.000 \times 10^{5}

.

These results depicted in Figure 3 and Figure 4 imply the following conclusions:

(i) The nominal (value of the) computed response,

L^{c}

, coincides with the expected value of the computed response,

E (L^{c})

, only if all sensitivities higher than first-order are ignored. Otherwise, the effects of the second- and higher-order sensitivities cause the expected value of the computed response,

E (L^{c})

, to be increasingly larger than the nominal computed value

L^{c}

, i.e.,

E (L^{c}) \geq L^{c}

;

(ii) As indicated in Figure 3, the inclusion of only the first- and second-order sensitivities appears to indicate that the “computations are inconsistent with the computations” since, in these cases, the standard deviation of the measured response does not overlap with the standard deviation of the computed response. However, this indication is proven to be false by the result depicted in Figure 4, which shows that the inclusion of the third- and fourth-order sensitivities renders the computation to be “consistent” with the measurement. This situation underscores the importance of including not only the second-order but also the higher (third- and fourth) order sensitivities;

(iii) As predicted by the second-BERRU-PM methodology, the best-estimate response value,

L^{b e}

, always falls in between the “expected value of the computed response”,

E (L^{c})

, and the experimentally measured value

L^{e}

, i.e.,

E (L^{c}) < L^{b e} < L^{e}

. As higher-order sensitivities are included, the predicted response value,

L^{b e}

, approaches the experimentally measured value,

L^{b e}

. Remarkably, all three of these quantities become clustered together, with

L^{b e} ≅ L^{e}

, when all sensitivities, from first to fourth order, are included;

(iv) It is also apparent that the predicted standard deviation of the predicted response is smaller than either the originally computed response standard deviation or the measured response standard deviation, i.e.,

S D^{(b e, 1)} < S D^{(e)}

and

S D^{(b e, 1)} < S D^{(1)}

. This reduction in the magnitude of the predicted response standard deviation is guaranteed by the application of the second-BERRU-PM methodology; this reduction is even more accentuated when the higher (second-, third-, fourth-) order sensitivities are also included;

(v) The results depicted in Figure 3 and Figure 4 also indicate that the second-order response sensitivities must always be included, even if only to quantify the need for including (or not) the third- and/or fourth-order sensitivities;

(vi) As indicated in Figure 3 and Figure 4, the standard deviation of the computed response increases as sensitivities of increasingly higher order are incorporated, as would logically be expected. However, this fact has no negative consequences after the second-BERRU-PM methodology is applied to combine the computational results with the experimental results since, as shown in Figure 3 and Figure 4, the second-BERRU-PM methodology reduces the predicted best-estimate standard deviations to values that are smaller than both the computed and the experimentally measured values of the initial standard deviations. The results obtained confirm the fact that the second-BERRU-PM methodology predicts best-estimate results that fall in between the corresponding computed and measured values while reducing the predicted standard deviations of the predicted results to values smaller than either the experimentally measured or the computed values of the respective standard deviations.

The second-BERRU-PM methodology also yields best-estimate optimal values for the (posterior) calibrated parameters, as indicated in Equation (26), along with reduced predicted uncertainties (standard deviations) for the predicted parameters, as shown in Equation (29). The best-estimate calibrated parameters thus obtained are subsequently used in the respective calibrated computational models to compute best-estimate responses, which will have smaller uncertainties since the calibrated parameters have smaller uncertainties than the original uncalibrated parameters. Such subsequent computations yield results practically equivalent to the predicted response value

L^{b e}

and the predicted best-estimate reduced standard deviation

S D^{(b e, 4)}

; because of the page limitation, however, these results cannot be reproduced here.

Fundamental Advantages of the Second-BERRU-PM Methodology over the Second-Order Data Assimilation

This section will describe the decisive advantages of the second-BERRU-PM methodology over the Second-Order Data Assimilation methodology. The Second-Order Data Assimilation methodology relies on using the second-order procedure to minimize a user-defined functional, which is meant to represent, in a chosen norm (usually the energy norm), the discrepancies between computed and experimental results (“responses”). The “Second-Order Data Assimilation” [32,33] considers that the vector of measured responses (“observations”), denoted by the vector

z ≜ {(z_{1}, \dots, z_{T R})}^{†}

, is a known function of the vector of state-variables

u (x) ≜ {[u_{1} (x), \dots, u_{T D} (x)]}^{†}

and the vector of errors

w ≜ {(w_{1}, \dots, w_{T R})}^{†}

, having the expression:

z = h (u) + w

, where

u

denotes the vector of dependent variables, and

h (u) ≜ {[h_{1} (u), \dots, h_{T R} (u)]}^{†}

is a known vector-function of

u

. The error term,

w

, is considered here to include “representative errors” stemming from sampling and grid interpolation; the mean value of

w

corresponds to

r_{e} ≜ {(r_{1}^{e}, \dots, r_{T R}^{e})}^{†}

, and the covariance matrix of

w

corresponds to

C_{e}^{r r}

. As described in [32,33],

w

is often considered to have the characteristics of “white noise”, in which case,

z \sim N [h (u), C_{e}^{r r}]

is a normal distribution with mean

h (u)

and covariance

C_{e}^{r r}

. In addition, it is assumed that the prior “background” information is also known, being represented by a multivariate normal distribution with a known mean, denoted as

u_{b}

, and a known covariance matrix denoted as

B

, i.e.,

u \sim N [u_{b}, B]

. The posterior distribution,

p (z | u)

, is obtained by applying Bayes’ Theorem to the above information, which yields the result

p (z | u) \sim \exp [- J (u)]

, where

J (u) ≜ \frac{1}{2} {{[z - h (u)]}^{†} {(C_{e}^{r r})}^{- 1} [z - h (u)] + {(u - u_{b})}^{†} B^{- 1} (u - u_{b})}

. The maximum posterior estimate is obtained by determining the minimum of the functional

J (u)

, which occurs at the root(s) of the following equation:

0 = \nabla J (u) = B^{- 1} (u - u_{b}) - {[D_{h} (u)]}^{†} {(C_{e}^{r r})}^{- 1} [z - h (u)],

(32)

where

D_{h} (u)

denotes the Jacobian matrix of

h (u)

with respect to the components of

u

.

The “first-order data assimilation” procedure solves Equation (32) by using a “partial quadratic approximation” to

J (u)

, while the “second-order data assimilation” procedure solves Equation (32) by using a “full quadratic approximation” to

J (u)

, as detailed in [32,33], to obtain the “optimal data assimilation solution”, which is here denoted as

u_{D A}

, as the solution of Equation (32). The following fundamental differences become apparent by comparing the “Data Assimilation” result represented by Equation (32) and the Hi-BERRU-PM results.

1. Data assimilation (DA) is formulated conceptually [32,33] either only in the phase space of measured responses (“observation space formulation”) or only in the phase space of the model’s dependent variables (“state space formulation”). Hence, DA can calibrate initial conditions as “direct results” but cannot directly calibrate any other model parameters. In contradistinction, the second-BERRU-PM methodology is formulated conceptually in the most inclusive “joint phase space of parameters, computed and measured responses”. Consequently, the second-BERRU-PM methodology simultaneously calibrates responses and parameters, thus simultaneously providing results for forward and inverse problems;

2. If the experiments are perfectly well known, i.e., if

C_{e}^{r r} = 0

, Equation (32) indicates that the DA methodology fails fundamentally. In contradistinction, Equations (23)–(31) indicate that the second-BERRU-PM methodology does not fail when

C_{e}^{r r} = 0

because, in any situation,

C_{c}^{r r} \neq 0

;

3. The DA methodology also fails fundamentally when the response measurements happen to coincide with the computed value of the response, i.e., when

z = h (u)

is at some point in the state space. In such a case, the DA’s Equation (32) yields the trivial result

u_{D A} = u_{b}

. In contradistinction, the Hi-BERRU-PM methodology does not yield such a trivial result when the response measurements happen to coincide with the computed value of the response, i.e., when

r_{e} = r_{k} (α^{0})

, because the difference

E_{c} (r) - r_{e}

, which appears on the right sides of Equations (23), (26) and (28)–(31), remains non-zero due to the contributions of the second- and higher-order sensitivities of the responses with respect to the model parameters, since

{E_{c} (r_{k}) - r_{k}^{e}}_{r_{k}^{e} = r_{k} (α^{0})} = \sum_{i = 1}^{T P} \sum_{j = 1}^{T P} {\partial^{2} r_{k} (α) / \partial α_{i} \partial α_{j}}_{α^{0}} cov (α_{i}, α_{j}) / 2 + \dots \dots \neq 0

for

k = 1, \dots, T R

. This situation clearly underscores the need for computing and retaining (at least) the second-order response sensitivities to the model parameters. Although a situation when

r_{e} = r_{k} (α^{0})

is not expected to occur frequently in practice, there are no negative consequences (should such a situation occur) if the second-BERRU-PM methodology is used, in contradistinction to using the DA methodology;

4. The second-BERRU-PM methodology only requires the inversion of the matrix

(C_{e}^{r r} + C_{c}^{r r})

of size

T R \times T R

. In contradistinction, the solution of the “first-order DA” requires the inversion of the Jacobian

D_{h} (u)

of

h (u)

, while the solution of the “second-order DA” also requires the inversion of a matrix-vector product involving the Hessian matrix of

h (u)

; these matrices are significantly larger [32,33] than the matrix

(C_{e}^{r r} + C_{c}^{r r})

. Hence, the second-BERRU-PM methodology is significantly more efficient computationally than DA;

5. The DA methodology [32,33] is practically non-extendable beyond “second-order”. A “third-order DA” would be computationally impractical because of the massive sizes of the matrices that would need to be inverted. In contradistinction, the second-BERRU-PM methodology presented herein already comprises the fourth-order sensitivities of responses to parameters and can be readily extended/generalized to include even higher-order sensitivities and parameter correlations.

All of the above advantages of the second-BERRU-PM methodology over the DA methodology stem from the fact that the second-BERRU-PM methodology is fundamentally anchored in physics-based principles (thermodynamics and information theory) formulated in the most inclusive possible phase space (namely, the combined phase space of computed and measured parameters and responses), whereas the DA methodology [32,33] is fundamentally based on the minimization of a subjective user-chosen functional.

4. Critical Review of Methods for Computing High-Order Sensitivities

There are only two methodologies for computing directly high-order functional derivatives, i.e., sensitivities, of a model response to the model’s underlying parameters, namely, (i) the well-known finite difference (FD) approximations of derivatives in conjunction with re-computations with altered parameter values and (ii) the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology” (nth CASAM) developed by Cacuci [21,23]. The salient features of the nth CASAM methodology will be reviewed in Section 4.1, while the most important issues encountered when using FD approximations will be highlighted in Section 4.2 below. A number of quasi-statistical methods also exist for computing so-called “sensitivity indicators”, which are not functional derivatives of the response with respect to the model parameters but are attempts at extracting sensitivity-like information from the approximate response-covariance-like information obtained from many re-computations (occasionally using Monte Carlo methods). Conceptually, these statistical methods are the equivalent of attempting to disentangle “sensitivities” within the components of the right side of Equation (25) from a statistical approximation of the left side of Equation (25). The salient features of these statistical methods will be briefly discussed in Section 4.3 below.

4.1. The Only Method for Efficiently Computing Exact Expressions of High-Order Sensitivities: The nth-CASAM

The only extant method that overcomes, to the largest possible extent, the curse of dimensionality while providing exact expressions for computing efficiently sensitivities of all orders is the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM)” developed by Cacuci [21,23] by extending the original adjoint sensitivity analysis methodology conceived by Cacuci [1,2]. The nth-CASAM requires computations in linearly increasing higher-dimensional Hilbert spaces, as opposed to exponentially increasing parameter-dimensional spaces. In particular, for a scalar-valued response associated with a model comprising

T P

model parameters, the nth-CASAM requires one adjoint computation for computing exactly all of the first-order response sensitivities, as opposed to at least

T P

forward computations, as required by other methods to obtain approximate values for these sensitivities.

The starting point of the mathematical framework of the nth-CASAM commences by noting that the known nominal (or mean) parameter values,

α^{0}

, will differ from the true values

α

, which are unknown, by variations

δ α ≜ {(δ α_{1}, \dots, δ α_{T P})}^{†}

, where

δ α_{i} ≜ α_{i} - α_{i}^{0}

. The parameter variations

δ α

will induce variations

v^{(1)} (x) ≜ {[δ u_{1} (x), \dots, δ u_{T D} (x)]}^{†}

around the nominal solution

u^{0} (x)

in the forward state functions via Equations (1) and (2). In turn, these variations will induce variations in the system’s response

R [u (x); α]

. Mathematically, the response variations are represented by the first-order G-differential

δ R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

of the response

R [u (x); α]

induced by arbitrary variations

[v^{(1)} (x); δ α]

in a neighborhood around the nominal functions and parameter values

[u^{0} (x); α^{0}]

, and is defined as follows:

δ R [u^{0} (x); α^{0}; v^{(1)} (x); δ α] ≜ {\frac{d}{d ε} R [u^{0} + ε v^{(1)}; α^{0} + ε (δ α)]}_{ε = 0},

(33)

where

ε

is a scalar quantity. The G-variation

δ R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

is an operator defined on the same domain as

R [u (x); α]

and has the same range as

R [u (x); α]

. The existence of the G-variation

δ R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

does not guarantee its numerical computability. Numerical methods most often require that

δ R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

be linear in

[v^{(1)} (x); δ α]

in a neighborhood

[u^{0} + ε v^{(1)}; α^{0} + ε (δ α)]

around

[u^{0} (x); f^{0} (α)]

, thereby admitting a total first-order G-derivative, which will be denoted as

d R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

. In practice, the computation of the expression on the right side of Equation (33) reveals immediately if the respective expression is linear (or not) in the vectors

v^{(1)} (x)

and/or

δ α

. Numerical methods (e.g., Newton’s method and variants thereof) for solving Equations (1) and (2) also require the existence of the first-order G-derivatives of the original model equations. Therefore, it will be henceforth assumed that the model responses and the operators underlying the physical system modeled by Equations (1) and (2) admit G-derivatives.

The first-order G-derivative

d R [u^{0} (x); α^{0}; v^{(1)} (x); δ α]

of

R [u (x); α]

is obtained, by definition, as follows:

\begin{array}{l} d R [u^{0} (x); α^{0}; v^{(1)} (x); δ α] ≜ {d R [u (x); x; δ α]}_{d i r} + {d R [u (x); α; v^{(1)} (x)]}_{i n d} \\ ≜ {\frac{d}{d ε} \int_{λ_{1} (α^{0}) + ε (δ λ_{1})}^{ω_{1} (α^{0}) + ε (δ ω_{1})} \dots \int_{λ_{T I} (α^{0}) + ε (δ λ_{T I})}^{ω_{T I} (α^{0}) + ε (δ ω_{T I})} S [u^{0} + ε v^{(1)}, α^{0} + ε (δ α); x] d x_{1} \dots d x_{T I}}_{ε = 0} . \end{array}

(34)

In Equation (34), the “direct effect” term

{δ R [u (x); x; δ α]}_{d i r}

comprises only dependencies on

δ α

and is defined as follows:

{δ R [u (x); x; δ α]}_{d i r} ≜ {\frac{\partial R (u; f)}{\partial α}}_{α^{0}} δ α ≜ \sum_{j_{1} = 1}^{T F} {R^{(1)} [j_{1}; u (x); α]}_{d i r} δ α_{j_{1}},

(35)

where

\begin{array}{l} {R^{(1)} [j_{1}; u (x); α]}_{d i r} ≜ {\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S (u; α; α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}}_{α^{0}} \\ + \sum_{j = 1}^{T I} {\frac{\partial ω_{j} (α)}{\partial α_{j_{1}}} \int_{λ_{1}}^{ω_{1}} d x_{1} \dots \int_{λ_{j - 1}}^{ω_{j - 1}} d x_{i - 1} \int_{λ_{j + 1}}^{ω_{j + 1}} d x_{i + 1} \dots \int_{λ_{T I}}^{ω_{T I}} d x_{T I} S [u (x_{1}, ., ω_{j} (α), ., x_{N_{x}}); α]}_{α^{0}} \\ - \sum_{j = 1}^{T I} {\frac{\partial λ_{j} (α)}{\partial α_{j_{1}}} \int_{λ_{1}}^{ω_{1}} d x_{1} \dots \int_{λ_{j - 1}}^{ω_{j - 1}} d x_{i - 1} \int_{λ_{j + 1}}^{ω_{j + 1}} d x_{i + 1} \dots \int_{λ_{T I}}^{ω_{T I}} d x_{T I} S [u (x_{1}, ., λ_{j} (α), ., x_{N_{x}}); α]}_{α^{0}} . \end{array}

(36)

{δ R [u (x); α; v^{(1)} (x)]}_{i n d} ≜ {\int_{λ_{1} (α)}^{ω_{1} (α)} d x_{1} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} d x_{T I} \frac{\partial S (u; α)}{\partial u} v^{(1)} (x)}_{α^{0}}

(37)

The direct effect term can be computed directly by using Equation (36). In contradistinction, the indirect effect term can be quantified only after having determined the variations

v^{(1)} (x)

in terms of the variations

δ α

. The first-order relationship between the vectors

v^{(1)} (x)

and the parameter variations

δ α

is determined by solving the equations obtained by applying the definition of the G-differential to Equations (1) and (2), which yields the following equations:

{\frac{d}{d ε} N [u^{0} + ε v^{(1)} (x); α^{0} + ε (δ α)]}_{ε = 0} = {\frac{d}{d ε} Q [x; α^{0} + ε (δ α)]}_{ε = 0},

(38)

{\frac{d}{d ε} B [u^{0} + ε v^{(1)} (x); α^{0} + ε (δ α)]}_{ε = 0} - {\frac{d}{d ε} C [α^{0} + ε (δ α)]}_{ε = 0} = 0 .

(39)

Carrying out the differentiations with respect to

ε

in Equations (38) and (39) and setting

ε = 0

in the resulting expressions yields the following equations:

{N^{(1)} (u; α) v^{(1)} (j_{1}; x)}_{α^{0}} = {s_{V}^{(1)} (j_{1}; u; α)}_{α^{0}}, j_{1} = 1, \dots, T P; x \in Ω_{x},

(40)

{b_{V}^{(1)} [u; α; v^{(1)} (j_{1}; x)]}_{α^{0}} = 0; j_{1} = 1, \dots, T P; x \in \partial Ω_{x} (α^{0}) .

(41)

In Equations (40) and (41), the superscript “(1)” indicates the “first-level” and the various quantities which appear in these equations are defined as follows:

N^{(1)} (u; α) ≜ {\frac{\partial N (u; α)}{\partial u}} ≜ {\frac{\partial N_{i}}{\partial u_{j}}}_{T D \times T D};

(42)

s_{V}^{(1)} (j_{1}; u; α) ≜ \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}};

(43)

{b_{V}^{(1)} [u; α; v^{(1)} (j_{1}; x)]}_{α^{0}} ≜ {\frac{\partial B (u; α)}{\partial u} v^{(1)} (j_{1}; x) + \frac{\partial [B (u; α) - C (α)]}{\partial α_{j_{1}}}}_{α^{0}} .

(44)

The system of equations comprising Equations (40) and (41) is called the “First-Level Variational Sensitivity System” (first-LVSS). As Equations (40) and (41) indicate, every parameter variation

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, will induce, directly or indirectly, variations in the model’s state variables. In principle, therefore, for every parameter variation

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, there would correspond a solution

v^{(1)} (j_{1}; x)

,

j_{1} = 1, \dots, T P

, of the first-LVSS. Thus, if the effect of every parameter variation were of interest, then the first-LVSS would need to be solved

T P

times, with distinct right-side and boundary conditions for each parameter variation

δ α_{j_{1}}

, which would require at least

T P

large-scale computations.

However, solving the first-LVSS can be avoided altogether by using the ideas underlying the “adjoint sensitivity analysis methodology” conceived by Cacuci [1,2] for the most efficient computation of exact expressions of first-order sensitivities, and subsequently, generalized by Cacuci [21,23] to enable the computation of arbitrarily high-order response sensitivities to model parameters. Thus, the need for computing the vectors

v^{(1)} (j_{1}; x)

and

j_{1} = 1, \dots, T P

is eliminated by expressing the indirect effect term defined in Equation (37) in terms of the solutions of the “First-Level Adjoint Sensitivity System” (First-LASS), the construction of which requires the introduction of adjoint operators. Constructing the First-LASS is accomplished by introducing a (real) Hilbert space denoted as

H_{1} (Ω_{x})

, endowed with an inner product of two vectors

w^{(1)} (x) \in H_{1}

and

w^{(2)} (x) \in H_{1}

, which is denoted as

{〈 w^{(1)}, w^{(2)} 〉}_{1}

and defined as follows:

{〈 w^{(1)}, w^{(2)} 〉}_{1} ≜ {\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} [w^{(1)} (x) \cdot w^{(2)} (x)] d x_{1} \dots d x_{T I}}_{α^{0}},

(45)

where

w^{(1)} (x) \cdot w^{(1)} (x) ≜ \sum_{i = 1}^{T D} w_{i}^{(1)} (x) w_{i}^{(2)} (x)

, and where the “dagger” (

†

), which indicates “transposition”, has been omitted (to simplify the notation) in the representation of this scalar product.

Consider a vector

a^{(1)} (x) \in H_{1}

, which is an element in

H_{1} (Ω_{x})

but is otherwise arbitrary at this stage, and use Equation (45) to form the inner product of

a^{(1)} (x) \in H_{1}

with the relation provided in Equation (40) to obtain

{{〈 a^{(1)}, N^{(1)} (u; α) v^{(1)} 〉}_{1}}_{α^{0}} = {{〈 a^{(1)}, q_{V}^{(1)} (u; α; δ α) 〉}_{1}}_{α^{0}}, x \in Ω_{x} .

(46)

The left side of Equation (46) is transformed by using the definition of the adjoint operator in

H_{1} (Ω_{x})

as follows:

{{〈 a^{(1)}, N^{(1)} (u; α) v^{(1)} 〉}_{1}}_{α^{0}} = {{〈 A^{(1)} (u; α) a^{(1)}, v^{(1)} 〉}_{1}}_{α^{0}} + {{[P^{(1)} (u; α; v^{(1)}; a^{(1)})]}_{\partial Ω_{x}}}_{α^{0}},

(47)

where

{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}

denotes the associated bilinear concomitant evaluated on the domain’s boundary

\partial Ω_{x}

, and

A^{(1)} (u; α) ≜ {[N^{(1)} (u; α)]}^{*}

denotes the operator formally adjoint to

N^{(1)} (u; g)

. The symbol

{[]}^{*}

indicates “formal adjoint” operator.

The first term on the right side of Equation (47) is required to represent the indirect effect term defined in Equation (37) by imposing the following relationship:

{A^{(1)} (u; α) a^{(1)} (x)}_{α^{0}} = {\partial S (u; α) / \partial u}_{α^{0}} ≜ q_{A}^{(1)} [u (x); α], x \in Ω_{x} .

(48)

The domain of

A^{(1)} (u; α)

is determined by selecting appropriate adjoint boundary and/or initial conditions, which will be denoted in operator form as:

{b_{A}^{(1)} (u; a^{(1)}; α)}_{α^{0}} = 0, x \in \partial Ω_{x} .

(49)

The above boundary conditions for the adjoint operator

A^{(1)} (u; α)

are obtained by imposing the following requirements: (i) they must be independent of unknown values of V⁽¹⁾(x) and

δ α

; (ii) the substitution of the boundary and initial conditions represented by Equations (41) and (49) into the expression of

{{[P^{(1)} (u; α; v^{(1)}; a^{(1)})]}_{\partial Ω_{x}}}_{α^{0}}

must cause all terms containing unknown values of

v^{(1)} (x)

to vanish.

Using the adjoint and forward variational boundary conditions represented by Equations (49) and (41) into Equation (47) reduces the bilinear concomitant

{{[P^{(1)} (u; α; v^{(1)}; a^{(1)})]}_{\partial Ω_{x}}}_{α^{0}}

to a residual bilinear concomitant denoted as

{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}}_{α^{0}}

, which will contain boundary terms involving only known values of

δ α

,

u

, and

a^{(1)}

. The residual bilinear concomitant

{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}}_{α^{0}}

is linear in

δ α

. The results obtained in Equations (47) and (48) are now used together with Equation (37) to obtain the following expression of the indirect-effect term as a function of

a^{(1)} (x)

:

\begin{array}{l} {δ R [u (x); α; v^{(1)} (x); δ α]}_{α^{0}} = {δ R [u (x); α; δ α]}_{d i r} + {{〈 a^{(1)}, q_{V}^{(1)} (u; α; δ α) 〉}_{1}}_{α^{0}} \\ - {{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}}_{α^{0}} ≜ \sum_{j_{1} = 1}^{T P} {R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}_{α^{0}} δ α_{j_{1}}, \end{array}

(50)

where, for each

j_{1} = 1, \dots, T P

, the quantity

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α] ≜ \partial R [u (x); α] / \partial α_{j 1}

denotes the first-order sensitivities of the response

R [u (x); α]

with respect to the model parameters

α_{j_{1}}

and has the following expression, for

j_{1} = 1, \dots, T P

:

\begin{array}{l} R^{(1)} [j_{1}; u (x); a^{(1)} (x); α] = {R^{(1)} [j_{1}; u (x); α]}_{d i r} - \frac{\partial {[{\hat{P}}^{(1)} (u; a^{(1)}; α;)]}_{\partial Ω_{x}}}{\partial α_{j_{1}}} \\ + \int_{λ_{1} (α)}^{ω_{1} (α)} d x_{1} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}} d x_{T I} ≜ \frac{\partial R [u (x); α]}{\partial α_{j_{1}}} . \end{array}

(51)

Each of the first-order sensitivities

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

of the response

R [u (x); f (α)]

, with respect to the model parameters to be computed inexpensively after having obtained the first-level adjoint function

a^{(1)} (x) \in H_{1}

, using only quadrature formulas to evaluate the various inner products involving

a^{(1)} (x) \in H_{1}

in Equation (51). The function

a^{(1)} (x) \in H_{1}

is obtained by solving numerically (48) and (49), which is the only large-scale computation needed for obtaining all of the first-order sensitivities. Equations (48) and (49) are called the First-Level Adjoint Sensitivity System (first-LASS). The solution,

a^{(1)} (x) \in H_{1} (Ω_{x})

, of the First-LASS is called the first-level adjoint function. It is very important to note that the First-LASS is independent of all parameter variations

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, and, therefore, needs to be solved only once, regardless of the number of model parameters under consideration. Furthermore, since the First-LASS is linear in

a^{(1)} (x)

, solving it requires less computational effort than solving the original model, which is nonlinear in

u (x)

.

The second-order sensitivities of the response

R [u (x); f (α)]

, with respect to the model parameters are computed by applying the principles underlying the first-CASAM to each of the first-order sensitivities

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

considered as a “model response”. This way, the second-order sensitivities of

R [u (x); f (α)]

, with respect to the model parameters, are computed in an order of priority chosen by the user (e.g., the largest of the first-order sensitivities would be considered with the highest priority). If all first-order sensitivities are treated as responses, then the mixed second-order sensitivities would be computed twice, using distinct second-level adjoint functions. Thus, the symmetry underlying the mixed higher-order sensitivities provides an intrinsic mutual “verification” of the respective numerical accuracies. The third-order sensitivities are computed by applying the principles of the first-CASAM to the second-order sensitivities, and so on. The details underlying the computations of the second- and higher-order sensitivities will be presented in detail in the accompanying Part II [24] when presenting the newly developed “high-order adjoint sensitivity analysis of responses to functions/features of model parameters” (nth-FASAM), which greatly increases the efficiency of this methodology in special cases.

4.2. Using Finite Differences: A Conceptually Simple but Computationally Inefficient and Inaccurate Procedure for Computing High-Order Sensitivities

Historically, the oldest method for computing approximate values of derivatives of a function of parameters with respect to an underlying parameter, when the actual functional form is not explicitly available, is the use of finite difference formulas in conjunction with re-computations of the respective function (model response) using altered parameter values. Thus, the first-order partial derivative (sensitivity) of a response

R (α)

,

α ≜ {(α_{1}, \dots, α_{T P})}^{†}

with respect to a parameter

α_{j}

around the nominal parameter values

α^{0} ≜ {(α_{1}^{0}, \dots, α_{i}^{0}, \dots, α_{T P}^{0})}^{†}

can be approximately computed by using backward, forward, or central differences; a formula that is accurate to second-order in parameter variations is the following:

{\frac{\partial R (α)}{\partial α_{j}}}_{α^{0}} \approx \frac{1}{2 h_{j}} (R_{j + 1} - R_{j - 1}) + O (h_{j}^{2}), j = 1, \dots, T P,

(52)

where

R_{j + 1} ≜ R (α_{j}^{0} + h_{j})

,

R_{j - 1} ≜ R (α_{j}^{0} - h_{j})

, and where

h_{j}

denotes a “judiciously chosen” variation in the parameter

α_{j}

around its nominal value

α_{j}^{0}

. The values

R_{j + 1} ≜ R (α_{j}^{0} + h_{j})

and

R_{j - 1} ≜ R (α_{j}^{0} - h_{j})

are obtained by re-computing the response

R (α)

repeatedly, using the changed parameter values

(α_{j}^{0} \pm h_{j})

. The value of the variation

h_{j}

must be chosen by “trial and error” for each parameter

α_{j}

. If

h_{j}

is too large or too small, the result produced by Equation (52) will be far off from the exact value of the derivative

\partial R (α) / \partial α_{j}

. It is important to note that finite difference formulas introduce their intrinsic “methodological errors”, such as the error

O (h_{j}^{2})

indicated in Equation (52), which is in addition to and independent of the errors that might be incurred in the computation of

R (α)

. In other words, even if the computation of

R (α)

were perfect (error-free), the finite difference formulas would, nevertheless, introduce their own intrinsic, numerical errors into the computation of the sensitivity

\partial R (α) / \partial α_{j}

.

For models with many parameters, the computations of approximate sensitivities are expensive, even for the first-order sensitivities; for example, for the PERP benchmark model discussed in Section 3.1, a forward (re)computation requires about 2 min CPU time. The evaluation of the two-point finite difference expression provided in Equation (52) requires two forward transport computations, one forward computation using the altered parameter value

(α_{j}^{0} + h_{j})

and a second forward computation using the altered parameter value

(α_{j}^{0} - h_{j})

. Hence, the total CPU time needed for computing the 7477 non-zero sensitivities using forward computations in conjunction with the two-point finite difference formula provided in Equation (52) is about 1388 h. In contradistinction, the computation of the first-level adjoint sensitivity function, which is needed to compute all of the first-order sensitivities, takes the same amount of CPU as a forward computation, i.e., ca. 2 min. It is evident that finite difference computations cannot only be performed for specifically targeted sensitivities.

The size of the parameter variation is critically important for the computational accuracy (or lack thereof) of finite difference (FD) formulas, and this importance increases with increasing order of sensitivities. The influence of the size of the parameter variation will be illustrated below by considering the largest fourth-order mixed and, respectively, unmixed sensitivities, namely, (i) the largest fourth-order unmixed relative sensitivity

S^{(4)} (σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}) = 2.720 \times 10^{6}

for isotope #6 (¹H) of the PERP benchmark and (ii) the largest fourth-order mixed relative sensitivity

S^{(4)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 5}^{30}) = 2.279 \times 10^{5}

, for the leakage response with respect to the microscopic total cross-sections for group 30 of isotopes #5 (¹²C) and #6 (¹H). The exact values for these sensitivities were computed [22] using the software package the “fourth-order SENS” [34], which computes the first-order through the fourth-order sensitivities of the PERP leakage response with respect to model parameters using the “Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology” (fourth-CASAM). On the other hand, the finite difference formula that has been used for computing these sensitivities is reproduced below:

\begin{array}{l} \frac{\partial^{4} R (α)}{\partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}} \partial α_{j_{4}}} \approx \frac{1}{16 h_{j_{1}} h_{j_{2}} h_{j_{3}} h_{j_{4}}} (R_{j_{1} + 1, j_{2} + 1, j_{3} + 1, j_{4} + 1} - R_{j_{1} + 1, j_{2} + 1, j_{3} + 1, j_{4} - 1} - R_{j_{1} + 1, j_{2} + 1, j_{3} - 1, j_{4} + 1} \\ + R_{j_{1} + 1, j_{2} + 1, j_{3} - 1, j_{4} - 1} - R_{j_{1} + 1, j_{2} - 1, j_{3} + 1, j_{4} + 1} + R_{j_{1} + 1, j_{2} - 1, j_{3} + 1, j_{4} - 1} + R_{j_{1} + 1, j_{2} - 1, j_{3} - 1, j_{4} + 1} - R_{j_{1} + 1, j_{2} - 1, j_{3} - 1, j_{4} - 1} \\ - R_{j_{1} - 1, j_{2} + 1, j_{3} + 1, j_{4} + 1} + R_{j_{1} - 1, j_{2} + 1, j_{3} + 1, j_{4} - 1} + R_{j_{1} - 1, j_{2} + 1, j_{3} - 1, j_{4} + 1} - R_{j_{1} - 1, j_{2} + 1, j_{3} - 1, j_{4} - 1} + R_{j_{1} - 1, j_{2} - 1, j_{3} + 1, j_{4} + 1} \\ - R_{j_{1} - 1, j_{2} - 1, j_{3} + 1, j_{4} - 1} - R_{j_{1} - 1, j_{2} - 1, j_{3} - 1, j_{4} + 1} + R_{j_{1} - 1, j_{2} - 1, j_{3} - 1, j_{4} - 1}) + O (h_{j_{1}}^{2}, h_{j_{2}}^{2}, h_{j_{3}}^{2}, h_{j_{4}}^{2}), \end{array}

(53)

where

R_{j_{1} + 1, j_{2} + 1, j_{3} + 1, j_{4} + 1} ≜ R (α_{j_{1}} + h_{j_{1}}, α_{j_{2}} + h_{j_{2}}, α_{j_{3}} + h_{j_{3}}, α_{j_{4}} + h_{j_{4}})

, etc. The finite difference formulas introduce their intrinsic “methodological errors” of order

O (h_{j_{1}}^{2}, h_{j_{2}}^{2}, h_{j_{3}}^{2}, h_{j_{4}}^{2})

, which are in addition to and independent of the errors that might be incurred in the computation of

\partial^{4} R (α) / \partial α_{j_{1}} \partial α_{j_{2}} \partial α_{j_{3}} \partial α_{j_{4}}

.

Finite difference computations of

S^{(4)} (σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30})

using Equation (53) were performed with step-sizes

h_{j}

ranging from 0.125% to 2% of

σ_{t, 6}^{g = 30}

. The results of these computations are summarized in Table 3 below, which indicate that using a very small step size (e.g., a 0.125% change in the microscopic total cross-section

σ_{t, 6}^{g = 30}

) in conjunction with the finite difference formula provided in Equation (53) causes a very large error of ca.

- 6010 %

by comparison to the exact value

S^{(4)} (σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}) = 2.720 \times 10^{6}

. Increasing the step size to a 0.5% change in

σ_{t, 6}^{g = 30}

reduces dramatically the error (to −17.7%). The smallest error (of only −0.45%) between the approximate result produced using the FD formula given in Equation (53), and the exact result produced by using the fourth-CASAM-L was attained for a 0.60% change in

σ_{t, 6}^{g = 30}

. Increasing the step size, however, worsened the results produced by the FD-formula; thus, a 1.0% change in

σ_{t, 6}^{g = 30}

increased the error to 31.4%, while a 2.0% change in

σ_{t, 6}^{g = 30}

further increased the error of the FD-formula (by comparison to the exact result produced by the fourth-CASAM) to 698%. For

h_{j} > 2.5 % \times σ_{t, 6}^{30}

, the forward neutron transport (re-)computations using PARTISAN failed to converge.

Finite difference computations using Equation (53) in conjunction with various step-sizes

h_{1} \times σ_{t, 5}^{g = 30}, h_{2} \times σ_{t, 6}^{g = 30}

, where

h_{1}

and

h_{2}

denote the percentage change in the parameters, and

σ_{t, 5}^{g = 30}

and

σ_{t, 6}^{g = 30}

, respectively, were also performed for the largest fourth-order mixed relative sensitivity

S^{(4)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 5}^{30}) = 2.279 \times 10^{5}

of the leakage response with respect to the microscopic total cross-sections for group 30 of isotopes #5 (¹²C) and #6 (¹H). The results thus obtained are depicted in Figure 5, which shows that (i) using a 0.125% change in both parameters

σ_{t, 5}^{g = 30}

and

σ_{t, 6}^{g = 30}

, the finite difference method causes an error of 34.6% from the exact value of

S^{(4)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 5}^{30}) = 2.279 \times 10^{5}

; (ii) the combination of a 0.5% change in

σ_{t, 5}^{g = 30}

and a 0.25% change in

σ_{t, 6}^{g = 30}

reduces this error to 6.34%; (iii) a 1.0% change in both parameters

σ_{t, 5}^{g = 30}

and

σ_{t, 6}^{g = 30}

increases the error to 197%.

The most important tool for verifying the accuracy of the mixed sensitivities is their symmetry property, which is built into the mathematical framework of the high-order nth-CASAM. Finite difference schemes provide approximate and direct verifications for both mixed and unmixed sensitivities. However, caution should be exercised when using the finite difference method since the accuracy of the various finite difference approximations becomes increasingly more sensitive to the chosen step size as the order of sensitivities increases.

4.3. Computing “Sensitivity Indicators” Using Statistical Methods

A plethora of statistical procedures have been developed, especially in the 1990s, to compute various “sensitivity indicators” used in the role of a “sensitivity” in the sense of indicating changes in the output of a model that could be attributed to some change in some parameter or process within the model or input to the model. These indicators invariably involve components of uncertainties (most often: variances) in the parameters and are often deduced from repeated re-computations (e.g., Monte Carlo) using the model itself with altered parameter values. For example, the procedures based on “variance decomposition” attempt to obtain an estimate of the variance of a response, as represented by the left side of Equation (25), by using re-computations and subsequently attempt to obtain the portions of the right side of Equation (25) that could be attributed to one or the other of the contributing parameters; this assigned contribution would then be deemed to be the “sensitivity indicator” for the respective parameter. Such a “sensitivity indicator” amalgamates parameter uncertainties and sensitivities, which cannot be disentangled, so the actual sensitivity or the response to a parameter cannot be accurately determined. In particular, such “sensitivity indicators” cannot be used within predictive modeling and/or “data assimilation” procedures. There are over a dozen variants of such statistical procedures; most of them also claim to be “global in nature”. All of these statistical procedures share the following common characteristics: they are conceptually simple to implement; they are computationally inefficient (at best) or useless (at worst) for large systems with thousands of parameters; they cannot consider correlations between parameters and/or responses; the accuracy of the “sensitivity indicator” as a bona fide “response sensitivity to a model parameter” cannot be guaranteed. Since none of these procedures can explicitly compute high-order sensitivities of responses to parameters, a detailed review of these procedures would be outside the scope of this work.

5. Discussion and Conclusions

This work has reviewed the motivation and methods for computing higher-order sensitivities of model responses with respect to model parameters. “Sensitivity analysis”, i.e., quantifying the variations in model responses caused by variations in the model parameters, cannot be performed without the availability of at least first-order sensitivities. Quantification of the second-order sensitivities is essential for deciding whether the first-order sensitivities are sufficient or not for the purpose of the respective “sensitivity analysis”. Second-order sensitivities would be computed in the order of priority indicated by the magnitudes of uncertainties associated with the first-order sensitivities. In particular, second-order sensitivities should always be computed for parameters for which the first-order sensitivities of the response under consideration happen to vanish to assess if the respective response is truly independent of the respective parameter or if the vanishing first-order sensitivity indicates an extremum of the respective response distribution (in the phase space of parameters) with respect to the respective parameter. The magnitudes and uncertainties associated with the second-order sensitivities will prioritize the need for computing third-order sensitivities, and so on. The convergence of the multivariate Taylor series expansion of the response under consideration in the phase space of parameters also needs to be investigated to ensure that this Taylor series it is not inadvertently used beyond its domain of convergence. If the model possesses parameters having standard deviations that are sufficiently large to exceed the radius of convergence of this Taylor series, then the respective parameters would need to be recalibrated by using, e.g., the predictive modeling methodology reviewed in Section 3.3.

As has been discussed in Section 4, the only methodology that can systematically and efficiently compute exact expressions of the response sensitivities of any order with respect to the model parameters is the nth-CASAM [21,23]. The principles underlying the first-CASAM (n = 1) for computing the first-order sensitivities have been reviewed in detail in Section 4 while also indicating the path for applying these principles to compute the higher-order sensitivities. These principles also provide the basis for the development, to be presented in the accompanying Part II [24], of the high-order adjoint sensitivity analysis methodology for computing sensitivities of responses with respect to functions (“features”) of model parameters. This new methodology (to be designated as the “nth-FASAM”) will be shown, in Part II [24], to be even more efficient than the nth-CASAM for those specific models that possess such functions/features of model parameters.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Brief Description of the Polyethylene-Reflected Plutonium (Acronym PERP) Reactor Physics Benchmark

The physical system considered in this work is a subcritical polyethylene-reflected plutonium (acronym PERP) metal sphere benchmark [15], which is used for neutron and gamma measurements and which is included in the Nuclear Energy Agency (NEA) International Criticality Safety Benchmark Evaluation Project (ICSBEP). The inner sphere has a radius

r_{1}

= 3.794 cm and contains α-phase plutonium. It is surrounded by a spherical shell reflector made of polyethylene of thickness 3.81 cm; the radius of the outer shell containing polyethylene is

r_{2}

= 7.604 cm. The constitutive materials of this benchmark are specified in Table A1.

Table A1. Dimensions and material composition of the PERP benchmark.

Materials	Isotopes	Weight Fraction	Density (g/cm³)	Zones
Material 1 (plutonium metal)	Isotope 1 (²³⁹Pu)	9.3804 × 10⁻¹	19.6	Material 1 is assigned to zone 1, having a radius of 3.794 cm.
	Isotope 2 (²⁴⁰Pu)	5.9411 × 10⁻²
	Isotope 3 (⁶⁹Ga)	1.5152 × 10⁻³
	Isotope 4 (⁷¹Ga)	1.0346 × 10⁻³
Material 2 (polyethylene)	Isotope 5 (C)	8.5630 × 10⁻¹	0.95	Material 2 is assigned to zone 2, which has an inner radius of 3.794 cm and an outer radius of 7.604 cm.
Material 2 (polyethylene)	Isotope 6 (¹H)	1.4370 × 10⁻¹	0.95

The neutron flux distribution within the PERP benchmark is computed using the PARTISN [25] multigroup discrete ordinates transport code which solves the following multi-group approximation of the neutron transport equation with a spontaneous fission source provided by the code SOURCES4C [26]:

B^{g} (α) φ^{g} (r, Ω) = Q^{g} (r), g = 1, \dots, G,

φ^{g} (r_{d}, Ω) = 0, r_{d} \in S_{b}, Ω \cdot n < 0, g = 1, \dots, G,

where

r_{d}

denotes the radius of the spherical benchmark, and

\begin{array}{l} B^{g} (α) φ^{g} (r, Ω) ≜ Ω \cdot \nabla φ^{g} (r, Ω) + Σ_{t}^{g} (r) φ^{g} (r, Ω) \\ - \sum_{g^{'} = 1}^{G} \int_{4 π} Σ_{s}^{g^{'} \to g} (r, Ω^{'} \to Ω) φ^{g^{'}} (r, Ω^{'}) d Ω^{'} - χ^{g} (r) \sum_{g^{'} = 1}^{G} \int_{4 π} {(ν Σ)}_{f}^{g^{'}} (r) φ^{g^{'}} (r, Ω^{'}) d Ω^{'}, \end{array}

Q^{g} (α; r) ≜ \sum_{m = 1}^{M} \sum_{i = 1}^{N_{f}} λ_{i} N_{i, m} F_{i}^{S F} ν_{i}^{S F} \frac{2}{\sqrt{π a_{i}^{3} b_{i}}} e^{- \frac{a_{i} b_{i}}{4}} \int_{E^{g + 1}}^{E^{g}} d E e^{- E / a_{i}} \sinh \sqrt{b_{i} E} .

The total leakage from the system is considered to be the paradigm response of interest for sensitivity analysis. Sensitivity analyses of counting rates and other responses can be performed in an analogous manner. Mathematically, the total neutron leakage from a one-dimensional sphere or slab nuclear system, which is denoted as

L (α)

, is defined as follows:

\begin{array}{l} L (α) ≜ \int_{V} d V \int_{0}^{\infty} d E \int_{Ω \cdot n > 0} d Ω Ω \cdot n δ (r - r_{s}) φ (r, E, Ω) \\ = \int_{S_{b}} d S \sum_{g = 1}^{G} \int_{Ω \cdot n > 0} d Ω Ω \cdot n φ^{g} (r, Ω) . \end{array}

The quantities appearing in the above equations are defined as follows:

1. The quantity

r

denotes a spatial point within the one-dimensional domain;

r_{s}

denotes a point on the outer boundary (denoted as

S_{b}

);

g

is an energy group index, and

G

is the total number of energy groups;

Ω

denotes the solid-angle independent variable while

n

denotes the outward-pointing normal to the domain’s outside surface;

2. The quantity

φ^{g} (r, Ω)

denotes the “group-flux” for group

g

and is the unknown state-function obtained by solving the transport equation;

3. The neutron source

Q^{g} (α; r)

is an energy- and material-dependent source of spontaneous fission neutrons occurring in a homogeneous material, in which, the quantity

N_{f}

denotes the total number of spontaneous-fission isotopes. The spontaneous-fission neutron spectrum for actinide nuclide

i

is approximated by a Watt’s fission spectrum while using two evaluated parameters denoted as a_i and

b_{i}

, respectively. The quantities λ_i,

F_{i}^{S F}

and

ν_{i}^{S F}

denote the decay constant, the spontaneous-fission branching fraction, and the average number of neutrons per spontaneous fission, respectively;

4. The quantity

N_{i, m}

denotes the atom density of isotope i in material m;

i = 1, \dots, I

,

m = 1, \dots, M

, where

I

denotes the total number of isotopes, and

M

denotes the total number of materials;

5. The quantity

Σ_{s}^{g^{'} \to g} (r, Ω^{'} \cdot Ω)

represents the macroscopic scattering transfer cross-section from energy group

g^{'}, g^{'} = 1, \dots, G

into energy group

g, g = 1, \dots, G

. The transfer cross-sections is computed in terms of the

l

th-order Legendre coefficients

σ_{s, l, i}^{g^{'} \to g}

(of the Legendre-expanded microscopic scattering cross-section from energy group g′ into energy group

g

for isotope i), which are tabulated parameters, using the following finite-order expansion:

\begin{array}{l} Σ_{s}^{g^{'} \to g} (r, Ω^{'} \cdot Ω) = \sum_{m = 1}^{M} Σ_{s, m}^{g^{'} \to g} (r, Ω^{'} \cdot Ω), \\ Σ_{s, m}^{g^{'} \to g} (r, Ω^{'} \cdot Ω) ≅ \sum_{i = 1}^{I} N_{i, m} \sum_{l = 0}^{I S C T} (2 l + 1) σ_{s, l, i}^{g^{'} \to g} (r) P_{l} (Ω^{'} \cdot Ω), m = 1, \dots, M, \end{array}

where ISCT denotes the order of the finite expansion in Legendre polynomial;

6. The total cross-section

Σ_{t}^{g}

for energy group

g, g = 1, \dots, G,

is defined as:

Σ_{t}^{g} = \sum_{m = 1}^{M} Σ_{t, m}^{g}; Σ_{t, m}^{g} = \sum_{i}^{I} N_{i, m} σ_{t, i}^{g} = \sum_{i}^{I} N_{i, m} [σ_{f, i}^{g} + σ_{c, i}^{g} + \sum_{g^{'} = 1}^{G} σ_{s, l = 0, i}^{g \to g^{'}}],

where

σ_{f, i}^{g}

and

σ_{c, i}^{g}

denote, respectively, the tabulated group microscopic fission and neutron capture cross-sections for group

g

. The weight fraction corresponding to each isotope is included in the respective macroscopic cross-section when computing the respective number densities. The above expression for

Σ_{t}^{g}

indicates that the zeroth-order (i.e.,

l = 0

) scattering cross-sections must be separately considered from the higher order (i.e.,

l \geq 1

) scattering cross-sections since the former contribute to the total cross-sections, while the latter does not;

7. PARTISN computes the quantity

{(ν Σ_{f})}^{g}

using the quantities

{(ν σ_{f})}_{i}^{g}

, which are provided in data files for each isotope

i

and energy group

g

as follows:

{(ν Σ_{f})}^{g} = \sum_{m = 1}^{M} {(ν Σ_{f})}_{m}^{g}; {(ν Σ_{f})}_{m}^{g} = \sum_{i = 1}^{I} N_{i, m} {(ν σ_{f})}_{i}^{g} .

For the purposes of sensitivity analysis, the quantity

ν_{i}^{g}

, which denotes the number of neutrons that were produced per fission by isotope

i

and energy group

g

, can be obtained by using the relation

ν_{i}^{g} = {(ν σ_{f})}_{i}^{g} / σ_{f, i}^{g}

, where the isotopic fission cross-sections

σ_{f, i}^{g}

are available in data files for computing reaction rates;

8. The quantity

χ^{g}

denotes the fission spectrum in the energy group

g

; it is defined in PARTISN as a space-independent quantity, as follows:

χ^{g} ≜ \frac{\sum_{i = 1}^{N_{f}} χ_{i}^{g} N_{i, m} \sum_{g^{'} = 1}^{G} {(ν σ_{f})}_{i}^{g^{'}} f_{i}^{g^{'}}}{\sum_{i = 1}^{N_{f}} N_{i, m} \sum_{g^{'} = 1}^{G} {(ν σ_{f})}_{i}^{g^{'}} f_{i}^{g^{'}}}, w i t h \sum_{g = 1}^{G} χ_{i}^{g} = 1,

where

χ_{i}^{g}

denotes the isotopic fission spectrum in group

g

, while

f_{i}^{g}

denotes the corresponding spectrum weighting function;

9. The vector

α

, which appears in the expression of the Boltzmann operator

B^{g} (α)

, represents the “vector of imprecisely known model parameters”, comprising the following components:

(i): The total cross-section $Σ_{t}^{g} \to Σ_{t}^{g} (t)$ is characterized by the vector of parameters $t$ , which is defined as follows:

t ≜ {[t_{1}, \dots, t_{J_{t}}]}^{†} ≜ {[t_{1}, \dots, t_{J_{σ t}}; n_{1}, \dots, n_{J_{n}}]}^{†} ≜ {[σ_{t}; N]}^{†}, J_{t} ≜ J_{σ t} + J_{n},

\begin{array}{l} σ_{t} ≜ {[t_{1}, \dots, t_{J_{σ t}}]}^{†} ≜ {[σ_{t, i = 1}^{1}, σ_{t, i = 1}^{2}, \dots, σ_{t, i = 1}^{G}, \dots, σ_{t, i}^{g}, \dots, σ_{t, i = I}^{1}, \dots, σ_{t, i = I}^{G}]}^{†}, \\ i = 1, \dots, I; g = 1, \dots, G; J_{σ t} = I \times G . \end{array}

N ≜ {[n_{1}, \dots, n_{J_{n}}]}^{†} ≜ {[N_{i = 1, m = 1}, N_{i = 2, m = 1}, \dots, N_{i, m}, \dots, N_{i = I, m = M}]}^{†}, J_{n} = I \times M .

where the dagger denotes “transposition”;

σ_{t, i}^{g}

denotes the microscopic total cross-section for isotope

i

and energy group

g

;

N_{i, m}

denotes the respective isotopic number density;

J_{σ t}

denotes the total number of microscopic total cross-sections, and

J_{n}

denotes the total number of isotopic number densities in the model;

(ii): The scattering cross-section $Σ_{s}^{g^{'} \to g} (Ω^{'} \cdot Ω) \to Σ_{s}^{g^{'} \to g} (s; Ω^{'} \cdot Ω)$ is characterized by the vector of parameters $s$ , which is defined as follows:

s ≜ {[s_{1}, \dots, s_{J_{s}}]}^{†} ≜ {[s_{1}, \dots, s_{J_{σ s}}; n_{1}, \dots, n_{J_{n}}]}^{†} ≜ {[σ_{s}; N]}^{†}, J_{s} ≜ J_{σ s} + J_{n},

where

\begin{matrix} σ_{s} ≜ {[s_{1}, \dots, s_{J_{σ s}}]}^{†} ≜ [σ_{s, l = 0, i = 1}^{g^{'} = 1 \to g = 1}, σ_{s, l = 0, i = 1}^{g^{'} = 2 \to g = 1}, \dots, σ_{s, l = 0, i = 1}^{g^{'} = G \to g = 1}, σ_{s, l = 0, i = 1}^{g^{'} = 1 \to g = 2}, σ_{s, l = 0, i = 1}^{g^{'} = 2 \to g = 2}, \\ {\dots, σ_{s, l, i}^{g^{'} \to g}, \dots, σ_{s, I S C T, i = I}^{G \to G}]}^{†}, \\ f o r i = 1, \dots, I; g, g^{'} = 1, \dots, G; l = 0, \dots, I S C T; J_{σ s} = (G \times G) \times I \times (I S C T + 1) . \end{matrix}

(iii): The quantity ${(ν Σ_{f})}^{g} \to {[ν Σ_{f} (f)]}^{g}$ is characterized by the vector of parameters $f$ , which is defined as follows:

\begin{array}{l} f ≜ {[f_{1}, \dots, f_{J_{σ f}}; f_{J_{σ f} + 1}, \dots, f_{J_{σ f} + J_{ν}}; f_{J_{σ f} + J_{ν} + 1}, \dots, f_{J_{f}}]}^{†} \\ ≜ {[σ_{f}; ν; N]}^{†}, J_{f} = J_{σ f} + J_{ν} + J_{n}, \end{array}

with

\begin{array}{l} σ_{f} ≜ {[σ_{f, i = 1}^{1}, σ_{f, i = 1}^{2}, \dots, σ_{f, i = 1}^{G}, \dots, σ_{f, i}^{g}, \dots, σ_{f, i = N_{f}}^{1}, \dots, σ_{f, i = N_{f}}^{G}]}^{†} \\ ≜ {[f_{1}, \dots, f_{J_{σ f}}]}^{†}, i = 1, \dots, N_{f}; g = 1, \dots, G; J_{σ f} = G \times N_{f}, \end{array}

\begin{array}{l} ν ≜ {[ν_{i = 1}^{1}, ν_{i = 1}^{2}, \dots, ν_{i = 1}^{G}, \dots, ν_{i}^{g}, \dots, ν_{i = N_{f}}^{1}, \dots, ν_{i = N_{f}}^{G}]}^{†} \\ ≜ {[f_{J_{σ f} + 1}, \dots, f_{J_{σ f} + J_{ν}}]}^{†}, i = 1, \dots, N_{f}; g = 1, \dots, G; J_{ν} = G \times N_{f} . \end{array}

(iv): The fission spectrum is considered to depend on the vector of parameters $p$ , being defined as follows:

\begin{array}{l} p ≜ {[p_{1}, \dots, p_{J_{p}}]}^{†} ≜ {[χ_{i = 1}^{g = 1}, χ_{i = 1}^{g = 2}, \dots, χ_{i = 1}^{G}, \dots, χ_{i}^{g}, \dots, χ_{N_{f}}^{G}]}^{†}, \\ i = 1, \dots, N_{f}; g = 1, \dots, G; J_{p} = G \times N_{f} . \end{array}

(v): The source $Q^{g} (r) \to Q^{g} (q; N)$ depends on the vector of model parameters $q$ , which is defined as follows:

q ≜ {[q_{1}, \dots, q_{J_{q}}]}^{†} ≜ {[λ_{1}, \dots, λ_{N_{f}}; F_{1}^{S F}, \dots, F_{N_{f}}^{S F}; a_{1}, \dots, a_{N_{f}}; b_{1}, \dots, b_{N_{f}}; ν_{1}^{S F}, \dots, ν_{N_{f}}^{S F}]}^{†}, J_{q} = 5 \times N_{f} .

In summary, the model parameters can all be considered to be the components of the “vector of model parameters”

α

, which is defined below:

α ≜ {[α_{1}, \dots, α_{J_{α}}]}^{†} ≜ {[σ_{t}; σ_{s}; σ_{f}; ν; p; q; N]}^{†}, J_{α} = J_{σ t} + J_{σ s} + J_{σ f} + J_{ν} + J_{p} + J_{q} + J_{n} .

References

Cacuci, D.G. Sensitivity theory for nonlinear systems: I. Nonlinear functional analysis approach. J. Math. Phys. 1981, 22, 2794–2802. [Google Scholar] [CrossRef]
Cacuci, D.G. Sensitivity theory for nonlinear systems: II. Extensions to additional classes of responses. J. Math. Phys. 1981, 22, 2803–2812. [Google Scholar] [CrossRef]
Práger, T.; Kelemen, F.D. Adjoint methods and their application in earth sciences. In Advanced Numerical Methods for Complex Environmental Models: Needs and Availability; Faragó, I., Havasi, Á., Zlatev, Z., Eds.; Bentham Science Publishers: Oak Park, IL, USA, 2014; Chapter 4A; pp. 203–275. [Google Scholar]
Luo, Z.; Wang, X.; Liu, D. Prediction on the static response of structures with large-scale uncertain-but-bounded parameters based on the adjoint sensitivity analysis. Struct. Multidiscip. Optim. 2020, 61, 123–139. [Google Scholar] [CrossRef]
Bellman, R.E. Dynamic Programming; Rand Corporation; Princeton University Press: Princeton, NJ, USA, 1957; ISBN 978-0-691-07951-6. [Google Scholar]
Cacuci, D.G. Second-order adjoint sensitivity analysis methodology for computing exactly and efficiently first- and second-order sensitivities in large-scale linear systems: I. Computational methodology. J. Comp. Phys. 2015, 284, 687–699. [Google Scholar] [CrossRef]
Cacuci, D.G. Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) for Large-Scale Nonlinear Systems: I. Theory. Nucl. Sci. Eng. 2016, 184, 16–30. [Google Scholar] [CrossRef]
Cacuci, D.G. The Second-Order Adjoint Sensitivity Analysis Methodology; CRC Press, Taylor & Francis Group: Boca Raton, FL, USA, 2018. [Google Scholar]
Cacuci, D.G.; Fang, R.; Favorite, J.A. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: I. Effects of Imprecisely Known Microscopic Total and Capture Cross Sections. Energies 2019, 12, 4219. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: II. Effects of Imprecisely Known Microscopic Scattering Cross Sections. Energies 2019, 12, 4114. [Google Scholar] [CrossRef]
Cacuci, D.G.; Fang, R.; Favorite, J.A.; Badea, M.C.; Di Rocco, F. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: III. Effects of Imprecisely Known Microscopic Fission Cross Sections and Average Number of Neutrons per Fission. Energies 2019, 12, 4100. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark. IV: Effects of Imprecisely Known Source Parameters. Energies 2020, 13, 1431. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: V. Computation of 2nd-Order Sensitivities Involving Isotopic Number Densities. Energies 2020, 13, 2580. [Google Scholar] [CrossRef]
Cacuci, D.G.; Fang, R.; Favorite, J.A. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: VI. Overall Impact of 1^st- and 2^nd-Order Sensitivities. Energies 2020, 13, 1674. [Google Scholar] [CrossRef]
Valentine, T.E. Polyethylene-reflected plutonium metal sphere subcritical noise measurements, SUB-PU-METMIXED-001. In International Handbook of Evaluated Criticality Safety Benchmark Experiments, NEA/NSC/DOC(95)03/I-IX; Organization for Economic Co-Operation and Development (OECD), Nuclear Energy Agency (NEA): Paris, France, 2006. [Google Scholar]
Fang, R.; Cacuci, D.G. Third-Order Adjoint Sensitivity Analysis of an OECD/NEA Reactor Physics Benchmark: II. Computed Sensitivities. Am. J. Comput. Math. 2020, 10, 529–558. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Third-Order Adjoint Sensitivity Analysis of an OECD/NEA Reactor Physics Benchmark: III. Response Moments. Am. J. Comput. Math. 2020, 10, 559–570. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Fourth-Order Adjoint Sensitivity and Uncertainty Analysis of an OECD/NEA Reactor Physics Benchmark: I. Computed Sensitivities. J. Nucl. Eng. 2021, 2, 281–308. [Google Scholar] [CrossRef]
Cacuci, D.G. The n^th-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (n^th-CASAM-L): I. Mathematical Framework. Energies 2021, 14, 8314. [Google Scholar] [CrossRef]
Cacuci, D.G. The n^th-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (n^th-CASAM-N): Mathematical Framework. J. Nucl. Eng. 2022, 3, 163–190. [Google Scholar] [CrossRef]
Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume I: Linear Systems; Springer: Cham, Switzerland, 2022; 362p. [Google Scholar] [CrossRef]
Cacuci, D.G.; Fang, R. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume II: Application to a Large-Scale System; Springer: Cham, Switzerland, 2023; 463p. [Google Scholar] [CrossRef]
Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume III: Nonlinear Systems; Springer: Cham, Switzerland, 2023; 369p. [Google Scholar] [CrossRef]
Cacuci, D.G. Computation of High-Order Sensitivities of Model Responses to Model Parameters. II: Introducing the High-Order Adjoint Sensitivity Analysis Methodology for Computing Response Sensitivities to Functions (“Features”) of Parameters. Energies 2023. submitted. [Google Scholar]
Alcouffe, R.E.; Baker, R.S.; Dahl, J.A.; Turner, S.A.; Ward, R. PARTISN: A Time-Dependent, Parallel Neutral Particle Transport Code System; LA-UR-08-07258; Los Alamos National Laboratory: Los Alamos, NM, USA, 2008. [Google Scholar]
Wilson, W.B.; Perry, R.T.; Shores, E.F.; Charlton, W.S.; Parish, T.A.; Estes, G.P.; Brown, T.H.; Arthur, E.D.; Bozoian, M.; England, T.R.; et al. SOURCES4C: A Code for Calculating (α,n), Spontaneous Fission, and Delayed Neutron Sources and Spectra. In Proceedings of the American Nuclear Society/Radiation Protection and Shielding Division 12th Biennial Topical Meeting, Santa Fe, NM, USA, 14–18 April 2002. LA-UR-02-1839. [Google Scholar]
Tukey, J.W. The Propagation of Errors, Fluctuations and Tolerances; Technical Reports No. 10-12; Princeton University: Princeton, NJ, USA, 1957. [Google Scholar]
Cacuci, D.G. Second-Order MaxEnt Predictive Modelling Methodology. I: Deterministically Incorporated Computational Model (2nd-BERRU-PMD). Am. J. Comp. Math. 2023, 16, 5552. [Google Scholar] [CrossRef]
Cacuci, D.G. Second-Order MaxEnt Predictive Modelling Methodology. II: Probabilistically Incorporated Computational Model (2nd-BERRU-PMP). Am. J. Comp. Math. 2023, 16, 5614. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Second-Order MaxEnt Predictive Modelling Methodology. III: Illustrative Application to a Reactor Physics Benchmark. Am. J. Comp. Math. 2023. accepted. [Google Scholar] [CrossRef]
Lewis, J.M.; Lakshmivarahan, S.; Dhall, S.K. Dynamic Data Assimilation: A Least Square Approach; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Cacuci, D.G.; Navon, M.I.; Ionescu-Bujor, M. Computational Methods for Data Evaluation and Assimilation; Chapman & Hall/CRC: Boca Raton, FL, USA, 2014. [Google Scholar]
Fang, R.; Cacuci, D.G. 4th-Order-SENS: A Software Module for Efficient and Exact 4th-Order Sensitivity Analysis of Neutron Particle Transport. Nucl. Sci. Eng. 2023. under review. [Google Scholar]

Figure 1. Histogram plot of the PERP benchmark leakage response for each energy group.

Figure 2. Illustration of the absolute values of the first-order through fourth-order unmixed relative sensitivities for isotope 6 (¹H) of the PERP benchmark.

Figure 3. Results for (i)

E (L^{c}) \pm S D^{(1)}

(in green),

L^{b e} \pm S D^{(b e, 1)}

(in red),

L^{e} \pm S D^{(e)}

(in blue), when only the 1st-order sensitivities are considered; (ii)

E (L^{c}) \pm S D^{(2)}

(in green),

L^{b e} \pm S D^{(b e, 2)}

(in red),

L^{e} \pm S D^{(e)}

(in blue), when 1st + 2nd-order sensitivities are included.

Figure 3. Results for (i)

E (L^{c}) \pm S D^{(1)}

(in green),

L^{b e} \pm S D^{(b e, 1)}

(in red),

L^{e} \pm S D^{(e)}

(in blue), when only the 1st-order sensitivities are considered; (ii)

E (L^{c}) \pm S D^{(2)}

(in green),

L^{b e} \pm S D^{(b e, 2)}

(in red),

L^{e} \pm S D^{(e)}

(in blue), when 1st + 2nd-order sensitivities are included.

Figure 4. Results for

E (L^{c}) \pm S D^{(4)}

(in green),

L^{b e} \pm S D^{(b e, 4)}

(in red),

L^{e} \pm S D^{(e)}

(in blue) when the 1st + 2nd + 3rd + 4th-order sensitivities are included.

Figure 4. Results for

E (L^{c}) \pm S D^{(4)}

(in green),

L^{b e} \pm S D^{(b e, 4)}

(in red),

L^{e} \pm S D^{(e)}

(in blue) when the 1st + 2nd + 3rd + 4th-order sensitivities are included.

Figure 5. Results for

S^{(4)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 5}^{30})

obtained using the finite difference (FD) with various step sizes.

Figure 5. Results for

S^{(4)} (σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 6}^{30}, σ_{t, 5}^{30})

obtained using the finite difference (FD) with various step sizes.

Table 1. Expected response value, variance, and skewness for 1% relative standard deviations (RSD) of the normally distributed and uncorrelated microscopic total cross-sections.

Expected Value	Variance	3rd-Order Moment and Skewness
$L (α^{0})$ = 1.765 × 10⁶	${[var (L)]}_{t}^{(1, U, N)}$ = 3.419 × 10¹⁰	${[μ_{3} (L)]}_{t}^{(2, U, N)}$ = 6.663 × 10¹⁵
${[E (L)]}_{t}^{(2, U, N)}$ = 4.598 × 10⁴	${[var (L)]}_{t}^{(2, U, N)}$ = 2.879 × 10⁹	${[μ_{3} (L)]}_{t}^{(3, U, N)}$ = 3.948 × 10¹⁵
	${[var (L)]}_{t}^{(3, U, N)}$ = 9.841 × 10⁹	${[μ_{3} (L)]}_{t}^{(4, U, N)}$ = 1.973 × 10¹⁵
${[E (L)]}_{t}^{(4, U, N)}$ = 6.026 × 10³	${[var (L)]}_{t}^{(4, U, N)}$ = 1.825 × 10⁹	${[μ_{3} (L)]}_{t}^{(U, N)}$ = 1.258 × 10¹⁶
${[E (L)]}_{t}^{(U, N)}$ = 1.817 × 10⁶	${[var (L)]}_{t}^{(U, N)}$ = 4.874 × 10¹⁰	${[γ_{1} (L)]}_{t}^{(U, N)}$ = 1.169

Table 2. Expected response value, variance, and skewness for 5% relative standard deviations (RSD) of the normally distributed and uncorrelated microscopic total cross-sections.

Expected Value	Variance	3rd-Order Moment and Skewness
$L (α^{0})$ = 1.765 × 10⁶	${[var (L)]}_{t}^{(1, U, N)}$ = 8.549 × 10¹¹	${[μ_{3} (L)]}_{t}^{(2, U, N)}$ = 1.070 × 10¹⁹
${[E (L)]}_{t}^{(2, U, N)}$ = 1.149 × 10⁶	${[var (L)]}_{t}^{(2, U, N)}$ = 1.799 × 10¹²	${[μ_{3} (L)]}_{t}^{(3, U, N)}$ = 6.169 × 10¹⁹
	${[var (L)]}_{t}^{(3, U, N)}$ = 2.338 × 10¹³	${[μ_{3} (L)]}_{t}^{(4, U, N)}$ = 3.083 × 10¹⁹
${[E (L)]}_{t}^{(4, U, N)}$ = 3.766 × 10⁶	${[var (L)]}_{t}^{(4, U, N)}$ = 2.852 × 10¹³	${[μ_{3} (L)]}_{t}^{(U, N)}$ = 1.032 × 10²⁰
${[E (L)]}_{t}^{(U, N)}$ = 6.681 × 10⁶	${[var (L)]}_{t}^{(U, N)}$ = 5.456 × 10¹³	${[γ_{1} (L)]}_{t}^{(U, N)}$ = 0.256

Table 3. Computations of

S^{(4)} (σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30})

using various step-sizes.

Table 3. Computations of

S^{(4)} (σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30}, σ_{t, 6}^{g = 30})

using various step-sizes.

Step-Size $h_{j}$	FD-Method	$\frac{F D - 4 th C A S A M}{4 th C A S A M}$
0.125% × $σ_{t, 6}^{g = 30}$	−1.607 × 10⁸	−6010%
0.25% × $σ_{t, 6}^{g = 30}$	−7.488 × 10⁶	−375%
0.50% × $σ_{t, 6}^{g = 30}$	2.239 × 10⁶	−17.7%
0.60% × $σ_{t, 6}^{g = 30}$	2.708 × 10⁶	−0.45%
0.65% × $σ_{t, 6}^{g = 30}$	2.838 × 10⁶	4.3%
0.75% × $σ_{t, 6}^{g = 30}$	3.063 × 10⁶	12.6%
1.00% × $σ_{t, 6}^{g = 30}$	3.574 × 10⁶	31.4%
2.00% × $σ_{t, 6}^{g = 30}$	2.171 × 10⁷	698%
>2.5% × $σ_{t, 6}^{g = 30}$	divergent	N/A

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cacuci, D.G. Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods. Energies 2023, 16, 6355. https://doi.org/10.3390/en16176355

AMA Style

Cacuci DG. Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods. Energies. 2023; 16(17):6355. https://doi.org/10.3390/en16176355

Chicago/Turabian Style

Cacuci, Dan Gabriel. 2023. "Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods" Energies 16, no. 17: 6355. https://doi.org/10.3390/en16176355

APA Style

Cacuci, D. G. (2023). Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods. Energies, 16(17), 6355. https://doi.org/10.3390/en16176355

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Computation of High-Order Sensitivities of Model Responses to Model Parameters—I: Underlying Motivation and Current Methods

Abstract

1. Introduction

2. Modeling of a Nonlinear Physical System Comprising Imprecisely Known Parameters and Boundaries

3. Motivation for Computing Exact Expressions of High-Order Response Sensitivities to Parameters

3.1. High-Order Sensitivity Analysis

3.2. High-Order Uncertainty Quantification

3.3. High-Order Predictive Modeling

Fundamental Advantages of the Second-BERRU-PM Methodology over the Second-Order Data Assimilation

4. Critical Review of Methods for Computing High-Order Sensitivities

4.1. The Only Method for Efficiently Computing Exact Expressions of High-Order Sensitivities: The nth-CASAM

4.2. Using Finite Differences: A Conceptually Simple but Computationally Inefficient and Inaccurate Procedure for Computing High-Order Sensitivities

4.3. Computing “Sensitivity Indicators” Using Statistical Methods

5. Discussion and Conclusions

Funding

Conflicts of Interest

Appendix A. Brief Description of the Polyethylene-Reflected Plutonium (Acronym PERP) Reactor Physics Benchmark

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI