First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework

Cacuci, Dan Gabriel

doi:10.3390/jne1010002

Open AccessArticle

First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework

by

Dan Gabriel Cacuci

Department of Mechanical Engineering, University of South Carolina, 300 Main Street, Columbia, SC 29208, USA

J. Nucl. Eng. 2020, 1(1), 3-17; https://doi.org/10.3390/jne1010002

Submission received: 1 June 2020 / Revised: 16 July 2020 / Accepted: 31 August 2020 / Published: 8 September 2020

Download

Browse Figures

Versions Notes

Abstract

:

This work presents the first-order comprehensive adjoint sensitivity analysis methodology (1st-CASAM) for computing efficiently the first-order sensitivities (i.e., functional derivatives) of operator-valued responses (i.e., model results) of general models of coupled nonlinear physical systems characterized by imprecisely known or and/or uncertain parameters, external boundaries, and internal interfaces between the coupled systems. The explicit mathematical formalism developed within the 1st-CASAM for computing the first-order sensitivities of operator-valued response to uncertain internal interfaces and external boundaries in the models’ phase–space enables this methodology to generalize all of the previously published methodologies for computing first-order response sensitivities. The computational resources needed for using forward versus adjoint operators in conjunction with spectral versus collocation methods for computing the response sensitivities are analyzed in detail. By enabling the exact computations of operator-valued response sensitivities to internal interfaces and external boundary parameters and conditions, the 1st-CASAM presented in this work makes it possible, inter alia, to quantify the effects of manufacturing tolerances on operator-valued responses of physical and engineering systems.

Keywords:

adjoint sensitivity analysis methodology; coupled nonlinear physical systems; operator-valued model response; exact first-order response sensitivities to model parameters; internal interfaces and external boundaries

1. Introduction

The aim of sensitivity analysis is to compute the sensitivities (i.e., functional derivatives) of responses (i.e., results of interest) of a computational model with respect to the respective model’s parameters. Statistical and “brute force” methods can yield approximate values for such sensitivities, while forward and adjoint methods yield mathematically exact expressions for sensitivities, which can therefore be computed to machine accuracy. It is beyond the scope of this work to review these methods and their numerous applications, but the interested reader may wish to consult the books [1,2,3] and references therein. The specific aim of this work is to generalize the forward/adjoint sensitivity analysis methodology conceived by Cacuci [4,5] for operator-valued (as opposed to scalar-valued) model responses, to enable the explicit computation of operator-valued response sensitivities to uncertain phase–space locations of boundaries and interfaces in coupled nonlinear subsystems. Knowledge of such sensitivities is crucial in practice for enabling the quantification of the effects of manufacturing tolerances when actually constructing any physical system, from benchmark experiments to industrial-size installations. It will be shown that response sensitivities to the imprecisely known phase–space locations of domain boundaries and interfaces can arise both from the definition of the system’s response as well as from the equations, interfaces and boundary conditions defining the model and its imprecisely known domain of definition. This work is structured as follows: Section 2 presents the general mathematical framework for computing exactly (in parameter space) and efficiently the sensitivities of a generic operator type to the physical system’s imprecisely known parameters, internal and external boundaries. This mathematical framework is called the “first-order comprehensive adjoint sensitivity analysis methodology” (1st-CASAM), where the qualifier “comprehensive” indicates that that all possible uncertain model parameters, including those characterizing the phase–space locations of internal and external boundaries are explicitly taken into consideration. The total sensitivity of the operator-valued response is represented using its spectral expansion and, alternatively, using its collocation/pseudo-spectral expansion. The relative advantages and disadvantages are discussed, including using mixed spectral/collocation expansions of the sensitivities of the operator-valued response. Section 3 offers concluding remarks.

2. Mathematical Framework of the 1st-Casam for Operator-Valued Responses of Coupled Systems Comprising Imprecisely Known Parameters, Interfaces and Boundaries

The system considered in this work comprises two nonlinear sub-systems which are coupled to one another across a common internal interface (boundary) in phase–space, and which will be called “Subsystem I” and, respectively, “Subsystem II”. The first subsystem is represented mathematically as follows:

N^{(I)} [u (x); α] = Q^{(I)} (α; x), x \in Ω_{x} (α)

(1)

Bold letters will be used in this work to denote matrices and vectors. Unless explicitly stated otherwise, the vectors in this work are considered to be column vectors. The second subsystem is represented mathematically as follows:

N^{(I I)} [v (y); α] = Q^{(I I)} (α; y), y \in Ω_{y} (α)

(2)

If differential operators appear in Equations (1) and (2), a corresponding set of boundary and/or initial/final conditions must also be given, these conditions can be represented in operator form as follows:

B [u (x), v (y); α; x, y] = 0, x \in \partial Ω_{x} (α), y \in \partial Ω_{y} (α)

(3)

The quantities appearing in Equations (1)–(3) are defined as follows:

$α ≜ {(α_{1}, \dots, α_{Z_{α}})}^{†} \in ℝ^{Z_{α}}$ denotes a column vector having $Z_{α}$ scalar-valued components representing all of the imprecisely known internal and boundary parameters of the physical systems, including imprecisely known parameters that characterize the interface and boundary conditions. Some of these parameters are common to both physical systems, e.g., the parameters that characterize common interfaces. These scalar parameters are considered to be subject to both random and systematic uncertainties, as is usually the case in practical applications. In order to use such parameters in practical computations, which is the scope of the methodology presented in this work, they are considered to be either “uncertain” or “imprecisely known”. “Uncertain” parameters are usually considered to follow a probability distribution having a known “mean value” and a known “standard deviation”. On the other hand, the actual values of “imperfectly known” parameters are unknown. To enable the use of such parameters in computations, “expert opinion” is invoked to assign each of such imprecisely known parameters a “nominal value” (which plays the role of a “mean value”) and a “range of variation” (which plays the role of a standard deviation). For practical computations, the actual origin of the parameter’s nominal (or mean) value and of its assigned standard deviation is immaterial, which is why the qualifiers “uncertain” and “imprecisely known” are often used interchangeably. In this work, the superscript “zero” will be used to denote the known nominal or mean values of various quantities. In particular, the vector of nominal and/or mean parameter values will be denoted as $α^{0} ≜ {(α_{1}^{0}, \dots, α_{Z_{α}}^{0})}^{†}$ . The symbol “ $≜$ ” will be used to denote “is defined as” or “is by definition equal to” and transposition will be indicated by a dagger $(†)$ superscript.
$x ≜ {(x_{1}, \dots, x_{Z_{x}})}^{†} \in ℝ^{Z_{x}}$ denotes the phase–space position vector, of dimension $Z_{x}$ , of independent variables for the system defined in Equation (1). The vector of independent variable $x$ is defined on a phase–space domain denoted as $Ω_{x} (α)$ , $Ω_{x} (α) ≜ {- \infty \leq a_{i} (α) \leq x_{i} \leq b_{i} (α) \leq \infty; i = 1, \dots, Z_{x}}$ , and is therefore considered to depend on the uncertain parameters $α$ . The lower-valued imprecisely known boundary-point of the independent variable is denoted as $a_{i} (α)$ , while the upper-valued imprecisely known boundary-point of the independent variable is denoted as $b_{i} (α)$ . For physical systems modeled by diffusion theory, for example, the “vacuum boundary condition” requires that the particle flux vanish at the “extrapolated boundary” of the spatial domain facing the vacuum; the “extrapolated boundary” depends on the imprecisely known geometrical dimensions of the system’s domain in space and also on the system’s microscopic transport cross sections and atomic number densities. The boundary $\partial Ω_{x} (α) ≜ {a (α) \cup b (α)}$ of the domain $Ω_{x} (α)$ comprises all of the endpoints $a (α) ≜ {[a_{1} (α), \dots, a_{Z_{x}} (α)]}^{†}$ and $b (α) ≜ {[b_{1} (α), \dots, b_{Z_{x}} (α)]}^{†}$ of the intervals on which the respective components of $x$ are defined. It may happen that some components $a_{i} (α)$ and/or $b_{j} (α)$ are infinite, in which case they would not depend on any imprecisely known parameters.
$u (x) ≜ {[u_{1} (x), \dots, u_{Z_{u}} (x)]}^{†}$ denotes a $Z_{u}$ -dimensional column vector whose components represent the system’s dependent variables (also called “state functions”). The vector-valued function $u (x)$ is considered the unique nontrivial solution of the physical problem described by Equations (1) and (3).
$N^{(I)} [u (x); α] ≜ {[N_{1}^{(I)} (u; α), \dots, N_{i}^{(I)} (u; α), \dots, N_{Z_{u}}^{(I)} (u; α)]}^{†}, i = 1, \dots, Z_{u},$ denotes a column vector of dimensions $Z_{u}$ whose components are operators that act nonlinearly on $u (x)$ and $α$ .
$Q^{(I)} (α; x) ≜ {[Q_{1}^{(I)} (α; x), \dots, Q_{Z_{u}}^{(I)} (α; x)]}^{†}$ denotes a $Z_{u}$ -dimensional column vector whose elements represent inhomogeneous source terms that depend either linearly or nonlinearly on $α$ . The components of $Q^{(I)} (α; x)$ may involve operators (rather than just finite-dimensional functions) and distributions acting on $α$ and $x$ .
$y ≜ {(y_{1}, \dots, y_{Z_{y}})}^{†} \in ℝ^{Z_{y}}$ denotes the $Z_{y}$ -dimensional phase–space position vector of independent variables for the physical system defined in Equation (2). The vector of independent variable $y$ is defined on a phase–space domain denoted as $Ω_{y} (α)$ , which is defined as follows: $Ω_{y} (α) ≜ {- \infty \leq c_{j} (α) \leq y_{j} \leq d_{j} (α) \leq \infty; j = 1, \dots, Z_{y}}$ . The lower-valued imprecisely known boundary-point of the independent variable $y_{i}$ is denoted as $c_{j} (α)$ , while the upper-valued imprecisely known boundary-point of the independent variable $y_{i}$ is denoted as $d_{j} (α)$ . Some or all of the points $c_{j} (α)$ may coincide with the points $b_{j} (α)$ . Additionally, some components of $y$ may coincide with some components of $x$ , in which case the respective lower and upper boundary points for the respective coinciding independent variables would also coincide correspondingly. The boundary $\partial Ω_{y} (α) ≜ {c (α) \cup d (α)}$ of the domain $Ω_{y} (α)$ comprises all of the endpoints $c (α) ≜ {[c_{1} (α), \dots, c_{Z_{y}} (α)]}^{†}$ and $d (α) ≜ {[d_{1} (α), \dots, d_{Z_{y}} (α)]}^{†}$ of the intervals on which the respective components of $y$ are defined.
$v (y) ≜ {[v_{1} (y), \dots, v_{Z_{v}} (y)]}^{†}$ denotes a $Z_{v}$ -dimensional column vector whose components represent the system’s dependent variables (also called “state functions”). The vector-valued function $v (y)$ is considered the unique nontrivial solution of the physical problem described by Equations (2) and (3).
$N^{(I I)} [u (x); α] ≜ {[N_{1}^{(I I)} (u; α), \dots, N_{i}^{(I I)} (u; α), \dots, N_{Z_{v}}^{(I I)} (u; α)]}^{†}, i = 1, \dots, Z_{v},$ denotes a column vector of dimensions $Z_{v}$ whose components are operators acting nonlinearly on $v (y)$ and $α$ .
$Q^{(I I)} (α; y) ≜ {[Q_{1}^{(I I)} (α; y), \dots, Q_{Z_{v}}^{(I I)} (α; y)]}^{†}$ denotes a $Z_{v}$ -dimensional column vector whose elements represent inhomogeneous source terms that depend either linearly or nonlinearly on $α$ . The components of $Q^{(I I)} (α; y)$ may involve operators and distributions acting on $α$ and $y$ .
The vector-valued operator $B [u (x), v (y); α; x, y]$ comprises all of the boundary, interface, and initial/final conditions for the coupled physical systems. If the boundary, interface and/or initial/final conditions are inhomogeneous, which is most often the case, then $B [0, 0; α; x, y] \neq 0$ .
Since $Q^{(I)} (α; x)$ and $Q^{(I I)} (α; y)$ may involve operators and distributions acting on $α$ and $y$ , all of the equalities in this work, including Equations (1)–(3), are considered to hold in the weak (“distributional”) sense.

The nominal (or “base-case”) solutions of Equations (1)–(3), denoted as

u^{0} (x)

and

v^{0} (y)

, are obtained by solving these equations at the nominal parameter values

α^{0}

, i.e.,

N^{(I)} [u^{0} (x); α^{0}] = Q^{(I)} (α^{0}; x), x \in Ω_{x} (α^{0})

(4)

N^{(I I)} [v^{0} (y); α^{0}] = Q^{(I I)} (α^{0}; y), y \in Ω_{y} (α^{0})

(5)

B [u^{0} (x), v^{0} (y); α^{0}; x, y] = 0, x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0})

(6)

The response considered in this work is a generic nonlinear function-valued operator, denoted as follows:

R [u (x), v (y); α; x, y]

(7)

The nominal value of the response, denoted as

R^{0} ≜ R [u^{0} (x), v^{0} (y); α^{0}; x, y]

, is determined by computing the response at the nominal values

α^{0}

,

u^{0} (x)

and

v^{0} (y)

. The true values of imprecisely known model, interface and boundary parameters may differ from their nominal (average, or “base-case”) values by variations denoted as

δ α ≜ (δ α_{1}, \dots, δ α_{N_{α}})

, where

δ α_{i} ≜ α_{i} - α_{i}^{0}

,

i = 1, \dots, N_{α}

. In turn, the parameter variations

δ α

will cause variations

δ u (x) ≜ {[δ u_{1} (x), \dots, δ u_{Z_{u}} (x)]}^{†}

and

δ v (y) ≜ {[δ v_{1} (y), \dots, δ v_{Z_{v}} (y)]}^{†}

in the state functions and all of these variations will cause variations in the response

R [u (x), v (y); α; x, y]

around the nominal response value

R^{0}

. Sensitivity analysis aims at computing the functional derivatives (called “sensitivities”) of the response to the imprecisely known parameters

α

. Subsequently, these sensitivities can be used for a variety of purposes, including quantifying the uncertainties induced in responses by the uncertainties in the model and boundary parameters, combining the uncertainties in computed responses with uncertainties in measured response (“data assimilation”) to obtain more accurate predictions of responses and/or parameters (“model calibration”, “predictive modeling”, etc.).

As has been shown by Cacuci [1], the most general definition of the 1st-order total sensitivity of an operator-valued model response to parameter variations is provided by the first-order “Gateaux-variation” (G-variation) of the response under consideration. To determine the first G-variation of the response

R [u (x), v (y); α; x, y]

, it is convenient to denote the functions appearing in the argument of the response as being the components of a vector

e ≜ {[u (x), v (y); α]}^{†}

, which represents an arbitrary “point” in the combined phase–space of the state functions and parameters. The point which corresponds to the nominal values of the state functions and parameters in this phase space is denoted as

e^{0} ≜ {[u^{0} (x), v^{0} (y); α^{0}]}^{†}

. Analogously, it is convenient to consider the variations in the model’s state functions and parameters to be the components of a “vector of variations”,

δ e

, defined as follows:

δ e ≜ {[δ u (x), δ v (y); δ α]}^{†}

. The 1st-order Gateaux- (G-) variation of the response

R (e)

, which will be denoted as

δ R (e^{0}; δ e)

, for arbitrary variations

δ e

in the model parameters and state functions in a neighborhood

(e^{0} + ε δ e)

around

e^{0}

, is obtained, by definition, as follows:

δ R (e^{0}; δ e) ≜ {\frac{d}{d ε} R [u^{0} (x) + ε δ u (x), v^{0} (y) + ε δ v (y); α^{0} + ε δ α; x, y]}_{ε = 0}

(8)

The unknown variations

δ u (x)

and

δ v (y)

in the state functions are related to the variations

δ α

through the equations obtained by applying the definition of the G-differential to the equations underlying the coupled nonlinear, i.e., Equations (1)–(3), to obtain the following relations:

{\frac{d}{d ε} N^{(I)} [u^{0} (x) + ε δ u (x); α^{0} + ε δ α]}_{ε = 0} = {\frac{d}{d ε} Q^{(I)} (α^{0} + ε δ α; x)}_{ε = 0}, x \in Ω_{x} (α^{0}),

(9)

{\frac{d}{d ε} N^{(I I)} [v^{0} (y) + ε δ v (y); α^{0} + ε δ α]}_{ε = 0} = {\frac{d}{d ε} Q^{(I I)} (α^{0} + ε δ α; y)}_{ε = 0}, y \in Ω_{y} (α^{0}),

(10)

{\frac{d}{d ε} B [u^{0} (x) + ε δ u (x), v^{0} (y) + ε δ v (y); α^{0} + ε δ α; x, y]}_{ε = 0} = 0, x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0}) .

(11)

Performing in Equations (9)–(11) the differentiations with respect to

ε

and setting

ε = 0

in the resulting expression yields the following system of equations:

δ N^{(I)} [u^{0} (x), α^{0}; δ u (x), δ α] = δ Q^{(I)} (α^{0}; δ α), x \in Ω_{x} (α^{0})

(12)

δ N^{(I I)} [v^{0} (y), α^{0}; δ v (y), δ α] = δ Q^{(I I)} (α^{0}; δ α), y \in Ω_{y} (α^{0}),

(13)

δ B (e^{0}; δ e) = 0, x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0})

(14)

The system of equations comprising Equations (12)–(14) is called the “First-Level Forward Sensitivity System” (1st-LFSS) and could be solved to obtain the variations

δ u (x)

and

δ v (y)

in the state functions in terms of the parameter variations

δ α

which appear as sources in the 1st-LFSS equations. Subsequently, the variations

δ u (x)

and

δ v (y)

thus obtained could be used to compute the total sensitivity

δ R (e^{0}; δ e)

defined in Equation (8).

The existence of the G-variations of the operators underlying the 1st-LFSS and total sensitivity

δ R (e^{0}; δ e)

does not guarantee their numerical computability. Numerical methods most often require that

δ R (e^{0}; δ e)

and the operators underlying the 1st-LFSS be linear in the variations

δ e

in a neighborhood

(e^{0} + ε δ e)

around

e^{0}

. The necessary and sufficient conditions for the G-differential

δ W (e^{0}; δ e)

of a nonlinear operator

W (e)

to be linear in

δ e

in a neighborhood

(e^{0} + ε δ e)

around

e^{0}

, and thus admit partial and total G-derivatives, are as follows [6]:

(i): $W (e)$ satisfies a weak Lipschitz condition at $e^{0}$ ;

$‖ W (e^{0} + ε h; x) - W (e^{0}; x) ‖ \leq k ‖ ε e^{0} ‖, k < \infty$

(15)
(ii): for two arbitrary vectors of variations $δ e_{1}$ and $δ e_{2}$ , the operator $W (e)$ satisfies the following relation:

$W (e^{0} + ε δ e_{1} + ε δ e_{2}) - W (e^{0} + ε δ e_{1}) - W (e^{0} + ε δ e_{2}) + W (e^{0}) = o (ε)$

(16)

It will henceforth be assumed that the operators

N^{(I)}

,

N^{(I I)}

,

B

,

Q^{(I)}

,

Q^{(I I)}

and

R

satisfy the conditions indicated in Equations (15) and (16). Hence, Equations (12)–(14) can be written in the following form:

{\frac{\partial N^{(I)} (u; α)}{\partial u}}_{(u^{0}; α^{0})} δ u (x) = {Q_{1}^{(1)} (u; α; δ α)}_{(u^{0}; α^{0})}, x \in Ω_{x} (α^{0}),

(17)

{\frac{\partial N^{(I I)} (v; α)}{\partial v}}_{(v^{0}, α^{0})} δ v (y) = {Q_{2}^{(1)} (v; α; δ α)}_{(v^{0}, α^{0})}, y \in Ω_{y} (α^{0}),

(18)

\begin{array}{l} {\frac{\partial B [u (x), v (y); α; x, y]}{\partial u}}_{(e^{0})} δ u (x) + {\frac{\partial B [u (x), v (y); α; x, y]}{\partial v}}_{(e^{0})} δ v (y) \\ + {\frac{\partial B [u (x), v (y); α; x, y]}{\partial α}}_{(e^{0})} δ α = 0, x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0}) . \end{array}

(19)

where

{Q_{1}^{(1)} (u; α; δ α)}_{(u^{0}; α^{0})} ≜ {\frac{\partial [Q^{(I)} (α; x) - N^{(I)} [u (x); α]]}{\partial α}}_{(u^{0}; α^{0})} δ α,

(20)

{Q_{2}^{(1)} (v; α; δ α)}_{(v^{0}, α^{0})} ≜ {\frac{\partial [Q^{(I I)} (α; y) - N^{(I I)} [v (y); α]]}{\partial α}}_{(v^{0}, α^{0})} δ α,

(21)

The partial G-derivatives

\partial N^{(I)} [u (x); α] / \partial u

,

\partial N^{(I I)} [v (y); α] / \partial v

,

\partial B [u (x), v (y); α; x, y] / \partial u

,

\partial B [u (x), v (y); α; x, y] / \partial v

,

\partial N^{(I)} [u (x); α] / \partial α

,

\partial N^{(I I)} [v (y); α] / \partial α

,

\partial Q^{(I)} [u (x); α] / \partial α

,

\partial Q^{(I I)} [u (x); α] / \partial α

and

\partial B [u (x), v (y); α; x, y] / \partial α

, which appear in Equations (17)–(21), are matrices of corresponding dimensions. When the G-variation

δ R (e^{0}; δ e)

is linear in

δ e

, it is called the G-differential of

R (e)

and is usually denoted as

D R (e^{0}; δ e)

. Furthermore, the result of the differentiations indicated on the right-side of the definition provided in Equation (8) can be written as follows:

D R (e^{0}; δ e) = {D R (e^{0}; δ α)}^{d i r} + {D R (e^{0}; δ u, δ v)}^{i n d},

(22)

where the so-called “direct-effect” term is defined as follows:

{D R (e^{0}; δ α)}^{d i r} ≜ {\frac{\partial R}{\partial α}}_{(e^{0})} δ α = \sum_{i = 1}^{Z_{α}} {\frac{\partial R}{\partial α_{i}}}_{(u^{0}; α^{0})} δ α_{i},

(23)

while the so-called “indirect-effect” term is defined as follows:

{D R (e^{0}; δ α)}^{i n d} ≜ {\frac{\partial R}{\partial u}}_{(e^{0})} δ u (x) + {\frac{\partial R}{\partial v}}_{(e^{0})} δ v (y) .

(24)

In Equations (23) and (24), the vectors

\partial R / \partial u

,

\partial R / \partial v

and

\partial R / \partial α

comprise, as components, the first-order partial G-derivatives computed at the phase–space point

e^{0}

. The G-differential

D R (e^{0}; δ e)

is an operator defined on the same domain as

R (e)

and has the same range as

R (e)

. The G-differential

D R (e^{0}; δ e)

satisfies the relation

R (e^{0} + ε δ e) - R (e^{0}) = D R (e^{0}; δ e) + Δ (δ e),

with

\lim_{ε \to 0} [Δ (ε δ e)] / ε = 0

.

The “direct effect” term

{D R (e^{0}; δ α)}^{d i r}

depends only on the parameter variations

δ α

so it can be computed immediately, since it does not depend on the variations

δ u

and

δ v

. On the other hand, “indirect effect” term

{D R (e^{0}; δ α)}^{i n d}

depends indirectly on the parameter variations

δ α

through the yet unknown variations

δ u (x)

and

δ v (y)

in the state functions, and these functions can be determined only by solving repeatedly the 1st-LFSS for every possible parameter variation

δ α_{i}, i = 1, \dots, Z_{α}

. The need for these prohibitively expensive computations can be circumvented by extending the concepts underlying the “Adjoint Sensitivity Analysis Methodology” (ASAM) conceived by Cacuci [1] to construct a “First-Level Adjoint Sensitivity System” (1st-LASS), the solution of which will be independent of the variations

δ α

,

δ u (x)

and

δ v (y)

. Subsequently, the solution of the 1st-LASS will be used to compute the indirect-effect term

{D R (e^{0}; δ α)}^{i n d}

by constructing an equivalent expression (for this indirect-effect term) which would not involve the unknown variations

δ u (x)

and

δ v (y)

.

2.1. Spectral Representation of the System Response’s Indirect-Effect Term

Since the indirect-effect term

{D R (e^{0}; δ α)}^{i n d}

is defined on the same domain

Ω_{x} (α^{0}) \cup Ω_{y} (α^{0})

as

R (e^{0})

, and has the same range as

R (e^{0})

, it follows that it can be represented in the following form:

\begin{array}{l} {D R (e^{0}; δ α)}^{i n d} = \\ \sum_{m_{1} = 0}^{\infty} \dots \sum_{m_{Z_{x}} = 0}^{\infty} \sum_{n_{1} = 0}^{\infty} \dots \sum_{n_{Z_{y}} = 0}^{\infty} {F_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ u) + G_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ v)}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) . \end{array}

(25)

where

\begin{array}{l} {F_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ u)}_{(e^{0})} ≜ \int_{a_{1} (α^{0})}^{b_{1} (α^{0})} d x_{1} \dots \int_{a_{Z_{x}} (α^{0})}^{b_{Z_{x}} (α^{0})} d x_{Z_{x}} \int_{c_{1} (α^{0})}^{d_{1} (α^{0})} d y_{1} \dots \int_{c_{Z_{y}} (α^{0})}^{d_{Z_{y}} (α^{0})} d y_{Z_{y}} {\frac{\partial R [u (x); v (y); α; x; y]}{\partial u} δ u}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) \end{array}

(26)

and

\begin{array}{l} {G_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ v)}_{(e^{0})} ≜ \int_{a_{1} (α^{0})}^{b_{1} (α^{0})} d x_{1} \dots \int_{a_{Z_{x}} (α^{0})}^{b_{Z_{x}} (α^{0})} d x_{Z_{x}} \int_{c_{1} (α^{0})}^{d_{1} (α^{0})} d y_{1} \dots \int_{c_{Z_{y}} (α^{0})}^{d_{Z_{y}} (α^{0})} d y_{Z_{y}} {\frac{\partial R [u (x); v (y); α; x; y]}{\partial v} δ v}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) . \end{array}

(27)

The following designations have been used in Equations (26) and (27): (i) the quantities

P_{m_{i}} (x_{i})

,

i = 1, \dots, Z_{x}

, denote the corresponding spectral basis functions (e.g., orthogonal polynomials, Fourier exponential/trigonometric functions) for the domain defined as the domain

Ω_{x}

; (ii) the quantities

O_{m_{i}} (y_{i})

,

i = 1, \dots, Z_{y}

, denote the spectral functions corresponding to the domain

Ω_{y}

; and (iii) the quantities

{F_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ u)}_{(e^{0})}

and

{G_{m_{1} \dots m_{Z_{x}} n_{1} \dots n_{Z_{y}}} (u; v; α; δ v)}_{(e^{0})}

denote the corresponding generalized spectral (Fourier) coefficients.

The appearance of the “difficult to compute” variations

δ u

and

δ v

in the functionals defined in Equations (26) and (27), respectively, can be eliminated by expressing the right-sides of Equations (26) and (27) in terms of adjoint functions that will be obtained by implementing the following sequence of steps:

Introduce a Hilbert space pertaining to the domain $Ω_{x} (α^{0}) \cup Ω_{y} (α^{0})$ , denoted as $H$ , comprising square-integrable vector-valued elements of the form $f^{(α)} (x, y) ≜ {[g^{(α)} (x, y), h^{(α)} (x, y)]}^{†} \in H$ and $f^{(β)} (x, y) ≜ {[g^{(β)} (x, y), h^{(β)} (x, y)]}^{†} \in H$ , where $g^{(α)} (x, y) ≜ {[g_{1}^{(α)} (x, y), \dots, g_{Z_{u}}^{(α)} (x, y)]}^{†}$ , $g^{(β)} (x, y) ≜ {[g_{1}^{(β)} (x, y), \dots, g_{Z_{u}}^{(β)} (x, y)]}^{†}$ , $h^{(α)} (x, y) ≜ {[h_{1}^{(α)} (x, y), \dots, h_{Z_{v}}^{(α)} (x, y)]}^{†}$ , $h^{(β)} (x, y) ≜ {[h_{1}^{(β)} (x, y), \dots, h_{Z_{v}}^{(β)} (x, y)]}^{†}$ .
Define the inner product, denoted as $〈 f^{(α)} (x, y), f^{(β)} (x, y) 〉$ , between two elements of $H$ , as follows:

$〈 f^{(α)} (x, y), f^{(β)} (x, y) 〉 ≜ \int_{a_{1} (α^{0})}^{b_{1} (α^{0})} \dots \int_{a_{Z_{x}} (α^{0})}^{b_{Z_{x}} (α^{0})} \int_{c_{1} (α^{0})}^{d_{1} (α^{0})} \dots \int_{c_{Z_{x}} (α^{0})}^{d_{Z_{x}} (α^{0})} [g^{(α)} (x, y) \cdot g^{(β)} (x, y) + h^{(α)} (x, y) \cdot h^{(β)} (x, y)] d x d y$

(28)

where

$g^{(α)} (x, y) \cdot g^{(β)} (x, y) ≜ \sum_{n = 1}^{Z_{u}} g_{n}^{(α)} (x, y) g_{n}^{(β)} (x, y)$

(29)

and

$h^{(α)} (x, y) \cdot h^{(β)} (x, y) ≜ \sum_{n = 1}^{Z_{v}} h_{n}^{(α)} (x, y) h_{n}^{(β)} (x, y)$

(30)
Recast Equations (17) and (18) in the following matrix from:

${[\begin{matrix} \frac{\partial N^{(I)} (u; α)}{\partial u} & 0 \\ 0 & \frac{\partial N^{(I I)} (v; α)}{\partial v} \end{matrix}]}_{(e^{0})} (\begin{matrix} δ u (x) \\ δ v (y) \end{matrix}) = {(\begin{matrix} Q_{1}^{(1)} (u; α; δ α) \\ Q_{2}^{(1)} (v; α; δ α) \end{matrix})}_{(e^{0})}$

(31)
Use the definition provided in Equation (28) to form the inner product of Equation (31) with a square-integrable vector $ψ^{(1)} (x, y) ≜ {[ψ^{(I)} (x, y), ψ^{(I I)} (x, y)]}^{†} \in H$ to obtain the following relation:

$\begin{array}{l} 〈 {(\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix})}^{†}, {[\begin{matrix} \frac{\partial N^{(I)} (u; α)}{\partial u} & 0 \\ 0 & \frac{\partial N^{(I I)} (v; α)}{\partial v} \end{matrix}]}_{(e^{0})} (\begin{matrix} δ u (x) \\ δ v (y) \end{matrix}) 〉 \\ = 〈 {(\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix})}^{†}, {(\begin{matrix} Q_{1}^{(1)} (u; α; δ α) \\ Q_{2}^{(1)} (v; α; δ α) \end{matrix})}_{(e^{0})} 〉 . \end{array}$

(32)
Using the definition of the adjoint operator in the Hilbert space $H$ , recast the left-side of Equation (32) as follows:

$\begin{array}{l} 〈 {(\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix})}^{†}, {[\begin{matrix} \frac{\partial N^{(I)} (u; α)}{\partial u} & 0 \\ 0 & \frac{\partial N^{(I I)} (v; α)}{\partial v} \end{matrix}]}_{(e^{0})} (\begin{matrix} δ u (x) \\ δ v (y) \end{matrix}) 〉 \\ = 〈 {(\begin{matrix} δ u (x) \\ δ v (y) \end{matrix})}^{†}, {[\begin{matrix} A^{*} (u; α) & 0 \\ 0 & B^{*} (v; α) \end{matrix}]}_{(e^{0})} (\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix}) 〉 \\ + {B C^{(1)} {[u (x); v (y); ψ^{(I)} (x, y), ψ^{(I I)} (x, y); δ u (x), δ v (y); α; x, y; δ α]}_{{_{\partial Ω_{x} \cup}}_{\partial Ω_{y}}}}_{(e^{0})}, \end{array}$

(33)

where the operator $A^{*} (u; α)$ denotes the formal adjoint of $\partial N^{(I)} (u; α) / \partial u$ , the operator $B^{*} (v; α)$ denotes the formal adjoint of $\partial N^{(I I)} (v; α) / \partial v$ and where ${B C^{(1)} {[u (x); v (y); ψ^{(I)} (x, y), ψ^{(I I)} (x, y); δ u (x), δ v (y); α; x, y; δ α]}_{{_{\partial Ω_{x} \cup}}_{\partial Ω_{y}}}}_{(e^{0})}$ denotes the bilinear concomitant evaluated on the boundary $δ Ω_{x} (α^{0}) \cup δ Ω_{y} (α^{0})$ . The superscript “1” which appears in the notation of the bilinear concomitant $B C^{(1)}$ indicates that this quantity arises in conjunction with the construction of the “First-Level Adjoint Sensitivity System (1st-LASS)”.
Replace the left-side of Equation (33) with the right-side of Equation (32) to obtain the following relation:

$\begin{array}{l} 〈 {(\begin{matrix} δ u (x) \\ δ v (y) \end{matrix})}^{†}, {[\begin{matrix} A^{*} (u; α) & 0 \\ 0 & B^{*} (v; α) \end{matrix}]}_{(e^{0})} (\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix}) 〉 \\ = 〈 {(\begin{matrix} ψ^{(I)} (x, y) \\ ψ^{(I I)} (x, y) \end{matrix})}^{†}, {(\begin{matrix} Q_{1}^{(1)} (u; α; δ α) \\ Q_{2}^{(1)} (v; α; δ α) \end{matrix})}_{(e^{0})} 〉 \\ - {B C^{(1)} {[u (x); v (y); ψ^{(I)} (x, y), ψ^{(I I)} (x, y); δ u (x), δ v (y); α; x, y; δ α]}_{{_{\partial Ω_{x} \cup}}_{\partial Ω_{y}}}}_{(e^{0})} . \end{array}$

(34)
Require the left-side of Equation (34) to represent the indirect-effect term ${D R (e^{0}; δ u, δ v)}_{i n d i r e c t}$ defined in Equation (25), which can be fulfilled by requiring the yet undetermined (adjoint) functions $ψ^{(I)} (x, y)$ and $ψ^{(I I)} (x, y)$ to satisfy the following equations:

${A^{*} (u; α)}_{(e^{0})} ψ^{(I)} (x, y) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial u} δ u}_{(e^{0})} P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}),$

(35)

${B^{*} (v; α)}_{(e^{0})} ψ^{(I I)} (x, y) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial v} δ v}_{(e^{0})} P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) .$

(36)
Since the source terms on the right-sides of Equations (35) and (36) depend on the indices of the spectral bases functions, it follows that the adjoint functions $ψ^{(I)} (x, y)$ and $ψ^{(I I)} (x, y)$ also depend on the respective indices, which will henceforth be explicitly displayed by writing $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ and $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ , respectively.
The boundary, interface, and initial/final conditions for the functions $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ and $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ are now determined by imposing the following requirements:
(a)
Implement the boundary, interface and initial/final conditions given in Equation (19) into the bilinear concomitant in Equation (34).
(b)
Eliminate the remaining unknown boundary, interface and initial/final conditions involving the functions $δ u (x)$ and $δ v (y)$ from the expression of the bilinear concomitant in Equation (34) by selecting boundary, interface and initial/final conditions for the adjoint functions $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ and $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ such that the selected conditions for these adjoint functions must be independent of unknown values of $δ u (x)$ , $δ v (y)$ and $δ α$ while ensuring that Equations (35) and (36) are well posed. The boundary conditions thus chosen for the adjoint functions $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ and $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ can be represented in operator form as follows:

$\begin{array}{l} {B_{A}^{(1)} [u (x); v (y); ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y), ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y); α; x, y]}_{(e^{0})} = 0, \\ x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0}), \end{array}$

(37)

where the subscript “A” indicates “adjoint” and the superscript “1” indicates that these boundary conditions arises in conjunction with the construction of 1st-LASS. The selection of the boundary conditions for the adjoint functions $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ and $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ represented by Equation (37) eliminates the appearance of any unknown values of the variations $δ u (x)$ and $δ v (y)$ in the bilinear concomitant in Equation (34), reducing it to a residual quantity that contains boundary terms involving only known values of $δ α$ , $u (x)$ , $v (y)$ , $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y)$ , $ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y)$ , and $α$ . This residual bilinear concomitant will be denoted as ${R C^{(1)} {[u (x); v (y); ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y), ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y); α; x, y; δ α]}_{{_{δ Ω_{x} \cup}}_{δ Ω_{y}}}}_{(e^{0})}$ .
In general, this residual bilinear concomitant does not automatically vanish, although it may do so in particular instances. In principle, this residual bilinear concomitant could be forced to vanish, if necessary, by considering extensions, in the operator sense, of the linear operators $A^{*} (u; α)$ and/or $B^{*} (v; α)$ , but such extensions seldom need to be used in practice.
Using Equations (34)–(36) in conjunction with Equations (26) and (27) in Equation (25) yields the following expression for the indirect-effect term ${D R (e^{0}; δ u, δ v)}^{i n d}$ :

$\begin{array}{l} {D R (e^{0}; δ u, δ v)}^{i n d} \\ = \sum_{m_{1} = 0}^{\infty} \dots \sum_{m_{Z_{x}} = 0}^{\infty} \sum_{m_{1} = 0}^{\infty} \dots \sum_{m_{Z_{y}} = 0}^{\infty} {〈 ψ^{(I)} (x, y) Q_{1}^{(1)} (u; α; δ α) + ψ^{(I I)} (x, y) Q_{2}^{(1)} (v; α; δ α) 〉 \\ {- R C^{(1)} {[u (x); v (y); ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y), ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y); α; x, y; δ α]}_{{_{\partial Ω_{x} \cup}}_{\partial Ω_{y}}}}}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) \\ ≜ {D R (e^{0}; ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)}, ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)}; δ α)}^{i n d} . \end{array}$

(38)

As the expression in Equation (38) indicates, the desired elimination from

{D R (e^{0}; δ α)}^{i n d}

of the unknown variations

δ u

and

δ v

has been accomplished by having replaced them by the adjoint functions

ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)}

and

ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)}

, which do not depend on any parameter variations; this fact that has been underscored by having explicitly indicated that the indirect-effect term can now be written in the form

{D R (e^{0}; ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)}, ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)}; δ α)}^{i n d}

.

When first introduced in Equation (32), it was not known that the adjoint functions would ultimately depend on the indices

m_{1}, \dots, m_{Z_{x}}

and

n_{1}, \dots, n_{Z_{y}}

; this fact has become apparent only after having constructed the right-sides (i.e., sources) of Equations (35) and(36) to emphasize this fact. These equations are re-written below:

\begin{matrix} {A^{*} (u; α)}_{(e^{0})} ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)} (x, y) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial u} δ u}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}), \end{matrix}

(39)

\begin{matrix} {B^{*} (v; α)}_{(e^{0})} ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)} (x, y) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial v} δ v}_{(e^{0})} \\ \times P_{m_{1}} (x_{1}) \dots P_{m_{Z_{x}}} (x_{Z_{x}}) O_{n_{1}} (y_{1}) \dots O_{n_{Z_{y}}} (y_{Z_{y}}) . \end{matrix}

(40)

The system of Equations (39) and (40), together with the adjoint boundary/initial conditions represented by Equation (37) will be called the “First-Level Adjoint Sensitivity System (1st-LASS).” The 1st-LASS is independent of the parameter variations

δ α

but depends on the indices

m_{1}, \dots, m_{Z_{x}}

and

n_{1}, \dots, n_{Z_{y}}

. In principle, therefore, the 1st-LASS needs to be solved as many times as there are nonzero spectral basis functions, which act as sources on the right side of the equations underlying the 1st-LASS. It is therefore very important to represent the indirect-effect term

{D R (e^{0}; δ u, δ v)}^{i n d}

defined in Equation (25) using as few basis-functions as possible, within a criterion of accuracy that is set by the user, a priori. Once the adjoint functions

ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)}

and

ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)}

are available, they can be used in Equation (38) to compute the indirect-effect term

{D R (e^{0}; ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I)}, ψ_{m_{1} . . m_{Z_{x}} n_{1} \dots n_{Z_{y}}}^{(I I)}; δ α)}^{i n d}

exactly and efficiently, using quadrature formulas, which are many orders of magnitude faster to compute than solving the operator (differential, integral) equations that underlie the 1st-LFSS.

In practice, orthogonal polynomials will often be selected to serve as basis-functions for the spectral Fourier representations of the responses of interest. As is well-known, orthogonal polynomials possess many recurrence relations which can be advantageously used to reduce massively the number of computations that would actually require solving the 1st-LASS.

In the particular case when the response is a scalar-valued functional of the system’s dependent variables, the expansion in Equation (25) reduces to a single term, so that the summations in the expression of the indirect-effect term

{D R (e^{0}; δ u, δ v)}^{i n d}

in Equation (38) also reduce to a single term.

2.2. Pseudo-Spectral Representation of the System Response’s Indirect-Effect Term

Alternatively, Lagrange interpolation, see e.g., [7], can be used to express the indirect-effect term defined in Equation (24) approximately as follows:

{D R (e^{0}; δ u, δ v)}^{i n d} ≅ \sum_{i, j = 0}^{N} R_{i j} (x_{i}, y_{j}) C F_{i j} (x, y) .

(41)

where the quantities

C F_{i j} (x, y)

represent the “cardinal functions”, where

x_{i}

and

y_{j}

denote the collocation (or interpolation) points, and where

\begin{array}{l} R_{i j} (x_{i}, y_{j}) = \int_{a_{1} (α^{0})}^{b_{1} (α^{0})} . . \int_{a_{Z_{x}} (α^{0})}^{b_{Z_{x}} (α^{0})} \int_{c_{1} (α^{0})}^{d_{1} (α^{0})} . . \int_{c_{Z_{x}} (α^{0})}^{d_{Z_{x}} (α^{0})} [{(\frac{\partial R}{\partial u})}_{(e^{0})} δ u (x) + {(\frac{\partial R}{\partial v})}_{(e^{0})} δ v (y)] \\ \times δ (x - x_{i}) δ (y - y_{j}) d x d y . \end{array}

(42)

The cardinal functions

C F_{i j} (x, y)

are also called [3] the “fundamental polynomials for pointwise interpolation”, the “elements of the cardinal basis”, the “Lagrange basis”, or the “shape functions”. Depending on the domains of definition

x \in Ω_{x} (α^{0}), y \in Ω_{y} (α^{0})

and choices of weight functions, particularly important cardinal functions are the Chebyshev, Legendre, Gegenbauer, Hermite, Laguerre polynomials, and Whittaker’s “sinc” function. In several dimensions, it is most efficient to use a tensor product basis, i.e., use basis functions that are products of one-dimensional basis functions. Particularly efficient computational procedures can be constructed when both the basis functions and grid are tensor products of one-dimensional functions and grids, respectively. Using trigonometric functions, Chebyshev polynomials, or rational Chebyshev functions as basis functions enables the use of the Fast Fourier Transform, which further enhances computational efficiency.

Following established practice [3], “collocation points” and “interpolation points” will be used as synonyms in this work, as will be the terms “collocation” and “pseudospectral” when referring to the fact that interpolatory methods will be used to determine the yet unknown indirect-effect term

{D R (e^{0}; δ u, δ v)}^{i n d}

by expressing it in terms of adjoint functions specifically developed for each of the collocation/interpolation points. The reason that “collocation” methods are alternatively labeled “pseudospectral” is that the optimum choice of the interpolation points makes collocation methods identical with the Galerkin method if the inner products are evaluated by “Gaussian integration”. It is important to note that neither the cardinal functions

C F_{i j} (x, y)

nor the collocation points

x_{i}

and

y_{j}

are subject to model parameter uncertainties.

The functionals

F_{i j} (x_{i}, y_{j})

defined in Equation (42) can be evaluated by using adjoint functions that are the solutions of a 1st-LASS constructed by following the same conceptual steps as those leading to Equations (39) and (40), and the adjoint boundary conditions defined by Equation (37). Omitting these intermediate steps, the final result is as follows:

\begin{array}{l} F_{i j} (x_{i}, y_{j}) \\ = {〈 ψ^{(A)} (x, y; x_{i}, y_{j}) Q_{1}^{(1)} (u; α; δ α) + ψ^{(B)} (x, y; x_{i}, y_{j}) Q_{2}^{(1)} (v; α; δ α) 〉 \\ {- {\hat{C}}^{(1)} {[u (x); v (y); ψ^{(A)} (x, y; x_{i}, y_{j}), ψ^{(B)} (x, y; x_{i}, y_{j}); α; x, y; δ α]}_{{_{\partial Ω_{x} \cup}}_{\partial Ω_{y}}}}}_{(e^{0})} \end{array}

(43)

where the adjoint functions

ψ^{(A)} (x, y; x_{i}, y_{j})

and

ψ^{(B)} (x, y; x_{i}, y_{j})

are the solutions of the following 1st-LASS:

{A^{*} (u; α)}_{(e^{0})} ψ^{(A)} (x, y; x_{i}, y_{j}) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial u} δ u}_{(e^{0})} δ (x - x_{i}) δ (y - y_{j})

(44)

{B^{*} (v; α)}_{(e^{0})} ψ^{(B)} (x, y; x_{i}, y_{j}) = {\frac{\partial R [u (x); v (y); α; x; y]}{\partial v} δ v}_{(e^{0})} δ (x - x_{i}) δ (y - y_{j})

(45)

\begin{array}{l} {C_{A}^{(1)} [u (x); v (y); ψ^{(A)} (x, y; x_{i}, y_{j}), ψ^{(B)} (x, y; x_{i}, y_{j}); α; x, y]}_{(e^{0})} = 0, \\ x \in \partial Ω_{x} (α^{0}), y \in \partial Ω_{y} (α^{0}), \end{array}

(46)

It is evident from Equations (44)–(46) that the 1st-LASS must be solved anew for each of the collocation/interpolation points considered in the expansion of the indirect-effect term shown in Equation (41). The choice between using the spectral expansion shown in Equation (25) or using the collocation/interpolation pseudo-spectral expansion shown in Equation (41) depends on the specific problem under consideration, but for comparable accuracy in the computation of the response sensitivities, using the collocation/interpolation pseudo-spectral expansion shown in Equation (41) is often more efficient computationally than using the full spectral expansion.

The practical implementation of the mathematical methodology underlying the 1st-CASAM is illustrated in Figure 1 and Figure 2. The derivation of the 1st-LFSS is illustrated in Figure 1. The path on the left side of Figure 1 depicts the derivation of the (non-discretized) 1st-LFSS starting from the differential equations underlying the original nonlinear system. On the other hand, the path on the right-side of Figure 1 depicts the derivation of the discretized 1st-LFSS starting from the discretized form of the original nonlinear equations. If this path is followed, it must be ensured that the discretized 1st-LFSS is consistent with the differential form of the 1st-LFSS in the limit of vanishing size of the discretization interval considered for the independent variables.

The derivation of the 1st-LASS is illustrated in Figure 2. The path on the left side of Figure 2 depicts the derivation of the (non-discretized) 1st-LASS starting from the differential form of the 1st-LFSS. On the other hand, the path on the right side of Figure 2 depicts the derivation of the discretized 1st-LASS starting from the discretized 1st-LFSS. If this path is chosen, the consistency of the discretized 1st-LASS with the differential form of the 1st-LFSS must again be ensured.

3. Concluding Remarks

This work has presented the First-Order Comprehensive Adjoint Sensitivity Analysis Methodology (1st-CASAM) for computing efficiently the exact first-order sensitivities (i.e., functional derivatives) of operator-valued responses (i.e., model results) of general models of coupled nonlinear physical systems characterized by imprecisely known parameters, internal interfaces between the coupled systems and external boundaries. When the model response is a (scalar-valued) functional of the system’s dependent variables (i.e., state functions), the total sensitivity of a scalar-valued functional response to all of the model’s state functions is (also) a functional of the variations in the model’s state variables. By being a functional of the variations in the model’s state variables, the total response sensitivity naturally defines an inner product in terms of which it can be expressed uniquely by virtue of the well-known Riesz Representation Theorem (which ensures that every functional defined in a Hilbert space can be expressed uniquely as an inner product). The existence of such a natural inner-product induced by a functional response enables the construction of an appropriate adjoint sensitivity system, the solution of which (i.e., the respective adjoint sensitivity functions) can always be used to compute, exactly and most efficiently, the sensitivities of a functional response to the model’s scalar parameters. When the response is a functional of the state variables, a single adjoint computation (i.e., solution of the adjoint sensitivity system) suffices for subsequently computing exactly all of the model’s response sensitivities to all of the model’s scalar parameters. The adjoint sensitivity system has the same dimensions as the original system, but it is always linear in the adjoint state functions. This is in contradistinction to the original system, which is usually nonlinear in its state functions. Solving the original forward system and the adjoint sensitivity system involve large-scale computations, since these systems invariably involve inversion of large matrices stemming from differential, difference, integral, and/or algebraic equations. Since the adjoint sensitivity analysis methodology requires solving just once the adjoint sensitivity system, this methodology is the most advantageous to use computationally in practice for large-scale systems involving many parameters.

On the other hand, the total sensitivity (to model parameters and state functions) of a model response which is a function-valued (as opposed to a scalar-valued) operator of the model’s state functions does not provide a natural inner product for the model/system under consideration. Without an inner product, it is not possible to construct an adjoint sensitivity system, the solution of which would subsequently be used for computing the response sensitivities to the model’s parameters. Therefore, an inner product must first be constructed to enable expressing the operator-valued total response sensitivity to the variations in the state functions in terms of functionals of the system’s dependent variables (state functions). The requisite inner product can be constructed by representing the total sensitivity of the operator-valued response to the system’s state functions in terms of scalar-valued (functionals) response using: (i) spectral expansions; or (ii) collocation/pseudo-spectral expansions, or (iii) combined spectral/collocation expansions. The coefficients in any of these expansions are functionals that can be represented in terms of an inner product. In turn, this inner product enables the construction of an adjoint sensitivity system, the solution of which can subsequently be used to compute exactly and efficiently the sensitivities of these coefficients to the model’s parameters. A different source for the adjoint sensitivity system is developed for each spectral coefficient or for each collocation point. Altogether, therefore, as many adjoint computations would be needed as there are spectral coefficients and/or collocation points in the phase–space of independent variables. Thus, for operator-valued responses, the fundamental issue is to establish the number of collocation points in the phase–space of independent variables and/or the number of Fourier coefficients which would be needed for representing the response within an a priori established accuracy in the phase–space of independent variables. Subsequently, for each Fourier coefficient and/or at each collocation point, the 1st-CASAM provides the exact sensitivities in the parameter space, in the computationally most efficient manner. By enabling the exact computations of operator-valued response sensitivities to internal interfaces and external boundary parameters and conditions, the 1st-CASAM presented in this work makes it possible, inter alia, to quantify the effects of manufacturing tolerances on the responses of physical and engineering systems.

An accompanying work [7] will present the application of the 1st-CASAM developed in this work to a benchmark problem [8] that models coupled heat conduction and convection in a physical system comprising an electrically heated rod surrounded by a coolant which simulates the geometry of a nuclear reactor. In particular, this benchmark [8] was used to verify [8,9] the numerical results produced by the FLUENT Adjoint Solver [10].

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Cacuci, D.G. Sensitivity and Uncertainty Analysis: Theory; Chapman & Hall/CRC: Boca Raton, NJ, USA, 2003; Volume 1. [Google Scholar]
Cacuci, D.G.; Ionescu-Bujor, M.; Navon, M.I. Sensitivity and Uncertainty Analysis: Applications to Large Scale Systems; Chapman & Hall/CRC: Boca Raton, NJ, USA, 2005; Volume 2. [Google Scholar]
Cacuci, D.G. The Second-Order Adjoint Sensitivity Analysis Methodology; Taylor & Francis/CRC Press: Boca Raton, NJ, USA, 2018. [Google Scholar]
Cacuci, D.G. Sensitivity Theory for Nonlinear Systems: I. Nonlinear Functional Analysis Approach. J. Math. Phys. 1981, 22, 2794–2802. [Google Scholar] [CrossRef]
Cacuci, D.G. Sensitivity Theory for Nonlinear Systems: II. Extensions to Additional Classes of Responses. J. Math. Phys. 1981, 22, 2803–2812. [Google Scholar] [CrossRef]
Rall, L.B. (Ed.) Nonlinear Functional Analysis and Applications; Academic Press: New York, NY, USA, 1971. [Google Scholar]
Boyd, J.P. Chebyshev and Fourier Spectral Methods, 2nd ed.; Dover Publications Inc.: Mineola, NY, USA, 2000. [Google Scholar]
Cacuci, D.G. Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: II. Application to a Nuclear Heat Removal Benchmark. J. Nucl. Eng. 2020, 1, 18–45. [Google Scholar]
Cacuci, D.G.; Fang, R.; Ilic, M.; Badea, M.C. A Heat Conduction and Convection Analytical Benchmark for Adjoint Solution Verification of CFD Codes Used in Reactor Design. Nucl. Sci. Eng. 2015, 182, 452–480. [Google Scholar] [CrossRef]
ANSYS^® Academic Research, Release 16.0. FLUENT Adjoint Solver; ANSYS, Inc.: Pittsburgh, PA, USA, 2015. [Google Scholar]

Figure 1. Implementation of the computational path for solving numerically the First-Level Forward Sensitivity System (1st-LFSS) to compute response sensitivities using forward sensitivity state functions.

Figure 2. Implementation of the computational path for solving numerically the First-Level Adjoint Sensitivity System (1st-LASS) to compute response sensitivities using adjoint sensitivity state functions.

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cacuci, D.G. First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework. J. Nucl. Eng. 2020, 1, 3-17. https://doi.org/10.3390/jne1010002

AMA Style

Cacuci DG. First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework. Journal of Nuclear Engineering. 2020; 1(1):3-17. https://doi.org/10.3390/jne1010002

Chicago/Turabian Style

Cacuci, Dan Gabriel. 2020. "First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework" Journal of Nuclear Engineering 1, no. 1: 3-17. https://doi.org/10.3390/jne1010002

APA Style

Cacuci, D. G. (2020). First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework. Journal of Nuclear Engineering, 1(1), 3-17. https://doi.org/10.3390/jne1010002

Article Menu

First-Order Comprehensive Adjoint Method for Computing Operator-Valued Response Sensitivities to Imprecisely Known Parameters, Internal Interfaces and Boundaries of Coupled Nonlinear Systems: I. Mathematical Framework

Abstract

1. Introduction

2. Mathematical Framework of the 1st-Casam for Operator-Valued Responses of Coupled Systems Comprising Imprecisely Known Parameters, Interfaces and Boundaries

2.1. Spectral Representation of the System Response’s Indirect-Effect Term

2.2. Pseudo-Spectral Representation of the System Response’s Indirect-Effect Term

3. Concluding Remarks

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI