Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Cacuci, Dan Gabriel

doi:10.3390/jne6020008

Open AccessFeature PaperArticle

Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

by

Dan Gabriel Cacuci

Department of Mechanical Engineering, University of South Carolina, Columbia, SC 29201, USA

J. Nucl. Eng. 2025, 6(2), 8; https://doi.org/10.3390/jne6020008

Submission received: 24 January 2025 / Revised: 12 March 2025 / Accepted: 27 March 2025 / Published: 29 March 2025

Download Versions Notes

Abstract

This work presents the general mathematical frameworks of the “First and Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra Type” designated as the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V methodologies, respectively. Using a single large-scale (adjoint) computation, the 1st-FASAM-NIE-V enables the most efficient computation of the exact expressions of all first-order sensitivities of the decoder response to the feature functions and also with respect to the optimal values of the NIE-net’s parameters/weights after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The computation of all second-order sensitivities with respect to the feature functions using the 2nd-FASAM-NIE-V requires as many large-scale computations as there are first-order sensitivities of the decoder response with respect to the feature functions. Subsequently, the second-order sensitivities of the decoder response with respect to the primary model parameters are obtained trivially by applying the “chain-rule of differentiation” to the second-order sensitivities with respect to the feature functions. The application of the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V methodologies is illustrated by using a well-known model for neutron slowing down in a homogeneous hydrogenous medium, which yields tractable closed-form exact explicit expressions for all quantities of interest, including the various adjoint sensitivity functions and first- and second-order sensitivities of the decoder response with respect to all feature functions and also primary model parameters.

Keywords:

first-order features adjoint sensitivity analysis; second-order features adjoint sensitivity analysis; Volterra Neural Integral Equation; neutron slowing down

1. Introduction

It is well-known that Neural Ordinary Differential Equations (NODEs) have enabled [1,2,3] the use of deep learning for modeling discretely sampled dynamical systems. NODEs provide a flexible trade-off between efficiency, memory costs, and accuracy while bridging traditional numerical modeling with modern deep learning, as demonstrated by various applications, including time-series, dynamics, and control [1,2,3,4,5,6,7,8,9]. However, because each time-step is determined locally in time, NODEs are limited to describing systems that are instantaneous. On the other hand, integral equations (IEs) model global “long-distance” spatio-temporal relations, and IE solvers often possess stability properties that are superior to solvers for ordinary and/or partial differential equations. Therefore, differential equations are occasionally recast in integral equation forms that can be solved more efficiently using IE solvers, as exemplified by the applications described in [10,11,12].

Due to their non-local behavior, IE solvers are suitable for modeling complex dynamics and learning the operator underlying the system under consideration by using data sampled from the respective system. As discussed in [13], the operator learning problem is formulated on finite grids using finite-difference methods that approximate the domain of the functions under investigation; the learning is performed by using an IE solver, which samples the domain of integration continuously. As shown in [14], Neural Integral Equations (NIEs) and the Attentional Neural Integral Equations (ANIEs) can be used to generate dynamics and infer the spatio-temporal relations that initially generated the data, thus enabling the continuous learning of non-local dynamics with arbitrary time resolution. The ANIE interprets the self-attention mechanism as the Nystrom method for approximating integrals [15], which enables efficient integration over higher dimensions, as discussed in [10,11,12,13,14,15] and references therein.

Neural nets are trained by minimizing a “loss functional” chosen by the user to represent the discrepancy between the output produced by the neural net’s decoder and some user-chosen “reference solution”. However, the physical system modeled by a neural net inevitably comprises imperfectly known parameters that stem from measurements and/or computations and are therefore afflicted by uncertainties that stem from the respective experiments and/or computations. Hence, even if the neural net perfectly reproduces a given state of a physical system, the neural net’s “optimized weights” are subject to the uncertainties inherent in the parameters that characterize the underlying physical system, and these uncertainties inevitably propagate to the decoder’s output response. It is hence important to quantify the impact of parameter/weight uncertainties on the uncertainties induced in the decoder’s output response. This impact is quantified by the sensitivities of the decoder’s response with respect to the optimized weights/parameters comprised within the neural net.

Neural nets comprise not only scalar-valued weights/parameters but also functions (e.g., correlations) of such scalar model parameters, which can be conveniently called “features of primary model parameters”. Cacuci [16] has developed the “nth-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (nth-FASAM-N)”, which enables the most efficient computation of the exact expressions of arbitrarily high-order sensitivities of model responses with respect to the model’s “features”. In turn, the sensitivities of the responses with respect to the primary model parameters are determined, analytically and trivially, by applying the “chain rule” to the expressions obtained for the response sensitivities with respect to the model’s “features”. The nth-FASAM-N [16] has been applied to develop general first- and second-order sensitivity analysis methodologies for NODEs [17] and for Neural Integral Equations of the Fredholm type [18], which enable the computation, with unsurpassed efficacy, of the exact expressions of first- and second-order sensitivities of decoder responses with respect to the underlying neural net’s optimized weights.

This work continues the application of the nth-FASAM-N [16] methodology to develop the “First- and Second-Order Methodologies for Neural Integral Equations of Volterra Type” (acronyms “1st-FASAM-NIE-V” and, respectively, “2nd-FASAM-NIE-V”). The 1st-FASAM-NIE-V methodology, which is presented in Section 2, enables the most efficient computation of exact expressions of all of the first-order sensitivities of NIE decoder responses with respect to all of the optimal values of the NIE-net’s parameters/weights after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The efficiency of the 1st-FASAM-NIE-V is illustrated in Section 3 by applying it to perform a comprehensive first-order sensitivity analysis of the well-known model [19,20,21] of neutron slowing down in a homogeneous medium containing fissionable material.

The general mathematical framework of the 2nd-FASAM-NIE-Volterra methodology, which is presented in Section 4, enables the most efficient computation of the exact expressions of the second-order sensitivities of NIE decoder responses with respect to all of the optimal values of the NIE-net’s parameters/weights. The efficiency of the 2nd-FASAM-NIE-V is illustrated in Section 5 by applying it to perform a comprehensive second-order sensitivity analysis of the neutron slowing down model [19,20,21] considered in Section 3. Section 6 concludes this work by presenting a discussion that highlights the unparalleled efficiency of the 2nd-FASAM-NIE-V methodology for performing a sensitivity analysis of Volterra-type Neural Integral Equations.

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-FASAM-NIE-V)

Following [14], a network of nonlinear “Neural Integral Equations of Volterra-type (NIE-Volterra)” can be represented by the system of coupled equations shown below:

h_{i} (t) = g_{i} [F (θ); t] + φ_{i} [F (θ); t] \int_{t_{0}}^{t} ψ_{i} [h (τ); F (θ); τ] d τ; t_{0} \leq t \leq t_{f}; 1 = 1, \dots T H .

(1)

The quantities appearing in Equation (1) are defined as follows:

(i): The real-valued scalar quantities $t$ , $t_{0} \leq t \leq t_{f}$ and $τ$ , $t_{0} \leq τ \leq t_{f}$ are time-like independent variables that parameterize the dynamics of the hidden/latent neuron units. Customarily, the variable $t$ is called the “global time”, while the variable $τ$ is called the “local time”. The initial time-value is denoted as $t_{0}$ , while the stopping time-value is denoted as $t_{f}$ .
(ii): The components of the vector $θ ≜ {[θ_{1}, \dots, θ_{T W}]}^{†}$ represent scalar learnable adjustable weights, where $T W$ denotes the total number of adjustable weights in all of the latent neural nets. The components of the column vector $θ ≜ {[θ_{1}, \dots, θ_{T W}]}^{†}$ are considered to be “primary parameters”, while the components of the vector-valued function $F (θ) ≜ {[F_{1} (θ), \dots, F_{T F} (θ)]}^{†}$ represent the ”feature” functions of the respective weights. The quantity $T F$ denotes the “total number of feature/functions of the primary model parameters” comprised in the NIE-Volterra. In general, $F (θ)$ is a nonlinear function of $θ$ . The total number of feature functions must necessarily be smaller than the total number of primary parameters (weights), i.e., $T F < T W$ . In the extreme case, when there are no feature functions, it follows that $F_{i} (θ) \equiv θ_{i}$ , for all $i = 1, \dots, T W \equiv T F$ . In this work, all vectors are considered to be column vectors, and the dagger “ $†$ ” symbol will be used to denote “transposition”. The symbol “ $≜$ ” will be used to denote “is defined as”, or, equivalently, “is by definition equal to”.
(iii): The $T H$ -dimensional vector-valued function $h (t) ≜ {[h_{1} (t), \dots, h_{T H} (t)]}^{†}$ represents the hidden/latent neural networks. The quantity $T H$ denotes the total number of components of $h (t)$ . At the initial time-value $t_{0}$ , the functions $h_{i} (t_{0})$ take on the known values $h_{i} (t_{0}) = g_{i} [F (θ); t_{0}]$ .
(iv): The functions $g_{i} [F (θ); t] = h_{i} (t_{0})$ , $i = 1, \dots, T H$ model the initial state (“encoder”) of the network. The functions $φ_{i} [F (θ); t]$ and $ψ_{i} [h (τ); F (θ); τ]$ , $i = 1, \dots, T H$ depend nonlinearly on $h (t)$ and $F (θ)$ , respectively, and model the dynamics of the latent neurons.

The “training” of NIE-Volterra net is accomplished by using the “adjoint” or other methods to minimize the user-chosen “loss functional” intended to represent the discrepancy between the output produced by the NIE decoder and a “reference solution” chosen by the user. After the training is completed, the primary parameters (“weights”)

θ ≜ {[θ_{1}, \dots, θ_{T W}]}^{†}

will have been assigned “optimal” values, which are obtained as a result of having minimized the chosen loss functional. These optimal values for the primary parameters (“weights”) will be denoted using a superscript “zero”, as follows:

θ^{0} ≜ {[θ_{1}^{0}, \dots, θ_{T W}^{0}]}^{†}

. Using these optimal/nominal parameter values to solve the NIE system will yield the optimal/nominal solution

h^{0} (t) ≜ {[h_{1}^{0} (t), \dots, h_{T H}^{0} (t)]}^{†}

, which will satisfy the following form of Equation (1):

h_{i}^{0} (t) = g_{i}^{0} [F (θ^{0}); t] + φ_{i}^{0} [F (θ^{0}); t] \int_{t_{0}}^{t} ψ_{i}^{0} [h^{0} (τ); F (θ^{0}); τ] d τ; t_{0} \leq t \leq t_{f}; 1 = 1, \dots T H .

(2)

After the NIE-net is optimized to reproduce the underlying physical system as closely as possible, the subsequent responses of interest are no longer “loss functions” but become specific functionals of NIE’s “decoder” response/output. Such a decoder response, which will be denoted as

R [h; F (θ)]

, can generally be represented by a scalar-valued functional of

h (t)

and

F (θ)

, defined as follows:

R [h; F (θ)] = \int_{t_{0}}^{t_{f}} D [h (t); F (θ); t] d t

(3)

The function

D [h (t); F (θ); t]

models the decoder and may contain distributions (e.g., Dirac-delta and/or Heaviside functionals, etc.) if the decoder response is to be evaluated at some particular point in time or over a subinterval within the interval

[t_{0}, t_{f}]

.

The optimal value of the decoder response, denoted as

R [h^{0}; F (θ^{0})]

, is represented by evaluating Equation (3) at the optimal/nominal parameter values

θ^{0} ≜ {[θ_{1}^{0}, \dots, θ_{T W}^{0}]}^{†}

and the optimal/nominal solution

h^{0} (t)

, as follows:

R [h^{0}; F (θ^{0})] = \int_{t_{0}}^{t_{f}} D [h^{0} (t); F (θ^{0}); t] d t

(4)

The true values

θ

of the primary parameters (“weights”) that characterize the physical system modeled by the NIE-V net are afflicted by uncertainties inherent to the experimental and/or computational methodologies employed to model the original physical system. Therefore, the true values

θ

of the primary parameters (“weights”) will differ from the known nominal values

θ^{0}

(which are obtained after training the NIE-net to represent the model of the physical system) by variations denoted as

δ θ ≜ θ - θ^{0}

. The variations

δ θ ≜ θ - θ^{0}

will induce corresponding variations

δ F ≜ F (θ) - F^{0}

and

F^{0} ≜ F (θ^{0})

in the feature functions, which in turn will induce variations

v^{(1)} (t) ≜ {[v_{1}^{(1)} (t), \dots, v_{T H}^{(1)} (t)]}^{†}

,

v_{i}^{(1)} (t) ≜ h_{i} (t) - h_{1}^{0} (t)

, and

i = 1, \dots, T H

around the nominal/optimal functions

h^{0} (t)

. Subsequently, the variations

δ F

and

v^{(1)} (t; x)

will induce variations in the NIE decoder’s response.

The 1st-FASAM-IDE-V methodology for computing the first-order sensitivities of the decoder’s response with respect to the NIE’s weights will be established by applying the same principles as those underlying the 1st-FASAM-N [16] methodology. These first-order sensitivities are embodied in the first-order G-variation

δ R (h^{0}; F^{0}; v^{(1)}; δ F)

of the response

R [h; F (θ)]

for variations

v^{(1)} (t)

and

δ F

around the nominal values

h^{0} (t)

and

F^{0}

, which is, by definition, obtained as follows:

\begin{array}{l} δ R (h^{0}; F^{0}; v^{(1)}; δ F) = {\{\frac{d}{d ε} \int_{t_{0}}^{t_{f}} D [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t] d t\}}_{ε = 0} \\ = {\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r} + {\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d} . \end{array}

(5)

In Equation (5), the “direct-effect term”

{\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r}

arises directly from variations

δ F

(which, in turn, stem from parameter variations

δ θ

) and is defined as follows:

\begin{array}{l} {\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r} ≜ \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F} δ F\}}_{(h^{0}; F^{0})} d t \\ \equiv \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} δ F_{j}\}}_{(h^{0}; F^{0})} d t, \end{array}

(6)

Meanwhile, the “indirect-effect term”

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

arises through the variations

v^{(1)} (t)

in the hidden state functions

h (t)

and is defined as follows:

\begin{array}{l} {\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d} ≜ \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial h} v^{(1)} (t)\}}_{(h^{0}; F^{0})} d t \\ \equiv \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial h_{j}} δ h_{j}\}}_{(h^{0}; F^{0})} d t . \end{array}

(7)

The first-order relationship between the variations

v^{(1)} (t)

and

δ F

is obtained from the first-order G-variation of Equation (1) for

i = 1, \dots T H

, as follows:

\begin{array}{l} {\{\frac{d}{d ε} [h_{i}^{0} (t) + ε v_{i}^{(1)} (t)]\}}_{ε = 0} = {\{\frac{d}{d ε} g_{i} (F^{0} + ε δ F; t)\}}_{ε = 0} + {\{\frac{d}{d ε} φ_{i} (F^{0} + ε δ F; t)\}}_{ε = 0} \\ \times {\{\frac{d}{d ε} \int_{t_{0}}^{t} ψ_{i} [h_{1}^{0} (τ) + ε v_{1}^{(1)} (τ), \dots, h_{T H}^{0} (τ) + ε v_{T H}^{(1)} (τ); F^{0} + ε δ F; τ] d τ\}}_{ε = 0}; t_{0} \leq t \leq t_{f} . \end{array}

(8)

Performing the operations indicated in Equation (8) yields the following NIE-V net, which will be called the “1st-Level Variational Sensitivity System” (1st-LVSS), for the components

v_{i}^{(1)} (t)

,

i = 1, \dots T H

of the “1st-level variational function”

v^{(1)} (t)

:

v_{i}^{(1)} (t) = {\{φ_{i} (F; t) \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} + \sum_{k = 1}^{T F} {\{q_{i k} (F; t) δ F_{k}\}}_{(h^{0}; F^{0})},

(9)

where

q_{i k} (F; t) ≜ \frac{\partial g_{i} (F; t)}{\partial F_{k}} + \frac{\partial φ_{i} (F; t)}{\partial F_{k}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{k}} d τ .

(10)

As indicated in Equation (9), the 1st-LVSS is to be computed at the nominal/optimal values for the respective model parameters. It is important to note that the 1st-LVSS is linear in the variational function

v^{(1)} (t)

, although it generally remains nonlinear in

h (t)

.

The 1st-LVSS would need to be solved anew to obtain the function

v^{(1)} (t)

that would correspond to each variation

δ F_{j}

,

j = 1, \dots, T F

; this procedure would become prohibitively expensive computationally if

T F

is a large number. The need to repeatedly solve the 1st-LVSS can be avoided by recasting the indirect-effect term

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

in terms of an expression that does not involve the function

v^{(1)} (t)

. This goal can be achieved by expressing

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

in terms of another function, which will be called the “1st-level adjoint function” and will be the solution of the “1st-Level Adjoint Sensitivity System (1st-LASS)” to be constructed next.

The 1st-LASS will be constructed in a Hilbert space, denoted as

H_{1} (Ω_{t})

, where

Ω_{t} ≜ [t_{0}, t_{f}]

, comprising elements of the same form as

v^{(1)} (t) \in H_{1} (Ω_{t})

. The inner product of two elements

χ^{(1)} (t) ≜ {[χ_{1}^{(1)} (t), \dots, χ_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

and

η^{(1)} (t) ≜ {[η_{1}^{(1)} (t), \dots, η_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

will be denoted as

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1}

and is defined as follows:

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1} ≜ \int_{t_{0}}^{t_{f}} {[χ^{(1)} (t)]}^{†} η^{(1)} (t) d t = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{i}^{(1)} (t) η_{i}^{(1)} (t) d t

(11)

The inner product

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1}

is required to hold in a neighborhood of the nominal values

(h^{0}; F^{0})

.

The next step is to form the inner product of Equation (9) with a vector

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

, where the superscript “(1)” indicates “1st-level”, to obtain the following relationship:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) v_{i}^{(1)} (t) d t - {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T F} [\int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i k} (F; t) d t] δ F_{k} . \end{array}

(12)

The second term on the left side of Equation (12) is transformed using “integration by parts”, as follows:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \{[\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \{[\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} [\frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{j} (t)} v_{j}^{(1)} (t) d t] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ] \\ = \sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{j}^{(1)} (t) \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{j} (t)} d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ . \end{array}

(13)

Placing the result obtained in Equation (13) into the left side of Equation (12) yields the following relation for the left side of Equation (12):

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) v_{i}^{(1)} (t) d t - {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) {\{a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ\}}_{(h^{0}, F^{0})} d t . \end{array}

(14)

The term on the right side of Equation (14) is now required to represent the “indirect-effect” term defined in Equation (7), which is achieved by requiring that the components of the function

a^{(1)} (t)

satisfy the following system of equations for

i = 1, \dots, T H

:

a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = \frac{\partial D [h (t); F (θ); t]}{\partial h_{i}} .

(15)

The Volterra-like neural system obtained in Equation (15) will be called the “1st-Level Adjoint Sensitivity System”, and its solution,

a^{(1)} (t)

, will be called the “1st-level adjoint sensitivity function”. The 1st-LASS is to be solved using the nominal/optimal values for the parameters and for the function

h (t)

, but this fact has not been explicitly indicated in order to simplify the notation. The 1st-LASS is linear in

a^{(1)} (t)

but it is, in general, nonlinear in

h (t; x)

. Notably, the 1st-LASS is independent of any parameter variations and needs to be solved once only to determine the 1st-level adjoint sensitivity function

a^{(1)} (t)

. The 1st-LASS is a “final-value problem” because the computation of the adjoint function

a^{(1)} (t)

will commence at

t = t_{f}

, with the known values

a_{i}^{(1)} (t_{f}) = {\{\partial D [h (t); F (θ); t] / \partial h_{i} (t)\}}_{t = t_{f}} .

It follows from Equations (12)–(15) that the indirect-effect term defined in Equation (7) can be expressed in terms of the 1st-level adjoint sensitivity function

a^{(1)} (t)

, as follows:

{\{δ R (h^{0}; F^{0}; a^{(1)})\}}_{i n d} = \sum_{k = 1}^{T F} {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i k} (F; t) d t\}}_{(h^{0}; F^{0})} δ F_{k} .

(16)

Using the results obtained in Equations (16) and (6) in Equation (5) yields the following expression for the G-variation

δ R (h^{0}; F^{0}; v^{(1)}; δ F)

, which is seen to be linear in

δ F

:

\begin{array}{l} δ R (h^{0}; F^{0}; v^{(1)}; δ F) = \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} δ F_{j}\}}_{(h^{0}; F^{0})} d t \\ + {\{\sum_{i = 1}^{T H} \sum_{j = 1}^{T F} [\int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i j} (F; t) d t] δ F_{j}\}}_{(h^{0}; F^{0})} ≜ \sum_{j = 1}^{T F} {\{\frac{\partial R}{\partial F_{j}}\}}_{(h^{0}; F^{0})} δ F_{j} . \end{array}

(17)

Identifying in Equation (17) the expressions that multiply the variations

δ F_{j}

yields the following expressions for the sensitivities

\partial R / \partial F_{j}

of the response

R [h; F (θ)]

with respect to the components

F_{j} (θ)

of the feature function

F (θ)

for

j = 1, \dots, T F

:

\begin{array}{l} \frac{\partial R [h; F (θ)]}{\partial F_{j}} = \int_{t_{0}}^{t_{f}} \frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \{\frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ\} d t . \end{array}

(18)

The expression on the right-side of Equation (18) is to be evaluated at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

The sensitivities with respect to the primary model parameters can be obtained by using the result obtained in Equation (18) together with the “chain rule” of differentiating compound functions, as follows:

\frac{\partial R}{\partial θ_{j}} = \sum_{i = 1}^{T F} \frac{\partial R}{\partial F_{i}} \frac{\partial F_{i}}{\partial θ_{j}}, j = 1, \dots, T W .

(19)

The sensitivities

\partial R / \partial F_{j}

are obtained from Equation (18), while the derivatives

\partial F_{i} / \partial θ_{j}

are obtained analytically, exactly, from the known expressions of the feature functions

F_{i} (θ)

.

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-CASAM-NIE-V)

When no feature functions can be constructed from the model parameters/weights, the feature functions become identical to the parameters, i.e.,

F_{i} (θ) \equiv θ_{i}

for all

i = 1, \dots, T F ≜ T W

. In this case, the expression obtained in Equation (18) yields directly the first-order sensitivities

\partial R / \partial θ_{j}

of the decoder response with respect to the model weights/parameters for all

j = 1, \dots, T W

, taking on the following specific form:

\begin{array}{l} \frac{\partial R [h; F (θ)]}{\partial θ_{j}} = \int_{t_{0}}^{t_{f}} \frac{\partial D [h (t); F (θ); t]}{\partial θ_{j}} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial θ_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \{\frac{\partial φ_{i} (F; t)}{\partial θ_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial θ_{j}} d τ\} d t . \end{array}

(20)

Because the 1st-LASS is independent of any parameter variations, the 1st-level adjoint sensitivity function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

, which appears in Equation (20), remains the solution of the 1st-LASS defined by Equation (15). In this case, however, all of the sensitivities

\partial R / \partial θ_{j}

for all

j = 1, \dots, T W

would be obtained by computing integrals using quadrature formulas. Thus, when there are no feature functions of parameters, the 1st-FASAM-NIE-V reduces to the “First-Order Comprehensive Adjoint Sensitivity Analysis Methodology [16] applied to Neural Integral Equations of Volterra-Type” (1st-CASAM-NIE-V). On the other hand, when features of parameters can be constructed, only

T F

(T F < T W)

numerical computations of integrals using quadrature formulas are required, using Equation (18) to obtain the sensitivities

\partial R / \partial F_{j}

,

j = 1, \dots, T F

. Subsequently, the sensitivities with respect to the model’s weights/parameters are obtained analytically using the chain rule provided in Equation (19).

3. Illustrative Application of the 1st-CASAM-NIE-V and 1st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

The illustrative model considered in this section is a Volterra-type integral equation that describes the energy distribution of neutrons in a homogeneous hydrogenous medium (such as a water-moderated/cooled reactor system) containing ²³⁸U (among other materials), which is a heavy element that strongly absorbs neutrons. The distribution of collided neutrons in such a medium is described [19,20,21] by the following linear integral equation of the Volterra type, customarily called the “neutron slowing down equation” for the neutron collision density, denoted as

C (E)

:

C (E) = \frac{Σ_{s} (E_{s}) S}{Σ_{t} (E_{s}) E_{s}} + \int_{E}^{E_{s}} C (e) \frac{Σ_{s} (e)}{Σ_{t} (e)} \frac{d e}{e}

(21)

The various quantities that appear in Equation (21) are defined as follows:

(i): The quantity $S$ denotes the rate at which the source neutrons, considered to be monoenergetic, are emitted at the “source energy” $E_{s}$ . Neutron upscattering is considered to be negligeable; therefore, $E_{s}$ is the highest energy in the medium.
(ii): The quantity $E$ , $0 < E_{l} \leq E \leq E_{s}$ , denotes the instantaneous energy of the collided neutrons; $E_{l}$ denotes the lowest neutron energy in the model.
(iii): The quantity $Σ_{s} (E)$ denotes the medium’s macroscopic scattering cross-section, which is defined as follows:

$Σ_{s} (E) ≜ \sum_{i = 1}^{M} w_{i} N_{i} σ_{s}^{(i)} (E),$

(22)

where M denotes the number of materials in the medium, $w_{i}$ denotes the relative weighting of the ith-material in the medium, $N_{i}$ denotes the number density of the ith-material, and $σ_{s}^{(i)} (E)$ denotes the energy-dependent scattering microscopic cross-section of the ith-material.
(iv): The quantity $Σ_{t} (E)$ denotes the medium’s macroscopic scattering cross-section, which is defined as follows:

$Σ_{t} (E) ≜ \sum_{i = 1}^{M} w_{i} N_{i} σ_{t}^{(i)} (E),$

(23)

where $σ_{t}^{(i)} (E) \geq σ_{s}^{(i)} (E)$ denotes the energy-dependent total microscopic cross-section of the ith-material. The quantities $w_{i}$ , $N_{i}$ , $σ_{s}^{(i)} (E)$ , $σ_{t}^{(i)} (E)$ are subject to uncertainties because they are determined from experimentally obtained data.

Notably, the Volterra-type Equation (21) is a “final-value problem” because the computation is started at the highest energy value,

C (E_{s}) = Σ_{s} (E_{s}) S / Σ_{t} (E_{s}) E_{s}

, and progresses towards the lowest energy value,

E_{l}

. Customarily, the solution of Equation (21) is written in the following form:

C (E) = \frac{Σ_{s} (E_{s}) S}{Σ_{t} (E_{s}) E} e x p \{- \int_{E}^{E_{s}} \frac{Σ_{a} (e)}{Σ_{t} (e)} \frac{d e}{e}\},

(24)

where

Σ_{a} (E) ≜ Σ_{t} (E) - Σ_{s} (E)

denotes the medium’s macroscopic absorption cross-section. The expression provided in Equation (24) is amenable to computations of the loss of neutrons due to absorbing materials, particularly in the so-called “resonance” energy region.

A typical “decoder response” for the NIE-Volterra network modeled by Equation (21) is the energy-averaged collision density, denoted below as

R [C (E)]

, which would be measured by a detector having an interaction cross-section

Σ_{d}

. Mathematically, this detector response can be expressed as follows:

R [C (E)] ≜ \int_{E_{l}}^{E_{s}} Σ_{d} (E) C (E) d E; Σ_{d} (E) ≜ N_{d} σ_{d} (E),

(25)

where

N_{d}

and

σ_{d} (E)

denote, respectively, the detector material’s atomic number density and the microscopic cross-section describing the interaction (e.g., absorption) of neutrons with the detector’s material;

N_{d}

and

σ_{d} (E)

can be considered the “weights” that characterize the neural net’s “decoder”.

Because the energy dependence of the cross-sections does not play a significant role in the sensitivity analysis of the NIE-Volterra modeled using Equation (21), the respective microscopic cross-sections will henceforth be considered to be energy-independent for the purpose of illustrating the application of the 1st-FASAM-NIE-V in order to simplify the ensuing derivations. For energy-independent cross-sections, Equations (21) and (25) take on the following forms, respectively:

C (E) = F (θ) [\frac{S}{E_{s}} + \int_{E}^{E_{s}} C (e) \frac{d e}{e}],

(26)

R [C (E)] = Σ_{d} (θ) \int_{E_{l}}^{E_{s}} C (E) d E .

(27)

In Equations (26) and (27), the source strength

S

is an imprecisely known “weight” that characterizes the neural net’s “encoder”. Furthermore, the (column) vector of parameters denoted as

θ ≜ {(θ_{1}, \dots, θ_{T W})}^{†}

comprises as components the “imprecisely known primary model parameters” (or “weights”, as they are customarily called when referring to neural nets) and is defined as follows:

θ ≜ {(θ_{1}, \dots, θ_{T W})}^{†} ≜ {(w_{1}, \dots, w_{M}; N_{1}, \dots, N_{M}; σ_{t}^{(1)}, \dots, σ_{t}^{(M)}; σ_{s}^{(1)}, \dots, σ_{s}^{(M)}; N_{d}, σ_{d})}^{†};

(28)

where

T W ≜ 4 \times M + 2

denotes the “total number of imprecisely-known weights/parameters”. These primary model parameters/weights are not known exactly but are affected by uncertainties because they stem from experimental procedures, which determine the nominal/mean/optimal values and the second-order moments of their unknown joint distributions; their third- and higher-order moments are rarely known. It is convenient to denote the nominal values of these primary model parameters/weights by using the superscript “zero”, as follows:

θ^{0} ≜ {(θ_{1}^{0}, \dots, θ_{T W}^{0})}^{†} ≜ {(w_{1}^{0}, \dots, w_{M}^{0}; N_{1}^{0}, \dots, N_{M}^{0}; σ_{t}^{(1, 0)}, \dots, σ_{t}^{(M, 0)}; σ_{s}^{(1, 0)}, \dots, σ_{s}^{(M, 0)}; N_{d}^{0}, σ_{d}^{0})}^{†};

(29)

The “feature function of primary parameters”,

F (θ)

, is defined as follows:

F (θ) ≜ Σ_{s} (θ) / Σ_{t} (θ) \leq 1 .

(30)

The closed-form solution of Equation (26) has the following expression in terms of the feature function

F (θ)

:

C (E) = \frac{S}{E_{s}} F (θ) {(\frac{E_{s}}{E})}^{F (θ)} .

(31)

The closed-form expression of the decoder response can be readily obtained by replacing the result obtained in Equation (31) in Equation (27) and performing the integration over the energy variable to obtain

R [C (E)] = S Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] .

(32)

The expression obtained in Equation (32) reveals that the imprecisely known quantities that affect the decoder response

R [C (E)]

are as follows:

(i): the source strength $S$ ;
(ii): the detector interaction macroscopic cross-section $Σ_{d} (θ)$ , which can be considered to be a “feature function” of the model parameters $θ$ ;
(iii): the feature function $F (θ) ≜ Σ_{s} (θ) / Σ_{t} (θ)$ .

3.1. Application of 1st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

The first-order sensitivities of the decoder response

R [C (E)]

with respect to the model parameters are obtained by applying the definition of the G-differential to Equation (26) for arbitrary parameter variations

δ θ ≜ {(δ θ_{1}, \dots, δ θ_{T W})}^{†} ≜ θ - θ^{0} ≜ {(θ_{1} - θ_{1}^{0}, \dots, θ_{T W} - θ_{T W}^{0})}^{†}

around the parameters’ nominal values. These parameter variations will induce variations

δ C (E) ≜ C (E) - C^{0} (E)

in the neutron collision density around the nominal value

C^{0} (E)

of the neutron collision density. The variations

δ θ

and

δ C (E)

will induce variations

δ R [C^{0} (E); θ^{0}; δ C (E); δ θ]

in the decoder’s response.

The first-order Gateaux (G-)variation

δ R [C^{0} (E); θ^{0}; δ C (E); δ θ]

is obtained, by definition, from Equation (27) as follows:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ θ) ≜ \frac{d}{d ε} {\{[Σ_{d} (θ^{0} + ε δ θ)] \int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} \\ = {\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r} + {\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d}, \end{array}

(33)

where the “direct effect” term

{\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r}

arises directly from parameter variations

δ θ

and is defined as follows:

\begin{array}{l} {\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r} ≜ [\int_{E_{l}}^{E_{s}} C^{0} (E) d E] {\sum_{i = 1}^{T W} \{\frac{\partial Σ_{d} (θ)}{\partial θ_{i}}\}}_{θ = θ^{0}} δ θ_{i} \\ = {\{(δ N_{d}) σ_{d} + N_{d} (δ σ_{d})\}}_{θ = θ^{0}} \int_{E_{l}}^{E_{s}} C^{0} (E) d E, \end{array}

(34)

Meanwhile, the indirect-effect term arises from the variations

δ C (E)

and is defined as follows:

{\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d} ≜ Σ_{d} (θ^{0}) \int_{E_{l}}^{E_{s}} δ C (E) d E

(35)

As indicated in Equations (34) and (35), both the direct-effect and the indirect-effect terms are to be evaluated at the nominal parameter values.

The first-order relation between the variation

δ C (E)

and the parameter variations

δ θ_{i}

is obtained by evaluating the G-variation of Equation (26) for variations

δ θ

around the nominal parameter values

θ^{0}

, which yields, by definition, the following NIE-Volterra equation for

δ C (E)

:

\begin{array}{l} δ C (E) ≜ \frac{d}{d ε} {\{F (θ^{0} + ε δ θ) \frac{S^{0} + ε δ S}{E_{s}} + F (θ^{0} + ε δ θ) \int_{E}^{E_{s}} [C^{0} (e) + ε δ C (e)] \frac{d e}{e}\}}_{ε = 0} \\ = F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} + Q (E), \end{array}

(36)

where

\begin{array}{l} Q (E) ≜ \frac{F (θ^{0}) δ S}{E_{s}} + {\{[\frac{S^{0}}{E_{s}} + \int_{E}^{E_{s}} C^{0} (e) \frac{d e}{e}] [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}]\}}_{θ = θ^{0}} \\ = \frac{F (θ^{0}) δ S}{E_{s}} + {\{\frac{S^{0}}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}]\}}_{θ = θ^{0}} . \end{array}

(37)

The second equality in Equation (37) has been obtained by using Equations (26) and (31) to eliminate the integral term involving

C^{0} (e)

.

The particular form of the first-order derivative

\partial F (θ) / \partial θ_{i}

, which appears in Equation (37), is obtained by using the definition of

F (θ)

provided in Equation (30), which yields the following expression:

\frac{\partial F (θ)}{\partial θ_{i}} = \frac{1}{Σ_{t} (θ)} \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} - \frac{Σ_{s} (θ)}{Σ_{t}^{2} (θ)} \frac{\partial Σ_{t} (θ)}{\partial θ_{i}},

(38)

In view of the definition provided in Equation (22), the derivatives

\partial Σ_{s} (θ) / \partial θ_{i}

have the following particular expressions:

f o r i = 1, \dots, M : θ_{i} ≜ w_{i}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = N_{i} σ_{s}^{(i)};

(39)

f o r i = M + 1, \dots, 2 M : θ_{i} ≜ N_{i}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = w_{i} σ_{s}^{(i)};

(40)

f o r i = 3 M + 1, \dots, 4 M : θ_{i} ≜ σ_{s}^{(i)}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = w_{i} N_{i} .

(41)

In view of the definition provided in Equation (23), the derivatives

\partial Σ_{t} (θ) / \partial θ_{i}

have the following particular expressions:

f o r i = 1, \dots, M : θ_{i} ≜ w_{i}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = N_{i} σ_{t}^{(i)};

(42)

f o r i = M + 1, \dots, 2 M : θ_{i} ≜ N_{i}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = w_{i} σ_{t}^{(i)};

(43)

f o r i = 2 M + 1, \dots, 3 M : θ_{i} ≜ σ_{t}^{(i)}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = w_{i} N_{i} .

(44)

The NIE-Volterra net represented by Equation (36) will be called the “1st-Level Variational Sensitivity System (1st-LVSS)” and its solution,

δ C (E)

, will be called the “1st-level variational sensitivity function”. It is evident that Equation (36) would need to be solved

(T W + 1)

times in order to obtain the variation

δ C (E)

for the source variation

δ S

and for every parameter variation

δ θ_{i}

This need to repeatedly solve Equation (36) can be circumvented by applying the principles of the 1st-CASAM-NIE-V, generally outlined in Section 2, to eliminate the appearance of the variation

δ C (E)

in the indirect-effect term defined in Equation (35) while expressing this indirect-effect term as a functional of a first-level adjoint function that does not depend on any parameter variation, as follows.

Consider that the function $δ C (E) \in H_{1} (Ω_{E})$ belongs to a Hilbert space denoted as $H_{1} (Ω_{E})$ , which is defined on the domain $Ω_{E} ≜ [E_{l}, E_{s}]$ . The inner product in $H_{1} (Ω_{E})$ of two functions, $u (E) \in H_{1} (Ω_{E})$ and $v (E) \in H_{1} (Ω_{E})$ , will be denoted as ${〈u (E), v (E)〉}_{1}$ , and it is defined as follows:

${〈u (E), v (E)〉}_{1} ≜ \int_{E_{l}}^{E_{0}} u (E) v (E) d E$

(45)
Form the inner product of Equation (36) with a vector $a^{(1)} (E) \in H_{1} (E)$ , where the superscript “(1)” indicates “1st-Level”, to obtain the following relationship:

${〈a^{(1)} (E), δ C (E) - F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e}〉}_{1} = {〈a^{(1)} (E), Q (E)〉}_{1}$

(46)
Transform the left side of Equation (46) as follows:

$\begin{array}{l} {〈a^{(1)} (E), δ C (E) - F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e}〉}_{1} = \int_{E_{l}}^{E_{s}} a^{(1)} (E) δ C (E) d E \\ - F (θ^{0}) \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} = \int_{E_{l}}^{E_{s}} δ C (E) a^{(1)} (E) d E \\ - F (θ^{0}) \int_{E_{l}}^{E_{s}} δ C (E) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e = \int_{E_{l}}^{E_{s}} δ C (E) [a^{(1)} (E) - \frac{F (θ^{0})}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e] d E . \end{array}$

(47)

In obtaining the expression on the right side of the last equality in Equation (47), the well-known “integration by parts” formula

\int g^{'} (x) f (x) d x = g (x) f (x) - \int g (x) f^{'} (x) d x

has been used to reverse the integration order in the double integral

\int_{E_{l}}^{E_{s}} a^{(1)} (E) d E \int_{E}^{E_{s}} δ C (e) \frac{d e}{e}

, as follows:

\begin{array}{l} g^{'} (E) ≜ v (E) ≜ a^{(1)} (E); g (E) = \int_{E_{l}}^{E} v (x) d x = \int_{E_{l}}^{E} a^{(1)} (E) d E; \\ f (E) ≜ \int_{E}^{E_{s}} u (x) d x; u (E) ≜ δ C (E) / E; f^{'} (E) ≜ - u (E); \end{array}

(48)

\begin{array}{l} \int_{E_{l}}^{E_{s}} v (E) d E \int_{E}^{E_{s}} u (x) d x = {\{\int_{E_{l}}^{E} v (E) d E \int_{E}^{E_{s}} u (x) d x\}}_{E = E_{s}} - {\{\int_{E_{l}}^{E} v (E) d E \int_{E}^{E_{s}} u (x) d x\}}_{E = E_{l}} \\ - \int_{E_{l}}^{E_{s}} [- u (E) d E] \int_{E_{l}}^{E} v (x) d x = \int_{E_{l}}^{E_{s}} u (E) d E \int_{E_{l}}^{E} v (x) d x . \end{array}

(49)

4.: Require the last term in Equation (47) to represent the indirect-effect term defined in Equation (35), which yields the following “1st-Level Adjoint Sensitivity System (1st-LASS)” for the first-level adjoint sensitivity function $a^{(1)} (E)$ :

$a^{(1)} (E) - \frac{F (θ^{0})}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e = Σ_{d} (θ^{0})$

(50)

The 1st-LASS represented by Equation (50) is a linear NIE-Volterra net, which is independent of any parameter variation and needs to be solved just once to obtain the first-level adjoint sensitivity function $a^{(1)} (E)$ . Notably, the 1st-LASS is an “initial-value problem”, in that the computation of $a^{(1)} (E)$ commences at the lowest energy value, where $a^{(1)} (E_{l}) = Σ_{d} (θ^{0})$ , and progresses towards the highest energy value, $E_{s}$ . For further reference, the closed-form solution of Equation (50) can be obtained by differentiating this equation with respect to $E$ and subsequently integrating the resulting first-order linear differential equation to obtain the following exact expression:

$a^{(1)} (E) = \frac{Σ_{d} (θ)}{1 - F (θ)} [1 - F (θ) {(\frac{E}{E_{l}})}^{F (θ) - 1}]$

(51)

The expression on the right side of Equation (51) is to be evaluated at the nominal parameter values $θ^{0}$ , but the superscript “zero” has been omitted for notational simplicity.
5.: Using Equations (46), (47), and (50) yields the following expression for the indirect-effect term defined in Equation (35):

$\begin{array}{l} {\{δ R (C; θ; δ C)\}}_{i n d} = \int_{E_{l}}^{E_{s}} a^{(1)} (E) Q (E) d E = \frac{F (θ) δ S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E \\ + \frac{S}{E_{s}} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}] \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}$

(52)

The expression on the right side of Equation (52) is to be evaluated at the nominal parameter values $θ^{0}$ , but the superscript “zero” has been omitted for notational simplicity.
6.: Adding the expression obtained in Equation (52) to the expression of the direct-effect term represented by Equation (34) yields the following expression for the first-order G-variation $δ R (C^{0}; θ^{0}; δ C; δ θ)$ :

$\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ θ) = [(δ N_{d}) σ_{d} + N_{d} (δ σ_{d})] \int_{E_{l}}^{E_{s}} C (E) d E \\ + \frac{F (θ) δ S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \frac{S}{E_{s}} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}] \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}$

(53)
7.: It follows from Equation (53) that the first-order sensitivities of the decoder response with respect to the (encoder’s) source strength and the optimal weights/parameters have the following expressions:

$\frac{\partial R}{\partial S} = \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E;$

(54)

$\frac{\partial R}{\partial N_{d}} = σ_{d} \int_{E_{l}}^{E_{s}} C (E) d E;$

(55)

$\frac{\partial R}{\partial σ_{d}} = N_{d} \int_{E_{l}}^{E_{s}} C (E) d E;$

(56)

$\frac{\partial R}{\partial θ_{i}} = \frac{S}{E_{s}} \frac{\partial F (θ)}{\partial θ_{i}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E; i = 1, \dots, T W - 2 .$

(57)

Inserting into Equations (54)–(57) the closed-form expression for the neutron collision density obtained in Equation (31) yields the following closed-form explicit expressions for the first-order sensitivities of the decoder response with respect to the (encoder’s) source strength and the optimal weights/parameters:

\frac{\partial R}{\partial S} = Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(58)

\frac{\partial R}{\partial N_{d}} = σ_{d} S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(59)

\frac{\partial R}{\partial σ_{d}} = N_{d} S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(60)

\begin{array}{l} \frac{\partial R}{\partial θ_{i}} = \frac{\partial F (θ)}{\partial θ_{i}} \frac{S Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\}; \\ i = 1, \dots, T W . \end{array}

(61)

The correctness of the expressions obtained in Equations (58)–(61) can be readily verified by differentiating the expressions of the decoder’s response obtained in Equation (32).

In practice, only the exact mathematical expression of the 1st-LASS, namely, Equation (50), and the exact mathematical expression of the first-order sensitivities obtained in Equations (54)–(57) are available. The solution of the 1st-LASS, which is a linear NIE-Volterra net for the first-level adjoint sensitivity function

a^{(1)} (E)

, would need to be obtained numerically in practice. The numerical solution for

a^{(1)} (E)

would be used to determine the first-order sensitivities stemming from the “indirect-effect” term by using quadrature formulas to evaluate the integrals obtained in Equations (54) and (57). It is very important to note that a single “large-scale” computation for determining numerically the adjoint function

a^{(1)} (E)

by solving the 1st-LASS (a NIE-Volterra type equation would be needed for evaluating all of the first-order sensitivities. The numerical computations using quadrature formulas for evaluating the integrals in Equations (54) and (57) are considered to be “small-scale” computations.

As it has been already observed in the brief remarks following Equation (37), the computation of the first-order sensitivities of the decoder response with respect to the encoder source strength S and model weights/parameters could also have been computed by repeatedly numerically solving the NIE-Volterra net (1st-LVSS) represented by Equation (36). This procedure would be very expensive computationally, as it would require

(T W + 1)

large-scale computations to solve the 1st-LVSS defined by Equation (36) in order to obtain the variation

δ C (E)

for every parameter variation

δ θ_{i}

and the source variation

δ S

. In addition, the same amount of “quadrature” computations would need to be performed using Equation (35) as would be needed for evaluating the first-order sensitivities using Equations (54) and (57).

3.2. Efficient Indirect Computation Using the 1st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Primary Model Parameters

When feature functions of model parameters, such as

Σ_{d} (θ)

and

F (θ)

, can be identified, as is the case with the NIE-Volterra net and decoder response represented by Equations (26) and (27), respectively, it is considerably more efficient to determine the first-order sensitivities of the decoder response with respect to the feature functions and subsequently derive analytically the sensitivities with respect to the primary model parameters by using the “chain rule of differentiation”, as will be shown in this Section. Thus, considering arbitrary variations

δ Σ_{d} (θ) ≜ Σ_{d} (θ) - Σ_{d} (θ^{0})

and

δ F (θ) ≜ F (θ) - F (θ^{0})

around the nominal values

Σ_{d} (θ^{0})

and, respectively,

F (θ^{0})

, the first-order G-variation of the decoder response has the following expression:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ Σ_{d}) ≜ \frac{d}{d ε} {\{[Σ_{d} (θ^{0}) + ε δ Σ_{d} (θ^{0})] \int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} \\ = (δ Σ_{d}) {\{\int_{E_{l}}^{E_{s}} C (E) d E\}}_{θ = θ^{0}} + {\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d}, \end{array}

(62)

where the expression of the indirect-effect term is defined in Equation (35). The first-order relation between the variation

δ C (E)

and the variations

δ Σ_{d} (θ)

and

δ F (θ)

is obtained, by definition, from Equation (26) as follows:

\begin{array}{l} δ C (E) ≜ \frac{d}{d ε} \{[F (θ^{0}) + ε δ F (θ)] \frac{S^{0} + ε δ S}{E_{s}} + [F (θ^{0}) + ε δ F (θ)] \\ {\times \int_{E}^{E_{s}} [C^{0} (e) + ε δ C (e)] \frac{d e}{e}\}}_{ε = 0} = F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} + Q (E), \end{array}

(63)

where

Q (E) ≜ \frac{F (θ^{0}) δ S}{E_{s}} + {\{\frac{S^{0}}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} [δ F (θ)]\}}_{θ = θ^{0}} .

(64)

Comparing Equation (63) to Equation (36) indicates that the only difference between these equations is the expression of the term

Q (E)

, which is expressed in terms of

δ F (θ)

in Equation (64). Consequently, the first-level adjoint sensitivity function that corresponds to the variational function

δ C (E)

is determined by following the same procedure as outlined in Equations (46)–(50), ultimately obtaining the same 1st-LASS as was obtained in Equation (50), having as solution the same expression for

a^{(1)} (E)

as was obtained in Equation (51). It further follows that the expression of the indirect-effect term will have the following expression:

{\{δ R (C; θ; δ C)\}}_{i n d} = (δ S) \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} .

(65)

It follows from Equations (62) and (65) that the first-order G-variation

δ R (C^{0}; θ^{0}; δ C; δ Σ_{d})

has the following expression:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ Σ_{d}) = (δ Σ_{d}) {\{\int_{E_{l}}^{E_{s}} C (E) d E\}}_{θ = θ^{0}} \\ + (δ S) \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}

(66)

As indicated by the expression obtained in Equation (66), the first-order sensitivities of the decoder response with respect to the feature functions and the encoder’s source strength are as follows:

\frac{\partial R}{\partial F (θ)} = \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E;

(67)

\frac{\partial R}{\partial Σ_{d} (θ)} = \int_{E_{l}}^{E_{s}} C (E) d E;

(68)

\frac{\partial R}{\partial S} = \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E .

(69)

The closed-form expressions of the above sensitivities are readily determined by using in Equations (67)–(69) the expressions obtained in Equations (51) and (24), and by performing the respective integrations to obtain

\frac{\partial R}{\partial F (θ)} = \frac{S Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(70)

\frac{\partial R}{\partial Σ_{d} (θ)} = S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(71)

\frac{\partial R}{\partial S} = Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] .

(72)

The first-order sensitivities with respect to the primary parameters are obtained analytically from Equations (67) and (68), respectively, by using the following “chain rule” of differentiation:

\frac{\partial R}{\partial N_{d}} = \frac{\partial R}{\partial Σ_{d} (θ)} \frac{\partial Σ_{d} (θ)}{\partial N_{d}} = σ_{d} \frac{\partial R}{\partial Σ_{d} (θ)};

(73)

\frac{\partial R}{\partial σ_{d}} = \frac{\partial R}{\partial Σ_{d} (θ)} \frac{\partial Σ_{d} (θ)}{\partial σ_{d}} = N_{d} \frac{\partial R}{\partial Σ_{d} (θ)};

(74)

\frac{\partial R}{\partial θ_{i}} = \frac{\partial R}{\partial F (θ)} \frac{\partial F (θ)}{\partial θ_{i}}; i = 1, \dots, 4 M .

(75)

The specific expressions of the first-order sensitivities

\partial R / \partial θ_{i}

,

i = 1, \dots, 4 M

, are obtained by using Equation (75) in conjunction with Equation (69) and Equations (38)–(44).

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters

The principles of the 1st-CASAM-NIE-V were applied in Section 3.1 to determine the first-order sensitivities of the decoder response directly with respect to the model’s primary parameters/weights. It has been shown that this procedure requires a single “large-scale” computation to solve an NIE-Volterra equation in order to determine the (single) first-level adjoint sensitivity function

a^{(1)} (E)

, which is subsequently used in

4 M + 1

integrals that are computed using quadrature formulas. The two additional first-order sensitivities with respect to the components of

Σ_{d} (θ)

require a single quadrature involving the forward function

C (E)

.

The principles of the 1st-FASAM-NIE-V were applied in Section 3.2 to determine the first-order sensitivities of the decoder response with respect to the feature functions. This path required just two (as opposed to

4 M + 1

) numerical evaluations of (two) integrals using quadrature formulas involving the first-level adjoint sensitivity function

a^{(1)} (E)

. The sensitivities of the decoder response with respect to the primary parameters/weights were subsequently determined analytically, using the “chain rule of differentiation” of the explicitly known expression of the feature function

F (θ)

. Evaluating the two additional first-order sensitivities with respect to the components of

Σ_{d} (θ)

requires a single quadrature involving the forward function

C (E)

, as in Section 3.1. Evidently, the indirect path presented in Section 3.2 is computationally more efficient, as it requires substantially fewer numerical quadratures than the path presented in Section 3.1. The superiority of the indirect path, via “feature functions”, over the direct computation of sensitivities with respect to the model parameters will be considerably more evident for the computation of second-order sensitivities, as will be shown in the forthcoming Section 4 and Section 5, below.

Of course, when no feature functions can be identified, the 1st-FASAM-NIE-V methodology becomes identical to the 1st-CASAM-NIE-V methodology.

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (2nd-FASAM-NIE-V)

The second-order sensitivities of the response

R [h; F (θ)]

defined in Equation (3) will be computed by conceptually using their basic definitions as being the “first-order sensitivities of the first-order sensitivities”. Thus, the second-order sensitivities stemming from the first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

are obtained from the first-order G-differential of Equation (18), for

j = 1, \dots, T F

, as follows:

\begin{array}{l} δ (\frac{\partial R}{\partial F_{j}}) ≜ {\{\frac{d}{d ε} [\int_{t_{0}}^{t_{f}} \frac{\partial D [h^{0} (t) + ε v^{(1)} (t); F (θ^{0}) + ε δ F; t]}{\partial F_{j}} d t]\}}_{ε = 0} \\ + {\{\sum_{i = 1}^{T H} \frac{d}{d ε} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] \frac{\partial g_{i} (F^{0} + ε δ F; t)}{\partial F_{j}} d t\}}_{ε = 0} \\ + \{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] \frac{\partial φ_{i} (F^{0} + ε δ F; t)}{\partial F_{j}} \\ {\times \int_{t_{0}}^{t} ψ_{i} [h^{0} (τ) + ε v^{(1)} (τ); F^{0} + ε δ F; τ] d τ\}}_{ε = 0} \\ + \{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] φ_{i} (F^{0} + ε δ F; t) d t \\ \times {\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h^{0} (τ) + ε v^{(1)} (τ); F^{0} + ε δ F; τ]}{\partial F_{j}} d τ\}}_{ε = 0} \\ ≜ δ {(\partial R / \partial F_{j})}_{d i r} + δ {(\partial R / \partial F_{j})}_{i n d}; j = 1, \dots, T F . \end{array}

(76)

In Equation (76), the expression of the direct-effect term

δ {(\partial R / \partial F_{j})}_{d i r}

is obtained after performing the operations with respect to the scalar

ε

and comprises the variations

δ F

(stemming from variations in the model parameters), being defined as follows:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{d i r} ≜ \sum_{k = 1}^{T F} \int_{t_{0}}^{t_{f}} [\frac{\partial^{2} D [h (t); F (θ); t]}{\partial F_{k} \partial F_{j}} δ F_{k}] d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1)} (t) \sum_{k = 1}^{T F} \frac{\partial^{2} g_{i} (h; F)}{\partial F_{k} \partial F_{j}} δ F_{k}] d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) [\sum_{k = 1}^{T F} \frac{\partial^{2} φ_{i} (F; t)}{\partial F_{k} \partial F_{j}} δ F_{k}] \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} [\sum_{k = 1}^{T F} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{k}} δ F_{k}] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) [\sum_{k = 1}^{T F} \frac{\partial φ_{i} (F; t)}{\partial F_{k}} δ F_{k}] d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t [\sum_{k = 1}^{T F} \frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial F_{k} \partial F_{j}} δ F_{k}]; j = 1, \dots, T F . \end{array}

(77)

The expression on the right side of Equation (77) is to be evaluated at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

The expression of the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

defined in Equation (76) is obtained after performing the operations with respect to the scalar

ε

and comprises the variations

v^{(1)} (t)

and

δ a^{(1)} (t) ≜ {[δ a_{1}^{(1)} (t), \dots, δ a_{T H}^{(1)} (t)]}^{†}

, as follows:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} ≜ \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} v_{k} (t) d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \sum_{k = 1}^{T H} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \sum_{k = 1}^{T H} [\frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ) \partial F_{j}} v_{k} (τ)] d τ; j = 1, \dots, T F . \end{array}

(78)

The expressions in Equation (78) are to be evaluated at the nominal values of the respective functions and parameters, but the respective indication (i.e., the superscript “zero”) has been omitted in order to simplify the notation.

The direct-effect term

δ {(\partial R / \partial F_{j})}_{d i r}

can be evaluated at this time for all variations

δ F

, but the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

can be evaluated only after having determined the variations

v^{(1)} (t)

and

δ a^{(1)} (t)

. The variation

v^{(1)} (t)

is the solution of the 1st-LVSS defined by Equation (9). On the other hand, the variational function

δ a^{(1)} (t)

is the solution of the system of equations obtained by G-differentiating the 1st-LASS. By definition, the G-differential of Equation (15) is obtained as follows for

i = 1, \dots, T H

:

\begin{array}{l} {\{\frac{d}{d ε} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)]\}}_{ε = 0} - \{\frac{d}{d ε} [\sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t]}{\partial h_{i} (t)}] \\ \times {\int_{t}^{t_{f}} [a_{k}^{(1, 0)} (τ) + ε δ a_{k}^{(1)} (τ)] φ_{k} (F^{0} + ε δ F; τ) d τ\}}_{ε = 0} \\ = {\{\frac{d}{d ε} \frac{\partial D [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t]}{\partial h_{i} (t)}\}}_{ε = 0} . \end{array}

(79)

Performing the operations indicated in Equation (79) and rearranging the various terms yields the following relations for

i = 1, \dots, T H

:

\begin{array}{l} δ a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ - \sum_{m = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t) \\ - \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = \sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n}, \end{array}

(80)

where

\begin{array}{l} \sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n} ≜ \sum_{k = 1}^{T H} [\sum_{n = 1}^{T F} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n}] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \sum_{n = 1}^{T F} \int_{t}^{t_{f}} [a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} δ F_{n}] d τ + \sum_{n = 1}^{T F} \frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n} . \end{array}

(81)

As indicated by the result obtained in Equation (80), the variations

δ a^{(1)} (t)

are coupled to the variations

v^{(1)} (t)

. Therefore, they can be obtained by simultaneously solving Equations (80) and (9), which together will be called the “2nd-Level Variational Sensitivity System (2nd-LVSS)”. The solution of the 2nd-LVSS, namely, the vector

v^{(2)} (t) ≜ {[δ a^{(1)} (t), v^{(1)} (t)]}^{†}

, will be called the “2nd-level variational sensitivity function”. Because the 2nd-LVSS depends on the variations

δ F

(stemming from variations in the model parameters), it would need to be solved anew for each such variation. The repeated solving of the 2nd-LVSS can be avoided by following the general principles underlying the 2nd-FASAM [16], which considers the function

v^{(2)} (t) ≜ {[v^{(1)} (t), δ a^{(1)} (t)]}^{†}

to be an element in a Hilbert space, denoted as

H_{2} (Ω_{t})

. The Hilbert space

H_{2} (Ω_{t})

is considered to be endowed with an inner product denoted as

{〈χ^{(2)}, η^{(2)}〉}_{2}

between two vectors

χ^{(2)} (t) ≜ {[χ_{1}^{(2)} (t), χ_{2}^{(2)} (t)]}^{†} \in H_{2} (Ω_{t})

and

η^{(2)} (t) = {[η_{1}^{(2)} (t), η_{2}^{(2)} (t)]}^{†} \in H_{2} (Ω_{t})

, with

η_{1}^{(2)} (t) ≜ {[η_{1, 1}^{(2)} (t), \dots, η_{1, T H}^{(2)} (t)]}^{†}

,

η_{2}^{(2)} (t) ≜ {[η_{2, 1}^{(2)} (t), \dots, η_{2, T H}^{(2)} (t)]}^{†}

,

χ_{1}^{(2)} (t) ≜ {[χ_{1, 1}^{(2)} (t), \dots, χ_{1, T H}^{(2)} (t)]}^{†}

, and

χ_{2}^{(2)} (t) ≜ {[χ_{2, 1}^{(2)} (t), \dots, χ_{2, T H}^{(2)} (t)]}^{†}

, which is defined as follows:

\begin{array}{l} {〈χ^{(2)}, η^{(2)}〉}_{2} ≜ \int_{t_{0}}^{t_{f}} {[χ^{(2)} (t)]}^{†} η^{(2)} (t) d t = {〈χ_{1}^{(2)}, η_{1}^{(2)}〉}_{1} + {〈χ_{2}^{(2)}, η_{2}^{(2)}〉}_{1} \\ = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{1, j}^{(2)} (t) η_{1, j}^{(2)} (t) d t + \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{2, j}^{(2)} (t) η_{2, j}^{(2)} (t) d t . \end{array}

(82)

Following the general principles underlying the 2nd-FASAM [16], the function

v^{(2)} (t) ≜ {[v^{(1)} (t), δ a^{(1)} (t)]}^{†}

will be eliminated from the expression of each indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

,

j = 1, \dots, T F

, defined in Equation (78). This elimination is achieved by considering, for each index

j = 1, \dots, T F

, a vector-valued function denoted as

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†} \in H_{2} (Ω_{t})

, with

a_{1}^{(2)} (t; j) ≜ {[a_{1, 1}^{(2)} (t; j), \dots, a_{1, T H}^{(2)} (t; j)]}^{†}

and

a_{2}^{(2)} (t; j) ≜ {[a_{2, 1}^{(2)} (t; j), \dots, a_{2, T H}^{(2)} (t; j)]}^{†}

. Using the definition provided in Equation (82), we construct the inner product of Equations (9) and (80) with the vector

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

to obtain the following relation:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) v_{i}^{(1)} (t) d t - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) φ_{i} (F; t) d t \sum_{k = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) δ a_{i}^{(1)} (t) d t - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{m = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t) d t \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] d t \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = Q^{(2)}, \end{array}

(83)

where

Q^{(2)} ≜ \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t) d t [\sum_{k = 1}^{T F} q_{i k} (F; t) δ F_{k}] + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t) d t [\sum_{k = 1}^{T F} S_{i k} (F; t) δ F_{k}] .

(84)

Following the principles of the 2nd-CASAM [16], the left side of Equation (83) will be identified with the indirect-effect term defined in Equation (78), thereby determining the (yet undetermined) functions

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

. For this purpose, the right side of Equation (78) is cast in the form of the inner product

{〈v^{(2)} (t), []〉}_{2} = {〈v^{(1)} (t), []〉}_{1} + {〈δ a^{(1)} (t), []〉}_{1}

. The terms on the right side of Equation (78) involving the components of the function

δ a^{(1)} (t)

are already in the desired format, but the terms involving the components of the function

v^{(1)} (t)

must be re-arranged, as follows.

(i): The fourth term on the right side of Equation (78) is recast by using “integration by parts” as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \sum_{k = 1}^{T H} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} d t] [\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} d t] [\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} v_{k} (t) \int_{t_{0}}^{t} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} v_{k} (t) d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ . \end{array}$

(85)
(ii): The sixth (last) term on the right side of Equation (78) is recast by using “integration by parts”, as above, to obtain the following relation:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \sum_{k = 1}^{T H} [\frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ) \partial F_{j}} v_{k} (τ)] d τ \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} v_{k} (t) d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ . \end{array}$

(86)

Using in Equation (78) the results obtained in Equations (85) and (86) yields the following expression for the indirect-effect term for

j = 1, \dots, T F

:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} ≜ \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k} (t) d t \{\frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} \\ + \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ\} \\ + \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{k}^{(1)} (t) \{\frac{\partial g_{k} (F; t)}{\partial F_{j}} + \frac{\partial φ_{k} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{k} [h (τ); F; τ] d τ + φ_{k} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{k}}{\partial F_{j}} d τ\} d t . \end{array}

(87)

The left side of Equation (83) is now recast in the form of the inner product

{〈v^{(2)} (t), []〉}_{2}

by performing the following operations:

(i): The second term on the left side of Equation (83) is rearranged by using “integration by parts”, as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) φ_{i} (F; t) d t \sum_{k = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) d τ \\ = \sum_{k = 1}^{T H} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k} (t) \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} d t \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ . \end{array}$

(88)
(ii): The fourth term on the left side of Equation (83) is rearranged by using “integration by parts”, as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t] [\int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t] [\int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t [- δ a_{k}^{(1)} (t) φ_{k} (F; t)] [\int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ] \\ = \sum_{k = 1}^{T H} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{k}^{(1)} (t) φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ \end{array}$

(89)
(iii): The fifth term on the left side of Equation (83) is rearranged as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} v_{k}^{(1)} (t) d t \\ = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k}^{(1)} (t) [\sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j)] d t . \end{array}$

(90)
(iv): The sixth term on the left side of Equation (83) is rearranged as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t) \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] d t \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ = \sum_{m = 1}^{T H} \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{m}^{(1)} (t) [\frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ] d t \\ = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k}^{(1)} (t) \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} [\frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ] d t \end{array}$

(91)

Inserting the results obtained in Equations (88)–(91) into the left side of Equation (83) yields the following relation:

\begin{array}{l} Q^{(2)} = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t v_{k}^{(1)} (t) \{a_{1, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ \\ - \sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j) - \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} \frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ\} \\ + \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t δ a_{k}^{(1)} (t) \{a_{2, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ\} . \end{array}

(92)

The right side of Equation (92) can now be required to represent the indirect-effect term defined in Equation (87) by imposing the requirement that the hitherto arbitrary function

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

be the solution of the following NIE-Volterra equations for

i = 1, \dots, T H; j = 1, \dots, T F

:

\begin{array}{l} a_{1, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ - \sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j) \\ - \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} \frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ = \frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} \\ + \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ; \end{array}

(93)

\begin{array}{l} a_{2, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ \\ = \frac{\partial g_{k} (F; t)}{\partial F_{j}} + \frac{\partial φ_{k} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{k} [h (τ); F; τ] d τ + φ_{k} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{k}}{\partial F_{j}} d τ . \end{array}

(94)

It follows from Equations (92)–(94) that the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

defined by Equation (78) or, equivalently, Equation (87) can be expressed in terms of the function

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

as follows for

j = 1, \dots, T F

:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t [\sum_{n = 1}^{T F} q_{i n} (F; t) δ F_{n}] + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t [\sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n}] \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t \sum_{n = 1}^{T F} δ F_{n} \{\frac{\partial g_{i} (F; t)}{\partial F_{n}} + \frac{\partial φ_{i} (F; t)}{\partial F_{n}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ\} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t \{\sum_{n = 1}^{T F} \frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n} \\ + \sum_{k = 1}^{T H} [\sum_{n = 1}^{T F} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n}] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \sum_{n = 1}^{T F} \int_{t}^{t_{f}} [a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} δ F_{n}] d τ\} . \end{array}

(95)

The second-order sensitivities

\partial^{2} R [h; F (θ)] / \partial F_{j} \partial F_{n}

of the decoder response with respect to the components of the feature function are obtained by adding the expression of the indirect-effect term obtained in Equation (95) to the expression for the direct-effect term obtained in Equation (77) and subsequently identifying the expressions that multiply the variations

δ F_{n}

. The expressions thus obtained for

\partial^{2} R [h; F (θ)] / \partial F_{j} \partial F_{n}

for

j, n = 1, \dots, T F

are as follows:

\begin{array}{l} \frac{\partial^{2} R [h; F (θ)]}{\partial F_{j} \partial F_{n}} ≜ \frac{\partial^{2} D [h (t); F (θ); t]}{\partial F_{n} \partial F_{j}} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial^{2} g_{i} (h; F)}{\partial F_{n} \partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial^{2} φ_{i} (F; t)}{\partial F_{n} \partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{n}} d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial F_{n} \partial F_{j}} \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t \{\frac{\partial g_{i} (F; t)}{\partial F_{n}} + \frac{\partial φ_{i} (F; t)}{\partial F_{n}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ\} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t \{\frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} \\ + \sum_{k = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} d τ\} . \end{array}

(96)

The NIE-Volterra system presented in Equations (93) and (94) is called the “2nd-Level Adjoint Sensitivity System (2nd-LASS)” and its solution,

a^{(2)} (j; t) = {[a_{1}^{(2)} (j; t), a_{2}^{(2)} (j; t)]}^{†}

,

j = 1, \dots, T F

, is called the “2nd-level adjoint sensitivity function”. Because the sources on the right sides of Equations (93) and (94) stem from the first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

,

j = 1, \dots, T F

, they are dependent on the index “j”, which implies that for each first-order sensitivity

\partial R [h; F (θ)] / \partial F_{j}

, there will correspond a distinct 2nd-LASS with a distinct solution

a^{(2)} (j; t) = {[a_{1}^{(2)} (j; t), a_{2}^{(2)} (j; t)]}^{†}

, a fact that has been emphasized by using the index “j” in the list of arguments of this second-level adjoint sensitivity function. Therefore, there will be as many second-level adjoint functions as there are distinct first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

, which is equivalent to the number of components

F_{j}

of the “feature-function”

F (θ)

. Notably, the integral operators on the left sides of Equations (93) and (94) do not depend on the index “j”, which means that the same left-hand side needs to be inverted for computing the second-level adjoint function, regardless of the source term on the right side (which corresponds to the particular component of the feature function) of Equations (93) and (94). Therefore, if the inverses of the operators appearing on the left sides of Equations (93) and (94) could be stored, they would not need to be inverted repeatedly, so the various second-level adjoint functions would be computed most efficiently.

The second-order sensitivities of the decoder response with respect to the optimal weights/parameters

θ_{k}, k = 1, \dots, T W

are obtained analytically by using the chain rule in conjunction with the expressions obtained in Equations (96) and (18), as follows:

\frac{\partial^{2} R [F (θ)]}{\partial θ_{k} \partial θ_{j}} = \frac{\partial}{\partial θ_{k}} \{\sum_{i = 1}^{T F} \frac{\partial R [F (θ)]}{\partial F_{i} (θ)} \frac{\partial F_{i} (θ)}{\partial θ_{j}}\}, j, k = 1, \dots, T W .

(97)

When there are no feature functions but only individual model parameters, i.e., when

F_{i} (θ) \equiv θ_{i}

for all

i = 1, \dots, T F ≜ T W

, the expression obtained in Equation (96) yields directly the second-order sensitivities

\partial^{2} R / \partial θ_{i} \partial θ_{j}

for all

i, j = 1, \dots, T W

. In this case, the 2nd-LASS would need to be solved

T W

times rather than just

T F

times

(T F < T W)

when

T F

feature functions can be constructed.

5. Illustrative Application of the 2nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

The 2nd-FASAM-NIE-V methodology developed in Section 4 will be applied in this section to the illustrative model (considered in Section 3) that describes the energy distribution of neutrons in a homogeneous hydrogenous medium. As discussed in Section 4, the second-order sensitivities of the decoder response,

R [C (E)]

, with respect to the feature functions

F (θ)

,

Σ_{d} (θ)

, and S, will be determined by considering the second-order sensitivities to be “the first-order sensitivities of the first-order sensitivities”. Thus, the first-order sensitivities obtained in Equations (67)–(69) will play the role of “responses” in the application of the 2nd-FASAM-NIE-V methodology.

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial F (θ)

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial F (θ)\}

, which is by definition obtained from Equation (67), as shown below:

\begin{array}{l} δ \{\frac{\partial R}{\partial F (θ)}\} = \frac{d}{d ε} {\{\frac{S^{0} + ε δ S}{E_{s}} \int_{E_{l}}^{E_{s}} [a^{(1, 0)} (E) + ε δ a^{(1)} (E)] {(\frac{E_{s}}{E})}^{[F (θ^{0}) + ε δ F (θ)]} d E\}}_{ε = 0} \\ = δ {\{\frac{\partial R}{\partial F (θ)}\}}_{d i r} + δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d}, \end{array}

(98)

where the direct-effect term

δ {\{\partial R / \partial F (θ)\}}_{d i r}

is defined as follows:

\begin{array}{l} δ {\{\frac{\partial R}{\partial F (θ)}\}}_{d i r} ≜ {\{\frac{(δ S)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E\}}_{θ = θ^{0}} \\ + {\{[δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E\}}_{θ = θ^{0}}, \end{array}

(99)

while the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

is defined as follows:

δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d} ≜ {\{\frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E\}}_{θ = θ^{0}} .

(100)

The direct-effect term can be computed at this time. On the other hand, the indirect-effect term can be computed after determining the variational function

δ a^{(1)} (E)

, which is the solution of the G-differentiated 1st-LASS defined in Equation (50). By definition, the G-differential of Equation (50) is provided by the following equation:

\begin{array}{l} \frac{d}{d ε} {\{[a^{(1, 0)} (E) + ε δ a^{(1)} (E)]\}}_{ε = 0} - \frac{d}{d ε} {\{\frac{F (θ^{0}) + ε δ F (θ)}{E} \int_{E_{l}}^{E} [a^{(1, 0)} (e) + ε δ a^{(1)} (e)] d e\}}_{ε = 0} \\ = \frac{d}{d ε} {\{Σ_{d} (θ^{0}) + ε δ Σ_{d} (θ)\}}_{ε = 0} . \end{array}

(101)

Performing the operations indicated in Equation (101) yields the following equation to be satisfied by the function

δ a^{(1)} (E)

at the nominal parameter values (the superscript “zero” will be omitted for notational simplicity):

δ a^{(1)} (E) - \frac{F (θ)}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e = \frac{δ F (θ)}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) .

(102)

Evidently, Equation (102) would need to be solved repeatedly for every parameter variation in order to compute the function

δ a^{(1)} (E)

that would correspond to the respective parameter variation. As was shown in Section 4, the need for repeatedly solving Equation (102) can be avoided by deriving an alternative expression for the indirect-effect term that does not involve

δ a^{(1)} (E)

. This alternative expression is derived by introducing the 2nd-LASS, the solution of which would replace the function

δ a^{(1)} (E)

in the alternative expression for the indirect-effect term. Notably, Equation (102) is independent of variations in the forward function,

δ C (E)

; therefore, the second-level adjoint sensitivity function will comprise a single component, denoted as

a^{(2)} (E; 1)

, and the 2nd-LASS will comprise a single equation for this component.

Following the principles of the 2nd-FASAM-NIE-V, we use Equation (45) to construct the inner product of Equation (102) with a function

a^{(2)} (E; 1)

, where the argument “1” indicates that this second-level adjoint sensitivity function will correspond to the first-order sensitivity

\partial R [C (E)] / \partial F (θ)

, to obtain the following relation:

\begin{array}{l} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) δ a^{(1)} (E) d E - F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e \\ = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E . \end{array}

(103)

The function

a^{(2)} (E; 1)

will be determined by requiring the left side of Equation (103) to represent the indirect-effect term defined in Equation (100). For this purpose, the left side will be recast using “integration by parts” into the following form:

\begin{array}{l} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) δ a^{(1)} (E) d E - F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e \\ = \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) [a^{(2)} (E; 1) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 1) \frac{d e}{e}] d E . \end{array}

(104)

Requiring the right side of Equation (104) to represent the indirect-effect term defined in Equation (100) yields the following Volterra-type 2nd-LASS to be satisfied by the second-level adjoint sensitivity function

a^{(2)} (E; 1)

:

a^{(2)} (E; 1) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 1) \frac{d e}{e} = \frac{S}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} .

(105)

The above 2nd-LASS is to be satisfied at the nominal parameter values

θ^{0}

. Notably, Equation (105) is a “final-value problem” as the computation of

a^{(2)} (E; 1)

commences at the highest energy, where

a^{(2)} (E_{s}; 1) = S / E_{s}

, and proceeds towards the lowest energy value,

E_{l}

. For subsequent verification purposes, the closed-form explicit expression of

a^{(2)} (E; 1)

obtained by solving Equation (105) is as follows:

a^{(2)} (E; 1) = E^{- F (θ)} [- S F (θ) E_{s}^{F (θ) - 1} \ln E + S F (θ) E_{s}^{F (θ) - 1} \ln E_{s} + S E_{s}^{F (θ) - 1}] .

(106)

It follows from Equations (103)–(105) that the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

is given by the following expression involving the second-level adjoint sensitivity function

a^{(2)} (E; 1)

:

δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d} = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E .

(107)

Adding the expression for the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

obtained in Equation (107) to the expression for the direct-effect term

δ {\{\partial R / \partial F (θ)\}}_{d i r}

obtained in Equation (99) yields the following expression for the G-differential

δ \{\partial R / \partial F (θ)\}

:

\begin{array}{l} δ \{\frac{\partial R}{\partial F (θ)}\} = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E \\ + \frac{(δ S)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E . \end{array}

(108)

The above expression is to be evaluated at the nominal parameter values

θ^{0}

.

It follows from the expression obtained in Equation (108) that the second-order sensitivities (of the decoder response with respect to the feature functions) that stem from the first-order sensitivity

\partial R [C (E)] / \partial F (θ)

have the following expressions to be evaluated at the nominal parameter values

θ^{0}

:

\frac{\partial^{2} R}{\partial^{2} F (θ)} = \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E;

(109)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial F (θ)} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E;

(110)

\frac{\partial^{2} R}{\partial S \partial F (θ)} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E .

(111)

Because the first-level adjoint sensitivity function

a^{(1)} (E)

is already available, the sensitivity

\partial^{2} R / \partial S \partial F (θ)

can be computed. The closed-form explicit expressions for the above second-order sensitivities are obtained by inserting the expression obtained in Equation (106) for

a^{(2)} (E; 1)

and the expression obtained in Equation (51) for

a^{(1)} (E)

and performing the respective integrations. Carrying out these operations yields the following expressions:

\begin{array}{l} \frac{\partial^{2} R}{\partial F (θ) \partial F (θ)} = [S Σ_{d} (θ)] \{- 2 {[1 - F (θ)]}^{- 3} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] \\ - 2 {[1 - F (θ)]}^{- 2} {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}}) - \frac{F (θ)}{1 - F (θ)} {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} {[\ln (\frac{E_{s}}{E_{l}})]}^{2}\}; \end{array}

(112)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial F (θ)} = \frac{S}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(113)

\frac{\partial^{2} R}{\partial S \partial F (θ)} = \frac{Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\} .

(114)

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial Σ_{d} (θ)

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial Σ_{d} (θ)\}

, which is by definition obtained from Equation (68), as shown below:

δ \{\frac{\partial R}{\partial Σ_{d} (θ)}\} = \frac{d}{d ε} {\{\int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} = \int_{E_{l}}^{E_{s}} δ C (E) d E .

(115)

Comparing Equation (115) with Equation (35), it becomes apparent that the following relation holds:

δ \{\partial R / \partial Σ_{d} (θ)\} = {\{δ R (C; θ; δ C)\}}_{i n d} / Σ_{d} (θ) .

(116)

Replacing the expression obtained for

{\{δ R (C; θ; δ C)\}}_{i n d}

in Equation (65) in Equation (116) yields the following expression:

δ \{\frac{\partial R}{\partial Σ_{d} (θ)}\} = \frac{F (δ S)}{E_{s} Σ_{d}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \frac{S (δ F)}{E_{s} Σ_{d}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} .

(117)

It follows from Equation (117) that the second-order sensitivities that stem from

\partial R / \partial Σ_{d}

have the following expressions:

\frac{\partial^{2} R}{\partial F (θ) \partial Σ_{d} (θ)} = \frac{S}{E_{s} Σ_{d} (θ)} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)};

(118)

\frac{\partial^{2} R}{δ Σ_{d} (θ) \partial Σ_{d} (θ)} = 0;

(119)

\frac{\partial^{2} R}{δ S \partial Σ_{d} (θ)} = \frac{F (θ)}{E_{s} Σ_{d} (θ)} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E .

(120)

The explicit closed-form expressions of the above second-order sensitivities are obtained by replacing the expression of

a^{(1)} (E)

in Equations (118) and (120), respectively, and performing the respective integrations to obtain

\frac{\partial^{2} R}{\partial F (θ) \partial Σ_{d} (θ)} = \frac{S}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(121)

\frac{\partial^{2} R}{δ S \partial Σ_{d} (θ)} = \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}]

(122)

The expression for

\partial^{2} R / \partial F (θ) \partial Σ_{d} (θ)

in Equation (118) must be equivalent to the expression for

\partial^{2} R / \partial Σ_{d} (θ) \partial F (θ)

in Equation (110). Notably, therefore, these mixed second-order sensitivities are computed twice using distinct expressions involving distinct adjoint functions, which provides an intrinsic mechanism for the stringent verification of the accuracy of the computations used for obtaining the numerical values of the respective adjoint functions.

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial S

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial S\}

, which is by definition obtained from Equation (69), as shown below:

\begin{array}{l} δ \{\frac{\partial R}{\partial S}\} = \frac{d}{d ε} {\{\frac{[F (θ^{0}) + ε δ F (θ)]}{E_{s}} \int_{E_{l}}^{E_{s}} [a^{(1, 0)} (E) + ε δ a^{(1)} (E)] d E\}}_{ε = 0} \\ = δ {\{\frac{\partial R}{\partial S}\}}_{d i r} + δ {\{\frac{\partial R}{\partial S}\}}_{i n d}, \end{array}

(123)

where the direct-effect term

δ {\{\partial R / \partial S\}}_{d i r}

is defined as follows:

δ {\{\frac{\partial R}{\partial S}\}}_{d i r} ≜ {\{\frac{δ F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E\}}_{θ = θ^{0}},

(124)

while the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

is defined as follows:

δ {\{\frac{\partial R}{\partial S}\}}_{i n d} ≜ {\{\frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) d E\}}_{θ = θ^{0}}

(125)

The direct-effect term

δ {\{\partial R / \partial S\}}_{d i r}

can be computed at this time. To determine the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

without needing to compute the variational function

δ a^{(1)} (E)

by solving repeatedly Equation (102), the steps outlined in Section 5.1 are applied using a second-order adjoint sensitivity function denoted as

a^{(2)} (E; 3)

, where the argument “3” indicates that this function will correspond to the first-order sensitivity

\partial R / \partial S

. Thus, following the conceptual steps outlined in Equations (103)–(107) but for the function

a^{(2)} (E; 3)

as the counterpart of the function

a^{(2)} (E; 1)

and for the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

as the counterpart of the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

leads to the following expression for the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

:

δ {\{\frac{\partial R}{\partial S}\}}_{i n d} ≜ {\{δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E\}}_{θ = θ^{0}}

(126)

where the second-order adjoint sensitivity function

a^{(2)} (E; 3)

is the solution of the following Volterra-type 2nd-LASS:

a^{(2)} (E; 3) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 3) \frac{d e}{e} = \frac{F (θ)}{E_{s}} .

(127)

For verification purposes, the explicit closed-form expression of the solution of Equation (127) is provided below.

a^{(2)} (E; 3) = \frac{F (θ)}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} .

(128)

Adding the expressions obtained in Equations (126) and (124) yields the following expression for the first G-variation

δ \{\partial R / \partial S\}

:

\begin{array}{l} δ \{\frac{\partial R}{\partial S}\} = \frac{δ F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e \\ + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E . \end{array}

(129)

It follows from Equation (129) that the second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial S

have the following expressions:

\frac{\partial^{2} R}{\partial F (θ) \partial S} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e

(130)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial S} = \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E .

(131)

\frac{\partial^{2} R}{\partial S \partial S} = 0

(132)

Inserting the expressions of

a^{(2)} (E; 3)

and

a^{(1)} (E)

, respectively, into Equations (130) and (131) and performing the respective integrations yields the following closed-form explicit expressions for the respective second-order sensitivities:

\frac{\partial R}{\partial F (θ) \partial S} = \frac{Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(133)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial S} = \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E})}^{F (θ) - 1}]

(134)

The expression for

\partial^{2} R / \partial S F (θ)

in Equation (111) must be equivalent to the expression for

\partial^{2} R / \partial F (θ) \partial S

in Equation (130). The expression for

\partial^{2} R / \partial S \partial Σ_{d} (θ)

in Equation (122) must be equivalent to the expression for

\partial^{2} R / \partial Σ_{d} (θ) \partial S

in Equation (131). The equivalences of these corresponding expressions provide stringent verification criteria for the accuracy of the computation of the respective adjoint functions.

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

Notably, the nine s-order sensitivities of the decoder response with respect to the three feature functions,

F (θ)

,

Σ_{d} (θ)

, and

S

were computed using three adjoint computations; each of these adjoint computations corresponds to one of the three first-order sensitivities of the decoder response with respect to the feature functions. Only six of these nine s-order sensitivities have distinct values; the mixed sensitivities were computed twice using different adjoint functions, thus providing stringent verification criteria for the numerical computation of these functions. The second-order sensitivities of the decoder response with respect to the primary model parameters are obtained by applying the “chain-rule of differentiation” provided in Equation (97) to the second-order sensitivities with respect to the feature functions.

On the other hand, the computation of the second-order sensitivities of the decoder response directly with respect to the primary model parameters would be performed by treating each of the first-order sensitivities defined in Equations (54)–(57) as a “decoder/model response”. A shown in Section 3.1, there would be

T W + 1

first-order sensitivities in this case, which means that there would be

T W + 1

“2nd-level adjoint sensitivity systems” to be solved, each one having a source term that would correspond to one of the

T W + 1

first-order sensitivities. Evidently, it is considerably more advantageous computationally to consider compute the second-order sensitivities via “feature functions”, whenever possible, rather than directly with respect to the primary model parameters.

6. Discussion and Conclusions

This work has introduced the general mathematical framework of the 2nd-FASAM-NIE-V methodology. The acronym “2nd-FASAM-NIE-V” stands for “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra Type”. The 2nd-FASAM-NIE-V encompasses the mathematical framework of the (first-order) 1st-FASAM-NIE-V methodology, which enables the most efficient computation of the exact expressions of all first-order sensitivities with respect to the feature functions and also with respect to the optimal values of the NIE-net’s parameters/weights after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The 1st-FASAM-NIE-V methodology requires a single large-scale computation for determining the first-level adjoint sensitivity function that is subsequently used for computing the sensitivities using conventional numerical quadrature formulas. The 2nd-FASAM-NIE-V requires as many large-scale computations (to solve the 2nd-Level Adjoint Sensitivity System) as there are first-order sensitivities of the decoder response with respect to the feature functions. Subsequently, the second-order sensitivities of the decoder response with respect to the primary model parameters are obtained by applying the “chain-rule of differentiation” to the second-order sensitivities with respect to the feature functions.

The application of the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V methodologies has been illustrated by using a well-known model for neutron slowing down in a homogeneous hydrogenous medium. This model has been chosen because the application of the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V yields tractable explicit exact expressions for all quantities of interest, including the various adjoint sensitivity functions and first- and second-order sensitivities of the decoder response with respect to all feature functions and also with respect to the primary model parameters. This illustrative application highlights the unsurpassed efficiency of the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V for second-order sensitivity analysis of NIE-Volterra nets. Ongoing research aims at developing the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations”, which will enable, in premiere, the exact computations of second-order sensitivities of decoder responses with respect to optimized weights/parameters based on the NIDE-models introduced in [22].

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

References

Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D.K. Neural ordinary differential equations. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: New York, NY, USA, 2018; Volume 31, pp. 6571–6583. [Google Scholar] [CrossRef]
Ruthotto, L.; Haber, E. Deep neural networks motivated by partial differential equations. J. Math. Imaging Vis. 2018, 62, 352–364. [Google Scholar]
Lu, Y.; Zhong, A.; Li, Q.; Dong, B. Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 3276–3285. [Google Scholar]
Dupont, E.; Doucet, A.; The, Y.W. Augmented neural odes. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32, pp. 14–15. [Google Scholar]
Grathwohl, W.; Chen, R.T.Q.; Bettencourt, J.; Sutskever, I.; Duvenaud, D. Ffjord: Free-form continuous dynamics for scalable reversible generative models. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
Zhong, Y.D.; Dey, B.; Chakraborty, A. Symplectic ode-net: Learning Hamiltonian dynamics with control. In Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia, 30 April 2020. [Google Scholar]
Kidger, P.; Morrill, J.; Foster, J.; Lyons, T. Neural controlled differential equations for irregular time series. In Proceedings of the Advances in Neural Information Processing Systems, Virtual, 6–12 December 2020; Volume 33, pp. 6696–6707. [Google Scholar]
Morrill, J.; Salvi, C.; Kidger, P.; Foster, J. Neural rough differential equations for long time series. In Proceedings of the International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 7829–7838. [Google Scholar]
Kidger, P. On Neural Differential Equations. arXiv 2022, arXiv:2202.02435. [Google Scholar]
Rokhlin, V. Rapid solution of integral equations of classical potential theory. J. Comput. Phys. 1985, 60, 187–207. [Google Scholar]
Rokhlin, V. Rapid solution of integral equations of scattering theory in two dimensions. J. Comput. Phys. 1990, 86, 414–439. [Google Scholar]
Greengard, L.; Kropinski, M.C. An integral equation approach to the incompressible Navier-Stokes equations in two dimensions. SIAM J. Sci. Comput. 1998, 20, 318–336. [Google Scholar]
Effati, S.; Buzhabadi, R. A neural network approach for solving Fredholm integral equations of the second kind. Neural Comput. Appl. 2012, 21, 843–852. [Google Scholar] [CrossRef]
Zappala, E.; de Oliveira Fonseca, A.H.; Caro, J.O.; van Dijk, D. Neural Integral Equations. arXiv 2023, arXiv:2209.15190v4. [Google Scholar]
Xiong, Y.; Zeng, Z.; Chakraborty, R.; Tan, M.; Fung, G.; Li, Y.; Singh, V. Nystromformer: A nystrom-based algorithm for approximating self-attention. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 2–9 February 2021; Volume 35, p. 14138. [Google Scholar]
Cacuci, D.G. Introducing the nth-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (nth-FASAM-N): I. Mathematical Framework. Am. J. Comput. Math. 2024, 14, 11–42. [Google Scholar] [CrossRef]
Cacuci, D.G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations. I: Mathematical Framework. Processes 2024, 12, 2660. [Google Scholar] [CrossRef]
Cacuci, D.G. “Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Fredholm-Type Neural Integral Equations. Mathematics 2025, 13, 14. [Google Scholar] [CrossRef]
Weinberg, A.M.; Wigner, E.P. The Physical Theory of Neutron Chain Reactors; The University of Chicago Press: Chicago, IL, USA, 1958. [Google Scholar]
Lamarsh, J.R. Introduction to Nuclear Reactor Theory; Addison-Wesley Publishing Company: Reading, MA, USA, 1966. [Google Scholar]
Duderstadt, J.J.; Hamilton, L.J. Nuclear Reactor Analysis; John Wiley & Sons: New York, NY, USA, 1976. [Google Scholar]
Zappala, E.; de Oliveira Fonseca, A.H.; Moberly, A.H.; Higley, J.M.; Abdallah, C.; Cardin, J.; Van Dijk, D. Neural Integro-Differential Equations. arXiv 2022, arXiv:2206.14282. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cacuci, D.G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering. J. Nucl. Eng. 2025, 6, 8. https://doi.org/10.3390/jne6020008

AMA Style

Cacuci DG. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering. Journal of Nuclear Engineering. 2025; 6(2):8. https://doi.org/10.3390/jne6020008

Chicago/Turabian Style

Cacuci, Dan Gabriel. 2025. "Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering" Journal of Nuclear Engineering 6, no. 2: 8. https://doi.org/10.3390/jne6020008

APA Style

Cacuci, D. G. (2025). Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering. Journal of Nuclear Engineering, 6(2), 8. https://doi.org/10.3390/jne6020008

Article Menu

Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Abstract

1. Introduction

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-FASAM-NIE-V)

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-CASAM-NIE-V)

3. Illustrative Application of the 1st-CASAM-NIE-V and 1st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

3.1. Application of 1st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

3.2. Efficient Indirect Computation Using the 1st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Primary Model Parameters

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (2nd-FASAM-NIE-V)

5. Illustrative Application of the 2nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

6. Discussion and Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Abstract

1. Introduction

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-FASAM-NIE-V)

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (1st-CASAM-NIE-V)

3. Illustrative Application of the 1st-CASAM-NIE-V and 1st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

3.1. Application of 1st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

3.2. Efficient Indirect Computation Using the 1st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Primary Model Parameters

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of the Volterra Type (2nd-FASAM-NIE-V)

5. Illustrative Application of the 2nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

5.1. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ F θ

5.2. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ Σ d θ

5.3. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ S

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

6. Discussion and Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$