Abstract
This work presents the mathematical framework of the nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (abbreviated as “nth-CASAM-L”), which is conceived for obtaining the exact expressions of arbitrarily-high-order (nth-order) sensitivities of a generic system response with respect to all of the parameters (including boundary and initial conditions) underlying the respective forward/adjoint systems. Since many of the most important responses for linear systems involve the solutions of both the forward and the adjoint linear models that correspond to the respective physical system, the sensitivity analysis of such responses makes it necessary to treat linear systems in their own right, rather than treating them as particular cases of nonlinear systems. This is in contradistinction to responses for nonlinear systems, which can depend only on the forward functions, since nonlinear operators do not admit bona-fide adjoint operators (only a linearized form of a nonlinear operator admits an adjoint operator). The nth-CASAM-L determines the exact expression of arbitrarily-high order sensitivities of responses to the parameters underlying both the forward and adjoint models of a nonlinear system, thus enable the most efficient and accurate computation of such sensitivities. The mathematical framework underlying the nth-CASAM is developed in linearly increasing higher-dimensional Hilbert spaces, as opposed to the exponentially increasing “parameter-dimensional” spaces in which response sensitivities are computed by other methods, thus providing the basis for overcoming the “curse of dimensionality” in sensitivity analysis and all other fields (uncertainty quantification, predictive modeling, etc.) which need such sensitivities. In particular, for a scalar-valued valued response associated with a nonlinear model comprising TP parameters, the 1st-−CASAM-L requires 1 additional large-scale adjoint computation (as opposed to TP large-scale computations, as required by other methods) for computing exactly all of the 1st-−order response sensitivities. All of the (mixed) 2nd-order sensitivities are computed exactly by the 2nd-CASAM-L in at most TP computations, as opposed to TP(TP + 1)/2 computations required by all other methods, and so on. For every lower-order sensitivity of interest, the nth-CASAM-L computes the “TP next-higher-order” sensitivities in one adjoint computation performed in a linearly increasing higher-dimensional Hilbert space. Very importantly, the nth-CASAM-L computes the higher-level adjoint functions using the same forward and adjoint solvers (i.e., computer codes) as used for solving the original forward and adjoint systems, thus requiring relatively minor additional software development for computing the various-order sensitivities.
  1. Introduction
The functional derivatives of results (customarily called “responses”) produced by computational models of physical systems with respect to the model’s parameters are customarily called the “response sensitivities.” The first-order sensitivities have been used for a variety of purposes, including: (i) understanding the model by ranking the importance of the various parameters; (ii) designing and optimizing the system; (iii) performing “reduced-order modeling” by eliminating unimportant parameters; (iv) quantifying the uncertainties induced in a model response due to model parameter uncertainties; (v) performing data assimilation and model calibration; (vi) performing “model validation” by comparing computations to experiments to address the question “does the model represent reality?” (vii) prioritizing improvements in the model; (viii) performing “forward and inverse” predictive modeling” to obtain best-estimate predicted results with reduced predicted uncertainties.
As is well known, non-linear operators do not admit adjoint operators; only linear operators admit corresponding adjoint operators. For this reason, many of the most important responses for linear systems involve the solutions of both the forward and the adjoint linear models that correspond to the respective physical system. Included among the widest used system responses that involve both the forward and adjoint functions are the various forms of Lagrangian functionals, the Raleigh quotient for computing eigenvalues and/or separation constants when solving partial differential equations, the Schwinger functional for first-order “normalization-free” solutions [,,,,]. These functionals play a fundamental role in optimization and control procedures, derivation of numerical methods for solving equations (differential, integral, integro-differential), etc. The analysis of responses that simultaneously involve both forward and adjoint functions makes it necessary to treat linear systems in their own right, rather than treating them as particular cases of nonlinear systems. This is in contradistinction to responses for nonlinear systems, which can depend only on the forward functions, since nonlinear operators do not admit bona-fide adjoint operators—only a linearized form of a nonlinear operator admits an adjoint operator.
As is well known, even the approximate determination of the first-order sensitivities  of a model response  to  parameters  using conventional finite-difference methods would require at least  large-scale computations with altered parameter values. The computation of the distinct second-order response sensitivities of a would require  large-scale computations, which rapidly becomes unfeasible for large-scale models comprising many parameters, even using supercomputers. The computation of higher-order sensitivities by conventional methods is limited in practice by the so-called “curse of dimensionality” [] since the number of large-scale computations needed by conventional, including statistical, methods for computing higher-order response sensitivities increases exponentially with the order of the response sensitivities.
This work presents the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward and Adjoint Linear Systems,” which will be abbreviated as “nth-CASAM-L.” The nth-CASAM-L enables the efficient computation of the exact expressions of the 1st-, 2nd-, 3rd- and 4th-order sensitivities of a generic system response which depends on both the forward and adjoint state functions with respect to all of the parameters underlying the respective systems. The mathematical framework of the nth-CASAM-L is constructed upon the fundamental principles introduced by Cacuci [,,,,] for the first- and second-order adjoint sensitivity analysis methodology. The qualifier “comprehensive” is used because the original methodology [,,,,] was extended to provide exact expressions for the sensitivities of a system response not only to the system’s internal parameters but also to the system’s uncertain boundaries and internal interfaces in phase-space [,]. The principles underlying the 2nd-CASAM-L were applied to an OECD/NEA reactor physics benchmark (a reflected plutonium sphere), for which the response of interest was the experimentally-measured [] neutron leakage through the sphere’s outer surface. The nuclear data used in the computational modeling of the neutron transport processes through this spherical benchmark included 21,976 imprecisely known model parameters, as follows: 180 group-averaged total microscopic cross sections, 21,600 group-averaged scattering microscopic cross sections, 120 parameters describing the fission process, 60 parameters describing the fission spectrum, 10 parameters describing the system’s sources, and 6 isotopic number densities. The results obtained for the 1st- and 2nd-order sensitivities of this benchmark’s leakage response to all of these 21,976 uncertain model parameters were presented in [,,,,,]. Notably, the 2nd-order sensitivities caused the expected value of the leakage response to be significantly larger than the computed value of the leakage response. The importance of the 2nd-order sensitivities increased as the relative standard deviations for the cross sections increase. For fully correlated cross sections, for example, neglecting the 2nd-order sensitivities would cause an error as large as 2000% in the expected value of the leakage response, and up to 6000% in the variance of the leakage response.
Subsequently, Cacuci [] has developed the 3rd-order CASAM-L, which was applied to compute [,,] the largest 3rd-order sensitivities of the OECD/NEA benchmark, which turned out to be among the (180)3 = 5,832,000 third-order mixed sensitivities (of which 988,260 are distinct) of this benchmark’s leakage response with respect to the benchmark’s 180 microscopic total cross sections. The results obtained in [,,] indicated that the absolute value of the largest 3rd-order relative sensitivity is ca. 437 times larger than the largest 2nd-order sensitivity and is ca. 20,000 times larger than the largest 1st-order sensitivity. These results have motivated the recent development by Cacuci [] of the 4th-order CASAM-L, which was applied [,] to compute the 45,212,895 distinct fourth-order sensitivities of this benchmark’s leakage response with respect to the benchmark’s 180 microscopic total cross sections.
This work is structured as follows: Section 2 presents the general mathematical formulation of the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems” (nth-CASAM-L). This innovative methodology has no comparable counterpart in the literature and is applicable to any linear system. The general mathematical formulation of the nth-CASAM-L will be proven to be correct by using “proof by induction,” which comprises the usual steps, namely: (i) establish the pattern underlying the nth-CASAM-L for n = 1; (ii) assume that the pattern is valid for an arbitrarily high-order, n; (iii) prove that the pattern is valid for n + 1. The starting point for this proof by induction is the 4th-order CASAM-L developed by Cacuci [], which is summarized, for convenient referencing, in Section 4 and Section 5.
Section 3 presents the framework of the “5th-Order Comprehensive Adjoint Sensitivity Analysis Methodology Response for Coupled Forward/Adjoint Linear Systems” (5th-CASAM-L). The 5th-CASAM-L will be developed from the 4th-CASAM-L, and its mathematical formalism will also be shown to coincide with the particular formulas obtained by setting n = 5 in the general nth-CASAM-L framework. The 5th-CASAM-L is also innovative and has no counterpart in the literature. Finally, Section 4 offers conclusions regarding the significance of this work’s novel results in the quest to overcome the curse of dimensionality in sensitivity analysis, uncertainty quantification and predictive modeling. The application of the nth-CASAM-L is illustrated in an accompanying work [] by considering a paradigm model which describes the transmission of particles produced by a distributed source through a shield which surrounds the source.
2. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (nth-CASAM-L)
The general framework of the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (nth-CASAM-L)” will be established by using a “proof by induction.” This proof will comprise the usual steps, as follows:
- Establish the pattern underlying the nth-CASAM-L for n = 1;
 - Assume that the pattern is valid for an arbitrarily high-order, n;
 - Prove that the pattern is valid for n + 1.
 
2.1. The Pattern Underlying the nth-CASAM-L for n = 1, 2, 3, 4
The mathematical framework of the 4th-CASAM-L, which also comprises the results for the 1st-, 2nd-, and 3rd-CASAM-L, has been conceived by Cacuci []. For convenient referencing, the pattern underlying the mathematical framework of the 4th-CASAM-L is succinctly reviewed in Section 4 and Section 5. The main features of the structures of the mathematical frameworks for n = 1, 2, 3, 4, are summarized in Table 1, Table 2, Table 3 and Table 4. This information will be used in the next Subsection to surmise the general mathematical framework underlying the nth-CASAM-L, for any order n, to enable the computations of response sensitivities of any order, n.
       
    
    Table 1.
    First-Order Sensitivities (n = 1).
  
       
    
    Table 2.
    Second-Order Sensitivities (n = 2).
  
       
    
    Table 3.
    Third-Order Sensitivities (n = 3).
  
       
    
    Table 4.
    Fourth-Order Sensitivities (n = 4).
  
2.2. The Pattern Underlying the nth-CASAM-L for Arbitrarily High-Order n
Based on the pattern displayed in Table 1, Table 2, Table 3 and Table 4, the pattern displayed in Table 5, below, is expected to be valid for any integer n.
       
    
    Table 5.
    nth-Order Sensitivities.
  
2.3. Proving That the Framewok for the nth-CASAM-L also Holds for the (n + 1)th-CASAM-L
The pattern presented in Table 5 implies that the generic nth-order sensitivity of the response  to model parameters has the following form, for ;;…:
      
        
      
      
      
      
    
        where the function  is the solution of the following nth-Level Forward System (nth-LFS):
      
        
      
      
      
      
    
        and where the nth-level adjoint function  is the solution of the following nth-Level Adjoint Sensitivity System (nth-LASS):
      
        
      
      
      
      
    
Concatenating Equations (2) and (3) yields the following (n + 1)th-Level Forward Sensitivity System [abbreviated as (n + 1)th-LFSS] for the function :
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
In terms of the function , each nth-order sensitivity can be written as follows:
      
        
      
      
      
      
    
The total G-differential of  is obtained, by definition, as follows:
        
      
        
      
      
      
      
    
        where the direct effect term  depends directly on the vector of parameter variations , and is defined as follows, for :
      
        
      
      
      
      
    
        and where the indirect-effect term  is defined as follows: 
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
The vector of variations  is the solution of the G-differentiated (n + 1)th-LFS represented by Equations (4) and (5), which is equivalent concatenating the nth-LVSS to the system of equations obtained by G-differentiating the nth-LASS while being subject to the corresponding boundary conditions. The nth-LVSS is assumed to be correct as given in Table 5, namely:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The G-differential of the nth-LASS is obtained from Equation (3), in the following form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
Concatenating Equations (12)−(15) yields the following (n + 1)th-Level Variational Sensitivity System [(n + 1)th-LVSS] for the function :
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
In principle, the (n + 1)th-LVSS, comprising Equations (17) and (18), could be solved to determine the variational vector  which could be used, in turn, to determine the indirect-effect term . On the other hand, the need for solving the (n + 1)th-LVSS can be avoided by expressing the indirect-effect term  in an alternative way, which does not involve the function . This alternative expression will be obtained by using the solution of a (n + 1)th-Level Adjoint Sensitivity Systems [(n + 1)th-LASS], which will be constructed below by implementing the same sequence of logical steps as were followed for constructing the 1st-LASS (and all of the other lower-level adjoint sensitivity systems).
The (n + 1)th-LASS is constructed in a Hilbert space, denoted as , comprising block-vector elements of the form , with elements having the following structure: . The inner product between two elements,  and , of the Hilbert space , will be denoted as  and is defined as follows:
      
        
      
      
      
      
    
Using the definition provided in Equation (21), form the inner product in  of Equation (12) with a yet undefined vector-valued function , , ; , to obtain the following relation:
        
      
        
      
      
      
      
    
        where  denotes the bilinear concomitant defined on the phase-space boundary , evaluated at the nominal values of the model parameter and respective functions, and where:
      
        
      
      
      
      
    
The first term on right-side of the second equality in Equation (22) is now required to represent the indirect-effect term , which is achieved by requiring that the (n + 1)th-level adjoint function , be the solution of the following (n + 1)th-Level Adjoint Sensitivity System, for ; ;…: 
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where the vector , , comprises  components defined as follows, for each ; ;…;:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The (n + 1)th-level adjoint boundary conditions represented by Equation (25) are selected so as to eliminate, in conjunction with the boundary conditions represented by Equation (18), all of the unknown values of the functions  in the expression of the bilinear concomitant . This bilinear concomitant may vanish after implementing the boundary conditions represented by Equations (18) and (25); if it does not vanish, this bilinear concomitant will be reduced to a residual quantity which will comprise only known values of , ,  and , and will be denoted as .
Using the equations underlying the 4th-LASS together with the relation provided in Equation in Equation yields the following expression for the indirect-effect term  in terms of the (n + 1)th-level adjoint functions , for each ; ;…;:
      
        
      
      
      
      
    
Adding the result obtained in Equation (33) for the indirect effect term to the result provided in Equation (9) for the direct effect term yields the following expression for the total nth-order G-variation (of the response) defined in Equation (8):
        
      
        
      
      
      
      
    
        where  denotes the (n + 1)th-order partial sensitivity of the response  with respect to the model parameters, evaluated at the nominal parameter values . The results obtained in this Subsection are summarized in Table 6.
       
    
    Table 6.
    (n + 1)th-Order Sensitivities.
  
The results summarized in Table 6 were obtained by applying the general principles underlying adjoint sensitivity analysis methodology to the nth-order sensitivities, considered as “model responses.” The expression of the (n + 1)th-order sensitivities were obtained by determining the first-order differential of the nth-order sensitivities, which yields an expression that is identical to the expression that would be obtained by advancing the index, from n to (n + 1), of the expressions of the nth-order sensitivities. Thus, the proof by induction of the general framework is thereby completed.
3. Particular Case: The 5th-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Coupled Forward/Adjoint Linear Systems (5th-CASAM-L)
As an additional application of the general framework of the nth-CASAM, this Subsection will present the “5th-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Coupled Forward/Adjoint Linear Systems” (5th-CASAM-L). The derivation of the 5th-CASAM-L will be performed explicitly, by starting with the results produced by the 4th-CASAM-L, which are presented in Section 5 and summarized in Table 4. It will be shown that the results thus obtained for the 5th-CASAM-L coincide with the results that would be predicted by the nth-CASAM-L (Table 5) for the particular value n = 5.
The 5th-CASAM-L commences by recalling the end-products of the 4th-CASAM-L, namely the expressions of the 4th-order partial sensitivity  of the response with respect to the model parameters, which was obtained in Equation (203) in Section 5, and which can be written in the following form:
      
        
      
      
      
      
    
The function  is the solution of the following 5th-Level Forward System (5th-LFS), which is obtained by concatenating Equations (179), (180), (195) and (196) from Section 5, which are written in block-matrix form as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where:
      
        
      
      
      
      
    
The 5th-order total differential of the response  is obtained by G-differentiating the 4th-order total G-differential of the response, to obtain the following expression:
      
        
      
      
      
      
    
      where the direct effect term  depends directly on the vector of parameter variations , and is defined as follows, for :
      
        
      
      
      
      
    
      and where the indirect-effect term  is defined as follows:
      
        
      
      
      
      
    
      with
      
      
        
      
      
      
      
    
The vector of variations  is the solution of the G-differentiated 5th-LFS represented by Equations (36) and (37), which is equivalent concatenating the 4th-LVSS to the system of equations obtained by G-differentiating the 4th-LASS while being subject to the corresponding boundary conditions. The 4th-LVSS has been determined in Section 5 and is provided in Equations (186) and (187). On the other hand, the G-differential of the 4th-LASS is obtained from Equations (195) and (196), and has the following form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where
      
      
        
      
      
      
      
    
Concatenating the 4th-LVSS with the 4th-LASS yields the following 5th-Level Variational Sensitivity System (5th-LVSS) for the function :
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
In principle, the 5th-LVSS, comprising Equations (46) and (47), could be solved to determine the variational vector  which could be used, in turn, to determine the indirect-effect term . However, the need for solving the 5th-LVSS can be avoided by expressing the indirect-effect term  in terms of the solution of a 5th-Level Adjoint Sensitivity Systems (5th-LASS), which is constructed in a Hilbert space, denoted as , comprising block-vector elements of the form ,, with elements having the following structure: . The inner product between two elements,  and  will be denoted as  and is defined as follows:
      
        
      
      
      
      
    
Using the definition provided in Equation (50), form the inner product in  of Equation (46) with a yet undefined vector-valued function ,, ;…;, to obtain the following relation:
      
        
      
      
      
      
    
      where  denotes the bilinear concomitant defined on the phase-space boundary , evaluated at the nominal values of the model parameter and respective functions, and where:
      
        
      
      
      
      
    
The first term on right-side of the second equality in Equation (51) is now required to represent the indirect-effect term , which is achieved by requiring that the 5th-level adjoint function , be the solution of the following 5th-Level Adjoint Sensitivity System (5th-LASS), for ; ;…;:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where the vector , , comprises  components defined as follows, for each ; ;…;:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The 5th-level adjoint boundary conditions represented by Equation (54) are selected so as to eliminate, in conjunction with the boundary conditions represented by Equation (47), all of the unknown values of the functions  in the expression of . This bilinear concomitant may vanish after implementing the boundary conditions represented by Equations (47) and (54); if it does not vanish, this bilinear concomitant will be reduced to a residual quantity which will comprise only known values of , ,  and , and will be denoted as .
Using the equations underlying the 4th-LASS together with the relation provided in Equation (51) in Equation (41) yields the following expression for the indirect-effect term  in terms of the 5th-level adjoint functions , for each ; ;…;:
      
        
      
      
      
      
    
Adding the result obtained in Equation (61) for the indirect effect term to the result provided in Equation (40) for the direct effect term yields the following expression for the total nth-order G-variation (of the response) defined in Equation (39):
      
        
      
      
      
      
    
      where  denotes the 5th-order partial sensitivity of the response  with respect to the model parameters, evaluated at the nominal parameter values . The results obtained in this Subsection are summarized in Table 7.
       
    
    Table 7.
    Fifth-Order Sensitivities (n = 5).
  
4. Mathematical Modeling of Response-Coupled Linear Forward and Adjoint Systems
The mathematical model of a physical system comprises independent variables (e.g., space, time, etc.), dependent variables (aka “state functions”; e.g., temperature, mass, momentum, etc.) and various parameters (appearing in correlations, coordinates of physical boundaries, etc.), which are all interrelated by equations that usually represent conservation laws. The model parameters usually stem from processes that are external to the system under consideration and are seldom, if ever, known precisely. Without loss of generality, the model parameters can be considered to be real scalar quantities, having known nominal (or mean) values and, possibly, known higher-order moments or cumulants (i.e., variance/covariances, skewness, kurtosis), which are usually determined from experimental data and/or processes external to the physical system under consideration. These imprecisely known model parameters will be denoted as ,…,, where the subscript “TP” indicates “Total (number of) Parameters”. It will be convenient to consider that these parameters are components of a “vector of parameters” denoted as , where  denotes the TP-dimensional subset of the set of real scalars. These model parameters are considered to include imprecisely known geometrical parameters that characterize the physical system’s boundaries in the phase-space of the model’s independent variables. Thus,  is a TP-dimensional column vector and its components are considered to include all of the uncertain model parameters, including those that may enter into defining the system’s boundary in the phase-space of independent variables. For subsequent developments, it is convenient to denote matrices and vectors using capital and, respectively, lower case bold letters. The symbol “” will be used to denote “is defined as” or “is by definition equal to”. Transposition will be indicated by a dagger  superscript.
The model is considered to comprise  independent variables which will be denoted as , and which will be considered as the components of a -dimensional column vector denoted as , where the sub/superscript “” denotes the “Total (number of) Independent variables”. The vector  of independent variables is considered to be defined on a phase-space domain, denoted as , . The lower boundary-point of an independent variable is denoted as  (e.g., the inner radius of a sphere or cylinder, the lower range of an energy-variable, etc.), while the corresponding upper boundary-point is denoted as  (e.g., the outer radius of a sphere or cylinder, the upper range of an energy-variable, etc.). A typical example of boundary conditions that depend on imprecisely known model and geometrical parameters is provided by models based on diffusion theory where the boundaries of the physical domain are facing vacuum. For such models, the boundary conditions for the respective state variables (i.e., particle flux and/or current) are imposed not on the physical boundary but on the “extrapolated boundary” of the respective spatial domain. The “extrapolated boundary” depends both on the imprecisely known physical dimensions of the problem’s domain and also on the medium’s properties, i.e., atomic number densities and microscopic transport cross sections. The boundary of , which will be denoted as , comprises the set of all of the endpoints  of the respective intervals on which the components of  are defined, i.e., .
A linear physical system is generally modeled by a system of coupled linear operator-equations which can be generally represented as follows:
      
        
      
      
      
      
    
      where  is a -dimensional column vector of dependent variables, where the sub/superscript “” denotes the “Total (number of) Dependent variables”. The functions , denote the system’s “dependent variables” (also called “state functions”). The matrix  , has dimensions . The components  are operators that act linearly on the dependent variables  and also depend (in general, nonlinearly) on the uncertain parameters . The components of the -dimensional column vector  represent inhomogeneous source terms. The components  of , where the subscript “” indicates sources associated with the “forward” system of equations, are also nonlinear functions of . Since the right-side of Equation (63) may contain distributions, the equality in this equation is considered to hold in the weak (i.e., “distributional”) sense. Similarly, all of the equalities that involve differential equations in this work will be considered to hold in the distributional sense.
When  contains differential operators, a set of boundary and/or initial conditions which define the domain of  must also be given. Since the complete mathematical model is considered to be linear in , the boundary and/or initial conditions needed to define the domain of  must also be linear in . Such linear boundary and/or initial conditions are represented in the following operator form:
      
        
      
      
      
      
    
In Equation (64), the components , of the -dimensional matrix-operator  are operators that act linearly on  and nonlinearly on ; the quantity  denotes the total number of boundary and initial conditions. The operator  is a -dimensional vector comprising components that are operators which in general act nonlinearly on . The subscript “” in Equation (64) indicates boundary conditions associated with the forward state function . In this work, capital bold letters will be used to denote matrices while lower case bold letter will be used to denote vectors; the components of these matrices and vectors may be operators rather than just functions.
Physical problems modeled by linear systems and/or operators are naturally defined in Hilbert spaces. The dependent variables , for the physical system represented by Equations (63) and (64) are considered to be square-integrable functions of the independent variables and are considered to belong to a Hilbert space which will be denoted as , where the subscript “zero” denotes “zeroth-level“ or “original”. Higher-level Hilbert spaces, which will be denoted as  etc., will also be introduced and used in this work. The Hilbert space  is considered to be endowed with the following inner product, denoted as , between two elements  and :
      
        
      
      
      
      
    
The dot in Equation (65) indicates the “scalar product of two vectors”, which is defined as follows:
      
        
      
      
      
      
    
      while the product-notation  in Equation (65) denotes the respective multiple integrals. The linear operator  admits an adjoint operator, denoted as , which is defined through the following relation for a vector :
      
        
      
      
      
      
    
In Equation (67), the formal adjoint operator  is the  matrix comprising elements  obtained by transposing the formal adjoints of the forward operators , i.e.,
      
      
        
      
      
      
      
    
Hence, the system adjoint to the linear system represented by Equations (63) and (64) can generally be represented as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
When  comprises differential operators, the operations (e.g., integration by parts) that implement the transition from the left-side to the right side of Equation (67) give rise to boundary terms which are collectively called the “bilinear concomitant”. The domain of  is determined by selecting adjoint boundary and/or initial conditions so as to ensure that the bilinear concomitant vanishes when the selected adjoint boundary conditions are implemented together with the forward boundary conditions given in Equation (64). The adjoint boundary conditions thus selected are represented in operator form by Equation (70), where the subscript “” indicates adjoint boundary and/or initial conditions associated with the adjoint state function . In most practical situations, the Hilbert space  is self-dual.
The system of equations comprising Equations (63), (64), (69) and (70) will be called the 1st-Level Forward System (1st-LFS), since it is the starting point for the computation of the model’s response, and its solution will be needed for the subsequent computation of the first-order sensitivities of the model’s response with respect to the model parameters. The 1st-LFS can be written compactly in block-matrix form as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where the following definitions were used:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The nominal solution of Equations (63) and (64), which will be denoted as , and the nominal solution of Equations (69) and (70), which will be denoted as , are obtained by solving these equations at the nominal (or mean) values of the model parameter . The superscript “zero” will henceforth be used to denote “nominal” (or, equivalently, “expected” or “mean” values). Thus, the vectors ,  and  satisfy the following equations:
      
        
      
      
      
      
    
      or, equivalently,
      
      
        
      
      
      
      
    
      
        
      
      
      
      
    
In Equations (75) and (76), the notation  has been used to indicate that the quantity within the brackets is to be evaluated at the nominal values of the parameters and state functions. This simplified notation is justified by the fact that when the parameters take on their nominal values, it implicitly means that the corresponding state functions also take on their corresponding nominal values. This simplified notation will be used throughout this work.
The relationship shown in Equation (67), which is the basis for defining the adjoint operator, also provides the following fundamental “reciprocity-like” relation between the sources of the forward and the adjoint equations, cf. Equations (63) and (69), respectively:
      
        
      
      
      
      
    
The functional on the right-side of Equation (77) represents a “detector response”, i.e., reaction-rate between the particles and the medium represented by  which is equivalent to the “number of counts” of particles incident on a detector of particles that “measures” the particle flux . Thus, the source term  in Equation (77) is usually associated with the “result of interest” to be measured and/or computed, which is customarily called the system’s “response”. In particular, if , then , which means that, in such a case, the right-side of Equation (77) provides the value of the dependent variable (particle flux, temperature, velocity, etc.) at the point in phase-space where the respective measurement is performed.
The relation given in Equation (77) indicates that for linear physical systems, the system’s response may depend not only on the model’s state-functions and on the system parameters, but may also depend on the adjoint function The system’s response will be denoted as  and, in the most general case, is a nonlinear operator acting on the model’s forward and adjoint state functions, as well as on imprecisely known parameters, both directly and indirectly through the state functions. The nominal value of the response, , is determined by using the nominal parameter values , the nominal value  of the forward state function and the nominal value  of the adjoint function.
In general, a function-valued (operator) response  can be represented by a spectral expansion in multidimensional orthogonal polynomials or Fourier series of the form:
      
        
      
      
      
      
    
      where the quantities , , denote the corresponding spectral functions (e.g., orthogonal polynomials or Fourier exponential/trigonometric functions) and where the spectral Fourier coefficients  are defined as follows:
      
        
      
      
      
      
    
The coefficients  can themselves be considered as system responses since the spectral polynomials  are perfectly well known while the expansion coefficients will contain all of the dependencies of the respective response on the imprecisely known model and response parameters. This way, the sensitivity analysis of operator-valued responses can be reduced to the sensitivity analysis of scalar-valued responses.
A measurement of a physical quantity can be represented as a response  located at a specific point, , in phase-space, having the following form:
      
        
      
      
      
      
    
      where  denotes the multidimensional Dirac-delta functional. Responses that occur in many fields (e.g., optimization, control, model verification, data assimilation, model validation, model calibration, predictive modeling) are Lagrangians having the following functional form:
      
        
      
      
      
      
    
A particular form of the response defined in Equation (81) is the Schwinger “normalization-free Lagrangian”, which takes on the following form [,]:
      
        
      
      
      
      
    
The system responses defined in Equation (78) through (82) indicate that their sensitivities (i.e., functional derivatives) with respect to the model parameters can be studied by considering a generic scalar-valued response, which will be denoted as , of the following form:
      
        
      
      
      
      
    
      where  denotes a suitably Gateaux- (G-) differentiable function of the indicated arguments. In general,  is nonlinear in , , and , and the components of  are considered to also include parameters that may specifically appear only in the definition of the response under consideration (but which might not appear in the definition of the model). Thus, the (physical) “system” is defined in this work to comprise both the system’s computational model and the system’s response.
The generic response defined in Equation (83) provides the basis for constructing any other responses of specific interest and will therefore be used for the generic “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems”, which will be abbreviated as “nth-CASAM-L” (where “n” is a finite number that indicates any desired, arbitrarily-high, order) and which will be developed in the remainder of this work. Note that the generic response defined in Equation (83) is, in general, a nonlinear function of all of its arguments, i.e.,  is nonlinear in , , and .
5. The nth-CASAM-L Methodology for n = 1, 2, 3, 4
5.1. The 1st-CASAM-L: Summary
The model parameters  are imprecisely known quantities, so their actual values may differ from their nominal values by quantities denoted as  Since the model parameters  and the state functions are related to each other through the forward and adjoint systems, it follows that variations  in the model parameters will cause corresponding variations  and , ,  in the forward and, respectively, adjoint state functions. In turn, the variations , , and  will cause a response variation  around the nominal response value .
The first-order total G-variation  of a function  for arbitrary variations  is defined as follows:
      
        
      
      
      
      
    
The first-order G-variation  is an operator defined on the same domain as , and has the same range as . The G-variation  satisfies the relation  with  The existence of the G-variation  does not guarantee its numerical computability. Numerical methods (e.g., Newton’s method and variants thereof) most often require that  be linear in the variations  in a neighborhood  around . The necessary and sufficient conditions for the G-differential  of a nonlinear operator  to be linear in the variations  in a neighborhood  around  are as follows:
- (i).
 - satisfies a weak Lipschitz condition at ; i.e.,
 - (ii).
 - For two arbitrary vectors of variations and , the operator satisfies the relation
 
In practice, it is not necessary to investigate if  satisfies the conditions presented in Equations (85) and (86) since it is usually evident if the right-side of Equation (84) is linear (or not) in the variations . A total variation  which is linear in  is called the “total differential of ” and is usually denoted as . In this case, the partial variation  is called the “partial differential  of  with respect to ”. Thus, the total differential  is linear in the variations  and the following representation holds:
      
        
      
      
      
      
    
        where the quantities  denote the partial derivatives of  with respect to its arguments . It will henceforth be assumed that all of the operators considered in this work satisfy the conditions presented in Equations (85) and (86), and therefore admit partial G-derivatives such that the representation shown in Equation (87) exists.
Since this work ultimately aims at deriving the explicit expressions of the nth-order (i.e., arbitrarily-high order) sensitivities of the response  with respect to the model parameters, the proliferation of indices, superscripts and subscripts is unavoidable. To avoid the use of “subscripted-subscripts” as much as possible, the subscripts which will be used for denoting the sensitivities (i.e., partial G-derivatives) of the model response with respect to the model parameters will have the following form: (i) , for first-order sensitivities; (ii) , for 2nd-order sensitivities; (iii) , for 3rd-order sensitivities, and so on. Thus, the indices  will be used for the nth-order sensitivities.
The 1st-order Gateaux- (G-) variation, denoted as , of the response  for arbitrary variations  in the model parameters and state functions, in a neighborhood  around , where  is a real scalar ( denotes the underlying field of scalars), is defined as follows:
      
        
      
      
      
      
    
In Equation (88), the “indirect-effect” term  depends only on the variations  and  in the state functions, and is defined as follows:
      
        
      
      
      
      
    
        while the “direct-effect” term  depends only on the parameter variations  and is defined as follows:
        
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
In Equations (89) and (90), the notation  is used to indicate that the quantity within the brackets is to be evaluated at the nominal values of the parameters and state functions. As has been previously mentioned, this simplified notation is justified by the fact that when the parameters take on their nominal values, it implicitly means that the corresponding state functions also take on their corresponding nominal values. The “direct effect” term  defined in Equation (90) depends directly on the parameter variations , and can be computed immediately since it does not depend on the variations  and . On the other hand, the “indirect effect” term  defined in Equation (89) depends indirectly on the parameter variations , through the variations  and  in the forward state functions  and , and can be computed only after having computed the values of the variations  and .
The variations  and  are the solutions of the equations obtained by the G-differentiating Equations (63), (64), (69) and (70), which yields the following system:, which will be called the 1st-Level Variational Sensitivity System (1st-LVSS):
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where
      
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The matrices  and , which appear on the right-side of Equation (94), are defined as follows:
	  
      
        
      
      
      
      
    
The matrices  and , which appear on the right-side of Equation (96), have the same structures as the matrices  and , respectively.
Considering the definitions provided in Equations (91), (94) and (95), it follows from Equation (92) that the state functions  and  are implicit functions of the model parameters ; therefore, the variations of the variations  and  can be considered to represent the following total differentials:
      
        
      
      
      
      
    
Using the results provided in Equations (91), (94)−(97) into Equations (92) and (93), and identifying the expressions that multiply each parameter variation  yields the following system of equations:
      
        
      
      
      
      
    
      with the functions  and  subject to the corresponding boundary conditions derived from Equation (93). It is evident from Equation (98) that the determinations of the functions  and  by solving the 1st-LVSS would require  large-scale computations, which would be prohibitively expensive in terms of computational resources.
The alternative path for determining the indirect-effect term  defined in Equation (89), which avoids the need to compute the functions  and , is provided by the first-order adjoint sensitivity analysis methodology which was conceived by Cacuci [,], which will be presented in this subsection in its extended framework, including the computation of response sensitivities to parameters underlying boundary and interface conditions. This extended framework is called the “Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward and Adjoint Linear Systems,” and is abbreviated as 1st-CASAM-L. The starting point for the 1st-CASAM-L is the 1st-LVSS, cf. Equations (92) and (93), which will be written in the following block-matrix form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      where
      
      
        
      
      
      
      
    
The solution  of the “1st-Level Variational Sensitivity System” (1st-LVSS) will be called the “1st-level variational sensitivity function, as highlighted by the superscript “(1)”.
The indirect-effect term  defined in Equation (89) can be computed by applying of the “First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward and Adjoint Linear Systems”, abbreviated as 1st-CASAM-L, which avoids the need for computing the functions  and . The 1st-CASAM-L uses a Hilbert space, denoted as , which comprises square-integrable functions vector-valued elements of the form , , , and which is endowed with an inner product between two elements, , , denoted as  and defined as follows:
      
        
      
      
      
      
    
In the Hilbert , form the inner product of Equation (92) with a yet undefined vector-valued function  to obtain the following relation:
      
        
      
      
      
      
    
      where  denotes the bilinear concomitant defined on the phase-space boundary , and where  is the operator formally adjoint to , i.e.,
      
      
        
      
      
      
      
    
Require the first term on right-side of the second equality in Equation (103) to represent the indirect-effect term defined in (89), to obtain the following relation:
      
        
      
      
      
      
    
      where the subscript “A” denotes “adjoint” and where the source term  is defined as follows:
      
        
      
      
      
      
    
Implement the boundary conditions given in Equation (93) into Equation (103), thereby eliminating the remaining unknown boundary-values of the functions  and  from the expression of the bilinear concomitant  by selecting appropriate boundary conditions for the function , to ensure that Equation (105) is well-posed while being independent of unknown values of , , and . The boundary conditions thus chosen for the function  can be represented in operator form as follows:
      
        
      
      
      
      
    
The selection of the boundary conditions for the adjoint function  represented by (107) eliminates the appearance of the unknown values of  in  and reduces this bilinear concomitant to a residual quantity that contains boundary terms involving only known values of , , ,  and . This residual quantity will be denoted as . In general, this residual quantity does not automatically vanish, although it may do so occasionally.
The system of equations comprising Equation (105) together with the boundary conditions represented by Equation (107) constitutes the 1st-Level Adjoint Sensitivity System (1st-LASS). The solution  of the 1st-LASS is called the 1st-level adjoint function. The 1st-LASS is called “first-level” (as opposed to “first-order”) because it does not contain any differential or functional-derivatives, but its solution, , will be used below to compute the first-order sensitivities of the response with respect to the model parameters. This terminology will be also used in the sequel, when deriving the expressions for the higher-order sensitivities.
It follows from Equations (103) and (105) that the indirect-effect term  can be expressed in terms of the 1st-level adjoint function  as follows:
	  
      
        
      
      
      
      
    
As indicated by the last equality in Equation (108), the variations  and  have been eliminated from the original expression of the indirect-effect term, which now instead depends on the 1st-level adjoint function . As indicated in Equation (105), solving the 1st-LASS entails the following operations: (i) inverting (i.e., solving) the original left-side of the adjoint equation with the source  to obtain the 1st-level adjoint function ; and (ii) inverting the original left-side of the forward equation with the source  to obtain the 1st-level adjoint function . It is very important to note that the 1st-LASS is independent of parameter variations . Hence, the 1st-LASS needs to be solved only once (as opposed to the 1st-LVSS, which would need to be solved anew for each parameter variation) to determine the 1st-level adjoint function . Subsequently, the “indirect-effect term”  is computed efficiently and exactly by simply performing the integrations over the adjoint function , as indicated on the right-side of Equation (108).
The total 1st-order sensitivity of the response  to the model parameters is obtained by adding the expressions of the direct-effect term defined in Equation (90) and indirect-effect term as obtained in Equation (108), which yields the following expression:
        
      
        
      
      
      
      
    
As indicated in the last equality in Equation (109), the expression of the first-order G-variation  no longer depends on the variations  and , which are expensive to compute, but instead depends on the 1st-level adjoint function , which is as inexpensive to compute as the original state function(s). In particular, the expression in Equation (109) also reveals that the sensitivities of the response  to parameters that characterize the system’s boundary and/or internal interfaces can arise both from the direct-effect and indirect-effect terms. It also follows from Equation (109) that the total 1st-order G-variation  can be expressed in terms of the 1st-level adjoint functions as follows:
      
        
      
      
      
      
    
      where the quantities  represent the 1st-order response sensitivities with respect to the model parameters, evaluated at the nominal values of the state functions and model parameters. For the subsequent computation of higher-order responses sensitivities, it is convenient to represent the 1st-order response sensitivities as follows:
      
        
      
      
      
      
    
In particular, if the residual bilinear concomitant is non-zero, the functions  would contain suitably defined Dirac delta-functionals for expressing the respective non-zero boundary terms as volume-integrals over the phase-space of the independent variables. Dirac-delta functionals would also be needed to represent, within , the terms containing the derivatives of the boundary end-points with respect to the model and/or response parameters. As indicated by Equation (111), the computation of the 1st-order response sensitivities to all model parameters requires two large-scale computations (which is just as many as solving the original system) namely: one large-scale computation for computing the first-level adjoint state function  and one large-scale computation for computing the first-level adjoint state function .
5.2. The 2nd-CASAM-L: Summary
For each value of the index , the first-order sensitivity  will be considered to play the role of a “model response.” The starting point for the development of the 2nd-CASAM-L is provided by the expressions of the sensitivities provided in terms of the 1st-level adjoint functions, namely Equation (111), which indicates that the computation of the first-order sensitivities  using the 1st-CASAM-L requires the determination of the functions  and . In view of the developments underlying the 1st-CASAM-L, the functions  and  are the solutions of the following system of equations, written in block-matrix form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
The system of equations represented by Equations (112) and (113) will be called the 2nd-Level Forward System, (2nd-LFS) since it plays the same role as the First-Level (original) Forward System (1st-LFS) has played for the computation of the first-order response sensitivities. In terms of the function , each 1st-order sensitivity can be compactly written, for each , as follows:
      
        
      
      
      
      
    
There are  distinct 2nd-order sensitivities of the response with respect to the model and response parameters. The 2nd-order total sensitivity of the model response is obtained by applying the definition of the 1st-order G-differential to Equation (111), which yields the following result, for each :
        
      
        
      
      
      
      
    
        where the direct-effect term  is defined as follows:
      
        
      
      
      
      
    
        while the indirect-effect term  is defined as follows:
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
As indicated by the subscript “zero,” the expression in Equation (118) is to be computed at nominal parameter and state function values.
The direct-effect term  can be computed immediately. The indirect-effect term, , however, depends on the variation . In turn, the variation  is the solution of G-differentiated 2nd-LFS or, equivalently, the solution of the system obtained by concatenating the 1st-LVSS with the equations obtained by G-differentiating the 1st-LASS, cf. Equations (105) and (107). Either way, one obtains the following system of equations:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
		
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The system comprising Equations (120) and (121) will be called the “2nd-Level Variational Sensitivity System” (2nd-LVSS), and its solution, , will be called the 2nd-level variational function.
All of the components of the matrices and vectors defined in Equations (120) and (121) are to be computed at nominal parameter and state function values. The matrix  depends only the system’s response and is responsible for coupling the forward and adjoint systems. Although the forward and adjoint systems are coupled, they could nevertheless be solved successively rather than simultaneously, because the matrix  is a 4 × 4 block-diagonal matrix.
In principle, the 2nd-LVSS could be solved for each possible component , , of the vector of parameter variations  to obtain the 2nd-level variational function . Subsequently,  could be used together with the known parameter variations  in Equation (118) to compute the indirect-effect term . Computing  would require at least  large-scale computations for solving the 2nd-LVSS for every independent component (, ,  ) of the vectors of parameter variations .
Following the principles introduced by Cacuci (2018), the application of the 2nd-CASAM-L avoids the need for solving the 2nd-LVSS, by expressing the indirect-effect term  defined in Equation (118) in an alternative manner, in terms of the solution of a 2nd-Level Adjoint Sensitivity System (2nd-LASS), which is constructed so as to eliminate the appearance of the 2nd-level variational vector  in the alternative expression of the indirect-effect term . The construction of the requisite 2nd-LASS commences by introducing a Hilbert space, denoted as , comprising square-integrable vector-valued elements of the form , with components . The inner product between two elements,  and , in this Hilbert space will be denoted as  and is defined as follows:
		
      
        
      
      
      
      
    
In the Hilbert , form the inner product of Equation (120) with a yet undefined function ,, to obtain the following relation:
		
      
        
      
      
      
      
     where  denotes the bilinear concomitant defined on the phase-space boundary  and where  is the operator formally adjoint to , having the following form:
		
      
        
      
      
      
      
    
The first term on right-side of the second equality in Equation (128) is now required to represent the indirect-effect term  defined in Equation (118). This requirement is satisfied requiring that the 2nd-level adjoint function , be the solution of the following 2nd-Level Adjoint Sensitivity System (2nd-LASS):
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where the following definitions were used for each :
		
      
        
      
      
      
      
    
The adjoint boundary conditions represented by Equation (131) are selected so as to eliminate, in conjunction with the boundary conditions provided in Equation (121), all of the unknown values of the functions  in the expression of the bilinear concomitant . The bilinear concomitant may vanish after these boundary conditions are implemented, but if it does not, it will be reduced to a residual quantity which will be denoted as  and which will comprise only known values of , ,  and .
The results provided in Equation (130), (131) and (128) are employed in Equation (118) to obtain the following expression for the indirect-effect term  in terms of the 2nd-level adjoint functions :
      
        
      
      
      
      
    
Inserting the expressions that define the vector  from Equation (122) into Equation (133) and adding the resulting expression for the indirect-effect term with the expression of the direct-effect term given in Equation (117) yields the following expression for the total second-order G-differential of the response :
		
      
        
      
      
      
      
     where  denotes the 2nd-order partial sensitivity of the response with respect to the model parameters, evaluated at the nominal parameter values , and has the following expression for :
		
      
        
      
      
      
      
    
Since the 2nd-LASS is independent of parameter variations , the exact computation of all of the partial second-order sensitivities  requires at most  large-scale (adjoint) computations using the 2nd-LASS, rather than  large-scale computations as would be required by forward methods. It is important to note that by solving the 2nd-LASS -times, the “off-diagonal” 2nd-order mixed sensitivities  will be computed twice, in two different ways (i.e., using distinct 2nd-level adjoint functions), thereby providing an independent intrinsic (numerical) verification that the 1st- and 2nd-order response sensitivities are computed accurately. The information provided by the 1st-order sensitivities usually indicates which 2nd-order sensitivities are important and which could be neglected. Therefore, it is useful to prioritize the computation of the 2nd-order sensitivities by using the rankings of the relative magnitudes of the 1st-order sensitivities as a “priority indicator”: the larger the magnitude of the relative 1st-order sensitivity, the higher the priority for computing the corresponding 2nd-order sensitivities. Also, since vanishing 1st-order sensitivities may indicate critical points of the response in the phase-space of model parameters, it is also of interest to compute the 2nd-order sensitivities that correspond to vanishing 1st-order sensitivities. In practice, only those 2nd-order partial sensitivities which are deemed important would need to be computed.
Dirac delta-functionals in Equation (135) may need to be used for expressing the non-zero residual terms in the respective residual bilinear concomitant and/or the terms containing derivatives with respect to the lower- and upper-boundary points, so that the expression of the partial second-order sensitivities  can be written in the following form, in preparation for computing the 3rd-order sensitivities:
      
        
      
      
      
      
    
5.3. The 3rd-CASAM-L: Summary
For each value of the indices ,, the 2nd-order sensitivity  will be considered to play the role of a “model response.” The starting point for the development of the 3rd-CASAM-L is provided by the expressions of the sensitivities provided in terms of the 1st-level adjoint functions, namely Equation (136), which indicates that the computation of the 2nd-order sensitivities  using the 2nd-CASAM-L requires the determination of the functions  and . Recalling the developments of the 2nd-CSASM-L presented in Section 5.2, the functions  and  are the solutions of the following system of equations, written in block-matrix form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
		
      
        
      
      
      
      
    
The system of equations represented by Equations (137) and (138) can be called the 3rd-Level Forward System. In terms of the function , each 2nd-order sensitivity can be compactly written as follows:
      
        
      
      
      
      
    
There are  distinct 3rd-order sensitivities of the response with respect to the model and response parameters. The third-order sensitivities of the response  with respect to the model parameters are obtained by determining the first-order G-differential of the 2nd-order sensitivities , which are defined in Equations (136) or (140). By definition, the total G-differential of  is obtained as follows:
		
      
        
      
      
      
      
    
        where the direct-effect term  depends directly on the parameter variations and is defined as follows:
      
        
      
      
      
      
    
        while the “indirect-effect term”  depends indirectly on the parameter variations through the variations in the forward and adjoint state functions and is defined as follows:
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
The indirect-effect term defined in Equation (143) can be computed only after having determined the vector of variations , which is the solution of the G-differentiated 3rd-LFS. The variation  is the solution of the G-differentiated 2nd-LASS, while  is the solution of the 2nd-LVSS. Thus, the G-differentiated 3rd-LFS is obtained by concatenating the 2nd-LVSS with the G-differentiated 2nd-LASS (including the corresponding boundary/initial conditions), which yields the following system of 8 coupled equations, which can be represented in the block-matrix form, for , as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The system represented by Equations (145) and (146) is called the 3rd-Level Variational Sensitivity System (3rd-LVSS) and the solution, , of the 3rd-LVSS will be called the “3rd-level variational sensitivity function.” The components of the matrices and vectors which appear in the 3rd-LVSS are to be computed at nominal parameter and state function values, although the corresponding indication has been omitted in order to simplify the notation. The matrices and vectors which appear in the 3rd-LVSS, Equations (145) and (146), are defined as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The components  of the matrix  are defined as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The need for solving the 3rd-LVSS can be avoided by expressing the indirect-effect term  defined in Equation (143) in terms of the solutions of a 3rd-Level Adjoint Sensitivity System (3rd-LASS), which does not involve any variations in the state functions and which is constructed by implementing the same sequence of logical steps as used to construct the 1st-LASS and the 2nd-LASS. The 3rd-LASS is constructed in a Hilbert space, denoted as , which comprises vector-valued elements of the form , with . The inner product between two elements,  and , is denoted as  and is defined as follows:
      
        
      
      
      
      
    
In the Hilbert , form the inner product of Equation (145) with a yet undefined vector-function ,, to obtain the following relation:
		
      
        
      
      
      
      
    
		where  denotes the bilinear concomitant defined on the phase-space boundary  and where  has the following form:
      
        
      
      
      
      
    
The first term on right-side of the second equality in Equation (164) is now required to represent the indirect-effect term  defined in Equation (142). This requirement is satisfied by requiring that the 3rd-level adjoint function   , be the solution of the following 3rd-Level Adjoint Sensitivity System (3rd-LASS):
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The block-vector  on the right-side of Equation (166) comprises, for each , eight components (which are themselves block-vectors) defined as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The adjoint boundary conditions represented by Equation (167) are selected so as to eliminate, in conjunction with the boundary conditions represented by Equation (146), all of the unknown values of the functions  in the expression of the bilinear concomitant , which may vanish but if it does not, it will be reduced to a residual quantity which will be denoted as  and which will comprise only known values of , ,  and .
The equations underlying the 3rd-LASS together with the relation expressed by Equation (164) are employed in Equation (142) to obtain the following expression for the indirect-effect term  in terms of the 3rd-level adjoint function , for :
      
        
      
      
      
      
    
As the identity in the last line of Equation (176) indicates, the dependence of the indirect-effect term  on the variations  has been eliminated, by having replaced this dependency by the dependence on the 3rd-level adjoint function .
Adding the expression for the indirect-effect term  provided in Equation (176) to the expression of the direct-effect term given in Equation (142) yields the following expression for the total third-order G-differential of the response , for , which was defined in Equation (141):
		
      
        
      
      
      
      
    
		where
        
      
        
      
      
      
      
    
        denotes the 3rd-order partial sensitivity of the response with respect to the model parameters, evaluated at the nominal parameter values .
5.4. The 4th-CASAM-L: Summary
The fourth-order sensitivities of the response  with respect to the model parameters are obtained by determining the first-order G-differential of the 3rd-order sensitivities , which are defined in Equation (178). Recalling the developments of the 3rd-CSASM-L presented in Section 5.3, the functions  and  are the solutions of the following system of equations, written in block-matrix form:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
      
        
      
      
      
      
    
The system of equations represented by Equations (179) and (180) will be called the 4th-Level Forward System.
There are  distinct 4th-order sensitivities of the response with respect to the model and response parameters. In view of the definition introduced in Equation (181), the 3rd-order sensitivity  becomes a function of , and its total G-differential is obtained, by definition, as follows:
		
      
        
      
      
      
      
    
		where the direct-effect term  depends directly on the parameter variations and is defined as follows:
      
        
      
      
      
      
    
        while the “indirect-effect term”  is defined as follows:
      
        
      
      
      
      
    
        where
        
      
        
      
      
      
      
    
The vectors of variations  and  are related to each other and must be determined by solving simultaneously the coupled system of equations obtained by concatenating the 3rd-LVSS to the system of equations obtained by G-differentiating the 3rd-LASS while being subject to the boundary conditions obtained by concatenating the boundary conditions which belong to the 3rd-LVSS with the boundary conditions obtained by G-differentiating the boundary conditions belonging to the 3rd-LASS. Concatenating all of these equations and boundary conditions yields the following 16 × 16 block-matrix equation which constitutes the 4th-Level Variational Sensitivity System (4th-LVSS):
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The need for solving the 4th-LVSS can be avoided by expressing the indirect-effect term  defined in Equation (184) an alternative way, which does not involve the function . This alternative expression will be obtained by using the solution of a 4th-Level Adjoint Sensitivity Systems (4th-LASS), which is constructed by implementing the same sequence of logical steps as were followed for constructing the 1st, 2nd, and 3rd-LASS. The 4th-LASS is constructed in a Hilbert space, denoted as , comprising block-vector elements of the form , with , for . The inner product between two elements,  and , of the Hilbert space , will be denoted as  and is defined as follows:
      
        
      
      
      
      
    
Using the definition provided in Equation (192), form the inner product in  of Equation (186) with a yet undefined vector-valued function denoted as ,, to obtain the following relation:
		
      
        
      
      
      
      
    
		where  denotes the bilinear concomitant defined on the phase-space boundary  and where:
		
      
        
      
      
      
      
    
The first term on right-side of the second equality in Equation (193) is now required to represent the indirect-effect term , which is achieved by requiring that the 4th-level adjoint function  be the solution of the following 4th-Level Adjoint Sensitivity System, for :
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where the vector  comprises, for each , sixteen components defined as follows:
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
The 4th-level adjoint boundary conditions represented by Equation (196) are selected so as to eliminate, in conjunction with the boundary conditions represented by Equation (187), all of the unknown values of the functions  in the expression of the bilinear concomitant . This bilinear concomitant may vanish after implementing the boundary conditions represented by Equations (196) and (187); if it does not vanish, it will be reduced to a residual quantity which will comprise only known values of , ,  and , which will be denoted as .
Using the equations underlying the 4th-LASS together with the relation provided in Equation (193) in Equation (184) yields the following expression for the indirect-effect term  in terms of the 4th-level adjoint functions , for :
        
      
        
      
      
      
      
    
Adding the result obtained in Equation (202) for the indirect effect term to the result in Equation (183) for the direct effect term yields the following expression for the total 3rd-order G-variation defined in Equation (182):
        
      
        
      
      
      
      
    
        where  denotes the 4th-order partial sensitivity of the response with respect to the model parameters.
6. Discussion and Conclusions
This work has presented the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Linear Systems” (abbreviation: nth-CASAM-L), where “n” is a finite number that indicates any desired, arbitrarily-high, order. The nth-CASAM-L enables the efficient computation of the exact expressions of the nth-order functional derivatives (“sensitivities”) of a general system response, which depends on both the forward and adjoint state functions, with respect to all of the parameters (including boundary and initial conditions) underlying the respective forward and adjoint systems. Since nonlinear operators do not admit adjoint operators (only linearized versions of nonlinear operators can admit a bona-fide adjoint operator), responses that simultaneously depend on forward and adjoint functions can arise only in conjunction with linear systems. Particularly important model responses that involve both the forward and adjoint functions are various forms of Lagrangian functionals, the Roussopoulos functional for computing reaction rates, the Raleigh quotient for computing eigenvalues and/or separation constants when solving partial differential equations, the Schwinger functional for “normalization-free” solutions, and many others, (e.g., [,]). These functionals play a fundamental role in various optimization and control procedures, derivation of numerical methods for solving (differential, integral, integro-differential) equations, etc. The sensitivity analysis of responses that simultaneously involve both forward and adjoint state functions makes it necessary to treat linear models/systems in their own right, rather than treating them as particular cases of nonlinear systems (in which the responses can depend only on the forward functions).
This work has shown that the mathematical framework underlying the nth-CASAM-L is set in linearly increasing higher-dimensional Hilbert spaces, as opposed to exponentially increasing parameter-dimensional spaces. In particular, for a scalar-valued valued response associated with a nonlinear model comprising TP parameters, the 1st-CASAM-L requires 1 additional large-scale adjoint computation (as opposed to TP large-scale computations, as required by other methods) for computing exactly all of the 1st-order response sensitivities. All of the (mixed) 2nd-order sensitivities are computed exactly by the 2nd-CASAM-L in at most TP computations, as opposed to TP(TP + 1)/2 computations required by all other methods, and so on. For every lower-order sensitivity of interest, the nth-CASAM-L computes the “TP next-higher-order” sensitivities in one adjoint computation performed in a linearly increasing higher-dimensional Hilbert space. Very importantly, the nth-CASAM-L computes the higher-level adjoint functions using the same forward and adjoint solvers (i.e., computer codes) as used for solving the original forward and adjoint systems, thus requiring relatively minor additional software development for computing the various-order sensitivities.
The nth-CASAM presented in this work is the only practically implementable methodology for obtaining the exact expressions (i.e., free of methodologically-introduced approximations) of arbitrarily-high order sensitivities (functional derivatives) of model responses to model parameters, for coupled forward/adjoint linear systems. By enabling the practical computation of any arbitrarily-order response sensitivities to model parameters, the nth-CASAM-L makes it possible to compare the relative values of the sensitivities of various order, in order to assess which sensitivities are important and which may actually be neglected, thus enabling future investigations of the convergence of the (multivariate) Taylor series expansion of the response in terms of parameter variations, as well as investigating the actual validity of expressions that are derived from Taylor-expansion of the response (e.g., response variances/covariance, skewness, kurtosis, etc.) as a function of the model’s parameters. The larger the number of model parameters, the more efficient the C-ASAM-L becomes for computing arbitrarily high-order response sensitivities. The nth-CASAM-L also enables the direct derivation, on paper, of the expression of a specific high-order sensitivity of interest, which can subsequently be computed directly; such a direct computation is not possible with any statistical method.
The nth-CASAM-L presented in this work provides a fundamentally important step in the quest to overcome the “curse of dimensionality” in sensitivity analysis, uncertainty quantification and predictive modeling. Ongoing work aims at developing the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems” (abbreviation: nth-CASAM-N) for the practical, efficient, and exact computation of arbitrarily-high order sensitivities of responses to model parameters for nonlinear systems. The nth-CASAM-L, together with the nth-CASAM-N, are expected to revolutionize all of the fields of activities which require response sensitivities, including the fields of uncertainty quantification, model validation, optimization, data assimilation, model calibration, sensor fusion, reduced-order modeling, inverse problems, and predictive modeling.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The author declares no conflict of interest.
References
- Lewins, J. Importance: The Adjoint Function; Pergamon Press Ltd.: Oxford, UK, 1965. [Google Scholar]
 - Stacey, W.M. Variational Methods in Nuclear Reactor Physics; Academic Press Inc.: New York, NY, USA, 1974. [Google Scholar]
 - Cacuci, D.G. Second-order sensitivities of a general functional of the forward and adjoint fluxes in a multiplying nuclear system with source. Nucl. Eng. Des. 2019, 344, 83–106. [Google Scholar] [CrossRef]
 - Cacuci, D.G. The Roussopoulos and Schwinger Functionals for Nuclear Systems Involving Imprecisely Known Fluxes and Parameters: Distinctions and Equivalences. Nucl. Sci. Eng. 2019, 193, 681–721. [Google Scholar] [CrossRef] [Green Version]
 - Cacuci, D.G. Second-order adjoint sensitivity analysis of ratios of functionals of the forward and adjoint fluxes in a multi-plying nuclear system with source. Ann. Nucl. Energy 2020, 135, 106956. [Google Scholar] [CrossRef]
 - Bellman, R.E. Dynamic Programming; Rand Corporation: Santa Monica, CA, USA, 1957; Princeton University Press: Princeton, NJ, USA, 1957; ISBN1 978-0-691-07951-6. Republished by Courier Dover Publications: Mineola, NJ, USA, 2003; ISBN2 978-0-486-42809-3. [Google Scholar]
 - Cacuci, D.G. Sensitivity theory for nonlinear systems: I. Nonlinear functional analysis approach. J. Math. Phys. 1981, 22, 2794–2802. [Google Scholar] [CrossRef]
 - Cacuci, D.G. Sensitivity theory for nonlinear systems. II. Extensions to additional classes of responses. J. Math. Phys. 1981, 22, 2803–2812. [Google Scholar] [CrossRef]
 - Cacuci, D.G. Sensitivity and Uncertainty Analysis: Theory; Chapman & Hall/CRC: Boca Raton, FL, USA, 2003; Volume 1. [Google Scholar]
 - Cacuci, D.G. Second-order adjoint sensitivity analysis methodology (2nd-ASAM) for computing exactly and efficiently first- and second-order sensitivities in large-scale linear systems: I. Computational methodology. J. Comput. Phys. 2015, 284, 687–699. [Google Scholar] [CrossRef] [Green Version]
 - Cacuci, D.G. Second-order adjoint sensitivity analysis methodology (2nd-ASAM) for large-scale nonlinear systems: I. Theory. Nucl. Sci. Eng. 2016, 184, 16–30. [Google Scholar] [CrossRef]
 - Cacuci, D.G. The Second-Order Adjoint Sensitivity Analysis Methodology; CRC Press, Taylor & Francis Group: Boca Raton, FL, USA, 2018. [Google Scholar]
 - Cacuci, D.G. Second-Order Adjoint Sensitivity Analysis Methodology for Computing Exactly Response Sensitivities to Uncertain Parameters and Boundaries of Linear Systems: Mathematical Framework. Am. J. Comput. Math. 2020, 10, 329–354. [Google Scholar] [CrossRef]
 - Valentine, T.E. Polyethylene-Reflected Plutonium Metal Sphere Subcritical Noise Measurements, SUB-PU-METMIXED-001. In International Handbook of Evaluated Criticality Safety Benchmark Experiments; NEA/NSC/DOC(95)03/I-IX; Organization for Economic Co-Operation and Development, Nuclear Energy Agency: Paris, France, 2006. [Google Scholar]
 - Cacuci, D.G.; Fang, R.; Favorite, J.A.; Badea, M.C.; Di Rocco, F. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: I. Effects of Imprecisely Known Microscopic Total and Capture Cross Sections. Energies 2019, 12, 4219. [Google Scholar] [CrossRef] [Green Version]
 - Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: II. Effects of Imprecisely Known Microscopic Scattering Cross Sections. Energies 2019, 12, 4114. [Google Scholar] [CrossRef] [Green Version]
 - Cacuci, D.G.; Fang, R.; Favorite, J.A.; Badea, M.C.; Di Rocco, F. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: III. Effects of Imprecisely Known Microscopic Fission Cross Sections and Average Number of Neutrons per Fission. Energies 2019, 12, 4100. [Google Scholar] [CrossRef] [Green Version]
 - Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: IV. Effects of Imprecisely Known Source Parameters. Energies 2020, 13, 1431. [Google Scholar] [CrossRef] [Green Version]
 - Fang, R.; Cacuci, D.G. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: V. Computation of Mixed 2nd-Order Sensitivities Involving Isotopic Number Densities. Energies 2020, 13, 2580. [Google Scholar] [CrossRef]
 - Cacuci, D.G.; Fang, R.; Favorite, J.A. Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark. VI: Overall Impact of 1st- and 2nd-Order Sensitivities on Response Uncertainties. Energies 2020, 13, 1674. [Google Scholar] [CrossRef] [Green Version]
 - Cacuci, D.G. Third order adjoint sensitivity analysis of reaction rate responses in a multiplying nuclear system with source. Ann. Nucl. Energy 2021, 151, 107924. [Google Scholar] [CrossRef]
 - Cacuci, D.G.; Fang, R. Third-Order Adjoint Sensitivity Analysis of an OECD/NEA Reactor Physics Benchmark: I. Mathematical Framework. Am. J. Comput. Math. 2020, 10, 503–528. [Google Scholar] [CrossRef]
 - Fang, R.; Cacuci, D.G. Third-Order Adjoint Sensitivity Analysis of an OECD/NEA Reactor Physics Benchmark: II. Computed Sensitivities. Am. J. Comput. Math. (AJCM) 2020, 10, 529–558. [Google Scholar] [CrossRef]
 - Fang, R.; Cacuci, D.G. Third-order adjoint sensitivity analysis of an OECD/NEA reactor physics benchmark: III. Response moments. Am. J. Comput. Math. (AJCM) 2020, 10, 559–570. [Google Scholar] [CrossRef]
 - Cacuci, D. Fourth-Order Comprehensive Adjoint Sensitivity Analysis (4th-CASAM) of Response-Coupled Linear Forward/Adjoint Systems: I. Theoretical Framework. Energies 2021, 14, 3335. [Google Scholar] [CrossRef]
 - Cacuci, D.G.; Fang, R. Fourth-Order Adjoint Sensitivity Analysis of an OECD/NEA Reactor Physics Benchmark: I. Mathematical Expressions and CPU-Time Comparisons for Computing 1st-, 2nd- and 3rd-Order Sensitivities. Am. J. Comput. Math. (AJCM) 2021, 11, 94–132. [Google Scholar] [CrossRef]
 - Cacuci, D.G.; Fang, R. Fourth-order adjoint sensitivity analysis of an OECD/NEA reactor physics benchmark: II. Mathematical expressions and CPU-time comparisons for computing 4th-order sensitivities. Am. J. Comput. Math. (AJCM) 2021, 11, 133–156. [Google Scholar] [CrossRef]
 - Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled For-ward/Adjoint Linear Systems (nth-CASAM-L): II. Illustrative Application. Energies 2021. submitted for publication. [Google Scholar]
 
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.  | 
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).