The First- and Second-Order Features Adjoint Sensitivity Analysis Methodologies for Fredholm-Type Neural Integro-Differential Equations: Mathematical Framework and Illustrative Application to a Heat Transfer Model

Dan Gabriel Cacuci

doi:10.20944/preprints202504.2086.v1

Submitted:

22 April 2025

Posted:

25 April 2025

You are already at the latest version

Abstract

This works presents the “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type” (1st-FASAM-NIDE-F) and the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type” (2nd-FASAM-NIDE-F). It is shown that the 1st-FASAM-NIDE-F methodology enables the most efficient computation of exactly-determined first-order sensitivities of decoder response with respect to the optimized NIDE-F parameters, requiring a single “large-scale” computation for solving the 1st-Level Adjoint Sensitivity System (1st-LASS), regardless of the number of weights/parameters underlying the NIDE-F decoder, hidden layers, and encoder. The 2nd-FASAM-NIDE-F methodology enables the computation, with unparalleled efficiency, of the second-order sensitivities of decoder responses with respect to the optimized/trained weights. The application of both the 1st-FASAM-NIDE-F and the 2nd-FASAM-NIDE-F methodologies is illustrated by considering a paradigm heat transfer model, which has been chosen because it can be formulated either as a first-order differential-integral equation of Fredholm type (NIDE-F) or as a conventional second-order “neural ordinary differential equation (NODE)”, while admitting exact closed-form solutions/expressions for all quantities of interest, including state functions, first-order and second-order sensitivities. This heat transfer model enables a detailed comparison of the 1st- and 2nd-FASAM-NIDE-F versus the recently developed 1st- and 2nd-FASAM-NODE methodologies, highlighting the considerations underlying the optimal choice for cases where the neural net of interest is amenable to using either of these methodologies for its sensitivity analysis.

Keywords:

Fredholm neural integro-differential equations

;

first-order features adjoint sensitivity analysis methodology

;

second-order features adjoint sensitivity analysis methodology

;

heat transfe

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

In practice, the system under consideration is modeled by learning the operator that can reproduce the system by using data sampled from the respective system. Typical operator learning problems are formulated on finite grids, using finite-difference methods that approximate the domain of the operator under investigation. Recovering the continuous limit is a challenging undertaking, particularly since irregularly sampled data may alter the evaluation of the learned operator. The use of differential equation solvers to learn dynamics through continuous deep learning models of neural networks, called “Neural Ordinary Differential Equations” (NODE), has been introduced by Chen et al. [1]. As demonstrated by various applications [1,2,3,4,5,6,7,8,9], NODE models provide an explicit connection between deep feed-forward neural networks and dynamical systems, offering flexible trade-offs between efficiency, memory costs and accuracy while bridging modern deep learning and traditional numerical modelling. However, NODE models are limited to describing systems that are instantaneous, since each time-step is determined locally in time, without contributions from the state of the system at other times.

In contradistinction to differential equations, integral equations (IE) model global spatio-temporal relations, which are learned through an IE-solver [see, e.g., 10] which samples the domain of integration continuously. Due to their non-local behavior, IE-solvers are suitable for modeling complex dynamics. The problem of learning dynamics from data through integral equations has been addressed by Zappala et al [11], who have introduced the Neural Integral Equation (NIE) and the Attentional Neural Integral Equation (ANIE). The NIE and the ANIE can be used to generate dynamics and can also be used to infer the spatio-temporal relations that generated the data, thus enabling the continuous learning of non-local dynamics with arbitrary time resolution [11,12]. Often, ordinary and/or partial differential equations can be recast in integral-equation forms that can be solved more efficiently using IE-solvers, as exemplified in scattering theory [13], fluid flow [14], and integral neutron and photon transport [15].

Zappala et al [16] have also developed a deep learning method called Neural Integro-Differential Equation (NIDE), which “learns” an integro-differential equation (IDE) whose solution approximates data sampled from given non-local dynamics. The motivation for using NIDE stems from the need to model systems that present spatio-temporal relations which transcend local modeling, as illustrated by the pioneering works of Volterra on population dynamics [17]. Combining the properties of differential and integral equations, IDEs also present properties that are unique to their non-local behavior [18,19,20], with applications in computational biology, physics, engineering and applied sciences [18,19,20,21,22,23].

All neural nets are trained by minimizing a “loss functional” which aims at representing the discrepancy between a “reference solution” and the output produced by the respective net’s decoder. The neural-net is optimized to reproduce the underlying physical system as closely as possible. However, the physical system modeled by a neural-net comprises parameters that stem from measurements and/or computations which are subject to uncertainties. Therefore, even though the neural net would ideally model perfectly the system’s parameters, the uncertainties inherent in these parameters would propagate to the subsequent results of interest, which are various functionals of the net’s decoder output rather than some “loss functional.” Hence, it is important to quantify the uncertainties induced in the decoder’s output by the uncertainties that afflict the parameters/weights underlying the physical system modeled by the respective neural-net. The quantification of the uncertainties in the net’s decoder and derived results (called ”responses”) of interest require the computation of the sensitivities of the decoder’s response with respect to the optimized weights/parameters comprised within the neural net.

Neural nets comprise not only scalar-valued weights/parameters but also scalar-valued functions (e.g., correlations, material properties, etc.) of the model’s scalar parameters. It is convenient to refer to such scalar-valued functions as “features of primary model parameters.” Cacuci [24] has recently introduced the “n^th-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (n^th-FASAM-N),” which enables the most efficient computation of the exact expressions of arbitrarily high-order sensitivities of model responses with respect to the model’s “features.” Subsequently, the sensitivities of the responses with respect to the primary model parameters are determined, analytically and trivially, by applying the “chain-rule” to the expressions obtained for the response sensitivities with respect to the model’s features/functions of parameters.

Based on the general framework of the n^th-FASAM-N methodology [24], Cacuci has developed specific sensitivity analysis methodologies for NODE-nets, as follows: the “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations (1st-FASAM-NODE)” [25] and the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations (2nd-FASAM-NODE)” [26]. The 1st-FASAM-NODE and the 2nd-FASAM-NODE are pioneering sensitivity analysis methodologies which enable the computation, with unparalleled efficiency, of exactly-determined first-order and, respectively, second-order sensitivities of decoder response with respect to the optimized/trained weights involved in the NODE’s decoder, hidden layers, and encoder.

Two important families of IDEs are the Volterra and the Fredholm equations. In a Volterra IDE, the interval of integration grows linearly during the system’s dynamics, while in a Fredholm IDE the interval of integration is fixed during the dynamic-history of the system, but at any given time instance within this interval, the system depends on the past, present and future states of the system. By applying the general concepts underlying the n^th-FASAM-N methodology [24], Cacuci [27,28] has also developed the general methodologies underlying the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Fredholm-Type (2nd-FASAM-NIE-F)” and the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (2nd-FASAM-NIE-V).” The 2nd-FASAM-NIE-F encompasses the “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Fredholm-Type (1st-FASAM-NIE-F), while the 2nd-FASAM-NIE-V encompasses the “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1st-FASAM-NIE-V).” The 1st-FASAM-NIE-F and 1st-FASAM-NIE-V methodologies, respectively, enable the computation, with unparalleled efficiency, of exactly-determined first-order sensitivities of decoder response with respect to the NIE-parameters, requiring a single “large-scale” computation for solving the 1st-Level Adjoint Sensitivity System (1st-LASS), regardless of the number of weights/parameters underlying the NIE-net. The 2nd-FASAM-NIE-F and 2nd-FASAM-NIE-F methodologies, respectively, enable the computation (with unparalleled efficiency) of exactly-determined second-order sensitivities of decoder response with respect to the NIE-parameters, requiring only as many “large-scale” computations as there are first-order sensitivities with respect to the feature functions.

This work presents the “First- and Second Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type” abbreviated as “1st-FASAM-NIDE-F” and “2nd-FASAM-NIDE-F,” respectively. These methodologies are also based on the general framework of the n^th-FASAM-N methodology [24]. The 1st-FASAM-NIDE-F is presented in Section 2, while the 2nd-FASAM-NIDE-F is presented in Section 3, in the sequel. Section 4 presents an illustrative application of the 1st-FASAM-NIDE-F and 2nd-FASAM-NIDE-F methodologies to a heat transfer model. This illustrative model has been chosen because it can be formulated either as a first-order differential-integral equation of Fredholm type or as a conventional second-order “neural ordinary differential equation (NODE)”, while admitting exact closed-form solutions/expressions for all quantities of interest, including state functions, first-order and second-order sensitivities. The availability of these alternative formulations, either as a NIDE-F or a NODE, of the illustrative paradigm heat conduction model makes it possible to compare the detailed, step-by-step, applications of the 1st-FASAM-NIDE-F versus the 1st-FASAM-NODE methodologies (for computing most efficiently the exact expressions of the first-order sensitivities of decoder response with respect to the model parameters) and, subsequently, to compare the applications of the 2nd-FASAM-NIDE-F versus the 2nd-FASAM-NODE methodologies (for computing most efficiently the exact expressions of the second-order sensitivities of decoder response with respect to the model parameters).

The discussion offered in Section 5 concludes this work by highlighting the unparalleled efficiency of the 1st-FASAM-NIDE-F and 2nd-FASAM-NIDE-F methodologies, respectively, for computing exact first- and second-order sensitivities, respectively, of decoder responses to model parameters in optimized NIE-F networks. Ongoing work aims at developing the “First- and Second-Order Features Adjoint Sensitivity Analysis Methodologies for Neural Integro-Differential Equations of Volterra-Type” (1st- FASAM-NIDE-V and 2nd-FASAM-NIDE-V, respectively), which will enable, in premiere, the most efficient computation of the exact expressions of the first- and second-order sensitivities of decoder-responses with respect to the optimized network’s weights/parameters for NIDE-V neural nets.

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type (1st-FASAM-NIDE-F)

The mathematical expression of the network of nonlinear Fredholm-type Neural Integro-Differential Equations (NIDE-F) considered in this work generalizes the NIDE-net model introduced in [16] and is represented in component form by the following system of N^th-order integro-differential equations:

\begin{array}{l} \sum_{n = 1}^{N} c_{i, n} [h (t); f (θ); t] \frac{d^{n} h_{i} (t)}{d t^{n}} = g_{i} [h (t); f (θ)] \\ + \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ ψ_{j} [h (τ); f (θ); τ]; t \in [t_{0}, t_{f}]; i = 1, \dots, T H . \end{array}

(1)

The boundary conditions, imposed at the “initial time”

t = t_{0}

and/or “final time”

t = t_{f}

on the functions

h_{i} (t)

and their time-derivatives associated with the encoder of the NIDE-F net represented by Equation (1) are represented in operator form as follows:

B_{j} [h (t); f (θ); t] = 0; a t t = t_{0} ​ a n d / o r t = t_{f}; j = 1, \dots, B C .

(2)

The quantities appearing in Equations (1) and (2) are defined as follows:

(i): The real-valued scalar quantities $t$ and $τ$ , $t_{0} \leq t, τ \leq t_{f}$ , are time-like independent variables which parameterize the dynamics of the hidden/latent neuron units. Customarily, the variable $t$ is called the “global time” while the variable $τ$ is called the “local time. The initial time-value is denoted as $t_{0}$ while the stopping time-value is denoted as $t_{f}$ . Thus, the dynamics modeled by Eq.(1) depends both on non-local effects, as well as on instantaneous information.
(ii): The components of the $T H$ -dimensional vector-valued function $h (t) ≜ {[h_{1} (t), \dots, h_{T H} (t)]}^{†}$ represents the hidden/latent neural networks; $T H$ denotes the total number of components of $h (t)$ . In this work, the symbol “ $≜$ ” will be used to denote “is defined as” or, equivalently, “is by definition equal to.” The various vectors will be considered to be column vectors. Typically, vectors will be denoted using bold lower-case letters. The dagger “ $†$ ” symbol will be used to denote “transposition.”
(iii): The components of the column-vector $θ ≜ {[θ_{1}, \dots, θ_{T W}]}^{†}$ represent the “primary” network parameters, namely scalar learnable adjustable parameters/weights, in all of the latent neural nets, including the encoders(s) and decoder(s), where $T W$ denotes the total number of adjustable parameters/weights.
(iv): The scalar-valued components $f_{i} (θ)$ , $i = 1, \dots, T F$ , of the vector-valued function $f (θ) ≜ {[f_{1} (θ), \dots, f_{T F} (θ)]}^{†}$ represent the ”feature/functions of the primary model parameters.” The quantity $T F$ denotes the total number of such feature functions comprised in the NIDE-F. In particular, all of the model parameters that might appear solely in the boundary and/or initial conditions are considered to be included among the components of the vector $θ$ . In general, $f (θ)$ is a nonlinear vector-valued function of $θ$ . The total number of feature functions must necessarily be smaller than the total number of primary parameters (weights), i.e., $T F < T W$ . When the NIDE-F comprises only primary parameters, it is considered that $f_{i} (θ) \equiv θ_{i}$ for all $i = 1, \dots, T W \equiv T F$ .
(v): The functions $ψ_{j} [h (τ); f (θ); τ]$ model the dynamics of the neurons in a latent space where the local time integration occurs, while the functions $φ_{i, j} [f (θ); t]$ map the local space back to the original data space. The functions $g_{i} [h (t); f (θ)]$ model additional dynamics in the original data space. In general, these functions are nonlinear in their arguments.
(vi): The functions $c_{i, n} [h (t); f (θ); t]$ are coefficient-functions, which may depend nonlinearly on the functions $h (t)$ and $f (θ)$ , associated with the order, $n = 1, \dots, N$ , of the time-derivatives $d^{n} h_{i} (t) / d t^{n}$ of the functions $h_{i} (t)$ .
(vii): The operators $B_{j} [h (t); f (θ); t]$ , $j = 1, \dots, B C$ , represent boundary conditions associated with the encoder and/or decoder, imposed at $t = t_{0}$ and/or at $t = t_{f}$ on the functions $h_{i} (t)$ and on their time-derivatives; the quantity “BC” denotes the “total number of boundary conditions.”

Customarily, the NIDE-F net is “trained” by minimizing a user-chosen loss functional representing the discrepancy between a reference solution (”target data”) and the output produced by the NIDE-F decoder. The “training” process produces “optimal” values for the primary parameters

θ ≜ {[θ_{1}, \dots, θ_{T W}]}^{†}

, which will be denoted in this work by using the superscript “zero,” as follows:

θ^{0} ≜ {[θ_{1}^{0}, \dots, θ_{T W}^{0}]}^{†}

. Using these optimal/nominal parameter values to evaluate the NIDE-F net yields the optimal/nominal solution

h^{0} (t, x) ≜ {[h_{1}^{0} (t), \dots, h_{T H}^{0} (t)]}^{†}

which will satisfy the following form of Equation (1):

\begin{array}{l} \sum_{n = 1}^{N} c_{i, n} [h^{0} (t); f (θ^{0}); t] \frac{d^{n} h_{i}^{0} (t)}{d t^{n}} = g_{i} [h^{0} (t); f (θ^{0})] \\ + \sum_{j = 1}^{T L} φ_{i, j} [f (θ^{0}); t] \int_{t_{0}}^{t_{f}} d τ ψ_{j} [h^{0} (τ); f (θ^{0}); τ]; i = 1, \dots, T H; \end{array}

(3)

subject to the following optimized/trained boundary conditions:

B_{j} [h^{0} (t); f (θ^{0}); t] = 0; a t t = t_{0} ​ a n d / o r t = t_{f}; j = 1, \dots, B C .

(4)

After the NIDE-F net is optimized to reproduce the underlying physical system as closely as possible, the subsequent responses of interest are no longer “loss functionals” but become specific functionals of the NIDE-F’s “decoder” output, which can be generally represented by the functional

R [h; f (θ)]

defined below:

R [h; f (θ)] = \int_{t_{0}}^{t_{f}} D [h (t); f (θ); t] d t .

(5)

The function

D [h (t); f (θ); t]

models the decoder. The scalar-valued quantity

R [h; f (θ)]

is a functional of

h (t, x)

and

f (θ)

, and represents the NIDE-F’s decoder-response. At the optimal/nominal parameter values, i.e., at

θ = θ^{0}

, the decoder response takes on the following formal form:

R [h^{0}; f (θ^{0})] = \int_{t_{0}}^{t_{f}} D [h^{0} (t); f (θ^{0}); t] d t .

(6)

The physical system modeled by the NIDE-F net comprises parameters that stem from measurements and/or computations. Consequently, even if the NIDE-F net models perfectly the underlying physical system, the NIDE-F’s optimal weights/parameters are unavoidably afflicted by uncertainties stemming from the parameters underlying the physical system. Hence, it is important to quantify the uncertainties induced in the decoder output,

R [h; f (θ)]

, by the uncertainties that afflict the parameters/weights underlying the physical system modeled by the NIDE-F net. The relative contributions of the uncertainties afflicting the optimal parameters to the total uncertainty in the decoder response are quantified by the sensitivities of the NIDE-F decoder-response with respect to the optimized NIDE-F parameters. The general methodology for computing the first-order sensitivities of the decoder output,

R [h; f (θ)]

, with respect to the components of the feature function

f (θ)

, and with respect to the primary model parameters

θ_{1}, \dots, θ_{T W}

, will be presented in this Section.

The known nominal values

θ^{0}

of the primary model parameters (“weights”) characterizing the NIDE-V net will differ from the true but unknown values

θ

of the respective weights by variations denoted as

δ θ ≜ θ - θ^{0}

. The variations

δ θ ≜ θ - θ^{0}

will induce corresponding variations

δ f ≜ f (θ) - f^{0}

,

f^{0} ≜ f (θ^{0})

, in the feature functions. The variations

δ θ

and

δ f

will induce, through Equation (1), variations

v^{(1)} (t) ≜ {[v_{1}^{(1)} (t), \dots, v_{T H}^{(1)} (t)]}^{†} ≜ {[δ h_{1} (t), \dots, δ h_{T H} (t)]}^{†}

around the nominal/optimal functions

h^{0} (t)

. In turn, the variations

δ f ≜ f (θ) - f^{0}

and

v^{(1)} (t; x)

will induce variations

δ R (h^{0}; f^{0}; v^{(1)}; δ f; t)

in the NIE decoder’s response.

The “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type (1st-FASAM-NIDE-F)” aims at obtaining the exact expressions of the first-order sensitivities (i.e., functional derivatives) of the decoder’s response with respect to the feature function and the primary model parameters, followed by the most efficient computation of these sensitivities. The 1st-FASAM-NIDE-F will be established by applying the same principles as those underlying the 1st-FASAM-N [24] methodology. The fundamental concept for defining the sensitivity of an operator-valued quantity

R (x)

with respect to variations

δ x

in a neighborhood around the nominal values

x^{0}

, has been shown in 1981 by Cacuci [29] to be provided by the 1st-order Gateaux- (G-) variation

δ R (x^{0}; δ x)

of

R (x)

, which is defined as follows:

δ R (x^{0}; δ x) ≜ {\{\frac{d}{d ε} [R (x^{0} + ε δ x)]\}}_{ε = 0} ≜ \lim_{ε \to 0} \frac{R (x^{0} + ε δ x) - R (x^{0})}{ε},

(7)

for a scalar

ε

and for arbitrary vectors

δ x

in a neighborhood

(x^{0} + ε δ x)

around

x^{0}

. When the G-variation

δ R (x^{0}; δ x)

is linear in the variation

δ x

, it can be written in the form

δ R (x^{0}; δ x) = {\{\partial R / \partial x\}}_{x^{0}} δ x

, where

{\{\partial R / \partial x\}}_{x^{0}}

denotes the first-order G-derivative of

R (x)

with respect to

x

, evaluated at

x^{0}

.

Applying the definition provided in Equation (7) to Equation (5) yields the following expression for the first-order G-variation

δ R (h^{0}; f^{0}; v^{(1)}; δ f)

of the response

R [h; f (θ)]

:

\begin{array}{l} δ R (h^{0}; f^{0}; v^{(1)}; δ f) = {\{\frac{d}{d ε} \int_{t_{0}}^{t_{f}} D [h^{0} (t) + ε v^{(1)} (t); f^{0} + ε δ f; t] d t\}}_{ε = 0} \\ = {\{δ R (h^{0}; f^{0}; δ f)\}}_{d i r} + {\{δ R (h^{0}; f^{0}; v^{(1)})\}}_{i n d}, \end{array}

(8)

where the “direct effect term” arises directly from variations

δ f

and is defined as follows:

{\{δ R (h^{0}; f^{0}; δ f)\}}_{d i r} ≜ \sum_{i = 1}^{T F} \int_{t_{0}}^{t_{f}} d t {\{\frac{\partial D [h (t); f (θ); t]}{\partial f_{i}} δ f_{i}\}}_{θ^{0}},

(9)

and where the “indirect effect term” arises indirectly, through the variations

v^{(1)} (t)

in the hidden state functions

h (t)

, is defined as follows:

{\{δ R (h^{0}; f^{0}; v^{(1)})\}}_{i n d} ≜ \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} d τ {\{\frac{\partial D [h (t); f (θ); t]}{\partial h_{i} (t)} v_{i}^{(1)} (t)\}}_{θ^{0}} .

(10)

The direct-effect term can be quantified using the nominal values

(h^{0}; f^{0})

but the indirect-effect term can be quantified only after determining the variations

v^{(1)} (t)

, which are caused by the variations

δ f

through the NIDE-F net defined in Equation (1).

The first-order relationship between the variations

v^{(1)} (t)

and

δ f

is obtained from the first-order G-variations of Equations (1) and (2). The first-order G-variations of Equations (1) and (2), respectively, are obtained, by definition, as follows:

\begin{array}{l} {\{\frac{d}{d ε} \sum_{n = 1}^{N} c_{i, n} [h^{0} (t) + ε v^{(1)} (t); f (θ^{0}) + ε δ f; t] \frac{d^{n} [h_{i}^{0} (t) + ε δ h_{i} (t)]}{d t^{n}}\}}_{ε = 0} \\ = {\{\frac{d g_{i} [h^{0} (t) + ε v^{(1)} (t); f (θ^{0}) + ε δ f]}{d ε}\}}_{ε = 0} \\ + {\{\frac{d}{d ε} \sum_{j = 1}^{T L} φ_{i, j} [f (θ^{0}) + ε δ f; t] \int_{t_{0}}^{t_{f}} d τ ψ_{j} [h^{0} (τ) + ε v^{(1)} (τ); f (θ^{0}) + ε δ f; τ]\}}_{ε = 0} . \end{array}

(11)

{\{\frac{d}{d ε} B_{j} [h^{0} (t) + ε v^{(1)} (t); f (θ^{0}) + ε δ f; t]\}}_{ε = 0} = 0; t = t_{0}; t = t_{f}; j = 1, \dots, B C .

(12)

Carrying out the operations indicated in Equations (11) and (12) yields the following NIDE-F net of Fredholm-type for the function

v^{(1)} (t)

:

\begin{array}{l} {\{\sum_{k = 1}^{T H} \sum_{n = 1}^{N} \frac{d^{n} h_{i} (t)}{d t^{n}} \sum_{k = 1}^{T H} \frac{\partial c_{i, n} [h; f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t) + \sum_{n = 1}^{N} c_{i, n} [h; f (θ); t] \frac{d^{n} v_{i}^{(1)} (t)}{d t^{n}}\}}_{θ^{0}} \\ - {\{\sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ)\}}_{θ^{0}} \\ - \sum_{k = 1}^{T H} {\{\frac{\partial g_{i} [h (t); f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} = \sum_{k = 1}^{T F} {\{q_{i, k}^{(1)} (h; f; t) δ f_{k}\}}_{θ^{0}}, i = 1, \dots, T H; \end{array}

(13)

\begin{array}{l} {\{\sum_{k = 1}^{T H} \frac{\partial B_{j} [h; f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} = - {\{\sum_{k = 1}^{T F} \frac{\partial B_{j} [h; f (θ); t]}{\partial f_{k}} δ f_{k}\}}_{θ^{0}}, \\ a t t = t_{0}; t = t_{f}; j = 1, \dots, B C; \end{array}

(14)

where:

\begin{array}{l} q_{i, k}^{(1)} (h; f; t) ≜ - \sum_{n = 1}^{N} \frac{\partial c_{i, n} [f (θ); t]}{\partial f_{k}} \frac{d^{n} h_{i} (t)}{d t^{n}} + \sum_{j = 1}^{T L} \frac{\partial φ_{i, j} (f; t)}{\partial f_{k}} \int_{t_{0}}^{t_{f}} d τ ψ_{j} [h (τ); f (θ); τ] \\ + \frac{\partial g_{i} (h; f; t)}{\partial f_{k}} + \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{j} [h (τ); f (θ); τ]}{\partial f_{k}}; ​ i = 1, \dots, T H; ​ k = 1, \dots, T F . \end{array}

(15)

The NIDE-F net represented by Equations (13) and (14) is called [24] the “1st-Level Variational Sensitivity system (1st-LVSS) and its solution,

v^{(1)} (t)

is called [24] the “1st-level variational function.” All of the quantities in Equations (13) and (14) are to be computed at the nominal parameter values, but the respective indication has not been explicitly shown in order to simplify the notation.

It is important to note that the 1st-LVSS is linear in the variational function

v^{(1)} (t)

. Therefore, the 1st-LVSS represented by Equation (13) can be written in matrix-vector form as follows:

L [h; f (θ); t] v^{(1)} (t) = Q^{(1)} [h; f (θ); t] δ f (θ),

(16)

where the

T H \times T F

-dimensional rectangular matrix

Q^{(1)} [h; f (θ); t]

comprises as components the quantities

q_{i, k}^{(1)} (h; f)

defined in Equation (15), while the components of the

T H \times T H

square matrix

L [h; f (θ); t] ≜ {[L_{i k}]}_{T H \times T H}

are operators (algebraic, differential, integral) defined below, for

i, k = 1, \dots, T H

:

\begin{array}{l} L_{i i} v_{i}^{(1)} (t) ≜ \sum_{n = 1}^{N} \frac{d^{n} h_{i} (t)}{d t^{n}} \frac{\partial c_{i, n} [h; f (θ); t]}{\partial h_{i} (t)} v_{i}^{(1)} (t) + \sum_{n = 1}^{N} c_{i, n} [h; f (θ); t] \frac{d^{n} v_{i}^{(1)} (t)}{d t^{n}} \\ - \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{i} (τ)} v_{i}^{(1)} (τ) d τ - \frac{\partial g_{i} [h (t); f (θ); t]}{\partial h_{i} (t)} v_{i}^{(1)} (t); \end{array}

(17)

\begin{array}{l} L_{i k} v_{k}^{(1)} (t) ≜ \sum_{n = 1}^{N} \frac{d^{n} h_{i} (t)}{d t^{n}} \frac{\partial c_{i, n} [h; f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t) - \frac{\partial g_{i} [h (t); f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ - \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ); i, k = 1, \dots, T H . \end{array}

(18)

Note that the 1st-LVSS would need to be solved anew for each variation

δ F_{j}

,

j = 1, \dots, T F

, in order to determine the corresponding function

v^{(1)} (t)

, which is prohibitively expensive computationally if

T F

is a large number. The need for repeatedly solving the 1st-LVSS can be avoided if the variational function

v^{(1)} (t)

could be eliminated from appearing in the expression of the indirect-effect term defined in Equation (10). This goal can be achieved [24] by expressing the right-side of Equation (10) in terms of the solutions of the “1st-Level Adjoint Sensitivity System (1st-LASS)” to be constructed next. The construction of this 1st-LASS will be performed in a Hilbert space comprising elements of the same form as

v^{(1)} (t) \in H_{1} (Ω_{t})

, defined on the domain

Ω_{t} ≜ t \in [t_{0}, t_{f}]

. This Hilbert space is endowed with an inner product of two elements

χ^{(1)} (t) ≜ {[χ_{1}^{(1)} (t), \dots, χ_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

and

η^{(1)} (t) ≜ {[η_{1}^{(1)} (t), \dots, η_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

, denoted as

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1}

and defined as follows:

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1} ≜ \int_{t_{0}}^{t_{f}} {[χ^{(1)} (t)]}^{†} η^{(1)} (t) d t = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{i}^{(1)} (t) η_{i}^{(1)} (t) d t .

(19)

The next step is to construct the inner product of Equation (13) with a vector

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

, where the superscript “(1)” indicates “1st-Level”, to obtain the following relationship:

{〈a^{(1)} (t), L [h; f (θ); t] v^{(1)} (t)〉}_{1} = {〈a^{(1)} (t), Q^{(1)} [h; f (θ); t] δ f (θ)〉}_{1}

(20)

The terms appearing in Equation (20) are to be computed at the nominal values

(h^{0}; F^{0})

but the respective notation has been omitted for simplicity.

Using the definition of the adjoint operator in

H_{1} (Ω_{t})

, the term on the left-side of Equation (20) is integrated by parts and the order of summations is reversed to obtain the following relation:

{〈a^{(1)} (t), L [h; f (θ); t] v^{(1)} (t)〉}_{1} = {〈v^{(1)} (t), A^{(1)} [h; f (θ); t] a^{(1)} (t)〉}_{1} + P (h; f; v^{(1)}; a^{(1)}),

(21)

where the operator

A^{(1)} [h; f (θ); t] ≜ L^{*} [h; f (θ); t]

denotes the formal adjoint of the operator

L [h; f (θ); t]

and where

P (h; F; v^{(1)}; a^{(1)})

represents the scalar-valued bilinear concomitant evaluated on the boundary

t = t_{0}

and/or

t = t_{f}

. Note that the

T H \times T H

matrix valued operator

A^{(1)} [h; f (θ); t] ≜ {\{A_{i j}^{(1)} [h; f (θ); t]\}}_{T H \times T H}

acts linearly on the vector

a^{(1)} (t)

. The “star” superscript ^* will be used in this work to denote “formal adjoint operator.”

It follows from Equations (20)and (21) that the following relation holds:

{〈v^{(1)} (t), A^{(1)} [h; f (θ); t]〉}_{1} = {〈a^{(1)} (t), Q^{(1)} δ f〉}_{1} - P (h; f; v^{(1)}; a^{(1)}) .

(22)

The term on the left-side of Equation (22) is now required to represent the indirect effect term defined in Equation (10) by imposing the following relation:

A^{(1)} [h; f (θ); t] a^{(1)} (t) = \frac{\partial D [h (t); f (θ); t]}{\partial h (t)}; t_{0} < t < t_{f} .

(23)

Using Equations (22) and (23) in Equation (10) yields the following expression for the indirect effect term:

{\{δ R (h^{0}; f^{0}; v^{(1)})\}}_{i n d} = {\{{〈a^{(1)} (t), Q^{(1)} δ f〉}_{1} - P (h; f; v^{(1)}; a^{(1)})\}}_{θ^{0}} .

(24)

The boundary conditions accompanying Equation (23) for the function

a^{(1)} (t)

are now chosen at the time values

t = t_{f}

and/or

t = t_{0}

so as to eliminate all unknown values of the 1st-level variational function

v^{(1)} (t)

from the bilinear concomitant

P (h; f; v^{(1)}; a^{(1)})

which remain after implementing the initial conditions provided in Equation (2). These boundary conditions for the function

a^{(1)} (t)

can be represented in operator form as follows:

B_{j}^{*} [a^{(1)} (t); f (θ); t] = 0; a t t = t_{f} ​ a n d / o r t = t_{0}; j = 1, \dots, B C .

(25)

The Fredholm-like NIDE net represented by Equations (23) and (25) will be called the “1st-Level Adjoint Sensitivity System” and the solution,

a^{(1)} (t)

, will be called the “1st-level adjoint sensitivity function.” The 1st-LASS is solved using the nominal/optimal values for the parameters and for the function

h (t)

but this fact has not been explicitly indicated in order to simplify the notation. Notably, the 1st-LASS is independent of any parameter variations so it needs to be solved just once to obtain the 1st-level adjoint sensitivity function

a^{(1)} (t)

The 1st-LASS is linear in

a^{(1)} (t)

but is, in general, nonlinear in

h (t; x)

.

Adding the result obtained in Equation (24) for the indirect-effect term

{\{δ R (h^{0}; f^{0}; v^{(1)})\}}_{i n d}

to the result obtained in Equation (9) for the direct-effect term yields the following expression for the first-order G-differential of the response

R [h; f (θ)]

:

\begin{array}{l} δ R (h^{0}; f^{0}; v^{(1)}; δ F) = {\{{〈a^{(1)} (t), Q^{(1)} δ f〉}_{1} - P (h; f; v^{(1)}; a^{(1)})\}}_{θ^{0}} \\ + \sum_{i = 1}^{T F} \int_{t_{0}}^{t_{f}} d t {\{\frac{\partial D [h (t); f (θ); t]}{\partial f_{i}} δ f_{i}\}}_{θ^{0}} ≜ {\{\sum_{i = 1}^{T F} R^{(1)} [i; h (t); a^{(1)} (t); f (θ)] δ f_{i}\}}_{θ^{0}}, \end{array}

(26)

where

R^{(1)} [i; h (t); a^{(1)} (t); f (θ)] ≜ \partial R [u (x); f (θ)] / \partial f_{i}

denotes the first-order sensitivity of the response

R [u (x); f (θ)]

with respect to the components

f_{i}

of the “feature”. Each sensitivity

R^{(1)} [i; u (x); a^{(1)} (x); f (α)]

is obtained by identifying the expression that multiplies the corresponding variation

δ f_{i}

and can be represented formally in the following integral form:

R^{(1)} [i; h (t); a^{(1)} (t); f (θ)] ≜ \int_{t_{0}}^{t_{f}} S^{(1)} [i; h (t); a^{(1)} (t); f (θ)] d t; i = 1, \dots, T F .

(27)

The functions

S^{(1)} [i; h (t); a^{(1)} (t); f (θ)]

will be subsequently used for determining the exact expressions of the second-order sensitivities of the response with respect to the components of the feature function

f (θ)

of model parameters.

In the following subsections, the detailed forms of the 1st-LASS will be provided for first-order (n=1) and, respectively, second-order (n=2) Fredholm-like NIDE.

2.1. First-Order Neural Integral Equations of Fredholm-Type (1st-NIDE-F)

The representation of the first-order

(n = 1)

neural integral equations of Fredholm-type (1st-NIDE-F) is provided below, for

i = 1, \dots, T H

:

c_{i, 1} [h (t); f (θ)] \frac{d h_{i} (t)}{d t} = g_{i} [h (t); f (θ)] + \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ ψ_{j} [h (τ); f (θ); τ] .

(28)

The typical boundary conditions provided at

t = t_{0}

(“encoder”) are as follows:

h_{i} (t_{0}) = e_{i}; i = 1, \dots, T H,

(29)

where the scalar values

e_{i}

are known, albeit imprecisely, since they are considered to stem from experiments and/or computations. Equations (28) and (29) are customarily considered an “initial value (NIDE-F) problem” although the independent variable t could represent some other physical entity (e.g., space, energy, etc.) rather than time.

The 1st-LVSS for the function

v^{(1)} (t)

is obtained by G-differentiating Equations (28) and (29), and has the following particular forms of Equations (13) and (14) for

n = 1

:

\begin{array}{l} {\{c_{i, 1} [h (t); f (θ)]\}}_{θ^{0}} \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} {\{\frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} \\ - {\{\sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ)\}}_{θ^{0}} \\ - \sum_{k = 1}^{T H} {\{\frac{\partial g_{i} [h (t); f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} = \sum_{k = 1}^{T F} {\{q_{i, k}^{(1)} (h; f)\}}_{θ^{0}} δ f_{k}, i = 1, \dots, T H; \end{array}

(30)

v_{i}^{(1)} (t_{0}) = δ e_{i}; i = 1, \dots, T H;

(31)

where:

\begin{array}{l} q_{i, k}^{(1)} (h; f) ≜ - \frac{d h_{i} (t)}{d t} \frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial f_{k}} + \sum_{j = 1}^{T L} \frac{\partial φ_{i, j} (f; t)}{\partial f_{k}} \int_{t_{0}}^{t_{f}} ψ_{j} [h (τ); f (θ); τ] d τ \\ + \frac{\partial g_{i} (h; f; t)}{\partial f_{k}} + \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{j} [h (τ); f (θ); τ]}{\partial f_{k}}; ​ i = 1, \dots, T H; ​ k = 1, \dots, T F . \end{array}

(32)

The 1st-LASS is constructed by using Equation (19) to form the inner product of Equation (30) with a vector

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

to obtain the following relationship:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \{c_{i, 1} [h (t); f] \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} \frac{\partial c_{1} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ - \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h (τ); f; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) - \sum_{k = 1}^{T H} \frac{\partial g_{i} [h (t); f; t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\} \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ f_{k} . \end{array}

(33)

Examining the structure of the left-side of Equation (33) reveals that the bilinear concomitant will arise from the integration by parts of the first term the on the left-side of Equation (33) to obtain the following relation:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) c_{i, 1} [h (t); f] \frac{d v_{i}^{(1)} (t)}{d t} d t = P (h; f; v^{(1)}; a^{(1)}) \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{d \{a_{i}^{(1)} (t) c_{i, 1} [h (t); f]\}}{d t} d t, \end{array}

(34)

where the bilinear concomitant

P (h; f; v^{(1)}; a^{(1)})

has the following expression, by definition:

P (h; f; v^{(1)}; a^{(1)}) ≜ \sum_{i = 1}^{T H} \{a_{i}^{(1)} (t_{f}) c_{i, 1} [h (t_{f}); f] v_{i}^{(1)} (t_{f}) - a_{i}^{(1)} (t_{0}) c_{i, 1} [h (t_{0}); f] v_{i}^{(1)} (t_{0})\} .

(35)

The second term on the left-side of Equation (33) will be recast in its “adjoint form” by reversing the order of summations so as to transform the inner product involving the function

a^{(1)} (t)

to an inner product involving the function

v^{(1)} (t)

, as follows:

\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} d t .

(36)

The third term on the left-side of Equation (33) is now recast in its “adjoint form” by reversing the order of summations and integrations so as to transform the inner product involving the function

a^{(1)} (t)

into an inner product involving the function

v^{(1)} (t)

, as follows:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} d t a_{i}^{(1)} (t) \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) d τ \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} d τ v_{i}^{(1)} (τ) \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{i} (τ)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) φ_{k, j} [f (θ); t] d t . \end{array}

(37)

The fourth term on the left-side of Equation (33) will be recast in its “adjoint form” by reversing the order of summations and integrations so as to transform the inner product involving the function

a^{(1)} (t)

into an inner product involving the function

v^{(1)} (t)

, as follows:

\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T H} \frac{\partial g_{i} [h (t); f; t]}{\partial h_{k} (t)} v_{k}^{(1)} (t) = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i} (t) \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)} .

(38)

Using the results obtained in Equations (34)‒(38) in the left-side of Equation (33) yields the following relation:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \{c_{i, 1} [h (t); f] \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} \frac{\partial c_{1} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ - \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h (τ); f; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) - \sum_{k = 1}^{T H} \frac{\partial g_{i} [h (t); f; t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\} \\ = P (h; f; v^{(1)}; a^{(1)}) + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) d t \{- \frac{d}{d t} [a_{i}^{(1)} (t) c_{i, 1} (h; f)] \\ + \frac{\partial c_{1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} - \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); t]}{\partial h_{i} (t)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (τ) φ_{k, j} [f (θ); τ] d τ \\ - \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)}\} = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ f_{k} . \end{array}

(39)

The relation in Equation (39) is rearranged as follows:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ F_{k} - P (h; f; v^{(1)}; a^{(1)}) \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) d t \{- \frac{d}{d t} [a_{i}^{(1)} (t) c_{i, 1} (h; f)] + \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} \\ - \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); t]}{\partial h_{i} (t)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (τ) φ_{k, j} [f (θ); τ] d τ - \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)}\} . \end{array}

(40)

The term on the right-side of Equation (40) is now required to represent the “indirect-effect” term defined in Equation (10), which is achieved by requiring the components of the function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

to satisfy the following system of first-order NIDE-F equations:

\begin{array}{l} - \frac{d}{d t} \{a_{i}^{(1)} (t) c_{i, 1} [h (t); f]\} + \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} \\ - \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); t]}{\partial h_{i} (t)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (τ) φ_{k, j} [f (θ); τ] d τ - \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)} \\ = \frac{\partial D [h (t); f (θ); t]}{\partial h_{i} (t)}; i = 1, \dots, T H . \end{array}

(41)

The relation obtained in Equation (41) is the explicit form of the relation provided in Equation (23) for the particular case when

n = 1

, i.e., when considering first-order neural integral equations of Fredholm-type (1st-NIDE-F).

The unknown values

v_{i} (t_{f})

in the bilinear concomitant

P (h; f; v^{(1)}; a^{(1)})

in Equation (40) are eliminated by imposing the following final-time conditions:

a_{i}^{(1)} (t_{f}) = 0; i = 1, \dots, T H .

(42)

It follows from Equations (33)‒(42) and (31) that the indirect-effect term defined in Equation (10) has the following expression in terms of the 1st-level adjoint sensitivity function

a^{(1)} (t)

:

{\{δ R (h; f; a^{(1)})\}}_{i n d} = \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ f_{k} + \sum_{i = 1}^{T H} a_{i}^{(1)} (t_{0}) c_{i, 1} [h (t_{0}); f] δ e_{i} .

(43)

The first-order NIDE-F obtained in Equations (41) and (42) represents the explicit form for the particular case n=1 of the 1st-LASS represented, in general, by Equations (23) and (25). To obtain the 1st-level adjoint sensitivity function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

, the 1st-LASS is solved backwards in time (globally) using the nominal/optimal values for the parameters and for the function

h (t)

but this fact has not been explicitly indicated in order to simplify the notation. Notably, the 1st-LASS is independent of any parameter variations so it needs to be solved just once to obtain the 1st-level adjoint sensitivity function

a^{(1)} (t)

The 1st-LASS is linear in

a^{(1)} (t)

but is, in general, nonlinear in

h (t; x)

.

Using the results obtained in Equations (43) and (9) in Equation (8) yields the following expression for the G-variation

δ R (h^{0}; F^{0}; v^{(1)}; δ f)

, which is seen to be linear in the variations

δ f_{j}

,

j = 1, \dots, T F

, in the model’s feature functions (induced by variations in the model’s primary parameters) and the variations

δ e_{i}

,

i = 1, \dots, T H

in the decoder’s initial conditions:

\begin{array}{l} δ R [h (t); f; a^{(1)} (t); δ f] ≜ \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} d t \frac{\partial D [h (t); f; t]}{\partial F_{j}} δ F_{j} + \sum_{i = 1}^{T H} a_{i}^{(1)} (t_{0}) c_{i, 1} [h (t_{0}); f] δ e_{i} \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{j = 1}^{T F} q_{j k}^{(1)} (h; f; t) δ f_{j} ≜ \sum_{j = 1}^{T F} \frac{\partial R}{\partial f_{j}} δ f_{j} + \sum_{i = 1}^{T H} \frac{\partial R}{\partial e_{i}} δ e_{i} . \end{array}

(44)

The expression in Equation (44) is to be satisfied at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

Identifying in Equation (44) the expressions that multiply the variations

δ e_{i}

yields the following expressions for the decoder response sensitivities with respect to the encoder’s initial conditions:

\frac{\partial R}{\partial e_{i}} = a_{i}^{(1)} (t_{0}) c_{i, 1} [h (t_{0}); f] = \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) c_{i, 1} [h (t); f] δ (t - t_{0}) d t; i = 1, \dots, T H .

(45)

It is apparent from Equation (45) that the sensitivities

\partial R / \partial e_{i}

are functionals of the form predicted in Equation (27). It is also apparent from Equation (45) that the sensitivities

\partial R / \partial e_{i}

are proportional to the values of the respective component

a_{i}^{(1)} (t_{0})

of the 1st-level adjoint function evaluated at the initial-time

t = t_{0}

. This relation provides an independent mechanism for verifying the correctness of solving the 1st-LASS from

t = t_{f}

to

t = t_{0}

(backwards in time) since the sensitivities

\partial R / \partial e_{i}

can be computed independently of the 1st-LASS by using finite differences of appropriately high-order in conjunction with known variations

δ e_{i}

and the correspondingly induced variations in the decoder response. Special attention needs to be devoted, however, to ensure that the respective finite-difference formula is accurate, which may need several trials with different values chosen for the variation

δ e_{i}

.

It also follows from Equations (44) and (32) that the sensitivities

\partial R / \partial f_{j}

of the response

R [h; f (θ)]

with respect to the components

f_{j} (θ)

of the feature function

f (θ)

have the following expressions, written in the form of Equation (27):

\partial R / \partial f_{j} ≜ R^{(1)} [j; h (t); a^{(1)} (t); f (θ)] ≜ \int_{t_{0}}^{t_{f}} S_{1}^{(1)} [j; h (t); a^{(1)} (t); f (θ)] d t; j = 1, \dots, T F;

(46)

where

\begin{array}{l} S_{1}^{(1)} [j; h (t); a^{(1)} (t); f (θ)] = \frac{\partial D [h (t); f (θ); t]}{\partial f_{j}} - \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \frac{d h_{i} (t)}{d t} \frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial f_{j}} \\ + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \frac{\partial g_{i} (f; t; t)}{\partial f_{j}} + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \sum_{k = 1}^{T L} \frac{\partial φ_{i, k} (f; t)}{\partial f_{j}} \int_{t_{0}}^{t_{f}} ψ_{k} [h (τ); f (θ); τ] d τ \\ + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \sum_{k = 1}^{T L} φ_{i, k} (f; t) \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{k} [h (τ); f (θ); τ]}{\partial f_{j}}; j = 1, \dots, T F . \end{array}

(47)

The subscript “1” attached to the quantity

S_{1}^{(1)} [j; h (t); a^{(1)} (t); f (θ)]

indicates that this quantity refers to a “first-order” NIDE-F net, while the superscript “(1)” indicates that this quantity refers to “first-order” sensitivities.

The sensitivities with respect to the primary model parameters can be obtained by using the result shown in Equation (46) together with the “chain rule” of differentiating compound functions, as follows:

\frac{\partial R}{\partial θ_{j}} = \sum_{i = 1}^{T F} \frac{\partial R}{\partial f_{i}} \frac{\partial f_{i}}{\partial θ_{j}}, j = 1, \dots, T W .

(48)

When there only model parameters (i.e., there are no feature functions of model parameters), then

f_{i} (θ) \equiv θ_{i}

for all

i = 1, \dots, T F ≜ T W

, and the expression obtained in Equation (46) yields directly the first-order sensitivities

\partial R / \partial θ_{j}

, for all

j = 1, \dots, T W

. In this case, all of the sensitivities

\partial R / \partial θ_{j}

, for all

j = 1, \dots, T W

would be obtained by computing integrals (using quadrature formulas). In contradistinction, when features of parameters can be established, only

T F

(T F < T W)

integrals would need to be computed (using quadrature formulas) to obtain the

\partial R / \partial F_{j}

,

j = 1, \dots, T F

; the sensitivities with respect to the model parameters would subsequently be obtained analytically using the chain-rule provided in Equation (48).

Occasionally, the boundary conditions may be provided through a measurement at the boundary

t = t_{f}

(“decoder”), as follows:

h_{i} (t_{f}) = d_{i}; i = 1, \dots, T H,

(49)

where the scalar values

d_{i}

are known, albeit imprecisely, since they are considered to stem from experiments and/or computations. In such a case, the determination of the first-order sensitivities

\partial R / \partial f_{j}

of the response

R [h; f (θ)]

with respect to the components

f_{j} (θ)

of the feature function

f (θ)

follows the same steps as in Section 2.1.2, above, yielding the following results:

(i): The 1st-LASS will become an “initial value problem” comprising Equation (41), subject not the conditions shown in Equation (42), but subject to the following “initial conditions

$a_{i}^{(1)} (t_{0}) = 0; i = 1, \dots, T H .$

(50)
(ii): The sensitivities $\partial R / \partial f_{j}$ of the response $R [h; f (θ)]$ with respect to the components $f_{j} (θ)$ of the feature function $f (θ)$ will have the same formal expressions as in Equation (46) but the components of the 1st-level adjoint function $a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}$ will be the solution of Equations (41) and (50).
(iii): The sensitivities of the response $R [h; f (θ)]$ with respect to boundary conditions at $t = t_{f}$ will have the following expressions:

$\frac{\partial R}{\partial d_{i}} = a_{i}^{(1)} (t_{f}) c_{i, 1} [h (t_{f}); f]; i = 1, \dots, T H .$

(51)

2.2. Second-Order Neural Integral Equations of Fredholm-Type (2nd-NIDE-F)

The representation of the second-order

(n = 2)

neural integral equations of Fredholm-type (2nd-NIDE-F) is provided below, for

i = 1, \dots, T H

:

\begin{array}{l} c_{i, 1} [h (t); f (θ)] \frac{d h_{i} (t)}{d t} + c_{i, 2} [h (t); f (θ)] \frac{d^{2} h_{i} (t)}{d t^{2}} = g_{i} [h (t); f (θ)] \\ + \sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} ψ_{j} [h (τ); f (θ); τ] d τ; i = 1, \dots, T H . \end{array}

(52)

There are several combinations of boundary conditions that can be provided, either for the function

h_{i} (t)

and/or for its first-derivative

d h_{i} (t) / d t

,

i = 1, \dots, T H

, at either

t = t_{0}

(encoder) or at

t = t_{f}

(decoder), or a combination thereof. For illustrative purposes, consider that the boundary conditions are as follows:

h_{i} (t_{0}) = e_{i}; h_{i} (t_{f}) = d_{i}; i = 1, \dots, T H .

(53)

The 1st-LVSS is obtained by taking the G-variations of Equations (52) and (53) to obtain the following system, comprising the forms taken on for

n = 2

by Equations (13) and (14), respectively:

\begin{array}{l} {\{c_{i, 1} [h (t); f (θ)]\}}_{θ^{0}} \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} {\{\frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} \\ + {\{c_{i, 2} [h (t); f (θ)]\}}_{θ^{0}} \frac{d^{2} v_{i}^{(1)}}{d t^{2}} + \frac{d^{2} h_{i} (t)}{d t^{2}} \sum_{k = 1}^{T H} {\{\frac{\partial c_{i, 2} [h (t); f (θ)]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} \\ - {\{\sum_{j = 1}^{T L} φ_{i, j} [f (θ); t] \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ)\}}_{θ^{0}} \\ - {\sum_{k = 1}^{T H} \{\frac{\partial g_{i} [h (t); f (θ); t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\}}_{θ^{0}} = \sum_{k = 1}^{T F} {\{q_{i, k}^{(1)} (h; f)\}}_{θ^{0}} δ f_{k}, i = 1, \dots, T H; \end{array}

(54)

v_{i}^{(1)} (t_{0}) = δ e_{i}; v_{i}^{(1)} (t_{f}) = δ d_{i}; i = 1, \dots, T H;

(55)

where for

i = 1, \dots, T H; ​

and

k = 1, \dots, T F

:

\begin{array}{l} q_{i k}^{(1)} (h; f) ≜ - \frac{d h_{i} (t)}{d t} \frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial f_{k}} - \frac{d^{2} h_{i} (t)}{d t^{2}} \frac{\partial c_{i, 2} [h (t); f (θ)]}{\partial f_{k}} + \frac{\partial g_{i} (h; f; t)}{\partial f_{k}} \\ + \sum_{j = 1}^{T L} \frac{\partial φ_{i, j} (f; t)}{\partial f_{k}} \int_{t_{0}}^{t_{f}} ψ_{j} [h (τ); f (θ); τ] d τ + \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{j} [h (τ); f (θ); τ]}{\partial f_{k}} . \end{array}

(56)

The 1st-LASS is constructed by using Equation (19) to form the inner product of Equation (54)with a vector

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

to obtain the following relationship:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \{c_{i, 1} [h (t); f] \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ + c_{i, 2} [h (t); f] \frac{d^{2} v_{i}^{(1)} (t)}{d t^{2}} + \frac{d^{2} h_{i} (t)}{d t^{2}} \sum_{k = 1}^{T H} \frac{\partial c_{i, 2} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ - \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h (τ); f; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) - \sum_{k = 1}^{T H} \frac{\partial g_{i} [h (t); f; t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\} \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ f_{k} . \end{array}

(57)

Examining the structure of the left-side of Equation (57) reveals that the bilinear concomitant will arise from the integration by parts of the first and third terms the on the left-side of Equation (57), as follows:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) c_{i, 1} [h; f] \frac{d v_{i}^{(1)} (t)}{d t} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) c_{i, 2} [h; f] \frac{d^{2} v_{i}^{(1)} (t)}{d t^{2}} d t = P (h; f; v^{(1)}; a^{(1)}) \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{d}{d t} \{a_{i}^{(1)} (t) c_{i .1} [h (t); f]\} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{d^{2}}{d t^{2}} \{a_{i}^{(1)} (t) c_{i .2} [h (t); f]\} d t, \end{array}

(58)

where the bilinear concomitant

P (h; f; v^{(1)}; a^{(1)})

has the following expression:

\begin{array}{l} P (h; f; v^{(1)}; a^{(1)}) ≜ \sum_{i = 1}^{T H} \{a_{i}^{(1)} (t_{f}) c_{i, 1} [h (t_{f}); f] v_{i}^{(1)} (t_{f}) - a_{i}^{(1)} (t_{0}) c_{i, 1} [h (t_{0}); f] v_{i}^{(1)} (t_{0})\} \\ + \sum_{i = 1}^{T H} \{a_{i}^{(1)} (t_{f}) c_{i, 2} [h (t_{f}); f] \frac{d v_{i}^{(1)} (t_{f})}{d t} - a_{i}^{(1)} (t_{0}) c_{i, 2} [h (t_{0}); f] \frac{d v_{i}^{(1)} (t_{0})}{d t}\} \\ - {\sum_{i = 1}^{T H} v_{i}^{(1)} (t_{f}) \{a_{i}^{(1)} (t) \frac{d c_{i, 2} [h (t); f]}{d t} + c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{f}} \\ + {\sum_{i = 1}^{T H} v_{i}^{(1)} (t_{0}) \{a_{i}^{(1)} (t) \frac{d c_{i, 2} [h (t); f]}{d t} + c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{0}} \end{array}

(59)

The remaining terms on the left-side of Equation (57) will be recast into their corresponding “adjoint form” by using the results obtained in Equations (34)‒(38). Using these results together with the results obtained in Equations (58) and (59) yields the following expression for the left-side Equation (57):

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \{c_{i, 1} [h (t); f] \frac{d v_{i}^{(1)} (t)}{d t} + \frac{d h_{i} (t)}{d t} \sum_{k = 1}^{T H} \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ + c_{i, 2} [h (t); f] \frac{d^{2} v_{i}^{(1)} (t)}{d t^{2}} + \frac{d^{2} h_{i} (t)}{d t^{2}} \sum_{k = 1}^{T H} \frac{\partial c_{i, 2} [h (t); f]}{\partial h_{k} (t)} v_{k}^{(1)} (t) \\ - \sum_{j = 1}^{T L} φ_{i, j} (f; t) \int_{t_{0}}^{t_{f}} d τ \sum_{k = 1}^{T H} \frac{\partial ψ_{j} [h (τ); f; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) - \sum_{k = 1}^{T H} \frac{\partial g_{i} [h (t); f; t]}{\partial h_{k} (t)} v_{k}^{(1)} (t)\} \\ = P (h; f; v^{(1)}; a^{(1)}) - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{d}{d t} \{a_{i}^{(1)} (t) c_{i, 1} [h (t); f]\} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{d^{2}}{d t^{2}} \{a_{i}^{(1)} (t) c_{i, 2} [h (t); f]\} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) \frac{\partial c_{i, 2} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d^{2} h_{k} (t)}{d t^{2}} d t - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i} (t) \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t) f (θ); t]}{\partial h_{i} (t)} \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} d τ v_{i}^{(1)} (τ) \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); τ]}{\partial h_{i} (τ)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) φ_{k, j} [f (θ); t] d t \end{array}

(60)

Using Equation (58) and rearranging the terms on the right-side of Equation (60) yields the following relation:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i, k}^{(1)} (h; f; t) δ F_{k} - P (h; f; v^{(1)}; a^{(1)}) \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i} (t) d t \{- \frac{d}{d t} \{a_{i}^{(1)} (t) c_{i, 1} [h (t); f]\} + \frac{d^{2}}{d t^{2}} \{a_{i}^{(1)} (t) c_{i, 2} [h (t); f]\} \\ + \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} + \frac{\partial c_{i, 2} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d^{2} h_{k} (t)}{d t^{2}} d t \\ - \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)} - \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); t]}{\partial h_{i} (t)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (τ) φ_{k, j} [f (θ); τ] d τ\} . \end{array}

(61)

The term on the right-side of Equation (61) is now required to represent the “indirect-effect” term defined in Equation (10), which is achieved by requiring the components of the function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

to satisfy the following 1st-LASS:

\begin{array}{l} - \frac{d}{d t} \{a_{i}^{(1)} (t) c_{i, 1} [h (t); f]\} + \frac{d^{2}}{d t^{2}} \{a_{i}^{(1)} (t) c_{i, 2} [h (t); f]\} \\ + \frac{\partial c_{i, 1} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d h_{k} (t)}{d t} + \frac{\partial c_{i, 2} [h (t); f]}{\partial h_{i} (t)} \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{d^{2} h_{k} (t)}{d t^{2}} d t \\ - \sum_{k = 1}^{T H} a_{k}^{(1)} (t) \frac{\partial g_{k} [h (t); f (θ); t]}{\partial h_{i} (t)} - \sum_{j = 1}^{T L} \frac{\partial ψ_{j} [h; f (θ); t]}{\partial h_{i} (t)} \int_{t_{0}}^{t_{f}} \sum_{k = 1}^{T H} a_{k}^{(1)} (τ) φ_{k, j} [f (θ); τ] d τ \\ = \frac{\partial D [h (t); f (θ); t]}{\partial h_{i} (t)} . \end{array}

(62)

The relation obtained in Equation (62) is the explicit form of the relation provided in Equation (23) for the particular case when

n = 2

, i.e., when considering second-order neural integral equations of Fredholm-type (2nd-NIDE-F).

The unknown values involving the function

v_{i} (t)

in the bilinear concomitant

P (h; f; v^{(1)}; a^{(1)})

defined in Equation (59) are eliminated by imposing the following conditions:

a_{i}^{(1)} (t_{0}) = 0; a_{i}^{(1)} (t_{f}) = 0; i = 1, \dots, T H .

(63)

It follows from Equations (33)‒(42) and (31) that the indirect-effect term defined in Equation (10) has the following expression in terms of the 1st-level adjoint sensitivity function

a^{(1)} (t)

:

{\{δ R (h; f; a^{(1)})\}}_{i n d} = \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{k = 1}^{T F} q_{i k}^{(1)} (h; f; t) δ f_{k} - \hat{P} (h; f; v^{(1)}; a^{(1)}),

(64)

where the boundary quantity

\hat{P} (h; f; v^{(1)}; a^{(1)})

contains the known remaining terms after having implemented the known boundary conditions given in Equations (55) and (63), and has the following explicit expression:

\hat{P} (h; f; v^{(1)}; a^{(1)}) ≜ - {\sum_{i = 1}^{T H} δ d_{i} \{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{f}} + {\sum_{i = 1}^{T H} δ e_{i} \{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{0}} .

(65)

Using the results obtained in Equations (64), (65), (56) and (9) in Equation (8) yields the following expression for the G-variation

δ R (h^{0}; f^{0}; v^{(1)}; δ f)

, which is seen to be linear in the variations

δ d_{i}

,

δ e_{i}

(

i = 1, \dots, T H

) and

δ f_{j}

(

j = 1, \dots, T F

):

\begin{array}{l} δ R [h (t); f (θ); a^{(1)} (t); δ f] ≜ \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} d t \frac{\partial D [h (t); f (θ); t]}{\partial f_{j}} δ f_{j} \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) d t \sum_{j = 1}^{T F} q_{j k}^{(1)} (h; f; t) δ f_{j} + {\sum_{i = 1}^{T H} δ d_{i} \{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} δ e_{i} \{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{0}} ≜ \sum_{j = 1}^{T F} \frac{\partial R}{\partial f_{j}} δ f_{j} + \sum_{i = 1}^{T H} \frac{\partial R}{\partial d_{i}} δ d_{i} + \sum_{i = 1}^{T H} \frac{\partial R}{\partial e_{i}} δ e_{i} . \end{array}

(66)

The expression in Equation (66) is to be satisfied at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

It also follows from Equations (66) and (56) that the sensitivities

\partial R / \partial f_{j}

of the response

R [h; f (θ)]

with respect to the components

f_{j} (θ)

of the feature function

f (θ)

have the following expressions, written in the form of Equation (27):

\partial R / \partial f_{j} ≜ R^{(1)} [j; h (t); a^{(1)} (t); f (θ)] ≜ \int_{t_{0}}^{t_{f}} S_{2}^{(1)} [j; h (t); a^{(1)} (t); f (θ)] d t; j = 1, \dots, T F;

(67)

where

\begin{array}{l} S_{2}^{(1)} [j; h (t); a^{(1)} (t); f (θ)] ≜ \frac{\partial D [h (t); f (θ); t]}{\partial f_{j}} - \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \frac{d h_{i} (t)}{d t} \frac{\partial c_{i, 1} [h (t); f (θ)]}{\partial f_{j}} \\ - \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \frac{d^{2} h_{i} (t)}{d t^{2}} \frac{\partial c_{i, 2} [h (t); f (θ)]}{\partial f_{j}} + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \frac{\partial g_{i} (f; t; x)}{\partial f_{j}} \\ + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \sum_{k = 1}^{T L} \frac{\partial φ_{i, k} (f; t)}{\partial f_{j}} \int_{t_{0}}^{t_{f}} ψ_{k} [h (τ); f (θ); τ] d τ \\ + \sum_{i = 1}^{T H} a_{i}^{(1)} (t) \sum_{k = 1}^{T L} φ_{i, k} (f; t) \int_{t_{0}}^{t_{f}} d τ \frac{\partial ψ_{k} [h (τ); f (θ); τ]}{\partial f_{j}}; j = 1, \dots, T F . \end{array}

(68)

The subscript “2” attached to the quantity

S_{2}^{(1)} [j; h (t); a^{(1)} (t); f (θ)]

indicates that this quantity refers to a “second-order” NIDE-F net, while the superscript “(1)” indicates that this quantity refers to “first-order” sensitivities. As expected, the expression of

S_{2}^{(1)} [j; h (t); a^{(1)} (t); f (θ)]

reduces to the expression of

S_{1}^{(1)} [j; h (t); a^{(1)} (t); f (θ)]

when the “second-order NIDE-F net” reduces to the “first-order NIDE-F net” in the case when

c_{i, 2} [h (t); f (θ)] \equiv 0

.

Identifying in Equation (66) the expressions that multiply the variations

δ e_{i}

yields the following expressions for the decoder response sensitivities with respect to the encoder’s initial-time conditions:

\frac{\partial R}{\partial e_{i}} = - {\{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{0}} = - \int_{t_{0}}^{t_{f}} c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t} δ (t - t_{0}) d t; i = 1, \dots, T H .

(69)

Identifying in Equation (66) the expressions that multiply the variations

δ d_{i}

yields the following expressions for the decoder response sensitivities with respect to the final-time conditions:

\frac{\partial R}{\partial d_{i}} = - {\{c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t}\}}_{t = t_{f}} = \int_{t_{0}}^{t_{f}} c_{i, 2} [h (t); f] \frac{d a_{i}^{(1)} (t)}{d t} δ (t - t_{f}) d t; i = 1, \dots, T H .

(70)

If the boundary conditions imposed on the forward functions

h_{i} (t)

and/or the first-derivatives

d h_{i} (t) / d t

,

i = 1, \dots, T H

, differ from the illustrative ones selected in Equation (53), then the corresponding boundary conditions for the 1st-level adjoint function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

would also differ from the ones shown in Equation (63), as would be expected. The components of

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

would consequently have different values; therefore, all of the first-order sensitivities

\partial R / \partial f_{j}

would have values different from those computed using Equation (68), even though the formal mathematical expressions of the respective sensitivities would remain unchanged. Of course, the sensitivities

\partial R / \partial e_{i}

and

\partial R / \partial d_{i}

would have expressions that would differ from those in Equations (69) and (70), respectively, if the boundary conditions in Equation (53), and consequently those in Equation (63), were different, since the residual bilinear concomitant

\hat{P} (h; f; v^{(1)}; a^{(1)})

would have a different expression from that shown in Equation (65).

3. Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm Type (2nd-FASAM-NIDE-F)

The second-order sensitivities of the response

R [h; f (θ)]

defined in Equation (5) will be computed by conceptually using their basic definitions as being the “first-order sensitivities of the first-order sensitivities.” Recall that the generic expression of the first-order sensitivities,

R^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)]

,

j_{1} = 1, \dots, T F

, of the response with respect to the components of the feature function

f (θ)

is provided in Equation (46). It follows that the second-order sensitivities of the response with respect to the components of the feature function will be provided by the first-order G-differential

δ R^{(1)}

of

R^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)]

, which is by definition obtained as follows:

\begin{array}{l} δ R^{(1)} [j_{1}; h^{0} (t); a^{(1, 0)} (t); f^{0} (θ); v^{(1)} (x); δ a^{(1)} (x); δ f] \\ ≜ {\{\frac{d}{d ε} δ R^{(1)} [j_{1}; h^{0} (x) + ε v^{(1)} (x); a^{(1, 0)} (x) + ε δ a^{(1)} (x); f^{0} + ε δ f]\}}_{ε = 0} \\ = \sum_{j_{2} = 1}^{T F} {\{\frac{\partial R^{(1)} [j_{1}; u; a^{(1)}; f]}{\partial f_{j_{2}}}\}}_{θ^{0}} δ f_{j_{2}} + {\{δ R^{(1)} [j_{1}; h; a^{(1)}; f; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}, \end{array}

(71)

where the indirect-effect term

{\{δ R^{(1)} [j_{1}; h; a^{(1)}; f; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}

comprises all dependencies on the vectors

v^{(1)} (x)

and

δ a^{(1)} (x)

of variations in the state functions

h (t)

and

a^{(1)} (t)

, around the respective nominal values denoted as

h^{0} (t)

and

a^{(1, 0)} (t)

, respectively, which are computed at the nominal parameter values

θ^{0}

. This indirect-effect term is defined as follows:

\begin{array}{l} {\{δ R^{(1)} [j_{1}; h; a^{(1)}; f; v^{(1)}; δ a^{(1)}]\}}_{i n d} ≜ \int_{t_{0}}^{t_{f}} d t {\{\frac{\partial S_{1}^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)]}{\partial u} v^{(1)} (x)\}}_{θ^{0}} \\ + \int_{t_{0}}^{t_{f}} d t {\{\frac{\partial S_{1}^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)]}{\partial a^{(1)}} δ a^{(1)} (x)\}}_{θ^{0}}; j_{1} = 1, \dots, T F . \end{array}

(72)

The variational function

δ a^{(1)} (x)

is the solution of the system of equations obtained by G-differentiating the 1st-LASS defined in Equations (23) and (25), which is by definition obtained as follows:

\begin{array}{l} {\{\frac{d}{d ε} \sum_{i = 1}^{T H} A_{i j}^{(1)} [h^{0} + ε v^{(1)} (t); f^{0} + ε δ f; t] [a_{i}^{(1, 0)} + ε δ a_{i}^{(1)}]\}}_{ε = 0} \\ = {\{\frac{d}{d ε} \frac{\partial D [h^{0} + ε v^{(1)} (t); f^{0} + ε δ f; t]}{\partial h_{j} (t)}\}}_{ε = 0}; j = 1, \dots, T H . \end{array}

(73)

{\{\frac{d}{d ε} B_{j}^{*} [h^{0} + ε v^{(1)} (t); f^{0} + ε δ f; a^{(1, 0)} + ε δ a^{(1)}; t]\}}_{ε = 0}; j = 1, \dots, B C .

(74)

Carrying out the operations indicated in Equations (73) and (74) yields the following relations:

\begin{array}{l} \sum_{k = 1}^{T H} {\{\frac{\partial}{\partial h_{k} (t)} [\sum_{i = 1}^{T H} A_{i j}^{(1)} (h; f; t) a_{i}^{(1)} (j_{1}; t)] - \frac{\partial^{2} D [h^{0} + ε v^{(1)} (t); f^{0} + ε δ f; t]}{\partial h_{k} (t) \partial h_{j_{1}} (t)}\}}_{θ^{0}} v_{k}^{(1)} (t) \\ + {\{\sum_{i = 1}^{T H} A_{i j}^{(1)} (h; f; t)\}}_{θ^{0}} δ a_{i}^{(1)} (j_{1}; t) = \sum_{j_{2} = 1}^{T F} q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t) δ f_{j_{2}} (θ); \\ j = 1, \dots, T H; j_{1} = 1, \dots, T F . \end{array}

(75)

\begin{array}{l} q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t) ≜ {\{\frac{\partial^{2} D [h^{0} + ε v^{(1)} (t); f^{0} + ε δ f; t]}{\partial f_{j_{2}} (θ) \partial h_{j_{1}} (t)}\}}_{θ^{0}} \\ - {\{\frac{\partial}{\partial f_{j_{2}} (θ)} \sum_{i = 1}^{T H} A_{i j}^{(1)} (h; f; t) a_{i}^{(1)} (j_{1}; t)\}}_{θ^{0}}; j = 1, \dots, T H; ​ ​ ​ j_{1}, j_{2} = 1, \dots, T F . \end{array}

(76)

\begin{array}{l} {\{\frac{\partial B_{j}^{*} (h; f; a^{(1)})}{\partial h (t)}\}}_{θ^{0}} v^{(1)} (t) + {\{\frac{\partial B_{j}^{*} (h; f; a^{(1)})}{\partial a^{(1)} (t)}\}}_{θ^{0}} δ a^{(1)} (t) + {\{\frac{\partial B_{j}^{*} (h; f; a^{(1)})}{\partial f (θ)}\}}_{θ^{0}} δ f (θ) = 0; \\ a t t = t_{f} ​ a n d / o r t = t_{0}; j = 1, \dots, B C . \end{array}

(77)

For subsequent derivations, it is convenient to represent the relations in Equation (75) in matrix-vector form, as follows:

V_{21}^{(2)} (u; a^{(1)}; f) v^{(1)} (t) + V_{22}^{(2)} (u; f) δ a^{(1)} (t) = Q^{(2)} [h (t); a^{(1)} (t); f (θ); t] δ f (θ);

(78)

where

\begin{array}{l} V_{21}^{(2)} (h; a^{(1)}; f) ≜ {\{\frac{\partial A^{(1)} (h; a^{(1)}; f)}{\partial h (t)} - \frac{\partial^{2} D (h; a^{(1)}; f)}{\partial h (t) \partial h (t)}\}}_{θ^{0}}; \\ V_{22}^{(2)} (h; a^{(1)}; f) ≜ {\{\frac{\partial A^{(1)} (h; a^{(1)}; f)}{\partial a^{(1)} (t)}\}}_{θ^{0}}; \\ Q^{(2)} [j_{1}; j_{2}; h; a^{(1)}; f (θ)] ≜ {\{q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t)\}}_{T H \times T F}; \end{array}

(79)

As indicated by Equation (78), the variational functions

v^{(1)} (x)

and

δ a^{(1)} (x)

are the solutions of the system of matrix equations obtained by concatenating the 1st-LVSS defined by Equations (14) and (16) with Eqs (77) and (78). The concatenated system thus obtained will be called the 2nd-Level Variational Sensitivity System (2nd-LVSS) and has the block-matrix form provided below:

{\{V M^{(2)} [2 \times 2; U^{(2)} (2; t); f] V^{(2)} (2; t)\}}_{θ^{0}} = {\{Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]\}}_{θ^{0}}; t_{0} < t < t_{f};

(80)

{\{B_{V}^{(2)} [2; U^{(2)} (2; t); V^{(2)} (2; t); f; δ f]\}}_{θ^{0}} = 0 [2] ≜ {[0, 0]}^{†}; a t t = t_{f} ​ a n d / o r t = t_{0} .

(81)

To distinguish block-matrices from block-vectors, two bold capital-letters have been used (and will henceforth be used) to denote block-matrices, as in the case of “the second-level variational matrix”

V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

. The “2nd-level” is indicated by the superscript “(2)”. The argument “

2 \times 2

”, which appears in the list of arguments of

V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

, indicates that this matrix is a

2 \times 2

-dimensional block-matrix comprising four submatrices, each of dimensions

T D \times T D

. The structure of the block-matrix

V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

is provided below:

V M^{(2)} [2 \times 2; U^{(2)} (2; t); f] ≜ (\begin{matrix} L [h; f (θ); t] & 0 \\ V_{21}^{(2)} (h; a^{(1)}; f; t) & V_{22}^{(2)} (h; a^{(1)}; f; t) \end{matrix} .)

(82)

The argument “2” which appears in the list of arguments of the vector

U^{(2)} (2; t)

and of the “variational vector”

V^{(2)} (2; t)

in Equation (80) indicates that each of these vectors is a 2-block column vector, each block comprising a column-vector of dimension

T D

; the vectors

U^{(2)} (2; t)

and

V^{(2)} (2; t)

are defined as follows:

U^{(2)} (2; t) ≜ (\begin{matrix} h (t) \\ a^{(1)} (t) \end{matrix}); V^{(2)} (2; t) ≜ (\begin{matrix} v^{(1)} (t) \\ δ a^{(1)} (t) \end{matrix}) .

(83)

The 2-block vector

Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]

is defined as follows:

Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f] ≜ (\begin{matrix} Q^{(1)} [h (t); f (θ); t] δ f (θ) \\ Q^{(2)} [h (t); a^{(1)} (t); f (θ); t] δ f (θ) \end{matrix});

(84)

The 2-block column vector

B_{V}^{(2)} [2; U^{(2)} (2; t); V^{(2)} (2; t); f; δ f]

in Equation (81) represents the concatenated boundary/initial conditions provided in Equations (14) and (77), evaluated at the nominal parameter values. The argument “2” in the expression

0 [2] ≜ {[0, 0]}^{†}

in Equation (81) indicates that this expression is a two-block column vector comprising two vectors, each of which has

T D

-components, all of which are zero-valued.

The need for solving the 2nd-LVSS is circumvented by deriving an alternative expression for the indirect-effect term

{\{δ R^{(1)} [j_{1}; u; a^{(1)}; f; v^{(1)}; δ a^{(1)}]\}}_{i n d}

defined in Equation (72), in which the function

V^{(2)} (2; t)

is replaced by a 2nd-level adjoint function that is independent of variations in the model parameter and state functions. This 2nd-level adjoint function will be the solution of a 2nd-Level Adjoint Sensitivity System (2nd-LASS), which will be constructed by using the same principles as employed for deriving the 1st-LASS. The 2nd-LASS is constructed in a Hilbert space

H_{2} (Ω_{t})

,

Ω_{t} ≜ t \in [t_{0}, t_{f}]

, comprising block-vectors having the same structure as

V^{(2)} (2; t)

that can generically be represented as follows:

Φ^{(2)} (2; t) ≜ {[φ^{(2)} (1; t), φ^{(2)} (2; t)]}^{†} \in H_{2} (Ω_{t})

, with

φ^{(2)} (i; t) ≜ {[φ_{i, 1}^{(2)} (t), \dots φ_{i, j}^{(2)} (t), \dots, φ_{i, T H}^{(2)} (t)]}^{†}

, for

i = 1, 2

. The Hilbert space

H_{2} (Ω_{t})

is endowed with the following inner product of two vectors

Φ^{(2)} (2; t) \in H_{2} (Ω_{t})

and

Ψ^{(2)} (2; t) \in H_{2} (Ω_{t})

:

\begin{array}{l} {〈Ψ^{(2)} (2; t), Φ^{(2)} (2; t)〉}_{2} ≜ \sum_{i = 1}^{2} {〈ψ^{(2)} (i; t), φ^{(2)} (i; t)〉}_{1} \\ = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{1, j}^{(2)} (t) η_{1, j}^{(2)} (t) d t + \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{2, j}^{(2)} (t) η_{2, j}^{(2)} (t) d t . \end{array}

(85)

The inner product defined in Equation (85) will be used to construct the 2nd-Level Adjoint Sensitivity System (2nd-LASS) for a 2nd-level adjoint function

A^{(2)} (2; t) ≜ {[a^{(2)} (1; t), a^{(2)} (2; t)]}^{†} \in H_{2} (Ω_{t})

,

a_{i}^{(2)} (t) ≜ {[a_{i, 1}^{(2)} (t), \dots, a_{i, T H}^{(2)} (t)]}^{†}

,

i = 1, 2

, by implementing the following sequence of steps, which are conceptually similar to those implemented in Section 2 for constructing the 1st-FASAM-NIDE-F methodology:

Using Equation (85), construct the inner product of the yet undetermined function $A^{(2)} (2; t) ≜ {[a^{(2)} (1; t), a^{(2)} (2; t)]}^{†} \in H_{2} (Ω_{t})$ with Equation (80) to obtain the following relation:

$\begin{array}{l} {\{{〈A^{(2)} (2; t), V M^{(2)} [2 \times 2; U^{(2)} (2; t); f] V^{(2)} (2; t)〉}_{2}\}}_{θ^{0}} \\ = {\{{〈A^{(2)} (2; t), Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]〉}_{2}\}}_{θ^{0}} . \end{array}$

(86)
Use the definition of the operator adjoint to $V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]$ in the Hilbert space $H_{2} (Ω_{t})$ to transform the inner product on the left-side of Equation (86) as follows:

$\begin{array}{l} {\{{〈A^{(2)} (2; t), V M^{(2)} [2 \times 2; U^{(2)} (2; t); f] V^{(2)} (2; t)〉}_{2}\}}_{θ^{0}} = {\{P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f)\}}_{θ^{0}} \\ + {\{{〈V^{(2)} (2; t), A M^{(2)} [2 \times 2; U^{(2)} (2; t); f] A^{(2)} (2; t)〉}_{2}\}}_{θ^{0}}, \end{array}$

(87)

where the quantity ${\{P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f)\}}_{θ^{0}}$ denotes the corresponding bilinear concomitant on the domain’s boundary, evaluated at the nominal values for the parameters and respective state functions, and where the operator $A M^{(2)} [2 \times 2; U^{(2)} (2; t); f] ≜ {\{V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]\}}^{*}$ denotes the formal adjoint of the matrix-valued operator $V M^{(2)} [2 \times 2; U^{(2)} (2; t); f]$ , comprising $(2 \times 2)$ block-matrices, each of dimensions $T D^{2}$ , having the following block-matrix structure.

$A M^{(2)} [2 \times 2; U^{(2)} (2; t); f] ≜ {(\begin{matrix} L & 0 \\ V_{21}^{(2)} & V_{22}^{(2)} \end{matrix})}^{*} = (\begin{matrix} {\{L^{*}\}}^{†} & {[V_{21}^{(2) *}]}^{†} \\ 0 & {[V_{22}^{(2) *}]}^{†} \end{matrix}) .$

(88)
Require the inner product on the right-side of Equation (87) to represent the indirect-effect term ${\{δ R^{(1)} [j_{1}; u; a^{(1)}; f; v^{(1)}; δ a^{(1)}]\}}_{i n d}$ defined in Equation (72) by imposing the following relation:

${\{A M^{(2)} [2 \times 2; U^{(2)} (2; t); f] A^{(2)} (2; j_{1}; t)\}}_{θ^{0}} = {\{Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f]\}}_{θ^{0}}, j_{1} = 1, \dots, T F;$

(89)

where

$Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f] ≜ (\begin{matrix} \partial S_{1}^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)] / \partial u \\ \partial S_{1}^{(1)} [j_{1}; h (t); a^{(1)} (t); f (θ)] / \partial a^{(1)} \end{matrix}), j_{1} = 1, \dots, T F .$

(90)

Since the source-term on the right-side of Equation (89) is a distinct quantity for each value of the index $j_{1}$ , this index has been added to the list of arguments of the function $A^{(2)} (2; j_{1}; t) ≜ {[a^{(2)} (1; j_{1}; t), a^{(2)} (2; j_{1}; t)]}^{†}$ in order to emphasize that a distinct function $A^{(2)} (2; j_{1}; t)$ will correspond to each index $j_{1}$ . Of course, the adjoint operator $A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]$ that acts on the function $A^{(2)} (2; j_{1}; t)$ is independent of the index $j_{1}$ and could, in principle, be inverted just once and stored for subsequent repeated applications to the $j_{1}$ -dependent source terms $Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f]$ for computing the corresponding functions $A^{(2)} (2; j_{1}; t)$ .
The definition of the function $A^{(2)} (2; j_{1}; t)$ is completed by requiring it to satisfy adjoint boundary/initial conditions represented in operator form as follows:

${\{B_{A}^{(2)} [2; U^{(2)} (2; t); A^{(2)} (2; j_{1}; t); f]\}}_{θ^{0}} = 0 [2]; j_{1} = 1, \dots, T F; a t t = t_{f} a n d / o r t = t_{0} .$

(91)

The boundary/initial conditions represented by Equation (91) are determined imposing the following requirements:

(a) they must be independent of unknown values of $V^{(2)} (2; t)$ ;

(b) the substitution of the boundary and/or initial conditions represented by Equations (81) and (91) into the expression of the bilinear concomitant

{\{P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f)\}}_{θ^{0}}

must cause all terms containing unknown boundary/initial values of

V^{(2)} (2; t)

to vanish.

The NIDE-net comprising Equations (89) and (91) is called the “2nd-Level Adjoint Sensitivity System (2nd-LASS)” and its solution,

A^{(2)} (2; j_{1}; t) ≜ {[a^{(2)} (1; j_{1}; t), a^{(2)} (2; j_{1}; t)]}^{†}

,

j_{1} = 1, \dots, T F

, is called the “2nd-level adjoint sensitivity function.” The unique properties of the 2nd-LASS will be highlighted in the sequel, below.

Using in Equation (72) the relations defining 2nd-LASS together with the 2nd-LVSS and the relation provided in Equation (87) yields the following alternative expression for the indirect-effect term, involving the 2nd-level adjoint sensitivity function

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†}

instead of the 2nd-level variational function

V^{(2)} (2; t)

:

\begin{array}{l} {\{δ R^{(1)} [j_{1}; h; a^{(1)}; f; A^{(2)} (2; j_{1}; x)]\}}_{i n d} = - {\{{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f; δ f)\}}_{θ^{0}} \\ + {\{{〈A^{(2)} (2; j_{1}; t), Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]〉}_{2}\}}_{θ^{0}} \end{array}

(92)

where

{\{{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f; δ f)\}}_{θ^{0}}

denotes known residual (non-zero) boundary terms which may not have vanished after having used the boundary and/or initial conditions represented by Equation (81) and (91).

Replacing the expression obtained in Equation (92) into Equation (71) yields the following expression:

\begin{array}{l} {\{δ R^{(1)} [j_{1}; U^{(2)} (2; t); A^{(2)} (2; j_{1}; t); f; δ f]\}}_{θ^{0}} = - {\{{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f; δ f)\}}_{θ^{0}} \\ + {\{{〈A^{(2)} (2; j_{1}; t), Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]〉}_{2}\}}_{θ^{0}} + \sum_{j_{2} = 1}^{T F} {\{\frac{\partial R^{(1)} [j_{1}; u; a^{(1)}; f]}{\partial f_{j_{2}}}\}}_{θ^{0}} δ f_{j_{2}} \\ ≜ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ \sum_{j_{2} = 1}^{T F} {\{\frac{\partial^{2} R [h; f (θ)]}{\partial f_{j_{2}} \partial f_{j_{1}}}\}}_{θ^{0}} δ f_{j_{2}}; j_{1} = 1, \dots, T F . \end{array}

(93)

The expressions of the second-order sensitivities

\partial^{2} R [h; f (θ)] / \partial f_{j_{2}} \partial f_{j_{1}}

of the response with respect to the components of the feature function are obtained by performing the following sequence of operations:

(i): Use Equation (84) to recast the second term on the right-side of Equation (93) as follows:

$\begin{array}{l} {\{{〈A^{(2)} (2; j_{1}; t), Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]〉}_{2}\}}_{θ^{0}} = {\{{〈a^{(2)} (1; j_{1}; t), Q^{(1)} [h (t); f (θ); t] δ f〉}_{1}\}}_{θ^{0}} \\ + {\{{〈a^{(2)} (2; j_{1}; t), Q^{(2)} [h (t); a^{(1)} (t); f (θ); t] δ f (θ)〉}_{1}\}}_{θ^{0}} . \end{array}$

(94)
(ii): Recall that $Q^{(1)} [h (t); f (θ); t] δ f ≜ \sum_{k = 1}^{T F} {\{q_{i, k}^{(1)} (h; f; t) δ f_{k}\}}_{θ^{0}}$ , where the quantities $q_{i, k}^{(1)} (h; f; t)$ were defined in Equation (15). Recall that $Q^{(2)} [h (t); a^{(1)} (t); f (θ); t] δ f (θ) ≜ \sum_{j_{2} = 1}^{T F} q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t) δ f_{j_{2}} (θ)$ where the quantities $q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t)$ were defined in Equation (76). Insert these expressions in Equation (94) to obtain the following relation:

$\begin{array}{l} {\{{〈A^{(2)} (2; j_{1}; t), Q_{V}^{(2)} [2; U^{(2)} (2; t); f; δ f]〉}_{2}\}}_{θ^{0}} = {\{\sum_{j_{2} = 1}^{T F} [\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} q_{i, k}^{(1)} (h; f; t) a_{1, i}^{(2)} (j_{1}; t) d t] δ f_{j_{2}}\}}_{θ^{0}} \\ + {\{\sum_{j_{2} = 1}^{T F} [\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} q_{j, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t) a_{2, i}^{(2)} (j_{1}; t) d t] δ f_{j_{2}}\}}_{θ^{0}} . \end{array}$

(95)
(iii): Insert into Equation (93) the equivalent expression obtained in Equation (95), and subsequently identifying the quantities that multiply the variations $δ f_{j_{2}}$ , to obtain the following expression for the second-order sensitivities $\partial^{2} R [h; f (θ)] / \partial f_{j_{2}} \partial f_{j_{1}}$ :

$\begin{array}{l} \frac{\partial^{2} R [h; f (θ)]}{\partial f_{j_{2}} \partial f_{j_{1}}} = \frac{\partial R^{(1)} [j_{1}; u; a^{(1)}; f]}{\partial f_{j_{2}}} - \frac{\partial {\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; f; δ f)}{\partial f_{j_{2}}} \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} q_{i, k}^{(1)} (h; f; t) a_{1, i}^{(2)} (j_{1}; t) d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} q_{i, j_{2}}^{(2)} (j_{1}; j_{2}; h; f; t) a_{2, i}^{(2)} (j_{1}; t) d t; j_{1}, j_{2} = 1, \dots, T F . \end{array}$

(96)

It is important to note that the 2nd-LASS is independent of variations

δ f

and variations

V^{(2)} (2; x)

in the respective state functions. It is also important to note that the

{(2 \times T D)}^{2}

-dimensional matrix

A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

is independent of the index

j_{1}

. Only the source-term

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f]

depends on the index

j_{1}

. Therefore, the same solver can be used to invert the matrix

A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

in order to solve numerically the 2nd-LASS for each

j_{1}

-dependent source

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f]

in order to obtain the corresponding

j_{1}

-dependent

2 \times T D

-dimensional 2nd-level adjoint function

A^{(2)} (2; j_{1}; t) ≜ {[a^{(2)} (1; j_{1}; t), a^{(2)} (2; j_{1}; t)]}^{†}

. Computationally, it would be most efficient to store, if possible, the inverse matrix

{\{A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]\}}^{- 1}

, in order to multiply directly the inverse matrix

{\{A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]\}}^{- 1}

with the corresponding source term

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; t); f]

, for each index

j_{1}

, in order to obtain the corresponding

j_{1}

-dependent

2 \times T D

-dimensional 2nd-level adjoint function

A^{(2)} (2; j_{1}; t) ≜ {[a^{(2)} (1; j_{1}; t), a^{(2)} (2; j_{1}; t)]}^{†}

.

Since the adjoint matrix

A M^{(2)} [2 \times 2; U^{(2)} (2; t); f]

is block-diagonal, solving the 2nd-LASS is equivalent to solving two 1st-LASS, with two different source terms. Thus, the “solvers” and the computer program used for solving the 1st-LASS can also be used for solving the 2nd-LASS. The 2nd-LASS was designated as the “second-level” rather than the “second-order” adjoint sensitivity system, since the 2nd-LASS does not involve any explicit 2nd-order G-derivatives of the operators underlying the original system but involves the inversion of the same operators that need to be inverted for solving the 1st-LASS.

If the 2nd-LASS is solved

T F

-times, the 2nd-order mixed sensitivities

\partial^{2} R [h; f (θ)] / \partial f_{j_{2}} \partial f_{j_{1}}

will be computed twice, in two different ways, in terms of two distinct 2nd-level adjoint functions. Consequently, the symmetry property

\partial^{2} R [h; f (θ)] / \partial f_{j_{2}} \partial f_{j_{1}} = \partial^{2} R [h; f (θ)] / \partial f_{j_{1}} \partial f_{j_{2}}

provides an intrinsic (numerical) verification that the 1st-level adjoint function

a^{(1)} (x)

and the components of the 2nd-level adjoint function

A^{(2)} (2; j_{1}; x)

and are computed accurately.

The second-order sensitivities of the decoder-response with respect to the optimal weights/parameters

θ_{k}, k = 1, \dots, T W

, are obtained analytically by using the chain rule in conjunction with the expressions obtained in Equations (46) and(96), as follows:

\frac{\partial^{2} R [h; f (θ)]}{\partial θ_{k} \partial θ_{j}} = \frac{\partial}{\partial θ_{k}} \{\sum_{i = 1}^{T F} \frac{\partial R [h; f (θ)]}{\partial f_{i} (θ)} \frac{\partial f_{i} (θ)}{\partial θ_{j}}\}, j, k = 1, \dots, T W .

(97)

4. Illustrative Application of the 1st-FASAM-NIDE-F and 2nd-FASAM-NIDE-F Methodologies to a Heat Transfer Model

The application of the 1st-FASAM-NIDE-F Methodology will be illustrated in this Section by considering a model of linear steady-state heat conduction through a homogeneous slab of thickness

l

, having a constant thermal conductivity denoted as

k

and involving a distributed heat source that is proportional to the temperature distribution within the slab; the proportionality constant is denoted as

Q

. The slab is considered to be insulated on one side, which is held at a temperature

T_{0}

. The temperature distribution within the slab, denoted as

T (x)

, is thus modeled by the following linear heat conduction equation:

\frac{d^{2} T (x)}{d x^{2}} + \frac{Q}{k} T (x) = 0; 0 < x < l; T (0) = T_{0}; {\{\frac{d T (x)}{d x}\}}_{x = 0} = 0 .

(98)

Consider that the model response of interest, denoted as

R (T)

, is the average temperature within the slab, which is defined as follows:

R (T) ≜ \frac{1}{l} \int_{0}^{l} T (x) d x .

(99)

The model’s primary parameters are

k, Q, T_{0}

, which can be subject to uncertainties, but their nominal/optimal values

k^{0}, Q^{0}, T_{0}^{0}

are considered to be known. These parameters are considered to be components of the following (column) “vector of model parameters”:

θ ≜ {(k, Q, T_{0})}^{†} .

(100)

The solution of Equation (98) has the following expression:

T (x) = T_{0} \cos x γ (θ); γ (θ) ≜ \sqrt{Q / k} .

(101)

The quantity

γ (θ)

is a “feature function” of the primary model parameters. Using in Equation (99) the result obtained in Equation (101) yields the following closed form expression for the model response:

R (T) ≜ \frac{T_{0}}{l γ (θ)} \sin l γ (θ) .

(102)

At the nominal parameter values, the nominal value of the temperature distribution and of the average temperature response, respectively, have the following expressions:

T^{0} (x) = T_{0}^{0} \cos x γ^{0}; γ^{0} ≜ γ (θ^{0});

(103)

R^{0} (T^{0}) ≜ \frac{T_{0}^{0}}{l γ^{0}} \sin l γ^{0} .

(104)

4.1. Applying the 1st-FASAM-NIDE-F Methodology to Obtain the First-Order Response Sensitivities to the Primary Model Parameters

The heat conduction equation presented in Equation (98) can be recast into the following equivalent NIDE-F form:

\frac{d T (x)}{d x} + γ^{2} (θ) \int_{0}^{l} T (x) d x = T_{0} γ (θ) [\sin l γ (θ) - \sin x γ (θ)]; T (0) = T_{0} .

(105)

The first-order sensitivities of the response

R (T)

will be determined from the first-order Gateaux- (G-) differential, denoted as

δ R

, of

R (T)

, which is obtained by applying the definition of the G-differential to Equation (99), as follows:

δ R ≜ {\{\frac{d}{d ε} \frac{1}{l} \int_{0}^{l} [T (x) + ε δ T (x)] d x\}}_{ε = 0} = \frac{1}{l} \int_{0}^{l} δ T (x) d x .

(106)

The variation

δ T (x)

is the solution of the 1st-Level Variational Sensitivity System (1st-LVSS) obtained by G-differentiating Equation (105), which yields the following NIDE-F for arbitrary variations

δ T (x)

and

δ γ (θ)

around the nominal values

T^{0} (x), θ^{0}

:

\begin{array}{l} {\{\frac{d}{d ε} [\frac{d (T^{0} + ε δ T)}{d x} + {(γ^{0} + ε δ γ)}^{2} \int_{0}^{l} (T^{0} + ε δ T) d x]\}}_{ε = 0} \\ = {\{\frac{d}{d ε} (T_{0}^{0} + ε δ T_{0}) (γ^{0} + ε δ γ) [\sin l (γ^{0} + ε δ γ) - \sin x (γ^{0} + ε δ γ)]\}}_{ε = 0}; \\ {\{\frac{d}{d ε} {[T^{0} (x) + ε δ T (x)]}_{x = 0}\}}_{ε = 0} = δ T_{0} . \end{array}

(107)

Performing the operations indicated in Equation (107) yields the following form for the 1st-LVSS:

\begin{array}{l} {\{\frac{d}{d x} δ T (x) + γ^{2} (θ) \int_{0}^{l} δ T (x) d x\}}_{θ = θ^{0}} = q^{(1)} (x), \\ q^{(1)} (x) ≜ {\{[(δ T_{0}) γ (θ) + δ γ (θ) T_{0}] [\sin l γ (θ) - \sin x γ (θ)]\}}_{θ = θ^{0}} \\ + δ γ (θ) {\{T_{0} γ (θ) [l \cos l γ (θ) - x \cos x γ (θ)]\}}_{θ = θ^{0}} - {\{2 δ γ (θ) γ (θ) \int_{0}^{l} T (x) d x\}}_{θ = θ^{0}}; \end{array}

(108)

{\{δ T (x)\}}_{x = 0} = δ T_{0} .

(109)

For subsequent reference, it is noted that the solution of the above 1st-LVSS has the following expression:

δ T (x) = δ T_{0} \cos x γ (θ) - δ γ (θ) T_{0} x \sin x γ (θ) .

(110)

The 1st-LVSS would need to be solved repeatedly, using every possible parameter variation, in order to determine the corresponding value of the temperature variation

δ T (x)

. These repeated computations can be avoided by eliminating the appearance of the variation

δ T (x)

in Equation (106); this aim can be achieved by deriving an alternative expression for the response variation

δ R

that would not involve the variation

δ T (x)

. This alternative expression for

δ R

will be constructed in terms of the first-level adjoint function, which is, in turn, obtained as the solution of the 1st-Level Adjoint Sensitivity System (1st-LASS) to be constructed next by using the inner product defined in Equation (10) for the single-component function

δ T (x)

. Forming the inner product of Equation (109) with a yet undefined function

a^{(1)} (x)

yields the following relation:

\int_{0}^{l} a^{(1)} (x) d x \frac{d}{d x} δ T (x) + γ^{2} (θ) \int_{0}^{l} a^{(1)} (x) d x \int_{0}^{l} δ T (x) d x = \int_{0}^{l} a^{(1)} (x) q^{(1)} (x) d x .

(111)

The relation obtained in Equation (111) is satisfied at the nominal/optimal parameter values but this fact has not been explicitly indicated in order to simplify the notation. Integrating by parts the first term on the left-side of Equation (111) and rearranging the second term on the left-side of Equation (111) yields the following relation:

\begin{array}{l} \int_{0}^{l} a^{(1)} (x) d x \frac{d}{d x} δ T (x) + γ^{2} (θ) \int_{0}^{l} a^{(1)} (x) d x \int_{0}^{l} δ T (x) d x = a^{(1)} (l) δ T (l) \\ - a^{(1)} (0) δ T (0) + \int_{0}^{l} δ T (x) d x \{- \frac{d a^{(1)} (x)}{d x} + γ^{2} (θ) \int_{0}^{l} a^{(1)} (x) d x\} . \end{array}

(112)

The function

a^{(1)} (x)

will now be determined as follows: (i) require that the last term on the right-side of Equation (112) be identical to the G-differential

δ R

defined in Equation (106); and (ii) eliminate the unknown quantity

δ T (l)

in Equation (112). These requirements lead to the following NIDE-F for the function

a^{(1)} (x)

:

- \frac{d a^{(1)} (x)}{d x} + γ^{2} (θ) \int_{0}^{l} a^{(1)} (x) d x = \frac{1}{l};

(113)

a^{(1)} (l) = 0 .

(114)

The NIDE-F-net represented by Equations (113) and (114) constitutes the 1st-Level Adjoint Sensitivity System (1st-LASS) for the 1st-level adjoint sensitivity function

a^{(1)} (x)

. The 1st-LASS is satisfied at the nominal parameter values but this fact has not been explicitly indicated in order to simplify the notation.

Using Equations (111)‒(114) in conjunction with Equation (106) yields the following alternative expression for the G-differential

δ R

in terms of

a^{(1)} (x)

:

δ R = \int_{0}^{l} a^{(1)} (x) q^{(1)} (x) d x + a^{(1)} (0) δ T_{0} .

(115)

Using in Equation (115) the expression provided for

q^{(1)} (x)

in Equation (108) and identifying the expressions that multiply the variations

δ T_{0}

and

δ γ (θ)

yields the following expressions for the first-order sensitivities of the response with respect to the initial condition

T_{0}

and the feature function

γ (θ)

, respectively:

\frac{\partial R}{\partial T_{0}} = γ (θ) \int_{0}^{l} a^{(1)} (x) [\sin l γ (θ) - \sin x γ (θ)] d x + a^{(1)} (0);

(116)

\begin{array}{l} \frac{\partial R}{\partial γ (θ)} = T_{0} \int_{0}^{l} a^{(1)} (x) [\sin l γ (θ) - \sin x γ (θ)] d x \\ + T_{0} γ (θ) \int_{0}^{l} a^{(1)} (x) [l \cos l γ (θ) - x \cos x γ (θ)] d x - 2 γ (θ) \int_{0}^{l} a^{(1)} (x) d x \int_{0}^{l} T (x) d x . \end{array}

(117)

The expressions obtained in Equations (116) and (117) can be evaluated after having determined the 1st-level adjoint sensitivity function

a^{(1)} (x)

. Also, these expressions are to be evaluated using the nominal/optimal parameter values, but this fact has not been explicitly indicated in order to simplify the notation. Notably, the 1st-LASS is independent of parameter variations, so it needs to be solved only once to determine

a^{(1)} (x)

. The closed-form explicit expression of the solution of the 1st-LASS represented by Equations (113) and (114) is provided below:

a^{(1)} (x) = - \frac{2 (x - l)}{l [2 + l^{2} γ^{2} (θ)]} .

(118)

Using the expression obtained in Equation (118) into Equations (116) and (117), respectively, and performing the respective integrations yields the following closed-form expressions:

\frac{\partial R}{\partial T_{0}} = \frac{\sin l γ (θ)}{l γ (θ)};

(119)

\frac{\partial R}{\partial γ (θ)} = \frac{T_{0}}{γ (θ)} \cos l γ (θ) - \frac{T_{0}}{l γ^{2} (θ)} \sin l γ (θ) .

(120)

As expected, the expressions obtained in Equations (119) and (120) coincide with the expressions that would be obtained by the direct differentiation of the expression for the model response

R (T)

obtained in Equation (102) with respect to

T_{0}

and

γ (θ)

, respectively. Of course, the closed-form exact expression for the model response in terms of the model’s primary parameters and/or feature functions is not available in practice.

The sensitivities of the model response with respect to the primary parameters are obtained by using the result obtained in Equation (120) in conjunction with the following “chain-rule of differentiation”:

\frac{\partial R}{\partial Q} = \frac{\partial R}{\partial γ (θ)} \frac{\partial γ (θ)}{\partial Q} = \frac{1}{2 \sqrt{k Q}} \frac{\partial R}{\partial γ (θ)};

(121)

\frac{\partial R}{\partial k} = \frac{\partial R}{\partial γ (θ)} \frac{\partial γ (θ)}{\partial k} = - \frac{1}{2 k} \sqrt{\frac{Q}{k}} \frac{\partial R}{\partial γ (θ)} .

(122)

4.2. Applying the 1st-FASAM-NODE Methodology to Obtain the First-Order Response Sensitivities to the Primary Model Parameters

The traditional form of the heat conduction model provided in Equation (98) is a neural ordinary differential equation (NODE) which can be analyzed directly by using the “First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations (1st-FASAM-NODE) introduced by Cacuci [25]. The G-differential of Equation (98) yields the following 1st-LVSS in NODE-form satisfied by the temperature variation

δ T (x)

:

{\{\frac{d^{2}}{d x^{2}} δ T (x) + γ^{2} (θ) δ T (x)\}}_{θ = θ^{0}} = - {\{2 δ γ (θ) γ (θ) T (x)\}}_{θ = θ^{0}}; 0 < x < l;

(123)

δ T (0) = δ T_{0}; {\{\frac{d}{d x} δ T (x)\}}_{x = 0} = 0 .

(124)

The 1st-LASS corresponding to the above 1st-LVSS is obtained by implementing the same steps as outlined in the previous Subsection, by constructing the inner-product of a yet undetermined function

b^{(1)} (x)

with Equation (123) to obtain the following relation:

\int_{0}^{l} b^{(1)} (x) d x [\frac{d^{2}}{d x^{2}} δ T (x) + γ^{2} (θ) δ T (x)] = - 2 δ γ (θ) γ (θ) \int_{0}^{l} b^{(1)} (x) T (x) d x .

(125)

The relation obtained in Equation (125) is to be evaluated at the nominal/optimal parameter values but this fact has not been explicitly indicated in order to simplify the notation.

Integrating by parts the first term on the left-side of Equation (125) yields the following relation:

\begin{array}{l} \int_{0}^{l} b^{(1)} (x) d x [\frac{d^{2}}{d x^{2}} δ T (x) + γ^{2} (θ) δ T (x)] = b^{(1)} (l) {\{\frac{d}{d x} δ T (x)\}}_{x = l} - b^{(1)} (0) {\{\frac{d}{d x} δ T (x)\}}_{x = 0} \\ - {\{δ T (x) \frac{d b^{(1)} (x)}{d x}\}}_{x = l} + {\{δ T (x) \frac{d b^{(1)} (x)}{d x}\}}_{x = 0} + \int_{0}^{l} δ T (x) [\frac{d^{2} b^{(1)} (x)}{d x^{2}} + γ^{2} (θ) b^{(1)} (x)] d x . \end{array}

(126)

Identifying the last term on the right-side of Equation (126) with the G-differential

δ R

provided in Equation (106), using the conditions provided in Equation (124) and eliminating the unknown boundary values on the right-side of Equation (126) yields the following expression for the G-differential in terms of the function

b^{(1)} (x)

:

δ R = - 2 δ γ (θ) γ (θ) \int_{0}^{l} b^{(1)} (x) T (x) d x - δ T_{0} {\{\frac{d b^{(1)} (x)}{d x}\}}_{x = 0},

(127)

where the 1st-level adjoint sensitivity function

b^{(1)} (x)

is the solution of the following 1st-Level Adjoint Sensitivity System (1st-LASS):

\frac{d^{2} b^{(1)} (x)}{d x^{2}} + γ^{2} (θ) b^{(1)} (x) = \frac{1}{l}; 0 < x < l;

(128)

b^{(1)} (l) = 0; {\{\frac{d b^{(1)} (x)}{d x}\}}_{x = l} = 0 .

(129)

Identifying the quantities that multiply the variations

δ T_{0}

and

δ γ (θ)

in Equation (127) yields the following expressions for the sensitivities of the model response with respect to

T_{0}

and

γ (θ)

:

\frac{\partial R}{\partial T_{0}} = - {\{\frac{d b^{(1)} (x)}{d x}\}}_{x = 0} = \int_{0}^{l} b^{(1)} (x) δ^{'} (x) d x;

(130)

\frac{\partial R}{\partial γ (θ)} = - 2 γ (θ) \int_{0}^{l} b^{(1)} (x) T (x) d x .

(131)

The 1st-LASS can be readily solved to obtain the following expression for the 1st-level adjoint sensitivity function

b^{(1)} (x)

:

b^{(1)} (x) = \frac{1 - \cos (x - l) γ (θ)}{l γ^{2} (θ)} .

(132)

Using in Equations (130) and (131) the expression for

b^{(1)} (x)

obtained above yields the following expressions:

\frac{\partial R}{\partial T_{0}} = \frac{\sin l γ (θ)}{l γ (θ)};

(133)

\frac{\partial R}{\partial γ (θ)} = \frac{T_{0}}{γ (θ)} \cos l γ (θ) - \frac{T_{0}}{l γ^{2} (θ)} \sin l γ (θ) .

(134)

All of the results obtained in Equations (125)‒(134) are to be evaluated at the nominal parameter values but this fact has not been explicitly indicated in order to simplify the notation.

4.3. Comparison: Applying the 1st-FASAM-NODE Methodology Versus Applying the 1st-FASAM-NIDE-F Methodology

In cases where the model can be equivalently expressed in either NODE or in NIE-F forms, such as shown in Equation (98) or Equation (105), respectively, it is important to highlight the similarities and differences between applying the 1st-FASAM-NODE methodology versus applying the 1st-FASAM-NIDE-F methodology for determining the first-order response sensitivities to the underlying model parameters. Evidently, the final results obtained in Equations (133) and (134) by treating the heat conduction model as a NODE, cf. Equations (98), are identical with the corresponding results obtained in Equations (119) and (120) by having treated the heat conduction model as a NIDE-F, cf. Equation (105). Furthermore, even though the form of the 1st-LVSS produced by the NODE methodology, namely Equations (123) and (124), differs from the form of the 1st-LVSS produced by the NIDE-F methodology, namely Equations (108) and (109), the solutions to these 1st-LVSS are identical to each other, having the expression provided in Equation (110)

However, the 1st-LASS corresponding to the NODE heat conduction model differs from the 1st-LASS corresponding to the NIDE-F heat conduction model, so that the corresponding 1st-level adjoint sensitivity function

b^{(1)} (x)

for the NODE-model, namely Equation (132), differs from the 1st-level adjoint sensitivity function

a^{(1)} (x)

for the NIDE-F heat conduction model, which is provided in Equation (118). Consequently, the expressions obtained in terms of the respective 1st-level adjoint sensitivity functions of the sensitivities of the model response with respect to the primary parameters and feature function for the NODE-representation, namely Equations (130) and (131), differ from those obtained for the NIDE-F representation, namely Equations (116) and (117). The structure of the 1st-LASS and expressions for sensitivities appear to be simpler in the NODE-representation than in the NIDE-F representation, but the choice of representation/framework will be largely influenced by the neural-net software available to the individual user.

4.4. Illustrative Application of the 2nd-FASAM-NIDE Methodology Versus the 2nd-FASAM-NODE Methodology for Computing the Second-Order Response Sensitivities to Model Features and Parameters

The general principles underlying the 2nd-FASAM-NIDE-F methodology presented in Section 3 will be applied to the paradigm heat conduction model considered in this Section in order to highlight the salient issues arising when applying this methodology to determine the second-order sensitivities of model responses to model features and parameters.

4.4.1. Application of the 2nd-FASAM-NIDE-F methodology

When applying the 2nd-FASAM-NIDE-F, the second-order sensitivities arise from the first-order sensitivities obtained in Equations (116) and (117). Thus, the second-order sensitivities arising from Equation (116) are provided by its G-differential for arbitrary variations around the nominal parameter and function values (indicated by the use of the superscript “zero”). Using in Equation (116) the result obtained in Equation (118) and applying the definition of the G-differential to the resulting expression yields the relation below:

\begin{array}{l} δ (\frac{\partial R}{\partial T_{0}}) ≜ δ {(\frac{\partial R}{\partial T_{0}})}_{d i r} + δ {(\frac{\partial R}{\partial T_{0}})}_{i n d} = {\{\frac{d}{d ε} 2 {[2 + l^{2} {(γ^{0} + ε δ γ)}^{2}]}^{- 1}\}}_{ε = 0} \\ + {\{\frac{d}{d ε} (γ^{0} + ε δ γ) \int_{0}^{l} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)] [\sin l (γ^{0} + ε δ γ) - \sin x (γ^{0} + ε δ γ)] d x\}}_{ε = 0}; \end{array}

(135)

where the expressions for the above direct-effect and, respectively, indirect-effect terms are obtained as shown below:

\begin{array}{l} δ {(\frac{\partial R}{\partial T_{0}})}_{d i r} = - {\{4 γ l^{2} (δ γ) {(2 + l^{2} γ^{2})}^{- 2}\}}_{θ^{0}} \\ + (δ γ) {\{\int_{0}^{l} (\sin l γ - \sin x γ + γ l \cos l γ - x γ \sin x γ) a^{(1)} (x) d x\}}_{θ^{0}}; \end{array}

(136)

δ {(\frac{\partial R}{\partial T_{0}})}_{i n d} = {\{γ \int_{0}^{l} (\sin l γ - \sin x γ) δ a^{(1)} (x) d x\}}_{θ^{0}}

(137)

The direct-effect term can be evaluated immediately. The indirect-effect term depends on the variational function

δ a^{(1)} (x)

, which is the solution of the G-differentiated 1st-LASS, comprising Equations (113) and (114), obtained by definition as follows:

{\{- \frac{d}{d x} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)]\}}_{ε = 0} + {\{{(γ^{0} + ε δ γ)}^{2} \int_{0}^{l} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)] d x\}}_{ε = 0} = 0;

(138)

δ a^{(1)} (l) = 0 .

(139)

Performing the operations indicated in Equation (138) yields the following NIDE-F, to be evaluated at the nominal parameter values:

- \frac{d}{d x} δ a^{(1)} (x) + γ^{2} \int_{0}^{l} δ a^{(1)} (x) d x = - 2 γ (δ γ) \int_{0}^{l} a^{(1)} (x) d x .

(140)

Since the indirect-effect term only depends on the variational function

δ a^{(1)} (x)

but does not depend on the variational function

δ T (x)

, the relations presented in Equations (139) and (140) constitute the 2nd-LVSS for the function

δ a^{(1)} (x)

, which is dependent on parameter variations and would need to be solved anew for each parameter variation of interest. The need for computing

δ a^{(1)} (x)

can be avoided by expressing the indirect-effect term defined by Equation (137) in terms of a 2nd-level adjoint sensitivity function that is independent of parameter variations. This adjoint function will be denoted as

a^{(2)} (1; x)

, where the argument “1” indicates that this adjoint function corresponds to the first-order sensitivity

\partial R / \partial T_{0}

, which was chosen in this case to be the “first” 1st-order sensitivity to be considered. The 2nd-LASS to be satisfied by

a^{(2)} (1; x)

will be constructed by applying the 2nd-FASAM-NIDE-F, which commences by forming the inner product of

a^{(2)} (1; x)

with Equation (140), to obtain the following relation:

\begin{array}{l} - \int_{0}^{l} a^{(2)} (1; x) [\frac{d}{d x} δ a^{(1)} (x)] d x + γ^{2} \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} δ a^{(1)} (y) d y \\ = - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} a^{(1)} (y) d y . \end{array}

(141)

Integrating by parts the first term on the left-side of Equation (141) and reversing the order of integrations in the remaining terms yields the following relation:

\begin{array}{l} \int_{0}^{l} δ a^{(1)} (x) [\frac{d}{d x} a^{(2)} (1; x) + γ^{2} \int_{0}^{l} a^{(2)} (1; x) d x] d x - a^{(2)} (1; l) δ a^{(1)} (l) \\ + a^{(2)} (1; 0) δ a^{(1)} (0) = - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} a^{(1)} (y) d y . \end{array}

(142)

The first term on the left-side of Equation (142) is now required to represent the indirect-effect term defined in Equation (137) to obtain the relation below:

\frac{d}{d x} a^{(2)} (1; x) + γ^{2} \int_{0}^{l} a^{(2)} (1; x) d x = γ (\sin l γ - \sin x γ) .

(143)

The unknown quantity

δ a^{(1)} (0)

is eliminated from Equation (142) by imposing the following condition:

a^{(2)} (1; 0) = 0 .

(144)

Replacing the results obtained in Equations (139), (143) and (144) into Equation (142) yields the following alternative expression for the indirect-effect term:

δ {(\frac{\partial R}{\partial T_{0}})}_{i n d} = - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} a^{(1)} (y) d y,

(145)

where the 2nd-level adjoint sensitivity function

a^{(2)} (1; x)

is the solution of the 2nd-Level Adjoint Sensitivity System (2nd-LASS) comprising Equations (143) and (144). The 2nd-LASS is a NIDE-F net that does not depend on parameter variations and needs to be solved once only at the nominal parameter values; its solution,

a^{(2)} (1; x)

, is used in Equation (145).

Adding the expressions obtained in Equations (145) and (136) yields the following expression:

\begin{array}{l} δ (\frac{\partial R}{\partial T_{0}}) = - 4 γ l^{2} (δ γ) {(2 + l^{2} γ^{2})}^{- 2} \\ + (δ γ) \int_{0}^{l} a^{(1)} (x) [\sin l γ - \sin x γ + γ l \cos l γ - x γ \sin x γ] d x \\ - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} a^{(1)} (y) d y ≜ \frac{\partial^{2} R}{\partial T_{0} \partial T_{0}} δ T_{0} + \frac{\partial^{2} R}{\partial T_{0} δ γ} δ γ . \end{array}

(146)

It follows from Equation (146) that:

\begin{array}{l} \frac{\partial^{2} R}{\partial T_{0} \partial T_{0}} = 0; \\ \frac{\partial^{2} R}{\partial T_{0} δ γ} = \int_{0}^{l} (\sin l γ - \sin x γ + γ l \cos l γ - x γ \sin x γ) a^{(1)} (x) d x \\ - 2 γ \int_{0}^{l} a^{(2)} (1; x) d x \int_{0}^{l} a^{(1)} (y) d y - 4 γ l^{2} {(2 + l^{2} γ^{2})}^{- 2} . \end{array}

(147)

The 2nd-LASS represented by Equations can be solved to obtain the following closed-form expression, to be evaluated at the nominal parameter values, for its solution:

a^{(2)} (1; x) = \frac{γ^{2} l}{1 + γ^{2} l^{2} / 2} x + \cos γ x - 1 .

(148)

Inserting the above expression for

a^{(2)} (1; x)

into Equation (147) and performing the respective integrations yields the following closed-form expression for the mixed second-order sensitivity:

\frac{\partial^{2} R}{\partial T_{0} δ γ} = \frac{1}{γ (θ)} \cos l γ (θ) - \frac{1}{l γ^{2} (θ)} \sin l γ (θ) .

(149)

The validity of the above expression can be readily verified by taking the appropriate derivative of either of the first-order sensitivities provided in Equations (133) and (134).

The second-order sensitivities arising from Equation (117) are provided by its G-differential for arbitrary variations around the nominal parameter and function values (indicated by the use of the superscript “zero”), which is by definition obtained as follows:

\begin{array}{l} δ (\frac{\partial R}{\partial γ (θ)}) ≜ δ {(\frac{\partial R}{\partial γ (θ)})}_{d i r} + δ {(\frac{\partial R}{\partial γ (θ)})}_{i n d} ≜ \{\frac{d}{d ε} (T_{0}^{0} + ε δ T_{0}) \int_{0}^{l} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)] \\ {\times [\sin l (γ^{0} + ε δ γ) - \sin x (γ^{0} + ε δ γ)] d x\}}_{ε = 0} + \{\frac{d}{d ε} (T_{0}^{0} + ε δ T_{0}) (γ^{0} + ε δ γ) \\ {\times \int_{0}^{l} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)] [l \cos l (γ^{0} + ε δ γ) - x \cos x (γ^{0} + ε δ γ)] d x\}}_{ε = 0} \\ - 2 {\{\frac{d}{d ε} (γ^{0} + ε δ γ) \int_{0}^{l} [a^{(1, 0)} (x) + ε δ a^{(1)} (x)] d x \int_{0}^{l} [T^{0} (x) + ε δ T (x)] d x\}}_{ε = 0}, \end{array}

(150)

where the direct-effect and, respectively, indirect-effect terms have the following expressions:

\begin{array}{l} δ {(\frac{\partial R}{\partial γ (θ)})}_{d i r} = (δ T_{0}) \int_{0}^{l} (\sin l γ^{0} - \sin x γ^{0}) a^{(1, 0)} (x) d x \\ + (δ γ) T_{0} \int_{0}^{l} (l \cos l γ^{0} - x \cos x γ^{0}) a^{(1, 0)} (x) d x \\ + (T_{0}^{0} δ γ + γ^{0} δ T_{0}) \int_{0}^{l} (l \cos l γ^{0} - x \cos x γ^{0}) a^{(1, 0)} (x) d x \\ + T_{0}^{0} γ^{0} (δ γ) \int_{0}^{l} (x^{2} \sin x γ^{0} - l^{2} \sin l γ^{0}) a^{(1, 0)} (x) d x - 2 (δ γ) \int_{0}^{l} a^{(1, 0)} (x) d x \int_{0}^{l} T^{0} (x) d x; \end{array}

(151)

\begin{array}{l} δ {(\frac{\partial R}{\partial γ (θ)})}_{i n d} = - 2 γ^{0} \int_{0}^{l} δ a^{(1, 0)} (x) d x \int_{0}^{l} T^{0} (x) d x - 2 γ^{0} \int_{0}^{l} a^{(1, 0)} (x) d x \int_{0}^{l} δ T (x) d x \\ + T_{0}^{0} \int_{0}^{l} (\sin l γ^{0} - \sin x γ^{0}) δ a^{(1, 0)} (x) d x + T_{0}^{0} γ^{0} \int_{0}^{l} (l \cos l γ^{0} - x \cos x γ^{0}) δ a^{(1, 0)} (x) d x . \end{array}

(152)

The variational function

δ T (x)

is the solution of Equations (108) and (109) while the variational function

δ a^{(1)} (x)

is the solution of Equations (139) and (140). Altogether, these four equations constitute the 2nd-LVSS for the two-component vector-valued variational function

V^{(2)} (2; x) ≜ {[δ T (x), δ a^{(1)} (x)]}^{†}

. The need for repeatedly solving this 2nd-LVSS for all parameter variations of interest is circumvented by eliminating the appearance of

V^{(2)} (2; x) ≜ {[δ T (x), δ a^{(1)} (x)]}^{†}

in the expression of the indirect-effect term defined in Equation (152), by constructing an alternative expression for this term using the solution of the 2nd-LASS, to be constructed by applying the steps outlined in Section 3, as follows:

Consider the two-component vector function $A^{(2)} (2; 2; x) ≜ {[a^{(2)} (1; 2; x), a^{(2)} (2; 2; x)]}^{†}$ , where the first argument denotes the component number and the second argument (“2”) indicates that this function will correspond to the “second” first-order sensitivity $\partial R / \partial γ (θ)$ . Using the inner product defined in Equation (85), construct the inner product of $A^{(2)} (2; 2; x) ≜ {[a^{(2)} (1; 2; x), a^{(2)} (2; 2; x)]}^{†}$ with Equations (108) and (140), respectively, to obtain the following relation:

$\begin{array}{l} \int_{0}^{l} a^{(2)} (1; 2; x) \frac{d}{d x} δ T (x) d x + γ^{2} \int_{0}^{l} a^{(2)} (1; 2; x) d x \int_{0}^{l} δ T (x) d x \\ - \int_{0}^{l} a^{(2)} (2; 2; x) \frac{d}{d x} δ a^{(1)} (x) d x + γ^{2} \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} δ a^{(1)} (x) d x \\ = \int_{0}^{l} a^{(2)} (1; 2; x) q^{(1)} (x) d x - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} a^{(1)} (x) d x . \end{array}$

(153)
Integrate by parts the first and third terms on the left-side of Equation (153) and rearrange the terms to obtain the following relation:

$\begin{array}{l} a^{(2)} (1; 2; l) δ T (l) - a^{(2)} (1; 2; 0) δ T (0) - a^{(2)} (2; 2; l) δ a^{(1)} (l) \\ + a^{(2)} (2; 2; 0) δ a^{(1)} (0) + \int_{0}^{l} δ T (x) [- \frac{d}{d x} a^{(2)} (1; 2; x) + γ^{2} \int_{0}^{l} a^{(2)} (1; 2; x)] d x \\ + \int_{0}^{l} δ a^{(1)} (x) [\frac{d}{d x} a^{(2)} (2; 2; x) + γ^{2} \int_{0}^{l} a^{(2)} (2; 2; x) d x] d x \\ = \int_{0}^{l} a^{(2)} (1; 2; x) q^{(1)} (x) d x - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} a^{(1)} (x) d x . \end{array}$

(154)
Require the third and fourth terms on the left-side of Equation (154) to represent the indirect-effect term defined in Equation (152) by imposing the following relations:

$- \frac{d}{d x} a^{(2)} (1; 2; x) + γ^{2} \int_{0}^{l} a^{(2)} (1; 2; x) = - 2 γ \int_{0}^{l} a^{(1)} (x) d x;$

(155)

$\begin{array}{l} \frac{d}{d x} a^{(2)} (2; 2; x) + γ^{2} \int_{0}^{l} a^{(2)} (2; 2; x) d x = - 2 γ \int_{0}^{l} T (x) d x \\ + T_{0} (\sin l γ - \sin x γ) + T_{0} γ (l \cos l γ - x \cos x γ) . \end{array}$

(156)
Eliminate the unknown terms $δ T (l)$ on the left-side of Equation (154) by imposing the following boundary conditions:

$a^{(2)} (1; 2; l) = 0; a^{(2)} (2; 2; 0) = 0 .$

(157)
Insert the boundary conditions represented by Equations (109) and (139) into Equation (154) and use the relations underlying the 2nd-LASS to obtain the following expression for the indirect-effect term defined in Equation (152):

$\begin{array}{l} δ {(\frac{\partial R}{\partial γ (θ)})}_{i n d} = a^{(2)} (1; 2; 0) δ T_{0} + \int_{0}^{l} a^{(2)} (1; 2; x) q^{(1)} (x) d x \\ - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} a^{(1)} (x) d x . \end{array}$

(158)

Add the expression obtained in Equation (158) to the expression of the direct-effect term provided in Equation (151) to obtain the following expression:

\begin{array}{l} δ (\frac{\partial R}{\partial γ (θ)}) ≜ \frac{\partial^{2} R}{\partial T_{0} δ γ} δ T_{0} + \frac{\partial^{2} R}{\partial γ \partial γ} δ γ = a^{(2)} (1; 2; 0) δ T_{0} + \int_{0}^{l} a^{(2)} (1; 2; x) q^{(1)} (x) d x \\ - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} a^{(1)} (x) d x + (δ T_{0}) \int_{0}^{l} (\sin l γ - \sin x γ) a^{(1)} (x) d x \\ + (δ γ) T_{0} \int_{0}^{l} (l \cos l γ - x \cos x γ) a^{(1)} (x) d x \\ + (T_{0} δ γ + γ δ T_{0}) \int_{0}^{l} (l \cos l γ - x \cos x γ) a^{(1)} (x) d x \\ + T_{0} γ (δ γ) \int_{0}^{l} (x^{2} \sin x γ - l^{2} \sin l γ) a^{(1)} (x) d x - 2 (δ γ) \int_{0}^{l} a^{(1)} (x) d x \int_{0}^{l} T (x) d x . \end{array}

(159)

Insert the expression of

q^{(1)} (x)

into the second term on the right-side of Equation (159) and collect the terms multiplying the variations

δ T_{0}

and

δ γ

, respectively, to obtain the following expressions:

\begin{array}{l} \frac{\partial^{2} R}{\partial T_{0} δ γ} = a^{(2)} (1; 2; 0) + γ \int_{0}^{l} (\sin l γ - \sin x γ) a^{(2)} (1; 2; x) d x \\ + \int_{0}^{l} (\sin l γ - \sin x γ) a^{(1)} (x) d x + γ \int_{0}^{l} (l \cos l γ - x \cos x γ) a^{(1)} (x) d x; \end{array}

(160)

\begin{array}{l} \frac{\partial^{2} R}{\partial γ \partial γ} = \int_{0}^{l} a^{(2)} (1; 2; x) [T_{0} (\sin l γ - \sin x γ) + T_{0} γ (l \cos l γ - x \cos x γ) - 2 γ \int_{0}^{l} T (x) d x] d x \\ - 2 γ \int_{0}^{l} a^{(2)} (2; 2; x) d x \int_{0}^{l} a^{(1)} (x) d x + T_{0} \int_{0}^{l} (l \cos l γ - x \cos x γ) a^{(1)} (x) d x \\ - 2 \int_{0}^{l} a^{(1)} (x) d x \int_{0}^{l} T (x) d x + T_{0} \int_{0}^{l} (l \cos l γ - x \cos x γ) a^{(1)} (x) d x \\ + T_{0} γ \int_{0}^{l} (x^{2} \sin x γ - l^{2} \sin l γ) a^{(1)} (x) d x . \end{array}

(161)

The algebraic manipulations involved in obtaining the closed-form expressions of the second-order sensitivities presented in Equations (160) and (161) are straightforward but involve a large amount of algebra stemming from the fact that the 2nd-LASS involves the two-component 2nd-level adjoint sensitivity function

A^{(2)} (2; 2; x) ≜ {[a^{(2)} (1; 2; x), a^{(2)} (2; 2; x)]}^{†}

. The reason for needing such a two-component adjoint function stems from the expression of the first-order sensitivity

\partial R / \partial γ (θ)

provided in Equation (117), which involves both the original function

T (x)

and the 1st-level adjoint sensitivity function

a^{(1)} (x)

. A significant amount of algebraic manipulations could be avoided by eliminating the appearance of either

T (x)

or

a^{(1)} (x)

in the expression of

\partial R / \partial γ (θ)

. If either of these functions were eliminated from appearing in the expression of

\partial R / \partial γ (θ)

, then the G-differential of

\partial R / \partial γ (θ)

would depend either just on

δ a^{(1)}

or just on

δ T

, which are “single-component” (as opposed to a “two-components”) variational sensitivity functions. In such a case, the corresponding 2nd-LASS would also comprise just a single-component (as opposed to a “two-component) 2nd-level adjoint sensitivity function. These considerations will be illustrated in the following by using Equation (101) to eliminate the appearance of the function

T (x)

in the expression provided in Equation (117) for

\partial R / \partial γ (θ)

, which would consequently take on the following simplified expression:

\frac{\partial R}{\partial γ} = T_{0} \int_{0}^{l} a^{(1)} (x) (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ) d x .

(162)

Applying the definition of the G-differential to Equation (162) yields the following expression:

δ \{\frac{\partial R}{\partial γ}\} = δ {(\frac{\partial R}{\partial γ})}_{d i r} + δ {(\frac{\partial R}{\partial γ})}_{i n d},

(163)

where the direct-effect and the indirect-effect terms are defined below:

\begin{array}{l} δ {(\frac{\partial R}{\partial γ})}_{d i r} = (δ T_{0}) \int_{0}^{l} a^{(1)} (x) (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ) d x \\ + (δ γ) T_{0} \int_{0}^{l} a^{(1)} (x) (- l^{2} γ \sin l γ + l \cos l γ + x^{2} γ \cos x γ - x \cos x γ - l \cos l γ - x \cos x γ) d x \end{array}

(164)

δ {(\frac{\partial R}{\partial γ})}_{i n d} = T_{0} \int_{0}^{l} (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ) δ a^{(1)} (x) d x .

(165)

The appearance in Equation (165) of the variational function

δ a^{(1)} (x)

is eliminated by following the same procedure as followed in the foregoing for the indirect-effect term

δ {(\partial R / \partial T_{0})}_{i n d}

. Ultimately, the indirect-effect term

δ {(\partial R / \partial γ)}_{i n d}

will have the following expression in terms of a 2nd-level adjoint sensitivity function denoted as

a^{(2)} (2; x)

:

δ {(\frac{\partial R}{\partial γ})}_{i n d} = - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; x) d x \int_{0}^{l} a^{(1)} (y) d y,

(166)

where the 2nd-level adjoint sensitivity function

a^{(2)} (2; x)

is the solution of the following 2nd-LASS:

\frac{d}{d x} a^{(2)} (2; x) + γ^{2} \int_{0}^{l} a^{(2)} (2; x) d x = T_{0} (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ);

(167)

a^{(2)} (2; 0) = 0

(168)

Adding the expressions obtained in Equations (164) and (166) yields the following expression for the G-differential

δ \{\partial R / \partial γ\}

:

\begin{array}{l} δ (\frac{\partial R}{\partial γ}) = (δ T_{0}) \int_{0}^{l} a^{(1)} (x) (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ) d x \\ + (δ γ) T_{0} \int_{0}^{l} a^{(1)} (x) (- l^{2} γ \sin l γ + l \cos l γ + x^{2} γ \cos x γ - x \cos x γ - l \cos l γ - x \cos x γ) d x \\ - 2 γ (δ γ) \int_{0}^{l} a^{(2)} (2; x) d x \int_{0}^{l} a^{(1)} (y) d y ≜ \frac{\partial^{2} R}{\partial T_{0} δ γ} δ T_{0} + \frac{\partial^{2} R}{\partial γ \partial γ} δ γ . \end{array}

(169)

It follows from Equation (169) that the respective second-order sensitivities have the following expressions:

\frac{\partial^{2} R}{\partial T_{0} δ γ} = \int_{0}^{l} a^{(1)} (x) (l γ \cos l γ - x γ \cos x γ - \sin l γ - \sin x γ) d x = \frac{1}{γ} \cos l γ - \frac{1}{l γ^{2}} \sin l γ .

(170)

\begin{array}{l} \frac{\partial^{2} R}{\partial γ \partial γ} = - 2 γ \int_{0}^{l} a^{(2)} (2; x) d x \int_{0}^{l} a^{(1)} (y) d y \\ + T_{0} \int_{0}^{l} a^{(1)} (x) (- l^{2} γ \sin l γ + l \cos l γ + x^{2} γ \cos x γ - x \cos x γ - l \cos l γ - x \cos x γ) d x . \end{array}

(171)

The mixed second-order sensitivity

\partial^{2} R / \partial T_{0} δ γ

in Equation (170) does not depend on the 2nd-level adjoint sensitivity function

a^{(2)} (2; x)

and was therefore evaluated immediately. Solving Equations (167) and (168) yields the following expression, to be evaluated at the nominal parameter values, for the 2nd-level adjoint sensitivity function

a^{(2)} (2; x)

:

a^{(2)} (2; x) = - T_{0} x \sin γ x .

(172)

Inserting the result obtained in Equation (172) into Equation (171) and performing the respective operations yields the following expression:

\frac{\partial^{2} R}{\partial γ (θ) \partial γ (θ)} = T_{0} (2 \frac{\sin l γ (θ)}{l γ^{3} (θ)} - 2 \frac{\cos l γ (θ)}{γ^{2} (θ)} - \frac{l \sin l γ (θ)}{γ (θ)}) .

(173)

It is evident from Equation (147) and Equation (160) or, alternatively, Equation (173) that the mixed second-order sensitivity

\partial^{2} R / \partial T_{0} δ γ

is computed twice, employing distinct expressions involving distinct 2nd-level adjoint sensitivity functions. This mechanism provides a stringent verification of the accuracy of the computation of the respective adjoint sensitivity functions.

In practice, the closed-form analytical expressions of the original functions, such as provide in Equation (101), are seldom available. Nevertheless, if such expressions are available, they can be advantageously used to reduce the amount of computations involved in determining the response sensitivities, as shown in the foregoing.

4.4.2. Alternative derivation of the second-order sensitivities by applying the 2nd-FASAM-NODE-F methodology

When applying the 2nd-FASAM-NODE methodology, the second-order sensitivities arise from the first-order sensitivities obtained in Equations (130) and (131). Thus, the second-order sensitivities arising from Equation (130) are provided by its G-differential for arbitrary variations around the nominal parameter and function values (indicated by the use of the superscript “zero”), which is by definition obtained as follows:

δ \{\frac{\partial R}{\partial T_{0}}\} ≜ {\{\frac{d}{d ε} \int_{0}^{l} [b^{(1, 0)} (x) + ε δ b^{(1)} (x)] δ^{'} (x) d x\}}_{ε = 0} = \int_{0}^{l} δ b^{(1)} (x) δ^{'} (x) d x,

(174)

where

δ^{'} (x)

denotes the derivative of the Dirac-delta functional. The variational function

δ b^{(1)} (x)

is the solution of the following 2nd-LVSS, obtained by G-differentiating Equations (128) and (129):

\frac{d^{2}}{d x^{2}} δ b^{(1)} (x) + γ^{2} δ b^{(1)} (x) = - 2 γ (δ γ) b^{(1)} (x); 0 < x < l;

(175)

δ b^{(1)} (l) = 0; {\{\frac{d}{d x} [δ b^{(1)} (x)]\}}_{x = l} = 0 .

(176)

The above 2nd-LVSS for the function

δ b^{(1)} (x)

is to be satisfied at the nominal parameter values, but the superscript “zero” (which has been used to denote this fact) has been omitted to simplify the notation. The need for repeatedly solving this 2nd-LVSS for all parameter variations of interest is circumvented by eliminating the appearance of

δ b^{(1)} (x)

in Equation (174). This aim will be accomplished by expressing

δ \{\partial R / \partial T_{0}\}

in terms of the solution of the 2nd-LASS to be constructed by applying the steps outlined in Section 3. Thus, consider an adjoint function that will be denoted as

b^{(2)} (1; x)

, where the argument “1” indicates that this adjoint function corresponds to the first-order sensitivity

\partial R / \partial T_{0}

, which is chosen in this case to be the “first” 1st-order sensitivity to be considered. The 2nd-LASS to be satisfied by

b^{(2)} (1; x)

will be constructed by applying the 2nd-FASAM-NODE, which commences by forming the inner product of

b^{(2)} (1; x)

with Equation (175), to obtain the following relation:

\begin{array}{l} \int_{0}^{l} b^{(2)} (1; x) [\frac{d^{2}}{d x^{2}} δ b^{(1)} (x)] d x + γ^{2} \int_{0}^{l} b^{(2)} (1; x) δ b^{(1)} (x) d x \\ = - 2 γ (δ γ) \int_{0}^{l} b^{(2)} (1; x) b^{(1)} (x) d x . \end{array}

(177)

Integrating by parts the first term on the left-side of Equation (177) and rearranging the terms yields the following relation:

\begin{array}{l} b^{(2)} (1; l) \frac{d}{d x} δ b^{(1)} (l) - b^{(2)} (1; 0) \frac{d}{d x} δ b^{(1)} (0) - δ b^{(1)} (l) \frac{d b^{(2)} (1; l)}{d x} + δ b^{(1)} (0) \frac{d b^{(2)} (1; 0)}{d x} \\ + \int_{0}^{l} δ b^{(1)} (x) [\frac{d^{2}}{d x^{2}} b^{(2)} (1; x) + γ^{2} b^{(2)} (1; x)] d x = - 2 γ (δ γ) \int_{0}^{l} b^{(2)} (1; x) b^{(1)} (x) d x . \end{array}

(178)

The last term on the left-side of Equation (178) is now required to represent the G-differential defined in Equation (174) to obtain the relation below:

\frac{d^{2}}{d x^{2}} b^{(2)} (1; x) + γ^{2} b^{(2)} (1; x) = δ^{'} (x) .

(179)

The unknown boundary terms are eliminated from Equation (178) by imposing the following conditions:

b^{(2)} (1; 0) = 0; {\{\frac{d}{d x} b^{(2)} (1; 0)\}}_{x = 0} = 0 .

(180)

The system of equations comprising Equations (179) and (180) constitute the 2nd-LASS for the 2nd-level adjoint sensitivity function

b^{(2)} (1; x)

.

Replacing the results obtained in Equations (176), (179) and (180) into Equation (178) yields the following alternative expression for the indirect-effect term:

δ {(\frac{\partial R}{\partial T_{0}})}_{i n d} = - 2 γ (δ γ) \int_{0}^{l} b^{(2)} (1; x) b^{(1)} (x) d x ≜ \frac{\partial^{2} R}{\partial T_{0} \partial T_{0}} δ T_{0} + \frac{\partial^{2} R}{\partial T_{0} δ γ} δ γ .

(181)

where the 2nd-level adjoint sensitivity function

b^{(2)} (1; x)

is the solution of the 2nd-Level Adjoint Sensitivity System (2nd-LASS) comprising Equations (179) and (180). The 2nd-LASS is a NODE net that does not depend on parameter variations and needs to be solved once only at the nominal parameter values; its solution,

b^{(2)} (1; x)

, is used in Equation (181) to determine the respective second-order response sensitivities.

Identifying in Equation (181) the quantities that multiply the respective parameter variations yields the following expressions:

\frac{\partial^{2} R}{\partial T_{0} δ γ} = - 2 γ \int_{0}^{l} b^{(2)} (1; x) b^{(1)} (x) d x; \frac{\partial^{2} R}{\partial T_{0} \partial T_{0}} = 0 .

(182)

Solving the 2nd-LASS represented by Equations (179) and (180) yields the following expression for the 2nd-level adjoint sensitivity function

b^{(2)} (1; x)

:

b^{(2)} (1; x) = H (x) \cos γ x,

(183)

where

H (x)

denotes the Heaviside-functional. Using in Equation (182) the results obtained in Equations (183) and (132) yields the following expression:

\frac{\partial^{2} R}{\partial T_{0} δ γ (θ)} = \frac{1}{γ (θ)} \cos l γ (θ) - \frac{1}{l γ^{2} (θ)} \sin l γ (θ) .

(184)

As expected, the expression obtained in Equation (184) is identical to the expressions obtained in Equation (170) and (149).

The second-order sensitivities arising from the first-order sensitivity represented by Equation (131) are obtained from its G-differential for arbitrary variations around the nominal parameter and function values. Thus, applying the definition of the G-differential to Equation (131) yields the following expression:

δ \{\frac{\partial R}{\partial γ (θ)}\} ≜ δ {\{\frac{\partial R}{\partial γ (θ)}\}}_{d i r} + δ {\{\frac{\partial R}{\partial γ (θ)}\}}_{i n d}

(185)

where the direct-effect and indirect-effect terms have the expressions below:

δ {\{\frac{\partial R}{\partial γ (θ)}\}}_{d i r} ≜ - 2 (δ γ) \int_{0}^{l} b^{(1)} (x) T (x) d x;

(186)

δ {\{\frac{\partial R}{\partial γ (θ)}\}}_{i n d} ≜ - 2 γ^{0} \int_{0}^{l} b^{(1)} (x) δ T (x) d x - 2 γ^{0} \int_{0}^{l} δ b^{(1)} (x) T (x) d x .

(187)

The indirect-effect term

δ {\{\partial R / \partial γ (θ)\}}_{i n d}

will be recast in terms of an alternative expression that will not involve the variational functions

δ T (x)

and

δ b^{(1)} (x)

by applying the principles of the 2nd-FASAM-NODE, which are fundamentally the same as those underlying the 2nd-FASAM-NIDE-F, as follows:

Consider the two-component vector function $B^{(2)} (2; 2; x) ≜ {[b^{(2)} (1; 2; x), b^{(2)} (2; 2; x)]}^{†}$ , where the first argument denotes the component number and the second argument (“2”) indicates that this function will correspond to the “second” first-order sensitivity, namely $\partial R / \partial γ (θ)$ . Using the inner product defined in Equation (85), construct the inner product of $B^{(2)} (2; 2; x) ≜ {[b^{(2)} (1; 2; x), b^{(2)} (2; 2; x)]}^{†}$ with Equations (123) and (175), respectively, to obtain the following relation, to be satisfied at the nominal parameter values (although the superscript “zero” will be omitted for simplicity):

$\begin{array}{l} \int_{0}^{l} b^{(2)} (1; 2; x) \frac{d^{2}}{d x^{2}} δ T (x) d x + γ^{2} \int_{0}^{l} b^{(2)} (1; 2; x) δ T (x) d x + \int_{0}^{l} b^{(2)} (2; 2; x) \frac{d^{2}}{d x^{2}} δ b^{(1)} (x) d x \\ + γ^{2} \int_{0}^{l} b^{(2)} (2; 2; x) δ b^{(1)} (x) d x = - 2 γ (δ γ) \int_{0}^{l} [b^{(2)} (1; 2; x) T (x) + b^{(2)} (2; 2; x) b^{(1)} (x)] d x . \end{array}$

(188)
Integrate by parts the first and third terms on the left-side of Equation (188) and rearrange the terms to obtain the following relation:

$\begin{array}{l} \int_{0}^{l} δ T (x) [\frac{d^{2} b^{(2)} (1; 2; x)}{d x^{2}} + γ^{2} b^{(2)} (1; 2; x)] d x \\ + \int_{0}^{l} δ b^{(1)} (x) [\frac{d^{2} b^{(2)} (2; 2; x)}{d x^{2}} + γ^{2} b^{(2)} (2; 2; x)] d x \\ + P [δ T (x), δ b^{(1)} (x), b^{(2)} (1; 2; x); b^{(2)} (2; 2; x)] \\ = - 2 γ (δ γ) \int_{0}^{l} [b^{(2)} (1; 2; x) T (x) + b^{(2)} (2; 2; x) b^{(1)} (x)] d x . \end{array}$

(189)

where the bilinear concomitant $P [δ T (x), δ b^{(1)} (x), b^{(2)} (1; 2; x); b^{(2)} (2; 2; x)]$ is defined below:

$\begin{array}{l} P [δ T (x), δ b^{(1)} (x), b^{(2)} (1; 2; x); b^{(2)} (2; 2; x)] ≜ b^{(2)} (1; 2; l) {\{\frac{d}{d x} δ T (x)\}}_{x = l} \\ - b^{(2)} (1; 2; 0) {\{\frac{d}{d x} δ T (x)\}}_{x = 0} - {\{δ T (x) \frac{d b^{(2)} (1; 2; x)}{d x}\}}_{x = l} + {\{δ T (x) \frac{b^{(2)} (1; 2; x)}{d x}\}}_{x = 0} \\ + b^{(2)} (2; 2; l) \frac{d}{d x} δ b^{(1)} (l) - b^{(2)} (2; 2; 0) \frac{d}{d x} δ b^{(1)} (0) \\ - δ b^{(1)} (l) \frac{d b^{(2)} (2; 2; l)}{d x} + δ b^{(1)} (0) \frac{d b^{(2)} (2; 2; 0)}{d x} . \end{array}$

(190)
Require the first and second terms on the left-side of Equation (189) to represent the indirect-effect term defined in Equation (187) by imposing the following relations:

$\frac{d^{2} b^{(2)} (1; 2; x)}{d x^{2}} + γ^{2} b^{(2)} (1; 2; x) = - 2 γ b^{(1)} (x);$

(191)

$\frac{d^{2} b^{(2)} (2; 2; x)}{d x^{2}} + γ^{2} b^{(2)} (2; 2; x) = - 2 γ T (x) .$

(192)
Eliminate the unknown boundary terms in the expression of the bilinear concomitant defined in Equation (190) by imposing the following boundary conditions:

$b^{(2)} (1; 2; l) = 0; {\{\frac{d b^{(2)} (1; 2; x)}{d x}\}}_{x = l} = 0; b^{(2)} (2; 2; 0) = 0; {\{\frac{d b^{(2)} (2; 2; x)}{d x}\}}_{x = 0} = 0 .$

(193)

The system comprising Equations (191)‒(193) constitutes the 2nd-LASS for the two-component 2nd-level adjoint sensitivity function $B^{(2)} (2; 2; x) ≜ {[b^{(2)} (1; 2; x), b^{(2)} (2; 2; x)]}^{†}$ .
Insert the boundary conditions represented by Equations (124) and (176) into Equation (193) and use the relations representing the 2nd-LASS to obtain the following expression for the indirect-effect term defined in Equation (152):

$\begin{array}{l} δ {(\frac{\partial R}{\partial γ})}_{i n d} = - δ T_{0} {\{\frac{b^{(2)} (1; 2; x)}{d x}\}}_{x = 0} \\ - 2 γ (δ γ) \int_{0}^{l} [b^{(2)} (1; 2; x) T (x) + b^{(2)} (2; 2; x) b^{(1)} (x)] d x . \end{array}$

(194)

Adding the expression obtained in Equation (194) to the expression of the direct-effect term provided in Equation (186) yields the following expression for the G-differential

δ \{\partial R / \partial γ\}

:

\begin{array}{l} δ \{\frac{\partial R}{\partial γ (θ)}\} ≜ - 2 (δ γ) \int_{0}^{l} b^{(1)} (x) T (x) d x - δ T_{0} {\{\frac{b^{(2)} (1; 2; x)}{d x}\}}_{x = 0} \\ - 2 γ (δ γ) \int_{0}^{l} [b^{(2)} (1; 2; x) T (x) + b^{(2)} (2; 2; x) b^{(1)} (x)] d x . \end{array}

(195)

It follows from the expression obtained in Equation (195) that:

\frac{\partial^{2} R}{\partial T_{0} δ γ} = - {\{\frac{b^{(2)} (1; 2; x)}{d x}\}}_{x = 0};

(196)

\frac{\partial^{2} R}{\partial γ δ γ} = - 2 \int_{0}^{l} b^{(1)} (x) T (x) d x - 2 γ \int_{0}^{l} [b^{(2)} (1; 2; x) T (x) + b^{(2)} (2; 2; x) b^{(1)} (x)] d x .

(197)

Solving the 2nd-LASS represented by Equations (191)‒(193) yields the following expressions for the components of

B^{(2)} (2; 2; x) ≜ {[b^{(2)} (1; 2; x), b^{(2)} (2; 2; x)]}^{†}

:

\begin{array}{l} b^{(2)} (1; 2; x) = \frac{2}{l γ^{2}} [- \frac{1}{γ} + \frac{x}{2} \sin γ (x - l) - \frac{1}{4 γ} \cos γ (x - l)] \\ (\frac{5}{2 l γ^{3}} \sin γ l + \frac{\sin^{2} γ l}{γ^{2} \cos γ l} - \frac{1}{γ^{2} \cos γ l}) \sin γ x + (\frac{5}{2 γ l^{3}} \cos γ l + \frac{\sin γ l}{γ^{2}}) \cos γ x; \end{array}

(198)

b^{(2)} (2; 2; x) = - T_{0} x \sin γ x .

(199)

Using in Equation (196) the result obtained in Equation (198) yields the following expression:

\frac{\partial^{2} R}{\partial T_{0} δ γ (θ)} = \frac{1}{γ (θ)} \cos l γ (θ) - \frac{1}{l γ^{2} (θ)} \sin l γ (θ) .

(200)

As expected, the above expression coincides with the expression obtained, successively, in Equations (149), (170) and (184). Evidently, the expression of the mixed second-order sensitivity

\partial^{2} R / \partial T_{0} δ γ

can be determined in several distinct ways, using distinct adjoint sensitivity functions, thus providing alternatives for verifying the computational accuracy of the respective adjoint functions, when these functions are computed numerically, as is the case in practice.

Inserting the results obtained in Equations (101), (183), (198) and (199) into Equation (197) and performing the respective integrations yields the following expression:

\frac{\partial^{2} R}{\partial γ (θ) \partial γ (θ)} = T_{0} (2 \frac{\sin l γ (θ)}{l γ^{3} (θ)} - 2 \frac{\cos l γ (θ)}{γ^{2} (θ)} - \frac{l \sin l γ (θ)}{γ (θ)}) .

(201)

As expected, the above expression coincides with the expression obtained in Equation (173).

5. Discussion and Conclusions

This first part work has introduced the First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type (1st-FASAM-NIDE-F), which enables the most efficient computation of exactly obtained expressions of the first-order sensitivities of NIDE-F decoder-responses with respect to the optimized NIDE-F weights/parameters. After introducing the framework of the 1st-FASAM-NIDE-F for a NIDE-F involving arbitrarily-high-order derivatives of the dependent variable (representing the hidden/latent neural networks) with respect to the time-like independent variable, this work has presented the application of the 1st-FASAM-NIDE-F to first-order and, subsequently, second-order NIDE-F neural nets. Remarkably, the application of the 1st-FASAM-NIDE-F requires a single “large-scale” computation, for solving the 1st-Level Adjoint Sensitivity System (1st-LASS), in order to compute all of the first-order sensitivities of the decoder response, regardless of the number of weights/parameters underlying the NIDE-F net.

Subsequently, this work has presented the general mathematical framework underlying the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type (2nd-FASAM-NIDE-F), which enables the most efficient computation of the exactly obtained expressions of the second-order sensitivities of NIDE-F decoder-responses with respect to the optimized NIDE-F weights/parameters. Next, this work has presented the application of the 1st-FASAM-NIDE-F and 2nd-FASAM-NIDE-F methodologies to an illustrative paradigm heat conduction model. This illustrative model has been chosen because it can be formulated either as a first-order differential-integral equation of Fredholm type or as a conventional second-order “neural ordinary differential equation (NODE)”, while admitting exact closed-form solutions/expressions for all quantities of interest, including state functions, first-order and second-order sensitivities. The availability of these alternative formulations, either as a NIDE-F or a NODE, of the illustrative paradigm heat conduction model makes it possible to compare the detailed, step-by-step, applications of the 1st-FASAM-NIDE-F versus the 1st-FASAM-NODE methodologies (for computing most efficiently the exact expressions of the first-order sensitivities of decoder response with respect to the model parameters) and, subsequently, to compare the applications of the 2nd-FASAM-NIDE-F versus the 2nd-FASAM-NODE methodologies (for computing most efficiently the exact expressions of the second-order sensitivities of decoder response with respect to the model parameters).

Ongoing work aims at developing the Second-Order Features Adjoint Sensitivity Analysis Methodologies for Neural Integro-Differential Equations of Volterra-Type (2nd-FASAM-NIDE-V), which will enable, in premiere, the most efficient computation of the exact expressions of the first- and second-order sensitivities of decoder-responses with respect to the optimized network’s weights/parameters for such neural nets.

Funding

This research received no external funding.”

Conflicts of Interest

The author declares no conflicts of interest.

References

Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D.K. Neural ordinary differential equations. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: New York, NY, USA, Volume 31, pp. 6571–6583, 2018. [CrossRef]
Rokhlin, V. Rapid solution of integral equations of classical potential theory. Journal of computational physics, 1985, 60(2), 187-207.
Ruthotto, L.; Haber, E. Deep neural networks motivated by partial differential equations. J. Math. Imaging Vis. 2018, 62, 352–364.
Lu, Y.; Zhong, A.; Li, Q.; Dong, B. Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; PMLR; pp. 3276–3285.
Grathwohl, W.; Chen, R.T.Q.; Bettencourt, J.; Sutskever, I.; Duvenaud, D. Ffjord: Free-form continuous dynamics for scalable reversible generative models. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019.
Dupont, E.; Doucet, A.; The, Y.W. Augmented neural odes. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, Volume 32, pp. 14–15, 8–14 December 2019.
Kidger, P.; Morrill, J.; Foster, J.; Lyons, T. Neural controlled differential equations for irregular time series. In Proceedings of the Advances in Neural Information Processing Systems, Virtual, Volume 33, pp. 6696–6707, 6–12 December 2020.
Kidger, P. On Neural Differential Equations. arXiv 2022, arXiv:2202.02435.
Morrill, J.; Salvi, C.; Kidger, P.; Foster, J. Neural rough differential equations for long time series. In Proceedings of the International Conference on Machine Learning, Virtual, PMLR; pp. 7829–7838, 18–24 July 2021.
Sohrab Effati and Reza Buzhabadi. A neural network approach for solving Fredholm integral equations of the second kind. Neural Computing and Applications, 2012, 21(5), 843-852.
Zappala, E.; de Oliveira Fonseca, A. H.; Caro, J. O.; van Dijk, D. Neural Integral Equations. arXiv:2209.15190v4 [cs.LG] 18 May 2023.
Xiong, Y.; Zeng, Z.; Chakraborty, R.; Tan, M.; Fung, G.; Li, Y.; Singh, V. Nystromformer: A nystrom-based algorithm for approximating self-attention. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence, volume 35, page 14138. NIH Public Access, 2021.
Rokhlin, V. Rapid solution of integral equations of scattering theory in two dimensions. Journal of Computational Physics, 1990, 86(2), 414-439.
Greengard, L; Kropinski M. C. An integral equation approach to the incompressible Navier-Stokes equations in two dimensions. SIAM Journal on Scientific Computing, 1998, 20(1), 318-336.
Prinja, A. K.; Larsen, E. W. General Principles of Neutron Transport. In Handbook of Nuclear Engineering, Cacuci, D. G. Ed., Ch. 5, Vol. 1, 543-642, Springer Science + Business Media, New York, NY, USA, 2010.
Zappala, E.; De Oliveira Fonseca, A. H.; Moberly, A. H.; Higley, M.J.; Abdallah, C.; Cardin, J; Van Dijk, D. Neural integro-differential equations. arXiv:2206.14282v4 [cs.LG] 30 Nov 2022.
Volterra, V. Theory of functionals and of integral and integro-differential equations. Bull. Amer. Math. Soc. 1932, 38.1, p. 623.
Caffarelli, L.; Silvestre, L.; Regularity theory for fully nonlinear integro-differential equations. Communications on Pure and Applied Mathematics: A Journal Issued by the Courant Institute of Mathematical Sciences. 2009, 62.5, 597-638.
Grigoriev, Y. N.; Ibragimov, N. H.; Kovalev, V.F.; Meleschko, S.V. Symmetries of integro-differential equations: with applications in mechanics and plasma physics. Lecture Notes in Physics Vol. 806. Springer Science + Business Media, Dordrecht, Netherlands, 2010.
Lakshmikantham, V. Theory of integro-differential equations. Vol. 1. CRC press, 1995.
Amari Shun-ichi. Dynamics of pattern formation in lateral-inhibition type neural fields. Biological cybernetics. 1977, 27.2, 77-87.
Jan Medlock and Mark Kot. Spreading disease: integro-differential equations old and new". Mathematical Biosciences. 2003, 184.2, 201-222.
Hugh R Wilson and Jack D Cowan. Excitatory and inhibitory interactions in localized populations of model neurons. Biophysical journal, 1972, 12.1, 1-24.
Cacuci, D.G. Introducing the n^th-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (n^th-FASAM-N): I. Mathematical Framework. Am. J. Comput. Math. 2024, 14, 11–42. [CrossRef]
Cacuci, D.G. First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations: Mathematical Framework and Illustrative Application to the Nordheim–Fuchs Reactor Safety Model. J. Nucl. Eng. 2024, 5, 347–372. [CrossRef]
Cacuci, D.G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations. I: Mathematical Framework. Posted Date: 15 October 2024. [CrossRef]
Cacuci, D. G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Fredholm-Type Neural Integral Equations. Mathematics 2025, 13, 14. [CrossRef]
Cacuci, D. G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering. J. Nucl. Eng. 2025, 6, 8. [CrossRef]
Cacuci, D.G. Sensitivity Theory for Nonlinear Systems: I. Nonlinear Functional Analysis Approach. J. Math. Phys. 1981, 22, 2794-2802.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The First- and Second-Order Features Adjoint Sensitivity Analysis Methodologies for Fredholm-Type Neural Integro-Differential Equations: Mathematical Framework and Illustrative Application to a Heat Transfer Model

Abstract

Keywords:

Subject:

1. Introduction

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm-Type (1st-FASAM-NIDE-F)

2.1. First-Order Neural Integral Equations of Fredholm-Type (1st-NIDE-F)

2.2. Second-Order Neural Integral Equations of Fredholm-Type (2nd-NIDE-F)

3. Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations of Fredholm Type (2nd-FASAM-NIDE-F)

4. Illustrative Application of the 1st-FASAM-NIDE-F and 2nd-FASAM-NIDE-F Methodologies to a Heat Transfer Model

4.1. Applying the 1st-FASAM-NIDE-F Methodology to Obtain the First-Order Response Sensitivities to the Primary Model Parameters

4.2. Applying the 1st-FASAM-NODE Methodology to Obtain the First-Order Response Sensitivities to the Primary Model Parameters

4.3. Comparison: Applying the 1st-FASAM-NODE Methodology Versus Applying the 1st-FASAM-NIDE-F Methodology

4.4. Illustrative Application of the 2nd-FASAM-NIDE Methodology Versus the 2nd-FASAM-NODE Methodology for Computing the Second-Order Response Sensitivities to Model Features and Parameters

4.4.1. Application of the 2nd-FASAM-NIDE-F methodology

4.4.2. Alternative derivation of the second-order sensitivities by applying the 2nd-FASAM-NODE-F methodology

5. Discussion and Conclusions

Funding

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe