Abstract
For the inverse problem in physical models, one measures the solution and infers the model parameters using information from the collected data. Oftentimes, these data are inadequate and render the inverse problem ill-posed. We study the ill-posedness in the context of optical imaging, which is a medical imaging technique that uses light to probe (bio-)tissue structure. Depending on the intensity of the light, the forward problem can be described by different types of equations. High-energy light scatters very little, and one uses the radiative transfer equation (RTE) as the model; low-energy light scatters frequently, so the diffusion equation (DE) suffices to be a good approximation. A multiscale approximation links the hyperbolic-type RTE with the parabolic-type DE. The inverse problems for the two equations have a multiscale passage as well, so one expects that as the energy of the photons diminishes, the inverse problem changes from well- to ill-posed. We study this stability deterioration using the Bayesian inference. In particular, we use the Kullback–Leibler divergence between the prior distribution and the posterior distribution based on the RTE to prove that the information gain from the measurement vanishes as the energy of the photons decreases, so that the inverse problem is ill-posed in the diffusive regime. In the linearized setting, we also show that the mean square error of the posterior distribution increases as we approach the diffusive regime.
1. Introduction
Optical tomography is a well-defined inverse problem. In the lab, laser beams with high-energy photons are injected into bio-tissues to detect the interior optical property [1]. This helps identify unhealthy bio-tissues for treatment. Mathematically, this amounts to reconstructing the optical coefficients, such as the scattering and the absorption coefficients in the radiative transfer equation (RTE), which is a model equation that describes the propagation of photon particles [2]. The equation, in its simplest form, reads
The unknown describes the number of photon particles at time , location , and traveling with velocity . The dimension of the problem is . v is the travel direction of the particles and therefore belongs to a unit ball. The two terms in the equation represent the transport and the scattering effect, respectively. The transport term characterizes , while the scattering term describes the way the photon particles interact with the media. When the temperature is fixed, this operator is a linear operator, whereas if the laser beam heats up the tissue, reflects the nonlinear dependence on the temperature. This is the term that encodes the optical property of the media.
In the steady state, the term is dropped, and the equation balances the transport term and the scattering term. The equation is well-posed if equipped with Dirichlet-type boundary condition [3,4,5,6]:
where collects the coordinates on the physical boundary with the velocity either pointing in or out of the domain:
In practice, the laser light is shined into the domain, meaning that is prescribed. Then, one detects the number of photons scattered out of the domain by measuring . We term this operator the albedo operator:
where the subindex reflects the influence of . Therefore, optical tomography amounts to reconstructing the coefficients in using the information in the operator . The general well-posedness theory of such an inverse problem was addressed in the pioneering papers [7,8]. The result on the stability was established in [9,10,11], and see [12] for a review.
One key parameter in Equation (1) is , which is the Knudsen number. It describes the regime the system is in. By definition, the Knudsen number is the ratio of the mean-free path and the domain length. Physically, if a photon has low energy (visible or near-infrared light), it travels only a short distance before being scattered, and the mean-free path is short compared to the domain length, leading to a small . When this happens, one typically observes a diffused light phenomenon, and the received images are blurred. The situation is termed the diffusion effect, and in this regime, RTE, either linear or nonlinear, can be asymptotically approximated by a diffusive equation that characterizes macroscopic quantities such as the density . Depending on the form of , this limiting equation for is accordingly adjusted. Details regarding the diffusion limit can be found in [13,14].
Intuitively, as one decreases the photon energy, the received picture loses its crisp, and the reconstruction becomes more unstable. The perturbation observed in the measurement is enlarged in the reconstructed coefficients. This phenomenon has been been numerically observed in [1,15,16] and proved rigorously in [17], in which the authors investigate the stability deterioration of the inverse problem as . However, all these results heavily depend on the theoretical framework where it is assumed that one has the access to the full map. This amounts to sending in all kinds of incoming data and taking the measurement over the entire .
These theoretical results are helpful in building the foundation for understanding the stability deterioration, but the setup is infeasible numerically or practically. In the lab, a finite number of incoming configurations can be taken, and the detectors can measure outgoing photons in a finite number of locations. Therefore, new theory needs to be developed to account for this real situation [18,19]. To put the problem in a mathematical framework, we denote as the incoming data and as the location of the detectors; then, the measurements are
Equivalently, denoting as the solution to (1) with the boundary condition in (2) being , then:
With the finite amount of data points, to find the coefficients in , one can either adopt the PDE-constrained minimization algorithms [20,21,22] or employ the Bayesian inference [23,24,25]. While this finite setting greatly affects the well-posedness argument [26,27,28], the physical intuition still holds true in the sense that the reconstruction is expected to become more and more unstable as diminishes.
In this paper, we quantify the stability deterioration in the Bayesian inference framework with a finite number of data points. To quantify “stability”, we propose two measures: one is a global measure that evaluates the information gain by comparing the posterior and prior distributions, and the other is a local measure that characterizes the “flatness” of the posterior distribution around the MAP (maximum a posteriori) point. More particularly, at the global level, we measure the difference between the posterior and prior distributions using the KL (Kullback–Leibler) divergence. Since the posterior distribution of the coefficients takes both the prior knowledge and the measured data into account, this divergence essentially characterizes by how much the data are driving the final guess away from the prior guess. At the local level, we estimate the Hessian of the posterior distribution around the MAP point, which essentially describes the uncertainty level of the optimizer. For both measures, we analyze their dependence on , and we reveal the stability relation between two regimes, the transport regime and diffusive regime, in a more rigorous way. We will present our theory through the lens of both linear and nonlinear RTEs, but we shall mention that other multiscale kinetic models may have such a relation in the inverse setting; see for instance in [29,30,31].
The rest of the paper is organized as follows. In the next section, we recall some formulation in the Bayesian inference problem and demonstrate a general relation in the KL divergence between prior and posterior distribution, through which we will address the stability deterioration from the global point of view. In Section 3, we consider the inverse problem for the linear radiative transfer equation, and we show that the KL divergence is of order , which diminishes as we approach the diffusive regime. We extend the investigation to the nonlinear RTE in Section 4 and obtain similar results. In Section 5, we focus on the local viewpoint by estimating the second derivative of the parameter-to-measurement map, and we show that it decreases at the order of , indicating that the posterior distribution becomes flatter near the diffusive regime and therefore is less sensitive to the measurement data, rendering the inverse problem more unstable. We summarize our theoretical results and provide some numerical evidence in Section 6. A final conclusion of the paper is drawn in Section 7.
All the discussion in these sections finally achieve the following aim of the paper: to demonstrate, in the Bayesian inference framework, the stability deterioration of optical imaging when the impinging light uses low-energy photons.
2. Bayesian Formulation Basics
Bayesian inference is one of the most popular numerical methods for inferring unknown parameters. In this section, we give a quick overview of definitions to be used in this paper.
To start, we define the parameter to measurement map and denote the measurement error:
where the measurement , the measurement error , and , the admissible set in a pre-defined Banach space :
Note that can be a function space specified in (10), and is a function in this space. Throughout the paper, we assume that is an additive noise generated by a Gaussian distribution , and is a dimensional matrix.
In Bayesian inference, one needs to prepare a prior distribution for . Denote as the -algebra of B and as the prior probability measure on B. According to the Bayes’ rule, the posterior distribution is given according to its Radon–Nikodym derivative with respect to :
where Z is the normalization constant:
and the mismatch is weighted by
Suppose are two distinct probability measures; then, the KL divergence measures the distance between them:
when this definition is used in Bayesian inference to quantify the relative gain through the measurement process, one defines the relative entropy, and in the particular setting of (4), we have the following proposition.
Proposition 1.
Assume ; then, the KL divergence between and has the following estimate:
for some positive constant C independent of B.
Proof.
Since and are mutually absolutely continuous, according to (4), we have
Denoting , we proceed with:
where we used the Lipschitz continuity of in a bounded interval. The constant C depends on the size of d and the boundedness of . Since , according to (3), , making . The inequality (5) holds if we plug in the expression of to get:
□
In view of (5), given that and are at least bounded, the difference between and is controlled by the difference between and . This means if is a slow-varying map over the whole admissible set B, then only slightly differs from , indicating that the information gain is small. In the following sections, we justify this property for RTE in both linear and nonlinear settings in the regime.
3. Global View Example 1: Linear Radiative Transfer Equation
The first example we consider is the linear radiative transfer equation:
where , , is the collision operator, is the absorption coefficient, and is the scattering coefficient. Both and here are seen as functions supported on . The boundary denotes the incoming () and outgoing () boundaries respectively, and thus, incoming boundary conditions are considered here.
In the following, we first establish the the parameter-to-measurement map . Then, according to Proposition 1, the core of the quantification lies in analyzing how fast varies with respect to , for which we derive its Fréchet derivative and estimate its dependence on .
3.1. First Derivative of the Parameter-to-Solution Map
Let be the short hand notation, , and assume that only a finite series of incoming data are generated in the experiments, and the outgoing data are collected only at K discrete boundary locations ; then, is defined as:
We also define the parameter-to-solution map as:
then and are related via:
The above relation can be understood more precisely component-wise. We write and , then
Consequently, finding the derivative of in the direction amounts to finding the derivative of the map ,
where the subscript in f denotes the parameter used in obtaining the solution f, and is an admissible variation of defined below. To abbreviate the notation, we denote
and omit the subscripts j, k when the calculations are valid for all j and k.
We now specify the set of as follows:
Then, from [32] (see Proposition 3.1), we have that for , . We then call a parameter an admissible variation of if the perturbed parameter for sufficiently small t.
Hereafter, assume that the boundary condition . We cite the following two results from [4]. The first one indicates that from (8) is well defined, and the second one provides an estimate of w defined in (9).
Lemma 1.
Let ; then, the operator is a Lipschitz continuous mapping from to , where is defined as
Proposition 2.
Denote
For and admissible variation , w is the unique solution of the following equation:
where is the solution to (6) with parameter σ. Moreover, for , there holds
3.2. Dependence on Knudsen Number
We discuss the dependence of the first derivative of the parameter-to-measurement map on the Knudsen number and use it to build the asymptotic connection in the KL divergence between and . The proofs are carried out by asymptotic analysis. First, we recall the diffusion limit of the RTE.
Theorem 1.
Suppose satisfies Equation (6) with a smooth boundary condition
where , and solves
Then, , and as . Moreover, we have
The proof is very similar to that in [32] (see the Appendix therein), but we still include the details here as it will be used in proving the next proposition.
Proof.
We decompose the solution as
where , , and are chosen as
In order for to be well defined above, we need to be in the range of , which then leads to
Equipping it with the boundary condition determines . Note that , the standard elliptic estimate gives the global boundedness of , and . Since is a bounded operator on , is uniformly bounded. Since and commute, similar analysis can be applied to show that is uniformly bounded. Plugging the expansion (14) into (6), we obtain the equation for :
with boundary condition . Consequently, satisfies RTE with boundary condition and source; therefore, from the maximum principle, . Recalling (14), we have
Using the boundedness of , we conclude the theorem. □
Remark 1.
In our case, we suppress the boundary layer by proposing a special boundary condition for f as defined in (12). This result can be extended to more general boundary conditions, in which case boundary layer analysis is inevitable, see [13] for details.
The proposition below demonstrates the dependence of the first derivative of on .
Proposition 3.
For every and admissible variation , there holds
Proof.
Considering (11) with perturbation , we expand w and f as
Then, at , we have , which indicates that
At , we have , then using the form of , we obtain , and thus
where we have used . To get an evolution equation for , we look at the level and obtain: . Substituting (16) and (17), and taking the average in v leads to
From the zero boundary condition of w, we also have
Substituting (15) into (11), we get
with boundary condition . Again from the maximum principle, . Here, the boundedness of , are again from the elliptic estimate of (18).
Now, recalling the definition of the first derivative of the forward map, we have
which concludes the proof. Here, the first term vanishes due to the zero boundary condition (19). □
Theorem 2.
For a linear RTE problem, the KL divergence between and vanishes as , i.e.,
Proof.
From (5), we have that
Then, from Proposition 3 and boundedness of , the result is immediate. □
4. Global View Example 2: Nonlinear Radiative Transfer Equation
Along the same vein, we extend our investigation to the nonlinear RTE in this section:
Here, , , is again the radiation intensity and is the temperature. is the total intensity. is the Knudsen number, which is defined to be the ratio of the mean free path to a characteristic length of the problem [33]. The boundary conditions for f are on the incoming boundary .
In the following, we first recall the diffusion limit of (21); then, we analyze the first derivative of the parameter to the measurement map in the inverse setting and investigate its dependence on so as to build a connection between and .
4.1. First Derivative of the Parameter-to-Solution Map
For the nonlinear RTE problem, the inverse problem is again set up to recover the scattering coefficient by measuring the outgoing intensity flux. Therefore, the parameter-to-measurement map is defined as
More particularly, assume that only a finite series of incoming data, , are generated in the experiments and injected into the tissue, and the outgoing data are collected only at K discrete boundary locations ; then
We also define the parameter-to-solution maps and
then, the forward map is written
Consequently, finding the derivative of amounts to finding the derivative of the map , namely,
where we use the same notation to index the parameter of f as what we did for the linear RTE. Since f is nonlinearly coupled to T, we also need to find the derivative of the map , which is defined similarly as
Here, defined in (10) and is an admissible variation of .
To abbreviate the notation, we denote
and omit the subscript j, k when the calculations are valid for all j and k.
Proposition 4.
Proof.
We follow Theorem 3.3 in [4] and extend the proof to the nonlinear RTE. Let and be admissible, and denote be corresponding solutions of (21):
and
For sufficiently small t, we write the perturbed solutions
then, it is straightforward to show that satisfy
From Theorem 3.1 in [33], for , let be that , there exists a unique solution to (27) that satisfies the following thanks to the maximum principle:
□
As an immediate consequence, we have the following control of the first derivative of the parameter-to-solution map.
Proposition 5.
For and admissible variation , and are the unique solutions of the following system:
where and satisfy (21) with parameter σ and , respectively. Moreover, there holds:
where and is a constant depending on .
4.2. Dependence on Knudsen Number
To analyze the dependence of on , we first recall the asymptotic result in the forward setting.
Theorem 3.
Let solve (21); then, as , the pair converges to , with solving the nonlinear diffusion equation,
where is a constant depending on dimension. is determined from and by solving a Milne problem.
This result is rigorously proved in [33], and we only sketch the main steps here.
Proof.
Consider the asymptotic expansion in in the bulk area:
Then, at the leading order , we have
which implies that is uniform in v. The next order leads to
At , we have
Taking the average in v for the first equation and subtracting it from the second one, we have
Using (32) in (34) implies that
where depends on the dimension.
To determine the boundary condition for , we need to take into account the boundary layer effect. More precisely, let be the interior of the domain so that is the boundary area. As before, we let be the stretching variable, where and n is the unit outer normal direction at ; then, . For , let , , and they satisfy the Milne problem:
Then, solving the Milne problem, the boundary condition for (35) is given by
□
With this theorem in mind, we delve into the dependence of the derivatives of on in the following proposition.
Proposition 6.
For every , , and admissible variation , there holds:
Proof.
The proof is carried out by asymptotic analysis. We write the expansions:
Plugging (36) into (21), we have as before:
and solving these readily gives , , and . The remaining and are obtained by solving the rest of the system:
Next, we plug both (36) and (37) into (29), and at leading order :
At , we equate
which implies
Then, the next order is:
taking the average in v of (44) and adding the result to gives the equation for
Here, the boundary condition for is again determined by the boundary layer asymptotic analysis, so we omit the details.
The main theorem for the nonlinear case is now in order.
Theorem 4.
For the nonlinear RTE problem, the KL divergence between and vanishes as , i.e.,
With the result from Proposition 6, the proof is identical to that of Theorem 2.
5. Local Behavior around the MAP Point
The KL divergence between the prior and the posterior distribution is a global quantity: it characterizes the information gain from the measured data in the whole distribution. We are also concerned about the local behavior of the posterior distribution, especially around the maximum a posteriori (MAP) point, which is denoted by . Suppose the posterior distribution around the MAP point is rather “flat”; then, the probability is unchanged in a fairly large area around , meaning that all the configuration in this flat area can be approximately taken as the optimal point, and the reconstruction is insensitive to data, demonstrating the instability.
This behavior can be characterized well if the problem is linear. Suppose ; then, assuming the prior distribution is a Gaussian centered at with covariance , the posterior distribution is uniquely determined by and by:
The “flatness” of a Gaussian is characterized by its covariance matrix. Indeed, with a quick derivation, one can show that:
Therefore, the less informative the forward map is, the smaller G is, and then the bigger gets, indicating the higher mean-square error. Geometrically, it means the Gaussian is flatter.
We would like to understand this local behavior around the MAP point. However, the forward map we have is nonlinear, so the argument above only serves as a guidance. To start, we denote:
Then, the convexity of the posterior distribution is uniquely determined by the hessian of A. For that, we quote:
Proposition 7.
Let be admissible and be an admissible variation. The Hessian of the posterior distribution is expressed in terms of the forward map as:
Proof.
We begin by expanding out :
where the remainder term has no dependence. Expand around for small t:
where . Plug this in the expression of A and collect the second power of t; then, we have:
concluding the proof. □
This formula holds true for all that is admissible and thus is valid at MAP as well. According to this proposition, to show the “flatness” of the distribution at MAP, we essentially need to show the smallness of both and for all i. All derivations below are to justify this statement for both the linear and nonlinear RTE.
5.1. Linear RTE
We begin with the linear RTE. The following proposition, due to [4], characterizes the second derivative of the parameter-to-solution map , which we will compute its dependence on .
Proposition 8.
Proof.
Considering the second derivative of G in Equation (24), all satisfy the RTE with different as shown below, and the same incoming boundary condition g,
and combining these equations gives
Here, we have used the fact that C is linear in . Dividing by and taking the limit as then gives
The boundedness is seen from Theorem 3.7 in [4]. □
We next determine the dependence of the forward map’s second derivative in terms of .
Proposition 9.
For every , admissible , and admissible variation , there holds:
Proof.
Considering (48) with fixed and boundary measurement location , we expand
Using (48), at , we obtain
From Proposition 2, and have no velocity dependence, which gives ; thus, has no velocity dependence. Additionally, from the zero boundary condition of H, we have .
At ,
and thus
Since and , we have
Finally at , we have
Integrating over velocity, we obtain an equation for the leading order term , which is in closed form, since are written in terms of (see [4])
Upon averaging over velocity, we obtain the second derivative of the operator with the perturbation,
The contribution from the term becomes zero due to its trivial boundary condition, and H is bounded from [4]. □
Directly from Propositions 3, 7 and 9, we have the following corollary.
Corollary 1.
For and admissible variation , the diagonal elements of the Hessian of the posterior distribution satisfies:
5.2. Nonlinear RTE
We repeat the analysis for the nonlinear RTE, for the case when has been linearized. Using similar notatinos, we let
and have the following proposition regarding the boundedness of and .
Proposition 10.
For and admissible variations , are the unique solution to the following system:
where denotes the first derivative of the parameter-to-solution map for f in the direction, and similarly for , and . Moreover, there holds:
where and depends on .
Proof.
We let , and . Each satisfies the nonlinear RTE with a variety of shown below:
where .
We begin by computing various combinations of . First, we recall:
and expand
Similarly for and respectively,
Combining these,
so that
To derive the equations for and , we take (49), subtract (50) and (51), and add (52). Using (53),
To show the boundedness, we note that solve the nonlinear RTE with a modified right-hand side only containing bounded terms, so we again use Theorem 3.1 from [33] to obtain the boundedness:
□
Proposition 11.
For every and functions in the admissible set, the second derivative of the forward map is of order ϵ,
Proof.
We start with arbitrary directions and and later choose the direction. To find the dependence of and , we expand:
and plug this in to (54). At order , we obtain
subtracting these two equations, we obtain
so loses its velocity dependence. At , we obtain
so
Finally, at , we obtain
and
Corollary 2.
For and admissible variation , the diagonal elements of the Hessian of the posterior distribution based on the nonlinear RTE satisfies:
This is a direct corollary from Propositions 3, 7, and 11.
6. Research Results and Discussion
As is well known, the radiative transfer equation, in the high-energy regime where the scattering is dominating, is well approximated by the diffusion equation. This asymptotic relation constitutes a major part in model reduction for the forward setting. On the contrary, it has an adverse effect for the inverse problem. In particular, the reconstruction becomes increasingly unstable as we approach the diffusive regime. This relation has been investigated numerically in [1,15,16] and analytically in [17], where a full data to measurement map, named the Albedo operator, is assumed to be given.
However, in practice, only partial data are available, and the previously obtained well-posedness results are no longer feasible. We focus our investigation in this paper for this scenario, using Bayesian inference as the basic tool. We have proposed two new measures to characterize the stability and its deterioration in the diffusion regime. One is the KL divergence between the posterior and prior distribution, which implies the overall information gain from the data. We show that the information gain is less in the small Knudsen number regime. The other is the Hessian of the posterior distribution, which is related to the mean square error of the posterior and quantifies the uncertainty of the optimizer; therefore, it serves as a local measure around the MAP. We find that the posterior distribution is more and more “flat” and thus carries no useful information when the diffusion effect is strong. Both measures are taken for the linear RTE and are extended to, for the first time in the literature, treat the nonlinear RTE as well.
Although the paper is completely theoretical, it gives guidance on conducting numerical experiments. The discussion in this paper essentially suggests that the problem is intrinsically bad in the diffusion regime and thus rules out all possible algorithms that could potentially deliver a good numerical recovery.
There are some natural follow-up questions that need to be answered in the near future. One task is to clearly identify the number and the quality of the measurements for a unique reconstruction. Suppose the reconstructed function is represented by an N dimensional vector; then, how many experiments and measurements exactly are needed for a unique reconstruction, and where should the source and detector be located? The answer to this question for EIT (electrical impedance tomography) was given in [26], but one needs to tune the process to fit the situation for the radiative transfer equation used in the current paper. Another question concerns the numerical stability. While it is true that the information gain deteriorates as becomes small, there might be numerical approaches that gradually incorporate information from high-energy to low-energy photons. One possible strategy is to use a large number of experimental measurements conducted with low-energy photons to build a rough initial guess as a “warm start” before adding in information obtained at a high-energy level. The warm start given by the low-energy level results serves as a good initial guess toward a final convergence. The process is parallel to Bayesian tempering, which saw some good developments in the past few years [34,35], and the inverse scattering problem that combines information from multiple frequencies [36]. Another approach is to use a hybrid image modality, such as photoacoustic tomography. where acoustic measurement generated by the photoacoustic effect is used to infer the optimal properties of the media. In this case, an improved stability is anticipated [37,38].
Numerical Evidence
Here, we conduct two numerical tests to further support our theoretical findings. In particular, we consider the following linear radiative transfer equation:
The data are prepared as follows. For the i-th experiment, we prescribe incoming data and measure the output intensity f at location j on , which is denoted as . Here, and detect location j are chosen randomly, and number of experiments are conducted with number of receivers for each experiment. These data pairs are kept fixed for various choices of Knudsen number for a fair comparison.
In the first test, we assume has the form
where is a two-dimensional normal distribution with mean zero and a fixed covariance matrix . The parameter is the parameter to be inferred. We assume the ground-truth , and the ground-truth medium is plotted on the left of Figure 1. The inverse problem aims at inferring this ground-truth from the prescribed data pairs. To illustrate the stability of the inverse problem, we show how, for different choices of away from the truth, the measurements differ from the true measurements. More precisely, we denote
as the perturbation in the measurement, where is the measurement with a fixed for the i-th experiment and j-th measurement, and is the measurement with true . In the right panel of Figure 1, we plot the difference in measurement data (58) versus the perturbation in for three different . It is clear that with smaller , the difference in measurement becomes more indistinguishable, which means that a smaller perturbation in measurement can lead to larger deviation in reconstruction. This is in consistent with our theory on the stability deterioration with decreasing .
Figure 1.
Test 1. (Left): ground truth (57) with . (Right): perturbation in measured data versus perturbation in .
In the second test, we consider a different form of :
Here, the to-be-reconstructed parameters are and . We choose and to be the ground-truth and display the medium in Figure 2. Similar to the previous case, we plot the difference in measurement
with respect to and . It is again evident that for smaller , becomes flatter, leading to a deterioration in stability, as shown in Figure 2. In addition, we identify the data that are above in Figure 3. Here, the horizontal axis is the index for the experiment, and the vertical axis is the index for the receiver. It is seen that a larger gives a sparse dataset, whereas small gives a dense dataset, meaning that all receivers receive some information about the source. This indicates that the data are more spread out for smaller , which makes the inverse problem harder to solve.
Figure 2.
Test 2: The panel on the left shows the ground-truth medium with and in (59). The two plots on the right show, for different , the perturbation in the measured data versus the perturbation in .
Figure 3.
Test 2. Visualization of measured data whose intensity value is above .
7. Conclusions
In this paper, we examine the stability deterioration for the multiscale inverse radiative transfer equation (RTE) in the Bayesian framework. Even though the instability on the continuous level was discussed in the literature [31] and the multiscale convergence was shown in [16,17,32], the instability representation in the Bayesian framework was open. The current paper constitutes the first result that fills the gap. The results presented suggest that one should not use Bayesian inference for conducting optical tomography when the photon energy is low. The numerical algorithm, without fine tuning, would carry very limited use in reconstructing the ground-truth media.
Author Contributions
Conceptualization: Q.L.; methodology, Q.L. and L.W.; validation, Q.L. and L.W.; formal analysis, K.N. and L.W.; investigation, Q.L. and L.W.; resources, Q.L. and L.W.; writing—original draft preparation, K.N.; writing—review and editing, Q.L. and L.W.; visualization, Q.L.; supervision, Q.L.; project administration, Q.L.; funding acquisition, Q.L. and L.W. All authors have read and agreed to the published version of the manuscript.
Funding
Q.L. acknowledges support from Vilas Early Career award. The research is supported in part by NSF via grant DMS-1750488 and the Office of the Vice Chancellor for Research and Graduate Education at the University of Wisconsin Madison with funding from the Wisconsin Alumni Research Foundation. K.N. acknowledges support from NSF grant DMS-1750488. L.W. acknowledges support from NSF via grant DMS-1846854.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data is contained within the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Arridge, S.R. Optical tomography in medical imaging. Inverse Probl. 1999, 15, R41–R93. [Google Scholar] [CrossRef] [Green Version]
- Case, K.M.; Zweifel, P.F. Linear Transport Theory; Addison-Wesley Publishing Company: Boston, MA, USA, 1967. [Google Scholar]
- Case, K.; Zweifel, P. Existence and uniqueness theorems for the neutron transport equation. J. Math. Phys. 1963, 4, 1376–1385. [Google Scholar] [CrossRef] [Green Version]
- Egger, H.; Schlottbom, M. Numerical methods for parameter identification in stationary radiative transfer. Comput. Optim. Appl. 2015, 62, 67–83. [Google Scholar] [CrossRef] [Green Version]
- Egger, H.; Schlottbom, M. A mixed variational framework for the radiative transfer equation. Math. Model. Methods Appl. Sci. 2012, 22, 1150014. [Google Scholar] [CrossRef]
- Dautray, R.; Lions, J.L. Mathematical Analysis and Numerical Methods for Science and Technology: Volume 1 Physical Origins and Classical Methods; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Choulli, M.; Stefanov, P. An inverse boundary value problem for the stationary transport equation. Osaka J. Math. 1999, 36, 87–104. [Google Scholar]
- Stefanov, P.; Tamasan, A. Uniqueness and non-uniqueness in inverse radiative transfer. Proc. Am. Math. Soc. 2009, 137, 2335–2344. [Google Scholar] [CrossRef] [Green Version]
- Wang, J. Stability estimates of an inverse problem for the stationary transport equation. Ann. Inst. Henri Poincaré Phys. Théor. 1999, 70, 473–495. [Google Scholar]
- Bal, G.; Langmore, I.; Monard, F. Inverse transport with isotropic sources and angularly averaged measurements. Inverse Probl. Imaging 2008, 2, 23–42. [Google Scholar] [CrossRef]
- Bal, G.; Jollivet, A. Generalized stability estimates in inverse transport theory. Inverse Probl. Imaging 2018, 12, 59–90. [Google Scholar] [CrossRef] [Green Version]
- Bal, G. Inverse transport theory and applications. Inverse Probl. 2009, 25, 48. [Google Scholar] [CrossRef] [Green Version]
- Bardos, C.; Santos, R.; Sentis, R. Diffusion Approximation and Computation of the Critical Size. Trans. Am. Math. Soc. 1984, 284, 617–649. [Google Scholar] [CrossRef]
- Bensoussan, A.; Lions, J.L.; Papanicolaou, G. Boundary layers and homogenization of transport processes. Publ. Res. Inst. Math. Sci. 1979, 15, 53–157. [Google Scholar] [CrossRef] [Green Version]
- Hielscher, A.; Alcouffe, R.; Barbour, R. Comparison of finite-difference transport and diffusion calculations for photon migration in homogeneous and heterogeneous tissues. Phys. Med. Biol. 1998, 43, 1285–1302. [Google Scholar] [CrossRef]
- Chen, K.; Li, Q.; Wang, L. Stability of stationary inverse transport equation in diffusion scaling. Inverse Probl. 2018, 34, 025004. [Google Scholar] [CrossRef] [Green Version]
- Lai, R.Y.; Li, Q.; Uhlmann, G. Inverse Problems for the Stationary Transport Equation in the Diffusion Scaling. SIAM J. Appl. Math. 2019, 79, 2340–2358. [Google Scholar] [CrossRef]
- Nusken, N.; Reich, S.; Rozdeba, P.J. State and Parameter Estimation from Observed Signal Increments. Entropy 2019, 21, 505. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Jiang, S.W.; Harlim, J. Parameter Estimation with Data-Driven Nonparametric Likelihood Functions. Entropy 2019, 21, 559. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Biegler, L.T.; Ghattas, O.; Heinkenschloss, M.; van Bloemen Waanders, B. Large-Scale PDE-Constrained Optimization: An Introduction. In Large-Scale PDE-Constrained Optimization; Biegler, L.T., Heinkenschloss, M., Ghattas, O., van Bloemen Waanders, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2003; pp. 3–13. [Google Scholar]
- Rees, T.; Dollar, H.S.; Wathen, A.J. Optimal Solvers for PDE-Constrained Optimization. SIAM J. Sci. Comput. 2010, 32, 271–298. [Google Scholar] [CrossRef] [Green Version]
- De los Reyes, J. Numerical PDE-Constrained Optimization; SpringerBriefs in Optimization; Springer International Publishing: Cham, Switzerland, 2015. [Google Scholar]
- Idier, J. Bayesian Approach to Inverse Problems; ISTE Wiley: Hoboken, NJ, USA, 2013. [Google Scholar]
- Stuart, A.M. Inverse problems: A Bayesian perspective. Acta Numer. 2010, 19, 451–559. [Google Scholar] [CrossRef] [Green Version]
- Dashti, M.; Stuart, A.M. The Bayesian Approach to Inverse Problems. In Handbook of Uncertainty Quantification; Ghanem, R., Higdon, D., Owhadi, H., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 311–428. [Google Scholar]
- Bui-Thanh, T.; Li, Q.; Zepeda-Núñez, L. Bridging and Improving Theoretical and Computational Electric Impedance Tomography via Data Completion. arXiv 2021, arXiv:2105.00554. [Google Scholar]
- Wang, J.; Li, M.; Cheng, J.; Guo, Z.; Li, D.; Wu, S. Exact reconstruction condition for angle-limited computed tomography of chemiluminescence. Appl. Opt. 2021, 60, 4273–4281. [Google Scholar] [CrossRef]
- Harris, I. Direct Sampling for Recovering Sound Soft Scatterers from Point Source Measurements. Computation 2021, 9, 120. [Google Scholar] [CrossRef]
- Li, Q.; Sun, W. Applications of kinetic tools to inverse transport problems. Inverse Probl. 2020, 36, 035011. [Google Scholar] [CrossRef] [Green Version]
- Hellmuth, K.; Klingenberg, C.; Li, Q.; Tang, M. Multiscale Convergence of the Inverse Problem for Chemotaxis in the Bayesian Setting. Computation 2021, 9, 119. [Google Scholar] [CrossRef]
- Li, Q.; Newton, K. Diffusion Equation-Assisted Markov Chain Monte Carlo Methods for the Inverse Radiative Transfer Equation. Entropy 2019, 21, 291. [Google Scholar] [CrossRef] [Green Version]
- Newton, K.; Li, Q.; Stuart, A.M. Diffusive optical tomography in the Bayesian framework. Multiscale Model. Simul. 2020, 18, 589–611. [Google Scholar] [CrossRef] [Green Version]
- Klar, A.; Schmeiser, C. Numerical passage from radiative heat transfer to nonlinear diffusion models. Math. Model. Methods Appl. Sci. 2001, 11, 749–767. [Google Scholar] [CrossRef]
- Chandra, R.; Azam, D.; Kapoor, A.; Müller, R.D. Surrogate-assisted Bayesian inversion for landscape and basin evolution models. Geosci. Model Dev. 2020, 13, 2959–2979. [Google Scholar] [CrossRef]
- Latz, J.; Madrigal-Cianci, J.P.; Nobile, F.; Tempone, R. Generalized parallel tempering on Bayesian inverse problems. Stat. Comput. 2021, 31, 1–26. [Google Scholar] [CrossRef]
- Bao, G.; Li, P.; Lin, J.; Triki, F. Inverse scattering problems with multi-frequencies. Inverse Probl. 2015, 31, 093001. [Google Scholar] [CrossRef] [Green Version]
- Rabanser, S.; Neumann, L.; Haltmeier, M. Stochastic proximal gradient algorithms for multi-source quantitative photoacoustic tomography. Entropy 2018, 20, 121. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Frederick, C.; Ren, K.; Vallélian, S. Image Reconstruction in Quantitative Photoacoustic Tomography with the Simplified P_2 Approximation. SIAM J. Imaging Sci. 2018, 11, 2847–2876. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).