QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs

Yong, Jiawei; Tang, Sihai

doi:10.3390/e28020156

Open AccessArticle

QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs

by

Jiawei Yong

and

Sihai Tang

^*

School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China

^*

Author to whom correspondence should be addressed.

Entropy 2026, 28(2), 156; https://doi.org/10.3390/e28020156

Submission received: 12 December 2025 / Revised: 25 January 2026 / Accepted: 28 January 2026 / Published: 30 January 2026

(This article belongs to the Special Issue Quantum Computation, Quantum AI, and Quantum Information)

Download

Browse Figures

Versions Notes

Abstract

Solving Bayesian inverse problems efficiently stands as a major bottleneck in scientific computing. Although Bayesian Physics-Informed Neural Networks (B-PINNs) have introduced a robust way to quantify uncertainty, the high-dimensional parameter spaces inherent in deep learning often lead to prohibitive sampling costs. Addressing this, our work introduces Quantum-Encodable Bayesian PINNs trained via Classical Ensemble Kalman Inversion (QEKI), a framework that pairs Quantum Neural Networks (QNNs) with Ensemble Kalman Inversion (EKI). The core advantage lies in the QNN’s ability to act as a compact surrogate for PDE solutions, capturing complex physics with significantly fewer parameters than classical networks. By adopting the gradient-free EKI for training, we mitigate the barren plateau issue that plagues quantum optimization. Through several benchmarks on 1D and 2D nonlinear PDEs, we show that QEKI yields precise inversions and substantial parameter compression, even in the presence of noise. While large-scale applications are constrained by current quantum hardware, this research outlines a viable hybrid framework for including quantum features within Bayesian uncertainty quantification.

Keywords:

physics-informed neural networks; quantum neural networks; ensemble Kalman inversion; noisy data

1. Introduction

Partial Differential Equations (PDEs) are important tools for describing the dynamics of physical, chemical, and engineering systems. In addition to state variables, these models often depend on multiple parameters that characterize material properties, source terms, or boundary conditions. In many real-world scenarios, some of these parameters are often unknown and must be inferred from limited and noisy measurement data, which are often referred to as PDE inverse problems.

Physics-Informed Neural Networks (PINNs) [1,2] provide a flexible framework for solving partial differential equations by embedding the governing physical laws into the loss function. This formulation has been widely used in both forward and inverse PDE problems. In parallel, operator learning methods have been developed to approximate mappings between function spaces. DeepONet [3] adopts a dual-network architecture, in which a branch network encodes input functions and a trunk network represents spatial coordinates, enabling the approximation of nonlinear operators. Another representative approach is the Fourier Neural Operator (FNO) [4], which parameterizes integral kernels in the frequency domain using Fourier representations and has been applied to a range of problems involving complex physical systems.

To further enable uncertainty quantification and Bayesian inference, the Bayesian Physics-Informed Neural Network (B-PINNs) framework [5] extends PINNs by treating network parameters probabilistically. B-PINNs originally rely on Hamiltonian Monte Carlo (HMC), which provides asymptotically exact posterior samples but is often computationally expensive. More recent work [6,7] instead adopts the Ensemble Kalman Inversion (EKI) [8,9], a gradient-free and efficient alternative that approximates posterior updates through ensemble-based dynamics.

The rise of Quantum Neural Networks (QNNs) [10,11] has brought new opportunities for deep learning and scientific computing. QNNs achieve a nonlinear representation and probabilistic modeling in a high-dimensional Hilbert space through a quantum feature mapping and variational quantum circuits. The strength of QNNs comes from their ability to combine quantum superposition with the intrinsic parallelism of neural networks, enabling a computational framework with significant potential [12,13]. It has been demonstrated in Ref. [14] that quantum models capable of realizing all sets of Fourier coefficients can act as simulators of universal functions. Several recent studies [15,16,17] have begun to explore integrating QNNs with Physics-Informed Neural Networks. However, training QNNs with gradient-based methods can suffer from the barren plateau phenomenon [18,19], where gradients vanish exponentially with the number of qubits or the depth of the circuit, posing challenges for optimization. Although previous research [20,21,22] has yielded a range of effective algorithms to mitigate this phenomenon, the quest for ever-improving, highly effective solutions remains of paramount importance.

Motivated by these developments, this paper proposes a Quantum-Encodable Bayesian PINNs trained via Classical Ensemble Kalman Inversion (QEKI) framework, which integrates QNNs into the traditional B-PINNs structure. Notably, the inherent structure of QNNs reduces the number of trainable parameters, which improves the representational efficiency of the inverse problem, allowing the architecture to achieve accurate posterior inference with fewer parameters. Furthermore, the EKI-based parameter update does not require gradient computation, which not only accelerates QNN training but also helps mitigate the barren plateau phenomenon. Although current experiments rely on simulated quantum devices, the accuracy and efficiency of the method are expected to improve further with the deployment of future quantum hardware.

The main contributions of this paper are summarized as follows:

(1): We propose a hybrid Quantum-Encodable Bayesian PINNs trained via Classical Ensemble Kalman Inversion (QEKI) framework that combines QNN-based agent modeling with EKI to achieve gradient-free training.
(2): We reduce the trainable parameters by more than an order of magnitude while achieving comparable accuracy. In addition, the method effectively avoids barren plateau behavior, offering a practical optimization method for QNNs.
(3): We provide numerical evidence on nonlinear Poisson, diffusion–reaction, and Burgers equations showing superior robustness to observation noise.

The remainder of the paper is organized as follows. In Section 2, we introduce the problem formulation, including the B-PINNs, HMC, EKI, and the principles of QNNs, and then present the algorithm of the proposed QEKI method. In Section 3, we describe the experimental setups and present the corresponding results for three representative test cases. Finally, in Section 4, we provide concluding remarks and discuss directions for future research.

2. Methodology

The problem we considered is formulated as follows.

\begin{matrix} N_{x} (u (x); λ) & = f (x), & x \in D, \\ B_{x} (u (x); λ) & = b (x), & x \in \partial D, \end{matrix}

(1)

where

N_{x}

is the differential operator that describes the physical process, and

B_{x}

is the boundary operator applied to the boundary.

D \subseteq R^{d}

is the d-dimensional physical domain with boundary

\partial D

, and

λ \in R^{N_{λ}}

represents a vector of unknown physical parameters. In addition,

f (x)

is the forcing function,

b (x)

is the boundary function, and

u (x)

is the solution of the PDE. The aim of the inverse problem is to infer the solution function

u (x)

and the unknown parameters

λ

by integrating observational data with physical equations. The available data consists of the following three sets:

\begin{matrix} D_{f} = {\{x_{f}^{i}, f^{i}\}}_{i = 1}^{N_{f}}, D_{b} = {\{x_{b}^{i}, b^{i}\}}_{i = 1}^{N_{b}}, D_{u} = {\{x_{u}^{i}, u^{i}\}}_{i = 1}^{N_{u}}, \end{matrix}

(2)

where

x_{f}, x_{u} \in D

and

x_{b} \in \partial D

,

f_{i}

,

b_{i}

,

u_{i}

correspond to residual data, boundary-condition data and solution measurement in points

x_{f}^{i}, x_{b}^{i}, x_{u}^{i}

, respectively,

N_{f}

,

N_{b}

,

N_{u}

denote the number of residual, boundary and solution data points.

2.1. Bayesian Physics-Informed Neural Network

The Bayesian Physics-Informed Neural Network (B-PINNs) employs a fully connected neural network as a surrogate model

\tilde{u} (x; θ)

for the PDE solution

u (x)

, where

θ \in R^{N_{θ}}

represents the parameters of the neural network. Set

ξ = \{θ, λ\}

collect both the surrogate model parameters and the unknown physical parameters. By Bayes’ theorem, the posterior distribution of

ξ

is conditioned on the solution measurements

D_{u}

, residual data

D_{f}

, and boundary data

D_{b}

, which can be obtained as follows:

p (ξ ∣ D_{u}, D_{f}, D_{b}) \propto p (ξ) p (D_{u} ∣ ξ) p (D_{f} ∣ ξ) p (D_{b} ∣ ξ) .

(3)

We assume that the parameters are independent and that

θ

follows a Gaussian distribution with zero mean. Therefore, we can express the prior as follows:

p (ξ) = p (λ) p (θ) = p (λ) \prod_{i = 1}^{N_{θ}} p (θ^{i}), p (θ^{i}) \sim N (0, {σ_{θ}^{i}}^{2}),

(4)

where

σ_{θ}^{i}

is the standard deviation of the corresponding neural network parameter

θ_{i}

. In addition, the likelihood can be expressed as follows:

\begin{matrix} p (D_{u} ∣ ξ) & = \prod_{i = 1}^{N_{u}} p (u^{i} ∣ ξ), p (D_{f} ∣ ξ) = \prod_{i = 1}^{N_{f}} p (f^{i} ∣ ξ), p (D_{b} ∣ ξ) = \prod_{i = 1}^{N_{b}} p (b^{i} ∣ ξ), \\ p (u^{i} ∣ ξ) & = \frac{1}{\sqrt{2 π σ_{u}^{2}}} exp (- \frac{{(u^{i} - \tilde{u} (x_{u}^{i}; θ))}^{2}}{2 σ_{u}^{2}}), \\ p (f^{i} ∣ ξ) & = \frac{1}{\sqrt{2 π σ_{f}^{2}}} exp (- \frac{{(f^{i} - N_{x} (\tilde{u} (x_{f}^{i}; θ); λ))}^{2}}{2 σ_{f}^{2}}), \\ p (b^{i} ∣ ξ) & = \frac{1}{\sqrt{2 π σ_{b}^{2}}} exp (- \frac{{(b^{i} - B_{x} (\tilde{u} (x_{b}^{i}; θ); λ))}^{2}}{2 σ_{b}^{2}}), \end{matrix}

(5)

where

σ_{u}, σ_{f}, σ_{b}

. The prior distribution of the physical parameters

λ

is problem-dependent. After obtaining the prior and likelihood functions, various sampling methods can be used to derive the posterior distribution. Within the B-PINNs framework, efficient posterior sampling is essential for parameter estimation. HMC enables thorough posterior exploration, while EKI provides a gradient-free iterative update. Excessive trainable parameters may reduce sampling accuracy and increase computational cost. QNNs can serve as a partial remedy by lowering the number of parameters, potentially improving sampling efficiency and estimation quality.

2.2. Hamiltonian Monte Carlo

Hamiltonian Monte Carlo (HMC) is an efficient Markov Chain Monte Carlo (MCMC) method specifically designed for sampling from complex high-dimensional probability distributions, and has been applied to inverse problems in Bayesian Neural Networks. By introducing auxiliary momentum variables and simulating Hamiltonian dynamics, HMC is capable of generating states that are widely separated while maintaining a high acceptance probability.

Suppose that the target posterior distribution of

ξ

conditioned on the observations

D_{u}, D_{f}, D_{b}

is given by

p (ξ ∣ D_{u}, D_{f}, D_{b}) ≃ exp (- V (ξ)),

(6)

where

V (ξ) = - ln p (D_{u}, D_{f}, D_{b} | ξ) - ln p (ξ)

represents the potential energy. Then the Hamiltonian dynamics can be defined as follows:

H (ξ, r) = V (ξ) + \frac{1}{2} r^{T} M^{- 1} r,

(7)

where

r

is an auxiliary momentum vector, and

M

is the corresponding mass matrix, which is set to be the identity matrix

I

.

\frac{1}{2} r^{T} M^{- 1} r

represents the kinetic energy. The Hamiltonian dynamics evolve the system according to

\begin{matrix} d ξ & = - M^{- 1} r d t, \\ d r & = - \nabla V (ξ) d t . \end{matrix}

(8)

We use the Leapfrog integration to perform updating, and the Metropolis–Hastings acceptance test determines whether the proposed sample is accepted. The complete HMC procedure is summarized in Algorithm 1.

2.3. Ensemble Kalman Inversion

Ensemble Kalman Inversion (EKI) is a gradient-free inversion method based on an ensemble of samples, originally developed from the Ensemble Kalman Filter (EnKF). Unlike traditional Bayesian inverse problem solutions, EKI does not require explicit gradient computation. Instead, it iteratively updates an ensemble of samples under observation constraints, progressively approximating the high-probability region of the posterior distribution.

For an inverse problem with known observation data

y_{obs}

and a forward model

G

, the goal is to estimate the unknown parameters

ξ

:

y_{obs} = G (ξ) + η,

(9)

where

η \sim N (0, R)

is the observation noise, and R is the observation covariance matrix. According to Bayes’ theorem, the posterior can be expressed in the following form:

p (ξ ∣ y_{obs}) \propto p (y_{obs} ∣ ξ) p (ξ) .

(10)

Algorithm 1 Hamiltonian Monte Carlo (HMC)

Require: initial states $ξ^{0}$ , time step size $δ t$ , leapfrog steps I, total running steps J
for $j = 0, 1, \dots, (J - 1)$ do
Sample $r^{j}$ from $N (0, M)$
$ξ_{0} \leftarrow ξ^{j}$
$r_{0} \leftarrow r^{j}$
for $i = 0, 1, \dots, (I - 1)$ do
$r_{i} \leftarrow r_{i} - \frac{δ t}{2} \nabla V (ξ_{i})$
$ξ_{i + 1} \leftarrow ξ_{i} + δ t M^{- 1} r_{i}$
$r_{i + 1} \leftarrow r_{i} - \frac{δ t}{2} \nabla V (ξ_{i + 1})$
end for
Sample $p$ from $U (0, 1)$
$α \leftarrow min [1, exp (H (ξ_{I}, r_{I}) - H (ξ^{j}, r^{j}))]$
if $p \geq α$ then
$ξ^{j + 1} \leftarrow ξ_{I}$
else
$ξ^{j + 1} \leftarrow ξ^{j}$
end if
end for
Return: $ξ^{1}, \dots, ξ^{J}$

Assuming the prior distribution of the parameters as

ξ \sim N (ξ_{0}, C_{0})

, the likelihood can be expressed as

p (y_{obs} ∣ ξ) \propto exp (- \frac{{∥R^{- 1 / 2} (y - G (ξ))∥}_{2}^{2}}{2}) .

(11)

To facilitate iterative inversion, we reformulate the Bayesian inverse problem as an artificial dynamical system:

\begin{matrix} ξ_{i} & = ξ_{i - 1} + ϵ_{i}, & ϵ_{i} & \sim N (0, Q), \\ y_{i} & = G (ξ_{i}) + η_{i}, & η_{i} & \sim N (0, R) . \end{matrix}

(12)

where

ϵ_{i}

represents an artificial parameter noise with covariance Q and

η_{i}

denotes the observation error with covariance R. In this formulation, the parameters are treated as state variables that evolve incrementally through the iterative process, while the observation equations remain unchanged. To efficiently approximate the posterior distribution, EKI employs the Kalman gain formula for Gaussian posterior distributions to iteratively update the sample set as in Ref. [9]. Let

{\{ξ_{0}^{j}\}}_{j = 1}^{J}

denote the initial ensemble of J members drawn from the prior distribution. Then, at iteration i, the j-th ensemble member is updated as

ξ_{i + 1}^{j} = ξ_{i}^{j} + ϵ_{i}^{j} + C_{i}^{ξ y} {(C_{i}^{y y} + R)}^{- 1} (y_{obs} + η_{i}^{j} - G (ξ_{i}^{j})),

(13)

where

ϵ_{i}^{j}

and

η_{i}^{j}

represent the corresponding parameter and the observation noise. The sample covariances are defined as

\begin{matrix} C_{i}^{y y} & = \frac{1}{J - 1} \sum_{j = 1}^{J} (y_{i}^{j} - {\bar{y}}_{i}) {(y_{i}^{j} - {\bar{y}}_{i})}^{T}, \\ C_{i}^{ξ y} & = \frac{1}{J - 1} \sum_{j = 1}^{J} (ξ_{i}^{j} - {\bar{ξ}}_{i}) {(y_{i}^{j} - {\bar{y}}_{i})}^{T} . \end{matrix}

(14)

At each iteration, the ensemble is updated according to the EKI scheme, producing a new set of parameter samples that progressively approximate the target posterior distribution.

2.4. Quantum Model

The quantum model

f_{θ} (x)

is formally defined as the expectation value of an observable O with respect to a state evolved by the quantum circuit [14]

U (x, θ)

as follows:

f_{θ} (x) = 〈 0 | U^{†} (x, θ) O U (x, θ) | 0 〉,

(15)

where

| 0 〉

denotes the initial quantum state and

U (x, θ)

is the quantum circuit, which depends on the input

x

and the parameter

θ

, and takes the following form:

U (x, θ) = W^{(L + 1)} (θ) S (x) W^{(L)} (θ) \dots W^{(2)} (θ) S (x) W^{(1)} (θ) .

(16)

The quantum circuit is composed of sequential layers L, where each layer includes a data-encoding circuit block

S (x)

and a trainable variational circuit block

W (θ)

. An example quantum circuit with 3 qubits and 2 sequential layers is shown in Figure 1.

The trainable and encoding layers of the quantum circuit are constructed using rotational gates [23]. The matrix representation of these gates is given below:

\begin{matrix} R_{X} (θ) = (\begin{matrix} cos (\frac{θ}{2}) & - i sin (\frac{θ}{2}) \\ - i sin (\frac{θ}{2}) & cos (\frac{θ}{2}) \end{matrix}), \\ R_{Y} (θ) = (\begin{matrix} cos (\frac{θ}{2}) & - sin (\frac{θ}{2}) \\ sin (\frac{θ}{2}) & cos (\frac{θ}{2}) \end{matrix}), \\ R_{Z} (θ) = (\begin{matrix} exp (- i \frac{θ}{2}) & 0 \\ 0 & exp (i \frac{θ}{2}) \end{matrix}) . \end{matrix}

(17)

Applying these rotation gates to the basis states

| 0 〉

and

| 1 〉

produces:

\begin{matrix} R_{X} (θ) | 0 〉 & = cos (\frac{θ}{2}) | 0 〉 - i sin (\frac{θ}{2}) | 1 〉, \\ R_{X} (θ) | 1 〉 & = - i sin (\frac{θ}{2}) | 0 〉 + cos (\frac{θ}{2}) | 1 〉, \\ R_{Y} (θ) | 0 〉 & = cos (\frac{θ}{2}) | 0 〉 + sin (\frac{θ}{2}) | 1 〉, \\ R_{Y} (θ) | 1 〉 & = - sin (\frac{θ}{2}) | 0 〉 + cos (\frac{θ}{2}) | 1 〉, \\ R_{Z} (θ) | 0 〉 & = [cos (\frac{θ}{2}) - i sin (\frac{θ}{2})] | 0 〉, \\ R_{Z} (θ) | 1 〉 & = [cos (\frac{θ}{2}) + i sin (\frac{θ}{2})] | 1 〉 . \end{matrix}

(18)

The data-encoding block

S (x)

is mathematically defined as a tensor product of rotations

R_{z}

:

\begin{matrix} S (x) = ⨂_{i = 1}^{n} R_{Z} (x_{i}) \end{matrix}

(19)

where n is the number of qubits and

x_{i}

is the i-th component of the input vector

x

. Applying the encoding block to the initial n-qubit state

⨂_{i = 1}^{n} | 0 〉

, we obtain the encoded state

| x 〉 = ⨂_{i = 1}^{n} | x_{i} 〉

as follows:

\begin{matrix} | x 〉 = S (x) (⨂_{i = 1}^{n} | 0 〉) = ⨂_{i = 1}^{n} (R_{Z} (x_{i}) | 0 〉) = ⨂_{i = 1}^{n} ([cos (\frac{x_{i}}{2}) - i sin (\frac{x_{i}}{2})] | 0 〉) . \end{matrix}

(20)

We use strongly entangling layers [24] as the trainable circuit block and training blocks

W (θ)

depend on the parameters

θ

that can be classically optimized. A strongly entangling layer with three qubits that contains 9 trainable variables takes the form as in Figure 2.

Applying the strongly entangling layer to the encoded state produces the following result:

\begin{matrix} W (θ) | x 〉 = U_{C N O T}^{31} U_{C N O T}^{23} U_{C N O T}^{12} (⨂_{i = 1}^{3} R_{Z} (θ_{i}^{1}) R_{Y} (θ_{i}^{2}) R_{Z} (θ_{i}^{3}) | x_{i} 〉), \end{matrix}

(21)

where

U_{C N O T}^{i j}

denotes a controlled-NOT gate acting on qubit i (control) and j (target). Applying the quantum circuit

U (x, θ)

to the initial state

| 0 〉

produces the final state

| \hat{x} 〉

:

\begin{matrix} | \hat{x} 〉 = U (x, θ) | 0 〉 . \end{matrix}

(22)

This final state encodes both the classical input

x

and the variational parameters

θ

, and serves as the basis for subsequent measurements to obtain the model output. Then the output of the quantum model is obtained by measuring a Hermitian observable O, whose expectation value defines the model prediction [25]:

\begin{matrix} 〈 O 〉 = 〈 \hat{x} | O | \hat{x} 〉, \end{matrix}

(23)

where O is a Hermitian operator representing the measured observable. We choose O to be a tensor product of Pauli-Z operators

⨂_{i = 1}^{n} Z_{i}

, which allows efficient readout of the model output from the quantum state. In practice,

〈 O 〉

is estimated by repeated measurements of the observable O on multiple runs of the circuit.

2.5. Quantum-Encodable Bayesian PINNs Trained via Classical Ensemble Kalman Inversion

Gradient-based optimization of QNNs often encounters the barren plateau phenomenon, where the variance of gradients decays exponentially with the number of qubits or the depth of the circuit. As a result, randomly initialized circuits often produce gradients that are effectively zero, causing extremely slow or even stalled learning, particularly in deep or noisy quantum circuits. As a gradient-free update method, EKI naturally circumvents these difficulties by reducing the need for explicit gradient computation, thus avoiding the gradient concentration problems associated with barren plateaus, which enhances the training stability and accelerates the overall optimization of the QNN.

Motivated by these advantages, we incorporate QNNs into the EKI–BPINNs framework as surrogate models to solve PDE inverse problems. In this framework, the QNN surrogate approximates the forward solution of the governing PDE. Its parameters and the unknown PDE parameters are treated as part of the ensemble, which evolves during the inversion process. The EKI update infers the unknown physical parameters by assimilating noisy observational data while maintaining ensemble diversity. Furthermore, to fully exploit the input data for the QNN, we first perform a classical preprocessing step that constructs the network inputs as learnable linear combinations of the raw PDE data. Specifically, the QNN input

H_{i n}

is defined as

H_{i n} = H_{0} W + b

, where

H_{0}

represents the raw data, including residual data, boundary-condition data and solution measurement, and W and b are learnable parameters, which serve as a classical linear layer to map the raw data into a vector whose dimension matches the number of qubits used in the quantum circuit.

Algorithm 2 summarizes the complete training and inversion procedure. In this study, all computational stages are implemented and executed on classical hardware, where the QNN is simulated using the PennyLane v0.43 [26] quantum circuit simulator. In the current framework, the inversion algorithm, ensemble perturbation, and residual-based updates are entirely classical, while the quantum role is confined to the QNN-based surrogate representation and its circuit evaluation.

The classical–quantum interaction is organized as an iterative parameter-update scheme. In each step, the BPINNs input is mapped to a quantum state using angle encoding, and the parameterized variational layers transform the state according to the current QNN weights. The circuit is evaluated repeatedly with several measurement shots to estimate expectation values, which serve as the circuit output. These outputs are passed to the classical EKI routine, where the residual between the observations and the quantum surrogate predictions is used to generate the next ensemble update.

Within B-PINNs, the parameters

ξ = \{θ, λ\}

are considered, with the forward operator

G

defining the model mapping. The quantity

y_{i}^{j}

represents the result of evaluating the forward model

G (ξ_{i}^{j})

in one step, which is defined as

\begin{matrix} y_{i}^{j} = G (ξ_{i}^{j}) = [\begin{matrix} u (x; θ_{i}^{j}) \\ N_{x} (u (x; θ_{i}^{j}); λ_{i}^{j}) \\ B_{x} (u (x; θ_{i}^{j}); λ_{i}^{j}) \end{matrix}] . \end{matrix}

(24)

Algorithm 2 Quantum-Encodable Bayesian PINNs trained via Classical Ensemble Kalman Inversion (QEKI)

Require: Observations $y_{obs}$ , initial J ensemble states ${\{ξ_{0}^{j}\}}_{j = 1}^{J}$ , observation covariance R, parameter covariance Q, training data points $x$ , iteration index $i = 0$ .
while not converge do
for $j = 1, 2, \dots, J$ do
Sample $ϵ_{i}^{j}$ from $N (0, Q)$
Update each ensemble state $ξ_{i}^{j} \leftarrow ξ_{i}^{j} + ϵ_{i}^{j}$
$(θ_{i}^{j}, λ_{i}^{j}) \leftarrow ξ_{i}^{j}$
Apply quantum circuit $| \hat{x} 〉 \leftarrow U (x, θ) | 0 〉$
Evaluate the expectation value ${〈 O 〉}_{i}^{j} \leftarrow 〈 \hat{x} | O | \hat{x} 〉$
$y_{i}^{j} \leftarrow N_{x} ({〈 O 〉}_{i}^{j}; λ_{i}^{j})$
end for
$C_{i}^{y y} \leftarrow \frac{1}{J - 1} \sum_{j = 1}^{J} (y_{i}^{j} - {\bar{y}}_{i}) {(y_{i}^{j} - {\bar{y}}_{i})}^{T}$
$C_{i}^{ξ y} \leftarrow \frac{1}{J - 1} \sum_{j = 1}^{J} (ξ_{i}^{j} - {\bar{ξ}}_{i}) {(y_{i}^{j} - {\bar{y}}_{i})}^{T}$
for $j = 1, 2, \dots, J$ do
Sample $η_{i}^{j}$ from $N (0, R)$
$ξ_{i + 1}^{j} \leftarrow ξ_{i}^{j} + C_{i}^{ξ y} {(C_{i}^{y y} + R)}^{- 1} (y_{obs} + η_{i}^{j} - y_{i}^{j})$
end for
end while
$\hat{I} \leftarrow i + 1$
Return: $ξ_{\hat{I}}^{1}, \dots, ξ_{\hat{I}}^{J}$

Correspondingly, the observations defined in (2) can be formulated as follows:

\begin{matrix} y_{obs} = {[\begin{matrix} {\{u^{i}\}}_{i = 1}^{N_{u}} {\{f^{i}\}}_{i = 1}^{N_{f}} {\{b^{i}\}}_{i = 1}^{N_{b}} \end{matrix}]}^{T} . \end{matrix}

(25)

The physics-based residuals are evaluated via PennyLane’s interface, which maps the quantum circuit to a classically differentiable node. This setup enables the use of classical automatic differentiation while maintaining compatibility with native quantum gradient methods, such as the parameter-shift rule, for future execution on quantum processing units.

The choice of covariance matrices Q and R can significantly affect the performance of the algorithm. Previous studies have explored automatic estimation techniques for these matrices. In this work, following the strategy proposed in Ref. [6], we set R to be the covariance matrix as defined in (5):

R = [\begin{matrix} σ_{u}^{2} I_{N_{u}} & 0 & 0 \\ 0 & σ_{f}^{2} I_{N_{f}} & 0 \\ 0 & 0 & σ_{b}^{2} I_{N_{b}} \end{matrix}]

(26)

An appropriate choice of Q is essential to preserve ensemble spread, so we set Q to be

Q = [\begin{matrix} σ_{θ}^{2} I_{N_{θ}} & 0 \\ 0 & σ_{λ}^{2} I_{N_{λ}} \end{matrix}]

(27)

where

σ_{θ}

and

σ_{λ}

represent the standard deviations of the artificial process noise introduced to preserve ensemble spread for the QNN parameters

θ

and the physical parameters

λ

, respectively.

To determine when QEKI iterations should terminate, we employ a discrepancy-based stopping rule inspired by classical iterative regularization methods. The basic idea is to monitor how well the prediction of the current ensemble-averaged model matches the observed data, measured by a weighted residual norm as defined in Ref. [6]. Let

\begin{matrix} D_{i} = ∥R^{- 1 / 2} (y_{obs} - \frac{1}{J} \sum_{j = 1}^{J} y_{i}^{j})∥, \end{matrix}

(28)

denote the discrepancy at iteration i, where

y_{i}^{j}

is the predicted observation produced by the j-th ensemble member at iteration i. Specifically, we define a sliding window of length W and terminate the iteration once the discrepancy no longer exhibits sufficient improvement, i.e.,

\begin{matrix} max_{j \in {i - W, \dots, i}} \frac{|D_{j} - D_{i}|}{D_{i}} < τ . \end{matrix}

(29)

Complexity Analysis. The computational cost of QEKI is lower than that of HMC, as it performs direct ensemble updates. The per-iteration computational complexity of QEKI is given by:

\begin{matrix} O (N_{y}^{3} + J N_{y} N_{ξ} + J N_{y}^{2}) . \end{matrix}

(30)

This complexity can be decomposed into the following components:

(1): $O (J N_{y}^{2})$ —Cost of constructing the observation covariance matrix $C_{i}^{y y}$
(2): $O (J N_{y} N_{ξ})$ —Cost of constructing the observation covariance matrix $C_{i}^{ξ y}$
(3): $O (N_{y}^{3} + J N_{y} N_{ξ} + J N_{y}^{2})$ —Cost of updating all ensemble members via the Kalman gain, $C_{i}^{ξ y} {(C_{i}^{y y} + R)}^{- 1} (y_{obs} + η_{i}^{j} - y_{i}^{j})$ .

In conclusion, the computational analysis indicates that the primary bottleneck of QEKI lies in the observation dimension

N_{y}

, specifically due to the

O (N_{y}^{3})

cost associated with the matrix inversion in the Kalman gain. Conversely, the algorithm exhibits linear scalability with respect to both the high-dimensional parameter space

N_{ξ}

and the ensemble size J. This structural characteristic allows for rapid ensemble updates without prohibitive computational costs.

This completes the description of the proposed QEKI framework. The next section presents numerical experiments that demonstrate its performance on representative PDE inverse problems.

3. Experiments

In this section, we present the experimental setup and numerical studies to evaluate the performance of the proposed QEKI framework. To ensure a fair and systematic comparison, we use the classical HMC-based B-PINNs as a baseline. This allows us to evaluate how well the quantum-encodable architecture can reproduce the posterior distribution under a well-understood classical inversion procedure. We note that due to current quantum-training bottlenecks and hardware limitations, QEKI does not achieve the same training speed as classical EKI-trained DNN surrogates. Nevertheless, a key advantage of the proposed approach is that it can maintain comparable posterior accuracy while using fewer trainable parameters, demonstrating the representational efficiency of the quantum-encodable architecture. All computations are executed on the open-source PennyLane v0.43 [26] and JAX 0.8.0 [27] platforms on a single GPU (NVIDIA GeForce RTX 4090 with 24 GB of memory).

In the HMC case, we adopt the variant of HMC described in [28]. The surrogate model is a neural network with two hidden layers, each containing 50 neurons and equipped with the activation function tanh. The number of neural network parameters is listed for each experiment. For the HMC implementation, the leapfrog integration step is set to

I = 50

, the initial time step is

δ t = 0.1

, the burn-in period consists of 1000 steps, and a total of 2000 posterior samples are collected.

In the EKI case, we employ a QNN architecture, as described in Ref. [14], consisting of four qubits and six entangling layers, while in Experiment 3, we use a circuit with six qubits and eight entangling layers to accommodate the increased model complexity introduced by the nonlinear advection term in the Burgers equation. The size of the ensemble is set to

J = 200

, and the initial ensemble states

{\{ξ_{0}^{j}\}}_{j = 1}^{J}

are drawn from the prior distribution, the standard deviations of artificial dynamics of the parameters are chosen to be

σ_{λ} = 0.01

and

σ_{θ} = 0.01

. For the stopping rule, we choose the size of the sliding window to be

W = 25

and the tolerance

τ = 0.05

.

Synthetic observation data are generated by first solving the target PDEs and then adding Gaussian noise with two different levels (

σ_{u} = σ_{f} = σ_{b}

= 0.1 and 0.01) to test robustness against observation uncertainty. For the 1D problems, both the solution measurements and the PDE residual points are placed on uniformly spaced grids. For the 2D examples, the solution measurements are randomly sampled in the physical domain, while the residual points inside the domain are generated using Latin hypercube sampling, and the boundary points are uniformly sampled along the boundary. The priors are all standard Gaussian distributions with zero mean.

To assess the accuracy of the solution and the estimation of the physical parameter, we compute the following error:

e_{u} = \frac{∥ u - \bar{u} ∥_{L_{2}}}{{∥ u ∥}_{L_{2}}}, e_{λ} = \frac{∥ λ - \bar{λ} ∥_{L_{1}}}{{∥ λ ∥}_{L_{1}}},

(31)

where

u

and

λ

are the reference solution and reference physical parameter,

\bar{u}

and

\bar{λ}

are the sample means of the approximate solution and approximate physical parameter. In all experiments considered in this work, we restrict

λ

to a scalar parameter. To evaluate computational performance, we compare the mean walltime of QEKI and HMC over 10 independent trials. We further compare the number of trainable parameters, with the numbers in parentheses indicating the total number of parameters in the linear classical layers at the input of the QNN.

3.1. One-Dimensional Nonlinear Poisson Equation

We first consider the 1D nonlinear Poisson problem:

λ u_{x x} + k tanh (u) = f, x \in [- 0.7, 0.7] .

(32)

In this experiment, we set

λ = 0.01

, and set

k = 0.7

as the unknown physical parameter. The forward problem admits the analytical solution

u (x) = s i n^{3} (x)

, from which the source term and the boundary conditions are exactly derived. The inverse task is to recover the parameter k and the solution

u (x)

, together with their associated uncertainty estimates. To enforce the PDE constraints, we employ 8 interior points, 2 boundary points, and 32 collocation points.

As shown in Table 1, both QEKI and HMC achieve an accurate recovery of the physical parameter k, with the true value

k = 0.7

consistently lying within one standard deviation band of their posterior mean estimates in both noise settings. This indicates that both Bayesian frameworks can provide statistically reliable uncertainty quantification for the inverse problem.

Figure 3 further compares the sample mean and standard deviation of the reconstructed surrogate solutions; the QEKI results show a consistently smaller posterior variance.

Table 2 further reports the relative errors in the estimated means of u and k, together with the computational walltime and the number of trainable parameters for each method. Notably, QEKI achieves comparable or better accuracy while requiring substantially fewer trainable parameters and reduced computational cost, highlighting its potential as a lightweight yet effective quantum-assisted inversion framework.

3.2. Two-Dimensional Nonlinear Diffusion–Reaction Equation

Here, we consider the following 2D nonlinear PDE:

\begin{matrix} λ Δ u + k u^{2} & = f, (x, y) \in {[- 1, 1]}^{2}, \\ u (x, - 1) & = u (x, 1) = 0, \\ u (- 1, y) & = u (1, y) = 0 . \end{matrix}

(33)

In this experiment, we set

λ = 0.01

and set

k = 1

as the unknown physical parameter to be inferred. The forward problem admits the analytical solution

u (x) = s i n (π x) s i n (π y)

, from which the source term and the boundary conditions are analytically derived. The inverse problem aims to recover both the parameter k and the solution

u (x)

with uncertainty estimates. To enforce the PDE constraints, we employ 100 interior points, 100 boundary points, and 100 collocation points. The true solution and the observation points are illustrated in Figure 4.

Table 3 shows that both QEKI and HMC successfully recover the physical parameter k, with the true value

k = 1

consistently lying within the two standard deviation bands of their posterior mean estimates in both noise settings. This indicates that both Bayesian frameworks provide reliable uncertainty quantification for the inverse problem.

Figure 5 compares the sample mean and standard deviation of the surrogate solutions obtained by the two approaches. The QEKI mean closely matches the reference solution, demonstrating its effectiveness in reconstructing the forward solution, while HMC similarly captures the overall trend but exhibits slightly larger posterior variance.

Table 4 shows the relative error of the estimated means for both u and k, along with the computational walltime and the number of trainable parameters for each method. Notably, QEKI achieves comparable or slightly better accuracy while requiring fewer trainable parameters and reduced computational cost, highlighting its potential as a computationally efficient quantum-assisted inversion framework. These results collectively demonstrate that QEKI can provide accurate and stable posterior estimates for both the physical parameter and the surrogate solution, even under different noise levels.

3.3. Burgers Equation

Here, we consider the following Burgers equation as presented in Ref. [15]:

u_{t} + u u_{x} = ν u_{x x},

(34)

with a viscosity of

ν = 0.01

,which we set to be the unknown physical parameter, in a computational domain of

(x, t) = [- 1, 1] \times [0, 1]

. The exact solution under the Dirichlet boundary condition is given by

u (x, t) = \frac{\frac{x}{t + 1}}{1 + \sqrt{\frac{t + 1}{t_{0}}} e^{\frac{x^{2}}{4 ν (t + 1)}}},

(35)

with

t_{0} = exp (\frac{1}{8 μ})

. The initial and boundary conditions can be derived exactly. To enforce the PDE constraints, we employ 50 initial points, 100 boundary points, 200 collocation points, and 200 interior points. The true solution and the observation points are illustrated in Figure 6.

Table 5 shows that both QEKI and HMC successfully recover the physical parameter

ν

, with the true value

ν = 0.01

consistently lying within the one standard deviation interval of the posterior mean under both noise levels. This indicates that both Bayesian inversion frameworks remain effective even in the presence of nonlinearity introduced by the Burgers equation.

Figure 7 further compares the surrogate predictions obtained from the two methods. The QEKI produces a mean solution that closely matches the reference, while the HMC captures the overall trend.

In addition, Table 6 summarizes the relative errors of the posterior mean estimates for both u and

ν

, together with the computational wall-time and the number of trainable parameters for each method. Notably, although the QNN architecture is enlarged for this nonlinear problem, the computational speed of QEKI experiences a slight decrease due to quantum hardware limitations, yet it still requires fewer trainable parameters compared to HMC. These results indicate that QEKI can reliably provide accurate posterior estimates for both the physical parameter and the surrogate solution, even in more challenging nonlinear scenarios, while maintaining overall computational efficiency.

3.4. Capacity Analysis

The focus of this experiment is to assess representational efficiency under a similar number of trainable parameters for the classical DNN-based method in comparison with the proposed QEKI architecture. In particular, we first examine whether the DNN model remains trainable with which architectures are constrained to a similar parameter budget as QEKI. In practice, we observe that the DNN-based method exhibits insufficient expressive power under this equal-capacity constraint and fails to achieve stable convergence, resulting in unreliable posterior estimates. In contrast, the QEKI model remains trainable and produces stable posterior inference under the same parameter budget. For this reason, additional DNN-based models with increased parameter counts are included to evaluate how much model capacity is required for classical architectures to recover comparable posterior accuracy and training behavior.

Specifically, we consider neural network architectures of three different sizes, corresponding to two-hidden-layer fully connected networks with 10, 30, and 50 neurons per hidden layer, resulting in 152, 1052, and 2752 trainable parameters, respectively. For each network configuration, both EKI and HMC are employed as inference engines. All experiments are conducted on the Burgers equation, as in Section 3.3, with the observation noise level fixed to 0.01. In all cases, the associated hyperparameters are selected following standard practice and kept consistent across models to ensure a fair comparison.

As shown in Table 7, we observe that networks with 10 and 30 neurons per layer fail to achieve stable convergence, while the network with 50 neurons per layer successfully converges and produces posterior estimates comparable to those obtained with QEKI, as shown in Figure 8. These results indicate that the QEKI architecture can maintain meaningful posterior inference under a much more restrictive parameter budget, highlighting the representational efficiency of QEKI.

4. Conclusions

This work introduced QEKI, a hybrid inversion framework that combines Ensemble Kalman Inversion (EKI) with a Quantum Neural Network (QNN) to solve PDE-based inverse problems. Experiments demonstrate that QEKI can achieve the same level of accuracy as classical EKI-based approaches while utilizing significantly fewer trainable parameters. To address current difficulties in training QNNs with gradients, EKI, a gradient-free optimization strategy for QNN parameter updates, demonstrates that reliable inversion can still be achieved without back-propagated quantum gradients.

The study shows promise, but there are still constraints that need to be addressed.

(1): The feasible circuit size is capped by the number of qubits and circuit depth. These two factors set the practical ceiling for what we can simulate today rather than the learning method itself. As the circuit grows, the simulation burden quickly outweighs the benefit of testing larger models.
(2): High-dimensional PDE inverse problems cannot yet be handled directly, since current QNN encoding capacity is limited by available qubits.
(3): The EKI update process can be unstable, and we observed that QEKI inherits this issue, leading to fluctuating convergence behavior in some cases.

Looking ahead, progress can be made even before quantum hardware scales up. A promising direction is to explore more efficient quantum network designs—for example, circuit ansatzes that are smaller, better structured, and tailored to PDE solution patterns. By improving how information and parameters are arranged in the circuit, we hope to expand simulation capacity without relying solely on adding qubits or depth. Second, strengthening the stability of the EKI update, for example, by adaptive ensemble re-scaling, improved noise modeling, or regularized Kalman updates, could lead to more predictable convergence. Finally, to evaluate high-dimensional inverse tasks under qubit constraints, dimensionality reduction techniques, such as KL expansion, VAE, or other latent-space parameterizations, can be incorporated, enabling QEKI to test surrogate inversion of high-dimensional parameters in a lower-dimensional space.

In summary, QEKI demonstrates that quantum models can achieve classical inversion accuracy with fewer parameters, and that QNNs can be optimized without requiring explicit quantum gradients. Future work should focus on improving model scalability, algorithm stability, and testing high-dimensional problems through the use of reduced representations.

Author Contributions

Conceptualization, J.Y. and S.T.; methodology, J.Y. and S.T.; software, J.Y.; validation, J.Y.; formal analysis, J.Y.; writing—original draft preparation, J.Y. and S.T.; writing—review and editing, J.Y. and S.T.; visualization, J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Lu, L.; Jin, P.; Karniadakis, G.E. Deeponet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators. arXiv 2019, arXiv:1910.03193. [Google Scholar]
Li, Z.; Kovachki, N.; Azizzadenesheli, K.; Liu, B.; Bhattacharya, K.; Stuart, A.; Anandkumar, A. Fourier neural operator for parametric partial differential equations. arXiv 2020, arXiv:2010.08895. [Google Scholar]
Yang, L.; Meng, X.; Karniadakis, G.E. B-PINNs: Bayesian physics-informed neural networks for forward and inverse PDE problems with noisy data. J. Comput. Phys. 2021, 425, 109913. [Google Scholar] [CrossRef]
Pensoneault, A.; Zhu, X. Efficient Bayesian Physics Informed Neural Networks for inverse problems via Ensemble Kalman Inversion. J. Comput. Phys. 2024, 508, 113006. [Google Scholar] [CrossRef]
Gao, Z.; Karniadakis, G.E. Scalable Bayesian Physics-Informed Kolmogorov-Arnold Networks. SIAM-ASA J. Uncertain. Quantif. 2025, 13, 1543–1577. [Google Scholar] [CrossRef]
Iglesias, M.A.; Law, K.J.H.; Stuart, A.M. Ensemble Kalman methods for inverse problems. Inverse Probl. 2013, 29, 045001. [Google Scholar] [CrossRef]
Kovachki, N.B.; Stuart, A.M. Ensemble Kalman inversion: A derivative-free technique for machine learning tasks. Inverse Probl. 2019, 35, 095005. [Google Scholar] [CrossRef]
Schuld, M.; Sinayskiy, I.; Petruccione, F. The quest for a Quantum Neural Network. Quantum Inf. Process. 2014, 13, 2567–2586. [Google Scholar] [CrossRef]
Du, Y.; Hsieh, M.H.; Liu, T.; You, S.; Tao, D. Learnability of Quantum Neural Networks. PRX Quantum 2021, 2, 040337. [Google Scholar] [CrossRef]
Panella, M.; Martinelli, G. Neural networks with quantum architecture and quantum learning. Int. J. Circ. Theor. App. 2011, 39, 61–77. [Google Scholar] [CrossRef]
Rigatos, G.; Tzafestas, S. Neurodynamics and attractors in quantum associative memories. Integr. Comput.-Aided Eng. 2007, 14, 225–242. [Google Scholar] [CrossRef]
Schuld, M.; Sweke, R.; Meyer, J.J. Effect of data encoding on the expressive power of variational quantum-machine-learning models. Phys. Rev. A 2021, 103, 032430. [Google Scholar] [CrossRef]
Xiao, Y.; Yang, L.M.; Shu, C.; Chew, S.C.; Khoo, B.C.; Cui, Y.D.; Liu, Y.Y. Physics-informed quantum neural network for solving forward and inverse problems of partial differential equations. Phys. Fluids 2024, 36, 097145. [Google Scholar] [CrossRef]
Berger, S.; Hosters, N.; Möller, M. Trainable embedding quantum physics informed neural networks for solving nonlinear PDEs. Sci. Rep. 2025, 15, 18823. [Google Scholar] [CrossRef]
Trahan, C.; Loveland, M.; Dent, S. Quantum Physics-Informed Neural Networks. Entropy 2024, 26, 649. [Google Scholar] [CrossRef]
McClean, J.R.; Boixo, S.; Smelyanskiy, V.N.; Babbush, R.; Neven, H. Barren plateaus in quantum neural network training landscapes. Nat. Commun. 2018, 9, 4812. [Google Scholar] [CrossRef]
Larocca, M.; Thanasilp, S.; Wang, S.; Sharma, K.; Biamonte, J.; Coles, P.J.; Cincio, L.; McClean, J.R.; Holmes, Z.; Cerezo, M. Barren plateaus in variational quantum computing. Nat. Rev. Phys. 2025, 7, 174–189. [Google Scholar] [CrossRef]
Grant, E.; Wossnig, L.; Ostaszewski, M.; Benedetti, M. An initialization strategy for addressing barren plateaus in parametrized quantum circuits. Quantum 2019, 3, 214. [Google Scholar] [CrossRef]
Sack, S.H.; Medina, R.A.; Michailidis, A.A.; Kueng, R.; Serbyn, M. Avoiding Barren Plateaus Using Classical Shadows. PRX Quantum 2022, 3, 020365. [Google Scholar] [CrossRef]
Patti, T.L.; Najafi, K.; Gao, X.; Yelin, S.F. Entanglement devised barren plateau mitigation. Phys. Rev. Res. 2021, 3, 033090. [Google Scholar] [CrossRef]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Schuld, M.; Bocharov, A.; Svore, K.M.; Wiebe, N. Circuit-centric quantum classifiers. Phys. Rev. A 2020, 101, 032308. [Google Scholar] [CrossRef]
Farhi, E.; Neven, H. Classification with Quantum Neural Networks on Near Term Processors. arXiv 2018, arXiv:1802.06002. [Google Scholar] [CrossRef]
Bergholm, V.; Izaac, J.; Schuld, M.; Gogolin, C.; Ahmed, S.; Ajith, V.; Alam, M.S.; Alonso-Linaje, G.; AkashNarayanan, B.; Asadi, A.; et al. Pennylane: Automatic differentiation of hybrid quantum-classical computations. arXiv 2018, arXiv:1811.04968. [Google Scholar]
Bradbury, J.; Frostig, R.; Hawkins, P.; Johnson, M.J.; Leary, C.; Maclaurin, D.; Necula, G.; Paszke, A.; VanderPlas, J.; Wanderman-Milne, S.; et al. JAX: Composable Transformations of Python+NumPy Programs, Version 0.2.5; Google LLC: Mountain View, CA, USA, 2018. [Google Scholar]
Zou, Z.; Meng, X.; Psaros, A.F.; Karniadakis, G.E. NeuralUQ: A Comprehensive Library for Uncertainty Quantification in Neural Differential Equations and Operators. SIAM Rev. 2024, 66, 161–190. [Google Scholar] [CrossRef]

Figure 1. Quantum model architecture with 3 qubits, 2 sequential layers, and a measurement layer.

Figure 2. Circuit layout of a three-qubit strongly entangling layer using rotation gates and CNOT gates.

Figure 3. Section 3.1: Reference solution, observation samples, sample mean and standard deviation by QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels.

Figure 3. Section 3.1: Reference solution, observation samples, sample mean and standard deviation by QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels.

Figure 4. Section 3.2: Observation points of the forward solution and the corresponding boundary measurements (represented by black dots).

Figure 5. Section 3.2: Comparison of QEKI and HMC under different noise levels

σ_{u} = 0.01, 0.1

. The prediction values obtained by both methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Figure 5. Section 3.2: Comparison of QEKI and HMC under different noise levels

σ_{u} = 0.01, 0.1

. The prediction values obtained by both methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Figure 6. Section 3.3: Observation points of the forward solution and the corresponding initial and boundary measurements (represented by black dots).

Figure 7. Section 3.3: Comparison of QEKI and HMC under different noise levels

σ_{u} = 0.01, 0.1

. The prediction values obtained by both methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Figure 7. Section 3.3: Comparison of QEKI and HMC under different noise levels

σ_{u} = 0.01, 0.1

. The prediction values obtained by both methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Figure 8. Comparison of QEKI and HMC, EKI with 50-neuron-DNN under noise levels

σ_{u} = 0.01

. The prediction values obtained by these methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Figure 8. Comparison of QEKI and HMC, EKI with 50-neuron-DNN under noise levels

σ_{u} = 0.01

. The prediction values obtained by these methods (top row), the corresponding standard deviations (middle row), the absolute error between the ground-truth solution (bottom row).

Table 1. Section 3.1: Sample mean and standard deviation of parameter k for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of k is 0.7.

Table 1. Section 3.1: Sample mean and standard deviation of parameter k for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of k is 0.7.

		k (Mean ± Std)
$σ_{u} = 0.01$	QEKI	$0.699 \pm 0.006$
$σ_{u} = 0.01$	HMC	$0.702 \pm 0.007$
$σ_{u} = 0.1$	QEKI	$0.689 \pm 0.011$
$σ_{u} = 0.1$	HMC	$0.687 \pm 0.018$

Table 2. Section 3.1: Relative errors

e_{u}

of the forward solution u and

e_{k}

of parameter k for the noise levels

σ_{u}

= 0.01, 0.1, average walltime and trainable parameters.

Table 2. Section 3.1: Relative errors

e_{u}

of the forward solution u and

e_{k}

of parameter k for the noise levels

σ_{u}

= 0.01, 0.1, average walltime and trainable parameters.

		$e_{u}$	$e_{k}$	Walltime (s)	Trainable Parameters
$σ_{u} = 0.01$	QEKI	$1.23 %$	$0.14 %$	$13.01$	$85 (+ 8)$
$σ_{u} = 0.01$	HMC	$1.13 %$	$0.15 %$	$45.32$	5252
$σ_{u} = 0.1$	QEKI	$8.03 %$	$1.57 %$	$13.11$	$85 (+ 8)$
$σ_{u} = 0.1$	HMC	$9.21 %$	$1.85 %$	$46.02$	5252

Table 3. Section 3.2: Sample mean and standard deviation of the parameter k for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of k is 1.

Table 3. Section 3.2: Sample mean and standard deviation of the parameter k for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of k is 1.

		k (Mean ± Std)
$σ_{u} = 0.01$	QEKI	$0.992 \pm 0.008$
$σ_{u} = 0.01$	HMC	$0.988 \pm 0.012$
$σ_{u} = 0.1$	QEKI	$1.029 \pm 0.022$
$σ_{u} = 0.1$	HMC	$0.964 \pm 0.035$

Table 4. Section 3.2: Relative errors

e_{u}

of the forward solution u and

e_{k}

of the unknown parameter k for noise levels

σ_{u}

= 0.01, 0.1, including average walltime and number of trainable parameters.

Table 4. Section 3.2: Relative errors

e_{u}

of the forward solution u and

e_{k}

of the unknown parameter k for noise levels

σ_{u}

= 0.01, 0.1, including average walltime and number of trainable parameters.

		$e_{u}$	$e_{k}$	Walltime (s)	Trainable Parameters
$σ_{u} = 0.01$	QEKI	$1.03 %$	$0.88 %$	$23.42$	$85 (+ 12)$
$σ_{u} = 0.01$	HMC	$1.21 %$	$1.21 %$	$51.22$	5302
$σ_{u} = 0.1$	QEKI	$2.77 %$	$3.41 %$	$24.52$	$85 (+ 12)$
$σ_{u} = 0.1$	HMC	$2.82 %$	$3.62 %$	$52.03$	5302

Table 5. Section 3.3: Sample mean and standard deviation of the parameter

ν

for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of

ν

is 0.01.

Table 5. Section 3.3: Sample mean and standard deviation of the parameter

ν

for QEKI and HMC for

σ_{u}

= 0.01, 0.1 noise levels. The true value of

ν

is 0.01.

		$ν (\times 10^{3})$ (Mean ± Std)
$σ_{u} = 0.01$	QEKI	$10.257 \pm 0.327$
$σ_{u} = 0.01$	HMC	$9.719 \pm 0.502$
$σ_{u} = 0.1$	QEKI	$10.614 \pm 0.823$
$σ_{u} = 0.1$	HMC	$10.740 \pm 0.908$

Table 6. Section 3.3: Relative errors

e_{u}

of the forward solution u and

e_{ν}

of the unknown parameter

ν

for noise levels

σ_{u}

= 0.01, 0.1, including average walltime and number of trainable parameters.

Table 6. Section 3.3: Relative errors

e_{u}

of the forward solution u and

e_{ν}

of the unknown parameter

ν

for noise levels

σ_{u}

= 0.01, 0.1, including average walltime and number of trainable parameters.

		$e_{u}$	$e_{ν}$	Walltime (s)	Trainable Parameters
$σ_{u} = 0.01$	QEKI	$1.41 %$	$2.57 %$	$44.88$	$163 (+ 18)$
$σ_{u} = 0.01$	HMC	$1.58 %$	$2.81 %$	$56.33$	5302
$σ_{u} = 0.1$	QEKI	$4.22 %$	$6.41 %$	$45.03$	$163 (+ 18)$
$σ_{u} = 0.1$	HMC	$4.72 %$	$7.41 %$	$56.09$	5302

Table 7. Comparison of QEKI and EKI, HMC with classical DNNs of different sizes under noise level

σ_{u}

= 0.01, together with convergence, relative errors

e_{u}

and

e_{ν}

of the unknown parameter

ν

.

Table 7. Comparison of QEKI and EKI, HMC with classical DNNs of different sizes under noise level

σ_{u}

= 0.01, together with convergence, relative errors

e_{u}

and

e_{ν}

of the unknown parameter

ν

.

	Trainable Parameters	Converged	$e_{u}$	$e_{ν}$
QEKI	163 (+18)	yes	$1.41 %$	$2.57 %$
HMC	152	no	-	-
	1052	no	-	-
	2752	yes	$1.68 %$	$2.88 %$
EKI	152	no	-	-
	1052	no	-	-
	2752	yes	$1.72 %$	$2.97 %$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yong, J.; Tang, S. QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs. Entropy 2026, 28, 156. https://doi.org/10.3390/e28020156

AMA Style

Yong J, Tang S. QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs. Entropy. 2026; 28(2):156. https://doi.org/10.3390/e28020156

Chicago/Turabian Style

Yong, Jiawei, and Sihai Tang. 2026. "QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs" Entropy 28, no. 2: 156. https://doi.org/10.3390/e28020156

APA Style

Yong, J., & Tang, S. (2026). QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs. Entropy, 28(2), 156. https://doi.org/10.3390/e28020156

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

QEKI: A Quantum–Classical Framework for Efficient Bayesian Inversion of PDEs

Abstract

1. Introduction

2. Methodology

2.1. Bayesian Physics-Informed Neural Network

2.2. Hamiltonian Monte Carlo

2.3. Ensemble Kalman Inversion

2.4. Quantum Model

2.5. Quantum-Encodable Bayesian PINNs Trained via Classical Ensemble Kalman Inversion

3. Experiments

3.1. One-Dimensional Nonlinear Poisson Equation

3.2. Two-Dimensional Nonlinear Diffusion–Reaction Equation

3.3. Burgers Equation

3.4. Capacity Analysis

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI