Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition

Arseniev, Boris; Zacharov, Igor

doi:10.3390/quantum7040047

Open AccessArticle

Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition

by

Boris Arseniev

^*

and

Igor Zacharov

Independent Researcher, Moscow 121205, Russia

^*

Author to whom correspondence should be addressed.

Quantum Rep. 2025, 7(4), 47; https://doi.org/10.3390/quantum7040047

Submission received: 12 September 2025 / Revised: 2 October 2025 / Accepted: 10 October 2025 / Published: 13 October 2025

Download

Browse Figures

Versions Notes

Abstract

The simulation of multidimensional wave propagation with variable material parameters is a computationally intensive task, with applications from seismology to electromagnetics. While quantum computers offer a promising path forward, their algorithms are often analyzed in the abstract oracle model, which can mask the high gate-level complexity of implementing those oracles. We present a framework for constructing a quantum algorithm for the multidimensional wave equation with a variable speed profile. The core of our method is a decomposition of the system Hamiltonian into sets of mutually commuting Pauli strings, paired with a dedicated diagonalization procedure that uses Clifford gates to minimize simulation cost. Within this framework, we derive explicit bounds on the number of quantum gates required for Trotter–Suzuki-based simulation. Our analysis reveals significant computational savings for structured block-model speed profiles compared to general cases. Numerical experiments in three dimensions confirm the practical viability and performance of our approach. Beyond providing a concrete, gate-level algorithm for an important class of wave problems, the techniques introduced here for Hamiltonian decomposition and diagonalization enrich the general toolbox of quantum simulation.

Keywords:

quantum algorithms; Hamiltonian simulation; Pauli decomposition; wave equation; mutual diagonalization; band matrices; commuting Pauli sets

1. Introduction

Wave equations are fundamental for modeling a wide range of physical phenomena, from acoustic and seismic wave propagation to electromagnetic fields and quantum mechanics [1,2]. Numerically simulating these equations, especially in high dimensions and in the presence of heterogeneous material properties, remains computationally demanding for classical computers. While finite-difference and finite-element methods are widely used, they often suffer from the “curse of dimensionality”, where computational costs increase exponentially with the number of spatial dimensions [2,3]. This scaling imposes a significant bottleneck on applications requiring high-resolution models in three or more dimensions, such as full-waveform inversion in geophysics [4,5] or detailed optical simulations [6].

Quantum computing presents a promising avenue for overcoming these limitations. By exploiting the inherent parallelism of quantum states, quantum algorithms can achieve exponential speedups for certain linear algebra tasks and differential equation solvers [7,8]. Recent advancements have made significant strides in creating quantum algorithms tailored for partial differential equations (PDEs), notably the wave equation. Previous studies have demonstrated the efficacy of quantum algorithms employing an oracle to probe the wave speed profile [9,10]. In contrast, different studies have concentrated on more practical applications concerning one-dimensional scenarios where the speed remains constant [11,12,13]. An important challenge remains in creating clear, non-oracle methods for high-dimensional wave equations with changing speed coefficients, which are vital for accurate physical modeling. A recent study introduced an effective oracle-based solution for this task [14], employing an oracle technique that, while scalable, necessitates more ancilla qubits and incurs further overhead.

Our approach addresses this gap and builds on prior methods for Pauli decomposition of sparse matrices [15,16], extending them to arbitrary multidimensional wave problems with general and block-structured variable speed profiles. We derive explicit expressions for the number of Pauli strings, the structure of mutually commuting sets, and the corresponding diagonalization circuits. Exploiting these structures, we provide upper bounds on the number of one- and two-qubit gates required for first-order and higher-order Trotter approximations. Furthermore, by incorporating block-structured velocity profiles—common in practical applications such as layered media [5]—we demonstrate substantial reductions in circuit complexity, highlighting the advantage of leveraging problem-specific structure.

To support the theoretical scaling, we offer numerical simulations for a 3D wave equation featuring block-structured speed profiles. The results illustrate the effectiveness of our decomposition method and confirm the predicted scaling of Pauli term counts. Our analysis shows that the proposed quantum algorithm alleviates the classical “curse of dimensionality”, providing a practical approach to modeling wave dynamics in high-dimensional spaces.

The article begins by outlining the general problem of the multidimensional wave equation as framed for quantum algorithms in Section 2, including essential definitions. The main results are detailed in Section 3, with Section 3.1 covering Pauli decomposition, Section 3.2 focusing on diagonalization, Section 3.3 addressing the scaling of the multidimensional problem, and Section 3.4 examining the block speed scenario. Numerical simulations for the three-dimensional wave equation are presented in Section 3.5, followed by a discussion in Section 4 and concluding remarks in Section 5. Details on discretization and vectorization are provided in Appendix A; Appendix B contains the example of quantum algorithm construction, while Appendix C compares the classical finite difference approach with a developed algorithm in case of standing wave in terms of number of operations. Finally proofs of propositions are in Appendix D.

2. Materials and Methods

The wave equation set in D dimensions, characterized by variable speed and adhering to zero boundary conditions, is expressed as follows

\begin{matrix} \frac{\partial^{2} u (t, \vec{x})}{\partial t^{2}} = \sum_{j = 1}^{D} \frac{\partial}{\partial x_{j}} (c^{2} (\vec{x}) \frac{\partial u (t, \vec{x})}{\partial x_{j}}), \\ u (t = 0, \vec{x}) = f (\vec{x}), \\ \frac{\partial u (t = 0, \vec{x})}{\partial t} = g (\vec{x}), \\ u (t, x_{j} = 0) = 0, j = 1, \dots, D, \\ u (t, x_{j} = l_{j}) = 0, j = 1, \dots, D . \end{matrix}

(1)

where

\vec{x} = (x_{1}, \dots, x_{D})

and

l_{j}

is the length of the corresponding dimension.

After discretization and vectorization denoted as

vec (\cdot)

(details are shown in Appendix A), we can write this in matrix–vector format as

\begin{matrix} \frac{\partial^{2}}{\partial t^{2}} vec (U (t)) = {\tilde{L}}_{D} (c (\vec{x})) vec (U (t)), \\ vec (U (t = 0)) = vec (F), \\ \frac{\partial}{\partial t} vec (U (t = 0)) = vec (G), \end{matrix}

(2)

where

U (t)

is the tensor for the function

u (t, \vec{x})

in a given domain, F is the tensor for the function

f (\vec{x})

in a given domain, and G is the tensor for the function

g (\vec{x})

in a given domain. The discrete operator is

{\tilde{L}}_{D} (c (\vec{x})) = - \sum_{j = 1}^{D} (I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}) S^{2} (I_{1} \otimes \dots \otimes B_{j}^{T} \otimes \dots I_{D}),

(3)

where S is the diagonal matrix with diagonal given by vectorized speed profile tensor C, which is discrete representation of

c (\vec{x})

in a given domain, that is,

S = diag (vec (C))

, and

B_{j}

is the first order forward first derivative approximation operator with explicit boundary conditions (first and last rows are zeros, as well as first and last columns). With this choice of

B_{j}

, the boundary conditions are incorporated into

{\tilde{L}}_{D}

.

2.1. Formulation as Schrödinger Equation

Following [9,16], consider the Schrödinger equation

i \partial_{t} ψ = H_{D} ψ

using the Hamiltonian

H_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} & {\tilde{B}}_{2} & {\tilde{B}}_{3} & \dots & {\tilde{B}}_{D} \\ {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}),

(4)

where

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots \otimes I_{D}

, with

B_{j} \in R^{N \times N}

,

N = 2^{n}

, being the matrix approximation of the first derivative with incorporated zero boundary conditions in a manner consistent with [16]. It is important to mention that typically, the size of matrix

B_{j}

may vary across dimensions; however, for the present discussion, we assume it is the same. The matrices

\tilde{B}

have the dimensions

R^{N^{D} \times N^{D}}

, while the Hamiltonian

H_{D}

is sized

R^{(D + 1) N^{D} \times (D + 1) N^{D}}

. At present, we set

D + 1 = 2^{n_{D}}

; extending this setup is straightforward since

H_{D}

can be supplemented with zero matrices, which have no effect on the Pauli decomposition.

In order to incorporate a variable speed profile in this Hamiltonian, we consider

{\tilde{H}}_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} S & {\tilde{B}}_{2} S & {\tilde{B}}_{3} S & \dots & {\tilde{B}}_{D} S \\ S {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ S {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}) = \tilde{S} H_{D} \tilde{S}, \tilde{S} = diag (I, \underset{D}{\underset{︸}{S, \dots, S}}),

(5)

where

S = diag (vec (C))

, and I is the identity matrix of the same size.

Differentiating the Schrödinger equation over time, given the Hamiltonian in (5), we get

\frac{d^{2}}{d t^{2}} (\begin{matrix} vec (U_{V}) \\ vec (G_{1}) \\ ⋮ \\ vec (G_{D}) \end{matrix}) = - (\begin{matrix} - {\tilde{L}}_{D} & 0 & \dots & 0 \\ 0 & A_{11} & \dots & A_{1 D} \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & A_{D 1} & \dots & A_{D D} \end{matrix}) (\begin{matrix} vec (U_{V}) \\ vec (G_{1}) \\ ⋮ \\ vec (G_{D}) \end{matrix}),

(6)

where

{\tilde{L}}_{D} = - \sum_{j = 1}^{D} {\tilde{B}}_{j} S^{2} {\tilde{B}}_{j}^{†}

,

vec (G_{1}), \dots, vec (G_{D})

are some additional components in the wavefunction and

A_{i j} = S {\tilde{B}}_{i}^{†} {\tilde{B}}_{j} S

. Note that if matrix B is a finite difference approximation of the first derivative, then its negative transpose, and

- B^{†}

is also an approximation. Specifically,

- B^{†}

employs the mirror scheme: it yields a backward scheme if B uses a forward scheme, a forward scheme if B uses a backward scheme, and remains a central scheme if B is central. We can see now that the first component

vec (U_{V})

indeed evolves according to (2). One way to set the initial condition of the Schrödinger equation is

\begin{matrix} vec (U_{V} (t = 0)) = vec (F), \\ vec (G_{j} (t = 0)) = \frac{1}{D} i {({\tilde{B}}_{j} S)}^{- 1} vec (G), j = 1, \dots D . \end{matrix}

(7)

Putting this together, the quantum algorithm for solving (1) reduces to preparing and evolving the state

| ψ (t) 〉 = exp (- i t {\tilde{H}}_{D}) | ψ (0) 〉, | ψ (t) 〉 = (\begin{matrix} vec (U_{V}) (t) \\ vec (G_{1}) (t) \\ ⋮ \\ vec (G_{D}) (t) \end{matrix}) .

(8)

We then postselect on the first component of (

| ψ (t) 〉

). While postselection, initialization, and the design of a suitable cost function are all very challenging tasks, in this work we focus on implementing the propagator

exp (- i t {\tilde{H}}_{D})

using one- and two-qubit gates.

Remark 1.

If the right hand side of the wave Equation (1) is given by

c^{2} (\vec{x}) \sum_{j = 1}^{D} (\frac{\partial^{2} u (t, \vec{x})}{\partial x_{j}^{2}})

, one can use

\tilde{S} = diag (S, {\underset{︸}{I, \dots, I}}_{D})

and consider the first component in the wavefunction as

vec (U_{V}) = S^{- 1} vec (U_{W})

. This way the component

vec (U_{W})

changes over time according to

L_{D} = - S^{2} \sum_{j = 1}^{D} {\tilde{B}}_{j} {\tilde{B}}_{j}^{†}

, which is effectively a form of

c^{2} (\vec{x}) \sum_{j = 1}^{D} (\frac{\partial^{2} u (t, \vec{x})}{\partial x_{j}^{2}})

. Initial conditions in this case can be set as

vec (U_{V} (t = 0)) = S^{- 1} vec (F)

, and

vec (G_{j} (t = 0)) = \frac{1}{D} i {(S^{2} {\tilde{B}}_{j})}^{- 1} vec (G), j = 1, \dots, D

. This does not affect the results presented here since they are based on decompositions of S and

H_{D}

.

2.2. Propagator Implementation Technique

To execute the propagator

exp (- i t {\tilde{H}}_{D})

on a quantum device, it must be broken down into operations that can be performed on such a device. This can be achieved by expressing

{\tilde{H}}_{D}

as a sum of Pauli strings, which are tensor products of Pauli matrices, and then applying the Trotter formula [17,18]. For comprehensive definitions, the reader is directed to Section 2 of [15]; here, we will only present the key concepts.

To express Pauli strings with X and Z matrices, we use bit strings

z, x \in B^{n}

; with them, an arbitrary Pauli string can be defined as the image of the extended Pauli string operator (Walsh function)

\hat{W} : B^{n} \times B^{n} \to P_{n}

as follows

\hat{W} (x, z) = ı^{x \cdot z} X^{x} Z^{z} = ⨂_{j = 1}^{n} ı^{x_{j} z_{j}} X^{x_{j}} Z^{z_{j}},

(9)

with the matrix product between

X^{x}

and

Z^{z}

and

x \cdot z = \sum_{l = 1}^{n} x_{l} z_{l}

denoting the inner product of two bitstrings. It can be seen that the Walsh function is bijective. Thus, each Pauli string can be encoded with a unique pair

(x, z)

, and decomposition in the Pauli basis can be rewritten as

{\tilde{H}}_{D} = \frac{1}{2^{n}} \sum_{x, z \in B^{n}} β_{x, z} \hat{W} (x, z),

(10)

where

β_{x, z} \in C

, and as shown in the proof of Proposition 4 of [15], the coefficients can be expressed in the form

β_{x, z} = ı^{x \cdot z} \sum_{p \in B} {(- 1)}^{z \cdot p} \cdot h_{p, p \oplus x},

(11)

where ⊕ is a binary XOR (addition modulo 2) operation and

h_{p, p \oplus x}

are elements of

{\tilde{H}}_{D}

, which makes it possible to compute coefficients in decomposition with

O (n 2^{n})

operations.

As the strings in the decomposition do not all commute with each other, the Trotter formula can be utilized, offering different accuracy levels depending on the formula’s order. This approach is employed to manage the non-commutative characteristics of the operators involved in the decomposition. Building upon prior studies [15,16], we express

{\tilde{H}}_{D}

as

\sum_{γ = 1}^{Γ} H_{γ}

, where each

H_{γ}

consists of mutually commuting Pauli strings. The Trotter formulas for orders 1, 2, and higher even orders

p = 2 k

, where

k = 2, 3, 4, \dots

, are represented as

\begin{matrix} S_{1} (t) & = e^{- i t H_{Γ}} \dots e^{- i t H_{1}}, \end{matrix}

(12)

\begin{matrix} S_{2} (t) & = e^{\frac{- i t}{2} H_{1}} \dots e^{\frac{- i t}{2} H_{Γ}} e^{\frac{- i t}{2} H_{Γ}} \dots e^{\frac{- i t}{2} H_{1}}, \end{matrix}

(13)

\begin{matrix} S_{2 k} (t) & = S_{2 k - 2}^{2} (s_{k} t) S_{2 k - 2} ((1 - 4 s_{k}) t) S_{2 k - 2}^{2} (s_{k} t), \end{matrix}

(14)

where

s_{k} = 1 / (4 - 4^{1 / (2 k - 1)})

. The approximation accuracy is determined by the number of Trotterization steps r, the evolution time t, the order p of the Trotter formula, and the Hamiltonian norm.

3. Results

This section outlines our findings in the form of propositions and corollaries. We express the propagator for the multidimensional wave equation

exp (- i t {\tilde{H}}_{D})

using single and two qubit operations as follows:

We expand on the decomposition of Pauli strings as described in [16] to apply it to the wave equation in D dimensions (Proposition 1).
We present a method for efficiently diagonalizing mutually commuting groups that emerge during decomposition (Proposition 2).
We establish an upper bound and scaling on the complexity involved in solving the multidimensional wave equation with a variable speed profile using the Trotterization algorithm in terms of single- and two-qubit operations (Corollaries 2 and 3).
We examine the practical scenario of a wave equation in D dimensions, characterized by a variable speed profile with a block structure and establish its upper bound and scaling (Corollary 4).
We demonstrate the algorithm’s numerical performance on the three-dimensional wave equation with a block-structured speed profile (Section 3.5). We also compare the finite-difference method (FDM) with the quantum algorithm presented here for a three-dimensional standing wave, reporting the number of operations required to achieve the same accuracy (Appendix C).

3.1. Pauli String Decomposition

We extend the results presented in Proposition 1 from [16], particularly focusing on the case of a block Hamiltonian H with dimensions

(D + 1) N \times (D + 1) N

.

Proposition 1

(Decomposition of a block Hamiltonian). The only Pauli strings that can have non-zero coefficients in the decomposition of matrix

H = (\begin{matrix} 0 & B_{1} & B_{2} & B_{3} & \dots & B_{D} \\ B_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ B_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ B_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ B_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}),

(15)

where

B_{p} \in C^{N \times N}

,

p = 1, \dots, D

, are d-diagonal matrices with

N = 2^{n}

, described by

W (x_{p, k, j}, z)

, where

z \in B^{n + | \hat{B} (D) |}

and

x_{p, k, j} = {\hat{B}}_{l} (| \hat{B} (D) |, p) * {\hat{B}}_{l} (n - s, 2^{j} - 1) * {\hat{B}}_{l} (s, 2^{s} - k),

(16)

where

p \in {1, \dots, D}

,

k \in {1, \dots, d}

,

j \in {1, \dots, n - s}

,

s = ⌈ {log}_{2} (k) ⌉

,

d \leq 2^{n - 1}

. Moreover, the main diagonals of

B_{p}

are represented by

W (x_{p, 0}, z)

, with

x_{p, 0} = {\hat{B}}_{l} (| \hat{B} (D) |, p) * {\hat{B}}_{l} (n, 0),

(17)

where

p \in {1, \dots, D}

.

Here, function

| \hat{B} (D) |

returns the number of bits necessary to represent D in binary form. Meanwhile,

{\hat{B}}_{l} (a, b)

provides the binary form of b with a specified length a, padding with leading zeros if necessary. The symbol ∗ denotes the concatenation of binary strings.

It is noteworthy that if

D + 1

is not a power of 2, this proposition can still be utilized by appending matrices

B_{p} = 0

for

p = D + 1, \dots, D^{'}

, where

D^{'}

is defined as

2^{⌈ {log}_{2} (D) ⌉}

. The d-diagonal matrices referenced in the proposition allow for enhanced accuracy of differential operators. Specifically, the three-diagonal matrices with d = 1 (where d denotes the count of diagonals above the main diagonal) will represent the first-order accurate differential operators.

All Pauli strings with non-zero coefficients in the decomposition are part of the sets labeled by

x_{p, k, j}

and

x_{p, 0}

, as described below:

\begin{matrix} S_{p, k, j} & = {W (x_{p, k, j}, z) | z \in B^{n + | \hat{B} (D) |}}, \\ S_{p, 0} & = {W (x_{p, 0}, z) | z \in B^{n + | \hat{B} (D) |}} . \end{matrix}

(18)

In each set represented by

x_{p, k, j}

and

x_{0}

, the elements are generated with

z \in B^{n + | \hat{B} (D) |}

, which are the binary representation of numbers from 0 to

2^{n + | \hat{B} (D) |} - 1

of length

n + | \hat{B} (D) |

. Therefore, the cardinality of each set is

2^{n + | \hat{B} (D) |}

. Furthermore, if

D + 1

is a power of 2, the cardinality can also be expressed as

(D + 1) 2^{n}

. Utilizing this proposition alongside Proposition 2 from [16], we formulate the ensuing corollary.

Corollary 1

(Number of sets in a block Hamiltonian). The total count of sets

S_{p, k, j}

(including

S_{p, 0}

) in the decomposition of Hamiltonian (15) with a d-band matrices

B_{p} \in C^{N \times N}, p = 1, \dots, D

, where

N = 2^{n}

is given by

s (D, d, n) = D (2^{| \hat{B} (d) |} + [n - | \hat{B} (d) |] d),

(19)

where

| \hat{B} (d) |

is the binary length of d.

We group the sets from Equation (18) into mutually commuting subsets, a result that aligns with previous findings [15,16].The assignment to a subset is determined by whether a Pauli string contains an odd or even number of Y matrices (i.e., the value of

x \cdot z mod 2

), as stated in Corollary 2 of [16]. In the specific case of a real-valued matrix, all Pauli strings have an even number of Y matrices, which leads to

s (D, d, n)

mutually commuting sets.

3.2. Mutual Diagonalization

In Proposition 1 we have provided strings

x_{p, k, j}

, which generate mutually commuting sets based on parity of Y operators (value of

x \cdot z (\mod 2)

). We outline the diagonalization process for these sets, drawing upon [19,20], and present it as the subsequent proposition.

Proposition 2

(Diagonalization of sets). Given the set of mutually commuting operators, characterized by a string

x \in B^{n}

, that is,

{W (x, z) | z \in B^{n}, x \cdot z = 0 (\mod 2)},

or

{W (x, z) | z \in B^{n}, x \cdot z = 1 (\mod 2)},

one can construct diagonalization operator

\hat{D}

, such that

W (x, z) = {(- 1)}^{r} {\hat{D}}^{†} W (0, \tilde{z}) \hat{D}

, with

r = (\frac{x \cdot z + (x \cdot z mod 2)}{2})

as

\begin{matrix} \hat{D} & = H_{k_{1}} \prod_{j = 2}^{M} C N O T (k_{1}, k_{j}), i f x \cdot z = 0 (\mod 2), \\ \hat{D} & = H_{k_{1}} S_{k_{1}} \prod_{j = 2}^{M} C N O T (k_{1}, k_{j}), i f x \cdot z = 1 (\mod 2), \end{matrix}

(20)

where

C N O T (c, t)

is a controlled-NOT operation with control on the c-th qubit and target on the t-th qubit;

H_{k}

and

S_{k}

are Hadamard and phase gates acting on the k-th qubit, respectively. The product runs over all indices

k_{j}

of the nonzero bits in

x

, where M is the total number of these bits and

k_{1}

is the index of the first nonzero bit. This procedure yields the string

\tilde{z}

, which is identical to

z

except that its

k_{1}

-th bit is set to 1.

This proposition enables the diagonalization of any set of mutually commuting operators using only the corresponding string

x

. The number of one- and two-qubit gates required for this diagonalization is determined by the parity of

x \cdot z

: it is M if

x \cdot z = 0 mod 2

and

M + 1

if

x \cdot z = 1 mod 2

, where M is the number of nonzero bits (Hamming weight) in

x

. Additionally, the proposition provides the outcome of the diagonalization, that is, string

\tilde{z}

.

3.3. Scaling of Multidimensional Wave Equation Quantum Algorithm

Using general results presented in previous sections, we formulate the following proposition made specifically for the multidimensional wave equation.

Proposition 3

(Decomposition of multidimensional wave equation Hamiltonian). The Hamiltonian for the D-dimensional wave problem stated as (1) can be written as

{\tilde{H}}_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} S & {\tilde{B}}_{2} S & {\tilde{B}}_{3} S & \dots & {\tilde{B}}_{D} S \\ S {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ S {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}) = \tilde{S} H_{D} \tilde{S}, \tilde{S} = diag (I, \underset{D}{\underset{︸}{S, \dots, S}}),

(21)

where

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}

with

B_{j} \in R^{N \times N}, j = 1, \dots, D

- d-diagonal matrices, with

N = 2^{n}

, and diagonal matrix

S \in R^{N^{D} \times N^{D}}

. Given Pauli decomposition

\tilde{S} = \sum_{j = 1}^{T} α_{j} Z^{z_{j}}

, where

Z^{z} = ⨂_{k = 1}^{D n + | \hat{B} (D) |} Z^{z_{k}}

, with

z_{k}

being the k-th bit in

z

, the number of Pauli strings that can have non-zero coefficients in the decomposition of matrix

{\tilde{H}}_{D}

is bounded by

g_{H} \leq Γ min (T^{2} K, (D + 1) N^{D} / 2)

, where

Γ

is the number of mutually commuting sets in

H_{D}

, and K is the number of Pauli strings in the single set. T is the number of Pauli strings in the decomposition of

\tilde{S}

.

The Hamiltonian

H_{D}

can be directly subjected to Proposition 1 and Corollary 1 with no alterations. The form of the Pauli strings in the decomposition remains essentially the same. To convert H into

H_{D}

, one simply needs to insert extra identity (I) matrices at the qubit locations that align with the structure of the

{\tilde{B}}_{j}

operators. In the binary representation of the Pauli strings using vectors

x

and

z

, this operation corresponds to adding zeros at the appropriate bit locations.

It is also observed that the number of mutually commuting sets in

{\tilde{H}}_{D}

is identical to that in

H_{D}

because

\tilde{S}

represents a diagonal matrix. According to Corollary 1, this count is denoted by

Γ = s (D, d, n)

, owing to the real-valued nature of

H_{D}

. If the number of diagonals d above the main is significantly less than N, a situation common with finite difference approximation matrices

B_{j}

, the number of commuting sets can be approximated by

Γ = O (D d n)

. The count of Pauli strings K contained within a single commuting set can be bounded by

K \leq 2^{n + | \hat{B} (D) | - 1} = (D + 1) N / 2

since only strings with an even number of Y Pauli matrices participate in the decomposition. Consequently, the overall number of Pauli strings that appear in

{\tilde{H}}_{D} = \tilde{S} H_{D} \tilde{S}

can be expressed as

O (D^{2} d n N min (T^{2}, N^{D - 1}))

, where T denotes the number of Pauli strings used in the decomposition of

\tilde{S}

. Furthermore, in cases where

T^{2} \leq N^{D - 1}

, the estimation may be modified to

g_{H} = O (D^{2} d n T^{2} N) .

(22)

From this result, we can estimate the number of one- and two-qubit gates required for a single Trotter step that realizes the time evolution operator

exp (- i {\tilde{H}}_{D} t)

. The construction of one Trotter step requires the diagonalization of each of the

s (D, d, n)

mutually commuting sets of Pauli strings. Recall that a Pauli string

x

can contain non-zero bits solely within the initial

| \hat{B} (D) |

places and the n locations specified by j in

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}

. Based on Proposition 2 and given the condition

x \cdot z \equiv 0 (mod 2)

, the count of one- and two-qubit gates necessary for the diagonalization and reverse diagonalization of a single set is limited by

g_{d} \leq 2 (n + | \hat{B} (D) |) .

(23)

Diagonalized Pauli strings are composed solely of Z operators. These can be executed using multi-qubit rotations denoted by

R_{\tilde{z}} (θ) \equiv exp (- i \frac{θ}{2} Z^{\tilde{z}})

, where

\tilde{z}

represents the modified string

z

in each respective diagonal group. The sequence of these rotations is organized in such a way as to optimize their implementation by reducing the number of CNOT gates required, achieved through minimizing the Hamming distance between successive

\tilde{z}

strings [21] (see a circuit example for a single group in Appendix B). Without particular assumptions about the configuration of

\tilde{S}

, we employ a worst-case scenario in which every

\tilde{z}

sequence is composed entirely of ones, with no cancellations occurring. This provides an upper limit of

2 (D n + | \hat{B} (D) | - 1)

CNOT gates for each rotation since the maximum number of CNOTs needed for an m-qubit rotation is

2 (m - 1)

[21]. Hence, the total number of gates necessary for realizing all rotations within a single diagonal configuration is bounded by

g_{z} \leq 2 (D n + | \hat{B} (D) |) min (T^{2} K, (D + 1) N^{D} / 2)

.

By bringing these components together, the total number

g_{1}

of one- and two-qubit gates required for each single step of the first-order Trotter–Suzuki decomposition to approximate

exp (- i {\tilde{H}}_{D} t)

is constrained by

g_{1} \leq Γ \cdot (g_{d} + g_{z}),

(24)

where

$Γ = s (D, d, n) = D (2^{| \hat{B} (d) |} + (n - | \hat{B} (d) |) d)$ represents the number of sets of Pauli strings that internally mutually commute;
$g_{d} \leq 2 (n + | \hat{B} (D) |)$ is the gate count for diagonalization operators per set;
$g_{z} \leq (D n + | \hat{B} (D) |) (D + 1) N min (T^{2}, N^{D - 1})$ represents the gate count for implementing the diagonal rotations per set. We relied on the estimation $K \leq (D + 1) N / 2$ as suggested by Proposition 1.

Therefore, the explicit bound in case

T^{2} < N^{D - 1}

for one step of first order Trotter formula is

g_{1} \leq s (D, d, n) [2 (n + | \hat{B} (D) |) + (D n + | \hat{B} (D) |) N (D + 1) T^{2}] .

(25)

For higher-order Trotter–Suzuki formulas of even order p [17,18], the gate count scales as

g_{p} = 2 \cdot 5^{⌊ p / 2 ⌋ - 1} g_{1}, for p = 2, 4, 6, \dots

(26)

We formalize the scaling for a single Trotter step in the following corollary.

Corollary 2

(One Trotter step scaling). Given the Hamiltonian

{\tilde{H}}_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} S & {\tilde{B}}_{2} S & {\tilde{B}}_{3} S & \dots & {\tilde{B}}_{D} S \\ S {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ S {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}) = \tilde{S} H_{D} \tilde{S}, \tilde{S} = diag (I, \underset{D}{\underset{︸}{S, \dots, S}}),

(27)

where

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}

with

B_{j} \in R^{N \times N}, j = 1, \dots, D

- d-diagonal matrices, with

N = 2^{n}

, and diagonal matrix

\tilde{S} = \sum_{j = 1}^{T} α_{j} Z^{z_{j}} \in R^{N^{D} \times N^{D}}

with T strings in decomposition. The number of one- and two-qubit gates for one step of first order Trotter formula for

exp (- i {\tilde{H}}_{D} t)

with assumptions

d ≪ N

and

T^{2} < N^{D - 1}

scales as

\begin{matrix} g_{1} = O (D^{3} d n^{2} N T^{2}) \\ g_{p} = O (5^{⌊ p / 2 ⌋} D^{3} d n^{2} N T^{2}), p = 2, 4, 6 \dots . \end{matrix}

(28)

This process can be made more efficient with a few methods. To start, the Hamming distance between neighboring bitstrings

\tilde{z}

should be reduced. This configuration aids in eliminating CNOT gates used for executing the multi-qubit

R_{\tilde{z}}

rotations. Secondly, an analogous improvement is realized by tactically reordering the sets, which facilitates the elimination of CNOT gates produced during the diagonalization process. Moreover, the arrangement of these sets constituting

\tilde{S}

can be utilized to lessen the number of gates, as illustrated in Section 3.4.

Finally, by employing Corollary 2, we shall illustrate the scaling of one- and two-qubit gates required for executing the propagator

exp (- i {\tilde{H}}_{D} t)

using the Trotter formula of order p. The scaling can be expressed as

g_{t r} = g_{p} r

, where

g_{p}

denotes the count of gates per trotter step of p-th order, and r represents the total number of steps needed to achieve an error of

ϵ = {exp (- i {\tilde{H}}_{D} t) - U (t, r, p)}_{2}

. In this context,

U (t, r, p)

serves as the Trotter approximation for

exp (- i {\tilde{H}}_{D} t)

. To determine r, we utilize the scaling outlined by the authors in [17], specifically

r = O ({(2 Γ 5^{⌊ p / 2 ⌋ - 1} {\tilde{H}}_{D} t)}^{1 + 1 / p} {(1 / ϵ)}^{1 / p}),

(29)

where

Γ

denotes the quantity of easy to implement Hamiltonians (in our case, we choose it to be mutually commuting sets) within

{\tilde{H}}_{D}

. We can approximate the Hamiltonian norm by

{\tilde{H}}_{D} = O (d N)

because the matrices

B_{j}

in

{\tilde{H}}_{D}

function as finite difference operators, which inherently contain the inverse of the discretization step size

1 / h = O (N)

. The subsequent corollary outlines the scaling requirements essential for constructing a quantum circuit that sets up the propagator for the D-dimensional wave equation, denoted as

exp (- i {\tilde{H}}_{D} t)

, with a margin of error

ϵ

.

Corollary 3

(Scaling for multidimensional wave equation quantum algorithm). Given the Hamiltonian for D-dimensional wave equation as

{\tilde{H}}_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} S & {\tilde{B}}_{2} S & {\tilde{B}}_{3} S & \dots & {\tilde{B}}_{D} S \\ S {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ S {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}) = \tilde{S} H_{D} \tilde{S}, \tilde{S} = diag (I, \underset{D}{\underset{︸}{S, \dots, S}}),

(30)

where

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}

with

B_{j} \in R^{N \times N}, j = 1, \dots, D

- d-diagonal matrices, with

N = 2^{n}

, and diagonal matrix

\tilde{S} = \sum_{j = 1}^{T} α_{j} Z^{z_{j}} \in R^{N^{D} \times N^{D}}

with T strings in decomposition, the number of one- and two-qubit gates required to implement a p-th order Trotter approximation

U (t, r, p)

with r steps, satisfying

{e^{- i {\tilde{H}}_{D} t} - U (t, r, p)}_{2} \leq ϵ

scales as

g_{t r} = O (t 5^{p} D^{4} d^{3} n^{3} N^{2} T^{2} {(\frac{2}{5} D d^{2} n N t / ϵ)}^{1 / p}),

(31)

assuming

d ≪ N

and

T^{2} < N^{D - 1}

.

The suggested approach mitigates the “curse of dimensionality” in a manner similar to the oracle-based method discussed in [9]. According to Corollary 2, the gate complexity for one Trotter step predominantly scales with the

N T^{2}

factor, which varies linearly with the system size, where

N = 2^{n}

. Nevertheless, the total complexity is primarily determined by the number of steps needed to reach the desired precision. When utilizing a p-th order Trotter formula, the dominant scaling factor is

N^{2 + 1 / p} T^{2}

. The extra scaling related to N stems from the finite difference approximation, which requires a step size that is contingent upon N.

3.4. Scaling of Multidimensional Wave Equation Quantum Algorithm with Block-Model Speed Profile

In this section, we adopt a block-model representation in which the domain is partitioned into regions of constant wave speed (acoustic approximation). This assumption is widely used because it simplifies the forward problem, provides clear correspondence between model parameters and geological interfaces, and enables efficient numerical implementation. However, it should be noted that real Earth materials rarely consist of perfectly homogeneous blocks. In practice, velocity and density typically vary smoothly due to compaction, mineral composition, and fluid content, and fine-scale heterogeneity introduces scattering and waveform complexity that are not captured in block models [22]. As a result, the block approximation can overestimate reflection amplitudes at sharp interfaces and underestimate energy redistribution into scattered or coda waves. Nevertheless, when seismic wavelengths are large relative to heterogeneity scales, block models offer a reasonable first-order description of wave kinematics, particularly for traveltime and phase analyses.

Consider a computational domain of size

2^{n_{x}} \times 2^{n_{y}} \times 2^{n_{z}}

, discretized uniformly into

2^{m_{x}}

,

2^{m_{y}}

, and

2^{m_{z}}

segments along the x, y, and z axes, respectively. The total number of distinct material blocks, and thus the number of independent variables in the system, is given by

T = 2^{m_{x} + m_{y} + m_{z}}

. Moreover, the vectorized velocity profile

S = diag (vec (C))

, with C being a tensor representation of

c (\vec{x})

, can be expressed as a linear combination of Pauli operators as follows:

S = \sum_{\begin{matrix} z_{x} \in B^{m_{x}}, \\ z_{y} \in B^{m_{y}}, \\ z_{z} \in B^{m_{z}} \end{matrix}} α_{z_{x}, z_{y}, z_{z}} \underset{n_{x}}{\underset{︸}{Z^{z_{x}} \otimes I^{\otimes n_{x} - m_{x}}}} \otimes \underset{n_{y}}{\underset{︸}{Z^{z_{y}} \otimes I^{\otimes n_{y} - m_{y}}}} \otimes \underset{n_{z}}{\underset{︸}{Z^{z_{z}} \otimes I^{\otimes n_{z} - m_{z}}}},

(32)

where

Z^{z} = ⨂_{k = 1} Z^{z_{k}}

, with

z_{k}

being the k-th bit in

z

. This formulation demonstrates that the operator S is decomposed into a sum of T Pauli terms. Each term corresponds to a diagonal Pauli string (composed of I and Z operators) acting on the

m = m_{x} + m_{y} + m_{z}

qubits that encode the model parameterization, while identity operators act on the remaining

(n_{x} + n_{y} + n_{z} - m)

qubits that correspond to a size of block in particular direction.

The number of terms in this decomposition is precisely equal to the number of variables, T, and the generalization to the multidimensional case is straightforward. To estimate the gate complexity of implementing the operator

\tilde{S}

, one must account for an ancillary register of size

⌈ {log}_{2} (D + 1) ⌉ = | \hat{B} (D) |

qubits. Crucially, the block structure of S given by Equation (32) provides a significant reduction in computational complexity. We leverage this by reformulating Corollaries 2 and 3 for the block-model speed profile.

Corollary 4

(Scaling for block-model speed profile). Given the Hamiltonian

{\tilde{H}}_{D} = (\begin{matrix} 0 & {\tilde{B}}_{1} S & {\tilde{B}}_{2} S & {\tilde{B}}_{3} S & \dots & {\tilde{B}}_{D} S \\ S {\tilde{B}}_{1}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{2}^{†} & 0 & 0 & 0 & \dots & 0 \\ S {\tilde{B}}_{3}^{†} & 0 & 0 & 0 & \dots & 0 \\ \dots & \dots & \dots & \dots & \dots & \dots \\ S {\tilde{B}}_{D}^{†} & 0 & 0 & 0 & \dots & 0 \end{matrix}) = \tilde{S} H_{D} \tilde{S}, \tilde{S} = diag (I, \underset{D}{\underset{︸}{S, \dots, S}}),

(33)

where

{\tilde{B}}_{j} = I_{1} \otimes \dots \otimes B_{j} \otimes \dots I_{D}

with

B_{j} \in R^{N \times N}, j = 1, \dots, D

- d-diagonal matrices, with

N = 2^{n}

, and a real diagonal matrix S, given as

S = \sum_{z_{j} \in B^{m}, j = 1, \dots, D} α_{z_{j}, \dots, z_{D}} \underset{n}{\underset{︸}{Z^{z_{1}} \otimes I^{\otimes n - m}}} \otimes \dots \otimes \underset{n}{\underset{︸}{Z^{z_{D}} \otimes I^{\otimes n - m}}} .

(34)

The number of one- and two-qubit gates for one step of first order Trotter formula for

exp (- i {\tilde{H}}_{D} t)

with assumption

d ≪ N

scales as

\begin{matrix} g_{1} = O (D^{2} d n N 2^{(D - 1) m}) \\ g_{p} = O (5^{⌊ p / 2 ⌋} D^{2} d n N 2^{(D - 1) m}), p = 2, 4, 6 \dots . \end{matrix}

(35)

Consequently, under the same assumptions the number of one- and two-qubit gates required to implement a p-th order Trotter approximation

U (t, r, p)

with r steps, satisfying

{e^{- i {\tilde{H}}_{D} t} - U (t, r, p)}_{2} \leq ϵ

, scales as

g_{t r} = O (t 5^{p} D^{3} d^{3} n^{2} N^{2} 2^{(D - 1) m} {(\frac{2}{5} D d^{2} n N t / ϵ)}^{1 / p}) .

(36)

The manner in which the system scales is dictated by the configuration of the resulting sets of Pauli operators. According to Proposition 1, one can systematically generate the entire set of possible bit strings

x

.

Given a binary string

x

, the strings

z

are formulated in the following manner:

In the case where $m = 0$ , the non-zero elements of the $z$ strings are constrained to the positions directly specified by the structure of the matrix ${\tilde{B}}_{j}$ . For example, given ${\tilde{B}}_{1} = B \otimes I \otimes \dots \otimes I$ , the Pauli strings associated with ${\tilde{B}}_{1}$ are the same as in decomposition of B but are padded with identity operators (I). This corresponds to appending zeros to both the $x$ and $z$ strings.
For a general parameter $m \geq 0$ , if the velocity profile adheres to the structure defined in Equation (34), the number of positions in the $z$ strings that can support non-zero values increases. Specifically, an additional $D m$ bits must be considered. These correspond to the first m bits in each of the D spatial directions within the $z$ string.

Therefore, the number of Pauli strings in a single set can be bounded as

K^{'} \leq 2^{n + | \hat{B} (D) | + (D - 1) m - 1}

, where the term

(D - 1) m

appears because, for a single dimension in the considered set, all bits are already accounted for in the implementation of B. Moreover, in this scenario, Gray code ordering can be applied within each mutually commuting set, allowing the elimination of certain CNOT gates and thus reducing the number of gates required to implement rotations for a single group to

g_{z} \leq 2 K^{'}

.

To illustrate this we present the full set of strings

x

and corresponding masks for

z

as described above for a three-dimensional wave equation—discretized with 2 qubits per dimension and with finite difference operators approximated by tridiagonal matrices (

d = 1

) in Table 1 for different values of m.

The number of

z

strings that must be considered can be reduced by leveraging the decomposition of the specific matrix

H_{D}

. However, in this work, we consider an arbitrary diagonal structure as a more general approach that enables the explicit encoding of boundary conditions, as demonstrated in [16].

3.5. A Three-Dimensional Wave Equation in a Numerical Example

As a practical example of our approach, we conducted a numerical simulation focusing on the three-dimensional wave Equation (

D = 3

). A finite-difference scheme is employed to discretize the problem on a regular grid, with each spatial dimension depicted by n qubits, leading to

N = 2^{n}

grid points for each dimension. The tridiagonal (

d = 1

) matrices provide an approximation for the finite-difference operators, and we utilize the block speed model described in Equation (32). The task at hand involves computing the implementation of the propagator

exp (- i t {\tilde{H}}_{3})

, where the Hamiltonian

{\tilde{H}}_{3}

is defined as

{\tilde{H}}_{3} = (\begin{matrix} 0 & {\tilde{B}}_{x} S & {\tilde{B}}_{y} S & {\tilde{B}}_{z} S \\ S {\tilde{B}}_{x}^{T} & 0 & 0 & 0 \\ S {\tilde{B}}_{y}^{T} & 0 & 0 & 0 \\ S {\tilde{B}}_{z}^{T} & 0 & 0 & 0 \end{matrix}) = \tilde{S} H_{3} \tilde{S}, \tilde{S} = (\begin{matrix} I & 0 & 0 & 0 \\ 0 & S & 0 & 0 \\ 0 & 0 & S & 0 \\ 0 & 0 & 0 & S \end{matrix}),

(37)

with

{\tilde{B}}_{x} = B \otimes I \otimes I

,

{\tilde{B}}_{y} = I \otimes B \otimes I

,

{\tilde{B}}_{z} = I \otimes I \otimes B

. Here,

B, I \in R^{N \times N}

, and B is a tridiagonal matrix incorporating zero boundary conditions [15].

Direct classical simulation of

exp (- i t {\tilde{H}}_{3})

is computationally hard. For instance, a discretization with

n = 6

qubits (

N = 64

points) per dimension requires exponentiating a matrix of size

4 N^{3} = 2^{3 n + 2} = 2^{20} \approx 10^{6}

. For realistic scenarios requiring high resolution (e.g.,

n \geq 10

, or

N \geq 1000

points per dimension), direct classical computation of the propagator becomes intractable.

For this physically meaningful resolutions—e.g.,

n \geq 10

so

N = 2^{10} \approx 10^{3}

grid points per axis—the state dimension is

4 N^{3} \sim 4 \times 10^{9}

, and even a single sparse matvec becomes impractical on commodity hardware. These constraints motivate our validation approach: rather than attempting end-to-end classical time evolution at large sizes, we (i) normalize the geometry and evolution time (

l_{x} = l_{y} = l_{z} = 1

,

t = 1

) to isolate algorithmic—not unit—scaling; (ii) adopt a randomized block speed field on a

2^{m} \times 2^{m} \times 2^{m}

partition so that

{\tilde{H}}_{3}

is representative (avoiding degenerate cases that would understate Trotter costs); and (iii) evaluate quantities that directly determine quantum resource scaling—namely, the number of Pauli terms and the gate count per first-order Trotter step—without reconstructing the full propagator. Where feasible (e.g.,

n \leq 5

), we still perform a functional check against a classical reference using expm_multiply, but the principal evidence comes from structural counts that (by Propositions 3 and Corollary 4) control the asymptotic behavior. This strategy preserves fidelity to the “hard part” of the problem—captured by the term structure and

{\tilde{H}}_{3}

—while remaining computationally tractable across the range of

(n, m)

needed to reveal the predicted scaling trends.

Therefore, to efficiently yet rigorously examine the scaling characteristics of our quantum algorithm, we employ a representative parameterization that encapsulates the essential complexity:

Time of evolution $t = 1$ .
Domain lengths $l_{x} = l_{y} = l_{z} = 1$ .
A block speed model with random elements from the range $(0, 1]$ , encoded with an equal number of qubits m per direction.
An equal number of qubits per dimension n.

For the Hamiltonian

{\tilde{H}}_{3}

, the maximum possible number of gates (Proposition 3) is represented by

g_{H} \leq Γ K^{'}

, where

Γ

is the number of sets containing mutually commuting Pauli strings (Equation (19)), and

K^{'}

is the number of Pauli strings per set. In the block speed model (Section 3.4), we find that

K^{'} \leq (D + 1) 2^{n + 2 m - 1} = 2^{n + 2 m + 1} = 2 N 4^{m}

. Thus, the limit turns into

g_{H} \leq 3 (n + 1) 2^{n + 2 m + 1} = 6 (n + 1) N 4^{m} .

(38)

Figure 1 illustrates the findings of the numerical simulation. We expressed the Hamiltonian

{\tilde{H}}_{3}

as a sum of Pauli strings and identified how many of these terms had non-zero coefficients. This decomposition was achieved by forming terms as outlined in Proposition 1 and evaluating their respective coefficients. For systems with

n < 6

qubits per dimension, we fully reconstructed the Hamiltonian from its Pauli decomposition and verified its equivalence to the original matrix to validate our method. For

n \geq 6

, direct reconstruction becomes computationally prohibitive; therefore, we relied solely on the coefficient computation routine without full matrix reconstruction. The plot displays the counts of the resulting Pauli terms as crosses, and the theoretical upper limit from Equation (38) is represented by a dotted line.

The empirical results reveal that the calculated upper bound closely estimates the actual count of Pauli strings. This difference arises from the distinctive configuration of the finite-difference matrix B used in our simulation. While the theoretical limit is formulated for any tridiagonal matrix, our approach uses a typical first-order stencil with matrix B having

- 1

on the main diagonal and 1 on the upper diagonal. This specific arrangement results in a more efficient Pauli decomposition, needing about

2^{n + 1}

strings for each operator, as opposed to the general maximum of

(n + 1) 2^{n - 1}

.

The number of gates required for a single iteration of a p-th order Trotterization is based on the first-order equation as presented in Equation (26). Consequently, our analysis prioritizes the numerical validation of the first-order estimate. For the Hamiltonian

{\tilde{H}}_{3}

, Equation (24) provides the upper limit on the gate count for one step of a first-order Trotterization, expressed as

g_{1} \leq Γ (g_{d} + g_{z})

. Here,

Γ

denotes the number of sets containing commuting Pauli strings,

g_{d}

represents the gate count required to diagonalize an individual set, and

g_{z}

signifies the gate count to execute the diagonal rotations in a diagonalized set.

For the block speed model, it is possible to utilize Gray code ordering within each mutually commuting set. This approach allows for further elimination of CNOT gates, thereby reducing the limit to

g_{z} \leq 2 K^{'}

. Previously, in Corollary 4 it was established that the block speed model restricts the number of Pauli strings per set by

K^{'} \leq 2 N 4^{m}

. Thus, the limit on the number of one- and two-qubit gates for a first-order step is

g_{1} \leq 3 (n + 1) [2 (n + 2) + 4 N 4^{m}] = 6 (n + 1) [n + 2 + 2 N 4^{m}] .

(39)

Figure 2 illustrates the results of the numerical simulation, showing how the gate counts vary with the number of qubits per dimension. We created a software program designed to produce a circuit corresponding to one Trotter step and for each collection of commuting Pauli strings. Each group’s circuit is output in QASM format for later integration into several Trotter steps. Following this, the overall gate count was ascertained from these output files. In systems where the qubit count is small, specifically

n \leq 4

, we confirmed that the circuit constructed for a single group of mutually commuting operators accurately realizes

exp (- i t H_{γ})

. Here,

H_{γ}

represents a Hamiltonian made up exclusively of Pauli strings from that group.

In Figure 2, the experimental results, indicated by crosses, are slightly beneath the theoretical limit outlined in Equation (39), represented by the dotted line. This is attributed to circuit optimization techniques applied during compilation. The CNOT gates in the diagonalization circuits of each set can cancel each other out if placed in a specific order, providing a slight optimization benefit. However, the main impact comes from performing the diagonal component.

Additionally, to validate the functional correctness of the algorithm, we examined a specific instance with fixed parameters

n = 5

and

m = 2

. This corresponds to solving the wave equation on a

32 \times 32 \times 32

grid (

N = 2^{5} = 32

), partitioned into

2^{m} = 4

velocity blocks per dimension, resulting in a total of 64 individual blocks, each of size

8 \times 8 \times 8

and assigned a unique velocity value. For this configuration, we verified that the implemented quantum circuit accurately approximates the action of the target unitary

exp (- i {\tilde{H}}_{3} t)

.

We assessed the accuracy by setting the initial condition to a random wavefunction

| ψ (0) 〉

and comparing the statevector produced by our algorithm against a classical reference computed using the “scipy.sparse.linalg.expm_multiply” function in Python. Specifically, we compute the error

ϵ = {| | U (t, r, p) | ψ (0) 〉 - exp (- i t {\tilde{H}}_{3}) | ψ (0) 〉}_{2} | |,

(40)

where

U (t, r, p) | ψ (0) 〉

is the state obtained from our quantum algorithm using r Trotter steps of order p.

Figure 3 illustrates how the error varies with both the quantity of Trotter steps and the Trotter order. As anticipated, the error diminishes when the number of steps or the formula’s order is raised. This outcome confirms our implementation and offers a pragmatic calculation of the Trotter steps needed to reach a specified error threshold.

Conducting a more thorough validation requires calculating the operator norm error

{U (t, r, p) - exp (- i t {\tilde{H}}_{3})}_{2}

and assessing it against the bound in Corollary 4. Nevertheless, this calculation turns out to be computationally infeasible using classical methods for the matrix dimensions we are dealing with, even when n and m are relatively small.

4. Discussion

We introduced a structured quantum algorithm for simulating the multidimensional variable-coefficient wave equation. The method begins with a Hamiltonian formulation and a decomposition into Pauli strings—tensor products of

I, X, Y, Z

. This representation affords a clear physical interpretation: Z terms encode diagonal energy shifts, whereas X and Y terms generate off-diagonal transitions between basis states.

Our main contribution is a systematic extension of bounds on only relevant Pauli strings and their generation for Hamiltonians with a special block structure, together with a partition of these strings into commuting sets that admit efficient diagonalization (Propositions 1 and 2). This customizes and generalizes prior decomposition strategies [15,16] to the setting of variable-coefficient, multi-dimensional wave equations.

A second contribution is the establishment of precise gate complexity limits for calculating time evolution operators expressed as

exp (- i {\tilde{H}}_{D} t)

. Our analysis indicates that, given assumptions applicable to physical models, the gate complexity for a single Trotter step increases linearly with the size of one dimension N (Corollary 2). Nevertheless, employing Trotter–Suzuki formulas adds extra overhead: the gate complexity for the complete algorithm utilizing a p-th order Trotter formula scales as

N^{2 + 1 / p}

with respect to the size of a single dimension (Corollary 3). This result is significant because it tackles the challenge posed by the “curse of dimensionality”, which hinders the functionality of conventional wave equation solvers that typically scale as

N^{D}

[1,2].

In contrast to oracle-based strategies [9] (with a 1D implementation in [12]), our algorithm specifies deterministic circuits in terms of one- and two-qubit gates. Although oracle-based methods may exhibit superior asymptotic query complexity, their constant factors and ancilla requirements can be substantial in regimes relevant to near-term problems [12]. Consequently, despite worse asymptotics, our explicit construction is often more practical at modest scales and enables precise resource estimation. Related oracle-dependent works [10,14] demonstrate scalability but at the cost of extra ancillas. For constant-speed models, ref [13] employs the Bell-basis approach to efficiently diagonalize each term of the Hamiltonian, yielding the same diagonalization as our method, while [11] demonstrates hardware experiments using Fourier transforms; however, these techniques do not directly extend to variable-speed profiles. In summary, relative to these lines of work, our algorithm provides explicit higher-dimensional resource bounds while accommodating variable coefficients.

For block-structured speed profiles, our analysis (Corollary 4) shows that respecting physical structure can materially reduce cost: the complexity simplifies to

2^{(D - 1) m}

when m qubits encode each block parameter (for a total of

2^{D m}

distinct speeds). Such piecewise-homogeneous models are common in seismic and acoustic imaging [5], making this specialization particularly relevant.

Three-dimensional simulations corroborate the predicted trends and indicate that rotation scheduling within each diagonal group can further reduce cost [21]. These optimizations are compatible with our commuting-set framework and offer practical gains without altering asymptotics.

Our use of Trotter–Suzuki decomposition entails the usual trade-off between accuracy and depth. Techniques such as qubitization [23] and linear combinations of unitaries [24] may improve asymptotic performance but generally require additional ancillas and more elaborate control logic. Moreover, our present estimates omit measurement overhead, cost-function construction, noise, and limited connectivity, all of which are consequential in the NISQ regime [25,26]. Closing these gaps is a necessary step toward end-to-end application benchmarks.

Beyond PDEs, the proposed Pauli decomposition and commuting-set diagonalization extend naturally to Hamiltonians with short-range interactions, where matrix elements cluster near the diagonal. Reducing non-commuting terms directly lowers measurement cost in variational and hybrid schemes [27,28]. The framework is also complementary to advances in randomized simulation [29], qubitization [23], and Schrödingerization [30]. We anticipate fruitful combinations that retain the structural advantages of our decomposition while improving asymptotic scaling.

Overall, the results advance quantum simulation of the wave equation in higher dimensions with variable coefficients, delivering explicit gate counts, deterministic circuit structure, and a pathway to exploiting block heterogeneity. We expect the practical advantage over oracle-based techniques observed in lower-dimensional, structured instances [15] to persist in 3D, though a like-for-like quantitative comparison remains open due to the absence of clear oracle constructions in this setting. Future work will integrate advanced simulation primitives and incorporate hardware-aware compilation and error mitigation to translate the present complexity gains into experimentally validated performance.

5. Conclusions

We have developed a general framework for quantum simulation of the multidimensional wave equation using Pauli string decomposition, diagonalization of commuting operator sets, and Trotterized time evolution. The method provides rigorous gate-complexity bounds and shows favorable scaling, particularly for the block speed model. Numerical validation in three dimensions confirms both the theoretical predictions and the potential for additional reductions through circuit-level optimization.

These results demonstrate that quantum computing can offer practical advantages for simulating high-dimensional wave dynamics, a class of problems that remain computationally demanding for classical solvers. Future directions include integrating qubitization and other advanced simulation techniques as well as designing hardware-aware circuit optimizations. The presented approach establishes a foundation for scalable quantum algorithms with applications in geophysics, nondestructive testing, and materials science.

Author Contributions

Conceptualization, B.A. and I.Z.; software, B.A.; validation, B.A. and I.Z.; formal analysis, B.A.; writing, review and editing, B.A. and I.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original data presented in the study are openly available in GitHub repository at https://github.com/barseniev/d_dim_quantum_wave_solver (accessed on 9 October 2025).

Acknowledgments

The authors acknowledge the use of Skoltech’s Zhores supercomputer [31] for obtaining the numerical results presented in this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PDE	Partial differential equation
FDM	Finite difference method

Appendix A. Discretization and Vectorization Conventions

This appendix details the spatial discretization scheme and vectorization procedure employed to transform the continuous three dimensional wave equation, Equation (1) (

D = 3

), into a discrete system suitable for numerical computation and quantum algorithm construction. While the focus is on three dimensions, the methodology generalizes straightforwardly to other cases.

The continuous scalar field

u (t, x, y, z)

is discretized onto a structured grid with

N_{x}

,

N_{y}

, and

N_{z}

points in the x, y, and z directions, respectively. We adopt the convention where the discrete solution is stored as a three-dimensional tensor

U (t) \in R^{N_{z} \times N_{y} \times N_{x}}

, such that its elements are defined by

u_{i, j, k} (t) \equiv u (t, x_{k}, y_{j}, z_{i}),

where the tensor indices

(i, j, k)

correspond to the spatial coordinates

(z, y, x)

. This specific index ordering is chosen to yield a canonical and efficient structure for the resulting discrete differential operators. A schematic representation of this data structure is provided in Figure A1.

Figure A1. Schematic example of a three dimensional data tensor

U \in R^{Z, Y, X}

, i.e., storing points corresponding to different coordinates as an array. Thus, an element of the array

u_{i, j, k}

corresponds to a point with coordinates

u (x_{k}, y_{j}, z_{i})

. For discretization, 4 points per direction are used.

Figure A1. Schematic example of a three dimensional data tensor

U \in R^{Z, Y, X}

, i.e., storing points corresponding to different coordinates as an array. Thus, an element of the array

u_{i, j, k}

corresponds to a point with coordinates

u (x_{k}, y_{j}, z_{i})

. For discretization, 4 points per direction are used.

To interface with standard numerical linear algebra routines, which typically operate on vectors, the tensor U is transformed into a state vector via a vectorization operation, denoted

vec (U)

. We adopt the Fortran-style (column-major) vectorization convention. This process maps the multi-dimensional array into a column vector

vec (U (t)) \in R^{N_{x} N_{y} N_{z}}

. The sequence of this mapping is illustrated schematically in Figure A2.

This chosen discretization and vectorization ordering induces a highly structured form for the discrete Laplacian operator. The right-hand side of Equation (1) is approximated as

(\frac{\partial^{2} u}{\partial x^{2}} + \frac{\partial^{2} u}{\partial y^{2}} + \frac{\partial^{2} u}{\partial z^{2}}) \approx L_{3} vec (U),

(A1)

where the discrete 3D Laplacian

L_{3 d}

is given by the sum of Kronecker products:

L_{3} = L_{x} \otimes I_{y} \otimes I_{z} + I_{x} \otimes L_{y} \otimes I_{z} + I_{x} \otimes I_{y} \otimes L_{z} .

(A2)

Here,

L_{x}, L_{y}, L_{z}

are matrix representations of one-dimensional second-derivative operators (e.g., from a finite-difference method) incorporating the desired boundary conditions, and

I_{x}, I_{y}, I_{z}

are identity matrices of appropriate sizes. Crucially, the order of the Kronecker product factors (x, y, z) is a direct consequence of the chosen vectorization convention, ensuring consistent and efficient application of the operator.

Figure A2. Illustration of the vectorization process for the tensor U from Figure A1. The red dashed lines indicate the order in which elements are sequenced into the resulting column vector (from top to bottom).

Consequently, the discrete form of the wave equation initial value problem, incorporating a spatially varying wave speed

c (\vec{x})

, is expressed as the following system of ordinary differential equations:

\begin{matrix} \frac{\partial^{2}}{\partial t^{2}} vec (U (t)) = L_{3} (c (\vec{x})) vec (U (t)), \\ vec (U (0)) = vec (F), \\ \frac{\partial}{\partial t} vec (U (0)) = vec (G), \end{matrix}

(A3)

where

vec (U (t))

is the state vector, and F and G are the tensor representations of the initial conditions

f (\vec{x})

and

g (\vec{x})

, respectively. The dependence of the operator

L_{3}

on the wave speed profile

c (\vec{x})

is implied.

Appendix B. Decomposition Example

This section provides a detailed, step-by-step example of the proposed decomposition method. We consider the three-dimensional (

D = 3

) wave equation with a block velocity profile:

\begin{matrix} \frac{\partial^{2} u (t, \vec{x})}{\partial t^{2}} = \frac{\partial}{\partial x} (c^{2} (\vec{x}) \frac{\partial u (t, \vec{x})}{\partial x}) + \frac{\partial}{\partial y} (c^{2} (\vec{x}) \frac{\partial u (t, \vec{x})}{\partial y}) + \frac{\partial}{\partial z} (c^{2} (\vec{x}) \frac{\partial u (t, \vec{x})}{\partial z}), \\ u (t = 0, \vec{x}) = f (\vec{x}), \\ \frac{\partial u (t = 0, \vec{x})}{\partial t} = g (\vec{x}), \\ u (t, x = 0) = u (t, y = 0) = u (t, z = 0) = 0, \\ u (t, x = l_{x}) = u (t, y = l_{y}) = u (t, z = l_{z}) = 0 . \end{matrix}

(A4)

The velocity profile is defined as

c (x, y, z) = \{\begin{matrix} c_{1} & if z \leq l_{z} / 2, \\ c_{2} & if z > l_{z} / 2 . \end{matrix}

(A5)

This equation can be transformed into the Schrödinger form,

i \partial_{t} ψ = \tilde{H} 3 ψ

, with the Hamiltonian:

{\tilde{H}}_{3} = (\begin{matrix} 0 & {\tilde{B}}_{x} S & {\tilde{B}}_{y} S & {\tilde{B}}_{z} S \\ S {\tilde{B}}_{x}^{T} & 0 & 0 & 0 \\ S {\tilde{B}}_{y}^{T} & 0 & 0 & 0 \\ S {\tilde{B}}_{z}^{T} & 0 & 0 & 0 \end{matrix}) = \tilde{S} H_{3} \tilde{S}, \tilde{S} = (\begin{matrix} I & 0 & 0 & 0 \\ 0 & S & 0 & 0 \\ 0 & 0 & S & 0 \\ 0 & 0 & 0 & S \end{matrix}),

(A6)

where

{\tilde{B}}_{x} = B \otimes I \otimes I

,

{\tilde{B}}_{y} = I \otimes B \otimes I

, and

{\tilde{B}}_{z} = I \otimes I \otimes B

. The matrices

B, I \in R^{N \times N}

, with

N = 2^{n}

, arise from discretizing each spatial dimension, and B is a tridiagonal matrix incorporating the zero boundary conditions.

The next step is to decompose this Hamiltonian for quantum simulation. We set

n = 2

qubits per dimension. Following Proposition 1, the Pauli strings

x

that may appear in the decomposition are listed in the first column of Table A1. The second step involves analyzing the block velocity profile. In this case, the profile is split only along the z-direction, implying

m_{x} = 0

,

m_{y} = 0

, and

m_{z} = 1

. Consequently, the matrix S can be written as

S = diag (vec (C))) = \frac{c_{1} + c_{2}}{2} I \otimes I \otimes I \otimes I \otimes I \otimes I + \frac{c_{1} - c_{2}}{2} I \otimes I \otimes I \otimes I \otimes Z \otimes I .

(A7)

From this, we derive the masks for the

z

strings, shown in the second column of Table A1.

Table A1. Pauli strings for the three-dimensional wave equation, discretized with

n = 2

qubits per dimension and finite-difference operators approximated by tridiagonal matrices (

d = 1

). The strings

x_{p, k, j}

(with

k = 1

for

d = 1

) are generated per Proposition 1. The corresponding masks for the

z

strings are shown for a two-block velocity model; a “∗” denotes a position that can be 0 or 1. Bits are grouped for readability: the first

| \hat{B} (D) | = 2

bits are for the Hamiltonian, and each subsequent block of

n = 2

bits encodes a spatial direction. The third column lists the diagonalization operators for case where

x \cdot z = 0 (\mod 2)

, according to Proposition 2. Here,

H_{k}

denotes a Hadamard gate on qubit k, and

C N O T (c, t)

is a controlled-NOT gate with control c and target t.

Table A1. Pauli strings for the three-dimensional wave equation, discretized with

n = 2

qubits per dimension and finite-difference operators approximated by tridiagonal matrices (

d = 1

). The strings

x_{p, k, j}

(with

k = 1

for

d = 1

) are generated per Proposition 1. The corresponding masks for the

z

strings are shown for a two-block velocity model; a “∗” denotes a position that can be 0 or 1. Bits are grouped for readability: the first

| \hat{B} (D) | = 2

bits are for the Hamiltonian, and each subsequent block of

n = 2

bits encodes a spatial direction. The third column lists the diagonalization operators for case where

x \cdot z = 0 (\mod 2)

, according to Proposition 2. Here,

H_{k}

denotes a Hadamard gate on qubit k, and

C N O T (c, t)

is a controlled-NOT gate with control c and target t.

$x_{p, 1, j}$	z	Diagonalization Operator D
$x_{1, 0} = 01 \| 00 \| 00 \| 00$	$z = * * \| * * \| 00 \| * 0$	$H_{2}$
$x_{1, 1, 1} = 01 \| 01 \| 00 \| 00$	$z = * * \| * * \| 00 \| * 0$	$H_{2} C N O T (2, 4)$
$x_{1, 1, 2} = 01 \| 11 \| 00 \| 00$	$z = * * \| * * \| 00 \| * 0$	$H_{2} C N O T (2, 3) C N O T (2, 4)$
$x_{2, 0} = 10 \| 00 \| 00 \| 00$	$z = * * \| 00 \| * * \| * 0$	$H_{1}$
$x_{2, 1, 1} = 10 \| 00 \| 01 \| 00$	$z = * * \| 00 \| * * \| * 0$	$H_{1} C N O T (1, 6)$
$x_{2, 1, 2} = 10 \| 00 \| 11 \| 00$	$z = * * \| 00 \| * * \| * 0$	$H_{1} C N O T (1, 5) C N O T (1, 6)$
$x_{3, 0} = 11 \| 00 \| 00 \| 00$	$z = * * \| 00 \| 00 \| * *$	$H_{1} C N O T (1, 2)$
$x_{3, 1, 1} = 11 \| 00 \| 00 \| 01$	$z = * * \| 00 \| 00 \| * *$	$H_{1} C N O T (1, 2) C N O T (1, 8)$
$x_{3, 1, 2} = 11 \| 00 \| 00 \| 11$	$z = * * \| 00 \| 00 \| * *$	$H_{1} C N O T (1, 2) C N O T (1, 7) C N O T (1, 8)$

To illustrate, consider the row for

x_{3, 1, 1} = 11 | 00 | 00 | 01

with the mask

z = * * | 00 | 00 | * *

. This mask indicates that only the Pauli strings listed in the second column of Table A2 may appear in the decomposition. Since these Pauli strings all commute, we can apply Proposition 2 to construct a diagonalization circuit. The required operator D is given in the last column of Table A1, and the result of applying this diagonalization is shown in the final column of Table A2.

Table A2. Pauli strings generated for

x_{3, 1, 1} = 11 | 00 | 00 | 01

and its corresponding mask

z = * * | 00 | 00 | * *

, where “∗” denotes a position that can be 0 or 1. Only strings with an even number of Y operators (

x \cdot z = 0 (\mod 2)

) are retained as the Hamiltonian

{\tilde{H}}_{3}

is real. The first column lists the specific

z

strings generated from the mask. The second column shows the corresponding Pauli strings in the decomposition. The third column lists the

\tilde{z}

strings after diagonalization per Proposition 2, and the last column shows the resulting diagonalized Pauli strings with diagonalization operator

D = H_{1} C N O T (1, 2) C N O T (1, 8)

, and its conjugate transpose

D^{†} = C N O T (1, 8) C N O T (1, 2) H_{1}

, since both the CNOT and Hadamard gates are Hermitian. Bits and Pauli matrices are grouped for readability.

Table A2. Pauli strings generated for

x_{3, 1, 1} = 11 | 00 | 00 | 01

and its corresponding mask

z = * * | 00 | 00 | * *

, where “∗” denotes a position that can be 0 or 1. Only strings with an even number of Y operators (

x \cdot z = 0 (\mod 2)

) are retained as the Hamiltonian

{\tilde{H}}_{3}

is real. The first column lists the specific

z

strings generated from the mask. The second column shows the corresponding Pauli strings in the decomposition. The third column lists the

\tilde{z}

strings after diagonalization per Proposition 2, and the last column shows the resulting diagonalized Pauli strings with diagonalization operator

D = H_{1} C N O T (1, 2) C N O T (1, 8)

, and its conjugate transpose

D^{†} = C N O T (1, 8) C N O T (1, 2) H_{1}

, since both the CNOT and Hadamard gates are Hermitian. Bits and Pauli matrices are grouped for readability.

$z = * * \| 00 \| 00 \| * *$	$W (x, z)$	$\tilde{z} = 1 * \| 00 \| 00 \| * *$	$W (0, \tilde{z}) = {(- 1)}^{(x \cdot z) / 2} DW (x, z) D^{†}$
$00 \| 00 \| 00 \| 00$	$X X \| I I \| I I \| I X$	$10 \| 00 \| 00 \| 00$	$+ Z I \| I I \| I I \| I I$
$00 \| 00 \| 00 \| 10$	$X X \| I I \| I I \| Z X$	$10 \| 00 \| 00 \| 10$	$+ Z I \| I I \| I I \| Z I$
$01 \| 00 \| 00 \| 01$	$X Y \| I I \| I I \| I Y$	$11 \| 00 \| 00 \| 01$	$- Z Z \| I I \| I I \| I Z$
$01 \| 00 \| 00 \| 11$	$X Y \| I I \| I I \| Z Y$	$11 \| 00 \| 00 \| 11$	$- Z Z \| I I \| I I \| Z Z$
$10 \| 00 \| 00 \| 01$	$Y X \| I I \| I I \| I Y$	$10 \| 00 \| 00 \| 01$	$- Z I \| I I \| I I \| I Z$
$10 \| 00 \| 00 \| 11$	$Y X \| I I \| I I \| Z Y$	$10 \| 00 \| 00 \| 11$	$- Z I \| I I \| I I \| Z Z$
$11 \| 00 \| 00 \| 00$	$Y Y \| I I \| I I \| I X$	$11 \| 00 \| 00 \| 00$	$- Z Z \| I I \| I I \| I I$
$11 \| 00 \| 00 \| 10$	$Y Y \| I I \| I I \| Z X$	$11 \| 00 \| 00 \| 10$	$- Z Z \| I I \| I I \| Z I$

We express the total Hamiltonian as a sum

{\tilde{H}}_{D} = \sum_{γ = 1}^{Γ} H_{γ}

, where each

H_{γ}

is a mutually commuting group of Pauli strings characterized by a single

x

. Since these groups do not all commute with each other, we employ a Trotter–Suzuki decomposition to approximate the time evolution.

A single Trotter step can be implemented using methods from [21]. The quantum circuit for the propagator

exp (- i H_{γ} t)

, corresponding to the group characterized by

x_{3, 1, 1}

, is shown in Figure A3. The circuit emphasizes the gate arrangement; the specific rotation angles for the

R_{z}

gates (which are defined in our implementation) are omitted for visual clarity. Using a Gray code ordering minimizes the number of CNOT gates in the diagonalization circuit.

The circuits for each group

x

are combined to form a single Trotter step. These steps are then repeated according to the chosen Trotter formula order to achieve the desired simulation accuracy.

Figure A3. Quantum circuit for the propagator

exp (- i H_{γ} t)

, where

H_{γ}

is characterized by

x_{3, 1, 1}

. The

z

strings are processed in the following order (left to right): 00000000, 10000001, 10000011, 00000010, 11000010, 01000011, 01000001, 11000000. That is, in the the Gray code order for

\tilde{z}

. The rotation angles are

θ = 2 α t_{t r}

, where

α

is the coefficient of the corresponding Pauli string and

t_{t r}

is the scaled time from the Trotter decomposition (e.g., for a first-order formula,

t_{t r} = t / r

, with r being the number of steps). The figure shows gate arrangement; the numerical values inside the

R_{z}

gates are not relevant.

Figure A3. Quantum circuit for the propagator

exp (- i H_{γ} t)

, where

H_{γ}

is characterized by

x_{3, 1, 1}

. The

z

strings are processed in the following order (left to right): 00000000, 10000001, 10000011, 00000010, 11000010, 01000011, 01000001, 11000000. That is, in the the Gray code order for

\tilde{z}

. The rotation angles are

θ = 2 α t_{t r}

, where

α

is the coefficient of the corresponding Pauli string and

t_{t r}

is the scaled time from the Trotter decomposition (e.g., for a first-order formula,

t_{t r} = t / r

, with r being the number of steps). The figure shows gate arrangement; the numerical values inside the

R_{z}

gates are not relevant.

Appendix C. Application to a 3D Standing Wave Problem

This section demonstrates the application of our proposed quantum algorithm to a three-dimensional standing wave problem. We consider a constant wave velocity

c = 1

and compare the computational complexity of our method against a standard finite-difference method (FDM) for the accuracy level. The problem is defined by the following wave equation and initial/boundary conditions:

\begin{matrix} \frac{\partial^{2} u (t, \vec{x})}{\partial t^{2}} = \frac{\partial^{2} u (t, \vec{x})}{\partial x^{2}} + \frac{\partial^{2} u (t, \vec{x})}{\partial y^{2}} + \frac{\partial^{2} u (t, \vec{x})}{\partial z^{2}}, \\ u (0, \vec{x}) = sin (π x) sin (π y) sin (π z), \\ \frac{\partial u (0, \vec{x})}{\partial t} = 0, \\ u (t, 0, y, z) = u (t, x, 0, z) = u (t, x, y, 0) = 0, \\ u (t, 1, y, z) = u (t, x, 1, z) = u (t, x, y, 1) = 0 . \end{matrix}

(A8)

An exact analytical solution is known for this problem:

u_{exact} (x, y, z, t) = sin (π x) sin (π y) sin (π z) cos (\sqrt{3} π t) .

(A9)

We compare the solution from our quantum method against a classical second-order FDM using an identical spatial discretization. The classical FDM solution is constructed iteratively for

u

—the discretized and vectorized version of

u (t, \vec{x})

:

\begin{matrix} u (0) = u (Δ t) = f, \\ u (k Δ t) = Δ t^{2} L_{3} u ((k - 1) Δ t) + 2 u ((k - 1) Δ t) - u ((k - 2) Δ t), k \geq 2 . \end{matrix}

(A10)

Here,

f

is the discretized and vectorized initial condition

u (0, \vec{x})

. The discrete Laplacian

L_{3}

is defined as

L_{3} = L_{x} \otimes I_{y} \otimes I_{z} + I_{x} \otimes L_{y} \otimes I_{z} + I_{x} \otimes I_{y} \otimes L_{z},

(A11)

where

L_{x} = - B_{x}^{T} B_{x}

,

L_{y} = - B_{y}^{T} B_{y}

, and

L_{z} = - B_{z}^{T} B_{z}

. The matrix

B_{x}

(and similarly

B_{y}

,

B_{z}

) is a first-order finite-difference operator (same as in Hamiltonian) with zero boundary conditions. For example, for

n = 2

we have

B_{x} = (\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & - 1 & 1 & 0 \\ 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 \end{matrix})

. For the classical FDM simulation, the number of time steps

N_{t}

is chosen to satisfy the Courant–Friedrichs–Lewy (CFL) condition for numerical stability, specifically

N_{t} = ⌈ \sqrt{3} N_{x} ⌉

, where

N_{x} = N_{y} = N_{z}

.

To quantify accuracy, we use the discrete

L^{2}

-norm error:

ϵ_{n u m} (u, v) = \sqrt{Δ V \sum_{j} {(u_{j} - v_{j})}^{2}}, Δ V = Δ x Δ y Δ z .

(A12)

After computing the error

ϵ_{n u m}

between the FDM and analytical solutions, we estimate the number of Trotter steps r required for the quantum algorithm to achieve the same accuracy. This estimate is derived from the data plotted in Figure A4, where the crosses represent the quantum algorithm’s error (error between the quantum and analytical solutions) as a function of r. For each spatial discretization level (i.e., for each number of qubits per dimension), we locate the fixed FDM error level on the y-axis. We then determine the point where a horizontal line at this FDM error value intersects the interpolated trend (dashed line) of the quantum error data on the log–log plot. The x-coordinate of this intersection gives the estimated number of Trotter steps r needed for the quantum solution to match the classical FDM’s precision. These intersection points are marked with dots in the figure.

Figure A4. Estimation of the quantum Trotter steps required for the second order formula to match classical FDM accuracy. The crosses show

ϵ_{num}

— error between quantum algorithm and analytical solution — plotted against the number of Trotter steps r for varying qubits per dimension n, corresponding to discretization points

N = 2^{n}

. For a fixed FDM error, the required Trotter steps are determined by the intersection with the error trend (dashed line). These intersections, marked by dots, provide the estimated r required for the quantum algorithm to achieve the same accuracy as the FDM.

Figure A4. Estimation of the quantum Trotter steps required for the second order formula to match classical FDM accuracy. The crosses show

ϵ_{num}

— error between quantum algorithm and analytical solution — plotted against the number of Trotter steps r for varying qubits per dimension n, corresponding to discretization points

N = 2^{n}

. For a fixed FDM error, the required Trotter steps are determined by the intersection with the error trend (dashed line). These intersections, marked by dots, provide the estimated r required for the quantum algorithm to achieve the same accuracy as the FDM.

Finally, we compare the computational cost, measured in the total number of operations. For the classical FDM, the cost is estimated as

g_{c l} = 3 N_{t} N^{3} = 3 \sqrt{3} N^{4}

as the matrix

L_{3}

has 3 non-zero entries per row, and the solution is evolved over

N_{t}

time steps. For the quantum algorithm, the cost is the total number of one- and two-qubit gates, estimated as

g_{q} = r \cdot g_{1}

, where r is the number of Trotter steps and

g_{1}

is the gate count for a single step (can be evaluate as in Equation (39)). This comparison is shown in Figure A5, where it can be seen that the quantum algorithm requires fewer operations than the standard FDM for this problem.

Figure A5. Comparison of the total number of operations (gate count for quantum, floating-point operations for classical) required by the quantum algorithm with the second order formula versus the classical FDM.

Appendix D. Proofs

Proof of Proposition 1.

Without loss of generality, we assume

D + 1 = 2^{n_{d}}

for some integer

n_{d}

. If this is not the case, one can extend the Hamiltonian by adding matrices

B_{p} = 0

for

p = D + 1, \dots, D^{'}

, where

D^{'} = 2^{⌈ {log}_{2} (D) ⌉}

, ensuring

D^{'}

is a power of two. This padding does not alter the structure or Pauli decomposition of the original

H_{D}

.

To prove the proposition, we utilize the standard basis matrices

E_{i, j}^{n}

, defined as the

2^{n} \times 2^{n}

matrices that are zero everywhere except for a 1 at position

(i, j)

. These matrices possess a crucial tensor product structure:

E_{i, j}^{n} = E_{i_{0}, j_{0}}^{1} \otimes \dots \otimes E_{i_{n - 1}, j_{n - 1}}^{1},

(A13)

where

i_{k}

and

j_{k}

are the bits in the n-bit binary representations of i and j, respectively, i.e.,

{\hat{B}}_{l} (n, i)

and

{\hat{B}}_{l} (n, j)

. Using this basis, the Hamiltonian

H_{D}

can be expressed as

H_{D} = \sum_{p = 1}^{D} (E_{0, p}^{n_{d}} \otimes B_{p} + E_{p, 0}^{n_{d}} \otimes B_{p}^{†}) .

(A14)

Since the Pauli strings constituting

B_{p}

and

B_{p}^{†}

are identical (the difference is only in coefficients of these matrices), we can denote their combined Pauli support simply as

P_{p}

. Moreover, according to Proposition 1 in [16], the matrix

B_{p}

admits a decomposition into Pauli strings characterized by binary vectors of the form

x_{k, j} = {\hat{B}}_{l} (n - s, 2^{j} - 1) * {\hat{B}}_{l} (s, 2^{s} - k),

(A15)

for

k \in {1, \dots, d}

,

j \in {1, \dots, n - s}

,

s = ⌈ {log}_{2} (k) ⌉

, and

d \leq 2^{n - 1}

, plus an additional diagonal string

x_{0} = {\hat{B}}_{l} (n, 0)

.

The single-qubit basis matrices have well-known Pauli decompositions:

\begin{matrix} E_{0, 0}^{1} & = (I + Z) / 2, & E_{1, 1}^{1} & = (I - Z) / 2, \\ E_{0, 1}^{1} & = (X + i Y) / 2, & E_{1, 0}^{1} & = (X - i Y) / 2 . \end{matrix}

Crucially,

E_{0, 0}^{1}

and

E_{1, 1}^{1}

are diagonal and are represented by the binary string

x = 0

(indicating I or Z), while

E_{0, 1}^{1}

and

E_{1, 0}^{1}

are off-diagonal and are represented by

x = 1

(indicating X or Y).

Now, applying the tensor product property (A13), we analyze the term

E_{0, p}^{n_{d}}

:

E_{0, p}^{n_{d}} = E_{0, p_{0}}^{1} \otimes \dots \otimes E_{0, p_{n_{d} - 1}}^{1},

(A16)

where

p_{0} \dots p_{n_{d} - 1} \equiv {\hat{B}}_{l} (n_{d}, p)

is the

n_{d}

-bit binary representation of p. The Pauli string representation of this matrix is determined by the

x

-component of each single-qubit factor. Since each factor

E_{0, p_{k}}^{1}

is represented by the binary value

p_{k}

, the full string for

E_{0, p}^{n_{d}}

is simply

x = {\hat{B}}_{l} (n_{d}, p)

. An identical argument shows that

E_{p, 0}^{n_{d}}

is also represented by the same string,

x = {\hat{B}}_{l} (n_{d}, p)

.

Therefore, the overall Pauli string representation for a term

(E_{0, p}^{n_{d}} + E_{p, 0}^{n_{d}}) \otimes P_{p}

in

H_{D}

is given by the concatenation of the string for the first subsystem and the string for

P_{p}

. This yields the general form

x_{p, k, j} = {\hat{B}}_{l} (n_{d}, p) * {\hat{B}}_{l} (n - s, 2^{j} - 1) * {\hat{B}}_{l} (s, 2^{s} - k),

(A17)

for

p \in {1, \dots, D}

,

k \in {1, \dots, d}

,

j \in {1, \dots, n - s}

,

s = ⌈ {log}_{2} (k) ⌉

. Additionally, the diagonal contributions from

P_{p}

lead to strings of the form

x_{p, 0} = {\hat{B}}_{l} (n_{d}, p) * {\hat{B}}_{l} (n, 0) .

(A18)

This completes the characterization of all Pauli strings present in the decomposition of

H_{D}

. □

Proof of Proposition 2.

We employ the standard tableau method for the simultaneous diagonalization of sets of commuting Pauli strings, as detailed in [19,20]. For a set of m commuting Pauli operators, each acting on n qubits, the tableau is a data structure of size

m \times 2 n

bits. Each row k of the tableau corresponds to a Pauli string

P_{k}

and is represented as a pair

(x_{k}, z_{k})

, where the binary vectors

x_{k}, z_{k} \in B^{n}

encode the corresponding Pauli string. The tableau for our set, characterized by a single

x

, is structured as follows:

[\begin{matrix} x_{1, 1} & \dots & x_{1, n} & | & z_{1, 1} & \dots & z_{1, n} \\ ⋮ & ⋱ & ⋮ & | & ⋮ & ⋱ & ⋮ \\ x_{1, 1} & \dots & x_{1, n} & | & z_{m, 1} & \dots & z_{m, n} \end{matrix}] .

Our goal is to construct a Clifford circuit that maps these operators to strings consisting solely of Z and I operators. This is equivalent to transforming the tableau such that the

x

-block becomes zero.

The transformation rules for the tableau under elementary Clifford gates are well-established [19,20]:

A Hadamard gate on qubit j swaps the j-th columns of the $x$ and $z$ blocks for all rows.
A CNOT gate with control c and target t performs the following for all rows:
- $x_{k, t} \leftarrow x_{k, c} \oplus x_{k, t}$
- $z_{k, c} \leftarrow z_{k, c} \oplus z_{k, t}$
A Phase gate on qubit j performs
- $z_{k, j} \leftarrow z_{k, j} \oplus x_{k, j}$

The diagonalization procedure processes can be described as follows:

Identify Pivot: Find the first column j (the pivot column) for which $x_{k, j} = 1$ .
Eliminate Other Entries: For every other columns $l > j$ with $x_{k, l} = 1$ , apply a CNOT gate with control j and target l. This operation adds column j to column l in the $x$ -block, zeroing out the entry $x_{k, l}$ .
Clear the Pivot: After step 2, only the pivot column j has a 1 in $x$ -block. At the same time only j-th column of the $z$ -block has changed. Moreover its values are equal to $z_{k, j} = ⨁_{l = 1}^{M} z_{k, l}$ , where l runs over all non zero bit in the initially given $x$ . It can be seen that $⨁_{l = 1}^{M} z_{k, l} = x \cdot z_{k}$ .
- If we are given a set, characterized by $x \cdot z_{k} = 0 (\mod 2)$ , apply a Hadamard gate on qubit j. This swaps the 1 in the $x$ -block with the 0 in the $z$ -block, completing the zeroing of $x$ block.
- If we are given a set, characterized by $x \cdot z_{k} = 1 (\mod 2)$ , first apply a Phase gate on qubit j. This sets $z_{k, j} \leftarrow z_{k, j} \oplus x_{k, j} = 0$ (since $z_{k, j} = 1$ and $x_{k, j} = 1$ ). Then, apply a Hadamard gate on qubit j to swap the 1 in $x$ with the 0 in $z$ .

The final result is a tableau with

x = 0

, meaning all Pauli strings have been diagonalized. Additionally, it is easy to see that after the diagonalization procedure only a single bit is changed in

z_{k}

strings. That is, we set pivot column bit j to 1 in each

z_{k}

.

We now analyze the cumulative sign change from the specific CNOT operations applied in Step 2 of the diagonalization procedure. In this step, for a fixed target qubit t, a CNOT gate with control c (the pivot) and target t is applied to every other column k in the set for which

x_{t, k} = 1

.

Substituting these values simplifies the update rule significantly:

r_{k} \leftarrow r_{k} \oplus (1 \cdot z_{k, t} \cdot (1 \oplus z_{k, c} \oplus 1)) = r_{k} \oplus z_{k, t} z_{k, c} .

(A19)

Using it directly based on step 2 in the diagonalization procedure, one can obtain following result

r = ⨁_{i = 1}^{M} z_{i} (⨁_{j = 0}^{i - 1} z_{j}) = \sum_{0 \leq j < i \leq M} z_{i} z_{j} (\mod 2) = C_{t}^{2} mod 2,

(A20)

where

z_{i}

denotes i-th index in

z

(that is i-th column in table), and

C_{t}^{2}

is number of combinations with

t = x \cdot z

being the number of bits in

z

equal to 1. Simplifying it further we can get

r = C_{t}^{2} mod 2 = \frac{t (t - 1)}{2} mod 2 = ⌊ \frac{t}{2} mod 2 ⌋,

(A21)

Additionally, the sign bit

r_{k}

remains unchanged after the application of a Hadamard gate. The sign update rule for a Hadamard gate on qubit t is

r_{k} \leftarrow r_{k} \oplus (x_{k, t} \cdot z_{k, t})

. This update is trivial because a Hadamard gate is only applied to a qubit t when its state in the tableau is prepared such that either

x_{k, t} = 0

or

z_{k, t} = 0

, ensuring the product

x_{k, t} \cdot z_{k, t}

is always zero.

In contrast, the sign update rule for a Phase (S) gate is identical:

r_{k} \leftarrow r_{k} \oplus (x_{k, t} \cdot z_{k, t})

. However, an S gate is applied under precisely the opposite condition when both

x_{k, t} = 1

and

z_{k, t} = 1

. Consequently, the product

x_{k, t} \cdot z_{k, t}

equals 1, and the sign bit

r_{k}

is flipped.

That is the sign that appears in diagonalization is

{(- 1)}^{r}

, where

r = \{\begin{matrix} \frac{t}{2} mod 2 & if t = x \cdot z = 0 (\mod 2), \\ \frac{t + 1}{2} mod 2 & if t = x \cdot z = 1 (\mod 2) . \end{matrix}

(A22)

or,

r = (\frac{t + (t mod 2)}{2}) mod 2 .

(A23)

Finally the sign that appears during diagonalization is given in the form

{(- 1)}^{r} = {(- 1)}^{(\frac{t + (t mod 2)}{2}) mod 2} = {(- 1)}^{(\frac{t + (t mod 2)}{2})}

(A24)

□

Proof of Proposition 3.

Let the velocity-model operator be decomposed as

\tilde{S} = \sum_{j = 1}^{T} α_{j} Z^{z_{j}}

, where

Z^{z} = ⨂_{k = 1}^{D n + ⌈ {log}_{2} (D + 1) ⌉} Z^{z_{k}}

is a Pauli string composed solely of I and Z matrices, and the Hamiltonian as

H_{D} = \sum_{j = 1}^{K} β_{j} P_{j}

, where

P_{j}

are Pauli strings.

We note that the operators

{\tilde{B}}_{j} = I \otimes \dots \otimes B_{j} \otimes \dots \otimes I

are constructed by embedding

B_{j}

into a larger tensor product. Consequently, the Pauli strings constituting

{\tilde{B}}_{j}

are those of

B_{j}

, padded with zeros in the

x

and

z

vectors at positions corresponding to the identity matrices. Therefore, the structure of the commuting sets in

H_{D}

, as established by Proposition 1 and Corollary 1, remains valid. The Hamiltonian

H_{D}

can be expressed as a sum over

Γ

mutually commuting sets:

H_{D} = \sum_{γ = 1}^{Γ} X^{x_{γ}} (\sum_{k = 1}^{K_{γ}} i^{x_{γ} \cdot z_{k}} β_{k} Z^{z_{k}}),

(A25)

where the phase factor

i^{x_{γ} \cdot z_{k}}

arises from the Walsh operator definition (Equation (9)). According to Corollary 1, and since

H_{D}

is real-valued, the number of sets is

Γ = s (D, d, n) = D (2^{⌈ {log}_{2} d ⌉} + (n - ⌈ {log}_{2} d ⌉) d)

, and the number of strings per set is

K_{γ} = 2^{n + ⌈ {log}_{2} (D + 1) ⌉ - 1}

, where the term

- 1

accounts for the constraint of an even number of Y operators.

We now analyze the transformed Hamiltonian

{\tilde{H}}_{D} = \tilde{S} H_{D} \tilde{S}

:

\begin{matrix} \tilde{S} H_{D} \tilde{S} & = (\sum_{j = 1}^{T} α_{j} Z^{z_{j}}) [\sum_{γ = 1}^{Γ} X^{x_{γ}} (\sum_{k = 1}^{K_{γ}} i^{x_{γ} \cdot z_{k}} β_{k} Z^{z_{k}})] (\sum_{l = 1}^{T} α_{l} Z^{z_{l}}) \\ = \sum_{γ = 1}^{Γ} \sum_{j, l = 1}^{T} α_{j} α_{l} Z^{z_{j}} X^{x_{γ}} (\sum_{k = 1}^{K_{γ}} i^{x_{γ} \cdot z_{k}} β_{k} Z^{z_{k}}) Z^{z_{l}} . \end{matrix}

(A26)

Using the Pauli commutation relation

Z^{z} X^{x} = {(- 1)}^{x \cdot z} X^{x} Z^{z}

and the fact that Z operators commute, we simplify:

\begin{matrix} \tilde{S} H_{D} \tilde{S} & = \sum_{γ = 1}^{Γ} \sum_{j, l = 1}^{T} \sum_{k = 1}^{K_{γ}} i^{x_{γ} \cdot z_{k}} {(- 1)}^{x_{γ} \cdot z_{j}} α_{j} α_{l} β_{k} X^{x_{γ}} Z^{z_{j} \oplus z_{k} \oplus z_{l}} \\ = \sum_{γ = 1}^{Γ} X^{x_{γ}} (\sum_{p = 1}^{K_{γ}^{'}} ν_{p} Z^{z_{p}}) . \end{matrix}

(A27)

In the final expression, the index p runs over all unique binary vectors

z_{p} = z_{j} \oplus z_{k} \oplus z_{l}

generated by the indices

j, l \in {1, \dots, T}

and

k \in {1, \dots, K_{γ}}

. The coefficient

ν_{p}

is the sum of all terms

i^{x_{γ} \cdot z_{k}} {(- 1)}^{x_{γ} \cdot z_{j}} α_{j} α_{l} β_{k}

for which

z_{j} \oplus z_{k} \oplus z_{l} = z_{p}

.

Two key observations follow from Equation (A27):

The commuting sets of $\tilde{S} H_{D} \tilde{S}$ are identical to those of $H_{D}$ as they are characterized by the same $x_{γ}$ strings.
The number of Pauli strings $K_{γ}^{'}$ within each set $γ$ is bounded by the number of unique $z_{p}$ vectors that can be generated. This number is at most the minimum of two values:
- The total number of combinations: $T^{2} K_{γ}$ .
- The total number of all possible diagonal Pauli strings of length $L = D n + ⌈ {log}_{2} (D + 1) ⌉$ , which is $(D + 1) N^{D}$ . However, since $\tilde{S} H_{D} \tilde{S}$ remains a real-valued matrix, its Pauli decomposition can only contain strings with an even number of Y operators. For a fixed $x_{γ}$ , this restricts the associated $z_{p}$ vectors, effectively halving the number of possibilities to $(D + 1) N^{D} / 2$ .

Thus,

K_{γ}^{'} \leq min (T^{2} K_{γ}, (D + 1) N^{D} / 2)

, and the total number of Pauli strings in the decomposition of

{\tilde{H}}_{D}

is bounded by

g_{H} \leq Γ \cdot min (T^{2} K_{γ}, (D + 1) N^{D} / 2) .

(A28)

Substituting the expressions

Γ \approx D d n

(for

d ≪ N

),

K_{γ} = (D + 1) N / 2

, and

N^{D} = 2^{D n}

, the scaling of

g_{H}

can be written as

g_{H} = O (D^{2} d n N min (T^{2}, N^{D - 1})) .

(A29)

□

References

Virieux, J.; Operto, S. An overview of full-waveform inversion in exploration geophysics. Geophysics 2009, 74, WCC1–WCC26. [Google Scholar] [CrossRef]
Taflove, A.; Hagness, S.C.; Piket-May, M. Computational electromagnetics: The finite-difference time-domain method. In The Electrical Engineering Handbook; Elsevier: Amsterdam, The Netherlands, 2005; Volume 3, p. 15. [Google Scholar]
LeVeque, R.J. Finite Difference Methods for Ordinary and Partial Differential Equations: Steady-State and Time-Dependent Problems; Society for Industrial and Applied Mathematics (SIAM): Philadelphia, PA, USA, 2007. [Google Scholar]
Tarantola, A. Inverse Problem Theory and Methods for Model Parameter Estimation; Society for Industrial and Applied Mathematics (SIAM): Philadelphia, PA, USA, 2005. [Google Scholar]
Pratt, R.G. Seismic waveform inversion in the frequency domain, Part 1: Theory and verification in a physical scale model. Geophysics 1999, 64, 888–901. [Google Scholar] [CrossRef]
Yee, K. Numerical solution of initial boundary value problems involving Maxwell’s equations in isotropic media. IEEE Trans. Antennas Propag. 1966, 14, 302–307. [Google Scholar]
Harrow, A.W.; Hassidim, A.; Lloyd, S. Quantum algorithm for linear systems of equations. Phys. Rev. Lett. 2009, 103, 150502. [Google Scholar] [CrossRef]
Berry, D.W. High-order quantum algorithm for solving linear differential equations. J. Phys. A Math. Theor. 2014, 47, 105301. [Google Scholar] [CrossRef]
Costa, P.C.S.; Jordan, S.; Ostrander, A. Quantum algorithm for simulating the wave equation. Phys. Rev. A 2019, 99, 012323. [Google Scholar] [CrossRef]
Bösch, C.; Schade, M.; Aloisi, G.; Keating, S.D.; Fichtner, A. Quantum wave simulation with sources and loss functions. Phys. Rev. Res. 2025, 7, 033225. [Google Scholar] [CrossRef]
Wright, L.; Mc Keever, C.; First, J.T.; Johnston, R.; Tillay, J.; Chaney, S.; Rosenkranz, M.; Lubasch, M. Noisy intermediate-scale quantum simulation of the one-dimensional wave equation. Phys. Rev. Res. 2024, 6, 043169. [Google Scholar] [CrossRef]
Suau, A.; Staffelbach, G.; Calandra, H. Practical quantum computing: Solving the wave equation using a quantum approach. ACM Trans. Quantum Comput. 2021, 2, 1–35. [Google Scholar] [CrossRef]
Sato, Y.; Kondo, R.; Hamamura, I.; Onodera, T.; Yamamoto, N. Hamiltonian simulation for hyperbolic partial differential equations by scalable quantum circuits. Phys. Rev. Res. 2024, 6, 033246. [Google Scholar] [CrossRef]
Sato, Y.; Tezuka, H.; Kondo, R.; Yamamoto, N. Quantum algorithm for partial differential equations of nonconservative systems with spatially varying parameters. Phys. Rev. Appl. 2025, 23, 014063. [Google Scholar] [CrossRef]
Arseniev, B.; Guskov, D.; Sengupta, R.; Biamonte, J.; Zacharov, I. Tridiagonal matrix decomposition for Hamiltonian simulation on a quantum computer. Phys. Rev. A 2024, 109, 052629. [Google Scholar] [CrossRef]
Arseniev, B.; Guskov, D.; Sengupta, R.; Zacharov, I. High-order schemes for solving partial differential equations on a quantum computer. Phys. Rev. A 2025, 111, 042625. [Google Scholar] [CrossRef]
Berry, D.W.; Ahokas, G.; Cleve, R.; Sanders, B.C. Efficient quantum algorithms for simulating sparse Hamiltonians. Commun. Math. Phys. 2007, 270, 359–371. [Google Scholar] [CrossRef]
Childs, A.M.; Su, Y.; Tran, M.C.; Wiebe, N.; Zhu, S. Theory of trotter error with commutator scaling. Phys. Rev. X 2021, 11, 011020. [Google Scholar] [CrossRef]
Kawase, Y.; Fujii, K. Fast classical simulation of Hamiltonian dynamics by simultaneous diagonalization using Clifford transformation with parallel computation. Comput. Phys. Commun. 2023, 288, 108720. [Google Scholar] [CrossRef]
Van Den Berg, E.; Temme, K. Circuit optimization of Hamiltonian simulation by simultaneous diagonalization of Pauli clusters. Quantum 2020, 4, 322. [Google Scholar] [CrossRef]
Welch, J.; Greenbaum, D.; Mostame, S.; Aspuru-Guzik, A. Efficient quantum circuits for diagonal unitaries without ancillas. New J. Phys. 2014, 16, 033040. [Google Scholar] [CrossRef]
Sato, H.; Fehler, M.C.; Maeda, T. Seismic Wave Propagation and Scattering in the Heterogeneous Earth; Springer: Berlin/Heidelberg, Germany, 2012; Volume 496. [Google Scholar]
Low, G.H.; Chuang, I.L. Optimal Hamiltonian simulation by quantum signal processing. Phys. Rev. Lett. 2017, 118, 010501. [Google Scholar] [CrossRef]
Berry, D.W.; Childs, A.M.; Cleve, R.; Kothari, R.; Somma, R.D. Simulating Hamiltonian dynamics with a truncated Taylor series. Phys. Rev. Lett. 2015, 114, 090502. [Google Scholar] [CrossRef]
Kandala, A.; Temme, K.; Córcoles, A.D.; Mezzacapo, A.; Chow, J.M.; Gambetta, J.M. Error mitigation extends the computational reach of a noisy quantum processor. Nature 2019, 567, 491–495. [Google Scholar] [CrossRef]
Preskill, J. Quantum computing in the NISQ era and beyond. Quantum 2018, 2, 79. [Google Scholar] [CrossRef]
Cerezo, M.; Arrasmith, A.; Babbush, R.; Benjamin, S.C.; Endo, S.; Fujii, K.; McClean, J.R.; Mitarai, K.; Yuan, X.; Cincio, L.; et al. Variational quantum algorithms. Nat. Rev. Phys. 2021, 3, 625–644. [Google Scholar] [CrossRef]
Peruzzo, A.; McClean, J.; Shadbolt, P.; Yung, M.H.; Zhou, X.Q.; Love, P.J.; Aspuru-Guzik, A.; O’brien, J.L. A variational eigenvalue solver on a photonic quantum processor. Nat. Commun. 2014, 5, 4213. [Google Scholar] [CrossRef] [PubMed]
Childs, A.M.; Ostrander, A.; Su, Y. Faster quantum simulation by randomization. Quantum 2019, 3, 182. [Google Scholar] [CrossRef]
Jin, S.; Liu, N.; Yu, Y. Quantum simulation of partial differential equations via Schrödingerization. Phys. Rev. Lett. 2024, 133, 230602. [Google Scholar] [CrossRef]
Zacharov, I.; Arslanov, R.; Gunin, M.; Stefonishin, D.; Bykov, A.; Pavlov, S.; Panarin, O.; Maliutin, A.; Rykovanov, S.; Fedorov, M. “Zhores”—Petaflops supercomputer for data—Driven modeling, machine learning and artificial intelligence installed in Skolkovo Institute of Science and Technology. Open Eng. 2019, 9, 512–520. [Google Scholar] [CrossRef]

Figure 1. Number of non-zero Pauli terms in the decomposition of the Hamiltonian

{\tilde{H}}_{3}

given by Equation (37). Empirical data (crosses), obtained with randomized velocity profiles for various block sizes m, are compared against the theoretical upper bound from Equation (38) (dotted line). The results demonstrate the scaling of the term count with the number of discretization qubits n for different values of m.

Figure 1. Number of non-zero Pauli terms in the decomposition of the Hamiltonian

{\tilde{H}}_{3}

given by Equation (37). Empirical data (crosses), obtained with randomized velocity profiles for various block sizes m, are compared against the theoretical upper bound from Equation (38) (dotted line). The results demonstrate the scaling of the term count with the number of discretization qubits n for different values of m.

Figure 2. Gate count for a single first-order Trotter step of the Hamiltonian

{\tilde{H}}_{3}

. Empirical data (crosses), obtained from compiled QASM circuits, are compared against the theoretical upper bound from Equation (39) (dotted line). The results demonstrate the scaling of the gate count with the number of discretization qubits n for different velocity block sizes m.

Figure 2. Gate count for a single first-order Trotter step of the Hamiltonian

{\tilde{H}}_{3}

. Empirical data (crosses), obtained from compiled QASM circuits, are compared against the theoretical upper bound from Equation (39) (dotted line). The results demonstrate the scaling of the gate count with the number of discretization qubits n for different velocity block sizes m.

Figure 3. Trotter error scaling for the Hamiltonian

{\tilde{H}}_{3}

with fixed parameters

n = 5

,

m = 2

. The error

ϵ

given by (40) is plotted against the number of Trotter steps r for different Trotter orders p. The results validate the algorithm and demonstrate the expected convergence with increasing r and p.

Figure 3. Trotter error scaling for the Hamiltonian

{\tilde{H}}_{3}

with fixed parameters

n = 5

,

m = 2

. The error

ϵ

given by (40) is plotted against the number of Trotter steps r for different Trotter orders p. The results validate the algorithm and demonstrate the expected convergence with increasing r and p.

Table 1. Example of all possible Pauli strings for the three-dimensional wave equation, discretized with 2 qubits per dimension and with finite difference operators approximated by tridiagonal matrices (

d = 1

). The strings

x_{p, k, j}

(where

k = 1

due to

d = 1

) are generated according to Proposition 1. The corresponding masks for the

z

strings are shown for different values of m in the block speed model; a “∗” denotes a position that can be either 0 or 1. Bits are grouped for readability: the first

| \hat{B} (D) | = 2

bits are used for the Hamiltonian construction, and each subsequent block of

n = 2

bits encodes a spatial direction.

Table 1. Example of all possible Pauli strings for the three-dimensional wave equation, discretized with 2 qubits per dimension and with finite difference operators approximated by tridiagonal matrices (

d = 1

). The strings

x_{p, k, j}

(where

k = 1

due to

d = 1

) are generated according to Proposition 1. The corresponding masks for the

z

strings are shown for different values of m in the block speed model; a “∗” denotes a position that can be either 0 or 1. Bits are grouped for readability: the first

| \hat{B} (D) | = 2

bits are used for the Hamiltonian construction, and each subsequent block of

n = 2

bits encodes a spatial direction.

$x_{p, 1, j}$	z, $m = 0$	z, $m = 1$	z, $m = 2$
$x_{1, 0} = 01 \| 00 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{1, 1, 1} = 01 \| 01 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{1, 1, 2} = 01 \| 11 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 0} = 10 \| 00 \| 00 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 1, 1} = 10 \| 00 \| 01 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 1, 2} = 10 \| 00 \| 11 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{3, 0} = 11 \| 00 \| 00 \| 00$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$
$x_{3, 1, 1} = 11 \| 00 \| 00 \| 01$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$
$x_{3, 1, 2} = 11 \| 00 \| 00 \| 11$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Arseniev, B.; Zacharov, I. Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition. Quantum Rep. 2025, 7, 47. https://doi.org/10.3390/quantum7040047

AMA Style

Arseniev B, Zacharov I. Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition. Quantum Reports. 2025; 7(4):47. https://doi.org/10.3390/quantum7040047

Chicago/Turabian Style

Arseniev, Boris, and Igor Zacharov. 2025. "Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition" Quantum Reports 7, no. 4: 47. https://doi.org/10.3390/quantum7040047

APA Style

Arseniev, B., & Zacharov, I. (2025). Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition. Quantum Reports, 7(4), 47. https://doi.org/10.3390/quantum7040047

$z = * * \| 00 \| 00 \| * *$	$W (x, z)$	$\tilde{z} = 1 * \| 00 \| 00 \| * *$	$W (0, \tilde{z}) = {(- 1)}^{(x \cdot z) / 2} DW (x, z) D^{†}$
$00 \| 00 \| 00 \| 00$	$X X \| I I \| I I \| I X$	$10 \| 00 \| 00 \| 00$	$+ Z I \| I I \| I I \| I I$
$00 \| 00 \| 00 \| 10$	$X X \| I I \| I I \| Z X$	$10 \| 00 \| 00 \| 10$	$+ Z I \| I I \| I I \| Z I$
$01 \| 00 \| 00 \| 01$	$X Y \| I I \| I I \| I Y$	$11 \| 00 \| 00 \| 01$	$- Z Z \| I I \| I I \| I Z$
$01 \| 00 \| 00 \| 11$	$X Y \| I I \| I I \| Z Y$	$11 \| 00 \| 00 \| 11$	$- Z Z \| I I \| I I \| Z Z$
$10 \| 00 \| 00 \| 01$	$Y X \| I I \| I I \| I Y$	$10 \| 00 \| 00 \| 01$	$- Z I \| I I \| I I \| I Z$
$10 \| 00 \| 00 \| 11$	$Y X \| I I \| I I \| Z Y$	$10 \| 00 \| 00 \| 11$	$- Z I \| I I \| I I \| Z Z$
$11 \| 00 \| 00 \| 00$	$Y Y \| I I \| I I \| I X$	$11 \| 00 \| 00 \| 00$	$- Z Z \| I I \| I I \| I I$
$11 \| 00 \| 00 \| 10$	$Y Y \| I I \| I I \| Z X$	$11 \| 00 \| 00 \| 10$	$- Z Z \| I I \| I I \| Z I$

$x_{p, 1, j}$	z, $m = 0$	z, $m = 1$	z, $m = 2$
$x_{1, 0} = 01 \| 00 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{1, 1, 1} = 01 \| 01 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{1, 1, 2} = 01 \| 11 \| 00 \| 00$	$z = * * \| * * \| 00 \| 00$	$z = * * \| * * \| * 0 \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 0} = 10 \| 00 \| 00 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 1, 1} = 10 \| 00 \| 01 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{2, 1, 2} = 10 \| 00 \| 11 \| 00$	$z = * * \| 00 \| * * \| 00$	$z = * * \| * 0 \| * * \| * 0$	$z = * * \| * * \| * * \| * *$
$x_{3, 0} = 11 \| 00 \| 00 \| 00$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$
$x_{3, 1, 1} = 11 \| 00 \| 00 \| 01$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$
$x_{3, 1, 2} = 11 \| 00 \| 00 \| 11$	$z = * * \| 00 \| 00 \| * *$	$z = * * \| * 0 \| * 0 \| * *$	$z = * * \| * * \| * * \| * *$

Article Menu

Quantum Simulation of Variable-Speed Multidimensional Wave Equations via Clifford-Assisted Pauli Decomposition

Abstract

1. Introduction

2. Materials and Methods

2.1. Formulation as Schrödinger Equation

2.2. Propagator Implementation Technique

3. Results

3.1. Pauli String Decomposition

3.2. Mutual Diagonalization

3.3. Scaling of Multidimensional Wave Equation Quantum Algorithm

3.4. Scaling of Multidimensional Wave Equation Quantum Algorithm with Block-Model Speed Profile

3.5. A Three-Dimensional Wave Equation in a Numerical Example

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Discretization and Vectorization Conventions

Appendix B. Decomposition Example

Appendix C. Application to a 3D Standing Wave Problem

Appendix D. Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI