Two Unitary Quantum Process Tomography Algorithms Robust to Systematic Errors

François Verdeil; Yannick Deville

doi:10.3390/psf2022005029

and

Institut de Recherche en Astrophysique et Planétologie (IRAP), Université Paul Sabatier (UPS)-Centre National de la Recherche Scientifique (CNRS)-Centre National d’Études Spatiales (CNES)-Observatoire Midi-Pyrénées (OMP), Université de Toulouse, 31400 Toulouse, France

^*

Author to whom correspondence should be addressed.

^†

Presented at the 41st International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Paris, France, 18–22 July 2022.

Phys. Sci. Forum2022, 5(1), 29;https://doi.org/10.3390/psf2022005029

This article belongs to the Proceedings The 41st International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering

Version Notes

Order Reprints

Abstract

Quantum process tomography (QPT) methods aim at identifying a given quantum process. QPT is a major quantum information processing tool, since it especially allows one to characterize the actual behavior of quantum gates, which are the building blocks of quantum computers. The present paper focuses on the estimation of a unitary process. This class is of particular interest because quantum mechanics postulates that the evolution of any closed quantum system is described by a unitary transformation. Unitary processes have significantly fewer parameters than general quantum processes (

2^{2 n_{q b}}

vs.

2^{4 n_{q b}} - 2^{2 n_{q b}}

real independent parameters for

n_{q b}

qubits). By assuming that the process is unitary we develop two methods that scale better with the size of the system. In the present paper, we stay as close as possible to the standard setup of QPT: the operator has to prepare copies of different input states. The properties those states have to satisfy in order for our method to achieve QPT are very mild. Therefore, we choose to operate with copies of

2^{n_{q b}}

initially unknown pure input states. In order to perform QPT without knowing the input states, we perform measurements on half the copies of each state, and let the other half be transformed by the system before measuring them (each copy is only measured once). This setup has the advantage of removing the issue of systematic (i.e., same on all the copies of a state) errors entirely because it does not require the process input to take predefined values. We develop a straightforward analytical solution that first estimates the states from the averaged measurements and then finds the unitary matrix (representing the process) coherent with those estimates by using our analytical solution to an extended version of Wahba’s problem. This estimate may then be used as an initial point for a fine tuning algorithm that maximizes the likelihood of the measurements. Simulation results show the effectiveness of the proposed methods.

Keywords:

quantum process tomography; unitary process; unitarily-constrained least squares; maximum likelihood

1. Prior Work and Problem Statement

System identification and system inversion are well-known problems, especially for classical systems. These problems are less challenging in the “nonblind”/“supervised” case [1] where the aim is, e.g., to identify the considered system by using the known input and the measured output. In contrast, in the “blind”/“unsupervised” case [2], the input values are unknown and uncontrolled, but some hypotheses are sometimes made on the input signal(s).

For quantum systems, non-blind system identification methods were first introduced in 1997 in [3] that came up with the name quantum process tomography (QPT), see [4]. They use copies of a set of known pure input states that are transformed by the process. Those transformed states are then measured and estimated using quantum state tomography (QST aims at estimating a quantum state using measurements). From there, the parameters of the process can be estimated from what is essentially a regression. This method scales poorly when the number of qubits increases, and is only experimentally feasible for one or two qubits. This is to be expected because, in general, a quantum process has

d^{4} - d^{2}

independent real parameters ([4], p. 391), with d the dimension of the Hilbert space (for an

n_{q b}

-qubit system

d = 2^{n_{q b}}

). This method would later be called standard QPT (SQPT), in contrast to non-standard QPT that uses ancilla qubits and weak measurements (see [5] for a survey). In Ref. [6], a SQPT approach that scales better with the number of qubits by assuming that the process is sparse is introduced. Like Baldwin et al., in most of [7], we choose to restrict ourselves to unitary processes. This class is of particular interest because the evolution of any closed quantum system is described by a unitary transformation. A unitary process has

d^{2}

independent real parameters.

A significant problem of SQPT is the need to precisely prepare the copies of the input states. Any systematic error on the input state has huge consequences for the precision. In 2015, we introduced the blind version of QPT (BQPT) in [8], then detailed it in [9], and more recently in [10]. In those papers, we focused on the tomography of the two-qubit cylindrical-symmetry Heisenberg coupling process. For those algorithms, the operator has to prepare one or several copies of an unknown set of initial states. This requires a preparation procedure to be known and reproducible, so that several copies of each used state may be prepared. It is not a violation of the no cloning theorem, the latter does not apply if we prepared the state that we want to reproduce. This idea removes the issue of systematic errors (with respect to a desired state) during the preparation. The system is identified by processing output measurements associated with

n_{s}

different unknown input states going through the system. Generally, we need to perform QST or at least to estimate some measurement outcome probabilities for each of the

n_{s}

output states. For the approaches of [8,9], this kind of QST requires

n_{c}

copies of each considered output state. Therefore, for each one of the

n_{s}

states the same experiment has to be repeated

n_{c}

times with the same input state value, for

n_{s} \times n_{c}

input state preparations in total. The most recent paper [10] also proposes “single-preparation BQPT methods” (SBQPT), i.e., methods which can operate with only one instance of each considered input state,

n_{c} = 1

.

In [11] (2021), we introduced the setup that will be further developed in the current paper. In Ref. [11] we considered copies of a single 2-qubit state (initially unentangled) being transformed by a unitary process and measured at 5 different time delays (

Δ_{t}, . . . ., 5 Δ_{t}

). In the current paper, we consider a setup closer to standard QPT where only two times are considered (see Figure 1). The unit-norm d-dimensional vectors

v_{1}, . . ., v_{d}

represent the initial quantum pure states. Those initial states are considered unknown. We simply assume that they are pure, unentangled, linearly independent, and that at least one of the states is not orthogonal to all the others (i.e.,

\exists j such that \forall k \neq j v_{j} / ⊥ v_{j}

). These are reasonable hypotheses, as long as the qubits are prepared separately, the states are unentangled; and d random states are always (probability 1) linearly independent and not orthogonal in the d-dimensional Hilbert space. After waiting

Δ_{t}

, each input state vector

v_{j}

is multiplied by the unitary

d \times d

matrix

M

, thus yielding the output state

w_{j} = M v_{j}

.

Figure 1. Considered setup.

We assume that enough types of measurements are performed on copies of all

2 d

states to achieve QST on each state. The present paper does not focus on the measurements performed and the QST algorithm. We simply assume that each state is recovered up to a global phase and a low residual error. For the numerical simulations, we will use the first QST algorithm of [12] which is suited to pure states and has the advantage of only requiring unentangled measurements on each qubit. However, the current paper is not bound to [12] and any pure state QST algorithm [13,14] can be performed. The fact that we perform measurements on the input states means that our algorithm is not blind, but since their values are not imposed by the proposed method, we keep the main advantage of the blind approaches (resilience to systematic error).

Section 2 briefly describes the system states and measurements. Section 3 describes a straightforward method that does not require an initialization and achieves QPT using the estimates of the states. Section 4 describes a method that improves the first estimate by maximizing the likelihood of the measurements. Finally, Section 5 contains some numerical results.

2. States and Measurements

2.1. Considered States

We hereafter consider an

n_{q b}

-qubit system, typically composed of distinguishable spins 1/2. Any pure state

| φ ⟩

of that system is here expressed in the basis defined as the tensor product of the standard bases associated with each qubit. The components of

| φ ⟩

in that basis can be stored in a d-element vector

v

, with

d = 2^{n_{q b}}

. The components of

v

are complex and the norm of

v

is 1. The global phase of

| φ ⟩

has no physical meaning, so we can assume that the first non-zero component of

v

is a real strictly positive number. In the rest of the paper, we consider the vector

v

instead of the state

| φ ⟩

.

2.2. Considered Types of Measurements

First focusing on a single qubit, we perform measurements based on the three Pauli operators

σ_{x}, σ_{y}

, and

σ_{z}

[4] and, e.g., related to spin 1/2 components along the

X, Y

, and Z axes. For each such direction, we define the eigenvector matrix whose first and second columns are the eigenvectors of the considered Pauli operator, respectively, associated with eigenvalues

+ 1

and

- 1

in the standard basis. These eigenvector matrices may be shown to read:

P_{X} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}) P_{Y} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 \\ i & - i \end{matrix}) P_{Z} = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) .

(1)

The probabilities of the outcomes

+ 1

and

- 1

when performing a measurement for state

v

along

D \in {X, Y, Z}

are, respectively, the first and second elements of

| {P_{D}}^{†} {v |}^{2}

where

{| . |}^{2}

is the element-wise squared modulus and

.^{†}

is the trans conjugate.

When considering

n_{q b}

qubits, we perform the above-defined measurements in parallel for all qubits. Each such type

T

of measurements corresponds to a given direction

D_{m} \in {X, Y, Z}

for the m-th qubit for each m in

{1, . . ., n_{q b}}

(

T = D_{1} . . . D_{n_{q b}}

). For each set of eigenvectors

e_{1}, . . ., e_{n_{q b}}

(each

e_{m}

is a column of one of the matrix of

P_{D_{m}}

of (1)) and eigenvalue

a_{1}, . . ., a_{n_{q b}}

(

a_{m}

is either

+ 1

if

e_{m}

is the first column of

P_{D_{m}}

and

- 1

if it is the second), respectively, associated with each qubit, the probability that a measurement on

v

yields these eigenvalues reads:

p_{a_{1} . . . a_{n_{q b}}} = {| {(e_{1} \otimes . . . . \otimes e_{n_{q b}})}^{†} v |}^{2}

(where ⊗ is the tensor product). Those d probabilities (from

p_{+ 1 . . . + 1}

to

p_{- 1 . . . - 1}

) therefore form the vector

| P_{T}^{†} {v |}^{2}

where

P_{T}

is the eigenvector matrix associated with the measurement along the directions

D_{1} . . . D_{n_{q b}}

of

T

. It is expressed as the tensor (i.e., Kronecker) product of one-qubit matrices of (1)

P_{T} = ⨂_{m = 1}^{n_{q b}} P_{D_{m}} with D_{m} \in {X, Y, Z} s . t . T = D_{1} . . . D_{n_{q b}} .

(2)

For example with

n_{q b} = 2

qubits, measuring the first one along

D_{1} = Z

and the second one along

D_{2} = X

(

T = Z X

) yields the following eigenvector matrix

P_{T} = P_{Z} \otimes P_{X} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & - 1 & 0 & 0 \\ 0 & 0 & 1 & 1 \\ 0 & 0 & 1 & - 1 \end{matrix})

. Those measurements are not multi-qubit Pauli measurements (used in (8.149) in [4]) because the latter only have 2 outcomes whereas the former have d outcomes (they are the concatenations of

n_{q b}

2-outcome measurements). In the rest of the paper, this type of measurement will be referred to as a string of

X, Y

, and Z (in this example,

T = Z X

).

For

n_{q b}

qubits, there are

3^{n_{q b}}

of those measurements. Since we are dealing with pure states, we can work with only 4 types of measurements:

T_{1} = Z . . . Z, T_{2} = Y . . . Y, T_{3} = X . . . X, T_{4} = X Y . . .

(

T_{4}

is X on every odd numbered qubit and Y on the even numbered, all the others are the same measurement types on all qubits). In Ref. [12] we explain how to perform QST with those measurements in Section 3 and Section 5. We will not mention it again in the rest of the paper, but if

n_{q b} = 1

we perform 3 types of measurements instead of 4 (along directions X, Y, and Z), as

T_{4} = T_{3} = X

.

Those measurements are performed on

{v_{j}}_{j \in {1, . . ., d}}

and

{w_{j}}_{j \in {1, . . ., d}}

. In total,

2 d

states are measured with 4 types of measurements. To estimate probabilities, each measurement is performed a given number of times that we call

n_{c}

, the total number of measurements performed is

8 d n_{c}

. For each one of the

8 d

distinct measurements, the numbers of times each one of the d outcomes was observed are stored in the d-dimensional vector

n_{j, k, ℓ}

where

j \in {1, . . ., d}

is the index if the measured state,

k \in {1, 2, 3, 4}

defines the type of the measurement and

l \in {0, 1}

is 0 if

v_{j}

is measured and 1 if it is

w_{j}

. Thus,

n_{j, k, ℓ}

contains the measurement counts for the state

M^{ℓ} v_{j}

along direction

T_{k}

. The expected value of

n_{j, k, ℓ}

is

n_{c} {| P_{T_{k}}^{†} M^{ℓ} v_{j} |}^{2}

.

3. QST-Based Solution

3.1. Main Idea

We assume that QST is performed properly for the states of Figure 1. It yields:

\hat{v_{j}} = v_{j} . e^{i ξ_{j}^{v}} + ε_{j}^{Q S T} \forall j \in {1, . . ., d} and \hat{w_{j}} = w_{j} . e^{i ξ_{j}^{w}} + ε_{d + j}^{Q S T} \forall j \in {1, . . ., d}

where

ξ_{j}^{v}

and

ξ_{j}^{w}

are unknown phases and

ε_{j}^{Q S T}

is the residual error, such that

E (| | ε_{j}^{Q S T} {| |}_{2}) \underset{n_{c} \to + \infty}{⟶} 0

(E is the expected value). For the rest of this section, we consider

ε_{j}^{Q S T} = 0

unless stated otherwise. In Section 2.1, we stated that the global phases of the states do not matter. This is true if the states are considered independently and this is the reason why the QST cannot recover the global phase. However, when the states are considered together (in order to find

M

) the differences between the global phases of the different states matter.

We know that

w_{j} = M v_{j} \forall j \in {1, . . ., d}

, therefore, with

ξ_{j} = ξ_{j}^{v} - ξ_{j}^{w} \forall j

, we have:

e^{i ξ_{j}} \hat{w_{j}} = M \hat{v_{j}} \forall j \in {1, . . ., d}

. Changing

M

to

M . e^{- i ξ_{1}}

and

ξ_{j}

to

ξ_{j} - ξ_{1} \forall j \in {1, . . ., d}

does not change the equality, so we can also assume

ξ_{1} = 0

and accept that

M

can only be recovered up to a global phase.

In the next section, we explain how to estimate the other phases

e^{i \hat{ξ_{j}}} \forall j \in {2, . . ., d}

. From that, we can define

\tilde{w_{j}} = \hat{w_{j}} . e^{i \hat{ξ_{j}}}

with which an estimate of

M

can easily be found as the problem becomes:

\tilde{w_{j}} = M \hat{v_{j}} \forall j \in {1, . . ., d} .

(3)

\hat{M} = [\tilde{w_{1}}, . . ., \tilde{w_{d}}] {[\hat{v_{1}}, . . ., \hat{v_{d}}]}^{- 1}

works as a solution. However, it is generally not a unitary solution because of the QST errors. Finding

\hat{M} \in U_{3} (R)

that is the least square solution of

\hat{a_{j}} = M \hat{b_{j}} \forall j \in {1, . . ., n}

with

\hat{a_{j}}, \hat{b_{j}} \in R^{3}

is a well known problem in the aerospace community. It is called Wahba’s problem after Wahba who first posed it in 1965 [15]. We have adapted its solution for

M \in U_{d} (C)

(details will be provided in a future paper). This yields:

B = [\tilde{w_{1}}, . . ., \tilde{w_{d}}] {[\hat{v_{1}}, . . ., \hat{v_{d}}]}^{†} ⟶ B = U S V^{†} ⟶ {\hat{M}}_{L S} = U V^{†}

(4)

where

U S V^{†}

is the singular value decomposition of

B

. We showed that this solution is optimal in the least square (LS) sense. The solution is unique if, and only if, both

[\hat{v_{1}}, . . ., \hat{v_{d}}]

and

[\tilde{w_{1}}, . . ., \tilde{w_{d}}]

are of full rank. This QPT method can be extended to any higher number of input states but with fewer than d, the solution is not unique.

3.2. Phase Recovery

The aim of the current section is to find

e^{i \hat{ξ_{j}}} \forall j \in {2, . . ., d}

given the vectors,

\hat{v_{j}}, \hat{w_{j}} \forall j \in {1, . . ., d}

such that there exist a unitary matrix

M

that realizes (3), with

ξ_{1}

assumed to be 0.

The basic idea is to use the fact that

e^{i ξ_{j_{1}}} \hat{w_{j_{1}}} + e^{i ξ_{j_{2}}} \hat{w_{j_{2}}} = M (\hat{v_{j_{1}}} + \hat{v_{j_{2}}}) \forall j_{1}, j_{2} \in {1, . . ., d}

and

M

is unitary so does not change the norm. Therefore,

ξ_{j_{1}, j_{2}} = ξ_{j_{2}} - ξ_{j_{1}}

is subject to:

\begin{matrix} ∥ \hat{w_{j_{1}}} + e^{i ξ_{j_{1}, j_{2}}} \hat{w_{j_{2}}} ∥_{2}^{2} = {∥ \hat{v_{j_{1}}} + \hat{v_{j_{2}}} ∥}_{2}^{2} \\ \Leftrightarrow ∥ \hat{w_{j_{1}}} ∥_{2}^{2} + ∥ \hat{w_{j_{2}}} ∥_{2}^{2} + r_{j_{1}, j_{2}} c o s (ξ_{j_{1}, j_{2}}) + i_{j_{1}, j_{2}} s i n (ξ_{j_{1}, j_{2}}) = {∥ \hat{v_{j_{1}}} + \hat{v_{j_{2}}} ∥}_{2}^{2} \end{matrix}

(5)

with

r_{j_{1}, j_{2}}

and

i_{j_{1}, j_{2}}

the real and imaginary part of

2 {\hat{w_{j_{2}}}}^{†} \hat{w_{j_{1}}}

, respectively. Equation (5) is solvable if, and only if,

\hat{w_{j_{1}}}

and

\hat{w_{j_{2}}}

are not orthogonal. By writing

cos (ξ_{j_{1}, j_{2}}) = \frac{1 - t^{2}}{1 + t^{2}}

and

sin (ξ_{j_{1}, j_{2}}) = \frac{2 t}{1 + t^{2}}

with

t = tan (\frac{ξ_{j_{1}, j_{2}}}{2})

, (5) becomes a quadratic equation (when both sides are multiplied by

1 + t^{2}

) with two real solutions for t, corresponding to two solutions for

ξ_{j_{1}, j_{2}}

that we call

ξ_{j_{1}, j_{2}}^{1}

and

ξ_{j_{1}, j_{2}}^{2}

(they can have the same value). It is numerically possible to have no real solution but this never happens if there are no QST errors. If there are no real solutions, we consider

ξ_{j_{1}, j_{2}}^{1}

and

ξ_{j_{1}, j_{2}}^{2}

both set to the real part of the complex solutions.

In order to choose between

ξ_{j_{1}, j_{2}}^{1}

and

ξ_{j_{1}, j_{2}}^{2}

, we have to consider a third pair of vectors:

{\hat{v_{j_{3}}}, \hat{w_{j_{3}}}}

. Solving (5) for the 3 possible pairs of indices (

{j_{1}, j_{2}}, {j_{1}, j_{3}}, {j_{2}, j_{3}}

) gives us

2^{3} = 8

possibilities for

ξ_{j_{1}, j_{2}}

,

ξ_{j_{1}, j_{3}}

,

ξ_{j_{2}, j_{3}}

. However, by definition

ξ_{j_{2}, j_{3}} = ξ_{j_{1}, j_{3}} - ξ_{j_{1}, j_{2}}

and it may be hoped that there is only one of the 8 possibilities that satisfies this. We keep the solution that comes the closest.

We can apply this method for

(j_{1}, j_{2}, j_{3}) \in {(1, 2, 3), (3, 4, 5)

, . . ., (d - 2, d - 1, d)}

. We would thus know all the differences between the phases, and, since

ξ_{1}

is 0, we would know all the phases.

In practice, doing this would work as long as one of the

v_{j}

is not orthogonal to all the others (otherwise (5) is not solvable for enough indices

j_{1}, j_{2}

); but this would not be robust to a realistic QST error. The actual algorithm we use will be described in a future longer paper. It is based on the same idea: finding the two solutions of (5) for all

{j_{1}, j_{2}}

indices. We improve the robustness by considering more than 3 pairs of well chosen indices.

4. Fine Tuning

4.1. Problem Statement

Section 3 describes a method to achieve QPT using the results of the QST on every state. The current section details a different approach that requires an initial estimate of

M

(we will use

{\hat{M}}_{L S}

from (4)) and finds the unitary matrix

{\hat{M}}_{M L}

and initial states

{\hat{V}}_{M L} = [\hat{v_{1}}, . . ., \hat{v_{d}}]

that maximize the likelihood of the measurements. Formally:

({\hat{M}}_{M L}, {\hat{V}}_{M L} = \underset{M, V}{arg max} L (M, V, M)

, where

M = {n_{j, k, ℓ}}_{j, k, ℓ}

represents the measurements results and

L

is the log-likelihood which we maximize in order to maximize the likelihood. The problem is actually simpler if we perform the maximization successively, i.e., find the best

V

for each

M

of which we compute the likelihood,

{\hat{M}}_{M L} = \underset{M}{arg max} (max_{V} L (M, V, M))

, because optimizing

V

knowing

M

(i.e., computing

max_{V} L (M, V, M)

) can be performed independently on all the

v_{j}

:

max_{V} L (M, V, M) = \sum_{j = 1}^{d} max_{v_{j}} (L (v_{j}, M_{v_{j}}) + L (M v_{j}, M_{w_{j}}))

, where

M_{v_{j}} = {n_{j, k, 0}}_{k}

and

M_{w_{j}} = {n_{j, k, 1}}_{k}

are the measurements performed on

v_{j}

and

w_{j}

, respectively. This is the case because the

{M_{v_{1}}, M_{w_{1}}, . . ., M_{v_{d}}, M_{w_{d}}}

are statistically independent and involve different arguments to be maximized for different j. Considering this, the problem becomes:

{\hat{M}}_{M L} = \underset{M}{arg max} \sum_{j = 1}^{d} max_{v_{j}} (L (v_{j}, M_{v_{j}}) + L (M v_{j}, M_{w_{j}})) .

(6)

In order to solve (6) we first need to be able to compute the likelihood of the measurements. Since most gradient based optimization algorithms can only be performed with a real number vector as argument, we also need to find real number parametrization for

M

and

v_{j}

. Those two points are the focuses of the following two subsections.

4.2. Statistical Model for the Measurements

In [16], the formula for the likelihood of samples from multiple outcome measurements is given (albeit for a mixed state represented by a density matrix which we would have to replace by

v v^{†}

or

w w^{†}

). Once we remove additive constants, the log-likelihood boils down to:

L (n_{1}^{o}, . . ., n_{d}^{o}) = \sum_{m = 1}^{d} n_{m}^{o} l o g (p_{m})

, where

p_{m}

is the theoretical probabilities of the m-th outcome, and

n_{m}^{o}

is the number of times the m-th outcome has been measured. If the measurement whose likelihood we want to compute has

P_{T_{k}}

as eigenvectors matrix (

k \in {1, 2, 3, 4}

) and is performed on

v_{j}

, then

{[p_{1} . . . p_{d}]}^{T} = {| P_{T_{k}}^{†} v_{j} |}^{2}

and

{[n_{1}^{o} . . . n_{d}^{o}]}^{T} = n_{j, k, 0}

(see the definition of

n_{j, k, ℓ}

and

P_{T_{k}}

in Section 2.2,

.^{T}

stands for transpose). If, instead of

v

, we measure

w

, then

{[p_{1} . . . p_{d}]}^{T} = {| P_{T_{k}}^{†} w_{j} |}^{2}

and

{[n_{1}^{o} . . . n_{d}^{o}]}^{T} = n_{j, k, 1}

. Let us rewrite

L

using the notation adapted to our measurements:

L (M^{ℓ} v_{j}, n_{j, k, ℓ}) = n_{j, k, ℓ}^{T} log (| P_{T_{k}}^{†} M^{ℓ} v_{j} |^{2})

(ℓ is either 0 or 1 so

M^{ℓ} v_{j}

is either

v_{j}

or

w_{j}

). We can replace

L

in (6) by its expression (knowing

M_{M^{ℓ} v_{j}} = {n_{j, 1, ℓ}, . . ., n_{j, 4, ℓ}}

), this yields:

{\hat{M}}_{M L} = \underset{M}{arg max} \sum_{j = 1}^{d} max_{v_{j}} \sum_{k = 1}^{4} \sum_{ℓ = 0}^{1} n_{j, k, ℓ}^{T} log (| P_{T_{k}}^{†} M^{ℓ} v_{j} |^{2}) .

(7)

4.3. Parametrization of the Arguments

For a given

j \in {1, . . ., d}

v_{j}

represents an unentangled state. By definition, it can be decomposed as a tensor product of

n_{q b}

1-qubit states:

v_{j} = q_{j, 1} \otimes . . . \otimes q_{j, n_{q b}}

. Each

q_{j, h}, h \in {1, . . ., n_{q b}}

has 2 real parameters,

r_{j, h}

and

θ_{j, h}

q_{j, h} = {[\begin{matrix} r_{j, h} & \sqrt{1 - r_{j, h}^{2}} e^{i θ_{j, h}} \end{matrix}]}^{T}

. Therefore,

v_{j}

can be parameterized with

2 n_{q b}

real parameters:

v_{j} = f_{v_{j}} (r_{j, 1}, θ_{j, 1}, . . ., r_{j, n_{q b}}, θ_{j, n_{q b}})

.

M

is a unitary matrix. Hence, it can be shown that there exists a Hermitian matrix

H

, such that

M = exp (i H)

where exp is the matrix exponential. Therefore,

M

can be parameterized with

d^{2}

real parameters:

(h_{1}, . . ., h_{d^{2}})

, where

h_{1}, . . ., h_{d^{2}}

is the parametrization of

H

starting with the

d (d + 1) / 2

real parts of the components that are on or above the diagonal (

H_{1, 1}, H_{1, 2}, . . ., H_{d, d}

) where

H_{i_{r}, i_{c}}

is the element on row

i_{r}

and column

i_{c}

of

H

) and ending with the

d (d - 1) / 2

imaginary parts of the components that are strictly above the diagonal (

H_{1, 2}, H_{1, 3}, . . ., H_{d - 1, d}

). Accounting for the fact that

M

can only be recovered up to a global phase, we can assume that

h_{1}

corresponding to the top left element of

H

is 0 and remove it from the parametrization. Indeed,

H - h_{1} I_{d}

(

I_{d}

is the

d \times d

identity matrix) has a 0 for its top left element and

exp (i H)

and

exp (i H - i h_{1} I_{d})

only differ by a global phase. Therefore, as far as the optimization algorithm is concerned,

M

has

d^{2} - 1

real parameters:

M = f_{M} (h_{2}, . . ., h_{d^{2}})

.

4.4. Optimization

In order to find the real parameters of

M

that solve (7) we use the BFGS quasi-Newton algorithm [17] initialized at the

d^{2} - 1

parameters that yield

{\hat{M}}_{L S}

up to a global phase. This algorithm is implemented with the fminunc Matlab function, we provide it with the analytical expressions of the gradients of the criterion in order to make it run faster. At each step of the optimization of

M

, d optimizations are performed on

2 n_{q b}

arguments in order to find the

{v_{j}}

(to solve the max inside the first sum in (7)). Those optimizations are also performed using the BFGS quasi-Newton algorithm with the analytical gradient provided. The latter algorithm is initialized at the real parameters of the unentangled state that is the closest to

\frac{1}{2} \hat{v_{j}} + \frac{1}{2} M^{- 1} \tilde{w_{j}}

, where

M^{- 1}

is the inverse of the

M

at the current state of the optimization (the

M

whose likelihood we are computing in order to maximize it), j is the index of the

v_{j}

we are optimizing and

\hat{v_{j}}

and

\tilde{w_{j}}

are defined in Section 3.1. The optimization algorithms stop when the norm of the difference between the arguments at two successive iterations is lower than

10^{- 30}

. Moreover, for the optimization of

M

it stops after 700 iterations if the previous criterion is not met. For 3 qubits or less, the optimization of

M

always stops before the 700 iterations. For 4 and 5 qubits, this is not always the case but the BFGS algorithm decreases the criterion at every step so even if the algorithm has not properly converged, the final estimate

{\hat{M}}_{M L}

is still more likely than all the others, and, in particular, more likely than

{\hat{M}}_{L S}

.

5. Numerical Results

Our algorithm is tested by simulating a random matrix

M_{t r u e}

which is a random complex matrix (composed of independent realizations of

X_{1} + i X_{2}

with

X_{1}

and

X_{2}

independent standard normal variables) to which the Gram–Schmidt process has been applied in order to make it unitary. The states

{v_{j}}

are generated randomly by applying

f_{v_{j}}

(defined in Section 4.3) to the

2 n_{q b}

random parameters generated uniformly on the intervals on which they are defined.

We then simulate the associated measurements and apply the algorithms of Section 3 and Section 4 in order to obtain estimates of

{\hat{M}}_{L S}

and

{\hat{M}}_{M L}

. With

n_{c} = 10, 000

, the computation time on one thread on an Intel Xeon silver 4214 2.4-GHz processor is way shorter for

{\hat{M}}_{L S}

(around 30 s for 5 qubits and less than 10 s for fewer qubits) than for

{\hat{M}}_{M L}

(around 7 h for 5 qubits, 15 mn for 4 qubits and less than a minute for fewer qubits).

We choose to perform further tests with 4 qubits. 500 matrices

M_{t r u e}

are generated, and the associated

{\hat{M}}_{L S}

and

{\hat{M}}_{M L}

are computed with

n_{c} = 625

and

n_{c} = 2500

for 2 qubits and

n_{c} = 2500

and

n_{c} = 10, 000

for 4 qubits. The associated numbers of copies of states to be measured are

8 d

times greater, so

20, 000

and

80, 000

for 2 qubits and

320, 000

and

1, 280, 000

for 4 qubits. We also compute

{\hat{M}}_{r e f}

with is the result of the likelihood maximization initialized at

M_{t r u e}

(only available in simulation) instead of

{\hat{M}}_{L S}

.

The metric we use in order to quantify the proximity between

M_{t r u e}

and its estimate

\hat{M}

(either

{\hat{M}}_{L S}

or

{\hat{M}}_{M L}

) is

\frac{1}{\sqrt{2 d}} | | M_{t r u e} - \hat{M} e^{i ϕ} | |

where

ϕ

is the angle that maximizes our metric (it accounts for the fact that

M_{t r u e}

can only be recovered up to a global phase) and

| | . | |

is the Frobenius norm. This metric is between 0 (if

\hat{M}

and

M_{t r u e}

are the same up to a global phase) and 1 (if they are orthogonal with respect to the Hilbert–Schmidt inner product).

The cumulative density function (cdf) of our metric (called error) is displayed in Figure 2. We note that:

Figure 2. Empirical cdf of the errors of the 3 maximum likelihood estimators with 4 qubits.

${\hat{M}}_{M L}$ is very similar to its reference ${\hat{M}}_{r e f}$ (especially with $n_{c} = 10, 000$ ). This means that the likelihood algorithm converges towards the global minimum (so 700 iterations is enough and ${\hat{M}}_{L S}$ is a good enough initial point).
${\hat{M}}_{L S}$ is worse than ${\hat{M}}_{M L}$ . This means that the costly likelihood maximization is not made in vain.
The errors with $n_{c} = 10, 000$ are roughly twice smaller than the errors with $n_{c} = 2500$ . So we are in the classic linear case where the error is proportional to the square root of the number of measurements. Additionally, the same graph with any $n_{c} > 2500$ could be deduced from Figure 2.

6. Conclusions and Future Work

In this paper, we introduced two QPT methods that do not require the initial states to be set to predetermined values, but work with randomly selected initial states that are measured beforehand. The first method uses QST to estimate the input and output states, lifts the phase ambiguities and finds the unitary matrix which fits the estimated states the best. The second method finds the unitary matrix that is the most likely according to the statistical distribution of the measurements. The latter method is more precise but slower and uses the result of the first method as an initialization.

We intend to perform more extensive tests and compare our method to non-blind methods such as [7]. We also want to link this algorithm with that of [11] by considering fewer initial states and more time delays than in Figure 1.

Author Contributions

The work described in the paper was performed during the PhD of François Verdeil under the supervision of Y.D. (PhD director). Both authors exchanged ideas to create the algorithms. Y.D. had the idea to take part in the MaxEnt22 conference. F.V. wrote the first draft of the paper and the code. Y.D.’s input and experience was instrumental in improving the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable here.

Acknowledgments

The authors would like to thank Alain Deville for helpful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ljung, L. System Identification: Theory for the User; PTR Prentice Hall: Upper Saddle River, NJ, USA, 1999; p. 540. [Google Scholar]
Abed-Meraim, K.; Qiu, W.; Hua, Y. Blind system identification. Proc. IEEE 1997, 85, 1310–1322. [Google Scholar] [CrossRef]
Nielsen, M.A.; Chuang, I.L. Prescription for experimental determination of the dynamics of a quantum black box. J. Mod. Opt. 2018, 44, 2455–2467. [Google Scholar]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Mohseni, M.; Rezakhani, A.T.; Lidar, D.A. Quantum-process tomography: Resource analysis of different strategies. Phys. Rev. A 2008, 77, 032322. [Google Scholar] [CrossRef]
Shabani, A.; Kosut, R.; Mohseni, M.; Rabitz, H.; Broome, M.; Almeida, M.; Fedrizzi, A.; White, A. Efficient measurement of quantum dynamics via compressive sensing. Phys. Rev. Lett. 2011, 106, 100401. [Google Scholar] [CrossRef] [PubMed]
Baldwin, C.H.; Kalev, A.; Deutsch, I.H. Quantum process tomography of unitary and near-unitary maps. Phys. Rev. A 2014, 90, 012110. [Google Scholar] [CrossRef]
Deville, Y.; Deville, A. From blind quantum source separation to blind quantum process tomography. In Proceedings of the International Conference on Latent Variable Analysis and Signal Separation, Liberec, Czech Republic, 25–28 August 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 184–192. [Google Scholar]
Deville, Y.; Deville, A. The blind version of quantum process tomography: Operating with unknown input values. IFAC-PapersOnLine 2017, 50, 11731–11737. [Google Scholar] [CrossRef]
Deville, Y.; Deville, A. Quantum process tomography with unknown single-preparation input states: Concepts and application to the qubit pair with internal exchange coupling. Phys. Rev. A 2020, 101, 042332. [Google Scholar] [CrossRef]
Verdeil, F.; Deville, Y.; Deville, A. Two-Qubit Unitary Quantum Process Tomography by Multiple-Delay Output Measurements for One Unknown Input Pure State Value. In Proceedings of the 2021 IEEE Statistical Signal Processing Workshop (SSP), Rio de Janeiro, Brazil, 11–14 July 2021; IEEE: New York, NY, USA, 2021; pp. 161–165. [Google Scholar]
Verdeil, F.; Deville, Y. Pure state tomography with parallel unentangled measurements. Phys. Rev. A, 2022; (to appear). [Google Scholar]
Goyeneche, D.; Cañas, G.; Etcheverry, S.; Gómez, E.; Xavier, G.; Lima, G.; Delgado, A. Five Measurement Bases Determine Pure Quantum States on Any Dimension. Phys. Rev. Lett. 2015, 115, 090401. [Google Scholar] [CrossRef] [PubMed]
Finkelstein, J. Pure-state informationally complete and “really” complete measurements. Phys. Rev. A 2004, 70, 052107. [Google Scholar] [CrossRef]
Wahba, G. A Least Squares Estimate of Satellite Attitude. SIAM Rev. 1965, 7, 409. [Google Scholar] [CrossRef]
Hradil, Z.; Řeháček, J.; Fiurášek, J.; Ježek, M. 3 Maximum-Likelihood Methods in Quantum Mechanics. In Quantum State Estimation; Springer: Berlin/Heidelberg, Germany, 2004; pp. 59–112. [Google Scholar]
Broyden, C.G. The convergence of a class of double-rank minimization algorithms. IMA J. Appl. Math. 1970, 6, 76–90. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Two Unitary Quantum Process Tomography Algorithms Robust to Systematic Errors^†

Abstract

1. Prior Work and Problem Statement

2. States and Measurements

2.1. Considered States

2.2. Considered Types of Measurements

3. QST-Based Solution

3.1. Main Idea

3.2. Phase Recovery

4. Fine Tuning

4.1. Problem Statement

4.2. Statistical Model for the Measurements

4.3. Parametrization of the Arguments

4.4. Optimization

5. Numerical Results

6. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Two Unitary Quantum Process Tomography Algorithms Robust to Systematic Errors †

Abstract

1. Prior Work and Problem Statement

2. States and Measurements

2.1. Considered States

2.2. Considered Types of Measurements

3. QST-Based Solution

3.1. Main Idea

3.2. Phase Recovery

4. Fine Tuning

4.1. Problem Statement

4.2. Statistical Model for the Measurements

4.3. Parametrization of the Arguments

4.4. Optimization

5. Numerical Results

6. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Two Unitary Quantum Process Tomography Algorithms Robust to Systematic Errors^†