Abstract
Quantum process tomography (QPT) methods aim at identifying a given quantum process. QPT is a major quantum information processing tool, since it especially allows one to characterize the actual behavior of quantum gates, which are the building blocks of quantum computers. The present paper focuses on the estimation of a unitary process. This class is of particular interest because quantum mechanics postulates that the evolution of any closed quantum system is described by a unitary transformation. Unitary processes have significantly fewer parameters than general quantum processes ( vs. real independent parameters for qubits). By assuming that the process is unitary we develop two methods that scale better with the size of the system. In the present paper, we stay as close as possible to the standard setup of QPT: the operator has to prepare copies of different input states. The properties those states have to satisfy in order for our method to achieve QPT are very mild. Therefore, we choose to operate with copies of initially unknown pure input states. In order to perform QPT without knowing the input states, we perform measurements on half the copies of each state, and let the other half be transformed by the system before measuring them (each copy is only measured once). This setup has the advantage of removing the issue of systematic (i.e., same on all the copies of a state) errors entirely because it does not require the process input to take predefined values. We develop a straightforward analytical solution that first estimates the states from the averaged measurements and then finds the unitary matrix (representing the process) coherent with those estimates by using our analytical solution to an extended version of Wahba’s problem. This estimate may then be used as an initial point for a fine tuning algorithm that maximizes the likelihood of the measurements. Simulation results show the effectiveness of the proposed methods.
1. Prior Work and Problem Statement
System identification and system inversion are well-known problems, especially for classical systems. These problems are less challenging in the “nonblind”/“supervised” case [1] where the aim is, e.g., to identify the considered system by using the known input and the measured output. In contrast, in the “blind”/“unsupervised” case [2], the input values are unknown and uncontrolled, but some hypotheses are sometimes made on the input signal(s).
For quantum systems, non-blind system identification methods were first introduced in 1997 in [3] that came up with the name quantum process tomography (QPT), see [4]. They use copies of a set of known pure input states that are transformed by the process. Those transformed states are then measured and estimated using quantum state tomography (QST aims at estimating a quantum state using measurements). From there, the parameters of the process can be estimated from what is essentially a regression. This method scales poorly when the number of qubits increases, and is only experimentally feasible for one or two qubits. This is to be expected because, in general, a quantum process has independent real parameters ([4], p. 391), with d the dimension of the Hilbert space (for an -qubit system ). This method would later be called standard QPT (SQPT), in contrast to non-standard QPT that uses ancilla qubits and weak measurements (see [5] for a survey). In Ref. [6], a SQPT approach that scales better with the number of qubits by assuming that the process is sparse is introduced. Like Baldwin et al., in most of [7], we choose to restrict ourselves to unitary processes. This class is of particular interest because the evolution of any closed quantum system is described by a unitary transformation. A unitary process has independent real parameters.
A significant problem of SQPT is the need to precisely prepare the copies of the input states. Any systematic error on the input state has huge consequences for the precision. In 2015, we introduced the blind version of QPT (BQPT) in [8], then detailed it in [9], and more recently in [10]. In those papers, we focused on the tomography of the two-qubit cylindrical-symmetry Heisenberg coupling process. For those algorithms, the operator has to prepare one or several copies of an unknown set of initial states. This requires a preparation procedure to be known and reproducible, so that several copies of each used state may be prepared. It is not a violation of the no cloning theorem, the latter does not apply if we prepared the state that we want to reproduce. This idea removes the issue of systematic errors (with respect to a desired state) during the preparation. The system is identified by processing output measurements associated with different unknown input states going through the system. Generally, we need to perform QST or at least to estimate some measurement outcome probabilities for each of the output states. For the approaches of [8,9], this kind of QST requires copies of each considered output state. Therefore, for each one of the states the same experiment has to be repeated times with the same input state value, for input state preparations in total. The most recent paper [10] also proposes “single-preparation BQPT methods” (SBQPT), i.e., methods which can operate with only one instance of each considered input state, .
In [11] (2021), we introduced the setup that will be further developed in the current paper. In Ref. [11] we considered copies of a single 2-qubit state (initially unentangled) being transformed by a unitary process and measured at 5 different time delays (). In the current paper, we consider a setup closer to standard QPT where only two times are considered (see Figure 1). The unit-norm d-dimensional vectors represent the initial quantum pure states. Those initial states are considered unknown. We simply assume that they are pure, unentangled, linearly independent, and that at least one of the states is not orthogonal to all the others (i.e., ). These are reasonable hypotheses, as long as the qubits are prepared separately, the states are unentangled; and d random states are always (probability 1) linearly independent and not orthogonal in the d-dimensional Hilbert space. After waiting , each input state vector is multiplied by the unitary matrix , thus yielding the output state .
Figure 1.
Considered setup.
We assume that enough types of measurements are performed on copies of all states to achieve QST on each state. The present paper does not focus on the measurements performed and the QST algorithm. We simply assume that each state is recovered up to a global phase and a low residual error. For the numerical simulations, we will use the first QST algorithm of [12] which is suited to pure states and has the advantage of only requiring unentangled measurements on each qubit. However, the current paper is not bound to [12] and any pure state QST algorithm [13,14] can be performed. The fact that we perform measurements on the input states means that our algorithm is not blind, but since their values are not imposed by the proposed method, we keep the main advantage of the blind approaches (resilience to systematic error).
Section 2 briefly describes the system states and measurements. Section 3 describes a straightforward method that does not require an initialization and achieves QPT using the estimates of the states. Section 4 describes a method that improves the first estimate by maximizing the likelihood of the measurements. Finally, Section 5 contains some numerical results.
2. States and Measurements
2.1. Considered States
We hereafter consider an -qubit system, typically composed of distinguishable spins 1/2. Any pure state of that system is here expressed in the basis defined as the tensor product of the standard bases associated with each qubit. The components of in that basis can be stored in a d-element vector , with . The components of are complex and the norm of is 1. The global phase of has no physical meaning, so we can assume that the first non-zero component of is a real strictly positive number. In the rest of the paper, we consider the vector instead of the state .
2.2. Considered Types of Measurements
First focusing on a single qubit, we perform measurements based on the three Pauli operators , and [4] and, e.g., related to spin 1/2 components along the , and Z axes. For each such direction, we define the eigenvector matrix whose first and second columns are the eigenvectors of the considered Pauli operator, respectively, associated with eigenvalues and in the standard basis. These eigenvector matrices may be shown to read:
The probabilities of the outcomes and when performing a measurement for state along are, respectively, the first and second elements of where is the element-wise squared modulus and is the trans conjugate.
When considering qubits, we perform the above-defined measurements in parallel for all qubits. Each such type of measurements corresponds to a given direction for the m-th qubit for each m in (). For each set of eigenvectors (each is a column of one of the matrix of of (1)) and eigenvalue ( is either if is the first column of and if it is the second), respectively, associated with each qubit, the probability that a measurement on yields these eigenvalues reads: (where ⊗ is the tensor product). Those d probabilities (from to ) therefore form the vector where is the eigenvector matrix associated with the measurement along the directions of . It is expressed as the tensor (i.e., Kronecker) product of one-qubit matrices of (1)
For example with qubits, measuring the first one along and the second one along () yields the following eigenvector matrix . Those measurements are not multi-qubit Pauli measurements (used in (8.149) in [4]) because the latter only have 2 outcomes whereas the former have d outcomes (they are the concatenations of 2-outcome measurements). In the rest of the paper, this type of measurement will be referred to as a string of , and Z (in this example, ).
For qubits, there are of those measurements. Since we are dealing with pure states, we can work with only 4 types of measurements: ( is X on every odd numbered qubit and Y on the even numbered, all the others are the same measurement types on all qubits). In Ref. [12] we explain how to perform QST with those measurements in Section 3 and Section 5. We will not mention it again in the rest of the paper, but if we perform 3 types of measurements instead of 4 (along directions X, Y, and Z), as .
Those measurements are performed on and . In total, states are measured with 4 types of measurements. To estimate probabilities, each measurement is performed a given number of times that we call , the total number of measurements performed is . For each one of the distinct measurements, the numbers of times each one of the d outcomes was observed are stored in the d-dimensional vector where is the index if the measured state, defines the type of the measurement and is 0 if is measured and 1 if it is . Thus, contains the measurement counts for the state along direction . The expected value of is .
3. QST-Based Solution
3.1. Main Idea
We assume that QST is performed properly for the states of Figure 1. It yields:
where and are unknown phases and is the residual error, such that (E is the expected value). For the rest of this section, we consider unless stated otherwise. In Section 2.1, we stated that the global phases of the states do not matter. This is true if the states are considered independently and this is the reason why the QST cannot recover the global phase. However, when the states are considered together (in order to find ) the differences between the global phases of the different states matter.
We know that , therefore, with , we have: . Changing to and to does not change the equality, so we can also assume and accept that can only be recovered up to a global phase.
In the next section, we explain how to estimate the other phases . From that, we can define with which an estimate of can easily be found as the problem becomes:
works as a solution. However, it is generally not a unitary solution because of the QST errors. Finding that is the least square solution of with is a well known problem in the aerospace community. It is called Wahba’s problem after Wahba who first posed it in 1965 [15]. We have adapted its solution for (details will be provided in a future paper). This yields:
where is the singular value decomposition of . We showed that this solution is optimal in the least square (LS) sense. The solution is unique if, and only if, both and are of full rank. This QPT method can be extended to any higher number of input states but with fewer than d, the solution is not unique.
3.2. Phase Recovery
The aim of the current section is to find given the vectors, such that there exist a unitary matrix that realizes (3), with assumed to be 0.
The basic idea is to use the fact that and is unitary so does not change the norm. Therefore, is subject to:
with and the real and imaginary part of , respectively. Equation (5) is solvable if, and only if, and are not orthogonal. By writing and with , (5) becomes a quadratic equation (when both sides are multiplied by ) with two real solutions for t, corresponding to two solutions for that we call and (they can have the same value). It is numerically possible to have no real solution but this never happens if there are no QST errors. If there are no real solutions, we consider and both set to the real part of the complex solutions.
In order to choose between and , we have to consider a third pair of vectors: . Solving (5) for the 3 possible pairs of indices () gives us possibilities for , , . However, by definition and it may be hoped that there is only one of the 8 possibilities that satisfies this. We keep the solution that comes the closest.
We can apply this method for . We would thus know all the differences between the phases, and, since is 0, we would know all the phases.
In practice, doing this would work as long as one of the is not orthogonal to all the others (otherwise (5) is not solvable for enough indices ); but this would not be robust to a realistic QST error. The actual algorithm we use will be described in a future longer paper. It is based on the same idea: finding the two solutions of (5) for all indices. We improve the robustness by considering more than 3 pairs of well chosen indices.
4. Fine Tuning
4.1. Problem Statement
Section 3 describes a method to achieve QPT using the results of the QST on every state. The current section details a different approach that requires an initial estimate of (we will use from (4)) and finds the unitary matrix and initial states that maximize the likelihood of the measurements. Formally: , where represents the measurements results and is the log-likelihood which we maximize in order to maximize the likelihood. The problem is actually simpler if we perform the maximization successively, i.e., find the best for each of which we compute the likelihood, , because optimizing knowing (i.e., computing ) can be performed independently on all the : , where and are the measurements performed on and , respectively. This is the case because the are statistically independent and involve different arguments to be maximized for different j. Considering this, the problem becomes:
In order to solve (6) we first need to be able to compute the likelihood of the measurements. Since most gradient based optimization algorithms can only be performed with a real number vector as argument, we also need to find real number parametrization for and . Those two points are the focuses of the following two subsections.
4.2. Statistical Model for the Measurements
In [16], the formula for the likelihood of samples from multiple outcome measurements is given (albeit for a mixed state represented by a density matrix which we would have to replace by or ). Once we remove additive constants, the log-likelihood boils down to: , where is the theoretical probabilities of the m-th outcome, and is the number of times the m-th outcome has been measured. If the measurement whose likelihood we want to compute has as eigenvectors matrix () and is performed on , then and (see the definition of and in Section 2.2, stands for transpose). If, instead of , we measure , then and . Let us rewrite using the notation adapted to our measurements: (ℓ is either 0 or 1 so is either or ). We can replace in (6) by its expression (knowing ), this yields:
4.3. Parametrization of the Arguments
For a given represents an unentangled state. By definition, it can be decomposed as a tensor product of 1-qubit states: . Each has 2 real parameters, and . Therefore, can be parameterized with real parameters: .
is a unitary matrix. Hence, it can be shown that there exists a Hermitian matrix , such that where exp is the matrix exponential. Therefore, can be parameterized with real parameters: , where is the parametrization of starting with the real parts of the components that are on or above the diagonal () where is the element on row and column of ) and ending with the imaginary parts of the components that are strictly above the diagonal (). Accounting for the fact that can only be recovered up to a global phase, we can assume that corresponding to the top left element of is 0 and remove it from the parametrization. Indeed, ( is the identity matrix) has a 0 for its top left element and and only differ by a global phase. Therefore, as far as the optimization algorithm is concerned, has real parameters: .
4.4. Optimization
In order to find the real parameters of that solve (7) we use the BFGS quasi-Newton algorithm [17] initialized at the parameters that yield up to a global phase. This algorithm is implemented with the fminunc Matlab function, we provide it with the analytical expressions of the gradients of the criterion in order to make it run faster. At each step of the optimization of , d optimizations are performed on arguments in order to find the (to solve the max inside the first sum in (7)). Those optimizations are also performed using the BFGS quasi-Newton algorithm with the analytical gradient provided. The latter algorithm is initialized at the real parameters of the unentangled state that is the closest to , where is the inverse of the at the current state of the optimization (the whose likelihood we are computing in order to maximize it), j is the index of the we are optimizing and and are defined in Section 3.1. The optimization algorithms stop when the norm of the difference between the arguments at two successive iterations is lower than . Moreover, for the optimization of it stops after 700 iterations if the previous criterion is not met. For 3 qubits or less, the optimization of always stops before the 700 iterations. For 4 and 5 qubits, this is not always the case but the BFGS algorithm decreases the criterion at every step so even if the algorithm has not properly converged, the final estimate is still more likely than all the others, and, in particular, more likely than .
5. Numerical Results
Our algorithm is tested by simulating a random matrix which is a random complex matrix (composed of independent realizations of with and independent standard normal variables) to which the Gram–Schmidt process has been applied in order to make it unitary. The states are generated randomly by applying (defined in Section 4.3) to the random parameters generated uniformly on the intervals on which they are defined.
We then simulate the associated measurements and apply the algorithms of Section 3 and Section 4 in order to obtain estimates of and . With , the computation time on one thread on an Intel Xeon silver 4214 2.4-GHz processor is way shorter for (around 30 s for 5 qubits and less than 10 s for fewer qubits) than for (around 7 h for 5 qubits, 15 mn for 4 qubits and less than a minute for fewer qubits).
We choose to perform further tests with 4 qubits. 500 matrices are generated, and the associated and are computed with and for 2 qubits and and for 4 qubits. The associated numbers of copies of states to be measured are times greater, so and for 2 qubits and and for 4 qubits. We also compute with is the result of the likelihood maximization initialized at (only available in simulation) instead of .
The metric we use in order to quantify the proximity between and its estimate (either or ) is where is the angle that maximizes our metric (it accounts for the fact that can only be recovered up to a global phase) and is the Frobenius norm. This metric is between 0 (if and are the same up to a global phase) and 1 (if they are orthogonal with respect to the Hilbert–Schmidt inner product).
The cumulative density function (cdf) of our metric (called error) is displayed in Figure 2. We note that:
Figure 2.
Empirical cdf of the errors of the 3 maximum likelihood estimators with 4 qubits.
- is very similar to its reference (especially with ). This means that the likelihood algorithm converges towards the global minimum (so 700 iterations is enough and is a good enough initial point).
- is worse than . This means that the costly likelihood maximization is not made in vain.
- The errors with are roughly twice smaller than the errors with . So we are in the classic linear case where the error is proportional to the square root of the number of measurements. Additionally, the same graph with any could be deduced from Figure 2.
6. Conclusions and Future Work
In this paper, we introduced two QPT methods that do not require the initial states to be set to predetermined values, but work with randomly selected initial states that are measured beforehand. The first method uses QST to estimate the input and output states, lifts the phase ambiguities and finds the unitary matrix which fits the estimated states the best. The second method finds the unitary matrix that is the most likely according to the statistical distribution of the measurements. The latter method is more precise but slower and uses the result of the first method as an initialization.
We intend to perform more extensive tests and compare our method to non-blind methods such as [7]. We also want to link this algorithm with that of [11] by considering fewer initial states and more time delays than in Figure 1.
Author Contributions
The work described in the paper was performed during the PhD of François Verdeil under the supervision of Y.D. (PhD director). Both authors exchanged ideas to create the algorithms. Y.D. had the idea to take part in the MaxEnt22 conference. F.V. wrote the first draft of the paper and the code. Y.D.’s input and experience was instrumental in improving the paper. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable here.
Acknowledgments
The authors would like to thank Alain Deville for helpful discussions.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Ljung, L. System Identification: Theory for the User; PTR Prentice Hall: Upper Saddle River, NJ, USA, 1999; p. 540. [Google Scholar]
- Abed-Meraim, K.; Qiu, W.; Hua, Y. Blind system identification. Proc. IEEE 1997, 85, 1310–1322. [Google Scholar] [CrossRef]
- Nielsen, M.A.; Chuang, I.L. Prescription for experimental determination of the dynamics of a quantum black box. J. Mod. Opt. 2018, 44, 2455–2467. [Google Scholar]
- Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
- Mohseni, M.; Rezakhani, A.T.; Lidar, D.A. Quantum-process tomography: Resource analysis of different strategies. Phys. Rev. A 2008, 77, 032322. [Google Scholar] [CrossRef]
- Shabani, A.; Kosut, R.; Mohseni, M.; Rabitz, H.; Broome, M.; Almeida, M.; Fedrizzi, A.; White, A. Efficient measurement of quantum dynamics via compressive sensing. Phys. Rev. Lett. 2011, 106, 100401. [Google Scholar] [CrossRef] [PubMed]
- Baldwin, C.H.; Kalev, A.; Deutsch, I.H. Quantum process tomography of unitary and near-unitary maps. Phys. Rev. A 2014, 90, 012110. [Google Scholar] [CrossRef]
- Deville, Y.; Deville, A. From blind quantum source separation to blind quantum process tomography. In Proceedings of the International Conference on Latent Variable Analysis and Signal Separation, Liberec, Czech Republic, 25–28 August 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 184–192. [Google Scholar]
- Deville, Y.; Deville, A. The blind version of quantum process tomography: Operating with unknown input values. IFAC-PapersOnLine 2017, 50, 11731–11737. [Google Scholar] [CrossRef]
- Deville, Y.; Deville, A. Quantum process tomography with unknown single-preparation input states: Concepts and application to the qubit pair with internal exchange coupling. Phys. Rev. A 2020, 101, 042332. [Google Scholar] [CrossRef]
- Verdeil, F.; Deville, Y.; Deville, A. Two-Qubit Unitary Quantum Process Tomography by Multiple-Delay Output Measurements for One Unknown Input Pure State Value. In Proceedings of the 2021 IEEE Statistical Signal Processing Workshop (SSP), Rio de Janeiro, Brazil, 11–14 July 2021; IEEE: New York, NY, USA, 2021; pp. 161–165. [Google Scholar]
- Verdeil, F.; Deville, Y. Pure state tomography with parallel unentangled measurements. Phys. Rev. A, 2022; (to appear). [Google Scholar]
- Goyeneche, D.; Cañas, G.; Etcheverry, S.; Gómez, E.; Xavier, G.; Lima, G.; Delgado, A. Five Measurement Bases Determine Pure Quantum States on Any Dimension. Phys. Rev. Lett. 2015, 115, 090401. [Google Scholar] [CrossRef] [PubMed]
- Finkelstein, J. Pure-state informationally complete and “really” complete measurements. Phys. Rev. A 2004, 70, 052107. [Google Scholar] [CrossRef]
- Wahba, G. A Least Squares Estimate of Satellite Attitude. SIAM Rev. 1965, 7, 409. [Google Scholar] [CrossRef]
- Hradil, Z.; Řeháček, J.; Fiurášek, J.; Ježek, M. 3 Maximum-Likelihood Methods in Quantum Mechanics. In Quantum State Estimation; Springer: Berlin/Heidelberg, Germany, 2004; pp. 59–112. [Google Scholar]
- Broyden, C.G. The convergence of a class of double-rank minimization algorithms. IMA J. Appl. Math. 1970, 6, 76–90. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).