Information Geometry of Randomized Quantum State Tomography

Fujiwara, Akio; Yamagata, Koichi

doi:10.3390/e20080609

Open AccessArticle

Information Geometry of Randomized Quantum State Tomography

by

Akio Fujiwara

^1,* and

Koichi Yamagata

²

¹

Department of Mathematics, Osaka University, Toyonaka, Osaka 560-0043, Japan

²

Graduate School of Informatics and Engineering, The University of Electro-Communications, Chofu, Tokyo 182-8585, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2018, 20(8), 609; https://doi.org/10.3390/e20080609

Submission received: 29 June 2018 / Revised: 5 August 2018 / Accepted: 13 August 2018 / Published: 16 August 2018

(This article belongs to the Special Issue Entropy: From Physics to Information Sciences and Geometry)

Download

Browse Figures

Versions Notes

Abstract

Suppose that a d-dimensional Hilbert space

H ≃ C^{d}

admits a full set of mutually unbiased bases

\{| 1^{(a)} ⟩, \dots, | d^{(a)} ⟩\}

, where

a = 1, \dots, d + 1

. A randomized quantum state tomography is a scheme for estimating an unknown quantum state on

H

through iterative applications of measurements

M^{(a)} = \{| 1^{(a)} ⟩ ⟨ 1^{(a)} |, \dots, | d^{(a)} ⟩ ⟨ d^{(a)} |\}

for

a = 1, \dots, d + 1

, where the numbers of applications of these measurements are random variables. We show that the space of the resulting probability distributions enjoys a mutually orthogonal dualistic foliation structure, which provides us with a simple geometrical insight into the maximum likelihood method for the quantum state tomography.

Keywords:

quantum state tomography; mutually unbiased bases; information geometry; dualistic foliation; mixed coordinate system

1. Introduction

Quantum state tomography is a method of estimating an unknown quantum state represented on some Hilbert space

H

, consisting of a fixed set of measurements that provides sufficient information about the unknown quantum state, as well as a data processing that maps each measurement outcome into the quantum state space

S (H)

on

H

[1]. A set of measurements that fulfils this requirement is sometimes called a measurement basis. For mathematical simplicity, we restrict ourselves to Hilbert spaces of finite dimensions.

To elucidate our motivation, let us treat the simplest case when

H ≃ C^{2}

. It is well known that there is a one-to-one affine correspondence between the qubit state space

S (C^{2}) : = {ρ \in C^{2 \times 2} | ρ \geq 0, Tr ρ = 1}

and the unit ball (called the Bloch ball)

B : = \{x = (x_{1}, x_{2}, x_{3}) \in R^{3}| {∥ x ∥}^{2} : = {(x_{1})}^{2} + {(x_{2})}^{2} + {(x_{3})}^{2} \leq 1\} .

In fact, the correspondence is explicitly given by the Stokes parametrization

x ⟼ ρ_{x} = \frac{1}{2} (I + x_{1} σ_{1} + x_{2} σ_{2} + x_{3} σ_{3}),

where

σ_{1}

,

σ_{2}

, and

σ_{3}

are the standard Pauli matrices. Since

E_{ρ_{x}} [σ_{i}] : = Tr ρ_{x} σ_{i} = x_{i}

for

i \in {1, 2, 3}

, the set

σ = (σ_{1}, σ_{2}, σ_{3})

of observables is regarded as an unbiased estimator [2,3,4] for the Stokes parameter

x = (x_{1}, x_{2}, x_{3})

. This is the basic idea behind the standard qubit state tomography, which runs as follows: suppose that, among N independent experiments, the ith Pauli matrix

σ_{i}

was measured

N / 3

times, and outcomes

+ 1

(spin-up) and

- 1

(spin-down) were obtained

n_{i}^{+}

and

n_{i}^{-}

times, respectively. Then a naive estimate for the true value of the parameter

x = (x_{1}, x_{2}, x_{3})

is

\hat{x} = ({\hat{x}}_{1}, {\hat{x}}_{2}, {\hat{x}}_{3}) : = (\frac{n_{1}^{+} - n_{1}^{-}}{N / 3}, \frac{n_{2}^{+} - n_{2}^{-}}{N / 3}, \frac{n_{3}^{+} - n_{3}^{-}}{N / 3}) .

When the estimate

\hat{x} \in {[- 1, 1]}^{3}

falls outside the Bloch ball B, it needs to be corrected so that the new estimate lies in the Bloch ball B. The maximum likelihood method is a canonical one to obtain a corrected estimate [2,5,6,7,8,9,10]. From the point of view of information geometry [11,12,13], the maximum likelihood estimate (MLE) is the orthogonal projection from the temporary estimate

\hat{x}

onto the Bloch ball B with respect to the standard Fisher metric along the

\nabla^{(m)}

-geodesic [14], (cf., Appendix A).

Now let us deal with a slightly generalized situation: suppose that the ith Pauli matrix

σ_{i}

was measured

N_{i}

times and outcomes

+ 1

and

- 1

were obtained

n_{i}^{+}

and

n_{i}^{-}

times, respectively, where

{N_{i}}_{i = 1, 2, 3}

were random variables. Such a situation arises in an actual experiment due to unexpected particle loss [15]. We shall call such a generalized estimation scheme a randomized state tomography. A naive estimate in this case is the following:

\hat{x} = ({\hat{x}}_{1}, {\hat{x}}_{2}, {\hat{x}}_{3}) : = (\frac{n_{1}^{+} - n_{1}^{-}}{N_{1}}, \frac{n_{2}^{+} - n_{2}^{-}}{N_{2}}, \frac{n_{3}^{+} - n_{3}^{-}}{N_{3}}) .

One may invoke the maximum likelihood method when

\hat{x}

falls outside the Bloch ball. It is then interesting to ask if there is also a useful geometrical picture for the MLE even when the numbers

N_{i}

of measurements are random variables.

The above mentioned problem is naturally extended to quantum state tomography on an arbitrary Hilbert space that admits a full set of mutually unbiased bases [16,17]. In a d-dimensional Hilbert space

H ≃ C^{d}

, k orthonormal bases

{\{| α^{(1)} 〉\}}_{α \in {1, \dots, d}}, {\{| β^{(2)} 〉\}}_{β \in {1, \dots, d}}, \dots, {\{| γ^{(k)} 〉\}}_{γ \in {1, \dots, d}}

are called mutually unbiased if they satisfy

{|〈α^{(a)} | β^{(b)}〉|}^{2} = \frac{1}{d}

for all

a, b \in {1, \dots, k}

with

a \neq b

, and

α, β \in {1, \dots, d}

. It is known that the number k of mutually unbiased bases (MUBs) is at most

d + 1

[18]. If there are

d + 1

MUBs, the Hilbert space

H

is said to admit a full set of MUBs. For example, when the dimension d of

H

is a power of a prime,

H

admits a full set of MUBs [19]. Whether or not any Hilbert space admits a full set of MUBs is an open question [16].

In what follows, unless otherwise stated, we assume that the Hilbert space

H ≃ C^{d}

under consideration admits a full set of MUBs. As demonstrated in Appendix B (cf., [17,20]), each density operator

ρ \in S (H)

can be uniquely represented as

ρ = ρ (ξ) : = \sum_{a = 1}^{d + 1} \{\sum_{α = 1}^{d - 1} ξ_{α}^{(a)} M_{α}^{(a)} + (1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(a)}) M_{d}^{(a)}\} - I,

(1)

where

M^{(a)} : = \{M_{1}^{(a)}, \dots, M_{d}^{(a)}\} = \{| 1^{(a)} 〉 〈 1^{(a)} |, \dots, | d^{(a)} 〉 〈 d^{(a)} |\}

is the projection-valued measure (PVM) associated with the ath orthogonal basis in the MUBs, and

ξ : = {(ξ_{α}^{(a)})}_{(a, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}}

is a

(d^{2} - 1)

-dimensional real parameter that is chosen so that

ρ (ξ) \geq 0

. A simple calculation shows that, if the ath measurement

M^{(a)}

is applied to the state

ρ (ξ)

, one obtains each outcome

α \in {1, \dots, d}

with probability

p_{α}^{(a)} = Tr ρ (ξ) M_{α}^{(a)} = \{\begin{matrix} ξ_{α}^{(a)}, & for α = 1, \dots, d - 1, \\ 1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(a)}, & for α = d . \end{matrix}

(2)

This implies that the parametrization

ξ \mapsto ρ (ξ)

establishes an affine isomorphism between the quantum state space

S (C^{d}) : = \{ρ \in C^{d \times d} | ρ \geq 0, Tr ρ = 1\}

and the convex set

B : = \{ξ \in R^{d^{2} - 1} | ρ (ξ) \geq 0\} .

Incidentally, the Stokes parametrization

x \mapsto ρ_{x}

for the qubit state space

S (C^{2})

is regarded as a special case of the above parametrization

ξ \mapsto ρ (ξ)

for

S (C^{d})

. In fact, the eigenvectors of the Pauli matrices

σ_{1}

,

σ_{2}

,

σ_{3}

form a full set of MUBs on

C^{2}

, and the Stokes parametrization

x = (x_{1}, x_{2}, x_{3})

is related to the above parametrization

ξ = (ξ_{1}^{(1)}, ξ_{1}^{(2)}, ξ_{1}^{(3)})

as

ξ_{1}^{(a)} = \frac{x_{a} + 1}{2}, (a = 1, 2, 3) .

Now that a standard affine parametrization

ξ \mapsto ρ (ξ)

has been established on an arbitrary Hilbert space

H ≃ C^{d}

that admits a full set of MUBs, the scheme of randomized state tomography is naturally extended to

H

as follows. Suppose that the ath measurement

M^{(a)}

was applied

N^{(a)}

times and the outcome

α \in {1, \dots, d}

was obtained

n_{α}^{(a)}

times, where

{\{N^{(a)}\}}_{a = 1, \dots, d + 1}

were random variables. Then, due to (2), a naive estimate for the parameter

ξ_{α}^{(a)}

is

{\hat{ξ}}_{α}^{(a)} = \frac{n_{α}^{(a)}}{N^{(a)}} .

When the estimate

\hat{ξ} : = ({\hat{ξ}}_{α}^{(a)}) \in {[0, 1]}^{d^{2} - 1}

falls outside the parameter space B, one may invoke the maximum likelihood method to obtain a corrected estimate.

The objective of the present paper is to clarify that the

\nabla^{(m)}

-projection interpretation for the MLE is still valid for the randomized state tomography by changing the standard Fisher metric into a deformed one depending on the realization of the random variables

N^{(a)}

, which might as well be called a randomized Fisher metric. Such a novel geometrical picture will provide important insights into the quantum metrology.

The paper is organized as follows. In Section 2, we first introduce a statistical model on an extended sample space

Ω

that represents the randomized state tomography. We then clarify that the probability simplex

P (Ω)

is decomposed into mutually orthogonal dualistic foliation by means of certain

\nabla^{(m)}

- and

\nabla^{(e)}

-autoparallel submanifolds. In Section 3, we give a statistical interpretation of the above-mentioned dualistic foliation structure. In particular, we point out that the MLE is the

\nabla^{(m)}

-projection with respect to a deformed Fisher metric that depends on the realization of the random variables

N^{(a)}

. These results are demonstrated by several illustrative examples in Section 4. Finally, some concluding remarks are presented in Section 5. For the reader’s convenience, some background information is provided in Appendix A and Appendix B, including information geometry of the MLE and affine parametrization of a quantum state space

S (H)

.

2. Geometry of Randomized State Tomography

We identify the randomized state tomography on

H ≃ C^{d}

with the following scheme [21]: at each step of the measurement, one chooses a PVM

M^{(a)}

at random with probability

s^{(a)}

, (

a = 1, \dots, d + 1

), and applies the chosen PVM to yield an outcome

α \in \{1, \dots, d\}

. The sample space

Ω

for this statistical picture is

Ω = \{(a, α) | a \in {1, \dots, d + 1}, α \in {1, \dots, d}\} .

Suppose that the unknown state

ρ

is specified by the coordinate

ξ \in B

as (1). Then the corresponding probability distribution on

Ω

is represented by the

d (d + 1)

-dimensional probability vector

\begin{matrix} p_{(s, ξ)} : = (s^{(1)} (ξ_{1}^{(1)}, \dots, ξ_{d - 1}^{(1)}, 1 - \sum_{α = 1}^{d - 1} ξ_{α}^{(1)}), \dots, s^{(d)} (ξ_{1}^{(d)}, \dots, ξ_{d - 1}^{(d)}, 1 - \sum_{α = 1}^{d - 1} ξ_{α}^{(d)}), \\ (1 - \sum_{a = 1}^{d} s^{(a)}) (ξ_{1}^{(d + 1)}, \dots, ξ_{d - 1}^{(d + 1)}, 1 - \sum_{α = 1}^{d - 1} ξ_{α}^{(d + 1)})) \end{matrix}

where the parameter

s : = (s^{(1)}, \dots, s^{(d)})

belongs to the domain

D : = \{s \in R^{d} | s^{(a)} > 0 for a \in {1, \dots, d}, and \sum_{a = 1}^{d} s^{(a)} < 1\} .

Note that the family

\{p_{(s, ξ)} | s \in D, ξ \in Ξ\}

with

\begin{matrix} Ξ : = \{ξ \in R^{d^{2} - 1} | ξ_{α}^{(a)} > 0 for (a, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}, \\ and \sum_{α = 1}^{d - 1} ξ_{α}^{(a)} < 1 for a \in {1, \dots, d + 1}\} \end{matrix}

forms a

(d^{2} + d - 1)

-dimensional open probability simplex

P (Ω)

, and the parameters

(s, ξ)

form a coordinate system of

P (Ω)

. Since we are only interested in estimating the parameter

ξ \in Ξ

, the remaining parameter

s \in D

is understood as a set of nuisance parameters [2,12]. In what follows, we regard

P (Ω)

as a statistical manifold endowed with the standard dualistic structure

(g, \nabla^{(e)}, \nabla^{(m)})

, where g is the Fisher metric, and

\nabla^{(e)}

and

\nabla^{(m)}

are the exponential and mixture connections [12].

Let us consider the following submanifolds of

P (Ω)

:

M (s) : = \{p_{(s, ξ)} ∣ ξ \in Ξ\}

for each

s \in D

, and

E (ξ) : = \{p_{(s, ξ)} ∣ s \in D\}

for each

ξ \in Ξ

. Since

M (s)

and

E (ξ)

are convex subsets of

P (Ω)

, they are both

\nabla^{(m)}

-autoparallel. In addition, we have the following.

Proposition 1.

For each

ξ \in Ξ

, the submanifold

E (ξ)

is

\nabla^{(e)}

-autoparallel. Furthermore, for each

s \in D

and

ξ \in Ξ

, the submanifolds

M (s)

and

E (ξ)

are mutually orthogonal with respect to the Fisher metric g.

Proof.

Let us change the coordinate system

(s, ξ)

into

(η_{〈a〉}, η_{〈b, α〉})

, where

η_{〈a〉} : = s^{(a)}

for

a \in {1, \dots, d}

, and

η_{〈b, α〉} : = s^{(b)} ξ_{α}^{(b)}

for

(b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}

. With this coordinate transformation, the probability vector

p_{(s, ξ)}

is rewritten as

p_{η} = ⨁_{a = 1}^{d + 1} (η_{〈a, 1〉}, \dots, η_{〈a, d - 1〉}, η_{〈a〉} - \sum_{α = 1}^{d - 1} η_{〈a, α〉}) .

(3)

Here,

η_{〈d + 1〉}

is a function of

{η_{〈a〉}}_{a \in {1, \dots, d}}

defined by

η_{〈d + 1〉} : = 1 - \sum_{a = 1}^{d} η_{〈a〉},

and is not a component of the coordinate system

η : = (η_{〈a〉}, η_{〈b, α〉})

. We see from the representation (3) that the coordinate system

η

is

\nabla^{(m)}

-affine. The potential function for

η

is given by the negative entropy

\begin{matrix} φ (η) & : = & \sum_{ω \in Ω} p_{η} (ω) log p_{η} (ω) \\ = & \sum_{a = 1}^{d + 1} \{\sum_{α = 1}^{d - 1} η_{〈a, α〉} log η_{〈a, α〉} + (η_{〈a〉} - \sum_{β = 1}^{d - 1} η_{〈a, β〉}) log (η_{〈a〉} - \sum_{β = 1}^{d - 1} η_{〈a, β〉})\} \end{matrix}

and the dual

\nabla^{(e)}

-affine coordinate system

θ

is given by

θ^{〈a〉} = \frac{\partial φ}{\partial η_{〈a〉}} = log \frac{s^{(a)}}{(1 - \sum_{b = 1}^{d} s^{(b)})} + log \frac{(1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(a)})}{(1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(d + 1)})}

for

a \in {1, \dots, d}

, and

θ^{〈b, α〉} = \frac{\partial φ}{\partial η_{〈b, α〉}} = log \frac{ξ_{α}^{(b)}}{(1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(b)})}

for

(b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}

. Thus, fixing

ξ

is equivalent to fixing the coordinates

{(θ^{〈b, α〉})}_{(b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}}

, and the submanifold

E (ξ)

is generated by changing the remaining parameters

{(θ^{〈a〉})}_{a \in {1, \dots, d}}

. This implies that

E (ξ)

is

\nabla^{(e)}

-autoparallel, proving the first part of the claim.

To prove the second part, let us introduce a mixed coordinate system [11]

{(η_{〈a〉}; θ^{〈b, α〉})}_{a \in {1, \dots, d}, (b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}}

of

P (Ω)

. Since

η_{〈a〉} = s^{(a)}

, the submanifold

M (s)

is rewritten as

M (s) = \{p_{(s, ξ)} | {(η_{〈a〉})}_{a \in {1, \dots, d}} are fixed and {(θ^{〈b, α〉})}_{(b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}} are arbitrary\} .

On the other hand, the submanifold

E (ξ)

is rewritten as

E (ξ) = \{p_{(s, ξ)} | {(θ^{〈b, α〉})}_{(b, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}} are fixed and {(η_{〈a〉})}_{a \in {1, \dots, d}} are arbitrary\} .

Thus, the orthogonality of

M (s)

and

E (ξ)

is an immediate consequence of the orthogonality of the dual affine coordinate systems

θ

and

η

with respect to the Fisher metric g. ☐

Proposition 1 implies that the manifold

P (Ω)

is decomposed into mutually orthogonal dualistic foliation based on the submanifolds

M (s)

and

E (ξ)

, as illustrated in Figure 1. We shall exploit this geometrical structure in the next section.

3. Estimation of the Parameter $ξ$

Let us proceed to the problem of estimating the unknown parameter

ξ

using the randomized tomography. Suppose that, among N independent repetitions of experiments, the ath measurement

M^{(a)}

was applied

N^{(a)}

times and outcomes

α \in {1, \dots, d}

were obtained

n_{α}^{(a)}

times. Then temporary estimates

(\hat{s}, \hat{ξ})

for the parameters

(s, ξ)

are given by

{\hat{s}}^{(a)} : = \frac{N^{(a)}}{N}

for

a \in {1, \dots, d}

, and

{\hat{ξ}}_{β}^{(b)} : = \frac{n_{β}^{(b)}}{N^{(b)}}

for

(b, β) \in {1, \dots, d + 1} \times {1, \dots, d - 1}

. If

\hat{ξ}

has fallen outside the physical domain B, one may seek a corrected estimate by the maximum likelihood method. Observe that, due to (2), the empirical distribution

{\hat{q}}_{N} \in P (Ω)

is represented as

{\hat{q}}_{N} = p_{(\hat{s}, \hat{ξ})} .

(4)

On the other hand, the physical domain B in the parameter space

Ξ

corresponds to the subset

B : = {p_{(s, ξ)} | s \in D, ξ \in B}

of

P (Ω)

, (see Figure 1). The MLE

p^{*}

in

P (Ω)

is then given by

p^{*} = \underset{p \in B}{\arg \min} D ({\hat{q}}_{N} ∥ p),

(5)

where

D (\cdot ∥ \cdot)

is the Kullback-Leibler divergence (cf., Appendix A). A crucial observation is the following.

Proposition 2.

The minimum in (5) is achieved on

M (\hat{s}) \cap B

.

Proof.

Let us take a point

p_{(s, ξ)} \in B

arbitrarily. It then follows from the mutually orthogonal dualistic foliation of

P (Ω)

established in Proposition 1 that

\begin{matrix} D ({\hat{q}}_{N} ∥ p_{(s, ξ)}) & = & D (p_{(\hat{s}, \hat{ξ})} ∥ p_{(s, ξ)}) \\ = & D (p_{(\hat{s}, \hat{ξ})} ∥ p_{(\hat{s}, ξ)}) + D (p_{(\hat{s}, ξ)} ∥ p_{(s, ξ)}) \\ \geq & D (p_{(\hat{s}, \hat{ξ})} ∥ p_{(\hat{s}, ξ)}) . \end{matrix}

In the second equality, the generalized Pythagorean theorem was used. Consequently,

min_{ξ \in B} D (p_{(\hat{s}, \hat{ξ})} ∥ p_{(s, ξ)}) \geq min_{ξ \in B} D (p_{(\hat{s}, \hat{ξ})} ∥ p_{(\hat{s}, ξ)})

for all

s \in D

, and the right-hand side is achieved if and only if

s = \hat{s}

. ☐

The geometrical implication of Proposition 2 is illustrated in Figure 2. The MLE

p^{*} = p_{(\hat{s}, ξ^{*})}

is the

\nabla^{(m)}

-projection from the empirical distribution

p_{(\hat{s}, \hat{ξ})}

to

B

, and is on the section

M (\hat{s})

specified by the temporary estimate

\hat{s}

.

Now we arrive at a geometrical picture behind the parameter estimation based on randomized state tomography. Suppose we are given a temporary estimate

(\hat{s}, \hat{ξ})

with

\hat{ξ} \notin B

. Due to Proposition 2, we can restrict ourselves to section

M (\hat{s})

as the search space for the MLE

p^{*}

. Since each section

M (\hat{s})

is affinely isomorphic to the parameter space

Ξ

, we can introduce a dualistic structure

(\tilde{g}, {\tilde{\nabla}}^{(e)}, {\tilde{\nabla}}^{(m)})

on

Ξ

in the following way. Firstly, we identify the metric

\tilde{g}

with the Fisher metric g restricted on

M (\hat{s})

, that is,

\begin{matrix} {\tilde{g}}_{(\hat{s}, ξ)} (\frac{\partial}{\partial ξ_{α}^{(a)}}, \frac{\partial}{\partial ξ_{β}^{(b)}}) & = & {\frac{\partial η_{〈a^{'}, α^{'}〉}}{\partial ξ_{α}^{(a)}} \frac{\partial η_{〈b^{'}, β^{'}〉}}{\partial ξ_{β}^{(b)}} g_{(s, ξ)} (\frac{\partial}{\partial η_{〈a^{'}, α^{'}〉}}, \frac{\partial}{\partial η_{〈b^{'}, β^{'}〉}})|}_{s = \hat{s}} \\ = & {s^{(a)} s^{(b)} \frac{\partial^{2} φ (η)}{\partial η_{〈a, α〉} \partial η_{〈b, β〉}}|}_{s = \hat{s}} \\ = & δ_{a b} {\hat{s}}^{(a)} (\frac{1}{ξ_{d}^{(a)}} + \frac{δ_{α β}}{ξ_{α}^{(a)}}), \end{matrix}

for

a, b \in {1, \dots, d + 1}

and

α, β \in {1, \dots, d - 1}

, where

{\hat{s}}^{(d + 1)}

and

ξ_{d}^{(a)}

are formally defined as

{\hat{s}}^{(d + 1)} : = 1 - \sum_{a = 1}^{d} {\hat{s}}^{(a)}, ξ_{d}^{(a)} : = 1 - \sum_{α = 1}^{d - 1} ξ_{α}^{(a)} .

Secondly, the mixture connection

{\tilde{\nabla}}^{(m)}

on

Ξ

is defined through the natural affine isomorphism between

M (\hat{s})

and

Ξ

. Finally, the dual connection

{\tilde{\nabla}}^{(e)}

is defined by the duality

\tilde{g} ({\tilde{\nabla}}_{X}^{(e)} Y, Z) : = X \tilde{g} (Y, Z) - \tilde{g} (Y, {\tilde{\nabla}}_{X}^{(m)} Z) .

Thus, the MLE

ξ^{*}

in the parameter space

Ξ

is interpreted as the

{\tilde{\nabla}}^{(m)}

-projection from

\hat{ξ}

to the physical domain B with respect to the metric

\tilde{g}

.

4. Examples

In this section, we present some examples that demonstrate the implication of Proposition 2 as well as the general diagram given in Figure 2.

4.1. When $dim H = 2$

Let us first study the simplest case when

H = C^{2}

. A full set of MUBs is given by

\begin{matrix} \{| 1^{(1)} 〉, | 2^{(1)} 〉\} = \{\frac{1}{\sqrt{2}} (\begin{matrix} 1 \\ 1 \end{matrix}), \frac{1}{\sqrt{2}} (\begin{matrix} 1 \\ - 1 \end{matrix})\}, \\ \{| 1^{(2)} 〉, | 2^{(2)} 〉\} = \{\frac{1}{\sqrt{2}} (\begin{matrix} 1 \\ - i \end{matrix}), \frac{1}{\sqrt{2}} (\begin{matrix} 1 \\ i \end{matrix})\}, \\ \{| 1^{(3)} 〉, | 2^{(3)} 〉\} = \{(\begin{matrix} 1 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 1 \end{matrix})\} . \end{matrix}

With these bases, the parameter representation (1) becomes

ρ = \frac{1}{2} (\begin{matrix} 1 + x_{3} & x_{1} - i x_{2} \\ x_{1} + i x_{2} & 1 - x_{3} \end{matrix}),

where

x = (x_{1}, x_{2}, x_{3})

is the standard Stokes parameter, which is related to

ξ = (ξ_{1}^{(1)}, ξ_{1}^{(2)}, ξ_{1}^{(3)})

as

x_{a} = 2 ξ_{1}^{(a)} - 1

for

a = 1, 2, 3

.

Figure 3 demonstrates how the

{\tilde{\nabla}}^{(m)}

-projection is realized. Here, the trajectories of

{\tilde{\nabla}}^{(m)}

-projections that gives the MLE

p^{*}

are plotted only on the

x_{1} x_{2}

-plane. The left and right panels correspond to the cases when

N^{(1)} : N^{(2)} = 1 : 1

and

N^{(1)} : N^{(2)} = 5 : 1

, respectively. The change of

ξ_{1}

-coordinate relative to the change of

x_{2}

-coordinate along each trajectory is less noticeable in the right panel than in the left panel. This is because a tomography with

N^{(1)} / N^{(2)} = 5

provides us with more information about

x_{1}

-coordinate, relative to

x_{2}

-coordinate, as compared with the case when

N^{(1)} / N^{(2)} = 1

.

4.2. When $dim H = 3$

The space

H = C^{3}

admits a full set of MUBs; for example,

\begin{matrix} \{| 1^{(1)} 〉, | 2^{(1)} 〉, | 3^{(1)} 〉\} = \{(\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 0 \\ 1 \end{matrix})\}, \\ \{| 1^{(2)} 〉, | 2^{(2)} 〉, | 3^{(2)} 〉\} = \{\frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ ω \\ ω^{2} \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ ω^{2} \\ ω \end{matrix})\}, \\ \{| 1^{(3)} 〉, | 2^{(3)} 〉, | 3^{(3)} 〉\} = \{\frac{1}{\sqrt{3}} (\begin{matrix} ω \\ 1 \\ 1 \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ ω \\ 1 \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ 1 \\ ω \end{matrix})\}, \\ \{| 1^{(4)} 〉, | 2^{(4)} 〉, | 3^{(4)} 〉\} = \{\frac{1}{\sqrt{3}} (\begin{matrix} ω^{2} \\ 1 \\ 1 \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ ω^{2} \\ 1 \end{matrix}), \frac{1}{\sqrt{3}} (\begin{matrix} 1 \\ 1 \\ ω^{2} \end{matrix})\}, \end{matrix}

where

ω = (- 1 + i \sqrt{3}) / 2

is a primitive third root of unity. With these bases, the parameter representation (1) becomes

ρ = (\begin{matrix} ξ_{1}^{(1)} & a_{12} - i b_{12} & a_{13} - i b_{13} \\ a_{12} + i b_{12} & ξ_{2}^{(1)} & a_{23} - i b_{23} \\ a_{13} + i b_{13} & a_{23} + i b_{23} & 1 - ξ_{1}^{(1)} - ξ_{2}^{(1)} \end{matrix}),

where

\begin{matrix} a_{12} = \frac{1}{2} (1 + ξ_{1}^{(2)} - ξ_{1}^{(3)} - ξ_{2}^{(3)} - ξ_{1}^{(4)} - ξ_{2}^{(4)}), \\ a_{13} = \frac{1}{2} (- 1 + ξ_{1}^{(2)} + ξ_{2}^{(3)} + ξ_{2}^{(4)}), \\ a_{23} = \frac{1}{2} (- 1 + ξ_{1}^{(2)} + ξ_{1}^{(3)} + ξ_{1}^{(4)}), \\ b_{12} = \frac{\sqrt{3}}{6} (1 - ξ_{1}^{(2)} - 2 ξ_{2}^{(2)} + ξ_{1}^{(3)} - ξ_{2}^{(3)} - ξ_{1}^{(4)} + ξ_{2}^{(4)}), \\ b_{13} = \frac{\sqrt{3}}{6} (- 1 + ξ_{1}^{(2)} + 2 ξ_{2}^{(2)} + 2 ξ_{1}^{(3)} + ξ_{2}^{(3)} - 2 ξ_{1}^{(4)} - ξ_{2}^{(4)}), \\ b_{23} = \frac{\sqrt{3}}{6} (1 - ξ_{1}^{(2)} - 2 ξ_{2}^{(2)} + ξ_{1}^{(3)} + 2 ξ_{2}^{(3)} - ξ_{1}^{(4)} - 2 ξ_{2}^{(4)}) . \end{matrix}

The physical domain B that corresponds to the state space

S (C^{3})

is a compact convex subset of the parameter space

Ξ (\subset R^{8})

, and the extreme points of B form an algebraic variety with respect to the parameters

ξ = (ξ_{1}^{(1)}, ξ_{2}^{(1)}, ξ_{1}^{(2)}, ξ_{2}^{(2)}, ξ_{1}^{(3)}, ξ_{2}^{(3)}, ξ_{1}^{(4)}, ξ_{2}^{(4)}) .

A numerical example of a

{\tilde{\nabla}}^{(m)}

-projection that gives the MLE is illustrated in Figure 4, where no probe particle is lost, that is, when

\hat{s} = (\frac{1}{4}, \frac{1}{4}, \frac{1}{4}, \frac{1}{4}) .

In Figure 4, the dot laid outside the greyish region indicates the empirical distribution, i.e., the temporary estimate

\hat{ξ} = (0.100, 0.100, 0.066, 0.333, 0.333, 0.333, 0.333, 0.333),

and the corresponding MLE is

ξ_{*} = (0.122, 0.122, 0.108, 0.329, 0.299, 0.327, 0.327, 0.299) .

Furthermore, the greyish region represents the physical domain B cut by a two-dimensional affine subspace of

Ξ

specified by the equation

ξ = (1 - s) \hat{ξ} + s ξ_{*} + t v .

The vector v was chosen randomly under the condition that

v ⊥ \hat{ξ} - ξ_{*} and ∥ v ∥ = ∥ \hat{ξ} - ξ_{*} ∥,

where the orthogonality ⊥ and the norm

∥ \cdot ∥

are understood relative to the standard Euclidean structure of

R^{8}

. In Figure 4, the vector v was taken to be

v_{1} = (- 0.036, - 0.038, 0.012, - 0.026, - 0.038, 0.011, 0.002, 0.005)

in the left panel, and

v_{2} = (0.028, 0.000, - 0.006, 0.034, - 0.024, - 0.022, 0.034, 0.030)

in the right panel.

Figure 4 also demonstrates that the sections of the physical domain B show a variety of shapes. Unfortunately, due to this asymmetry of B, we were unable to find a (nontrivial) two-dimensional affine subspace on which every

{\tilde{\nabla}}^{(m)}

-projection runs. Such a difficulty is in good contrast to the simplest case

H ≃ C^{2}

, where the set B is rotationally symmetric and the

{\tilde{\nabla}}^{(m)}

-projections can be displayed on any two-dimensional section of B that passes through the origin of B as Figure 3.

4.3. When $dim H \geq 4$

The space

H = C^{4}

is also known to admit a full set of MUBs since

dim H = 4

is the second power of the prime number 2; for example [22],

\begin{matrix} \{| 1^{(1)} 〉, | 2^{(1)} 〉, | 3^{(1)} 〉, | 4^{(1)} 〉\} = \{(\begin{matrix} 1 \\ 0 \\ 0 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 0 \\ 1 \\ 0 \end{matrix}), (\begin{matrix} 0 \\ 0 \\ 0 \\ 1 \end{matrix})\}, \\ \{| 1^{(2)} 〉, | 2^{(2)} 〉, | 3^{(2)} 〉, | 4^{(2)} 〉\} = \{\frac{1}{2} (\begin{matrix} 1 \\ 1 \\ 1 \\ 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ 1 \\ - 1 \\ - 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ - 1 \\ - 1 \\ 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ - 1 \\ 1 \\ - 1 \end{matrix})\}, \\ \{| 1^{(3)} 〉, | 2^{(3)} 〉, | 3^{(3)} 〉, | 4^{(3)} 〉\} = \{\frac{1}{2} (\begin{matrix} 1 \\ - 1 \\ - i \\ - i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ - 1 \\ i \\ i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ 1 \\ i \\ - i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ 1 \\ - i \\ i \end{matrix})\}, \\ \{| 1^{(4)} 〉, | 2^{(4)} 〉, | 3^{(4)} 〉, | 4^{(4)} 〉\} = \{\frac{1}{2} (\begin{matrix} 1 \\ - i \\ - i \\ - 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ - i \\ i \\ 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ i \\ i \\ - 1 \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ i \\ - i \\ 1 \end{matrix})\}, \\ \{| 1^{(5)} 〉, | 2^{(5)} 〉, | 3^{(5)} 〉, | 4^{(5)} 〉\} = \{\frac{1}{2} (\begin{matrix} 1 \\ - i \\ - 1 \\ - i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ - i \\ 1 \\ i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ i \\ - 1 \\ i \end{matrix}), \frac{1}{2} (\begin{matrix} 1 \\ i \\ 1 \\ - i \end{matrix})\} . \end{matrix}

It is straightforward to calculate the parameter representation (1) of a state

ρ \in S (C^{4})

; however, the corresponding density matrix is rather complicated, and we omit to display it here.

When

H = C^{6}

, or more generally, when

dim H

is not a power of a prime, we do not know whether

H

admits a full set of MUBs. Let us touch upon a situation where a Hilbert space

H

, if it exists, does not admit a full set of MUBs. In this case, there is no measurement basis

M^{(a)}

that allows a parametrization

ξ

of the state space

S (H)

having a direct connection to the probability distribution of the outcomes as (2). Such a situation could be comparable to the case when the Gell-Mann matrices [23] are used as the measurement basis for estimating an unknown state on

H = C^{3}

. A state

ρ \in S (C^{3})

is represented as

ρ = ρ_{x} : = \frac{1}{3} (I + \sqrt{3} \sum_{i = 1}^{8} x_{i} λ_{i}),

where

λ_{1}, \dots, λ_{8}

are the Gell-Mann matrices, and

x = (x_{1}, \dots, x_{8})

is a set of real parameters. The physical domain

B = \{x \in R^{8}| ρ_{x} \geq 0\}

forms a compact convex subset of the unit ball in

R^{8}

. With the state

ρ_{x}

, the probability distribution of obtaining the eigenvalues

(- 1, 0, 1)

of the observable

λ_{1}

is

(\frac{1 - \sqrt{3} x_{1} + x_{8}}{3}, \frac{1 - 2 x_{8}}{3}, \frac{1 + \sqrt{3} x_{1} + x_{8}}{3}),

while the probability distribution of obtaining the eigenvalues

(- 1, 0, 1)

of the observable

λ_{2}

is

(\frac{1 - \sqrt{3} x_{2} + x_{8}}{3}, \frac{1 - 2 x_{8}}{3}, \frac{1 + \sqrt{3} x_{2} + x_{8}}{3}) .

Note that the probability of obtaining the eigenvalue 0 of

λ_{1}

is identical to that of

λ_{2}

. However, in a randomized estimation scheme in which

λ_{i}

is measured

N_{i}

times, the frequency of obtaining the eigenvalue 0 of

λ_{1}

would be different from that of

λ_{2}

. Consequently, one cannot assign a consistent temporary estimate

{\hat{x}}_{8}

for the parameter

x_{8}

in that case. Put differently, the empirical distribution

{\hat{q}}_{N}

on the extended outcome space

Ω

does not in general have a coordinate representation (4). Thus, the existence of a full set of MUBs is crucial in our analysis.

5. Concluding Remarks

In the present paper, we explored an information geometrical structure of the randomized quantum state tomography, assuming that the Hilbert space under consideration admits a full set of MUBs. We first introduced a classical statistical model

{p_{(s, ξ)}}_{s, ξ}

on an extended sample space

Ω

, and found that the probability simplex

P (Ω)

was decomposed into mutually orthogonal dualistic foliation (Proposition 1). We then clarified that this geometrical structure had a statistical importance in estimating the coordinate

ξ

of an unknown quantum state

ρ (ξ)

under the existence of the nuisance parameter s (Proposition 2). This result gave a generalized insight into the

\nabla^{(m)}

-projection interpretation for the MLE in that a similar interpretation was still valid for the randomized quantum state tomography by changing the standard Fisher metric into a deformed one. It also provided us with a new, convenient way of data processing in the actual quantum state tomography that may involve unexpected probe particle loss.

It should be noted that the existence of a full set of MUBs ensures the parametrization (1) of the quantum state space

S (H)

. Such a parametrization is distinctive in that it enables a direct correspondence between the parameter space and the probability simplex, realizing the coordinate representation (4) of the empirical distribution

{\hat{q}}_{N}

. Thus, the use of a full set of MUBs is crucial in our analysis. Nevertheless, it is often the case that the Hilbert space under consideration takes the form

H ≃ {(C^{p})}^{\otimes n}

for

p = 2

or 3 because qubits or qutrits are often regarded as building blocks of various quantum protocols. Therefore, the existence of a full set of MUBs would not be too strong a requirement in applications.

Author Contributions

The authors contributed equally to this work.

Funding

The present study was supported by JSPS KAKENHI Grant Numbers JP22340019 and JP17H02861.

Acknowledgments

The authors are grateful to Ryo Okamoto and Shigeki Takeuchi for helpful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MUBs	mutually unbiased bases
PVM	projection-valued measure
MLE	maximum likelihood estimate

Appendix A. Information Geometry of MLE

Let

P (Ω)

denote the set of probability distributions on a finite sample space

Ω

, i.e.,

P (Ω) : = \{p : Ω \to R |p (ω) > 0 for all ω \in Ω, and \sum_{ω \in Ω} p (ω) = 1\} .

This set may be identified with the

(| Ω | - 1)

-dimensional (open) simplex, where

| Ω |

denotes the number of elements in

Ω

, and thus it is sometimes referred to as the probability simplex on

Ω

. The set

P (Ω)

is also regarded as a statistical manifold endowed with the dualistic structure

(g, \nabla^{(e)}, \nabla^{(m)})

, where g is the Fisher metric, and

\nabla^{(e)}

and

\nabla^{(m)}

are the exponential and mixture connections [11,12,13].

Suppose that the state of the physical system at hand belongs to a (closed) subset

M

of

P (Ω)

, but we do not know which is the true state. We further assume that the probability distributions of

M

are faithfully parametrized by a finite dimensional parameter

θ

as

M = {p_{θ} (ω) | θ \in Θ} .

In this case,

M

is called a parametric model, and our task is to estimate the true value of the parameter

θ

that specifies the true state. Suppose that, by n independent experiments, we have obtained data

(x_{1}, x_{2}, \dots, x_{n}) \in Ω^{n}

. This information is compressed into the empirical distribution, an element of

P (Ω)

defined by

\begin{matrix} {\hat{q}}_{n} (ω) & : = & \frac{Number of occurrences of ω in data (x_{1}, x_{2}, \dots, x_{n})}{n} \\ = & \frac{1}{n} \sum_{i = 1}^{n} δ_{x_{i}} (ω) \end{matrix}

for each

ω \in Ω

, where

δ_{x_{i}} (ω)

is the Kronecker delta. If

{\hat{q}}_{n}

belongs to the model

M

, then we have an estimate

{\hat{θ}}_{n}

that satisfies

p_{{\hat{θ}}_{n}} = {\hat{q}}_{n}

. However, the empirical distribution

{\hat{q}}_{n}

does not always belong to the model

M

. When

{\hat{q}}_{n} \notin M

, we need to find an alternative estimate from the data. A canonical method of finding an alternative estimate

p_{{\hat{θ}}_{n}} \in M

is the maximum likelihood method, in which one seeks the maximizer of the likelihood function

θ ⟼ p_{θ} (x_{1}) p_{θ} (x_{2}) \dots p_{θ} (x_{n}),

within the domain

Θ

of the parameter

θ

, so that

{\hat{θ}}_{n} : = \underset{θ \in Θ}{\arg \max} \{p_{θ} (x_{1}) p_{θ} (x_{2}) \dots p_{θ} (x_{n})\} .

We can rewrite this relation as follows.

\begin{matrix} {\hat{θ}}_{n} & = & \underset{θ \in Θ}{\arg \max} \frac{1}{n} \sum_{i = 1}^{n} log p_{θ} (x_{i}) \\ = & \underset{θ \in Θ}{\arg \max} \sum_{ω \in Ω} {\hat{q}}_{n} (ω) log p_{θ} (ω) \\ = & \underset{θ \in Θ}{\arg \min} \sum_{ω \in Ω} {\hat{q}}_{n} (ω) \{log {\hat{q}}_{n} (ω) - log p_{θ} (ω)\} \\ = & \underset{θ \in Θ}{\arg \min} D ({\hat{q}}_{n} ∥ p_{θ}), \end{matrix}

where

D (q ∥ p) : = \sum_{ω \in Ω} q (ω) log \frac{q (ω)}{p (ω)}

is the Kullback-Leibler divergence from q to p. In other words, the maximum likelihood estimate (MLE)

p_{{\hat{θ}}_{n}}

is the point on

M

that is “closest” from the empirical distribution

{\hat{q}}_{n}

as measured by the Kullback-Leibler divergence:

p_{{\hat{θ}}_{n}} = \underset{p \in M}{\arg \min} D ({\hat{q}}_{n} ∥ p) .

Due to the generalized Pythagorean theorem, the MLE is geometrically understood as the

\nabla^{(m)}

-projection from

{\hat{q}}_{n}

to

M

or its boundary, as illustrated in Figure A1.

Figure A1. The maximum likelihood estimate

p_{{\hat{θ}}_{n}}

is the minimizer of the function

p \mapsto D ({\hat{q}}_{n} ∥ p)

with respect to

p \in M

, and is also understood as the

\nabla^{(m)}

-projection from the empirical distribution

{\hat{q}}_{n}

to

M

or its boundary.

Figure A1. The maximum likelihood estimate

p_{{\hat{θ}}_{n}}

is the minimizer of the function

p \mapsto D ({\hat{q}}_{n} ∥ p)

with respect to

p \in M

, and is also understood as the

\nabla^{(m)}

-projection from the empirical distribution

{\hat{q}}_{n}

to

M

or its boundary.

Appendix B. Parametrization of $S (H)$

Suppose that the Hilbert space

H ≃ C^{d}

under consideration admits a full set of MUBs

{\{| α^{(a)} 〉\}}_{α \in {1, \dots, d}}, (a = 1, \dots, d + 1) .

For each

a \in {1, \dots, d + 1}

, let

M^{(a)} : = \{M_{1}^{(a)}, \dots, M_{d}^{(a)}\} = \{| 1^{(a)} 〉 〈 1^{(a)} |, \dots, | d^{(a)} 〉 〈 d^{(a)} |\} .

Then, the operators

{\{M_{α}^{(a)} - \frac{I}{d}\}}_{(a, α) \in {1, \dots, d + 1} \times {1, \dots, d - 1}}

are linearly independent, spanning the space of selfadjoint operators with zero trace. This is easily seen from the orthogonality relation:

Tr (M_{α}^{(a)} - \frac{I}{d}) (M_{β}^{(b)} - \frac{I}{d}) = δ_{a b} (δ_{α β} - \frac{1}{d}) .

Thus, given

ρ \in S (H)

, the operator

ρ - (I / d)

is uniquely expanded as

ρ - \frac{I}{d} = \sum_{a = 1}^{d + 1} \sum_{α = 1}^{d - 1} x_{α}^{(a)} (M_{α}^{(a)} - \frac{I}{d}),

where

x_{α}^{(a)}

are real numbers. We can regard

x_{α}^{(a)}

as a coordinate system of the state space

S (H)

. When

d = 2

, this is identical to the Stokes parametrization, up to a factor of 2.

Now, let us change the coordinate system

x_{α}^{(a)}

into

ξ_{α}^{(a)}

as

x_{α}^{(a)} = ξ_{α}^{(a)} + (\sum_{β = 1}^{d - 1} ξ_{β}^{(a)}) - 1 .

We then arrive at the parametrization (1), i.e.,

ρ = \sum_{a = 1}^{d + 1} \{\sum_{α = 1}^{d - 1} ξ_{α}^{(a)} M_{α}^{(a)} + (1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(a)}) M_{d}^{(a)}\} - I .

This parametrization is useful in our analysis because it gives a direct connection to the probability distribution of outcomes of the measurement

M^{(a)}

as

p_{α}^{(a)} : = Tr ρ M_{α}^{(a)} = \{\begin{matrix} ξ_{α}^{(a)}, & for α = 1, \dots, d - 1, \\ 1 - \sum_{β = 1}^{d - 1} ξ_{β}^{(a)}, & for α = d . \end{matrix}

References

Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Lehmann, E.L.; Casella, G. Theory of Point Estimation, 2nd ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
Helstrom, C.W. Quantum Detection and Estimation Theory; Academic Press: New York, NY, USA, 1976. [Google Scholar]
Holevo, A.S. Probabilistic and Statistical Aspects of Quantum Theory; North-Holland: Amsterdam, The Netherlands, 1982. [Google Scholar]
Hradil, Z. Quantum-State Estimation. Phys. Rev. A 1997, 55, R1561–R1564. [Google Scholar] [CrossRef]
Banaszek, K.; D’Ariano, G.M.; Paris, M.G.A.; Sacchi, M.F. Maximum-likelihood estimation of the density matrix. Phys. Rev. A 1999, 61, 010304. [Google Scholar] [CrossRef]
Hradil, Z.; Summhammer, J.; Badurek, G.; Rauch, H. Reconstruction of the spin state. Phys. Rev. A 2000, 62, 014101. [Google Scholar] [CrossRef]
James, D.F.V.; Kwiat, P.G.; Munro, W.J.; White, A.G. Measurement of qubits. Phys. Rev. A 2001, 64, 052312. [Google Scholar] [CrossRef]
De Burgh, M.D.; Langford, N.K.; Doherty, A.C.; Gilchrist, A. Choice of measurement sets in qubit tomography. Phys. Rev. A 2008, 78, 052122. [Google Scholar] [CrossRef]
Blune-Kohout, R. Optimal, reliable estimation of quantum states. New J. Phys. 2010, 12, 043034. [Google Scholar] [CrossRef]
Amari, S.-I.; Nagaoka, H. Methods of Information Geometry; Translations of Mathematical Monographs 191; AMS and Oxford: Charles Street, RI, USA, 2000. [Google Scholar]
Amari, S.-I. Differential-Geometrical Methods in Statistics; Lecture Notes in Statistics 28; Springer: Berlin, Germany, 1985. [Google Scholar]
Murray, M.K.; Rice, J.W. Differential Geometry and Statistics; Chapman & Hall: London, UK, 1993. [Google Scholar]
Fujiwara, A.; Yamagata, K. Data processing for qubit state tomography: An information geometric approach. arXiv, 2016; arXiv:1608.07983. [Google Scholar]
Fraïsse, J.M.E.; Braun, D. Quantum channel-estimation with particle loss: GHZ versus W states. Quantum Meas. Quantum Metrol. 2016, 3, 53. [Google Scholar] [CrossRef]
Durt, T.; Englert, B.-G.; Bengtsson, I.; Życzkowski, K. On mutually unbiased bases. Int. J. Quantum Inf. 2010, 8, 535–640. [Google Scholar] [CrossRef]
Yuan, H.; Zhou, Z.; Guo, G. Quantum state tomography via mutually unbiased measurements in driven cavity QED systems. New J. Phys. 2016, 18, 043013. [Google Scholar] [CrossRef]
Wootters, W.K.; Fields, B.D. Optimal state-determination by mutually unbiased measurements. Ann. Phys. 1989, 191, 363–381. [Google Scholar] [CrossRef]
Bengtsson, I. Three ways to look at mutually unbiased bases. arXiv, 2006; arXiv:quant-ph/0610216. [Google Scholar]
Ivonovic, I.D. Geometrical description of quantal state determination. J. Phys. A 1981, 14, 3241–3245. [Google Scholar] [CrossRef]
Yamagata, K. Efficiency of quantum state tomography for qubits. Int. J. Quantum Inform. 2011, 9, 1167–1183. [Google Scholar] [CrossRef]
Klappenecker, A.; Rötteler, M. Constructions of mutually unbiased bases. arXiv, 2003; arXiv:quant-ph/0309120. [Google Scholar]
Gell-Mann, M. Symmetries of baryons and mesons. Phys. Rev. 1962, 125, 1067. [Google Scholar] [CrossRef]

Figure 1. Mutually orthogonal dualistic foliation of

P (Ω)

based on

M (s)

and

E (ξ)

. Each section

M (s)

is affinely isomorphic to the parameter space

Ξ

. The greyish cylindrical area indicates the subset

B = {p_{(s, ξ)} | s \in D, ξ \in B}

of

P (Ω)

. In particular, for each

s \in D

, the intersection

M (s) \cap B

is affinely isomorphic to the physical domain B that corresponds to the state space

S (H)

.

Figure 1. Mutually orthogonal dualistic foliation of

P (Ω)

based on

M (s)

and

E (ξ)

. Each section

M (s)

is affinely isomorphic to the parameter space

Ξ

. The greyish cylindrical area indicates the subset

B = {p_{(s, ξ)} | s \in D, ξ \in B}

of

P (Ω)

. In particular, for each

s \in D

, the intersection

M (s) \cap B

is affinely isomorphic to the physical domain B that corresponds to the state space

S (H)

.

Figure 2. The maximum likelihood method in the framework of randomized tomography. Given a temporary estimate

(\hat{s}, \hat{ξ})

with

\hat{ξ} \notin B

, we can restrict ourselves to the section

M (\hat{s})

as the search space for the MLE

p^{*}

, and

p^{*} = p_{(\hat{s}, ξ^{*})}

is the

\nabla^{(m)}

-projection from the empirical distribution

p_{(\hat{s}, \hat{ξ})}

to

B

on the section

M (\hat{s})

.

Figure 2. The maximum likelihood method in the framework of randomized tomography. Given a temporary estimate

(\hat{s}, \hat{ξ})

with

\hat{ξ} \notin B

, we can restrict ourselves to the section

M (\hat{s})

as the search space for the MLE

p^{*}

, and

p^{*} = p_{(\hat{s}, ξ^{*})}

is the

\nabla^{(m)}

-projection from the empirical distribution

p_{(\hat{s}, \hat{ξ})}

to

B

on the section

M (\hat{s})

.

Figure 3. The trajectories of

{\tilde{\nabla}}^{(m)}

-projections on the Stokes parameter space when

N^{(1)} : N^{(2)} = 1 : 1

(left) and

N^{(1)} : N^{(2)} = 5 : 1

(right). The greyish disk represents the Bloch ball B.

Figure 3. The trajectories of

{\tilde{\nabla}}^{(m)}

-projections on the Stokes parameter space when

N^{(1)} : N^{(2)} = 1 : 1

(left) and

N^{(1)} : N^{(2)} = 5 : 1

(right). The greyish disk represents the Bloch ball B.

Figure 4. A trajectory of

{\tilde{\nabla}}^{(m)}

-projection displayed on randomly chosen two-dimensional affine subspaces of

Ξ

to which both the empirical distribution (marked as a dot) and the MLE belong. The greyish region represents the physical domain B.

Figure 4. A trajectory of

{\tilde{\nabla}}^{(m)}

-projection displayed on randomly chosen two-dimensional affine subspaces of

Ξ

to which both the empirical distribution (marked as a dot) and the MLE belong. The greyish region represents the physical domain B.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fujiwara, A.; Yamagata, K. Information Geometry of Randomized Quantum State Tomography. Entropy 2018, 20, 609. https://doi.org/10.3390/e20080609

AMA Style

Fujiwara A, Yamagata K. Information Geometry of Randomized Quantum State Tomography. Entropy. 2018; 20(8):609. https://doi.org/10.3390/e20080609

Chicago/Turabian Style

Fujiwara, Akio, and Koichi Yamagata. 2018. "Information Geometry of Randomized Quantum State Tomography" Entropy 20, no. 8: 609. https://doi.org/10.3390/e20080609

APA Style

Fujiwara, A., & Yamagata, K. (2018). Information Geometry of Randomized Quantum State Tomography. Entropy, 20(8), 609. https://doi.org/10.3390/e20080609

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Information Geometry of Randomized Quantum State Tomography

Abstract

1. Introduction

2. Geometry of Randomized State Tomography

3. Estimation of the Parameter $ξ$

4. Examples

4.1. When $dim H = 2$

4.2. When $dim H = 3$

4.3. When $dim H \geq 4$

5. Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Information Geometry of MLE

Appendix B. Parametrization of $S (H)$

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Information Geometry of Randomized Quantum State Tomography

Abstract

1. Introduction

2. Geometry of Randomized State Tomography

3. Estimation of the Parameter ξ

4. Examples

4.1. When dim H = 2

4.2. When dim H = 3

4.3. When dim H ≥ 4

5. Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Information Geometry of MLE

Appendix B. Parametrization of S ( H )

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Estimation of the Parameter $ξ$

4.1. When $dim H = 2$

4.2. When $dim H = 3$

4.3. When $dim H \geq 4$

Appendix B. Parametrization of $S (H)$