Coarse-Graining of Observables

Gudder, Stan

doi:10.3390/quantum4040029

Open AccessArticle

Coarse-Graining of Observables

by

Stan Gudder

Department of Mathematics, University of Denver, Denver, CO 80208, USA

Quantum Rep. 2022, 4(4), 401-417; https://doi.org/10.3390/quantum4040029

Submission received: 23 August 2022 / Revised: 25 September 2022 / Accepted: 27 September 2022 / Published: 3 October 2022

(This article belongs to the Special Issue Exclusive Feature Papers of Quantum Reports)

Download Review Reports Versions Notes

Abstract

:

We first define the coarse-graining of probability measures in terms of stochastic kernels. We define when a probability measure is part of another probability measure and say that two probability measures coexist if they are both parts of a single probability measure. We then show that any two probability measures coexist. We extend these concepts to observables and instruments and mention that two observables need not coexist. We define the discretization of an observable as a special case of coarse-graining and show that these have 0 –1 stochastic kernels. We next consider finite observables and instruments and show that in these cases, stochastic kernels are replaced by stochastic matrices. We also show that coarse-graining is the same as post-processing in this finite case. We then consider sequential products of observables and discuss the sequential product of a post-processed observable with another observable. We briefly discuss SIC observables and the example of qubit observables.

Keywords:

coarse-graining; post-processing; observables

1. Introduction

In this introduction we shall present a general discussion with detailed definitions and results given in later sections. A coarse-graining of an observable A is an imprecise version of A [1,2]. Generally speaking, coarse-graining means a reduction in the statistical description of a system. If A and B are observables, we say that B is a coarse-graining of A if the probability distribution of B is an affine function of the probability disctribution of A. A specific type of coarse-graining is when B is an unsharp or fuzzy version of A [3,4,5].

In Section 2, we discuss a coarse-graining of probability measures that involves stochastic kernels. Since probability measures are the classical counterparts of quantum states, we can consider this as pertaining to coarse-graining in classical physics. This work is applied in Section 3 to studying coarse-graining of quantum observables and instruments. Section 3 begins with an application for the dynamics of a quantum system described by a strongly continuous unitary group. We then discuss parts and discretizations of observables and show that any two discretizations of observables coexist.

Section 4 discusses finite observables and shows that in this case coarse-graining is the same as post-processing [5,6]. Rank 1, sharp and atomic observables are considered next. Sequential products and conditioned observables are treated. The concepts of this section are illustrated with the example of finite position and momentum observables where a crucial role is played by the finite Fourier transform.

Section 5 is more speculative than the previous sections and we do not arrive at many definite conclusions This section discusses symmetric informationally complete (SIC) observables. An important unsolved problem is whether SIC observables exist for every finite dimensional Hilbert space [5]. It is not even known whether high dimensional SIC observables exist. Applying some of the work in Section 4, we propose a possible method for attacking this problem. Unfortunately, we have not been able to complete the method and leave this for future work.

2. Coarse-Graining of Measures

We denote the set of probability measures on a measurable space

(Ω, F)

by

Prob (Ω, F)

. Let

(Ω_{1}, F_{1})

,

(Ω_{2}, F_{2})

be measurable spaces. A map

v : Ω_{1} \times F_{2} \to [0, 1]

that satisfies

x \mapsto v (x, Δ)

is measurable for all

Δ \in F_{2}

and

v (x, \cdot) \in Prob (Ω_{2}, F_{2})

for all

x \in Ω_{1}

is called a stochastic kernel [5]. If v is a stochastic kernel, define

V : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{2}, F_{2})

by

[V (μ)] (Δ) = \int v (x, Δ) μ (d x)

. We call v the stochastic kernel for V and we say that

V (μ)

is a coarse-graining of

μ

. We think of

V (μ)

as an imprecise version of

μ \in Prob (Ω_{1}, F_{1})

on

(Ω_{2}, F_{2})

. Notice that V is an affine map because if

0 \leq λ_{i} \leq 1

,

\sum λ_{i} = 1

, then

\begin{matrix} [V (\sum λ_{i} μ_{i})] (Δ) & = \int v (x, Δ) (\sum λ_{i} μ_{i}) (d x) = \sum λ_{i} \int v (x, Δ) μ_{i} (d x) \\ = \sum λ_{i} [V (μ_{i})] (Δ) \end{matrix}

for all

Δ \in F_{2}

. Moreover, if

f : Ω_{2} \to R

is measurable, then

\begin{matrix} \int_{Ω_{2}} f (y) [V (μ)] (d y) & = \int_{Ω_{2}} f (y) \int_{Ω_{1}} v (x, d y) μ (d x) = \int_{Ω_{2}} \int_{Ω_{1}} f (y) v (x, d y) μ (d x) \\ = \int_{Ω_{1}} [\int_{Ω_{2}} f (y) v (x, d y)] μ (d x) \end{matrix}

Example 1.

The map

v (x, Δ) = χ_{_{Δ}} (x)

is a stochastic kernel from

(Ω, F)

to

(Ω, F)

. The corresponding coarse-graining map

V : Prob (Ω, F) \to Prob (Ω, F)

satisfies

V (μ) (Δ) = \int v (x, Δ) μ (d x) = \int χ_{_{Δ}} (x) μ (d x) = \int_{Δ} μ (d x) = μ (Δ)

for all

Δ \in F

. Hence,

V (μ) = μ

so V is the identity map.

Example 2.

Let

ν \in Prob (Ω_{2}, F_{2})

and let

v : Ω_{1} \times F_{2} \to [0, 1]

be defined by

v (x, Δ) = ν (Δ)

for all

x \in Ω_{1}

,

Δ \in F_{2}

. Then v is a stochastic kernel and the corresponding coarse-graining map is

V (μ) (Δ) = \int v (x, Δ) μ (d x) = \int ν (Δ) μ (d x) = ν (Δ)

Hence, V is the constant map

V (μ) = ν

.

We define the Dirac measure at x on

(Ω, F)

by

δ_{x}

where

δ_{x} (Δ) = 1

if and only if

x \in Δ

.

Lemma 1.

(a) If

v : Ω_{1} \times F_{2} \to [0, 1]

is a stochastic kernel for

V : Prob (Ω_{1}, F_{1}) \to (Ω_{2}, F_{2})

then

v (x, Δ) = V (δ_{x}) (Δ)

.(b) If

V : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{2}, F_{2})

has a stochastic kernel v, then v is unique.

Proof.

(a) If v is a stochastic kernel for V, then

V (δ_{x}) (Δ) = \int v (y, Δ) δ_{x} (d y) = v (x, Δ)

(b) follows from (a). □

It can be shown that an arbitrary affine map

V : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{2}, F_{2})

need not have a stochastic kernel and hence need not be a coarse-graining. One way to accomplish this is to construct such a map V where

x \mapsto V (δ_{x}) (Δ)

is not measurable for some

Δ \in F_{2}

. We leave the details of this to the reader. Then V does not have a stochastic kernel v because if it did, then by Lemma 1(a),

v (x, Δ) = V (δ_{x}) (Δ)

so

x \mapsto v (x, Δ)

is not measurable for some

Δ \in F_{2}

which is a contradiction.

Let

(Ω_{j}, F_{j})

,

j = 1, 2, 3

, be measurable spaces and let

v : Ω_{1} \times F_{2} \to [0, 1]

,

u : Ω_{2} \times F_{3} \to [0, 1]

be stochastic kernels. Define

u \circ v : Ω_{1} \times F_{3} \to [0, 1]

by

u \circ v (x, Δ) = \int_{Ω_{2}} u (y, Δ) v (x, d y)

. Then

u \circ v

is a stochastic kernel.

Lemma 2.

Let

V : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{2}, F_{2})

and

U : Prob (Ω_{2}, F_{2}) \to Prob (Ω_{3}, F_{3})

be coarse-grainings with corresponding stochastic kernels

v, u

. Then their composition

U \circ V : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{3}, F_{3})

has stochastic kernel

u \circ v

.

Proof.

For all

μ \in Prob (Ω_{1}, F_{1})

,

Δ \in F_{3}

we have that

\begin{matrix} [U \circ V (μ)] (Δ) & = \int_{Ω_{2}} u (y, Δ) V (μ) (d y) = \int_{Ω_{2}} \int_{Ω_{1}} u (y, Δ) v (x, d y) μ (d x) \\ = \int_{Ω_{1}} \int_{Ω_{2}} u (y, Δ) v (x, d y) μ (d x) = \int_{Ω_{1}} u \circ v (x, Δ) μ (d x) \end{matrix}

Hence, the stochastic kernel for

U \circ V

is

u \circ v

. □

We say that

μ_{2} \in Prob (Ω_{2}, F_{2})

is part of

μ_{1} \in Prob (Ω_{1}, F_{1})

if there exists a measurable function

f : Ω_{1} \to Ω_{2}

such that

μ_{2} (Δ) = μ_{1} [f^{- 1} (Δ)]

for all

Δ \in F_{2}

. Define

V_{f} : Prob (Ω_{1}, F_{1}) \to Prob (Ω_{2}, F_{2})

by

(V_{f} μ) (Δ) = μ [f^{- 1} (Δ)]

. Thus,

μ_{2}

is part of

μ_{1}

, if and only if

μ_{2} = V_{f} (μ_{1})

for a measurable function

f : Ω_{1} \to Ω_{2}

. Notice that

V_{f}

is affine because

[V_{f} (\sum λ_{i} μ_{i})) (Δ)] = \sum λ_{i} μ_{i} [(V_{f} μ_{i}) (Δ)] = (\sum λ_{i} V_{f} μ_{i}) (Δ))

and hence,

V_{f} (\sum λ_{i} u_{i}) = \sum λ_{i} V_{f} (μ_{i})

.

Lemma 3.

A map

v : Ω_{1} \times F_{2} \to [0, 1]

is the stochastic kernel for

V_{f}

if and only if

v (x, Δ) = χ_{f^{- 1} (Δ)} (x)

for all

x \in Ω_{1}

,

Δ \in F_{2}

.

Proof.

If

v (x, Δ) = χ_{f^{- 1} (Δ)}

, then v is a stochastic kernel and

\begin{matrix} \int v (x, Δ) μ (d x) & = \int χ_{f^{- 1} (Δ)} (x) μ (d x) = \int_{f^{- 1} (Δ)} μ (d x) \\ = μ [f^{- 1} (Δ)] = (V_{f} μ) (Δ) \end{matrix}

for all

x \in Ω_{1}

,

Δ \in F_{2}

. Hence, v is a stochastic kernel for

V_{f}

. Since stochastic kernels are unique, the converse holds. □

Lemma 4.

If

f : Ω_{1} \to Ω_{2}

and

g : Ω_{2} \to Ω_{3}

are measurable,

V_{g} \circ V_{f} = V_{g \circ f}

and the stochastic kernel for

V_{g \circ f}

is

ω (x, Δ) = χ_{f^{- 1} (g^{- 1} (Δ))} (x)

.

Proof.

For all

μ \in Prob (Ω_{1}, F_{1})

and

Δ \in F_{3}

we have that

\begin{matrix} (V_{g} \circ V_{f}) μ (Δ) & = V_{f} [μ (g^{- 1} (Δ))] = μ [f^{- 1} (g^{- 1} (Δ))] = μ [{(g \circ f)}^{- 1} (Δ)] \\ = (V_{g \circ f} μ) (Δ) \end{matrix}

Hence,

V_{g} \circ V_{f} = V_{g \circ f}

. It follows from Lemma 3 that the stochastic kernel for

V_{g \circ f}

is

w (x, Δ) = χ_{{(g \circ f)}^{- 1}) Δ)} (x) = χ_{f^{- 1} (g^{- 1} (Δ))} (x) □

We say that two probability measures coexist, if they are both parts of another probability measure.

Lemma 5.

If

μ_{1} \in Prob (Ω_{1}, F_{1})

,

μ_{2} \in Prob (Ω_{2} F_{2})

then

μ_{1}, μ_{2}

coexist.

Proof.

Define

μ \in Prob (Ω_{1} \times Ω_{2}, F_{1} \times F_{2})

by

μ = μ_{1} \times μ_{2}

and define

f : Ω_{1} \times Ω_{2} \to Ω_{1}

by

f (x, y) = x

,

g : Ω_{1} \times Ω_{2} \to Ω_{2}

by

g (x, y) = y

. Then f and g are measurable and if

Δ_{1} \in F_{1}

we obtain

μ [f^{- 1} (Δ_{1})] = μ (Δ_{1} \times Ω_{2}) = μ_{1} (Δ_{1}) μ_{2} (Ω_{2}) = μ_{1} (Δ_{1})

Hence,

μ_{1}

is a part of

μ

. Similarly, if

Δ_{2} \in F_{2}

, then

μ [g^{- 1} (Δ_{2})] = μ_{2} (Δ_{2})

so

μ_{2}

is a part of

μ

. □

Let

(Ω, F)

be a measurable space and let

(Ω_{1}, 2^{Ω_{1}})

be a finite measurable space with

Ω_{1} = \{1, 2, \dots, n\}

. Let

B_{1}, B_{2}, \dots, B_{n}

be a measurable partition of

Ω

. That is

B_{i} \cap B_{j} = \emptyset

for

i \neq j

and

\cup B_{i} = Ω

. Define

V : Prob (Ω, F) \to Prob (Ω_{1}, 2^{Ω_{1}})

by

V (μ) (Δ) = \sum \{μ (B_{i}) : i \in Δ\}

. Then V is affine because

\begin{matrix} V (\sum λ_{j} μ_{j}) (Δ) & = \sum_{i \in Δ} [\sum_{j} λ_{j} μ_{j} (B_{i})] = \sum_{j} λ_{j} \sum_{i \in Δ} μ_{j} (B_{i}) \\ = \sum_{j} λ_{j} V (μ_{j}) (Δ) \end{matrix}

so that

V (\sum λ_{j} μ_{j}) = \sum λ_{j} V (μ_{j})

. We call V a discretization map and

V (μ)

a discretization of

μ

. A stochastic kernel

v (x, Δ)

is called a 0 –1 stochastic kernel if

v (x, Δ) = 0 or 1

for all

x, Δ

.

Theorem 1.

An affine map

V : Prob (Ω, F) \to Prob (Ω_{1}, 2^{Ω_{1}})

is a discretization if and only if V has a 0 –1 stochastic kernel.

Proof.

Suppose V is a discretization and V has stochastic kernel

v (x, Δ)

. Then by Lemma 1 we obtain for

j = 1, 2, \dots, n

that

v (x, \{j\}) = V (δ_{x}) (\{j\}) = \sum \{δ_{x} (B_{i}) : i \in \{j\}\} = δ_{x} (B_{j})

Hence,

\begin{matrix} v (x, Δ) & = V (δ_{x}) (Δ) = \sum_{j \in Δ} V (δ_{x}) (\{j\}) = \sum_{j \in Δ} δ_{x} (B_{j}) \\ = δ_{x} (⋃_{j \in Δ} B_{j}) = χ_{⋃_{j \in Δ} B_{j}} (x) \end{matrix}

for all

x \in Ω

,

Δ \in 2^{Ω_{1}}

. To show that

v (x, Δ)

is actually the stochastic kernel for V we have that

\begin{matrix} \int v (x, Δ) μ (d x) & = \int χ_{⋃_{j \in Δ} B_{j}} (x) μ (d x) = \int_{⋃_{j \in Δ} B_{j}} μ (d x) \\ = \sum_{j \in Δ} μ (B_{j}) = V (μ) (Δ) \end{matrix}

Of course,

v (x, Δ)

is a 0 –1 stochastic kernel. Conversely, suppose

v (x, Δ)

is a 0 –1 stochastic kernel for

V : Prob (Ω, F) \to Prob (Ω_{1}, 2^{Ω_{1}})

Then

v (x, Δ) = \sum_{j \in Δ} v (x, \{j\})

for all

x \in Ω

,

Δ \in 2^{Ω_{1}}

. Let

B_{i}

,

i = 1, 2, \dots, n

, be the measurable sets

B_{i} = \{x \in Ω : v (x, \{i\}) = 1\}

If

x \in B_{i} \cap B_{j}

for

i \neq j

, then

v (x, \{i\}) = v (x, \{j\}) = 1

and

v (x, \{i, j\}) = 2

which is a contradiction. Hence,

B_{i} \cap B_{j} = \emptyset

for

i \neq j

. If

x \in Ω

and

v (x, \{i\}) = 0

for all

i = 1, 2, \dots, n

, then

v (x, Ω_{1}) = \sum_{i \in Ω_{1}} v (x, \{i\}) = 0

which is a contradiction. Hence, there exists an i such that

v (x, \{i\}) = 1

so

\cup B_{i} = Ω_{1}

. We conclude that

\{B_{i}\}

is a measurable partition of

Ω

. Since

(V μ) (\{i\}) = \int v (x, \{i\}) μ (d x) = \int_{B_{i}} μ (d x) = μ (B_{i})

we have for all

Δ \in 2^{Ω_{1}}

that

(V μ) (Δ) = \sum_{i \in Δ} (V μ) (\{i\}) = \sum_{i \in Δ} μ (B_{i})

We conclude that V is a discretization map. □

When we consider a finite measurable space

(Ω, F)

we always assume that

F = 2^{Ω}

so

F

need not be specified. For

Ω = \{x_{1}, x_{2}, \dots, x_{n}\}

we identify a

μ \in Prob (Ω)

with the column vector with entries

μ (x_{1}), μ (x_{2}), \dots, μ (x_{n})

where we write

μ (\{x_{i}\}) = μ (x_{i})

,

i = 1, 2, \dots, n

. An

m \times n

matrix

M = [m_{i j}]

is a stochastic matrix if

0 \leq m_{i j} \leq 1

and

\sum_{j = 1}^{m} m_{i j} = 1

for all

i = 1, 2, \dots, n

. In this finite case, the stochastic kernels are replaced by stochastic matrices. This is because, in the finite case, if

v (x, Δ)

is a stochastic kernel, then

v (x_{i}, \{y_{j}\})

is a stochastic matrix and conversely, if

[m_{i j}]

is a stochastic matrix, then

v (x_{i}, Δ) = \sum \{m_{i j} : y_{j} \in Δ\}

is a stochastic kernel.

Theorem 2.

Let

Ω_{1} = \{x_{1}, x_{2}, \dots, x_{n}\}

,

Ω_{2} = \{y_{1}, y_{2}, \dots, y_{m}\}

and let

V : Prob (Ω_{1}) \to Prob (Ω_{2})

be affine. Then there exists a unique

m \times n

stochastic matrix

\tilde{V}

such that for every

ν \in Prob (Ω_{1})

we have

V (ν) = \tilde{V} ν

. Conversely, if M is an

m \times n

stochastic matrix, then there exists an affine map

V : Prob (Ω_{1}) \to Prob (Ω_{2})

such that

\tilde{V} = M

.

Proof.

Let

V : Prob (Ω_{1}) \to Prob (Ω_{2})

be affine. Since every element of

Prob (Ω_{2})

is a convex combination of

δ_{y_{j}}

,

j = 1, 2, \dots, m

we have that

V (δ_{x_{i}}) = \sum_{j = 1}^{m} μ_{i j} δ_{y_{j}}

where

0 \leq μ_{i j} \leq 1

and

\sum_{j = 1}^{m} μ_{i j} = 1

,

i = 1, 2, \dots, n

. We conclude that

\tilde{V} = [μ_{i j}]

is an

m \times n

stochastic matrix and

μ_{i j} = [V (δ_{x_{i}})] (y_{j})

. Letting

ν \in Prob (Ω_{1})

we obtain

ν = \sum_{i = 1}^{n} ν_{i} δ_{x_{i}}

where

ν_{i} = ν (x_{i})

,

i = 1, 2, \dots, n

. Since

0 \leq ν_{i} \leq 1

,

\sum_{i = 1}^{n} ν_{i} = 1

and V is affine, we conclude that

V (ν) = V (\sum_{i = 1}^{n} ν_{i} δ_{x_{i}}) = \sum_{i = 1}^{n} ν_{i} V (δ_{x_{i}}) = \sum_{i = 1}^{n} ν_{i} \sum_{j = 1}^{m} μ_{i j} δ_{y_{j}} = \sum_{i, j} μ_{i j} ν (x_{i}) δ_{y_{j}}

It follows that

V (ν) = [\begin{matrix} V (ν) (y_{1}) \\ V (ν) (y_{2}) \\ ⋮ \\ V (ν) (y_{m}) \end{matrix}] = [\begin{matrix} \sum μ_{i 1} ν (x_{i}) \\ \sum μ_{i 2} ν (x_{i}) \\ ⋮ \\ \sum μ_{i m} ν (x_{i}) \end{matrix}] = \tilde{V} [\begin{matrix} ν (x_{1}) \\ ν (x_{2}) \\ ⋮ \\ ν (x_{n}) \end{matrix}] = \tilde{V} ν

To show that

\tilde{V}

is unique, suppose

V (ν) = M ν

where

M = [M_{i j}]

is an

m \times n

matrix. We then obtain

M_{i j} = 〈δ_{y_{j}}, M δ_{x_{i}}〉 = 〈δ_{y_{j}}, V (δ_{x_{i}}〉 = 〈δ_{y_{j}}, \sum_{k = 1}^{m} μ_{i k} δ_{y_{k}}〉 = μ_{i j} = {\tilde{V}}_{i j}

Conversely, let

M = [M_{i j}]

be an

m \times n

stochastic matrix. Define

V : Prob (Ω_{1}) \to Prob (Ω_{2})

by

V (δ_{x_{i}}) = \sum_{j = 1}^{m} μ_{i j} δ_{y_{j}}

,

i = 1, 2, \dots, n

and extend V affinely to all of

Prob (Ω_{1})

. By our previous work,

\tilde{V} = M

. □

We conclude that in the finite case, every affine map

V : Prob (Ω_{1}) \to Prob (Ω_{2})

is a coarse-graining and is implemented by a unique stochastic matrix

\tilde{V}

. We then identify V and

\tilde{V}

.

3. Observables and Instruments

In this section, we employ our previous work to study coarse-graining of observables and instruments. Let H be a complex Hilbert space that represents a quantum system S. We denote the set of bounded linear operators on H by

L (H)

. For

A, B \in L (H)

, we write

A \leq B

if

〈ϕ, A ϕ〉 \leq 〈ϕ, B ϕ〉

for all

ϕ \in H

. An operator

E \in L (H)

is an effect if

0 \leq E \leq I

where

0, I

are the zero and identity operators, respectively. We denote the set of effects by

E (H)

and interpret an

E \in E (H)

as a 1– 0 (true-false) measurement [5,7,8]. If

(Ω_{A}, F)

is a measurable space, an observable with outcome space

Ω_{A}

is an effect-valued measure

A : F \to E (H)

[5,7,8]. That is,

A (\cup Δ_{i}) = \sum A (Δ_{i})

when

Δ_{i} \cap Δ_{j} = \emptyset

,

i \neq j

, and

A (Ω) = I

. We interpret

A (Δ)

as the effect that occurs when a measurement of A results in an outcome in

Δ

. A state for S is an effect

ρ \in E (H)

that satisfies

tr (ρ) = 1

. We denote the set of states on H by

S (H)

. If

ρ \in S (H)

,

E \in E (H)

we interpret

tr (ρ E)

as the probability that E occurs (is true) when S is in the state

ρ

. If A is an observable, its statistics in the state

ρ

is given by the distribution

Φ_{ρ}^{A} (Δ) = tr [ρ A (Δ)]

for all

Δ \in F

. Of course,

Φ_{ρ}^{A} \in Prob (Ω, F)

for all

ρ \in S (H)

[5,7,8].

We now discuss a method for constructing stochastic kernels from observables. Let

(Ω, F)

,

(Ω_{A}, G)

be measurable spaces,

\{α_{x} : x \in Ω\} \subseteq S (H)

a collection of states and A an observable with outcome space

Ω_{A}

. We say that

(α, A)

is measurable if

x \mapsto Φ_{α_{x}}^{A} (Δ)

is measurable for all

Δ \in G

. If

(α, A)

is measurable, we define the stochastic kernel

v (x, Δ) = tr [α_{x} A (Δ)] = Φ_{α_{x}}^{A} (Δ)

(1)

with the corresponding coarse-graining

V_{(α, A)} (μ) (Δ) = \int v (x, Δ) μ (d x) = \int tr [α_{x} A (Δ)] μ (d x) = \int ϕ_{α_{x}}^{A} (Δ) μ (d x)

(2)

If

α_{x}

are pure states

α_{x} = |ϕ_{x}〉 〈ϕ_{x}|

,

ϕ_{x} \in H

, then (1) and (2) become

\begin{matrix} v (x, Δ) & = 〈A (Δ) ϕ_{x}, ϕ_{x}〉 \end{matrix}

(3)

and

\begin{matrix} V_{(α, A)} (μ) (Δ) & = \int 〈A (Δ) ϕ_{x}, ϕ_{x}〉 μ (d x) \end{matrix}

(4)

We interpret (1) as the probability that a measurement of A results in an outcome in

Δ

when S is in the state

α_{x}

.

Example 3.

Let

Ω = \{1, 2, \dots, n\}

be a finite measurable space. We show that any stochastic matrix

M = [μ_{i j}]

,

i, j = 1, 2, \dots, n

can be written in the form of the previous paragraph. Let H be a complex Hilbert space with dimension n and let

\{ϕ_{i} : i = 1, 2, \dots, n\}

be an orthonormal basis for H. Let A be the observable with outcome space Ω satisfying

A (\{j\}) = diag [μ_{1 j}, μ_{2 j}, \dots, μ_{n j}]

Letting

α_{i}

be the pure state

α_{i} = |ϕ_{i}〉 〈ϕ_{i}|

,

i = 1, 2, \dots, n

, we obtain

〈A (\{j\}) ϕ_{i}, ϕ_{i}〉 = μ_{i j}

This is essentially (3).

We now give an application of the previous structure to the study of the dynamics of the system S. Suppose the dynamics of S is described by the strongly continuous unitary group

e^{- i t K}

,

t \in [0, \infty)

, where K is the Hamiltonian for S. If

ϕ_{0} \in H

is the initial state, then

ϕ_{t} = e^{- i t K} ϕ_{0}

is the state at time

t \in [0, \infty)

. We can consider

ϕ_{t}

as a collection of states indexed by the points of the measurable space,

([0, \infty), B ([0, \infty)))

. Let A be an observable with outcome space

(Ω_{A}, F)

. Since

t \mapsto ϕ_{t}

is continuous we have that

t \mapsto Φ_{ϕ_{t}}^{A} (Δ) = 〈A (Δ) ϕ_{t}, ϕ_{t}〉

(5)

is continuous for all

Δ \in F

. It follows that

(ϕ_{t}, A)

is measurable. We conclude that the map

v : [0, \infty) \times F \to [0, 1]

given by

v (t, Δ) = 〈A (Δ) ϕ_{t}, ϕ_{t}〉

is a stochastic kernel called the dynamical kernel for

(ϕ_{t}, A)

. We interpret

v (t, Δ)

as the probability that a measurement of A at time t results in an outcome in

Δ

. In terms of the dynamical group we have

v (t, Δ) = 〈A (Δ) e^{- i t K} ϕ_{0}, e^{- i t K} ϕ_{0}〉 = 〈e^{i t K} A (Δ) e^{- i t k} ϕ_{0}, ϕ_{0}〉

(6)

The observable

Δ \mapsto e^{i t K} A (Δ) e^{- i t K}

which gives the time evolution of A is the Heisenberg picture of quantum mechanics while (5) gives the Schrödinger picture. The corresponding coarse-graining map

\begin{matrix} V_{(ϕ, A)} : Prob ([0, \infty), B ([0, \infty))) \to Prob (Ω_{A}, F) \end{matrix}

satisfies

\begin{matrix} V_{(ϕ, A)} (μ) (Δ) = \int 〈A (Δ) ϕ_{t}, ϕ_{t}〉 μ (d t) = \int 〈e^{i t K} A (Δ) e^{- i t K} ϕ_{0}, ϕ_{0}〉 μ (d t) \end{matrix}

For a particular time

t_{0} \in [0, \infty)

we have

V_{(ϕ, A)} (δ_{t_{0}}) (Δ) = v (t_{0}, Δ) = 〈e^{i t_{0} K} A (Δ) e^{- i t_{0} K} ϕ_{0}, ϕ_{0}〉

Let A be an observable with outcome space

(Ω_{A}, F)

and let

(Ω, G)

be a measurable space. If

v : Ω_{A} \times G

is a stochastic kernel, we define the observable

V \cdot A

with outcome space

Ω

by

V \cdot A (Δ) = \int v (x, Δ) A (d x), for all Δ \in G

We call v the stochastic kernel for V and

V \cdot A

is a coarse-graining of A [5,7]. We see that

V \cdot A (Δ)

is the unique effect satisfying

tr [ρ V \cdot A (Δ)] = \int v (x, Δ) tr [ρ A (d x)]

for all

ρ \in S (H)

. We now show that this idea extends to observables.

Lemma 6.

V \cdot A

is the unique observable with distribution

Φ_{ρ}^{V \cdot A} (Δ) = V [Φ_{ρ}^{A}] (Δ)

Proof.

For all

ρ \in S (H)

,

Δ \in G

we obtain

\begin{matrix} Φ_{ρ}^{V \cdot A} (Δ) & = tr [ρ (V \cdot A) (Δ)] = tr [ρ \int v (x, Δ) A (d x)] = \int v (x, Δ) tr [ρ A (d x)] \\ = \int v (x, Δ) Φ_{ρ}^{A} (d x) = V [Φ_{ρ}^{A}] (Δ) \end{matrix}

The observable

V \cdot A

is unique because two observables on H with the same distributions for every

ρ \in S (H)

are identical [5,7,8]. □

If

A_{i}

,

i = 1, 2, \dots, n

, are observables on H with the same outcome set and

0 \leq λ_{i} \leq 1

,

\sum λ_{i} = 1

, it is clear that

\sum λ_{i} A_{i}

is again an observable. Thus, such observables form a convex set. We conclude that

A \mapsto V \cdot A

is an affine map because

\begin{matrix} V \cdot (\sum λ_{i} A_{i}) (Δ) & = \int v (x, Δ) \sum λ_{i} A (d x) = \sum λ_{i} \int v (x, Δ) A_{i} (d x) \\ = \sum λ_{i} (V \cdot A_{i}) (Δ) \end{matrix}

Let

(Ω, F)

,

(Ω_{A}, G)

be measurable spaces and let

(α, A)

be measurable with corresponding stochastic kernel

v (x, Δ)

and coarse-graining

V_{(α, A)}

given by (1) and (2). If B is an observable with outcome space

Ω

we obtain the following result.

Lemma 7.

(a) For all

Δ \in G

we have that

(V_{(α, A)} \cdot B) (Δ) = \int Φ_{α_{x}}^{A} (Δ) B (d x)

(b) For all

ρ \in S (H)

,

Δ \in G

, we have that

Φ_{ρ}^{V_{(α, A) \cdot B}} (Δ) = \int Φ_{α_{x}}^{A} (Δ) Φ_{ρ}^{B} (d x)

Proof.

(a) Since

v (x, Δ) = Φ_{α_{x}}^{A} (Δ)

for all

Δ \in G

, we obtain

(V_{(α, A)} \cdot B) (Δ) = \int v (x, Δ) B (d x) = \int Φ_{α_{x}}^{A} (Δ) B (d x)

(b) For all

ρ \in S (H)

,

Δ \in G

, applying (a) we obtain

\begin{matrix} Φ_{ρ}^{V_{(α, A) \cdot B}} (Δ) & = tr [ρ (V_{(α, A)} \cdot B) (Δ)] = tr [ρ \int Φ_{α_{x}}^{A} (Δ) B (d x)] \\ = \int Φ_{α_{x}}^{A} (Δ) tr [ρ B (d x)] = \int Φ_{α_{x}}^{A} (Δ) Φ_{ρ}^{B} (d x) \end{matrix})

□

An observable B is part of an observable A if there exists a measurable surjection

f : Ω_{A} \to Ω_{B}

such that

B = V_{f} \cdot A

[6,9,10].

Lemma 8.

Let

A, B

be observables on H with outcome spaces

(Ω_{A}, F_{A})

,

(Ω_{B}, F_{B})

, respectively. Then B is part of A if and only if there is a measurable surjection

f : Ω_{A} \to Ω_{B}

such that

B (Δ) = A [f^{- 1} (Δ)]

for all

Δ \in F_{B}

.

Proof.

If B is a part of A, there exists a measurable surjection

f : Ω_{A} \to Ω_{B}

such that

B = V_{f} \cdot A

. If

v (x, Δ) = χ_{f^{- 1} (Δ)} (x)

is the corresponding stochastic kernel, then for

Δ \in F_{B}

we obtain

\begin{matrix} B (Δ) & = (V_{f} \cdot A) (Δ) = \int v (x, Δ) A (d x) = \int χ_{f^{- 1} (Δ)} (x) A (d x) \\ = \int_{f^{- 1} (Δ)} A (d x) = A [f^{- 1} (Δ)] \end{matrix}

Conversely, if

B (Δ) = A [f^{- 1} (Δ)]

for all

Δ \in F_{B}

, then letting

v (x, Δ) = χ_{f^{- 1} (Δ)} (x)

we obtain

B (Δ) = (V_{f} \cdot A) (Δ)

by reversing the previous argument. Hence,

B = V_{f} \cdot A

so B is part of A. □

By Lemma 6 if B is part of A so that

B = V_{f} \cdot A

, then

Φ_{ρ}^{B} = V_{f} (Φ_{ρ}^{A})

and hence

Φ_{ρ}^{B}

is part of

Φ_{ρ}^{A}

for all

ρ \in S (H)

. Two observables

B, C

coexist if there exists an observable A such that B and C are part of A [5,7,11,12]. It is well-known that unlike in Lemma 5, two observables need not coexist [5,7,12]. Let A be an observable with outcome space

(Ω_{A}, F)

. If V is a discretization of

(Ω_{A}, F)

, we call

V \cdot A

a discretization of A [5]. If

v (x, \{i\}) = χ_{B_{i}} (x)

is the corresponding stochastic kernel we obtain

(V \cdot A) (\{i\}) = \int v (x, \{i\}) A (d x) = \int χ_{_{B_{i}}} (x) A (d x) = \int_{B_{i}} A (d x) = A (B_{i})

(7)

Moreover,

(V \cdot A) (Δ) = \sum_{i \in Δ} (V \cdot A) (\{i\}) = \sum \{A (B_{i}) : i \in Δ\}

Lemma 9.

If

V \cdot A

is a discretization of A, then

V \cdot A

is a part of A.

Proof.

Let

V : Prob (Ω_{A}, F) \to Prob (Ω_{1})

where

Ω_{1} = \{1, 2, \dots, n\}

so the outcome space of

V \cdot A

is

Ω_{1}

. Let

v (x, \{i\}) = χ_{_{B_{i}}} (x)

be the corresponding stochastic kernel. Define

f : Ω_{A} \to Ω_{1}

by

f (x) = i

if

x \in B_{i}

. Then by (7)

(V \cdot A) (\{i\}) = A (B_{i}) = A [f^{- 1} (\{i\})]

and it follows that for all

Δ \subseteq Ω_{1}

we obtain

(V \cdot A) (Δ) = \sum_{i \in Δ} (V \cdot A) (\{i\}) = A [f^{- 1} (Δ)]

Hence,

V \cdot A

is part of A. □

Corollary 1.

Any two discretizations of an observable coexist.

Let

T (H)

be the set of trace-class operators on H. An operation on H is a trace non-increasing, completely positive linear map

T : T (H) \to T (H)

[5,7,8,11]. If an operation T preserves the trace, then T is called a channel on H. An instrument on H with outcome space

Ω_{I}

is an operation-valued measure

I

on

(Ω_{I}, F)

such that

I (Ω_{I})

is a channel. The statistics of an instrument

I

for a state

ρ \in S (H)

is given by its distribution

Φ_{ρ}^{I} (Δ) = tr [I (Δ) (ρ)]

for all

Δ \in F

. Of course,

Φ_{ρ}^{I}

is a probability measure on

(Ω_{I}, F)

. We say that an instrument

I

measures an observable A if

Ω_{A} = Ω_{I}

and for all

ρ \in S (H)

and

Δ \in F

we have

Φ_{ρ}^{A} (Δ) = tr [ρ A (Δ)] = tr [I (Δ) (ρ)] = Φ_{ρ}^{I} (Δ)

It can be shown that an instrument measures a unique observable, but an observable is measured by many instruments [5]. If

I

measures A we write

\hat{I} = A

. We think of

I

as an apparatus that can be employed to measure the observable

\hat{I}

and conclude that there are many such apparatuses. Although

I

reproduces the statistics of

\hat{I}

,

I

gives more information than

\hat{I}

. This is because when a measurement of

I

produces a result in

Δ \in F

the instrument

I

updates the state of the system to the new state

I (Δ) ρ / tr [I (Δ) ρ]

when

tr [I (Δ) ρ] \neq 0

[5,7,8].

If

I

is an instrument on

(Ω_{I}, F)

and

v : Ω_{I} \times G \to [0, 1]

is a stochastic kernel, then we shall show that

(V \cdot I) (Δ) = \int v (x, Δ) I (d x)

is an instrument with outcome space

(Ω, G)

called a coarse-graining of

I

. To show this we have that

(V \cdot I) (Δ)

is countably additive on

G

and

(V \cdot I) (Ω) = \int v (x, Ω) I (d x) = \int_{Ω_{I}} I (d x) = I (Ω_{I})

so

(V \cdot I) (Ω)

is a channel. Moreover, if

ρ \in S (H)

we obtain

\begin{matrix} tr [(V \cdot I) (Δ) ρ] & = tr [\int v (x, Δ) I (d x) (ρ)] = \int v (x, Δ) tr [I (d x) ρ] \\ \leq \int tr [I (d x) ρ] = tr [I (Ω_{I}) ρ] = tr (ρ) \end{matrix}

It follows that

V \cdot I

is an instrument. It is easy to check that instruments form a convex set and that

I \mapsto V \cdot I

is affine.

Theorem 3.

(a)

{(V \cdot I)}^{\land} = V \cdot \hat{I}

. (b)For instruments

I, J

we have that

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

for all

ρ \in S (H)

if and only if

J = V \cdot \hat{I}

.(c)If

J = V \cdot I

, then

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

for all

ρ \in S (H)

.

Proof.

(a) For all

ρ \in S (H)

we obtain

\begin{matrix} tr [ρ (V \cdot \hat{I}) (Δ)] & = tr [ρ \int v (x, Δ) \hat{I} (d x)] = \int v (x, Δ) tr [ρ \hat{I} (d x)] \\ = \int v (x, Δ) tr [I (d x) (ρ)] = tr [\int v (x, Δ) I (d x) (ρ)] \\ = tr [(V \cdot I) (Δ) (ρ)] = tr [ρ {(V \cdot I)}^{\land} (Δ)] \end{matrix}

It follows that

{(V \cdot I)}^{\land} = V \cdot \hat{I}

. (b) If

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

, then for all

ρ \in S (H)

we have that

\begin{matrix} tr [ρ \hat{J} (Δ)] & = tr [J (ρ) (Δ)] = Φ_{ρ}^{J} (Δ) = V (Φ_{ρ}^{I}) (Δ) = \int v (x, Δ) Φ_{ρ}^{I} (d x) \\ = \int v (x, Δ) tr [I (ρ) (d x)] = \int v (x, Δ) tr [ρ \hat{I} (d x)] \\ tr [ρ \int v (x, Δ) \hat{I} (d x)] = tr [ρ V \cdot \hat{I} (Δ)] \end{matrix}

Therefore,

\hat{J} (Δ) = V \cdot \hat{I} (Δ)

for all

Δ

so

\hat{J} = V \cdot \hat{I}

. Conversely, if

\hat{J} = V \cdot \hat{I}

, then for all

ρ \in S (H)

we obtain

\begin{matrix} Φ_{ρ}^{J} (Δ) & = tr [ρ \hat{J} (Δ)] = tr [ρ V \cdot \hat{I} (Δ)] = tr [ρ \int v (x, Δ) \hat{I} (d x)] \\ = \int v (x, Δ) tr [ρ \hat{I} (d x)] = \int v (x, Δ) tr [I (ρ) (d x)] = \int v (x, Δ) Φ_{ρ}^{I} (d x) \\ = V (Φ_{ρ}^{I}) (Δ) \end{matrix}

Hence,

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

. (c) If

J = V \cdot I

, then by (a)

\hat{J} = {(V \cdot I)}^{\land} = V \cdot \hat{I}

. Applying (b) gives

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

for all

ρ \in S (H)

. □

The converse of Theorem 3(c) does not hold. That is, if

Φ_{ρ}^{J} = V (Φ_{ρ}^{I})

for all

ρ \in S (H)

, we need not have

J = V \cdot I

. For example, let

V = I

the identity map. Then

Φ_{ρ}^{J} = Φ_{ρ}^{I}

for all

ρ \in S (H)

. However, there exist

J \neq I

with

Φ_{ρ}^{J} = Φ_{ρ}^{I}

for all

ρ \in S (H)

so

J \neq I \cdot I = I

. Applying Theorem 3, we can consider the various special types of coarse-graining for instruments.

4. Finite Observables

In this section, we restrict our attention to finite observables. If A is an observable with

Ω_{A} = \{x_{1}, \dots, x_{n}\}

, then A is completely determined by

\{A (\{x_{1}\}), A (\{x_{2}\}), \dots, A (\{x_{n}\})\}

We then define

A_{x} = A (\{x\})

and write

A = \{A_{x} : x \in Ω_{A}\}

. It follows that for all

Δ \subseteq Ω_{A}

we have that

A (Δ) = \sum \{A_{x} : x \in Δ\}

. Let

B = \{B_{y} : y \in Ω_{B}\}

be another observable and let

V : Prob (Ω_{A}) \to Prob (Ω_{B})

be an affine map. We write

B = V \cdot A

if

Φ_{ρ}^{B} = V (Φ_{ρ}^{A})

for all

ρ \in S (H)

. We then say that B is a post-processing of A [5,6]. Thus, post-processing is the same as coarse-graining for finite observables.

Theorem 4.

If

V : Prob (Ω_{A}) \to Prob (Ω_{B})

is affine, then

B = V \cdot A

if and only if

B_{y} = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} A_{x}

for all

y \in Ω_{B}

where

{\tilde{V}}_{x y}

is the stochastic matrix corresponding to V.

Proof.

Suppose

V : Prob (Ω_{A}) \to Prob (Ω_{B})

is affine and

B = V \cdot A

. By Theorem 2

\tilde{V}

is a stochastic matrix and for all

ρ \in S (H)

we obtain

\begin{matrix} tr (ρ B_{y}) & = Φ_{ρ}^{B} (y) = V (Φ_{ρ}^{A}) (y) = \tilde{V} (Φ_{ρ}^{A}) (y) = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} Φ_{ρ}^{A} (x) \\ = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} tr (ρ A_{x}) = tr [ρ \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} A_{x}] \end{matrix}

It follows that

B_{y} = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} A_{x}

. Conversely, suppose

B_{y} = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} A_{x}

for all

y \in Ω_{B}

. Then for all

ρ \in S (H)

and

y \in Ω_{B}

we obtain

Φ_{ρ}^{B} (y) = tr (ρ B_{y}) = tr (ρ \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} A_{x}) = \sum_{x \in Ω_{A}} {\tilde{V}}_{x y} Φ_{ρ}^{A} (x) = (V Φ_{ρ}^{A}) (y)

Hence,

B = V \cdot A

. □

We can identify an observable with a set

A = \{A_{x_{1}}, A_{x_{2}}, \dots, A_{x_{n}}\} \subseteq E (H)

satisfying

\sum_{i = 1}^{n} A_{x_{i}} = I

. We say that A is rank 1, sharp, atomic, respectively, if

A_{x_{i}}

are rank 1, projections, 1-dimensional projections. If A is sharp, it follows that

A_{x} A_{y} = A_{y} A_{x} = 0

for

x \neq y

[5,6]. If A is atomic, there exists an orthonormal basis

\{ϕ_{i}\}

for H such that

A_{x_{i}} = |ϕ_{i}〉 〈ϕ_{i}|

,

i = 1, 2, \dots, n

. Notice that

A_{x}

is rank 1 if and only if

A_{x} = λ P

where

0 < λ \leq 1

and P is a 1-dimensional projection.

If

A = \{A_{x} : x \in Ω_{A}\}

,

B = \{B_{y} : y \in Ω_{B}\}

are observables on H, their sequential product

A \circ B

is the observable with outcome space

Ω_{A} \times Ω_{B}

given by [6,13].

{(A \circ B)}_{(x, y)} = A_{x} \circ B_{y} = A_{x}^{1 / 2} B_{y} A_{x}^{1 / 2}

We also define the observable Bconditioned by the observable A as

{(B ∣ A)}_{y} = \sum_{x \in Ω_{A}} (A_{x} \circ B_{y})

It can be shown that

(B ∣ A)

coexists with A [6]. If

μ

is a stochastic matrix of the appropriate size, then

\begin{matrix} {(A \circ μ \cdot B)}_{(x, y)} & = A_{x} \circ {(μ \cdot B)}_{y} = A \circ \sum_{z \in Ω_{B}} μ_{z y} B_{z} = \sum_{z \in Ω_{B}} μ_{z y} A_{x} \circ B_{z} \\ = \sum_{z \in Ω_{B}} μ_{z y} A_{x}^{1 / 2} B_{z} A_{x}^{1 / 2} \end{matrix}

(8)

and if

ν

is a stochastic matrix of the appropriate size, then

\begin{matrix} {[(ν \cdot A) \circ B]}_{(x, y)} & = {(ν \cdot A)}_{x} \circ B_{y} = (\sum_{z \in Ω_{A}} ν_{z x} A_{z}) \circ B_{y} \\ = {(\sum_{z \in Ω_{A}} ν_{z x} A_{z})}^{1 / 2} B_{y} {(\sum_{z \in Ω_{A}} ν_{z x} A_{z})}^{1 / 2} \end{matrix}

(9)

Notice that (9) is much more complicated than (8). If A is sharp, then (8) and (9) become

\begin{matrix} {(A \circ μ \cdot B)}_{(x, y)} & = \sum_{z \in Ω_{B}} μ_{z y} A_{x} B_{z} A_{x} \end{matrix}

(10)

and

\begin{matrix} {[(ν \cdot A) \circ B]}_{(x, y)} & = (\sum_{z \in Ω_{A}} ν_{z x}^{1 / 2} A_{z}) B_{y} (\sum_{z \in Ω_{A}} ν_{z x}^{1 / 2} A_{z}) \\ = \sum_{r, s \in Ω_{A}} ν_{r x}^{1 / 2} ν_{s x}^{1 / 2} A_{r} B_{y} A_{s} \end{matrix}

(11)

If A and B are atomic with

A_{x} = |ϕ_{x}〉 〈ϕ_{x}|

and

B_{y} = |ψ_{y}〉 〈ψ_{y}|

then (8), (9) become

\begin{matrix} {(A \circ μ \cdot B)}_{(x, y)} & = [\sum_{z \in Ω_{B}} μ_{z y} {|〈ϕ_{x}, ψ_{z}〉|}^{2}] |ϕ_{x}〉 〈ϕ_{x}| \end{matrix}

(12)

and

\begin{matrix} {[(ν \cdot A) \circ B]}_{(x, y)} & = \sum_{r, s \in Ω_{A}} ν_{r x}^{1 / 2} ν_{s x}^{1 / 2} 〈ϕ_{r}, ψ_{y}〉 〈ψ_{y}, ϕ_{s}〉 |ϕ_{r}〉 〈ϕ_{s}| \\ = |\sum_{r \in Ω_{A}} ν_{r x}^{1 / 2} 〈ϕ_{r}, ψ_{y}〉 ϕ_{r}〉 〈\sum_{r \in Ω_{A}} ν_{r x}^{1 / 2} 〈ϕ_{r}, ψ_{y}〉 ϕ_{r}| \end{matrix}

(13)

Notice from (12) and (13) that both

A \circ μ \cdot B

and

(ν \cdot A) \circ B

are rank 1 observables. The next lemma shows that post-processing and conditioning interact in a regular way.

Lemma 10.

(μ \cdot B ∣ A) = μ \cdot (B ∣ A)

Proof.

The result follows because

\begin{matrix} {(μ \cdot B ∣ A)}_{z} & = \sum_{x \in Ω_{A}} [A_{x} \circ {(μ \cdot B)}_{z}] = \sum_{x \in Ω_{A}} (A_{x} \circ \sum_{y \in Ω_{B}} μ_{y z} B_{y}) \\ = \sum_{y \in Ω_{B}} μ_{y z} [\sum_{x \in Ω_{A}} (A_{x} \circ B_{y})] = {[μ \cdot (B ∣ A)]}_{z} \end{matrix}

Hence,

(μ \cdot B ∣ A) = μ \cdot (B ∣ A)

. □

Example 4.

This example illustrates the concepts of this section in terms of finite position and momentum observables. Let H be a finite-dimensional Hilbert space with dimension d and let

\{ϕ_{j} : j = 0, 1, \dots, d - 1\}

be an orthonormal basis for H. The finite Fourier transform is the unitary operator on H given by

F = \frac{1}{\sqrt{d}} \sum_{j, k = 0}^{d - 1} e^{2 π i j k / d} |ϕ_{k}〉 〈ϕ_{j}|

where

i = \sqrt{- 1}

[5]. Equivalently, F is the operator satisfying

F (ϕ_{k}) = \frac{1}{\sqrt{d}} \sum_{j = 0}^{d - 1} e^{2 π i j k / d} ϕ_{j}

for all

k = 0, 1, \dots, d - 1

. We call

Q = \{Q_{j} : j = 0, 1, \dots, d - 1\}

where

Q_{j} = |ϕ_{j}〉 〈ϕ_{j}|

the finite position observable and

P = \{P_{j} : j = 0, 1, \dots, d - 1\}

where

P_{j} = |ψ_{j}〉 〈ψ_{j}|

with

ψ_{j} = F ϕ_{j}

the finite momentum observable. Notice that

P_{j} = F Q_{j} F^{*}

,

j = 0, 1, \dots, d - 1

and

Ω_{Q} = Ω_{P} = \{0, 1, \dots, d - 1\}

. We also see that Q and P are atomic observables. The observable

Q \circ P

has effects

{(Q \circ P)}_{(j, k)} = Q_{j} \circ P_{k} = Q_{j} P_{k} Q_{j} = {|〈ϕ_{j}, ψ_{j}〉|}^{2} |ϕ_{j}〉 〈ϕ_{j}| = \frac{1}{d} Q_{j}

Thus,

Q \circ P

is a rank 1 observable and

(P ∣ Q)

is the trivial observable

{(P ∣ Q)}_{k} = \sum_{j} (Q_{j} \circ P_{k}) = \frac{1}{d} I

for

k = 0, 1, \dots, d - 1

. In a similar way,

{(P \circ Q)}_{(j, k)} = \frac{1}{d} P_{j}

and

{(Q ∣ P)}_{k} = \frac{1}{d} I

for

j, k = 0, 1, \dots, d - 1

. The distribution of Q in the state

ρ \in S (H)

becomes

Φ_{ρ}^{Q} (j) = tr (ρ Q_{j}) = 〈ϕ_{j}, ρ ϕ_{j}〉

for

j = 0, 1, \dots, d - 1

.

More interesting observables are obtained by post-processing. Let

μ_{r j}

be a stochastic matrix so that

μ_{r j} \geq 0

and

\sum_{j = 0}^{d - 1} μ_{r j} = 1

, for all

r, j = 0, 1, \dots, d - 1

. Then the post-processing observable

μ \cdot Q

satisfies

{(μ \cdot Q)}_{j} = \sum_{r = 0}^{d - 1} μ_{r j} Q_{r} = \sum_{r = 0}^{d - 1} μ_{r j} |ϕ_{r}〉 〈ϕ_{r}|

We see that the eigenvalues of

{(μ \cdot Q)}_{j}

are

μ_{r j}

,

r = 0, 1, \dots, d - 1

with corresponding eigenvectors

ϕ_{r}

. The distribution of

μ \cdot Q

in the state

ρ \in S (H)

becomes

\begin{matrix} Φ_{ρ}^{μ \cdot Q} (j) & = tr [ρ {(μ \cdot Q)}_{j}] = tr [ρ \sum_{r = 0}^{d - 1} μ_{r j} |ϕ_{r}〉 〈ϕ_{r}|] \\ = \sum_{r = 0}^{d - 1} μ_{r j} 〈ϕ_{r}, ρ ϕ_{r}〉 = \sum_{r = 0}^{d - 1} μ_{r j} Φ_{ρ}^{Q} (r) \end{matrix}

The observable

(μ \cdot Q) \circ P

satisfies

\begin{matrix} {[(μ \cdot Q) \circ P]}_{(j, k)} & = {(μ \cdot Q)}_{j} \circ P_{k} = {(μ \cdot Q)}_{j}^{1 / 2} P_{k} {(μ \cdot Q_{j})}^{1 / 2} \\ = \sum_{r = 0}^{d - 1} μ_{r j}^{1 / 2} |ϕ_{r}〉 〈ϕ_{r}| P_{k} \sum_{s = 0}^{d - 1} μ_{s j}^{1 / 2} |ϕ_{s}〉 〈ϕ_{s}| \\ = \sum_{r, s = 0}^{d - 1} μ_{r j}^{1 / 2} μ_{s j}^{1 / 2} 〈ϕ_{r}, ψ_{k}〉 〈ψ_{k}, ϕ_{s}〉 |ϕ_{r}〉 〈ϕ_{s}| \\ = \frac{1}{d} \sum_{r, s = 0}^{d - 1} μ_{r j}^{1 / 2} μ_{s j}^{1 / 2} e^{2 π i k (s - r)} |ϕ_{r}〉 〈ϕ_{s}| \end{matrix}

(14)

Equation (14) also follows from (13).

5. SIC Observables

This section is more speculative than the previous ones and we do not come to many definite conclusions. A finite observable A is informationally complete (IC) if

tr (ρ_{1} A_{x}) = tr (ρ_{2} A_{x})

for all

x \in Ω_{A}

implies that

ρ_{1} = ρ_{2}

. Equivalently, A is informationally complete if

Φ_{ρ_{1}}^{A} = Φ_{ρ_{2}}^{A}

implies that

ρ_{1} = ρ_{2}

. It can be shown that there exist IC observables for every finite dimensional Hilbert space H [5]. An Observable A on a Hilbert space H with

dim H = d

is symmetric if [5]:

(S1): $|Ω_{A}| = d^{2}$ ,
(S2): A has rank 1,
(S3): $tr (A_{x}) = 1 / d$ for all $x \in Ω_{A}$ ,
(S4): $tr (A_{x} A_{y}) = 1 / d^{2} (d + 1)$ for all $x \neq y \in Ω_{A}$ .

It can be shown that

d^{2}

is the smallest cardinality for the outcome space of an IC observable [5]. Furthermore,

tr (A_{x}) = 1 / d

if

tr (A_{x})

is constant for all

x \in Ω_{A}

and

tr (A_{x} A_{y}) = 1 / d^{2} (d + 1)

for

x \neq y \in Ω_{A}

if

tr (A_{x} A_{y})

is constant for

x \neq y

[5]. A symmetric IC observable is called a SIC observable. An important unsolved problem is whether SIC observables exist for every finite dimensional Hilbert space [5]. It is not even known whether high dimensional SIC observables exist. We would like to propose a possible method for attacking this problem. Unfortunately, we have not been able to complete this method and we leave this to future work.

Let

dim H = d

and let

A = \{|ϕ_{x}〉 〈ϕ_{x}| : x \in Ω_{A}\}

,

B = \{|ψ_{y}〉 〈ψ_{y}| : y \in Ω_{B}\}

be atomic observables. For a

d \times d

stochastic matrix

μ

we define the observable

C = (ν \cdot A) \circ B

. For example,

(μ \cdot Q) \circ P

of Example 4 is such an observable. Letting

η_{x y} \in H

be the vector given by

η_{x y} = \sum_{r \in Ω_{A}} ν_{r x}^{1 / 2} 〈ϕ_{r}, ψ_{y}〉 ϕ_{r}

(15)

We conclude from (13) that for all

(x, y) \in Ω_{C}

we have that

C_{(x, y)} = |η_{x y}〉 〈η_{x y}|

(16)

It immediately follows that C satisfies (S1) and (S2). We say that a stochastic matrix

ν

is doubly stochastic if

\sum_{x} ν_{x y} = 1

for all y [5]. The bases

\{ϕ_{r}\}

,

\{ψ_{y}\}

are mutually unbiased bases (MUB) if

{|〈ϕ_{r}, ψ_{y}〉|}^{2} = 1 / d

for all

r, y = 1, 2, \dots, d

[6]. It is easy to show that there exist pairs of MUB for every finite dimension. In fact, the two bases in Example 4 are MUB.

Theorem 5.

(a)If ν is doubly stochastic and

\{ϕ_{r}\}

,

\{ψ_{y}\}

are MUB, then

tr [C_{(x, y)}] = 1 / d

for all

x, y

.(b) If

tr [C_{(x, y)}] = 1 / d

for all

x, y

then ν is doubly stochastic.

Proof.

(a) Applying (15), (16) we have that

\begin{matrix} tr [C_{(x, y)}] & = {∥η_{x y}∥}^{2} = 〈\sum_{r} ν_{r x}^{1 / 2} 〈ϕ_{r}, ψ_{y}〉 ϕ_{r}, \sum_{s} ν_{s x}^{1 / 2} 〈ϕ_{s}, ψ_{y}〉 ψ_{s}〉 \\ = \sum_{r} ν_{r x} {|〈ϕ_{r}, ψ_{y}〉|}^{2} \end{matrix}

for all

x, y

. If

ν

is doubly stochastic and

\{ϕ_{r}\}

,

\{ψ_{y}\}

are MUB we conclude that

tr [C_{(x, y)}] = \frac{1}{d} \sum_{r} ν_{r x} = \frac{1}{d}

for all

x, y

. (b) If

tr [C_{(x, y)}] = 1 / d

for all

x, y

, then by (a) we obtain

\sum_{r} ν_{r x} {|〈ϕ_{r}, ψ_{y}〉|}^{2} = \frac{1}{d}

for all

x, y

. Summing over y gives

\sum_{r} ν_{r x} = 1

for all x, so

ν

is doubly stochastic. □

In Theorem 5b, if

tr [C_{(x, y)}] = 1 / d

for all

x, y

, then

\{ϕ_{r}\}

,

\{ψ_{y}\}

need not be MUB so the converse of Theorem 5a does not hold. For example, suppose

ν_{r x} = 1 / d

for all

r, x

. Then

tr [C_{(x, y)}] = 1 / d

for all

x, y

but

\{ϕ_{r}\}

,

\{ψ_{y}\}

can be arbitrary bases. We conclude from Theorem 5a that if

ν

is doubly stochastic and

\{ϕ_{r}\}

,

\{ψ_{y}\}

are MUB, then Condition (S3) holds.

Lemma 11.

(a)Condition (S4) holds if and only if

{〈η_{x y}, η_{x^{'} y^{'}}〉}^{2} = 1 / d^{2} (d + 1)

for all

(x, y) \neq (x^{'}, y^{'})

.(b)The observable C is IC if and only if for

ρ_{1}, ρ_{2} \in S (H)

we have that

〈ρ_{1} η_{x y}, η_{x y}〉 = 〈ρ_{2} η_{x y}, η_{x y}〉

for all

x, y

implies that

ρ_{1} = ρ_{2}

.

Proof.

(a) Applying (16) we have that

tr [C_{(x, y)} C_{(x^{'}, y^{'})}] = {〈η_{x y}, η_{x^{'} y^{'}}〉}^{2}

and the result follows. (b) Applying (16) we have that

tr [ρ C_{(x, y)}] = tr (ρ |η_{x y}〉 〈η_{x y}|) = 〈ρ η_{x y}, η_{x y}〉

and the result follows. □

Theorem 5 and Lemma 11 complete conditions under which C become a SIC observable.

We now illustrate our SIC method in the qubit case

H = C^{2}

. Let

ϕ_{1} = (1, 0)

,

ϕ_{2} = (0, 1)

be the standard basis for H and let

\{ψ_{1}, ψ_{2}\}

be a basis for H such that

\{ϕ_{i}\}

,

\{ψ_{j}\}

are MUB. For example, we could use

\begin{matrix} ψ_{1} & = F (ϕ_{1}) = \frac{1}{\sqrt{2}} (ϕ_{1} + ϕ_{2}) \\ ψ_{2} & = F (ϕ_{2}) = \frac{1}{\sqrt{2}} (ϕ_{1} - ϕ_{2}) \end{matrix}

of Example 4. Define the atomic observables

A = \{|ϕ_{1}〉 〈ϕ_{1}|, |ϕ_{2}〉 〈ϕ_{2}|\}

,

B = \{|ψ_{1}〉 〈ψ_{1}|, |ψ_{2}〉 〈ψ_{2}|\}

with

Ω_{A} = Ω_{B} = \{1, 2\}

. Let

ν

be the doubly stochastic matrix

ν = [\begin{matrix} a & 1 - a \\ 1 - a & a \end{matrix}]

0 \leq a \leq 1

. Define the observable

C = (ν \cdot A) \circ B

and the effects

D = [\begin{matrix} a^{1 / 2} & 0 \\ 0 & {(1 - a)}^{1 / 2} \end{matrix}], E = [\begin{matrix} {(1 - a)}^{1 / 2} & 0 \\ 0 & a^{1 / 2} \end{matrix}]

Letting

η_{x y}

,

x, y \in \{1, 2\}

, be the vectors defined by (15), we have by (16) that

η_{11} = D ψ_{1}

,

η_{12} + D ψ_{2}

,

η_{21} = E ψ_{1}

,

η_{22} = E ψ_{2}

.

We have that C satisfies Conditions (S1), (S2) and (S3). According to Theorem 5(a), C satisfies Condition (S4) if and only if

〈η_{j k}, η_{j^{'} k^{'}}〉 = 1 / 12

when

(j, k) \neq (j^{'}, k^{'})

. Now

{〈η_{11}, η_{22}〉}^{2} = {〈D ψ_{1}, E ψ_{2}〉}^{2} = {〈E D ψ_{1}, ψ_{2}〉}^{2} = {〈a^{1 / 2} {(1 - a)}^{1 / 2} ψ_{1}, ψ_{2}〉}^{2} = 0

Hence, (S4) is not satisfied.

If

a = 0

, 1 or

1 / 2

, it is easy to check that C is not IC. Unfortunately, even when

a \neq 0, 1, 1 / 2

, C need not be IC. For example, let

ψ_{1} \frac{1}{\sqrt{2}} (ϕ_{1} + ϕ_{2})

,

ψ_{2} = \frac{1}{\sqrt{2}} (ϕ_{1} - ϕ_{2})

as before. We then have the following result.

Theorem 6.

If

a \neq 0, 1, 1 / 2

and G is a

2 \times 2

self-adjoint matrix,

[{GC}_{(j, k)}] = 0

for all

j, k = 1, 2

, if and only if

G = [\begin{matrix} 0 & i α \\ - i α & 0 \end{matrix}]

where

α \in R

.

Proof.

By (16),

tr [G C_{(j, k)}] = 0

if and only if

〈G η_{j k}, η_{j k}〉 = 0

(17)

for all

j, k = 1, 2

. Letting

G = [\begin{matrix} {(1 - a)}^{1 / 2} & 0 \\ 0 & a^{1 / 2} \end{matrix}]

we conclude that (17) holds if and only if

\begin{matrix} 〈G D ψ_{1}, D ψ_{1}〉 & = \frac{1}{2} [G_{11} a + G_{22} (1 - a) + 2 a^{1 / 2} {(1 - a)}^{1 / 2} Re G_{12}] = 0 \end{matrix}

(18)

\begin{matrix} 〈G E ψ_{1}, E ψ_{1}〉 & = \frac{1}{2} [G_{11} (1 - a) + G_{22} a + 2 a^{1 / 2} {(1 - a)}^{1 / 2} Re G_{12}] = 0 \end{matrix}

(19)

\begin{matrix} 〈G D ψ_{2}, D ψ_{2}〉 & = \frac{1}{2} [G_{11} a + G_{22} (1 - a) - 2 a^{1 / 2} {(1 - a)}^{1 / 2} Re G_{12}] = 0 \end{matrix}

(20)

\begin{matrix} 〈G E ψ_{2}, E ψ_{2}〉 & = \frac{1}{2} [G_{11} (1 - a) + G_{22} a - 2 a^{1 / 2} {(1 - a)}^{1 / 2} Re G_{12}] = 0 \end{matrix}

(21)

Adding (18) and (20) gives

G_{11} a + G_{22} (1 - a) = 0

and hence,

G_{11} = \frac{a - 1}{a} G_{22}

. If

G_{22} \neq 0

, then

{(a - 1)}^{2} = a^{2}

so

a = 1 / 2

which is a contradiction. Hence,

G_{22} = 0

and it follows that

G_{11} = Re G_{12} = 0

. Hence,

G_{12} = i α

,

α \in R

and the result follows. The converse is clear. □

Corollary 2.

If

ψ_{1} = \frac{1}{\sqrt{2}} (ϕ_{1} + ϕ_{2})

,

ψ_{2} = \frac{1}{\sqrt{2}} (ϕ_{1} - ϕ_{2})

, then C is not IC.

Proof.

Define

ρ_{1} = \frac{1}{2} [\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}]

,

ρ_{2} = [\begin{matrix} 1 / 2 & i α \\ - i α & 1 / 2 \end{matrix}]

where

0 < α < 1 / 2

. Then

ρ_{1} \in S (H)

and it is easy to check that

ρ_{2} \in S (H)

by showing that the eigenvalues of

ρ_{2}

are

\frac{1}{2} \pm α

. Since

ρ_{2} - ρ_{1} = [\begin{matrix} 0 & i α \\ - i α & 0 \end{matrix}]

, it follows from Theorem 6 that

tr [ρ_{2} C_{(j, k)}] = tr [ρ_{1} C_{(j, k)}]

for all

j, k = 1, 2

. However,

ρ_{2} \neq ρ_{1}

so C is not IC. □

It is possible that for other

\{ψ_{1}, ψ_{2}\}

we obtain an IC observable C. It is also possible that for higher dimensional spaces we obtain SIC observables using this method. Even though C is not IC, it satisfies two necessary (but not sufficient) conditions for IC [5] (Prop 3.35). If C is IC these conditions are: (a)

C_{(j, k)}

does not have both eigenvalues 0,1 and (b) for all

j, k

there exists

j^{'}, k^{'}

such that

C_{(j, k)} C_{(j^{'}, k^{'})} \neq C_{(j^{'}, k^{'})} C_{(j, k)}

Indeed, (a) is clear and (b) follows from the fact that

C_{(1, 1)} C_{(1, 2)} \neq C_{(1, 2)} C_{(1, 1)} and C_{(2, 2)} C_{(2, 1)} \neq C_{(2, 1)} C_{(2, 2)}

Funding

This research received no external funding.

Data Availability Statement

No data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Glatzel, F.; Schilling, T. The interplay between memory and potentials of mean force: A discussion on the structure of equations of motion for coarse-grained observables. EPL 2021, 136, 1–6. [Google Scholar] [CrossRef]
Rudnick, L. Majorization approach to entropic uncertainty relations for coarse-grained observables. arXiv 2015, arXiv:1503.03682v1. [Google Scholar] [CrossRef] [Green Version]
Ali, S.; Emch, G. Fuzzy observables in quantum mechanics. J. Math. Phys. 1974, 15, 176–182. [Google Scholar] [CrossRef]
Ali, S.; Carmeli, C.; Heinosaari, T.; Toigo, A. Commutative povms and fuzzy observables. Found. Phys. 2009, 39, 593–612. [Google Scholar] [CrossRef] [Green Version]
Heinosaari, T.; Ziman, M. The Mathematical Language of Quantum Theory; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Gudder, S. Combinations of quantum observables and instruments. arXiv 2020, arXiv:2010.08025. [Google Scholar] [CrossRef]
Busch, P.; Grabowski, M.; Lahti, P. Operational Quantum Physics; Springer: Berlin/Heidelberg, Germany, 1995. [Google Scholar]
Nielson, M.; Chuang, I. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Gudder, S. Parts and composites of quantum systems. arXiv 2020, arXiv:2009.07371. [Google Scholar] [CrossRef]
Fillipov, S.; Heinosaari, T.; Leppäjärvi, L. Simulability of observables in general probabilistic theories. Phys. Rev. 2018, A97, 62102. [Google Scholar] [CrossRef] [Green Version]
Heinosaari, T.; Reitzner, D.; Stano, R.; Ziman, M. Coexistence of quantum operations. J. Phys. 2009, A42, 365302. [Google Scholar] [CrossRef]
Lahti, P. Coexistence and joint measurability in quantum mechanics. Int. J. Theor. Phys. 2003, 42, 893–906. [Google Scholar] [CrossRef]
Gudder, S.; Greechie, R. Sequential products on effect algebras. Rep. Math. Phys. 2002, 49, 87–111. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gudder, S. Coarse-Graining of Observables. Quantum Rep. 2022, 4, 401-417. https://doi.org/10.3390/quantum4040029

AMA Style

Gudder S. Coarse-Graining of Observables. Quantum Reports. 2022; 4(4):401-417. https://doi.org/10.3390/quantum4040029

Chicago/Turabian Style

Gudder, Stan. 2022. "Coarse-Graining of Observables" Quantum Reports 4, no. 4: 401-417. https://doi.org/10.3390/quantum4040029

APA Style

Gudder, S. (2022). Coarse-Graining of Observables. Quantum Reports, 4(4), 401-417. https://doi.org/10.3390/quantum4040029

Article Menu

Coarse-Graining of Observables

Abstract

1. Introduction

2. Coarse-Graining of Measures

3. Observables and Instruments

4. Finite Observables

5. SIC Observables

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI