Entropy of Quantum Measurements

Gudder, Stanley

doi:10.3390/e24111686

Open AccessFeature PaperArticle

Entropy of Quantum Measurements

by

Stanley Gudder

Department of Mathematics, University of Denver, Denver, CO 80208, USA

Entropy 2022, 24(11), 1686; https://doi.org/10.3390/e24111686

Submission received: 25 October 2022 / Revised: 12 November 2022 / Accepted: 14 November 2022 / Published: 18 November 2022

(This article belongs to the Special Issue Entropy in Quantum Systems and Quantum Field Theory (QFT) II)

Download Review Reports Versions Notes

Abstract

If a is a quantum effect and

ρ

is a state, we define the

ρ

-entropy

S_{a} (ρ)

which gives the amount of uncertainty that a measurement of a provides about

ρ

. The smaller

S_{a} (ρ)

is, the more information a measurement of a gives about

ρ

. In Entropy for Effects, we provide bounds on

S_{a} (ρ)

and show that if

a + b

is an effect, then

S_{a + b} (ρ) \geq S_{a} (ρ) + S_{b} (ρ)

. We then prove a result concerning convex mixtures of effects. We also consider sequential products of effects and their

ρ

-entropies. In Entropy of Observables and Instruments, we employ

S_{a} (ρ)

to define the

ρ

-entropy

S_{A} (ρ)

for an observable A. We show that

S_{A} (ρ)

directly provides the

ρ

-entropy

S_{I} (ρ)

for an instrument

I

. We establish bounds for

S_{A} (ρ)

and prove characterizations for when these bounds are obtained. These give simplified proofs of results given in the literature. We also consider

ρ

-entropies for measurement models, sequential products of observables and coarse-graining of observables. Various examples that illustrate the theory are provided.

Keywords:

entropy; quantum measurements; effects; observables

1. Introduction

In an interesting article, D. Šafránek and J. Thingna introduce the concept of entropy for quantum instruments [1]. Various important theorems are proved and applications are given. In quantum computation and information theory one of the most important problems is to determine an unknown state by applying measurements on the system [2,3,4,5]. Entropy provides a quantification for the amount of information given to solve this so-called state discrimination problem [6,7,8]. In this article, we first define the entropy for the most basic measurement, namely a quantum effect a [2,3,9,10]. If

ρ

is a state, we define the

ρ

-entropy

S_{a} (ρ)

which gives the amount of uncertainty (or randomness) that a measurement of a provides about

ρ

. The smaller

S_{a} (ρ)

is, the more information a measurement of a provides about

ρ

. In Section 2, we give bounds on

S_{a} (ρ)

and show that if

a + b

is an effect then

S_{a + b} (ρ) \leq S_{a} (ρ) + S_{b} (ρ)

. We then prove a result concerning convex mixtures of effects. We also consider sequential products of effects and their

ρ

-entropies.

In Section 3, we employ

S_{a} (ρ)

to define the entropy

S_{A} (ρ)

for an observable A. Then

S_{A} (ρ)

gives the uncertainty that a measurement of A provides about

ρ

. We show that

S_{A} (ρ)

directly gives the

ρ

-entropy

S_{I} (ρ)

for an instrument

I

. We establish bounds for

S_{A} (ρ)

and characterize when these bounds are obtained. These give simplified proofs of results given in [1,5,11]. We also consider

ρ

-entropies for measurement models, sequential products of observables and coarse-graining of observables. Various examples that illustrate the theory are provided. In this work, all Hilbert spaces are assumed to be finite dimensional. Although this is a restriction, the work applies for quantum computation and information theory [2,3,9,10].

2. Entropy for Effects

Let H be a finite dimensional complex Hilbert space with dimension n. We denote the set of linear operators on H by

L (H)

and the set of states on H by

S (H)

. If

ρ \in S (H)

with nonzero eigenvalues

λ_{1}, λ_{2}, \dots, λ_{m}

including multiplicities, the von Neumann entropy of

ρ

is [4,6,7,8].

S (ρ) = - \sum_{i = 1}^{m} λ_{i} ln (λ_{i}) = - tr [ρ ln (ρ)]

We consider

S (ρ)

as a measure of the randomness or uncertainty of

ρ

and smaller values of

S (ρ)

indicate more information content. For example,

ρ

is the completely random state

I / n

, where I is the identity operator, if and only if

S (ρ) = ln (n)

and

ρ

is a pure state if and only if

S (ρ) = 0

. Moreover, it is well-known that

0 \leq S (ρ) \leq ln (n)

for all

ρ \in S (H)

. The following properties of S are well-known [4,6,8]:

\begin{matrix} S (U ρ U^{*}) & = S (ρ) when U is unitary \\ S (ρ_{1} \otimes ρ_{2}) & = S (ρ_{1}) + S (ρ_{2}) \\ \sum μ_{i} S (ρ_{i}) & \leq S (\sum μ_{i} ρ_{i}) \leq \sum μ_{i} S (ρ_{i}) - \sum μ_{i} ln (μ_{i}) \end{matrix}

where

0 \leq μ_{i} = 1

with

\sum μ_{i} = 1

.

An operator

a \in L (H)

that satisfies

0 \leq a \leq I

is called an effect [2,3,9,10]. We think of an effect a as a two-outcome yes-no measurement. If a measurement of a results in outcome yes we say that a occurs and if it results in outcome no then a does not occur. The effect

a^{'} = I - a

is the complement of a and

a^{'}

occurs if and only if a does not occur. We denote the set of effects by

E (H)

. If

a \in E (H)

and

ρ \in S (H)

then

0 \leq tr (ρ a) \leq 1

and we interpret

tr (ρ a)

as the probability that a occurs when the system is in state

ρ

. If

a \neq 0

we define the

ρ

-entropy of a to be

S_{a} (ρ) = - tr (ρ a) ln [\frac{tr (ρ a)}{tr (a)}]

(1)

We interpret

S_{a} (ρ)

as the amount of uncertainty that the system is in state

ρ

resulting from a measurement of a. The smaller

S_{a} (ρ)

is, the more information a measurement of a gives about

ρ

. Such information is useful for state discrimination problems [2,3,4,5].

If

ρ

is the completely random state

I / n

then (1) becomes

S_{a} (I / n) = - tr (I a / n) ln [\frac{tr (I a / n)}{tr (a)}] = - \frac{1}{n} tr (a) ln (\frac{1}{n}) = \frac{tr (a)}{n} ln (n)

Since

tr (a) \leq n

we conclude that

S_{a} (I / n) \leq S (I / n)

for all

a \in E (H)

. Another extreme case is when

a = λ I

for

0 < λ \leq 1

. We then have for any

ρ \in S (H)

that

S_{λ I} (ρ) = - tr (ρ λ I) ln [\frac{tr (ρ λ I)}{tr (λ I)}] = - λ ln [\frac{λ}{λ tr (I)}] = λ ln (n)

Thus, as

λ

gets smaller, the more information we gain.

A real-valued function with domain

D (f)

, an interval in

R

, is strictly convex if for any

x_{1}, x_{2} \in D (f)

with

x_{1} \neq x_{2}

and

0 < λ < 1

we have

f [λ x_{1} + (1 - λ) x_{2}] < λ f (x_{1}) + (1 - λ) f (x_{2})

If the opposite inequality holds, then f is strictly concave. It is clear that f is strictly convex if and only if

- f

is strictly concave. Of special importance in this work are the strictly convex functions

- ln x

and

x ln x

. We shall frequently employ Jensen’s theorem which says: if f is strictly convex and

0 \leq μ_{i} \leq 1

with

\sum_{i = 1}^{m} μ_{i} = 1

, then

f (\sum_{i = 1}^{m} μ_{i} x_{i}) \leq \sum_{i = 1}^{m} μ_{i} f (x_{i})

Moreover, we have equality if and only if

x_{i} = x_{j}

for all

i, j = 1, 2, \dots, m

[1].

Theorem 1.

If

ρ \in S (H)

with nonzero eigenvalues

λ_{i}

,

i = 1, 2, \dots, m

, and

a \in E (H)

with

tr (ρ a) \neq 0

, then

- \sum_{i} tr (P_{i} a) λ_{i} ln (λ_{i}) \leq S_{a} (ρ) \leq ln [\frac{tr (a)}{tr (ρ a)}]

where

ρ = \sum_{i} λ_{i} P_{i}

is the spectral decomposition of ρ. Moreover,

S_{a} (ρ) = ln [tr (a) / tr (ρ a)]

if and only if

tr (ρ a) = 1

in which case

S_{a} (ρ) = l n [tr (a)]

and if

S_{a} (ρ) = - \sum_{i} tr (P_{i} a) λ_{i} ln (λ_{i})

(2)

then

tr (P_{i} a) = tr (P_{j} a)

for all

i, j = 1, 2, \dots, m

and

S_{a} (ρ) = (tr (a) / m) S (ρ)

while if

tr (P_{i} a) = tr (P_{j} a)

for all

i, j = 1, 2, \dots m

then

S_{a} (ρ) = (tr (a) / m) ln (m)

.

Proof.

Letting

μ_{j} = tr (P_{j} a) / tr (a)

,

j = 1, 2, \dots, m

, we have that

0 \leq μ_{j} \leq 1

and

\sum_{j} μ_{j} = 1

. Since

- x ln (x)

is strictly concave we obtain

\begin{matrix} S_{a} (ρ) & = - tr (ρ a) ln [\frac{tr (ρ a)}{tr (a)}] = - tr (\sum_{i} λ_{i} P_{i} a) ln [\frac{tr (\sum_{j} λ_{j} P_{j} a)}{tr (a)}] \\ = - \sum λ_{i} tr (P_{i} a) ln (\sum_{j} λ_{j} μ_{j}) = tr (a) [- \sum_{i} λ_{i} μ_{i} (\sum_{j} λ_{j} μ_{j})] \\ \geq - tr (a) \sum_{i} μ_{i} λ_{i} ln (λ_{i}) = - tr (a) \sum_{i} \frac{tr (P_{i} a)}{tr (a)} λ_{i} ln (λ_{i}) \\ = - \sum_{i} tr (P_{i} a) λ_{i} ln (λ_{i}) \end{matrix}

Since

tr (ρ a) = tr (a^{1 / 2} ρ a^{1 / 2}) \leq tr (ρ) = 1

we have that

S_{a} (ρ) = tr (ρ a) ln [\frac{tr (a)}{tr (ρ a)}] \leq ln [\frac{tr (a)}{tr (ρ a)}]

If

tr (ρ a) = 1

, then

S_{a} (ρ) = - tr (ρ a) ln [\frac{tr (ρ a)}{tr (ρ a)}] = - ln [\frac{1}{tr (a)}] = ln [tr (a)]

Conversely, if

S_{a} (ρ) = ln [tr (a) / tr (ρ a)]

, then clearly

tr (ρ a) = 1

. If (2) holds, then we have equality for Jensen’s inequality. Hence,

tr (P_{i} a) = tr (P_{j} a)

for all

i, j = 1, 2, \dots, m

. Since

tr (a) = \sum_{i} tr (P_{i} a) = m tr (P_{i} a)

we conclude that

S_{a} (ρ) = - tr (P_{1} a) \sum_{i} λ_{i} ln (λ_{i}) = \frac{tr (a)}{m} S (ρ)

Finally, suppose

tr (P_{i} a) = tr (P_{j} a)

for all

i, j = 1, 2, \dots, m

. Then

tr (a) = \sum_{i} tr (P_{i} a) = m tr (P_{1} a)

We conclude that

\begin{matrix} S_{a} (ρ) & = - tr (P_{1} a) \sum_{i} λ_{i} ln [\sum_{j} λ_{j} \frac{tr (P_{1} a)}{tr (a)}] = - tr (P_{1} a) \sum_{i} λ_{i} ln (\sum_{j} λ_{j} \frac{1}{m}) \\ = - tr (P_{1} a) \sum_{i} λ_{i} ln (\frac{1}{m}) = \frac{tr (a)}{m} ln (m) e \end{matrix}

□

For

a, b \in E (H)

we write

a ⊥ b

if

a + b \in E (H)

.

Theorem 2.

If

a ⊥ b

, then

S_{a + b} (ρ) \geq S_{a} (ρ) + S_{b} (ρ)

for all

ρ \in S (H)

. Moreover,

S_{a + b} (ρ) = S_{a} (ρ) + S_{b} (ρ)

if and only if

tr (b) tr (ρ a) = tr (a) tr (ρ b)

.

Proof.

Since

- x ln x

is concave, letting

λ_{1} = tr (a) / [tr (a) + tr (b)]

,

λ_{2} = tr (b) / [tr (a) + tr (b)]

,

x_{1} = tr (ρ a) / tr (a)

,

x_{2} = tr (ρ b) / tr (b)

we obtain

\begin{matrix} S_{a + b} (ρ) & = - tr [ρ (a + b)] ln \{\frac{tr [ρ (a + b)]}{tr (a + b)}\} \\ = - tr (a + b) [\frac{tr (ρ a) + tr (ρ b)}{tr (a + b)}] ln [\frac{tr (ρ a) + tr (ρ b)}{tr (a + b)}] \\ = - tr (a + b) (λ_{1} x_{1} + λ_{2} x_{2}) ln (λ_{1} x_{1} + λ_{2} x_{2}) \\ \geq - tr (a + b) [λ_{1} x_{1} ln (x_{1}) + λ_{2} x_{2} ln (x_{2})] \\ = - tr (ρ a) ln [\frac{tr (ρ a)}{tr (a)}] - tr (ρ b) ln [\frac{tr (ρ b)}{tr (b)}] = S_{a} (ρ) + S_{b} (ρ) \end{matrix}

We have equality if and only if

x_{1} = x_{2}

which is equivalent to

tr (b) tr (ρ a) = tr (a) tr (ρ b)

. □

Corollary 1.

S_{a} (ρ) + S_{a^{'}} (ρ) \leq ln (n)

and

S_{a} (ρ) + S_{a^{'}} (ρ) = ln (n)

if and only if

tr (a) = n tr (ρ a)

.

Proof.

Applying Theorem 2 we obtain

S_{a} (ρ) + S_{a^{'}} (ρ) \leq S_{a + a^{'}} (ρ) = S_{I} (ρ) = ln (n)

\begin{matrix} We have equality & \Leftrightarrow tr (a^{'}) tr (ρ a) = tr (a) tr (ρ a^{'}) \\ \Leftrightarrow [n - tr (a)] tr (ρ a) = tr (a) [1 - tr (ρ a)] \\ \Leftrightarrow tr (a) = n tr (ρ a) e \end{matrix}

□

Corollary 2.

S_{a + b} (ρ) \geq S_{a} (ρ), S_{b} (ρ)

.

Corollary 3.

If

a \leq b

, then

S_{a} (ρ) \leq S_{b} (ρ)

for all

ρ \in S (H)

.

Proof.

If

a \leq b

, then

b = a + c

for

c = b - a \in E (H)

. Hence,

S_{b} (ρ) = S_{a + c} (ρ) \geq S_{a} (ρ) + S_{c} (ρ) \geq S_{a} (ρ)

for every

ρ \in S (H)

. □

Applying Theorem 2 and induction we obtain the following.

Corollary 4.

If

a_{1} + a_{2} + \dots + a_{m} \leq I

, then

S_{\sum a_{i}} (ρ) \geq \sum S_{a_{i}} (ρ)

. Moreover, we have equality if and only if

tr (a_{j}) tr (ρ a_{i}) = tr (a_{i}) tr (ρ a_{j})

for all

i, j = 1, 2, \dots, m

.

Notice that

E (H)

is a convex set in the sense that if

a_{i} \in E (H)

and

0 \leq λ_{i} \leq 1

with

\sum_{i = 1}^{m} λ_{i} = 1

, then

\sum λ_{i} a_{i} \in E (H)

.

Corollary 5.

(i) If

0 < λ \leq 1

and

a \in E (H)

, then

S_{λ a} (ρ) = λ S_{a} (ρ)

for all

ρ \in S (H)

. (ii) If

0 < λ_{i} \leq 1

,

a_{i} \in E (H)

, with

\sum_{i = 1}^{m} λ_{i} = 1

, then

S_{\sum λ_{i} a_{i}} (ρ) \leq \sum λ_{i} S_{a_{i}} (ρ)

for all

ρ \in S (H)

. We have equality if and only if

tr (a_{j}) tr (ρ a_{i}) = tr (a_{i}) tr (ρ a_{j})

for all

i, j = 1, 2, \dots, m

.

Proof.

(i) We have that

S_{λ a} (ρ) = - tr (ρ λ a) ln [\frac{tr (ρ λ a)}{tr (λ a)}] = - tr (ρ a) ln [\frac{λ tr (ρ a)}{λ tr (a)}] = λ S_{a} (ρ)

(ii) Applying (i) and Corollary 4 gives

S_{\sum λ_{i} a_{i}} (ρ) \geq \sum S_{λ_{i} a_{i}} (ρ) = \sum λ_{i} S_{a_{i}} (ρ)

together with the equality condition. □

As with

E (H)

,

S (H)

is a convex set and we have the following.

Theorem 3.

If

0 < λ_{i} \leq 1

ρ_{i} \in S (H)

,

i = 1, 2, \dots, m

, with

\sum_{i = 1}^{m} λ_{i} = 1

, then

S_{a} (\sum λ_{i} ρ_{i}) \geq \sum λ_{i} S_{a} (ρ_{i})

for all

a \in E (H)

. We have equality if and only if

tr (ρ_{i} a) = tr (ρ_{j} a)

for all

i, j = 1, 2, \dots, m

.

Proof.

Letting

x_{i} = tr (ρ_{i} a) / tr (a)

, since

- x ln x

is concave, we obtain

\begin{matrix} S_{a} (\sum λ_{i} ρ_{i}) & = - tr (\sum λ_{i} ρ_{i} a) ln [\frac{tr (\sum λ_{i} ρ_{i} a)}{tr (a)}] \\ = - tr (a) \sum λ_{i} \frac{tr (ρ_{i} a)}{tr (a)} ln [\frac{\sum λ_{i} tr (ρ_{i} a)}{tr (a)}] \\ = tr (a) [- \sum λ_{i} x_{i} ln (\sum λ_{j} x_{j})] \geq - tr (a) \sum λ_{i} x_{i} ln (x_{i}) \\ = - tr (a) \sum λ_{i} \frac{tr (ρ_{i} a)}{tr (a)} ln [\frac{tr (ρ_{i} a)}{tr (a)}] = - \sum λ_{i} tr (ρ_{i} a) ln [\frac{tr (ρ_{i} a)}{tr (a)}] \\ = \sum λ_{i} S_{a} (ρ_{i}) \end{matrix}

We have equality if and only if

x_{i} = x_{j}

which is equivalent to

tr (ρ_{i} a) = tr (ρ_{j} a)

for all

i, j = 1, 2, \dots, m

. □

Theorem 4.

If

a_{i} \in E (H_{i})

,

ρ_{i} \in S (H_{i})

,

i = 1, 2

, then

S_{a_{1} \otimes a_{2}} (ρ_{1} \otimes ρ_{2}) = tr (ρ_{2} a_{2}) S_{a_{1}} (ρ_{1}) + tr (ρ_{1} a_{1}) S_{a_{2}} (ρ_{2}) \leq S_{a_{1}} (ρ_{1}) + S_{a_{2}} (ρ_{2}) .

Proof.

This follows from

\begin{matrix} S_{a_{1} \otimes a_{2}} (ρ_{1} \otimes ρ_{2}) & = - tr (ρ_{1} \otimes ρ_{2} a_{1} \otimes a_{2}) ln [\frac{tr (ρ_{1} \otimes ρ_{2} a_{1} \otimes a_{2})}{tr (a_{1} \otimes a_{2})}] \\ = - tr (ρ_{1} a_{1}) tr (ρ_{2} a_{2}) ln [\frac{tr (ρ_{1} a_{1}) tr (ρ_{2} a_{2})}{tr (a_{1}) tr (a_{2})}] \\ = - tr (ρ_{1} a_{1}) tr (ρ_{2} a_{2}) \{ln [\frac{tr (ρ_{1} a_{1})}{tr (a_{1})}] + ln [\frac{tr (ρ_{2} a_{2})}{tr (a_{2})}]\} \\ = tr (ρ_{2} a_{2}) S_{a_{1}} (ρ_{1}) + tr (ρ_{1} a_{1}) S_{a_{2}} (ρ_{2}) \leq S_{a_{1}} (ρ_{1}) + S_{a_{2}} (ρ_{2}) e \end{matrix}

□

An operation on H is a completely positive linear map

I : L (H) \to L (H)

such that

tr [I (A)] \leq tr (A)

for all

A \in L (H)

[2,3,6,9,10]. If

I

is an operation we define the dual of

I

to be the unique linear map

I^{*} : L (H) \to L (H)

that satisfies

tr [I (A) B] = tr [A I^{*} (B)]

for all

A, B \in L (H)

. If

a \in E (H)

then for any

ρ \in S (H)

we have

0 \leq tr [I (ρ) a] \leq 1

and it follows that

I^{*} (a) \in E (H)

. We say that

I

measures

a \in E (H)

if

tr [I (ρ)] = tr (ρ a)

for all

ρ \in S (H)

. If

I

measures a we define the

I

-sequential product

a \circ b = I^{*} (b)

for all

b \in E (H)

[12,13]. Although

a \circ b

depends on the operation used to measure a we do not include

I

in the notation for simplicity. We interpret

a \circ b

as the effect that results from first measuring a using

I

and then measuring b.

Theorem 5.

(i) If

b ⊥ c

, then

a \circ (b + c) = a \circ b + a \circ c

. (ii)

a \circ I = a

. (iii)

a \circ b \leq a

for all

b \in E (H)

. (iv)

S_{a \circ b} (ρ) \leq S_{a} (ρ)

for all

ρ \in S (H)

.

Proof.

(i) For every

ρ \in S (H)

we obtain

\begin{matrix} tr [ρ a \circ (b + c)] & = tr [ρ I^{*} (b + c)] = tr [I (ρ) (b + c)] = tr [I (ρ) b] + tr [I (ρ) c] \\ = tr [ρ I^{*} (b)] + tr [ρ I^{*} (c)] = tr [ρ a \circ b] + tr [ρ a \circ c] \\ = tr [ρ (a \circ b + a \circ c)] \end{matrix}

Hence,

a \circ (b + c) = a \circ b + a \circ c

. (ii) For all

ρ \in S (H)

we have

tr (ρ a \circ I) = tr [ρ I^{*} (I)] = tr [I (ρ) I] = tr [I (ρ)] = tr (ρ a)

Hence,

a \circ I = a

. (iii) By (i) and (ii) we have

a \circ b + a \circ b^{'} = a \circ (b + b^{'}) = a \circ I = a

It follows that

a \circ b \leq a

. (iv) Since

a \circ b \leq a

, by Corollary 3 we obtain

S_{a \circ b} (ρ) \leq S_{a} (ρ)

for all

ρ \in S (H)

. □

Theorem 5(iv) shows that

a \circ b

gives more information than a about

ρ

. We can continue this process and make more measurements as follows. If

I^{i}

measures

a^{i}

,

i = 1, 2, \dots, m

, we have

a^{1} \circ a^{2} \circ \dots \circ a^{m} = {(I^{1})}^{*} {(I^{2})}^{*} \dots {(I^{m - 1})}^{*} (a^{m})

and it follows from Theorem 5(iv) that

S_{a^{1} \circ a^{2} \circ \dots \circ a^{m}} (ρ) \leq S_{a^{1} \circ a^{2} \circ \dots \circ a^{m - 1}} (ρ)

Notice that the probability of occurrence of the effect

a^{1} \circ a^{2} \circ \cdot \circ a^{m}

in state

ρ

is

\begin{matrix} tr (ρ a^{1} \circ a^{2} \circ \dots \circ a^{m}) & = tr [ρ {(I^{1})}^{*} {(I^{2})}^{*} \dots {(I^{m - 1})}^{*} (a^{m})] \\ = tr [I^{m - 1} I^{m - 2} \dots I^{1} (ρ) a^{m}] \end{matrix}

Thus, we begin with the input state

ρ

, then measure

a^{1}

using

I^{1}

, then measure

a^{2}

using

I^{2}, \dots

and finally measuring

a^{m}

.

Example 1.

1 For

a \in E (H)

we define the Lüders operation

L^{a} (A) = a^{1 / 2} A a^{1 / 2}

[14]. Since

tr [A {(L^{a})}^{*} (B)] = [L^{a} (A) B] = tr [a^{1 / 2} A a^{1 / 2} B] = tr (A a^{1 / 2} B a^{1 / 2})

we have

{(L^{a})}^{*} (B) = a^{1 / 2} B a^{1 / 2}

so

{(L^{a})}^{*} = L^{a}

. We have that

L^{a}

measures a because

tr [L^{a} (ρ)] = tr (a^{1 / 2} ρ a^{1 / 2}) = tr (ρ a)

for every

ρ \in S (H)

. We conclude that the

L^{a}

sequential product is

a \circ b = {(L^{a})}^{*} (b) = a^{1 / 2} b a^{1 / 2}

We also have that

\begin{matrix} S_{a \circ b} (ρ) & = - tr (ρ a \circ b) ln [\frac{tr (ρ a \circ b)}{tr (a \circ b)}] = - tr (ρ a^{1 / 2} b a^{1 / 2}) ln [\frac{tr (ρ a^{1 / 2} b a^{1 / 2})}{tr (a^{1 / 2} b a^{1 / 2})}] \\ = - tr (a \circ ρ b) ln [\frac{tr (a \circ ρ b)}{tr (a b)}] . \end{matrix}

Example 2.

2 For

a \in E (H)

,

α \in S (H)

we define the Holevo operation [15]

H^{(a, α)} (A) = tr (A a) α

. Since

\begin{matrix} tr [A {(H^{(a, α)})}^{*} (B)] & = tr [H^{(a, α)} (A) B] = tr [tr (A a) α B] = tr (A a) tr (α B) \\ = tr [A tr (α B) a] \end{matrix}

we have

{(H^{(a, α)})}^{*} (B) = tr (α B) a

. We have

H^{(a, α)}

measures a because

tr [H^{(a, α)} (ρ)] = tr (ρ a)

for every

ρ \in S (H)

. We conclude that the

H^{(a, α)}

sequential product is

a \circ b = {(H^{(a, α)})}^{*} (b) = tr (α b) a

We also have that

S_{a \circ b} (ρ) = - tr (α b) tr (ρ a) ln [\frac{tr (ρ a)}{tr (a)}] = tr (α b) S_{a} (ρ)

If

a_{i} \in E (H)

,

i = 1, 2, \dots, m

, and we measure

a_{i}

with operations

H^{(a_{i}, α_{i})}

,

i = 1, 2, \dots, m - 1

, then

\begin{matrix} a_{1} \circ a_{2} \circ \dots \circ a_{m} & = a_{1} \circ (a_{2} \circ \dots \circ a_{m}) = tr (α_{1} a_{2} \circ \dots \circ a_{m}) a_{1} \\ = tr [α_{1} tr (α_{2} a_{3} \circ \dots \circ a_{m}) a_{2}] a_{1} \\ = tr (α_{2} a_{3} \circ \dots \circ a_{m}) tr (α_{1} a_{2}) a_{1} \\ ⋮ \\ = tr (α_{m - 1} a_{m}) tr (α_{m - 2} a_{m - 1}) \dots tr (α_{1} a_{2}) a_{1} \end{matrix}

Moreover, it follows from Corollary 5(i) that

S_{a_{1} \circ \dots \circ a_{m}} (ρ) = tr (α_{m - 1} a_{m}) tr (α_{m - 2} a_{m - 1}) \dots tr (α_{1} a_{2}) S_{a_{1}} (ρ)

for all

ρ \in S (H)

.

3. Entropy of Observables and Instruments

We now extend our work on entropy of effects to entropy of observables and instruments. An observable on H is a finite collection of effects

A = \{A_{x} : x \in Ω_{A}\}

,

A_{x} \neq 0

, where

\sum_{x \in Ω_{A}} A_{x} = I

[2,3,9]. The set

Ω_{A}

is called the outcome space of A. The effect

A_{x}

occurs when a measurement of A results in the outcome x. If

ρ \in S (H)

, then

tr (ρ A_{x})

is the probability that outcome x results from a measurement of A when the system is in state

ρ

. If

Δ \subseteq Ω_{A}

, then

Φ_{ρ}^{A} (Δ) = \sum_{x \in Δ} tr (ρ A_{x})

is the probability that A has an outcome in

Δ

when the system is in state

ρ

and

Φ_{ρ}^{A}

is called the distribution of A. We also use the notation

A (Δ) = \sum \{A_{x} : x \in Δ\}

so

Φ_{ρ}^{A} (Δ) = tr [ρ A (Δ)]

for all

Δ \subseteq Ω_{A}

. In this way, an observable is a positive operation-valued measure (POVM). We say that an observable A is sharp if

A_{x}

is a projection on H for all

x \in Ω_{A}

and A is atomic if

A_{x}

is a one-dimensional projection for all

x \in Ω_{A}

.

If A is an observable and

ρ \in S (H)

the

ρ

-entropy of A is

S_{A} (ρ) = \sum S_{A_{x}} (ρ)

where the sum is over the

x \in Ω_{A}

such that

tr (ρ A_{x}) \neq 0

. Then

S_{A} (ρ)

is a measure of the information that a measurement of A gives about

ρ

. The smaller

S_{A} (ρ)

is, the more information given. Notice that if A is sharp, then

tr (A_{x}) = dim (A_{x})

and if A is atomic, then

S_{A} (ρ) = - \sum_{x} tr (ρ A_{x}) ln [tr (ρ A_{x})]

There are two interesting extremes for

S_{A} (ρ)

. If

ρ

has spectral decomposition

ρ = \sum_{i = 1}^{m} λ_{i} P_{i}

and A is the observable

A = \{P_{i} : i = 1, 2, \dots, m\}

, then

S_{A} (ρ) = - \sum_{i} tr (ρ P_{i}) ln [tr (ρ P_{i})] = - \sum λ_{i} ln (λ_{i}) = S (ρ)

As we shall see, this gives the minimum entropy (most information). For the completely random state

I / n

and any observable A we obtain

\begin{matrix} S_{A} (I / n) & = - \sum_{x} \frac{tr (A_{x})}{n} ln [\frac{tr (A_{x}) / n}{tr (A_{x})}] = - \frac{1}{n} \sum_{x} tr (A_{x}) ln (\frac{1}{n}) \\ = \frac{ln (n)}{n} \sum_{x} tr (A_{x}) = \frac{ln (n)}{n} tr (I) = ln (n) \end{matrix}

(3)

We shall also see that this gives the maximum entropy (least information).

Theorem 6.

For any observable A and

ρ \in S (H)

we have

S (ρ) \leq S_{A} (ρ) \leq ln (n)

Proof.

Applying Theorem 1 we obtain

\begin{matrix} S_{A} (ρ) & = \sum_{x \in Ω_{A}} S_{A_{x}} (ρ) \geq - \sum_{x \in Ω_{A}} \sum_{i} tr (P_{i} A_{x}) λ_{i} ln (λ_{i}) \\ = - \sum_{i} tr (P_{i} \sum_{x \in Ω_{A}} A_{x}) λ_{i} ln (λ_{i}) \\ = - \sum_{i} tr (P_{i}) λ_{i} ln (λ_{i}) = - \sum_{i} λ_{i} ln (λ_{i}) = S (ρ) \end{matrix}

Since

ln (x)

is concave and

tr (ρ A_{x}) > 0

,

\sum_{x} tr (ρ A_{x}) = 1

we have by Jensen’s inequality

\begin{matrix} S_{A} (ρ) & = \sum_{x} tr (ρ A_{x}) ln [\frac{tr (A_{x})}{tr (ρ A_{x})}] \leq ln [\sum_{x} tr (ρ A_{x}) \frac{tr (A_{x})}{tr (ρ A_{x})}] \\ = ln [\sum_{x} tr (A_{x})] = ln [tr (I)] = ln (n) e \end{matrix}

□

An observable A is trivial if

A_{x} = λ_{x} I

,

0 < λ_{x} \leq 1

,

\sum λ_{x} = 1

.

Corollary 6.

(i)

S_{A} (ρ) = ln (n)

if and only if

tr (A_{x}) tr (ρ A_{y}) = tr (A_{y}) tr (ρ A_{x})

for all

x, y \in Ω_{A}

. (ii) A is trivial if and only if

S_{A} (ρ) = ln (n)

for all

ρ \in S (H)

. (iii)

ρ = I / n

if and only if

S_{A} (ρ) = ln (n)

for all observables A. (iv)

S (ρ) = ln (n)

if and only if

ρ = I / n

.

Proof.

(i) This follows from the proof of Theorem 6 because this is the condition for equality in Jensen’s inequality. (ii) Suppose A is trivial with

A_{x} = λ_{x} I

. Then for every

ρ \in S (H)

we have

S_{A} (ρ) = - \sum_{x} tr (ρ λ_{x} I) ln [\frac{tr (ρ λ_{x} I)}{tr (λ_{x} I)}] = - \sum_{x} λ_{x} ln (\frac{λ_{x}}{n λ_{x}}) = ln (n) \sum_{x} λ_{x} = ln (n)

Conversely, suppose

S_{A} (ρ) = ln (n)

for all

ρ \in S (H)

. By (i) we have that

tr (A_{x}) tr (ρ A_{y}) = tr (A_{y}) tr (ρ A_{x})

for all

ρ \in S (H)

. It follows that

〈ϕ, A_{y} ϕ〉 = 〈ϕ, A_{x} ϕ〉 \frac{tr (A_{y})}{tr (A_{x})}

for every

ϕ \in H

,

ϕ \neq 0

. Hence,

A_{y} = (tr (A_{y})) / (tr (A_{x})) A_{x}

so that

I = \sum_{y} A_{y} = \sum_{y} \frac{tr (A_{y})}{tr (A_{x})} A_{x} = \frac{n}{tr (A_{x})} A_{x}

We conclude that

A_{x} = (tr (A_{x})) / n I

for all

x \in Ω_{A}

so A is trivial. (iii) If

ρ = I / n

, we have shown in (3) that

S_{A} (ρ) = ln (n)

for all observables A. Conversely, if

S_{A} (ρ) = ln (n)

for every observable A, as before, we have

tr (A_{x}) tr (ρ A_{y}) = tr (A_{y}) tr (ρ A_{x})

for every observable A. Letting

A_{x}

be the observable given by the spectral decomposition

ρ = \sum λ_{x} A_{x}

where A is atomic, we conclude that

λ_{x} = λ_{y}

for all

x, y \in Ω_{A}

. Hence,

λ_{x} = 1 / n

and

ρ = \sum (1 / n) A_{x} = I / n

. (iv)If

S (ρ) = ln (n)

, by Theorem 6,

S_{A} (ρ) = ln (n)

for every observable A. Applying (iii),

ρ = I / n

. Conversely, if

ρ = I / n

, then

S (ρ) = - \sum_{i = 1}^{n} \frac{1}{n} ln (\frac{1}{n}) = - ln (\frac{1}{n}) = ln (n) e

□

We now extend Corollary 5(ii) and Theorem 3 to observables. If

A^{i} = \{A_{x}^{i} : x \in Ω\}

are observables with the same outcome space

Ω

,

i = 1, 2, \dots, m

, and

0 < λ_{i} \leq 1

with

\sum_{i = 1}^{m} λ_{i} = 1

, then the observable

A = \{A_{x} : x \in Ω\}

where

A_{x} = \sum_{i = 1}^{m} λ_{i} A_{x}^{i}

is called a convex combination of the

A^{i}

[12].

Theorem 7.

(i) If A is a convex combination of

A^{i}

,

i = 1, 2, \dots, m

, then for all

ρ \in S (H)

we have

S_{A} (ρ) \geq \sum_{i = 1}^{m} λ_{i} S_{A^{i}} (ρ)

(ii) If

0 < λ_{i} \leq 1

with

\sum_{i = 1}^{m} λ_{i} = 1

,

ρ_{i} \in S (H)

,

i = 1, 2, \dots, m

, and A is an observable, then

S_{A} (\sum_{i} λ_{i} ρ_{i}) \geq \sum_{i} λ_{i} S_{A} (ρ_{i})

Proof.

(i) Applying Corollary 5(ii) gives

\begin{matrix} S_{A} (ρ) & = \sum_{x} S_{A_{x}} (ρ) = \sum_{x} S_{\sum λ_{i} A_{x}^{i}} (ρ) \geq \sum_{x} \sum_{i} λ_{i} S_{A_{x}^{i}} (ρ) \\ = \sum_{i} λ_{i} \sum_{x} S_{A_{x}^{i}} (ρ) = \sum_{i} λ_{i} S_{A^{i}} (ρ) \end{matrix}

(ii) Applying Theorem 3 gives

\begin{matrix} S_{A} (\sum_{i} λ_{i} ρ_{i}) & = \sum_{x} S_{A_{x}} (\sum_{i} λ_{i} ρ_{i}) \geq \sum_{x} \sum_{i} λ_{i} S_{A_{x}} (ρ_{i}) \\ = \sum_{i} λ_{i} \sum_{x} S_{A_{x}} (ρ_{i}) = \sum_{i} λ_{i} S_{A} (ρ_{i}) e \end{matrix}

□

We say that an observable B is a coarse-graining of an observable A if there exists a surjection

f : Ω_{A} \to Ω_{B}

such that

B_{y} = \sum \{A_{x} : f (x) = y\} = A [f^{- 1} (y)]

for every

y \in Ω_{B}

[2,12,16].

Theorem 8.

If B is a coarse-graining of A, then

S_{B} (ρ) \geq S_{A} (ρ)

for all

ρ \in S (H)

.

Proof.

Let

B_{y} = A [f^{- 1} (y)]

for all

y \in Ω_{B}

and let

p_{y} = tr (ρ B_{y})

,

p_{x}^{'} = tr (ρ A_{x})

for all

y \in Ω_{b}

,

x \in Ω_{A}

. Then

p_{y} = tr (ρ \sum_{f (x) = y} A_{x}) = \sum_{f (x) = y} tr (ρ A_{x}) = \sum_{f (x) = y} p_{x}^{'}

Let

V_{y} = tr (B_{y})

,

V_{x}^{'} = tr (A_{x})

so that

V_{y} = tr \sum (\sum_{f (x) = y} A_{x}) = \sum_{f (x) = y} tr (A_{x}) = \sum_{f (x) = y} V_{x}^{'}

Since

- x ln (x)

is concave, we conclude that

\begin{matrix} S_{B} (ρ) & = - \sum_{y} p_{y} ln (\frac{p_{y}}{V_{y}}) = - \sum_{y} \sum_{f (x) = y} p_{x}^{'} ln [\frac{\sum_{f (x) = y} p_{x}^{'}}{V_{y}}] \\ = - \sum_{y} V_{y} (\sum_{f (x) = y} \frac{p_{x}^{'} V_{x}^{'}}{V_{x}^{'} V_{y}}) ln (\sum_{f (x) = y} \frac{p_{x}^{'} V_{x}^{'}}{V_{x}^{'} V_{y}}) \\ \geq - \sum_{y} V_{y} \sum_{f (x) = y} \frac{V_{x}^{'}}{V_{y}} [\frac{p_{x}^{'}}{V_{x}^{'}} ln (\frac{p_{x}^{'}}{V_{x}^{'}})] = - \sum_{y} \sum_{f (x) = y} p_{x}^{'} ln (\frac{p_{x}^{'}}{V_{x}^{'}}) \\ = - \sum_{x} p_{x}^{'} ln (\frac{p_{x}^{'}}{V_{x}^{'}}) = S_{A} (ρ) e \end{matrix}

□

The equality condition for Jensen’s inequality gives the following.

Corollary 7.

An observable A possesses a coarse-graining

B_{y} = A [f^{- 1} (y)]

with

S_{B} (ρ) = S_{A} (ρ)

for all

ρ \in S (H)

if and only if for every

x_{1}, x_{2} \in Ω_{A}

with

f (x_{1}) = f (x_{2})

we have

tr (A_{x_{2}}) tr (ρ A_{x_{1}}) = tr (A_{x_{1}}) tr (ρ A_{x_{2}})

A trace preserving operation is called a channel. An instrument on H is a finite collection of operations

I = \{I_{x} : x \in Ω\}

such that

\sum_{x \in Ω_{I}} I_{x}

is a channel [2,3,9]. We call

Ω_{I}

the outcome space for

I

. If

I

is an instrument, there exists a unique observable A such that

tr (ρ A_{x}) = tr [I_{x} (ρ)]

for all

x \in Ω_{A} = Ω_{I}

,

ρ \in S (H)

and we say that

I

measures A. Although an instrument measures a unique observable, an observable is measured by many instruments For example, if A is an observable, the corresponding Łüders instrument [14] is defined by

L_{x}^{A} (B) = A_{x}^{1 / 2} B A_{x}^{1 / 2}

for all

B \in L (H)

. Then

L^{A}

is an instrument because

\begin{matrix} tr [\sum_{x} L_{x}^{A} (B)] & = \sum_{x} tr [L_{x}^{A} (B)] = \sum_{x} tr (A_{x}^{1 / 2} B A_{x}^{1 / 2}) = \sum_{x} tr (A_{x} B) \\ = tr (\sum_{x} A_{x} B) = tr (I B) = tr (B) \end{matrix}

for all

B \in L (H)

. Moreover,

L^{A}

measures A because

tr [L_{x}^{A} (ρ)] = tr (A_{x}^{1 / 2} ρ A_{x}^{1 / 2}) = tr (ρ A_{x})

for all

ρ \in S (H)

. Of course, this is related to Example 1. Corresponding to Example 2, we have a Holevo instrument

H^{(A, α)}

where

α_{x} \in S (H)

,

x \in Ω_{A}

and

H_{x}^{(A, α)} (B) = tr (B A_{x}) α_{x}

for all

B \in L (H)

[15]. To show that

H^{(A, α)}

is an instrument we have

\begin{matrix} tr [\sum_{x} H_{x}^{(A, α)} (B)] & = \sum_{x} tr [H_{x}^{(A, α)} (B)] = \sum_{x} tr [tr (B A_{x}) α_{x}] \\ = \sum_{x} tr (B A_{x}) = tr (B \sum_{x} A_{x}) = tr (B) \end{matrix}

Moreover,

H^{(A, α)}

measures A because

tr [H_{x}^{A, α} (ρ)] = tr [(ρ A_{x}) α_{x}] = tr (ρ A_{x}) tr (α_{x}) = tr (ρ A_{x})

Let

A, B

be observables and let

I

be an instrument that measures A. We define the

I

-sequential product

A \circ B

[12,13] by

Ω_{A \circ B} = Ω_{A} \times Ω_{B}

and

A \circ B_{(x, y)} = I_{x}^{*} (B_{y}) = A_{x} \circ B_{y}

Defining

f : Ω_{A \circ B} \to Ω_{A}

by

f (x, y) = x

,we obtain

A \circ B [f^{- 1} (x)] = \sum_{f (x, y) = x} A_{x} \circ B_{y} = \sum_{y \in Ω_{B}} I_{x}^{*} (B_{y}) = I_{α}^{*} (I) = A_{x}

We conclude that A is a coarse-graining of

A \circ B

. Applying Theorem 8 we obtain the following.

Corollary 8.

If

A, B

are observables, the

S_{A \circ B} (ρ) \leq S_{A} (ρ)

for all

ρ \in S (H)

. Equality

S_{A \circ B} (ρ) = S_{A} (ρ)

holds if and only if for every

x \in Ω_{A}

,

y_{1}, y_{2} \in Ω_{B}

we have

\frac{tr (ρ A_{x} \circ B_{y_{1}})}{tr (A_{x} \circ B_{y_{1}})} ln [\frac{tr (ρ A_{x} \circ B_{y_{1}})}{tr (A_{x} \circ B_{y_{1}})}] = \frac{tr (ρ A_{x} \circ B_{y_{2}})}{tr (A_{x} \circ B_{y_{2}})} ln [\frac{tr (ρ A_{x} \circ B_{y_{2}})}{tr (A_{x} \circ B_{y_{2}})}]

Extending this work to more than two observables, let

I^{1}, I^{2}, \dots, I^{m - 1}

be instruments that measure the observables

A^{1}, A^{2}, \dots, A^{m - 1}

, respectively. If

A^{m}

is another observable, we have that

{(A^{1} \circ A^{2} \circ \dots \circ A^{m})}_{(x_{1}, x_{2}, \dots, x_{m})} = {(I_{x_{1}}^{1})}^{*} {(I_{x_{2}}^{2})}^{*} \dots {(I_{x_{m - 1}}^{m - 1})}^{*} (A_{x_{m}}^{m})

The next result follows from Corollary 8.

Corollary 9.

If

A^{1}, A^{2}, \dots, A^{m}

are observables, then

S_{A^{1} \circ A^{2} \circ \dots \circ A^{m}} (ρ) \leq S_{A^{1} \circ A^{2} \circ \dots \circ A^{m - 1}} (ρ)

for all

ρ \in S (H)

.

If

I

is an instrument, let A be the unique observable that

I

measures so

tr [I_{x} (ρ)] = tr (ρ A_{x})

for all

x \in Ω_{I}

and

ρ \in S (H)

. We define the

ρ

-entropy of

I

as

S_{I} (ρ) = S_{A} (ρ)

. Since

A_{x} = I_{x}^{*} (I)

we have

tr (A_{x}) = tr [I_{x}^{*} (I)] = tr [I_{x} (I)]

Hence,

S_{I} (ρ) = S_{A} (ρ) = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (ρ A_{x})}{tr (A_{x})}] = - \sum_{x} tr [I_{x} (ρ)] ln \{\frac{tr [I_{x} (ρ)]}{tr [I_{x} (I)]}\}

Now let

I^{1}, I^{2}, \dots, I^{m}

be instruments and let

A^{1}, A^{2}, \dots, A^{m}

be the unique observables they measure, respectively. Denoting the composition of two instruments

I, J

by

I \circ J

we have

\begin{matrix} tr [I_{x_{m}}^{m} \circ I_{x_{m - 1}}^{m - 1} \circ \dots \circ I_{x_{1}}^{1} (ρ)] & = tr [ρ {(I_{x_{1}}^{1})}^{*} {(I_{x_{2}}^{1})}^{*} \dots {(I_{x_{m}}^{m})}^{*} (I)] \\ = tr (ρ A_{x_{1}}^{1} \circ A_{x_{2}}^{2} \circ \dots \circ A_{x_{m}}^{m}) \end{matrix}

Hence, the observable measured by

I^{m} \circ I^{m - 1} \circ \dots \circ I^{1}

is

A^{1} \circ A^{2} \circ \dots \circ A^{m}

. It follows that

S_{I^{m} \circ I^{m - 1} \circ \dots \circ I^{1}} (ρ) = S_{A^{1} \circ A^{2} \circ \dots \circ A^{m}} (ρ)

We conclude that Theorems 1, 2 and 3 [1] follow from our results. Moreover, our proofs are simpler since they come from the more basic concept of

ρ

-entropy for effects.

Let

A, B

be observables on H and let

I

be an instrument that measures A. The corresponding sequential product becomes

{(A \circ B)}_{(x, y)} = I_{x}^{*} (B_{y}) = A_{x} \circ B_{y}

The

ρ

-entropy of

A \circ B

has the form

\begin{matrix} S_{A \circ B} (ρ) & = - \sum_{x, y} tr [ρ {(A \circ B)}_{(x, y)}] ln \{\frac{tr [ρ {(A \circ B)}_{(x, y)}]}{tr [{(A \circ B)}_{(x, y)}]}\} \\ = - \sum_{x, y} tr [ρ I_{x}^{*} (B_{y})] ln \{\frac{tr [ρ I_{x}^{*} (B_{y})]}{tr [I_{x}^{*} (B_{y})]}\} \\ = - \sum_{x, y} tr [I_{x} (ρ) B_{y}] ln \{\frac{[I_{x} (ρ) B_{y}]}{tr [I_{x} (I) B_{y}]}\} \end{matrix}

If

L^{A}

is the Lüders instrument

I_{x}^{A} (ρ) = A_{x}^{1 / 2} ρ A_{x}^{1 / 2}

we have

{(A \circ B)}_{(x, y)} = A_{x}^{1 / 2} B_{y} A_{x}^{1 / 2}

and

S_{A \circ B} (ρ) = - \sum_{x, y} tr (A_{x}^{1 / 2} ρ A_{x}^{1 / 2} B_{y}) ln [\frac{tr (A_{x}^{1 / 2} ρ A_{x}^{1 / 2} B_{y})}{tr (A_{x} B_{y})}]

If

H^{(A, α)}

is the Holevo instrument

H_{x}^{(A, α)} (ρ) = tr (ρ A_{x}) α_{x}

,

α_{x} \in S (H)

we obtain

\begin{matrix} S_{A \circ B} (ρ) & = - \sum_{x, y} tr (ρ A_{x}) tr (α_{x} B_{y}) ln [\frac{tr (ρ A_{x}) tr (α_{x} B_{y})}{tr (A_{x}) tr (α_{x} B_{y})}] \\ = - \sum_{x, y} tr (ρ A_{x}) tr (α_{x} B_{y}) ln [\frac{tr (ρ A_{x})}{tr (A_{x})}] \\ = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (ρ A_{x})}{tr (A_{x})}] = S_{A} (ρ) \end{matrix}

This also follows from Corollary 8 because

\frac{tr (ρ A_{x} \circ B_{y})}{tr (A_{x} \circ B_{y})} = \frac{tr (α_{x} B_{y}) tr (ρ A_{x})}{tr (α_{x} B_{y}) tr (A_{x})} = \frac{(ρ A_{x})}{tr (A_{x})}

If A is an observable on H and B is an observable on K we form the tensor product observable

A \otimes B

on

H \otimes K

given by

{(A \otimes B)}_{(x, y)} = A_{x} \otimes B_{y}

where

Ω_{A \otimes B} = Ω_{A} \times Ω_{B}

[12].

Lemma 1.

If

ρ_{1} \in S (H)

,

ρ_{2} \in S (K)

, then

S_{A \circ B} (ρ_{1} \otimes ρ_{2}) = S_{A} (ρ_{1}) + S_{B} (ρ_{2})

Proof.

From the definition of

A \otimes B

we obtain

\begin{matrix} S_{A \otimes B} (ρ_{1} \otimes ρ_{2}) & = - \sum_{x, y} tr (ρ_{1} \otimes ρ_{2} A_{x} \otimes B_{y}) ln [\frac{tr (ρ_{1} \otimes ρ_{2} A_{x} \otimes B_{y})}{tr (A_{x} \otimes B_{y})}] \\ = - \sum_{x, y} tr (ρ_{1} A_{x}) tr (ρ_{2} B_{y}) ln [\frac{tr (ρ_{1} A_{x}) tr (ρ_{2} B_{y})}{tr (A_{x}) tr (B_{y})}] \\ = - \sum_{x, y} tr (ρ_{1} A_{x}) tr (ρ_{2} B_{y}) ln [\frac{tr (ρ_{1} A_{x})}{tr (A_{x})}] \\ - \sum_{x, y} tr (ρ_{1} A_{x}) tr (ρ_{2} B_{y}) ln [\frac{tr (ρ_{2} B_{y})}{tr (B_{y})}] \\ = - \sum_{x} tr (ρ_{1} A_{x}) ln [\frac{tr (ρ_{1} A_{x})}{tr (A_{x})}] - \sum_{y} tr (ρ_{2} B_{y}) ln [\frac{tr (ρ_{2} B_{y})}{tr (B_{y})}] \\ = S_{A} (ρ_{1}) + S_{B} (ρ_{2}) e \end{matrix}

□

We conclude that A gives more information about

ρ_{1}

than A and B give about

ρ_{1} \otimes ρ_{2}

and similarly for B.

A measurement model [2,3,9] is a 5-tuple

M = (H, K, ν, σ, P)

where H is the system Hilbert space, K is the probe Hilbert space,

ν

is the interaction channel,

σ \in S (K)

is the initial probe state and P is the probe observable on K. We interpret

M

as an apparatus that is employed to measure an instrument and hence an observable. In fact,

M

measures the unique instrument

I

on H given by

I_{x} (ρ) = tr_{K} [ν (ρ \otimes σ) (I \otimes P_{x})]

In this way, a state

ρ \in S (H)

is input into the apparatus and combined with the initial state

σ

of the probe system. The channel

ν

interacts the two states and a measurement of the probe P is performed resulting in outcome x. The outcome state is reduced to H by applying the partial trace over K. Now

I

measures an unique observable A on H that satisfies

tr (ρ A_{x}) = tr [I_{x} (ρ)] = tr [ν (ρ \otimes σ) (I \otimes P_{x})]

(4)

The

ρ

-entropy of

I

becomes

S_{I} (ρ) = S_{A} (ρ) = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (ρ A_{x})}{tr (A_{x})}]

where

tr (ρ A_{x})

is given by (4). Of course,

S_{I} (ρ) = S_{A} (ρ)

gives the amount of information that a measurement by

M

provides about

ρ

. A closely related concept is the observable

I \otimes P

and

S_{I \otimes P} [ν (ρ \otimes σ)]

also provides the amount of information that a measurement

M

provides about

ρ

. It follows from (4) that the distribution of A in the state

ρ

equals the distribution of

I \otimes P

in the state

ν (ρ \otimes σ)

. We now compare

S_{A} (ρ)

and

S_{I \otimes P} [ν (ρ \otimes σ)]

. Applying (4) gives

\begin{matrix} S_{I \otimes P} & [ν (ρ \otimes σ)] \\ = - \sum_{x} tr [ν (ρ \otimes σ) (I \otimes P_{x})] ln \{\frac{tr [ν (ρ \otimes σ) (I \otimes P_{x})]}{tr (I \otimes P_{x})}\} \\ = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (ρ A_{x})}{n tr (P_{x})}] = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (A_{x})}{n tr (P_{x})} \frac{tr (ρ A_{x})}{tr (A_{x})}] \\ = - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (ρ A_{x})}{tr (A_{x})}] - \sum tr (ρ A_{x}) ln [\frac{tr (A_{x})}{n tr (P_{x})}] \\ = S_{A} (ρ) - \sum_{x} tr (ρ A_{x}) ln [\frac{tr (A_{x})}{n tr (P_{x})}] \end{matrix}

It follows that

S_{A} (ρ) \leq S_{I \otimes P} [ν (ρ \otimes σ)]

if and only if

\sum_{x} tr (ρ A_{x}) ln [\frac{tr (A_{x})}{n tr (P_{x})}] \leq 0

(5)

Now (5) may or may not hold depending on A,

ρ

and P. In many cases, P is atomic [2,9] and then

ln [\frac{tr (A_{x})}{n tr (P_{x})}] = ln [\frac{tr (A_{x})}{n}] < 0

so

S_{A} (ρ) \leq S_{I \otimes P} [ν (ρ \otimes σ)]

for all

ρ \in S (H)

. Also, (5) holds if P is sharp.

Funding

This research received no external funding.

Conflicts of Interest

The author declare no conflict of interest.

References

Šafránek, D.; Thingna, J. Quantifying information extraction using generalized quantum measurements. arXiv 2022, arXiv:2007.07246. [Google Scholar]
Heinosaari, T.; Ziman, M. The Mathematical Language of Quantum Theory; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Nielson, M.; Chuang, I. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Ohya, M.; Petz, D. Quantum Entropy and It’s Uses; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Šafránek, D.; Aguirre, A.; Schindler, J.; Deutsch, J. A brief introduction to observational entropy. Found. Phys. 2021, 51, 101. [Google Scholar] [CrossRef]
Lindblad, G. Completely positive maps and entropy inequalities. Comm. Math. Phys. 1975, 40, 147–151. [Google Scholar] [CrossRef]
von Neumann, J. Mathematical Foundations of Quantum Mechanics; Princeton University Press: Princeton, NJ, USA, 1955. [Google Scholar]
Wehrl, A. General properties of entropy. Rev. Mod. Phys. 1978, 50, 221. [Google Scholar] [CrossRef]
Busch, P.; Lahti, P.; Mittlestaedt, P. The Quantum Theory of Measurement; Springer: Berlin/Heidelberg, Germany, 1996. [Google Scholar]
Kraus, K. States, Effects and Operations; Springer: Berlin/Heidelberg, Germany, 1983. [Google Scholar]
Šafránek, D.; Deutsch, J.; Aguirre, A. Quantum coarse-grained entropy and thermodynamics. Phys. Rev. A 2019, 99, 010101. [Google Scholar] [CrossRef]
Gudder, S. Combinations of quantum observables and instruments. J. Phys. A Math. Theor. 2021, 54, 364002. [Google Scholar] [CrossRef]
Gudder, S. Sequential products of Quantum measurements. arXiv 2021, arXiv:2108.07925. [Google Scholar] [CrossRef]
Lüders, G. Über due Zustandsänderung durch den Messprozess. Ann. Physik 1951, 6, 322–328. [Google Scholar]
Holevo, A. Probabilistic and Statistical Aspects of Quantum Theory; North-Holland: Amsterdam, The Netherlands, 1982. [Google Scholar]
Gudder, S. Coarse-graining of observables. Quant. Rep. 2022, 4, 401–417. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gudder, S. Entropy of Quantum Measurements. Entropy 2022, 24, 1686. https://doi.org/10.3390/e24111686

AMA Style

Gudder S. Entropy of Quantum Measurements. Entropy. 2022; 24(11):1686. https://doi.org/10.3390/e24111686

Chicago/Turabian Style

Gudder, Stanley. 2022. "Entropy of Quantum Measurements" Entropy 24, no. 11: 1686. https://doi.org/10.3390/e24111686

APA Style

Gudder, S. (2022). Entropy of Quantum Measurements. Entropy, 24(11), 1686. https://doi.org/10.3390/e24111686

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Entropy of Quantum Measurements

Abstract

1. Introduction

2. Entropy for Effects

3. Entropy of Observables and Instruments

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI