Quantum Probabilities and Maximum Entropy

Schlatter, Andreas

doi:10.3390/e19070304

Open AccessArticle

Quantum Probabilities and Maximum Entropy

by

Andreas Schlatter

Burghaldeweg 2F, 5024 Küttigen, Switzerland

Entropy 2017, 19(7), 304; https://doi.org/10.3390/e19070304

Submission received: 22 May 2017 / Revised: 14 June 2017 / Accepted: 24 June 2017 / Published: 26 June 2017

Download Versions Notes

Abstract

:

Probabilities in quantum physics can be shown to originate from a maximum entropy principle.

Keywords:

quantum measurement; Born-rule; density operator; symmetry

1. Introduction

Ever since quantum physics was discovered at the beginning of the 20th century, there has been a debate about its interpretation. The mathematical formalism does not allow a direct, descriptive interpretation of quantum-objects in space and time and seems to be underdetermined [1] with regard to its ontological structure. As a consequence, there are multiple interpretations/ontologies available today. A particular topic has always been the role and nature of the probabilities in quantum physics. A cornerstone of quantum theory is the fact that the world is empirically and epistemically probabilistic. This means that agents are able to assign probabilities to future events, which are then empirically tested by multiple trials of experiments on identical systems. We also include the term “epistemic”, because of the fact that, although there are today deterministic models of the quantum realm [2], it seems that their values are knowable only modulo randomly distributed initial conditions. The fact that nature shows probabilistic patterns and that agents can theoretically predict and then empirically find them in experiments is by no means self-evident. We will see that once we have defined how physical properties are represented in the theory, we only need two additional, plausible assumptions on how agents empirically gather data and draw conclusions, to uniquely define the theory. The fact that experimental (statistical) frequencies coincide with the probabilities is built into the theory and thence logically no surprise. Again, what is astounding though, is that nature plays the game and allows such a theory in the first place.

There has been a long debate on how to interpret randomness in quantum physics [3]. There are, at first sight, three different kinds of probabilities. The first category consists of the probabilities, which arise from pure quantum states through the Born-rule [4]. These are sometimes considered as the “true” quantum-probabilities. The second category consists of the weights in mixed states and the third one of the frequencies found in multiple trials of an experiment on identical quantum systems. There naturally arises the quest for an underlying principle, common to all categories. Since agents can choose to do a single experiment only, the frequency definition seems to fall short as the common principle. On the other hand and in view of what we said earlier, it also seems a bold standpoint to say that the probabilities are merely subjective [5]. The fact that nature allows a theory, where agents by experiments can test probabilities arising from that theory, does say something about nature itself, of which agents, admittedly, are a part. A further interpretation, which works for the single trial, is the one of propensities [6]. There is a vast literature on the philosophical question of the true nature of probabilities, which we are unable to cover here [7]. We will add a specific unifying view and argue in this paper that the probabilities in quantum theory can be understood as the result of symmetry under permutations in combination with Laplace’s law of indifference, i.e., a maximum entropy principle.

2. Quantum Physics

The ansatz for the mathematical theory of quantum physics is to represent a physical quantity as a self-adjoint operator

A \in L (H)

in the space of linear operators over a state space

H

, which carries the structure of a complex Hilbert space of some dimension

d \in ℕ

. The values, which this quantity can assume in an experiment, are the corresponding eigenvalues

λ_{k} \in ℝ, k \leq d,

of

A

. We have to find a way to assign probabilities to these eigenvalues, a task which is equivalent to assigning probabilities to the orthogonal projection operators

{Π_{k}}_{k \leq d} \in L (H)

, which project states in

H

to eigenstates corresponding to the

λ_{k}

. If agents assign a probability

p_{0}

to a projection operator

Π_{0}

, which is common to two families of orthogonal projectors,

{Π_{k}}_{k \leq d}, {Γ_{k}}_{k \leq d},

and do corresponding experiments, they would like to be sure that, if they find the frequency

p_{0}

, the results describe the same event

Π_{0}

. This is a non-contextuality condition (note that the set of (conditional) probabilities over realized values is contextual, as a theorem by Kochen–Specker [8] and an example by Hardy [9] show). A famous theorem from Gleason [10] says that to any non-contextual measure

μ

on the sets of projectors

{Π}_{H}

over a Hilbert space

H

of dimension

d \geq 3

, there exists a unique positive semi-definite, self-adjoint operator

ϱ

of trace class one, called density operator, such that

μ (Π) = t r (ϱ Π)

for all

Π

. This is the Born-rule. The theorem defines the appropriate measures as well as the quantum states, which are identified with the density-operators

ϱ

. There is a special class

S (H)

of density-operators, called pure states. Every vector,

| ψ 〉 \in H,

defines a corresponding pure state

ϱ = | ψ 〉 〈 ψ |

, which is the projection operator onto

| ψ 〉 .

The set of density operators,

D (H) \subset L (H)

, is the set of all convex combinations of pure states

S (H) \subset D (H)

. The weights corresponding to the convex combinations are the second category of probabilities mentioned above. As important as Gleason’s theorem of course is, because it technically defines the right quantum-probabilities, it does not say much about their nature/interpretation.

2.1. Probabilities of Mixed States

The simplest case is the second category, namely the probability weights of mixed states. In the mixed case, there are ex-ante probabilities

p_{a} \geq 0, a \leq M, \sum_{a \leq M} p_{a} = 1

, which generate states of the form

ϱ = \sum_{a \leq M} p_{a} ϱ_{a},

(1)

where the

{ϱ_{a} = | ψ_{a} 〉 〈 ψ_{a} |}_{a \leq M}

are (non-necessarily orthogonal) pure states. These probabilities are considered to be of classical type, i.e., uncertainties about a possible set of preparations. Since the rational numbers

ℚ

are dense in

ℝ

, we may assume that, with an arbitrarily small error,

p_{a} = r_{a} / q_{a}, r, q \in ℕ_{+} .

Let

Q = \prod_{a \leq M} q_{a}

and

Q_{a} = \prod_{a^{'} \leq M, a^{'} \neq a} q_{a}

, respectively. We can then set

N = Q

and

m_{a} = Q_{a} r_{a}

to get a number of

N

states

{ϱ_{a_{k}} ≔ ϱ_{a}}_{a \leq M, k \leq m_{a}}

with probabilities

p_{a_{k}} = 1 / N

and aggregate probabilities

p_{a} = m_{a} / N

. This way, the probabilities

p_{a}

clearly reflect the indifference principle resulting from the permutation-symmetry of (1).

2.2. Probabilities of Pure States

We now consider a system

S

represented by a pure state

| ψ 〉 \in H

with resolution in the eigenbasis

| e_{a} 〉, 1 \leq a \leq M,

of a self-adjoint operator

A \in L (H)

,

| ψ 〉 = \sum_{a \leq M} ψ_{a} | e_{a} 〉, ψ_{a} \in ℂ .

We can form the corresponding pure state

ϱ_{ψ} = | ψ 〉 〈 ψ |

with matrix-entries

{(ϱ_{ψ})}_{a a'} = ψ_{a} ψ_{a'}^{*}

. The Born rule then assigns probabilities

p_{a} = \frac{{| ψ_{a} |}^{2}}{{‖ ψ ‖}^{2}},

(2)

to the projectors

ϱ_{a} = | e_{a} 〉 〈 e_{a} |, a \leq M

. Assume that there is an additional system

ℰ

with orthonormal basis states

{| n 〉}_{n \leq N},

which is initially in the base state

ϱ_{0} = | 0 〉 〈 0 |

. A measurement of some state

ϱ

by the probe

ℰ

is an operation

U

on the joint system

ϱ_{j o i n t} = | 0 〉 〈 0 | \otimes ϱ

U (| 0 〉 〈 0 | \otimes ϱ) U^{*},

(3)

where

U

is unitary

U U^{*} = 𝕝

(this follows from the fact that a general interaction evolution

U (t) = e^{- (i / ℏ) H t}

is unitary). A general unitary transformation on a tensor-product, expressed in the respective bases, can be written as a matrix

U = Σ_{a n, a^{'} n^{'}} u_{a n, a^{'} n^{'}} | a 〉 | n 〉 〈 a' | 〈 n' | = Σ_{n n'} A_{n n'} \otimes | n 〉 〈 n' |,

(4)

where the operators

A_{n n^{'}}

are given by

A_{n n^{'}} = \sum_{a a^{'}} u_{n a, n^{'} a^{'}} | a 〉 〈 a^{'} |

. We denote the diagonal sub-block

A_{n 0}

simply by

A_{n}

. Since

U

is unitary, we have

〈 0 | U U^{*} | 0 〉 = Σ_{n} A_{n} A_{n}^{*} = 𝕝 .

(5)

Conversely, we can choose any set of operators

A_{n}

satisfying the resolution of the identity-condition (5) to define a measurement on an initial joint state

ϱ_{j o i n t} = | 0 〉 〈 0 | ⨂ ϱ

. We now have the necessary elements in place to give the main argument.

Assume there is a second system

ℰ

with basis

{| n}_{n ≦ N}

and an observer who would like to know in what state

ϱ_{a} = | e_{a} 〉 〈 e_{a} |

the system

S

is in, by making an appropriate measurement

U

on the joint system

ϱ_{j o i n t} = | 0 〉 〈 0 | ⨂ ϱ_{ψ}

. If that is possible in the first place, then, having no additional knowledge, the observer does not, a priori, know in what state

| n 〉, n \leq N

, the probe will be after the measurement and before observation, leading to permutation-symmetry. Let the underlying pure state

| ψ 〉 \in H

have coefficients

ψ_{a} = \sqrt{m_{a}} e^{i φ_{a}}, m_{a} \in ℕ, φ_{a} \in ℝ

(since the rational numbers

ℚ

are dense in

ℝ

, the choice of

m_{a} \in ℕ

is general enough). The probe

ℰ

can be chosen appropriately coarse-grained (this coarse-graining is first introduced in [11] in the context of many-worlds) such that

N = \sum_{a \leq M} m_{a}

. The observer is after the measurement and before observation in a situation where, by lack of further information, she will by Laplace’s principle of indifference a priori attribute to each outcome

〈 n | U (| 0 〉 〈 0 | ⨂ ϱ_{ψ}) U^{*} | n 〉

equal probability

p_{n} = 1 / N, n \leq N

. This attribution is equivalent to maximizing the entropy function

H (p) = - \sum_{n = 1}^{N} p_{n} \log (p_{n})

. The observer can therefore write down in the spirit of (1) an average of outcomes

\tilde{ϱ} = Σ_{n \leq N} \frac{1}{N} (〈 n | U (| 0 〉 〈 0 | ⨂ ϱ_{ψ}) U^{*} | n 〉) = Σ_{n \leq N} \frac{1}{N} (A_{n} ϱ_{ψ} A_{n}^{*}) .

(6)

For our purpose, we now chose the operators

A_{n}

to be the scaled projectors

{{\tilde{P}}_{a_{k}} ≔ (1 / \sqrt{m_{a}}) P_{a}}_{a \leq M, k \leq m_{a}}

to the basis-states

| e_{a} 〉, a \leq M

. Note that we have replaced the simple-index

n

by the double-index

a_{k}

. This choice is consistent with the demands of a measurement, since the

{\tilde{P}}_{a_{k}}

satisfy (5)

Σ_{n \leq N} {\tilde{P}}_{n}^{*} {\tilde{P}}_{n} = Σ_{a \leq M} Σ_{k \leq m_{a}} {\tilde{P}}_{a_{k}}^{*} {\tilde{P}}_{a_{k}} = Σ_{a \leq M} P_{a}^{*} P_{a} = 𝕝 .

(7)

Therefore, we can write (6) in the following form

\tilde{ϱ} = Σ_{n \leq N} \frac{1}{N} ({\tilde{P}}_{n} ϱ_{ψ} {\tilde{P}}_{n}^{*}) = Σ_{a \leq M} Σ_{k \leq m_{a}} \frac{1}{N} ({\tilde{P}}_{a_{k}}^{*} ϱ_{ψ} {\tilde{P}}_{a_{k}})

= Σ_{a \leq M} \frac{1}{N} (P_{a} ϱ_{ψ} P_{a}^{*}) = Σ_{a \leq M} \frac{m_{a}}{N} ϱ_{a} .

(8)

Comparing Equation (8) with Equation (1), we see that

\tilde{ϱ}

can be viewed as a mixed state with probabilities

p_{a} = \frac{m_{a}}{N} = \frac{{| ψ_{a} |}^{2}}{{‖ ψ ‖}^{2}},

(9)

which is the Born-rule.

Before we turn to consider the frequencies, let’s have a look at composite systems

ϱ_{12} \in D (H \otimes H) .

To show the principle it is sufficient to look at binary systems.) The state

ϱ_{12}

may be mixed or pure and we can apply the findings in a straightforward way. The single components are given by the partial trace

ϱ_{1 / 2} = t r_{2 / 1} (ϱ_{12})

. If the state has the form

ϱ_{12} = ϱ_{1} \otimes ϱ_{2}

, then we are in the separable case and can apply the results in 2.1 and 2.2 to each individual component, which can be pure or mixed. In case

ϱ_{12}

is entangled, then the partial trace always produces a mixed state. When we now consider frequencies, then the individual quantum systems might be single or composite, what is important is that they are temporally separable in order to allow for statistics.

2.3. Frequencies

The theory so far does only cover single trials. Assume there is a density-operator

ϱ \in D

and a complete set of projectors

{Π_{k}}_{k \leq M}

. To find probabilities for a sequence of different outcomes

k_{1}, \dots, k_{N},

of

N

experiments on

ϱ

(this is done on

N

identically prepared systems) we can apply Gleason’s theorem to the tensor product [5]

ϱ^{N} = ϱ \otimes \dots \otimes ϱ,

(10)

to get

p (k_{1}, \dots, k_{N}) = t r (ϱ^{N} Π_{k_{1}} \otimes ... \otimes Π_{k_{N}}) = p_{k_{1}}, \dots, p_{k_{N}},

(11)

with

p_{k} = t r (ϱ Π_{k}) .

(12)

So the outcomes of repeated measurements are identically and independently distributed (i.i.d.). The probability for outcome

k

to occur

n_{k}

times,

k \leq M

,

\sum_{k} n_{k} = N

, is given by the multinomial distribution

p (n_{1}, \dots, n_{M}) = (N! / n_{1}!, \dots, n_{M}!) p_{1}^{n_{1}}, \dots, p_{M}^{n_{M}} .

(13)

The individual counting functions

n_{k}

are binomially distributed and hence

E (n_{k}) = N p_{k}

. For large

N

the averages,

{\bar{n}}_{k}

, of the statistical counting functions approach the expectation values and therefore

{\bar{n}}_{k} / N \approx p_{k} .

(14)

The fact that

{\bar{n}}_{k} \to E (n_{k}), N \to \infty,

is due to the law of large numbers. The frequencies with their implied principle of indifference (14) indeed replicate the probabilities. This is achieved by a strong assumption in the theory, reflected in Equations (11), (13), and (14). It is the independence condition for the multi-trial states

ϱ^{N}, N \in ℕ

. Actually, it is itself a consequence of the assumption that agents have maximal information about a system of

N

copies of a quantum state [5]. Independence implies serial permutation-symmetry, i.e., the fact that it does not count in which sequence the results occur. So in the case of multiple-trials the theory uses a stronger assumption than permutation-symmetry to obtain the compatible frequency-probabilities (14). Can we weaken the assumption?

It is remarkable that, due to the (infinite) de-Finetti theorem [12], the assumption of independence can be weakened to the one of exchangeability, to still allow reasonable statistics. Exchangeability stands for permutation-symmetry of the joint distribution of

N

trials

X_{n}, n \leq N,

ϱ_{N} (X_{1}, \dots, X_{N}) = ϱ_{N} (X_{π (1)}, \dots, X_{π (N)}), π \in P e r (N),

and for consistency from step

N

to

N - 1

,

ϱ_{N - 1} = t r ϱ_{N}

. If satisfied, it can uniquely represent an

N

-trial state

ϱ_{N}

by an integral over product states of form (10) by means of a measure

μ

on

S (H)

ϱ_{N} = \int_{S (H)} μ (ϱ) ϱ^{N} .

(15)

(The measure

μ

belongs to the second category of probabilities.) For states

ϱ_{N}

of form (15) the statistical approach (14) works with some suitable adjustments (while the distributions are directly integral averages over the product state-distributions, there holds a law of large numbers only conditional to a suitable

σ

-algebra [13]). Whether we work with states of maximal information (10) or states of form (15), in any case permutation-symmetry and the principle of indifference are key features of the frequencies (14), derived in multiple-trials.

3. Conclusions

We have, in the exposition, not made use of any specific interpretation of quantum mechanics, but relied on the original formalism only. We have seen that it is possible to interpret all the probabilities in quantum physics as arising from permutation-symmetry and the principle of indifference, which amounts to maximum entropy. For the single trial, this simply means that at any single point in time we have a number of equiprobable states which can occur. In the case of multiple trials, any single state can occur equiprobably at a number of different points in time. Since permutation-symmetry is very natural and inherent in statistics, and since Laplace’s principle is a basic rational intuition, which underlies elementary combinatorics, we feel that both together are acceptable principles on which to base a theory of nature. So, given the projectors on Hilbert space as the model-structure, the assumptions of non-contextuality and independence/exchangeability lead together with the principle of indifference directly to both a formal and interpretative specification of the probabilities in quantum physics. The thus specified probabilities are real in as much, as they belong to a theory, in which agents and systems enter into a testable relationship. They are hence as much features of agents as they are of the physical systems.

Conflicts of Interest

The author declares no conflict of interest.

References

Lewis, P.J. Phenomena and Theory; Oxford University Press: Oxford, UK, 2016; pp. 22–24. [Google Scholar]
Bohm, D. A suggested interpretation of the quantum theory in terms of “hidden” variables. Phys. Rev. 1952, 85, 166–180. [Google Scholar] [CrossRef]
Lewis, P.J. Indeterminacy; Oxford University Press: Oxford, UK, 2016; pp. 72–105. [Google Scholar]
Born, M. Quantenmechanik der Stoßvorgänge. Z. Phys. 1926, 37, 863–867. (In German) [Google Scholar] [CrossRef]
Caves, M.C.; Fuchs, C.A.; Schack, R. Quantum probabilities as Baysien probabilities. Phys. Rev. A 2002, 65, 022305. [Google Scholar] [CrossRef]
Popper, K.R. The Propensity Interpretation of Probability. Br. J. Philos. Sci. 1959, 10, 25–42. [Google Scholar] [CrossRef]
Childers, T. Philosophy and Probability; Oxford University Press: Oxford, UK, 2013. [Google Scholar]
Kochen, S.; Specker, E. The Problem of Hidden Variables in Quantum Mechanics. J. Math. Mech. 1967, 17, 59–87. [Google Scholar] [CrossRef]
Mermin, N.D. What is quantum mechanics trying to tell us? Am. J. Phys. 1998, 66, 753–767. [Google Scholar] [CrossRef]
Gleason, A.M. Measures on the closed subspaces of a Hilbert space. J. Math. Mech. 1957, 6, 885–893. [Google Scholar] [CrossRef]
Zurek, W.H. Environment-assisted invariance, entanglement and probabilities in quantum physics. Phys. Rev. Lett. 2003, 90, 120404. [Google Scholar] [CrossRef] [PubMed]
Caves, M.C.; Fuchs, C.A.; Schack, R. Unknown Quantum States: The Quantum de Finetti Representation. J. Math. Phys. 2002, 49, 4537–4559. [Google Scholar] [CrossRef]
Kallenberg, O. Probabilistic Symmetries and Invariance Principles, 1st ed.; Springer: New York, NY, USA, 2005; p. 510. [Google Scholar]

© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Schlatter, A. Quantum Probabilities and Maximum Entropy. Entropy 2017, 19, 304. https://doi.org/10.3390/e19070304

AMA Style

Schlatter A. Quantum Probabilities and Maximum Entropy. Entropy. 2017; 19(7):304. https://doi.org/10.3390/e19070304

Chicago/Turabian Style

Schlatter, Andreas. 2017. "Quantum Probabilities and Maximum Entropy" Entropy 19, no. 7: 304. https://doi.org/10.3390/e19070304

APA Style

Schlatter, A. (2017). Quantum Probabilities and Maximum Entropy. Entropy, 19(7), 304. https://doi.org/10.3390/e19070304

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Quantum Probabilities and Maximum Entropy

Abstract

1. Introduction

2. Quantum Physics

2.1. Probabilities of Mixed States

2.2. Probabilities of Pure States

2.3. Frequencies

3. Conclusions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI