1. Introduction
A measurement is a physical process like any other and, therefore, its analysis should cause no difficulties once a proper understanding of the relevant dynamics has been achieved [
1]. Nevertheless, the problem of quantum measurement has historically been a source of endless controversy. It is intimately associated with most of those features of quantum mechanics (QM) that make it so strange and fascinating (see, e.g., [
2,
3,
4,
5]). Does the quantum state reflect incomplete information or is it something real and ontic ? If the latter, can wave functions undergo a physical collapse during measurement? Alternatively, if no collapses ever occur, and wave functions always obey the linear Schrödinger equation, how could quantum measurements ever yield definite outcomes? How does one negotiate the interface between the microscopic quantum world and the macroscopic classical world of the measuring device? Do at least some privileged variables represent something real with definite values at all times? Or, alternatively, are the values of all observables created during the act of measurement? If so, how can one ever say that anything real exists when nobody is looking?
Our first goal here is to address the problem of measurement from the perspective of entropic dynamics (ED) [
6,
7]. The ED approach to QM is ideally suited to tackle the questions above because it is based on entropic and Bayesian methods of inference that have been designed to process information and data. (For a more detailed presentation, see [
8].) The success of the ED approach hinges on a clear ontological and epistemic commitment. In ED, the positions of particles enjoy the privileged role of being the only ontic variables, and all measurements are ultimately position measurements. In contrast, probabilities and wave functions are fully epistemic in nature.
Indeed, while the position of a particle can be measured directly, other generic observables are treated very differently; they are never actually “measured”. The system of interest is coupled to an ancillary system, and the “value” of a generic observable is only indirectly inferred from the observed position of the ancilla’s “pointer” variable. Thus, positions reflect real properties with definite values that are not created by the act of measurement and, as we shall see, all other observables turn out to be epistemic because they reflect the properties of the wave function [
9,
10]. This explains how it is that their values are “created” by the act of measurement.
Models such as ED that invoke ontic variables while the wave function remains an epistemic object are described as “realist -epistemic models”. In contrast, the various descendants of Bohr’s Copenhagen interpretation that deny a definite quantum reality are dubbed “anti-realist -epistemic models”, while models such as the de Broglie–Bohm and many-worlds models are called “realist -ontic models”.
A number of powerful no-go theorems exist—the so-called
-ontology theorems [
5]—that rule out large families of
-epistemic “ontological” models [
11,
12,
13] because they disagree with QM (e.g., [
11,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]. (The term ‘ontological models’ has been proposed as an improved way to refer to the old ‘hidden-variable models’. The new term recognizes that some “hidden” variables, such as positions, are observable and, therefore, not at all hidden.) These no-go theorems have been interpreted as strong evidence in favor of “realist
-ontic models”. However,
-epistemic models remain highly appealing, not least because they trivially explain the infamous wave function collapse as a mere updating of probabilities in the light of new data. Remarkably, fully developed realist
-epistemic models such as ED are scarce [
5]. To my knowledge, ED is the only such model that provides a detailed reconstruction of the formalism of QM and claims to reproduce not just a fragment of quantum phenomena but QM in its totality.
A second goal of this paper is to analyze how ED evades the consequences of -ontology theorems. Ontological models assume the existence of ontic variables. We shall argue that they also implicitly assume the existence of some ontic dynamics at the subquantum level. (The details of the dynamics need not be specified, and therein lies the power and generality of the -ontology theorems.) ED, on the other hand, makes a commitment to ontic variables while denying them ontic dynamics; ED is a purely epistemic dynamics of probabilities. There is no implication that particles move as they do because they are pushed around by other particles or guided by an ontic pilot wave. Wave functions guide our expectations about where particles might be found, but there is no mechanism that accounts for any causal influence on the particles themselves. ED is a mechanics without a mechanism.
Section 2 contains a brief overview of ED. In
Section 3, I discuss the direct measurement of microscopic positions, including their amplification to achieve observability at the macroscopic level [
9]. Then, in
Section 4, I discuss how the use of more elaborate devices allows us to define other non-position observables. I show how their “measurement” is ultimately reduced to the direct measurement of positions, and I derive the associated Born rule [
9,
26]. The special case of the von Neumann measurements provides an interesting extension [
10]. Thus far, these sections have reviewed our previous work on this subject. In
Section 5, I present new material that addresses the question of how ED manages to evade the various
-ontology theorems.
2. Brief Review of Entropic Dynamics
To set the context for the rest of the paper, we review the main ideas that form the foundation of entropic dynamics. For a detailed account, see [
6,
7,
8]. Here is a brief summary:
Ontological clarity: Particles have definite but unknown positions collectively denoted by x. These are the ontic microstates. (A is a composite index, , where n labels the particles and the three spatial coordinates.) The particles follow continuous trajectories and the goal is to predict the probability of the positions x on the basis of some limited information.
ED is a dynamics of probabilities: The probability of a step from x to a neighboring , , is found by maximizing its entropy relative to a prior that enforces short steps and subject to appropriate constraints that introduce directionality and correlations. The main constraint involves a function that plays three separate roles: first, it is related to a constraint in the maximization of entropy; second, if the probabilities are considered as generalized coordinates, then is the momentum that is canonically conjugate to them; and third, is the phase of the quantum wave function, .
Entropic time: The epistemic dynamics of probabilities inevitably lead to an epistemic notion of time. The construction of time involves the introduction of the concept of an instant, the notion that the instants are suitably ordered, and a convenient definition of duration. By its very construction, there is a natural arrow of entropic time.
The evolution of probabilities is found by the accumulation of the short steps described by
. This results in a continuity equation that is local in configuration space but leads to non-local correlations in physical space,
Notation: ; is the mass tensor, are the particle masses, and is the inverse mass tensor.)
Symplectic structure: For a suitable choice of a functional
, the continuity Equation (
1) can be written in Hamiltonian form,
which suggests choosing
as a pair of canonically conjugate variables. The epistemic phase space (or e-phase space)
has a natural symplectic structure with symplectic two-form
.
Information geometry: The e-phase space is assigned a metric structure with metric tensor
G based on the information metric of the statistical manifold
of probabilities
. The joint presence of symplectic and metric structures implies the existence of a complex structure and suggests the introduction of wave functions
as complex coordinates. (For a discussion of the subtleties concerning the correct choice of the spaces that are cotangent to the manifold
and of the metric structure associated with e-phase space
, see [
7].)
The epistemic dynamics that preserve the symplectic structure in the sense of vanishing Lie derivative,
, obey Hamilton’s equations,
If we further require the preservation of the metric structure,
, and of the normalization of probabilities we find that
is constrained to be bilinear in
and
,
which implies that (
3) can be rewritten as a
linear Schrödinger equation,
The particular form of the Hamiltonian kernel
is determined by requiring that it reproduce the ED evolution of probabilities, Equation (
1). In standard notation we find
Entropic dynamics are the purely epistemic dynamics of or, equivalently, of ; there is no underlying ontic dynamics of x. Compared to other models of QM, ED is fairly conservative in that it confers ontic status to configurational variables such as position and a clear epistemic status to probabilities, phases, and wave functions. However, ED is radically non-classical in that it denies the ontic status of dynamics and of all observables (energy, momentum, etc.) except position.
Hilbert space: To conclude the reconstruction of QM, we can take full advantage of the linearity of (
6) and introduce Hilbert spaces and the Dirac notation:
3. Measuring Position: Amplification
All measurements are position measurements. The measurement of the position of a microscopic particle is conceptually straightforward because the particle already has a definite position x. The issue of inferring x is not different from the way data information is handled in any other Bayesian inference problem. There is, however, the technical problem of amplifying microscopic details so they can become macroscopically observable. This is usually handled with a detection device set up in an initial unstable equilibrium. For example, QM allows us to calculate the probability that a particle at x will ionize a neighboring atom located at a. In a bubble chamber, the ionized atom will trigger the formation of a bubble centered at a; in a photographic emulsion, the ion will trigger the formation of a silver crystallite centered at a. More generally, the particle activates the amplifying system by inducing a cascade reaction that leaves the amplifier in a definite macroscopic final state described by some “pointer” variable a.
The goal of the amplification process is to allow us to infer the microscopic position
x from the observed macroscopic position
a of the pointer variable. Incidentally, the latter is just a classical variable in the sense that its dynamics can, to an excellent approximation, be described by the classical limit of quantum theory [
27]. Once the likelihood function
is given, the value
x can be inferred following a standard application of Bayes rule,
In practice, life is more complicated, and the likelihood function will be distorted and smeared by spurious correlations and noise. Successful measurement always involves, of course, a skilled experimentalist who will design the device so that those unwanted effects will be minimized and controlled.
The point of these considerations is to emphasize that the measurement of position is conceptually straightforward and that there is nothing intrinsically quantum mechanical about the amplification process.
4. “Measuring” Other Observables and the Born Rule
Position is easy because it is an ontic quantity. Next, we tackle observables other than position: how they are defined and how they are measured [
9,
26]. For notational convenience, we initially consider the case of a single particle that lives on a lattice; the measurement of its position leads to a discrete set
of possible outcomes. The translation from continuous to discrete positions is straightforward,
and
If the state is
Since position is the only ontic quantity, it is not strictly necessary to define other observables except that they turn out to be convenient to discuss more complex experiments in which the particle is subjected to additional interactions, such as magnetic fields or diffraction gratings, before it reaches the position detectors.
The fact that measurements are dynamical processes means that the interactions within a complex measurement device
are described by a linear and unitary evolution
given by a Hamiltonian
. The particle will be detected with certainty at position
provided it was initially in a state
such that
Since the set
is orthonormal and complete, the corresponding set
is also orthonormal and complete,
To find the effect of the complex device
on a generic (normalized) initial state
, we express it in the
basis,
where
. The state
evolves through
according to
so that when it reaches the actual position detectors the new state is
According to the Born rule for position measurements, Equation (
11), the probability of finding the particle at the position
is
In words: The probability that the particle in the initial epistemic state
is later found at position
is
, and the (suitably amplified) position
plays the role of a discrete pointer variable. (The generalization to continuous variables spaces is straightforward [
26]). The argument above illustrates the main idea, but it also raises the inevitable question:
What, if anything, has been “measured” here? [
1]. The answer is tricky.
Note that the particle in the initial epistemic state has been detected in an ontic state as if it had earlier been in the epistemic state. This process is usually described in a slightly different language that unfortunately obscures the distinction between the ontic nature of and the epistemic nature of (or, more generally, of the amplitude ). It is said that the particle is detected in stateas if it had earlier been in the state.One can further obscure the language by de-emphasizing the inner workings of the complex device, forgetting the dynamics and treating the detector as a black box. The result is a more concise but more misleading statement: the particle has been “detected” in the state. Continuing along the same lines leads us to adopt the language that is standard in QM textbooks: the probability that the particle in stateis “detected” in stateis, which reproduces Born’s rule for a generic measurement device . However, by now, the real meaning of what has been ‘detected’ lies buried deep underground.
The same complex detector
can be used to “measure” all operators of the form
where the eigenvalues
are arbitrary (possibly complex!) scalars. Note that when we say we have detected the particle at
as if it had earlier been in state
, we are absolutely not implying that the particle
was in the particular epistemic state
—this is just a figure of speech. The actual epistemic state was
not
. When the system is “detected in
” the standard language is that the outcome of the measurement is the eigenvalue
, which establishes the eigenvector-eigenvalue connection. It is then clear that the “outcome”
was not a pre-existing value, and it is in this sense that one says that the value
was “created by the act of measurement”.
This point deserves to be stated more explicitly: sentences such as “the particle has momentum ” or “the particle has energy E” are to be recognized as mere linguistic shortcuts that convey information about various components of the wave function before the particle enters the complex detector. Therefore, strictly speaking, there is no such thing as the momentum or the energy of the particle: the momentum and the energy are not properties of the particle but properties of special epistemic states.
In the standard language one refers to the operator
as representing an “observable” and it is common to attribute to its eigenvalues and eigenvectors the status of being ontic—that is, of having actual physical properties. This is not mere abuse of language; in the ED framework, it is just plain wrong. Since what one is actually doing is inferring properties of the wave function from measurements of position, a more appropriate terminology would be to refer to
as an “inferable” [
10].
To summarize: In the standard interpretation of quantum mechanics, Born’s rule for generic measurements is a postulate. In ED, it is the natural consequence of unitary time evolution and the hypothesis that all measurements are ultimately position measurements.
An Illustration: Von Neumann Measurements
So far, we have just discussed measurements that rely on the direct detection of the position of the particle (and its subsequent amplification). One can substantially enlarge the class of useful experiments by considering complex setups in which one infers properties of one system indirectly by measuring the position of another system—the pointer variable—with which the system has interacted. Nothing in this section is original material; it is included merely as a purely pedagogical illustration of the fact that all measurements are position measurements.
The system of interest is composed of one or many particles; its ontic state is
, and its epistemic state is
. The pointer device is also a particle; its ontic state is
X, and its epistemic state is
. The interaction between the system and the pointer is modelled by the Hamiltonian
where
is the operator to be “measured”, and
is the operator that generates translations of the pointer states,
The function
measures the strength of the interaction. We make the usual assumptions: (a) that
; (b) that
vanishes before and after the measurement; and (c) that while the measurement lasts,
is large enough that
is a good approximation to the full Hamiltonian.
The pointer is set to its initial “ready” position near
with some uncertainty
,
where
is a normalization constant. The initial state of the system is
. As a result of the interaction, the system and pointer evolve according to
and become entangled. Using (
18)–(
21), we find
which shows that the probability of the pointer position
X has been shifted from an initial Gaussian centered at
to a final mixture of Gaussians centered at
,
When
is small and the Gaussian distributions are neatly resolved, we have a “strong” or “von Neumann” measurement. The conclusion is that measuring the final pointer position
allows us to infer the eigenvalue
. (See the
Appendix A for more details.)
When the Gaussian distributions overlap significantly, the measurement is said to be “weak”. Such weak measurements do not allow us to infer the eigenvalues
, but they can nevertheless still be useful because they allow us to infer other quantities such as the phase or even the wave function itself (for more on this see [
10] and references therein).
5. Evading the No-Go Theorems
The no-go theorems that rule out large families of realistic
-epistemic models are formulated in a framework of “ontological models” that originates with Bell [
11]. The idea is that prior to an actual measurement, the system undergoes some sort of preparation procedure
, and the result of the measurement
is to yield outcomes labeled
k (inferred from either
or
in the previous section). The goal is to find the probability
of an outcome
k for given
and
.
In realist models, the ontic state of the system is denoted by variables
. In ED, for example,
consists of the particle and pointer positions
x and
X, while in the de Broglie–Bohm model,
consists of both
x,
X, and
. It is assumed that the preparation procedure
may determine
completely, but it need not yield complete control over
—
, it only determines the probability distribution,
. Thus, as the system enters the measuring device
, we are not only uncertain about the future outcome
k but also of the initial values
just
before . This means that the relevant probability to be discussed is the
joint distribution
, and the probability of the outcome
k is given by marginalizing over
,
So far, we have just used the rules of probability theory, which, being of universal applicability, also apply to QM. The desired goal is to find realist models such that the distribution on the right of (24) matches the predictions of QM such as Equation (
16).
To proceed further, we write (
24) as
and consider the two factors on the left separately. Concerning the first factor,
, we shall assume that the distribution of
is settled by the earlier choice of preparation
and is independent of whatever choice one might
later make for the measurement device
,
This is a statement of causality [
14]: conditional on
,
is independent of the later choice of
.
The crucial assumption that defines what [
12,
13] call an “ontological model” concerns the second factor, the response function
. The assumption is that the distribution of outcomes depends only on the ontic state
after the preparation
but
before the device
and on the actual measurement performed,
This assumption does not necessarily violate QM. For example, in the de Broglie–Bohm model,
,
which states that, conditional on the information about
that is codified into
, all
other details about
not already conveyed by
are irrelevant.
However, in a
-epistemic ontological model
is not included in
. The assumption there is that conditional on
, for any choice of
, the outcome
k is
completely independent of all details about , including any information that might be codified into the epistemic
.
It is these -epistemic ontological models that are shown by all the no-go theorems [
5,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]
to disagree with QM.
ED satisfies the causality assumption (
26) but violates (
27), and, therefore,
ED is not an ontological model in the sense of [
11,
12,
13], which makes ED immune to all the no-go theorems. More explicitly, the situation with ED is the following: the ontic variables are positions of particles and/or pointers,
, and the information about the preparation procedure
is fully conveyed by the wave function
. Then, the causality assumption, Equation (
26), reads
where
are position values
before entering the device
. Next, since ED is a purely epistemic dynamics,
conditional on the epistemic wave function the distribution of k is independent of the ontic variables —the latter have no causal influence on the future outcome
k. Thus, instead of (
27), in ED the response function
is
Substituting (
29) and (
30) into (
25) yields
which agrees with the probability predicted by quantum mechanics. (See the
Appendix A for further details.)
It is worth emphasizing the difference between response functions in ontological models, Equation (
27), and in ED, Equation (30): ontological models assume that all the information about the preparation that is relevant for the measurement is carried by the ontic variables
. This means that lurking in the background there is an implicit assumption that ontic dynamics exist that relate the
earlier values of
as the system enters
to the
later values
that result in the outcome
k. The power of the no-go theorems lies in the fact that the details of the ontic dynamics remain unspecified—the ontic dynamics could be deterministic or stochastic, they could be local or non-local, and so on—but they assume that some ontic dynamics must exist.
ED, on the other hand, makes a commitment to ontic states without making the associated commitment to an ontic dynamics; only the epistemic variables
appear on the right hand side of (
30). In ED, the epistemic wave functions do not guide the particles; they only guide our expectations about where particles might be found.
ED represents epistemic mechanics without an ontic mechanism.
Remark 1. Beyond the general framework of ontological models, the various no-go theorems depend on additional assumptions. It is these additional assumptions that have been offered as possible loopholes (see, e.g., [5]). We emphasize that ED by-passes the issue of additional assumptions and evades the ontological models framework at the deeper level of the epistemic vs. ontic nature of the dynamics. Remark 2. In [13], ψ-epistemic models are defined as those that fail to be ψ-ontic. This definition would in principle allow ψ-epistemic models based on exotic probability theories [5]. The ED approach shows that such generalizations are not necessary; there is no need for exotic or even quantum probabilities, for quantum logic, for excess ontological baggage, or for retrocausality. ED is ψ-epistemic in that is directly related to the probabilities , that is, to knowledge and beliefs, and to the conjugate momenta that codify the information that updates those probabilities. Remark 3. The ideas above could have been formulated in a more general setting of preparations that only determine a density matrix and measurements that are described by positive operator valued measures (POVMs). However, such increased generality has no effect on the conclusions and would only serve to obscure the main ideas.