Experimental Test of the “Special State” Theory of Quantum Measurement

Schulman, Lawrence S.

doi:10.3390/e14040665

Open AccessArticle

Experimental Test of the “Special State” Theory of Quantum Measurement

by

Lawrence S. Schulman

Physics Department, Clarkson University, Potsdam, New York, NY 13699, USA

Entropy 2012, 14(4), 665-686; https://doi.org/10.3390/e14040665

Submission received: 25 December 2011 / Revised: 21 March 2012 / Accepted: 26 March 2012 / Published: 2 April 2012

(This article belongs to the Special Issue Arrow of Time)

Download

Browse Figures

Versions Notes

Abstract

:

An experimental test of the “special state” theory of quantum measurement is proposed. It should be feasible with present-day laboratory equipment and involves a slightly elaborated Stern–Gerlach setup. The “special state” theory is conservative with respect to quantum mechanics, but radical with respect to statistical mechanics, in particular regarding the arrow of time. In this article background material is given on both quantum measurement and statistical mechanics aspects. For example, it is shown that future boundary conditions would not contradict experience, indicating that the fundamental equal-a-priori-probability assumption at the foundations of statistical mechanics is far too strong (since future conditioning reduces the class of allowed states). The test is based on a feature of this theory that was found necessary in order to recover standard (Born) probabilities in quantum measurements. Specifically, certain systems should have “noise” whose amplitude follows the long-tailed Cauchy distribution. This distribution is marked by the occasional occurrence of extremely large signals as well as a non-self-averaging property. The proposed test is a variant of the Stern–Gerlach experiment in which protocols are devised, some of which will require the presence of this noise, some of which will not. The likely observational schemes would involve the distinction between detection and non-detection of that “noise”. The signal to be detected (or not) would be either single photons or electric fields (and related excitations) in the neighborhood of the ends of the magnets.

Keywords:

special states; tests of quantum mechanics; retrocausality; Cauchy distribution; arrow of time

PACS Classification:

03.65.Ta; 05.40.Fb; 05.90.+m; 42.50.Xa

1. Introduction

This article has three components. The first two are background for the third, which proposes—in some level of detail—an experimental test for the ideas propounded in the earlier sections. The components are

A theory of quantum measurement that is conservative: there is only unitary time evolution. There is no wave function collapse, there is no need for “many world” concepts, and the wave function is not merely a construct for calculating probabilities.
A modification of statistical mechanics that is radical but which contradicts no experience or experiment. Because of future conditioning, many initial conditions are excluded, contrary to standard statistical mechanics. In particular there is a form of conditioning that can be used to motivate the quantum ideas. A rationale for this future conditioning introduces cosmological considerations.
A modified Stern–Gerlach experiment in which physical phenomena not predicted by the Copenhagen interpretation would occur. In particular the test would be conducted using two protocols, in one of which there would be an observable signal, in the other there would not. That signal could be the emission of photons in the eV range or the appearance of electric fields (and related effects) near the ends of the magnets.

With respect to the background material, there will be frequent reference to [1] and indirectly to the many citations therein. After publishing [1], I did not much work on this problem. Although there are many open theoretical issues, I felt that there could only be progress if these ideas were tested experimentally. If they passed, there would be no shortage of thought devoted to them.

What has rejuvenated my interest is the possibility of an experiment that could confirm features of this theory. The trail to the practical suggestion contained in the present article began with the implementation in [2] of the Wheeler delayed-choice experiment [3,4]. Although Wheeler made his prediction entirely within the framework of the Copenhagen interpretation, there is an apparent inversion of causality that suggested the kind of tightly interconnected future and past that characterizes my own work. In conference presentations in which I reported preliminary ideas on this subject [5,6] I focused on that experiment; but I later realized that the essential physical feature that would allow the experimental test did not depend on the “delayed” part of the story. This makes the experiment much easier.

The sections of this article follow the enumeration given above, quantum mechanics (Section 2), statistical mechanics (Section 3), experiment (Section 4). Section 5 is a discussion.

2. Quantum Mechanics

This is a brief and selective summary of my quantum measurement ideas, based on the central notion of “special states” (henceforth mostly sans quotation marks).

Consider the Schrödinger cat. For this unfortunate feline, if the trigger of the device aimed at it depended on, say, an atomic decay, the probability of a living cat would be the non-decay probability, say 1/2, for the time interval set for the “experiment”. I will now give an example where—if you could prepare the microscopic state of the apparatus—you could keep the cat alive.

First consider the formalism for ordinary decay. A single level decays, emitting a photon. For a finite-time context there will be a band of energies into which it can decay, and this is modeled as a finite number, N, of narrowly spaced levels (so

N ≫ 1

). A Hamiltonian for this system is

H = (\begin{matrix} ω & ϕ \\ ϕ^{†} & Ω \end{matrix})

(1)

where

ω \in R

,

ϕ \in C^{N}

,

N ≫ 1

, and Ω is a real, diagonal

N \times N

matrix. The wave function, ψ, is an

(N + 1)

-row column vector and initially has 1 in its first entry, zeros elsewhere. The survival probability is

S (t) \equiv {| 〈 ψ (0) | exp (- i H t / ℏ) | ψ (0) 〉 |}^{2}

. A numerical calculation of this quantity provides the graphs of Figure 1. The semilog plot shows that the decay is close to exponential until

t \approx 300

, at which point a (quantum) Poincaré recurrence sets in, due to the finiteness of N (100 for this calculation). I also show the early-time quantum Zeno effect, manifested as initial non-exponential decay. The time interval during which this is significant matches well to the “Zeno time” that I proposed in [7,8,9].

Figure 1. Normal decay. “N” (the size of Ω in Equation (1)) is 100, and at about time-300 there is a Poincarè recurrence due to this finite dimension. The semilog plot shows excellent exponential decay up until then. On the right is early-time non-exponential decay (note the shorter times plotted), related to the so-called quantum Zeno effect. The calculated “Zeno time”,

τ_{Zeno} \equiv \frac{ℏ}{\sqrt{〈 ψ | H^{2} | ψ 〉 - {〈 ψ | H | ψ 〉}^{2}}}

, is about 7.

Figure 1. Normal decay. “N” (the size of Ω in Equation (1)) is 100, and at about time-300 there is a Poincarè recurrence due to this finite dimension. The semilog plot shows excellent exponential decay up until then. On the right is early-time non-exponential decay (note the shorter times plotted), related to the so-called quantum Zeno effect. The calculated “Zeno time”,

τ_{Zeno} \equiv \frac{ℏ}{\sqrt{〈 ψ | H^{2} | ψ 〉 - {〈 ψ | H | ψ 〉}^{2}}}

, is about 7.

We next suppose that the decaying atom is one of many, all of which have essentially the same matrix elements for decay with photon emission. The number atoms (and the number of associated levels) is n, and we assume

N ≫ n ≫ 1

. This more general Hamiltonian is again given by Equation (1), but the meaning of the symbols has changed. Now ω is an

n \times n

matrix, constant (all the atoms are the same), and diagonal. The coupling, ϕ, is now a rectangular

n \times N

complex (in general) matrix, while Ω is as before.

The atoms are assumed close enough and steady enough to interact coherently and their net excitation number is one; hence the wave function has

N + n

components, and the initial condition (non-decay) requires that all non-zero elements of the initial wave function lie in the first n entries. The resulting decay [10] is remarkable and is shown in Figure 2. The average decay is shown in the solid (essentially) straight line (black in color). This is relatively normal, although the linearity (as opposed to exponential decay) is due to particular circumstances. But what are not normal are the blue (dashed) and red (dash-dot) curves.

Figure 2. Decay from a collection of n similar levels. The non-solid lines show the special states, which take values close to 0 and 1 at the selected time, 16.

Those curves require explanation. First a time,

t_{0}

, is picked. In this case, it is 16 (shown as a vertical green line in the figure). Then, by a method described in Appendix A, I find two special classes of states: those for which

S (t_{0}) \approx 0

and those for which

S (t_{0}) \approx 1

. That appendix tells you how to find the initial conditions, the

ψ (0)

’s, that lead to the all-decayed or all-not-decayed states at time-

t_{0}

. (That there are such states is equivalent to my demand for special states, as we shall see.) The blue (dash-dot) curve in the figure is the time-dependence of an all-not-decayed-at-

t_{0}

state. At

t_{0}

it is essentially still in the initial subspace of undecayed states. The red (dash) curve is the time dependence of one of the other class of states, those that are nearly fully decayed at time-

t_{0}

. For both, after

t_{0}

there is no 0-1 requirement, although by continuity they do not change radically. (Regarding imperfect attainment of 0 or 1, see [1].)

2.1. Use of the Special State

We return to the Schrödinger cat. Suppose the gun fires in response to a system of atoms of the sort described above. Let the full wave function at time zero be Ψ; this includes the cat, the cage, the weapon, the atom that triggers the weapon—everything! Let the Hamiltonian for all this be

H_{total}

and let the time at which we “look to see if the cat’s alive” be

t_{0}

. In some reasonable approximation, the wave function can be written

Ψ = Θ A Φ

with Φ the cat wave function, A the wave function of the atoms (and their decay products) and Θ referring to everything else. We suppose that the time

t_{0}

is such that there is a 50-50 chance that the cat is alive. Letting

U (t_{0}) \equiv exp (- i H_{total} t_{0} / ℏ)

, this situation can be schematically written

Ψ (t_{0}) = U (t_{0}) Ψ (0) = α Θ_{ℓ} (t_{0}) A_{ℓ} (t_{0}) Φ_{ℓ} (t_{0}) + β Θ_{d} (t_{0}) A_{d} (t_{0}) Φ_{d} (t_{0})

(2)

(ℓ and d are “living” and “dead”) with

{| α |}^{2} \approx {| β |}^{2} \approx 1 / 2

(the Θ’s A’s and Φ’s are normalized to 1). This state is what Griffiths [11] has called “grotesque”, a superposition of macroscopically different states. Given the decay example just discussed, it is clear how to keep the cat—definitely—alive. Start the atom wave function in one of the blue, non-decaying, states shown in Figure 2, call it

A^{'} (0)

. This means that there is a state of the whole thing, call it

Ψ^{'} (0)

, such that

Ψ (t_{0}) = U (t_{0}) Ψ^{'} (0) = U \{Θ (0) A^{'} (0) Φ (0)\} = Θ_{ℓ} (t_{0}) A_{ℓ} (t_{0}) Φ_{ℓ} (t_{0})

(3)

Similarly, there are red—full decay—states (call them

A^{''}

) with corresponding

Ψ^{''}

such that

Ψ (t_{0}) = U (t_{0}) Ψ^{''} (0) = U \{Θ (0) A^{''} (0) Φ (0)\} = Θ_{d} (t_{0}) A_{d} (t_{0}) Φ_{d} (t_{0})

(4)

We have thus obtained definite, non-grotesque, states without any black magic of “measurement”; the only thing that happens is pure, unitary time evolution.

2.2. The Assumption Concerning Special States in Nature

The major assumption concerning special states is that in every situation in which there might emerge grotesque states (and this goes beyond human laboratory experiments) the initial conditions are special.

This assumption implicitly makes two claims. The first is less radical: there are enough special states to do the job. Within any apparatus capable of creating grotesqueness there are enough degrees of freedom for certain rare states to give definite (non-grotesque) outcomes. I cannot verify this in general but I have explored many models of apparatus and found special states for all of them. Interestingly, circa 1990—after 60 years of debate—there was not a single apparatus model that I could find [12] that was realistic enough to be used to address this question. It was for this reason that Bernard Gaveau and I developed a quantum apparatus model [13] in which I could subsequently find special states.

The other claim is truly radical. It places a restriction on initial conditions. The fundamental axiom of statistical mechanics states that, given a macroscopic description of a system, the microscopic states associated with it are all those consistent with the macroscopic description.

I say, no, you do not take all states, you only select very particular ones, those that are what I call special, namely those that do not lead to grotesqueness.

To provide perspective on this claim I will discuss the arrow of time, since connected to that notion there is also a tremendous elimination of microscopic states. From that discussion will emerge a context in which justification of my selection claim can be imagined.

However, this does not exhaust the tasks imposed for the recovery of the standard results of quantum measurements. In particular, the existence of special states does not by itself give the Born probabilities. It is this requirement—on which we next focus—that leads to the experimental test proposed in this article.

2.3. Recovering Probabilities

In classical mechanics if you know an initial phase space point, the outcome at any later time is certain. You use probability when there are many initial points consistent with the information you have (so probabilities other than zero or one mean your information is incomplete) [14]. In this case the probability of any particular outcome is proportional to the volume of phase space that leads to that outcome. This can be considered a corollary of the arrow of time definition given earlier. (For working backward—retrodiction—Bayesian rules enter, but that is another story [15].) The quantum version of this replaces volumes of phase space by the dimensions of subspaces of Hilbert space and also—it is said—introduces another kind of probability, one that supposedly is intrinsic.

For the ideas expressed earlier using special states there is no additional layer of probability. The collection of special states for any particular outcome forms a vector space, and my postulate is that the probability of a given outcome is proportional to the dimension of the associated vector space of special states. This is a bold postulate, since the usual Born probabilities depend little on the apparatus and are computed from the wave function of the system being measured. On the other hand, I require identification of the special states of system and apparatus combined and a counting of (vector space dimension of) those special states. I have not managed to check this even in some of the apparatus models where I have succeeded in finding special states. I believe the reason is that the special states I have solved for are atypical; after all, one should not expect solvability to be an attribute of a real measurement apparatus.

However, the idea that I could exhibit and count special states is an optimistic one. Some years ago I took the opposite view and in a fit of pessimism said, suppose I could have any special states I wanted, what constraints on their distribution would I have, and—maybe—those constraints would mean the whole idea was wrong.

In the following discussion I will make the assumption that Nature, the environment, the apparatus, even parts of the system being measured, can provide the rare microstates needed. Moreover, there will be many for each outcome [16]. To see how this assumption is implemented and the constraints it imposes, it will be useful to focus on a particular experiment.

Consider a Stern–Gerlach experiment measuring the z-component of an atom’s spin. For an atom having net spin 1/2, let the prepared wave function be

u_{θ} = e^{i θ σ_{x} / 2} (\binom{1}{0})

(5)

with

σ_{x}

the Pauli spin matrix. Only two outcomes are possible, designated down and up. Their detection involves a hot wire detector downstream from the magnet supplying the inhomogeneous field that induces the measurement (coupling spin and translational degrees of freedom). The standard prediction is that they come with the ratio

{tan}^{2} \frac{θ}{2} = \frac{{sin}^{2} θ / 2}{{cos}^{2} θ / 2}

(6)

Now a special state that will send the atom to the right place for, say, a down measurement, may be rare, but we need to look for the least rare among all those that could do the job. These least-rare states will presumably be unusual states of the environment, but let us consider where, spatially, that rarity will be manifested. After the atom has passed through the magnet it would be necessary to coherently recombine the spatially separate portions of the wave function, while if a rare environmental state were available prior to the atom’s deflection by the magnetic field it would only have to rotate the spin by, say,

\frac{π}{2} - θ

. So I will assume that the least unlikely states are those that act on the spin wave function in the following way:

u_{θ} = e^{i θ σ_{x} / 2} (\binom{1}{0}) \to e^{i ψ σ_{x} / 2} e^{i θ σ_{x} / 2} (\binom{1}{0}) = e^{i (ψ + θ) σ_{x} / 2} (\binom{1}{0})

(7)

I am about to make a slight shift in perspective. Instead of counting actual microstates of the environment I will sort them by their effect, namely by the size, ψ, of the rotation they can induce on the wave function. Suppose that there are

f (ψ)

states that can rotate by angle ψ. Without loss of generality for what follows we can normalize the function f so its integral over ψ is 1. In any given experiment the net result of all these special states will be a rotation by

\sum_{α = 1}^{N} ψ_{α}

if the spin is subject to N such rotations/kicks/special states along its path.

Dealing with this observation almost led me to abandon my quantum measurement ideas. If you imagine that a large number of “kicks” (rotations of the sort discussed) are necessary then one would expect to be able to use the central limit theorem, in which case the relative ratios for getting up or down would be a ratio of Gaussians, not the tangent-squared function given earlier.

It turns out that this problem has a solution and its solution is a key to the experimental test that I here propose. First, drop the assumption that you can use the central limit theorem, i.e., we do not assume that the function f has a second moment. This leads us into the world of the Lévy distributions, with many peculiar properties, as we shall see. Let us assume that whatever happens to our spin must happen in a single “kick”. It follows that the function f must satisfy

{tan}^{2} \frac{θ}{2} = \frac{F (θ + π)}{F (θ)}

(8)

where

F (θ) \equiv \sum_{k = - \infty}^{\infty} f (θ + 2 k π)

. (Note that

F (θ)

gives all ways of getting

u_{θ}

to become up, and

F (π - θ)

gives the ways to become down. The solution to this functional equation is

f (ψ) = C_{a} (ψ), with C_{a} (x) = \frac{a / π}{x^{2} + a^{2}}

(9)

for a small. So you can do it! (And, this distribution has the property that if the sum of n samples drawn from it is far larger than

n a

, the least unlikely way to do this is a single large kick, all the others much smaller. For the Gaussian they would all be about the same size.)

For further discussion of the function,

C_{a}

, the Cauchy distribution, see [1], as well as many books on probability theory, e.g., [17,18].

As to the parameter a, if it is too large, deviations from standard probabilities will be observed, but since I do not know where this noise is coming from I cannot use this for experimental predictions. I also mention that the demonstration above can be extended to many dimensional choices, not just spin-1/2 and not just spin. See [1].

Finally, I remind the reader that the arguments for the Cauchy distribution were based on a “fit of pessimism”. What this means is that this provides a way to disprove the theory, rather than prove it. I despair of counting special states in all possible models, and only try to establish statistics on the effects of those states, what I call “kicks”. If these are not found, that’s that. But not to put too negative a light on this work, it should also be pointed out that nothing in the Copenhagen corpus has any hint of this peculiar noise distribution, so a positive experimental result would, at least to my mind, have more content than simply the lack of disproof.

3. Statistical Mechanics

3.1. The Arrow of Time

The usual statement of the thermodynamic arrow of time is that entropy increases, that is, it increases in one time direction, not the other. Alternative ways of saying this exist, for example the impossibility of converting heat to work. I will give another formulation, one that focuses on assumptions on microscopic states. How does one predict? If you isolate a glass of water containing ice cubes at 2 p.m., your prediction on its form at 3 p.m. is based on assuming that all microscopic states consistent with what you see at 2 p.m. are equally likely. In principle you evolve these forward in time and average, with the vast majority of microstates giving smaller ice cubes, colder water. If this system had been isolated since 1 p.m. your estimate of its 1 p.m. state would be based on an entirely different method. Using your 2 p.m. information, you make a guess about what it might have been at 1 p.m. and evolve that forward (as you did from 2 to 3 p.m.). If it fits what you see at 2 p.m., then it’s a possible 1 p.m. state. (Had you propagated back all the 2 p.m. microstates, you would find smaller ice cubes, colder water at 1 p.m., which contradicts experience.) These different rules are an alternative statement of the arrow of time.

Now consider what you have implied about the 2 p.m. microstates. If you view them as initial conditions, everything goes, all of them are OK. But if you view them as having evolved from an earlier condition, macroscopically specified, then almost all of them are rejected. How do I know this? I can appeal to the usual formulation, the increase of entropy. The number of microstates is given by

exp (S / k_{B})

with S the entropy. Lower entropy at the earlier time means fewer states, and if I plug in plausible numbers for water and ice in a normal size glass, you will find that the rarity of the 2 p.m. microstates, considered as final states, is astounding, numbers like one in

10^{10^{24}}

.

3.2. The Cat Map

Another example illustrates how a selection of initial microstates can take place. Consider the “cat map”, an area-preserving transformation of the unit square that has served as a model of equilibration [19]. The mapping is

\begin{matrix} x^{'} & \equiv x + y, & mod 1 \end{matrix}

(10)

\begin{matrix} y^{'} & \equiv x + 2 y, & mod 1 \end{matrix}

(11)

A collection of points (thought of as ideal gas particles) starting out in a small region of the square will rapidly spread throughout. The equilibration can be quantified by coarse graining the unit square and replacing a microscopic specification (giving the exact position of each point) by simply listing the number of points in each grain. The entropy is then defined as

S \equiv - \sum p_{α} log p_{α}

with α labeling the equal area (by construction) grains and

p_{α} = n_{α} / n

with

n_{α}

the number of points in grain-α and n the total number. The expansion of such a gas is illustrated in Figure 3 and the associated entropy increase shown in Figure 4. Next I show the continuation of the time evolution. Figure 5 shows the evolution of the same points for later times, and the associated entropy dependence appears in Figure 6.

Figure 3. Times 0, 1, 2, 4, 5, 8 in the evolution of a gas of 250 particles under cat-map dynamics.

Figure 4. Entropy as a function of time for the expanding gas of Figure 3.

Figure 5. Times 11, 14, 15, 17, 18, 19 in the evolution of a gas of 250 particles under cat-map dynamics.

As you can see, something funny is going on. Instead of having the particle locations continue to fluctuate they come back together and the entropy decreases. This is not the result of a lucky Poincarè recurrence, which would only occur after the order of

50^{250}

time steps. Rather, I solved a two-time boundary value problem, finding points, all of which were gathered in a single box, at times separated by 19 time steps. This means that the point locations at time-0 were not at all random, even though they appear to be. They have a cryptic constraint. This constraint is mild by earlier standards, ruling out a mere 98% of all points rather than

1 - 1 / 10^{10^{24}}

. Another important point, illustrated in Figure 7, is that the initial behavior of the macroscopic quantity, entropy, is the same with or without the cryptic constraint (that figure is for another simulation in which 100 coarse grains were used).

Figure 6. Continuation of entropy as a function of time, including the contracting segment, as shown in Figure 5.

Figure 7. Entropy as a function of time for a cat-map simulation with a cryptic constraint at time 19 and for a simulation with no cryptic constraint. 100 coarse grains are used. Note that there is essentially no difference in the initial behavior.

The conclusions I draw from this example are as follows. With future boundary conditions you can restrict the set of initial conditions. Moreover, you cannot tell the difference. The implication is that the usual axiom of statistical mechanics, equal probability for all microstates, is far stronger than is justified by experience.

3.3. Special States and Determinism

An aspect that I will not dwell on is the total interconnectedness and determinism of the universe. The experiment you plan to do is not arbitrary, but is built into the initial conditions, initial conditions not only within the range of your personal perception but of the universe as a whole. People with different philosophical preferences may view this as extremely negative or extremely positive. I personally am in the latter camp, but Nature does not always respect preferences.

3.4. Requiring Special States

The last topic in this section is why there should be the particular restriction on initial states that selects the “special” ones. A partial answer is that if at some point in the distant future there is no substantial grotesqueness, then that will impose the special state restriction for all times. For suppose that there is such a future condition. I claim that the least unlikely way for it to happen is to have no grotesqueness for all prior times. This is because once you allow a level of macroscopic superposition it is extremely difficult to undo. The living Schrödinger cat may become an experimental animal sent to the moon, and the dead one buried outside the perpetrator’s lab (the “worlds” have split). Getting them back together coherently is not an option. For this reason, satisfying the non-grotesqueness condition for all times is less unlikely. And the only way to do this by unitary evolution is by means of special states (which is essentially the definition of special state). I should remark that although for many individual kinds measurements I have shown that special states exist, it is a serious question whether there are enough so that every final condition has enough richness to be the special initial condition for the next thing that is going to happen. (My personal expectation is that the identity of particles—e.g., all electrons are the same electron—makes satisfying this condition less formidable, but that question is one that I did not pursue quantitatively, pending experimental testing of the ideas.)

I call the boundary condition just discussed a “partial” explanation because it only leads to another question: why this future boundary condition? Here my response is speculative and may well reflect limitations of my own imagination as well as contemporary scientific ignorance (cf. Boltzmann’s explanation the arrow of time [20] as a fluctuation in an enormously long-lived universe). In the usual many world discussions the image is of steady branching to more and more “worlds”. With this picture it is not absurd (but also not necessarily implied) that long ago there were fewer such worlds, perhaps at some early stage just one initial wave function that had no macroscopically different superpositions. (Some would call this a quantum arrow of time.) Now let us imagine a cosmology in which there is an eventual contraction. This does not seem a likely scenario in view of the discovery of accelerated expansion, but in the many speculations on the implications of that discovery, contraction, or even a big crunch, is far from having been ruled out. Under these circumstances it is plausible to argue that the arrow of time is a consequence of space-time geometry, so that the end and the beginning should have roughly the same state, which would be non-grotesque. This is admittedly a lot to swallow. But I would refer to the many revolutions that cosmology has undergone, even since the 1930’s discovery of expansion [21]. Or, this condition on states may obtain for reasons that I am totally unable to imagine, just as limited knowledge of cosmology in Boltzmann’s day made some of his views on the arrow of time untenable.

4. Experiment

4.1. Properties of the Kick

In Section 2.3 we looked at a two level spin system passing through a Stern–Gerlach (SG) apparatus. Our purpose was to establish minimal requirements for any kind of special state. Now however, we really want to consider the true physical system, the Stern–Gerlach experiment.

As emphasized, our usual perspective is not to focus on the dynamics of this system alone, but rather on that of the entire environment necessary for a full description. The richness of the environment is what supports the existence of special states. However, in the present section and in the analysis of Section 2.3, a different viewpoint is taken, closer to the way most quantum calculations are done. The environment is in whatever special state it is in, but because this state may be rare, its action on the particle or spin of interest will also be unusual. We focus on that action alone and treat the environment’s rare action through an effective Hamiltonian. This Hamiltonian provides the time evolution of the wave function through left multiplication by

exp (- i H_{eff} t / ℏ)

.

The system is prepared by passing a beam of atoms through another Stern–Gerlach apparatus and only that part of the beam having a particular value of angular momentum, say

+ ℏ / 2

along a particular direction, is selected and sent on to the next SG apparatus. The second SG apparatus is not (necessarily) oriented in the same direction. Let the direction of motion (aside from the eventual deflection) be in the positive y direction and the gradient of the second SG apparatus be in the z direction. Let

i, j

, and

k

be unit vectors along the

x, y

, and z axes, respectively. We assume that when entering the second apparatus—which is the one on which we focus—the atom’s spin is along the direction

n = k cos θ - j sin θ

, for some angle θ. As in Equation (5), the initial wave function of an atom, when exiting the first SG apparatus, can be taken to be

u_{θ} = e^{i θ σ_{x} / 2} (\begin{matrix} 1 \\ 0 \end{matrix}) = (\begin{matrix} cos \frac{θ}{2} \\ i sin \frac{θ}{2} \end{matrix})

(12)

consistent with the preparation just specified. It is possible to multiply

u_{θ}

by an arbitrary overall phase or to use density matrices, but this does not affect our conclusions. For the SG experiment the final state should have

| u_{f} (1) | = 1

(“up”) or

| u_{f} (2) | = 1

(“down”), which requires that the angle in Equation (12) be rotated to become an integer multiple of π. Thus the overall action of the effective Hamiltonian is to add an angle ϕ to

θ / 2

so as to accomplish this goal [22]. We refer to this action of the effective Hamiltonian as a “kick”. The kick is thus a left multiplication of the wave function by

e^{- i H_{eff} t / ℏ} = e^{i ϕ σ_{x}}

(13)

bringing it to up or down. As indicated, the effective Hamiltonian in Equation (13) represents the effect of uncontrollable elements of the environment.

As discussed in Section 2.3 (and proved in Section 9.1 of Reference [1] or [23], Section 4.1) recovery of the Born probabilities requires that the kicks be Cauchy distributed, namely that the probability density for a kick of size ϕ should be

C_{a} (ϕ) = \frac{a / π}{a^{2} + ϕ^{2}}

(14)

with a a parameter that is small. Moreover, it is a property of this distribution that the least unlikely way to achieve large (compared to a) total rotation of the spin is through a single kick.

For up we thus require

ϕ = n π - θ / 2

and for down

ϕ = (n + 1 / 2) π - θ / 2

, with

n = 0, \pm 1, \pm 2, \dots

. Define

F_{a} (ψ) \equiv \sum_{n = 0, \pm 1, \pm 2, \dots} \frac{a / π}{a^{2} + {(n π - ψ)}^{2}}

(15)

Then the probability for the two outcomes is

Pr (UP) = \frac{1}{Z} F (\frac{θ}{2}), Pr (DOWN) = \frac{1}{Z} F (\frac{θ - π}{2})

(16)

with Z, the sum of F at the two values, providing normalization. For small a this recovers the standard probabilities. The sums can be done explicitly, but we hold off, since there will be related sums to evaluate and we will do all of them at once.

In searching for evidence of special states, presumably the larger the kick the larger the signal. With this in mind, we calculate the expectation of kick size both conditioned on an outcome and unconditioned. We thus want

\begin{matrix} {〈ϕ〉}_{UP} & = \frac{a}{Z π} \sum_{n = 0, \pm 1, \pm 2, \dots} \frac{(n π - \frac{1}{2} θ)}{a^{2} + {(n π - \frac{θ}{2})}^{2}} \end{matrix}

(17)

\begin{matrix} {〈ϕ〉}_{DOWN} & = \frac{a}{Z π} \sum_{n = 0, \pm 1, \pm 2, \dots} \frac{(n π - \frac{θ - π}{2})}{a^{2} + {(n π - \frac{θ - π}{2})}^{2}} \end{matrix}

(18)

and their sum.

To evaluate Equations (15), (17) and (18) consider the following identity [24]

\frac{1}{tan z} = \sum_{n = - \infty}^{\infty} \frac{1}{z - n π}

(19)

where n runs over the integers. The poles of one over the tangent function occur at multiples of π and the residues are unity. Let

z = θ + i a

. Using elementary relations we write the real and imaginary parts of Equation (19),

\begin{matrix} \frac{tan θ}{{tan}^{2} θ {cosh}^{2} a + {sinh}^{2} a} & = \sum_{n} \frac{θ - n π}{{(θ - n π)}^{2} + a^{2}} \end{matrix}

(20)

\begin{matrix} \frac{tanh a}{{tanh}^{2} a {cos}^{2} θ + {sin}^{2} θ} & = \sum_{n} \frac{a}{{(θ - n π)}^{2} + a^{2}} \end{matrix}

(21)

From Equation (21) we get the following information:

Z = \frac{4 a}{π} \frac{1}{{sin}^{2} θ}

and for sufficiently small a,

Pr (DOWN) / Pr (UP) = {tan}^{2} (θ / 2)

, as it should.

Remark 1: As mentioned in Section 2.3 and explicitly calculated in [1,23], for a not negligible there will be a deviation from standard probabilities. This imposes a restriction on a, but does not provide an experimental test since, in the absence of physical specifics, there is no information on the size of a.

Equation (20) gives the sums used in the expectations of the kick-angles and yields

\begin{matrix} {〈ϕ〉}_{UP} & = - sin \frac{θ}{2} {cos}^{3} \frac{θ}{2} \end{matrix}

(22)

\begin{matrix} {〈ϕ〉}_{DOWN} & = - {sin}^{3} \frac{θ}{2} cos \frac{θ}{2} \end{matrix}

(23)

If

θ \approx 0

there is no specializing, so the expected kick size for those measured as up goes to zero. Surprisingly perhaps for those measured as down the expectation is even smaller. This is because although the kicks (however few) are larger, they are as likely to be positive as negative.

According to Equations (22) and (23) the average kick size is order unity, although given the quirks of the Lévy distributions, this was not a foregone conclusion. Looking at Equations (17) and (18) it is clear that moments higher than the first do not exist (the first moment is borderline), so that it is conceivable that with experimental studies that focus on large kicks other information may be gleaned.

Remark 2: Three of the series that we have considered, Equations (17), (18) and (19), are only conditionally convergent. As Hille [24] remarks in connection with Equation (19), one can add

1 / n

(

n \neq 0

) to each summand to obtain absolute convergence, or what is essentially the same thing, choose to combine positive and negative n terms before summing the infinite series.

Remark 3: If one performs a series of experiments and manages to measure the kick in each of them, the average will not converge to the results of Equation (22) or Equation (23). This is where the “quirks” of the Lévy distribution enter. As remarked, some of our series are not absolutely convergent and the distribution is not self-averaging. In fact the average of many measurements has the same probability distribution as a single measurement. This can be useful for the experimentalist looking for the effect, since even with averaging there is no suppression of large magnitude kicks. The use of the average might be thought of as the setting of the scale but in fact the only scale is a, which is taken to be small. By conditioning on large events, a disappears and there is really no scale.

Depending on experimental setup, it is possible to optimize the angle for maximum signal. For example, suppose one is able to sort particles according to outcome. Then to optimize as a function of θ, one would consider the strength of the field needed for (say) up, times the probability of up. This is proportional to

F (θ) \equiv {cos}^{2} (θ / 2) {〈 ϕ 〉}_{UP}

. The derivative of this function,

F^{'} = (1 / 2) {cos}^{4} (θ / 2) ({cos}^{2} (θ / 2) - 5 {sin}^{2} (θ / 2))

, vanishes for

θ = π

and

θ = 2 {tan}^{- 1} (1 / \sqrt{5}) \approx 48^{\circ}

. Both are stationary points, but the maximum is the second value,

48^{\circ}

. On the other hand, one may send in a large number of particles and simply want to maximize the (absolute value of) the total,

| {cos}^{2} (θ / 2) {〈 ϕ 〉}_{UP} + {sin}^{2} (θ / 2) {〈 ϕ 〉}_{DOWN} |

. This gives

F \equiv (1 / 2) (sin θ) [1 - (1 / 2) {sin}^{2} θ]

which has a shallow minimum at

90^{\circ}

and maxima symmetric about this minimum, one of them being at

θ = {sin}^{- 1} (\sqrt{2 / 3}) \approx 55^{\circ}

.

It follows that there is not a lot of profit in fine tuning the optimization. However, what is more significant is that there is a definite θ dependence. Thus if θ is varied between 0 and

π / 2

one could compare a

θ \approx 0

no-signal situation (no special state is needed) with a positive signal situation, say at

θ \approx 50^{\circ}

.

4.1.1. Strength of the Field Inducing the Kick

For a spin about to enter a Stern–Gerlach apparatus, the effective part of

H_{eff}

of Equation (13) involves a magnetic field,

B

. For the kick angle ϕ to have characteristic size unity we require

|\frac{H_{eff} Δ t}{ℏ}| = |\frac{μ \cdot B Δ t}{ℏ}| = \frac{1}{2} |ϕ| \sim 1

(24)

where

Δ t

is the duration of the field’s interaction with the spin. The quantity μ is essentially the electron magnetic moment; taking its magnitude to be the Bohr magneton, implies

B Δ t \sim 10^{- 11} Ts

(25)

To evaluate B requires an estimate of

Δ t

, in turn requiring some picture of the nature of the interaction. At this stage, two possibilities present themselves. The field may be connected to the strong magnetic field the atom experiences in approaching and passing through the magnets. Or the field could be something separate, carried perhaps by an externally arriving photon.

We first consider a possible association with the SG field. A conservative estimate would be interaction durations of a few ms, in which case field strengths would be about

10^{- 8}

T, which is well within the range of macroscopic measurement. However, this is probably too conservative. In a typical SG experiment the Ag or K atoms are moving at about 1 km/s. If the kick takes place within about 10cm, then

Δ t \sim 1 μ

s and the field strength would be on the order of 0.1 G, something your compass needle could discern.

As far as an electric field generated by this transient field, Maxwell’s equations suggest

E \sim \frac{L B}{T}

, where L is the characteristic scale for the spatial variation of E and T the time scale for variation of B. If

L \sim 10^{- 1}

m and

T \sim 1 μ

s, we find an electric field on the order of

1

V/m, also easily measurable. Another estimate in this connection uses

L / Δ t \sim v = 1

km/s. Thus

E \sim \frac{L B}{Δ t} \sim \frac{L B Δ t}{{(Δ t)}^{2}} = \frac{v B Δ t}{L} = \frac{10^{10} 10^{- 11}}{L} \sim \frac{10^{- 1}}{L}

.

Now consider an outside photon, not necessarily related to the magnetic fields of the SG apparatus. An estimate of this photon’s energy can be made in terms of the time of interaction: since

μ \cdot B

is an energy, by Equation (24) that energy should be roughly

ℏ / Δ t

. If

Δ t

is a characteristic electromagnetic interaction time,

10^{- 16}

s, this gives an energy on the order of 5 eV.

4.1.2. Magnetic Fields along the Particle Path

A convenient way to study the field in the Stern–Gerlach apparatus [25] is to replace the magnets (for purposes of calculation) by a pair of infinite parallel wires with currents flowing in opposite directions. The magnitude of the field is then constant on (circular) cylindrical surfaces for distances large in comparison to the wire separation. This matches the field seen by the passing particle if the pole pieces have the shape of those cylinders. As desired, this magnetic field has a steep gradient perpendicular to the cylindrical surfaces.

Our interest is not so much in the field within the magnet as the field seen by the atom as it approaches the magnet, moving in the positive y direction. This will certainly depend on the specifics of the magnet, but to get a handle on those fields and to go beyond dimensional analysis, we study the finite length magnetic field by simulating the actual field by one generated by a current loop that consists of two wires, but now they are finite. They extend for the length of the magnet and are joined at each end by a semicircular loop (completing the circuit). Figure 8 illustrates the following geometry: The circuit is in the x-y plane (

z = 0

). The straight-wire portions run from

y = - L / 2

to

+ L / 2

, the upper portion at

x = + s

, the lower one at

x = - s

. The semicircles at each end (also in the x-y plane with

z = 0

) are of radius s. The particle trajectory is in the direction of increasing y and parallel to the y-axis. It has

x = 0

and a value of z large enough so that in its neighborhood the contour lines of the field are essentially circles in the x-z plane. The field at a point

R = y j + z k

is given by the following integral:

B (R) = \frac{μ_{0} I}{4 π} \oint_{Γ} d r \times \frac{(r - R)}{{| r - R |}^{3 / 2}}

(26)

where I is the current and SI units are used. Now the particle is deflected in the positive or negative z direction (that’s the point of the experiment). But there will also be some spread of the beam in the x direction whose consequences for the field we will evaluate to lowest order. The contour, Γ, consists of four parts, the top (“T”) portion of the wire parallel to the y axis, the bottom (”B”) portion, the right semicircle (“R”,

y = L / 2

) and the left semicircle (“L”,

y = - L / 2

). For

R = y j + z k

(which is the plane

x = 0

), the straight wire portions can be fully integrated and give

\begin{matrix} \begin{matrix} B_{T & B} (R) & = \frac{μ_{0}}{4 π} I \int_{- L / 2}^{L / 2} d η j \times \frac{s i + (η - y) j - z k}{{[s^{2} + z^{2} + {(η - y)}^{2}]}^{3 / 2}} + \{I \to - I & s \to - s\} \\ = \frac{μ_{0}}{4 π} (\frac{- 2 I s k}{s^{2} + z^{2}}) [sin θ_{2} - sin θ_{1}] \end{matrix} \end{matrix}

(27)

where

tan θ_{(\binom{2}{1})} = (- y \pm L / 2) / \sqrt{z^{2} + s^{2}}

. We also present the first order correction for small x, i.e., the observation point

R

becomes

x i + y j + z k

. The additional term is of the form

{x \partial B / \partial x |}_{x = 0}

. After a bit of calculation one obtains

{\frac{\partial B_{T & B} (R)}{\partial x}|}_{x = 0} = - \frac{μ_{0} I}{4 π} 2 s z i [A_{+} - A_{-}]

(28)

where

A = λ \frac{(2 λ^{2} + 3 b^{2})}{b^{4} {(λ^{2} + b^{2})}^{3 / 2}}

(± implicit on A and λ), with

λ_{\pm} = \pm \frac{L}{2} - y

and

b^{2} = s^{2} + z^{2}

. Because of the z dependence, vertical (x) spread in the beam will cause (unwanted) blurring of the spin-induced splitting.

Figure 8. Geometrical configuration. The separation of the wires is 2s. The particle moves in the positive y direction in the plane

x = 0

and at a positive, essentially constant z value that is larger than s. Looking at the circuit from positive z, Equation (27) corresponds to a current moving in the clockwise direction.

Figure 8. Geometrical configuration. The separation of the wires is 2s. The particle moves in the positive y direction in the plane

x = 0

and at a positive, essentially constant z value that is larger than s. Looking at the circuit from positive z, Equation (27) corresponds to a current moving in the clockwise direction.

The field from the two semicircular portions does not have a general closed form solution but analytic information can still be obtained. The length of the path within the magnet, L, will be assumed long enough so that we need consider only one semicircle at a time. Moreover, with respect to the SG apparatus on which we focus (the second) the field on exit is irrelevant, since at that stage only location is measured, not spin. Nevertheless, the exit field will play a role for the first apparatus, because it can change what we assume is the incoming state. Qualitatively though, the possible effects will be the same.

A point on the semicircular portion of the wire near

- L / 2

is given by

r = - \frac{L}{2} j - s (j sin ψ + i cos ψ)

, with ψ running from 0 to π. For clockwise circulating current (as viewed from positive z)

d ψ

is in the direction of the current. After a bit of calculation we obtain an expression for the left semicircular (“L”) contribution

B_{L} (R) = \frac{μ_{0} I}{4 π} \int_{0}^{π} s d ψ \frac{z (i cos ψ + j sin ψ) - k (\bar{y} sin ψ + x cos ψ + s)}{{[x^{2} + {\bar{y}}^{2} + s^{2} + 2 s (\bar{y} sin ψ + x cos ψ)]}^{3 / 2}}

(29)

where

\bar{y} \equiv y + \frac{L}{2}

. For purposes of studying the effective Hamiltonian, Equation (13), we are only interested in the x-component of this field. Specializing to

x = 0

, the integral can be performed, yielding

B_{L x} (R) = \frac{μ_{0} I}{π} [\frac{1}{\sqrt{{\bar{y}}^{2} + s^{2} + z^{2}}} - \frac{1}{\sqrt{{(\bar{y} + s)}^{2} + z^{2}}}]

(30)

As the atom approaches the magnet, this field rotates the spin one way and then the other. The magnitude of this field is substantial. Rewrite the field as

B_{L x} (R) = \frac{μ_{0} I}{4 π} \frac{1}{z} [\frac{4}{\sqrt{{(\frac{\bar{y}}{z})}^{2} + {(\frac{s}{z})}^{2} + 1}} - \frac{4}{\sqrt{{(\frac{\bar{y} + s}{z})}^{2} + 1}}]

(31)

The dimensionless quantity in the square brackets has a maximum of about

1 / 2

for

s / z \sim 0.75

, which is approximately the value in the experiment of Reference [25]. Comparing Equation (27) and Equation (31) it is seen that the external field reaches almost half the field value inside the magnets.

4.2. Detection Scenarios

The general strategy is to send in atoms with spins at (say) 50° relative to the z-axis (tilted along the y-axis) and to send them in at 0° [26]. Comparison of the two cases should show additional “random” activity—noise—when they are at the non-zero angle. At 0° no kicks are necessary to drive the spins into a single beam for the SG experiment. At 50° they will all need to be sent one way or the other. The actual rotating of the spins would not itself be visible, but related and additional fields should be present. The idea is that there should be “collateral damage”, by which is meant that the photon or field fluctuation is not perfectly matched to accomplish its rotational task and nothing more. As discussed at length in Reference [1], in generating a special state one seeks the least unlikely of them. A fundamental assumption in the present proposal is that a perfect match is less likely than an imperfect one. In addition, by virtue of Maxwell’s equations, there are compulsory electric fields alongside the magnetic fields that rotate the spin.

Ways to fine-tune the strategy above may certainly exist. For example, if the signal of a kick can be correlated with a particular atom (which goes either up or down), differences in signal rates for different angles can be further exploited.

4.2.1. Scenario when the Fields Are Generated by the SG Magnets

One issue is the stability of the fields. The fields needed for rotating the spins are on the order of 1G, while the magnet is maintaining a field of roughly 5000 G. One thus needs field measurements with better than 0.1% accuracy. It should also be recalled that the preparation of the spin at some particular angle is accomplished by means of a earlier SG setup. Kicks can occur in the first as well as the second magnet. The rotating fields for the magnets (meaning, for the example studied, fields in the x-direction) are also different for different atoms because of finite beam width (cf. Equation (31) where there is z-dependence in the field).

Furthermore, the magnetic fields that can rotate the spin are necessarily accompanied by electric fields since the variety of rotation directions through the magnet (for

θ \neq 0

) demands time-dependent variation of

B

. With an atomic velocity of 1 km/s, a conservative estimate puts these fields on the order of 1 or more V/m.

For an atomic beam, there may be additional effects. Many atoms pass through the magnet at roughly the same time. Not all of them are rotated the same way, so that rapid variation of the magnetic field would be required (along with the electric fields just discussed). In addition, the “least unlikely” principle suggests that there would be a tendency for bunching in the output, that is there would be short-time correlations in up or down outcomes. The rationale is that a single large fluctuation is more likely than two independent ones.

4.2.2. Scenario when the Fields Are Generated by External Photons

Our rough estimate for photon energy was in the eV range, visible or UV light when the kick drives the spin around many times (as is occasionally expected, given the Cauchy distribution). Individual photons in this energy range should be easy to detect.

It should be pointed out though that the estimates of Section 4.1.1 are only that—estimates. A general scale is established. However, the properties of the Cauchy distribution imply that this scale will often be vastly exceeded. For this reason I do not go beyond the semiclassical assumption, implicit in that calculation, that the field acts on the atom, but not vice versa. For atom-photon scattering one should in principle work in a QED context. My assumption is that both incoming photon and outgoing photon will all be on the scale of the estimate.

5. Discussion

There are three issues to be taken up in this discussion: (1) Comments on the plausibility of the overall theory; (2) Review of the nature and assumptions in the experimental test; (3) The possibility of other tests.

Concerning the special state theory, I think that Bohr’s criterion of being “crazy enough” is satisfied [27]. Personally I have no problem with the restriction on initial states, nor on the idea of what is sometimes called a “block universe”, one in which past and future are all part of a unified space-time (and maybe more) history. Where my credibility is stretched is the possibility that there are so many microstates that specialness is possible again and again and again. On the other hand, I am sufficiently unhappy with other quantum measurement ideas, either giving up unitarity or having many worlds or giving up the idea that the wave function is any more than a computational tool, that I am prepared to entertain this “crazy enough” idea.

The proposed experiment would involve two sets of Stern–Gerlach apparatus, one for preparation, one for measurement. The calculations in this article leave open two possibilities for the detection of a signal accompanying the rotation of the atom’s spin. In one case, there would need to be high quality light sensors along the path between them (which the experimentalist must therefore maintain in darkness). In the other, precise measurements of the magnetic field (as well as stability of that field) would be necessary. Alternatively electric fields could be measured close to the entry to the magnets. It is also possible that bunching effects would be detected in measurements of atom positions.

Our proposals are based on a number of assumptions. For photon measurements, we expect the energy of the emitted photon to be in the eV range. This is based on no more than the fact that the usual time scale for electromagnetic interactions is

10^{- 16}

s. I can easily imagine an order of magnitude correction in either direction. However, the range of “kick” sizes is also great, so that even if, say, the bulk of the photons landed in the infrared, some would be visible. Moreover, there are sensors for these other energy ranges. Another assumption is the concept of what I have called “collateral damage”. Namely, if the spin is to be rotated by a specific amount, it is likely that the field or photon doing the job is not exactly tailored to do only that, but rather would have some other energy value and would carry away the excess. Moreover, since the strength of the needed kick has a long-tail distribution, the excesses, presumably on the same scale, would have the same distribution. In addition, if the rotating field is that of the magnet, even if there is little or no excess in the magnetization field, a significant electric field (demanded by Maxwell’s equations) would still appear. The nature of the demand for the electric field implies that it too be Cauchy distributed. There are of course other assumptions, such as identifying the location of the kick as the atom’s path before being well into the second SG magnet, but they seem to me more secure hypotheses.

One might also ask, does the measurement of the “kick” on the path of the particle already fix the outcome, in the same way that checking which slit a particle goes through can destroy the interference pattern in a two-slit experiment. Analyzing this question requires determining whether the upstream (i.e., before entering the second SG magnet) measurement can actually predict the outcome, which in turn requires a more quantitative estimate of the expected signal. However, from the standpoint of confirming the theory described above, there are two aspects of the suggested tests that are significant even if predictive information could be deduced from the upstream measurement. First, the contrast between 0° and 50° entry beams (the angles are the orientation of the atoms relative to the z axis of the second SG magnet) would exist whether or not the spin localization (“space quantization”) were observed. At 0° there would be no signal, not photons, not electric fields; at 50° there would be such a signal whether or not the usual SG splitting were observed. Secondly the observation of a Cauchy distribution in the noise would also be support for this theory, since nothing in the Copenhagen interpretation involves long tailed distributions.

The last issue concerns other possible two- (or more) state observations. The beam splitters and polarizers, used for example in [2] and working with photons instead of atoms, do jobs similar to that of the Stern–Gerlach apparatus and may be simpler to set up. I have not analyzed such experiments because I have less confidence in being able to identify where the least unlikely changes in the photon would take place. Partly this is my own ignorance and partly this reflects the greater complexity in, say, rotating polarization, involving as it does a medium. As suggested in [1] (§10.3) the field of quantum computation, focusing as it does on the control of individual qubits, also presents opportunities for observing the signs of “specializing” (although indications of Cauchy noise in quantum computation were not anticipated there, and were suggested by a referee). A mesoscopic version of the SG experiment involving electron spins is given in [28], and may well provide a more conveniently performed experiment than that suggested in the present article. In any case, in principle Cauchy distributed noise should appear whenever a selection of macroscopic states is demanded. If this can conveniently be matched with cases where no selection is needed (as in sending in beams in the SG experiment oriented at 0° and 50°) then the comparison should show the differences the special state theory predicts.

Acknowledgments

The list of those to whom I am grateful for participation in the formation of these ideas is long and can be found in [1]. In developing the new material in this article I have received helpful advice from Bernard Gaveau, Karel Polak, Carlo Rizzo, Marco Roncadelli and Dipanker Roy. I must, however, add to these thanks that any errors, especially when experimental details are under discussion, are my own. I am also grateful for the hospitality of the Max Planck Institute for the Physics of Complex Systems in Dresden, where some of the work on this article took place.

References and Notes

Schulman, L.S. Times Arrows and Quantum Measurement; Cambridge University Press: New York, NY, USA, 1997. [Google Scholar]
Jacques, V.; Wu, E.; Grosshans, F.; Treussart, F.; Grangier, P.; Aspect, A.; Roch, J.-F. Experimental realization of wheelers delayed-choice gedanken experiment. Science 2007, 315, 966–968. [Google Scholar] [CrossRef] [PubMed]
Wheeler, J.A. Frontiers of Time, in Problems in the Foundations of Physics; di Francia, G.T., Ed.; North Holland: Amsterdam, The Netherlands, 1979; pp. 395–497. [Google Scholar]
Wheeler, J.A. Delayed-Choice Experiments and the Bohr-Einstein Dialog; Rhoads, J.E., Ed.; Amerrican Philosophical Society: Philadelphia, PA, USA, 1981; pp. 9–40. [Google Scholar]
Schulman, L.S. Delayed choice experiments, the arrow of time, and quantum measurement. AIP Conf. Proc. 2011, 1408, 153–167. [Google Scholar]
The material in [5] was also presented in a lecture at the 20^th anniversary celebration of the Center for Theoretical Studies, Prague, Czech Republic, November 2010.
Schulman, L.S. Observational line broadening and the duration of a quantum jump. J. Phys. A 1997, 30, L293–L299. [Google Scholar] [CrossRef]
Schulman, L.S. Continuous and pulsed observations in the quantum Zeno effect. Phys. Rev. A 1998, 57, 1509–1515. [Google Scholar] [CrossRef]
Schulman, L.S. Jump time and passage time: The duration of a quantum transition. In Time in Quantum Mechanics, 2nd ed.; Muga, J.G., Mayato, R.S., Egusquiza, I.L., Eds.; Springer-Verlag: Berlin, Heidelberg, Germany, 2008; pp. 99–120. [Google Scholar]
Schulman, L.S.; Doering, C.R.; Gaveau, B. Linear decay in multi-level quantum systems. J. Phys. A 1991, 24, 2053–2060. [Google Scholar] [CrossRef]
Griffiths, R.B. Consistent histories and the interpretation of quantum mechanics. J. Stat. Phys. 1984, 36, 219–272. [Google Scholar] [CrossRef]
A solvable model of particle detection in quantum theory. Acta Fac. Rerum Nat. Univ. Comen. Phys. 1980, 20, 65–94.
Gaveau, B.; Schulman, L.S. Model apparatus for quantum measurements. J. Stat. Phys. 1990, 58, 1209–1230. [Google Scholar] [CrossRef]
We also use probability when—even if the initial point is known—it is impractical to calculate the later phase space point, for example for chaotic dynamics. Taking this into account would lead to a slight restatement of the assertions in the text, but the quantum issues, which are the point of the discussion, are the same.
Schulman, L.S.; Newton, R.G.; Shtokhamer, R. Model of implication in statistical mechanics. Philos. Sci. 1975, 42, 503–511. [Google Scholar] [CrossRef]
Something rare can still be abundant. Here is an example that will make this utterance sound less mystical. Consider the melting ice story from the arrow of time discussion. The 2 p.m. ice-plus-cold-water into which the 1 p.m. ice cube has melted has far more microstates than just those coming from an ice cube of one particular size. Therefore, as discussed, when thinking of the 2 p.m. state as the image of an earlier state there is an order 1 in $10^{10^{24}}$ restriction on its microstates. However, there are many possible forms the 1 p.m. piece of ice could take—a small and a large cube, a cube and 12 chips, an ice sculpture of a polar bear, of a tulip, etc., etc. So final state microstates are relatively rare, but still have $10^{10^{24}}$ ’s in their abundances. Well, maybe it’s only $10^{10^{23.99}}$ .
Adler, R.J.; Feldman, R.E.; Taqqu, M.S. A Practical Guide to Heavy Tails: Statistical Techniques and Applications; Birkhäuser: Boston, MA, USA, 1998. [Google Scholar]
Samorodnitsky, G.; Taqqu, M.S. Stable Non-Gaussian Random Processes: Stochastic Models with Infinite Variance; Chapman and Hall: New York, NY, USA, 1994. [Google Scholar]
Arnold, V.I.; Avez, A. Ergodic Problems of Classical Mechanics; Benjamin: New York, NY, USA, 1968. [Google Scholar]
Boltzmann, L. Lectures on Gas Theory; Dover Publications: New York, NY, USA, 1995; Section 90; pp. 446–448. [Google Scholar]
L. Landau is said to have characterized cosmologists as, “Often in error; never in doubt”.
ϕ in this section corresponds to the 2ψ of Equation (7). The definition of a also changes by a factor 2.
Schulman, L.S. Definite quantum measurements. Ann. Phys. 1991, 212, 315–370. [Google Scholar] [CrossRef]
Hille, E. Analytic Function Theory, Volume I; Ginn and Company: New York, NY, USA, 1959; Section 9.3, Equation 9.3.14. [Google Scholar]
MIT Dept. Physics, Junior lab. The Stern-Gerlach experiment: Quantization of angular momentum. 2003. Available online: web.mit.edu/8.13/JLExperiments/JLExp_18_rev1.pdf (accessed on 27 March 2012).
They could as well tilt along the x-axis, although details of our calculation would be slightly different. Note too, that for, say 50^∘, one would select from the output of the first SG setup. However, for the 0^∘ experiment this is not necessary, since there would still be an absence of “noise” even if both orientations are incoherently recombined and sent through the second SG setup.
See the following link for what may be the source of this expression. Available online: http://en.wikiquote.org/wiki/Niels_Bohr (accessed on 21 March 2012).
Ionicioiu, R.; DAmico, I. Mesoscopic Stern-Gerlach device to polarize spin currents. Phys. Rev. B 2003, 67, 041307:1–041307:4. [Google Scholar] [CrossRef]

Appendix

A. Search Technique for Special States

I use the notation of the decay narrative in Section 2. Let the projection operator for the n-dimensional subspace of the initially excited atoms be called P. Let the propagator for the full

(N + n)

-dimensional Hamiltonian (H) be called U, so that

U = exp (- i H t_{0} / ℏ)

, where

t_{0}

is the particular time at which the state must be non-grotesque. If the initial state,

ψ_{0}

, is undecayed then it satisfies

P ψ_{0} = ψ_{0}

. The probability that at time-

t_{0}

it is still undecayed is

S (t_{0}) = {||P U ψ_{0}〉|}^{2}

, the “survival probability”. This can be rewritten as

S (t_{0}) = 〈ψ_{0} |C^{†} C| ψ_{0}〉

, with

C \equiv P U P

. The problem of finding states that decay entirely or do not decay at all becomes the problem of finding eigenvectors of

C^{†} C

with eigenvalues near 0 or 1. In general for large enough systems (thinking beyond the particular decay model of the Hamiltonian Equation (1)) there will be many eigenvalues quite close to both limits. For the case at hand (and this is related to the straightness of the line in Figure 2) almost all the eigenvalues cluster around zero and one [10]. The latter property holds when the coupling matrices ϕ are essentially constant.

The figures shown in Section 2 are based on numerical calculations.

© 2012 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.)

Share and Cite

MDPI and ACS Style

Schulman, L.S. Experimental Test of the “Special State” Theory of Quantum Measurement. Entropy 2012, 14, 665-686. https://doi.org/10.3390/e14040665

AMA Style

Schulman LS. Experimental Test of the “Special State” Theory of Quantum Measurement. Entropy. 2012; 14(4):665-686. https://doi.org/10.3390/e14040665

Chicago/Turabian Style

Schulman, Lawrence S. 2012. "Experimental Test of the “Special State” Theory of Quantum Measurement" Entropy 14, no. 4: 665-686. https://doi.org/10.3390/e14040665

APA Style

Schulman, L. S. (2012). Experimental Test of the “Special State” Theory of Quantum Measurement. Entropy, 14(4), 665-686. https://doi.org/10.3390/e14040665

Article Menu

Experimental Test of the “Special State” Theory of Quantum Measurement

Abstract

1. Introduction

2. Quantum Mechanics

2.1. Use of the Special State

2.2. The Assumption Concerning Special States in Nature

2.3. Recovering Probabilities

3. Statistical Mechanics

3.1. The Arrow of Time

3.2. The Cat Map

3.3. Special States and Determinism

3.4. Requiring Special States

4. Experiment

4.1. Properties of the Kick

4.1.1. Strength of the Field Inducing the Kick

4.1.2. Magnetic Fields along the Particle Path

4.2. Detection Scenarios

4.2.1. Scenario when the Fields Are Generated by the SG Magnets

4.2.2. Scenario when the Fields Are Generated by External Photons

5. Discussion

Acknowledgments

References and Notes

Appendix

A. Search Technique for Special States

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI