I am finally in position to explore these theorems in detail. I start with an evaluation of Brukner’s work.
3.1. Evaluation of the No-Go Theorem for Observer-Independent Facts
The initial observation I make is that the discussion in [
6] of the key ‘Observer-independent facts’ assumption is somehow ambiguous and allows for two different interpretations. The issue is that it is not clear whether what Brukner calls the observational statement
(“The pointer of Wigner’s friend’s apparatus points to result
”), refers to an observational statement uttered by Charlie, or by Alice when she performs a Z measurement. This is of relevance because these different possible readings lead to different contents for the ‘Observer-independent facts’ assumption.
One option is to take
as a statement uttered by Alice. In that case, since the ‘Observer-independent facts’ assumption requires “an assignment of truth values to statements
and
independently of which measurement Wigner performs”, then the assumption would demand, not only for all performed experiments to have well-defined results, but also for (at least some) unperformed experiments to do so. This seems to be the interpretation of the ‘Observer-independent facts’ assumption offered in [
7] where, as we saw, it is taken as demanding propositions about all observables
that might be measured to be assigned a truth value, independently of whether they were actually performed or not (Ref. [
10] contains the same interpretation). That is, they read Brukner’s assumption as implying that, even when Alice performs a Z measurement, a statement regarding the result of her
unperformed X measurement has to have a well-defined value. This is why, in [
7], it is concluded that the ‘Observer-independent facts’ assumption is equivalent to KSNC.
However, even with this reading of Brukner’s assumption, it seems to me that this identification is illegitimate. KSNC is an assumption demanding for the results of all experiments to be independent of the
context of the measurement and, in particular, independent of which other measurements were performed simultaneously. I fail to see how, by reading
as a statement by Alice, the ‘Observer-independent facts’ is equivalent to demanding non-contextuality. (That is, KSNC would demand, for instance, for the result of Alice’s experiment to be independent of what Bob decides to measure but, by reading
as a statement by Alice, the ‘Observer-independent facts’ demands, say, for the result of Alice’s
measurement to be well-defined, even if such a measurement is not performed and Alice measures
. Moreover, the KSNC is a demand for all results, of all possible experiments. It is not clear to me that the particular demand by Brukner, made for a specific experiment, could straightforwardly be read as setting such a strong demand for all possible observables, in all possible experiments. In any case, it is important to clearly distinguish between a demand for experiments to yield observer-independent, absolute results—the motivation behind ‘Observer-independent facts’—and a demand for all measurements to passively, context-independently, reveal previously possessed well-defined values—the motivation behind KSNC. Of course, the latter is a much stronger requirement, already ruled out by the Kochen–Specker theorem). At any rate, I agree with [
7] that, by reading the ‘Observer-independent facts’ assumption this way, Brukner’s no-go theorem fails to set relevant limits on the objectivity of facts. The point is that, by interpreting
as a statement by Alice (and
as a statement by Bob), Brukner, at best, sets constraints on frameworks demanding objectivity, not only of performed experiments, but also of unperformed ones. It seems clear to me that such a demand goes well beyond what is usually required of a framework which seeks to maintain objectivity, so letting go of such a strong demand does not seem problematic at all.
What about opting for the other interpretation of , namely, taking as a statement uttered by Charlie? This is by far the most natural reading of Brukner’s ‘Observer-independent facts’ assumption and the one in line with Brukner’s own conclusions. As we saw, he takes the theorem to show that there can be no framework where the results of Wigner and his friend can jointly be considered objective facts of the world. Therefore, it seems clear that what he has in mind is that the observational statement made by Charlie has a well-defined truth value, even if Alice decides to perform a Z measurement. Let’s explore the import of Brukner’s theorem, given this interpretation of .
The starting point in the construction of the theorem is the observation that the ‘Observer-independent facts’ assumption implies the existence of the joint distribution . It is important to note, though, that, given the interpretation of (and ) under consideration, this would mean the existence of a joint distribution for the results of all four observers, Charlie, Alice, Debbie, and Bob, in a run in which Alice and Bob decide to perform an X measurement—as opposed to a distribution for two possible results of Alice and two of Bob. That is, the ‘Observer-independent facts’ assumption implies the reality of those particular four results of all four observers, so there must be a joint distribution for them. Since this is so, the marginals, expectation values and the CHSH inequality constructed out of must also refer to these four particular measurements. The problem, though, is that Brukner proceeds to compare such results with the quantum predictions, not for those four measurements, but for the two possible results of Alice and the two possible results of Bob. That is, even though, at the beginning, he interprets and as observational statements made by Charlie and Debbie, in the end, he takes and to be the results of Alice and Bob, when they chose to perform a Z measurement.
There is, then, a
hidden assumption behind the derivation of the theorem in [
6], namely, that the results of Alice and Bob, when they perform a Z measurement, necessarily coincide with the results previously obtained by Charlie and Debbie, respectively. In fact, for the construction of the theorem in [
7], such an assumption is made explicit (see items ii and iii of the AOE assumption). Therefore, I postpone a detailed discussion of the issue until I explore the LF theorem. For now, I just note the presence of this implicit, independent assumption in [
6], without which the theorem cannot be derived.
The next observation I make is related to the last step in the derivation of the theorem, namely, the comparison with quantum predictions. According to [
6], there are entangled states to be distributed among the two labs, such as that in Equation (
1), which lead to quantum predictions that violate the CHSH inequality. While this, of course, is the case for a standard Bell-type experiment, this does not automatically mean that this is also the case for the novel EWFS considered by Brukner, containing four agents and, in particular, two superobservers performing measurements over whole labs. One must be careful, then, to have in mind what is the correct
physical interpretation of the expectation values in a given CHSH inequality—and not because the quantum predictions violate such an inequality for a given experimental scenario, it means that they do so for others.
So what about the quantum predictions for Brukner’s scenario? The problem is that, since it involves intermediate measurements by Charlie and Debbie, such predictions crucially depend on what exactly happens during measurements. The issue, of course, is that the standard (textbook) interpretation of quantum mechanics is hopelessly vague in that respect [
11]. The upshot is that
standard quantum mechanics is simply unable to make concrete predictions for the scenario considered—the measurement problem gets in the way. That is, the Wigner’s friend scenarios under consideration are precisely the type of settings for which one cannot get away with pressing on, adopting an operational stance and ignoring the conceptual limitations of the standard framework.
It is often (implicitly or explicitly) assumed that a correct application of standard quantum mechanics to a Wigner’s friend scenario means that Wigner must describe the laboratory, and all of its contents, as evolving unitarily—allegedly in accordance with the quantum rule for the evolution of isolated systems. This, presumably, is what Brukner has in mind with his ‘Universal validity of quantum theory’ assumption. It is not clear to me, though, that standard quantum mechanics contains such a rule and, if so, that this would be a correct application of it. The issue is that standard quantum mechanics also contains the rule that systems collapse upon measurements by observers so, in a Wigner’s friend scenario, the unitary evolution of isolated systems and the collapse postulate collide.
In any case, one could simply
stipulate that the laboratory and all of its contents evolve unitarily, even during the measurements of the friends. The issue is that ref. [
6] also wants to assume that measurements yield objective results, but we know from [
12] that those two assumptions, unitarity and objective outcomes, are incompatible with another assumption inherent to the standard framework: that the physical description given by the quantum state is complete. Therefore, on pain of inconsistency, it is simply impossible to assume unitarity and objective outcomes, and to employ the standard framework to make predictions. One can, of course, employ a framework which solves the measurement problem to make predictions. The issue is that different frameworks—such as objective collapse models, pilot-wave theories, or Everettian scenarios—lead to different predictions for the scenario in question. For instance, while pilot-wave does predict a violation of the CHSH inequality in Brukner’s setting, objective collapse models do not (see [
5] for details).
Taking stock, the “no-go theorem for observer-independent facts” in [
6] depends on an independent, hidden assumption demanding for the Z measurement results of Alice and Bob to coincide with those of Charlie and Debbie. Moreover, there is no such thing as the correct quantum predictions for the proposed experiment, so there is no quantum benchmark with which to compare the predictions of models satisfying the imposed constrains—namely, the four assumptions in [
6] plus the hidden one mentioned above. That is, the constraints imposed on frameworks satisfying these assumptions are not really in conflict with quantum predictions. (A final comment regarding the conclusions in [
6]. We saw that the theorem is interpreted as showing that there can be no theoretical framework where the results of different observers can jointly be considered objective facts of the world. However, even ignoring all the problems described above, and taking the result as valid, it is not clear why one would be forced to discard the ‘Observer-independent facts’ assumption, as Brukner suggests. At best, the theorem would demand for (at least) one of its assumptions to be abandoned, without pointing to any one of them in particular). Below, I will show that the theorem in [
7] suffers from quite similar issues. I delay a full assessment of the impact of these limitations on the validity, significance and strength of these theorems until after the detailed evaluation of the “Local Friendliness no-go theorem”, to which we turn next.
3.2. Evaluation of the Local Friendliness No-Go Theorem
I start with a simple, intuitive proof of the LF inequality, for the particular case when Alice and Bob only have two binary-outcome measurement options—in which case the LF inequality coincides with the CHSH inequality. Consider the LF setting and an ensemble of runs for which Alice and Bob choose
. By AOE, there is a joint distribution for the results of all observers and, by Fine’s theorem [
13], the expectation values of products of results, calculated with the marginals of such a joint distribution, satisfy the CHSH inequality
with, e.g.,
, the expectation value of the product of the results of Charlie and Bob, when
.
Next, we employ the LF assumptions to transform this inequality, involving results of all four observers, into a CHSH inequality for the results of Alice and Bob. In particular, we note that
and that
That is, we see that NSD and L allow the identification of certain expectation values with the same observers, but different settings, and that AOE and, in particular, conditions ii and iii, allow us to substitute, under the right circumstances, Alice for Charlie and Bob for Debbie. Putting everything together,
that is, the CHSH inequality. (While standard proofs of Bell’s theorem explicitly allow for the probabilities to change from run to run (through their dependence on what is usually denoted by
), Ref. [
7] does not allow for such a change—they explicitly take the results of the friends to play the role of hidden variables, but do not allow for additional variables that could alter the joint probability distribution from run to run. One can fix the problem by adding a dependency of the joint distributions on
and averaging over it. For this to work, one must assume, on top of the LF assumptions, the so-called ‘Settings Independence’ assumption:
. The upshot is that, given settings independence, it is enough for the expectation values calculated with the joint distribution of each possible value of
to satisfy the inequality, for the average of them, over
, to do so).
Above we saw that the starting point of the LF theorem is a critique of Brukner’s ‘Observer-independent facts’ assumption, which leads to its substitution by LF’s AOE. Such an assumption demands for all performed experiments to have observer-independent results which, for the experiment under consideration, implies the existence of a joint distribution, . Moreover, such a joint distribution is demanded to ensure consistency between the outcomes of friends and superobservers when .
I maintain, however, that the critique in [
7] seems to depend on a particular, arguably not very natural, reading of an ambiguous element in Brukner’s presentation. In particular, they read [
6] as demanding all measurements that might be performed to have well-defined values, but it is more natural to read it as demanding all performed experiments to have well-defined values, which seems equivalent to what AOE demands. We also saw that Brukner’s derivation of the theorem depends on a hidden assumption, to the effect that the results of Alice and Bob, when they perform a Z measurement, necessarily coincide with the results previously obtained by Charlie and Debbie. It seems, then, that this hidden assumption is identical to the demand of consistency between friends and superobservers imposed by conditions ii and iii in AOE. I conclude that, by adopting the most natural interpretation of Brukner’s claims, AOE is exactly equivalent to the conjunction of Brukner’s ‘Observer-independent facts’ with his hidden assumption.
All this makes it clear that the AOE assumption contains two logically
independent parts: (1) the claim that there is a joint distribution for all results and (2) conditions ii and iii, which have to do with the particular case in which Wigner measures in the same basis as his friend. Moreover, it could be argued that, while ref. [
7] take both parts as demands in order to ensure for observed events to be considered observer-independent or absolute, strictly speaking, it is only the first part that is required to enforce absoluteness of observed events. The issue is that, even when Wigner and his friend measure “in the same basis” (e.g.,
), it is never the case that what they are comparing is the result of the exact same measurement (I use the quotations to emphasize the fact that friends and superobservers perform measurements on different systems so, strictly speaking, the bases cannot be the same). As a result, a demand for absoluteness of observed events does not necessarily imply a demand for these two measurements to yield the exact same result.
The issue is that there are alternative ways to describe a measurement by Wigner, when he measures “in the same basis” as his friend. For instance, according to [
6], Alice and Bob might perform a “Z measurement” in basis
with
the state of whole lab
i after the friend measures the spin along
z and finds the result ±. However, according to [
7], if
, Alice opens Charlie’s vault, asks him what he observed and sets her result equal to that of Charlie. Note also that, when the measurements by Alice and Bob are described in [
7], they are treated differently, depending on whether they measure “in the same basis” than the friend or not. As we saw, when they do, the measurements are described as them simply opening the door. However, when they measure in a different basis, it is argued that the measurement can be thought of as one in which Alice and Bob undo the measurement of the friend, and measure the particle again directly. It is clear that this description is available only by assuming that the measurements are
reversible, which might not be the case for some models.
In summary, the AOE assumption contains two independent parts, one demanding for there to be a joint distribution for all results and another demanding consistency between the results of Wigner and his friend when they both measure “in the same basis”. While it might seem strange to forgo the second, such might be a necessary consequence of certain models, i.e., contextual ones (a feature all hidden variable theories must possess). In any case, these two parts are logically independent and it serves us best to make that explicit. Moreover, it might be argued that, while the former does seem to be a necessary demand to make in order to attain objectivity of observed results, the second might be relaxed. (Ref. [
14] does exactly that).
As I mentioned in
Section 2.2, an important, far-reaching claim in [
7] is that their theorem “places strictly stronger constraints on physical reality than Bell’s theorem”. To arrive at this claim, first it is argued that, although usually left implicit, the AOE assumption is necessary for the derivation of Bell’s theorem. Then, it is pointed out that, while the derivation of the LF inequalities require AOE, NSD, and L, the derivation of the Bell inequalities requires AOE, NSD, L and ‘Outcome independence’. Since, according to this reasoning, the LF assumptions are a subset of Bell’s assumptions, it is concluded that it is logically impossible to construct a model that allows violation of the LF inequality, but does not allow violation of Bell’s inequalities.
I contend, though, that, strictly speaking, the AOE assumption is not necessary for the derivation of Bell’s theorem. Therefore, it is not the case that the LF theorem is strictly stronger than Bell’s theorem. As we saw, the AOE assumption involves two independent parts and, while it seems true that demanding for results to be objective is, in fact, a necessary (implicit) supposition behind Bell’s result, a demand of consistency between the results of friends and superobservers, is not required for the derivation of Bell’s theorem. In fact, Bell’s experimental scenario does not even involve friends and superobservers, so any condition constraining the relation between the results of friends and superobservers, seems fully irrelevant.
The last issue I would like to analyze has to do with the wording of the LF theorem. As we saw in
Section 2.2, the construction of the theorem has two parts. First, it is shown that models obeying the LF assumptions satisfy the LF inequalities. Second, it is argued for the following conditional: if the quantum operations required by the proposed experiment can, in principle, be performed, then the quantum predictions violate the LF inequalities. Putting everything together, by defining a ‘physical theory’ as “any theory that correctly predicts the correlations between the outcomes observed by the superobservers Alice and Bob”, the LF theorem is stated as a conditional: if a superobserver can perform arbitrary quantum operations on an observer and its environment, then no physical theory can satisfy the LF assumptions.
I start by inspecting the definition of ‘physical theory’. As we just saw, in [
7], that notion is reserved for any theory that “correctly predicts” the correlations between Alice and Bob. However, what are these “correct predictions”? Since, as already was established, standard quantum mechanics cannot produce actual predictions for the experiment in question, these “correct predictions” cannot be the quantum predictions. Therefore, the “correct predictions” must mean those that coincide with the actual, experimentally observed correlations, whatever they turn out to be. Of course, for all we know, these actual correlations can be anything. Therefore, in order to constrain them, Ref. [
7] employs a conditional to the effect that, if a superobserver can perform arbitrary quantum operations on an observer and its environment, then the quantum predictions violate the LF inequalities. The idea being that, while, in general, it is true that there are no correct quantum predictions for the proposed experiment, if the antecedent of the conditional is fulfilled, i.e., if a superobserver can, in fact, perform arbitrary quantum operations on an observer and its environment, then unambiguous predictions can be produced. Moreover, the idea is that, in that case, such predictions violate the LF inequalities. The problem is that such a conditional is false.
To begin with, I find the operational tone of the antecedent of the conditional vague and inadequate. The antecedent in question is formulated in [
7] in a couple of different ways, one general and one particular to the EWFS in question. The general form, as we saw, asks for superobservers to be able to perform arbitrary quantum operations on an observer and its environment—that is, for quantum evolution (including quantum measurement) to be controllable on such a scale. The particular formulation demands for the quantum operations required by the proposed experiment to be performable, at least in principle. Either way, what is it exactly that the condition demands? Presumably, the idea is something like this. If, for instance, the measurement of one of the friends objectively breaks unitarity, then the superobserver would not be able to perform a quantum operation that maintains the friend (and her environment) in an entangled superposition corresponding to different observational states. That is, the fact that unitary is broken, is supposed to restrict the operations available to the superobserver.
It seems to me, however, that the fact that a measurement of a friend breaks unitarity is better understood as a feature of the internal dynamics of the model in question, and not really as a constraint on what operations the superobservers can or cannot perform. In other words, even if the measurements of the friends break unitarity, the superobservers are perfectly capable of performing the quantum operations required by the proposed experiment, namely, certain measurements on the corresponding labs—i.e., the breakdown of unitarity does not restrict the measurements available to the superobservers. It seems, then, that what the antecedent of the conditional needs to capture is a constraint on how macroscopic systems evolve during measurements. Even though ref. [
7] shies away from using these terms, what it seems to be actually demanded is for the evolution during those measurements to be purely unitary.
More importantly, even if the measurements of the friends involve purely unitary evolution, and the superobservers are able to perform quantum operations that maintain the friends in coherent superpositions, that does not mean that the correlations predicted are going to break the LF inequalities. That is, even if the states of the labs, after the friends measure, are given by what quantum mechanics calls a coherent superposition of different observational states, it is not necessarily the case that the LF inequalities would be broken. As we saw already, standard quantum mechanics cannot make predictions for the EWFS in question. Therefore, to make predictions for them, it is necessary to come up with an alternative, non-standard framework, capable of doing so. The point I want to make is that the fact that one of these frameworks stipulates purely unitary evolution during measurements in no way implies that such a model predicts violations of the LF inequalities. As I said, these models must be non-standard alternatives to the standard framework and, as such, they can possess all sorts of non-standard features, including predictions that do not break the LF inequalities, even if superobservers are able to fully control the quantum states of the friends. We must not forget that the assumption that results are objective, together with the assumption of purely unitary evolution, implies the existence of additional variables [
12]. These additional variables can play a non-trivial role in the calculation of predictions, allowing for predictions that do not break the LF inequalities.
In summary, Ref. [
7] looks for a condition that would guarantee predictions that break the LF inequality and it tries to formulate it in terms of the operations the superobservers are able to perform. The problem is that such a proposal does not work and, more generally, there seems to be no simple condition that would achieve what they are looking for. The truth is that, to make predictions for these scenarios, alternatives to the standard framework are required and different models will make different predictions, without there being a single condition that could determine on which side of the LF inequalities the predictions of a model would land. Of course, what ref. [
7] does show is that models satisfying the LF assumptions do not break the inequalities, but what is lacking is a way to stipulate what models are able to break them.
We can contrast all this with Bell’s theorem. In that case, one has (1) that models satisfying Bell’s assumptions satisfy Bell’s inequality and (2) that standard quantum mechanics unambiguously predicts violations of the inequality. The theorem can then be paraphrased as “the predictions of local models are incompatible with those of quantum mechanics”. In the LF case, in contrast, one shows that models satisfying the LF assumptions satisfy the LF inequalities, but one cannot show that quantum mechanics predicts violations of the inequality—and it is not even clear how to characterize models that violate it. One is then only left with the connection between the LF assumptions and the LF inequality.
Now, it is of course true that, once Bell experiments have been performed, and clear violations of the inequalities have been observed empirically, the fact that standard quantum mechanics violates Bell’s inequality becomes almost irrelevant. With experimental violations of the inequality, what we now have is a much stronger claim, namely, that “the predictions of local models are incompatible with actual experiments”—a claim which is completely independent of quantum mechanics. Could one not do the same with the LF inequalities? The issue is that, unlike Bell’s experiment, the one required to experimentally probe the LF inequalities, while presumably doable in principle, is unrealizable in practice (and will be so for the foreseeable future). It is true, then, that such experiments would set constraints on empirically viable models, but the fact that we are unable to actually perform them greatly diminishes the force behind the LF result.
Summing up, in [
7] it is shown that models satisfying the LF assumptions satisfy the LF inequalities. However, contrary to what it is argued there, it is not the case that such assumptions are strictly weaker that those of Bell. Moreover, since quantum mechanics is unable to make predictions for the EWFS considered, there is no standard violating the LF inequalities with which to contrast models satisfying the LF assumptions. Finally, the proposed experiment is, at this stage, no more than a gedankenexperiment, so the prospects of experimentally constraining LF models seem, at least for a long time, unattainable.