Abstract
We summarize a recent reconstruction of the quantum theory of qubits from rules constraining an observer’s acquisition of information about physical systems. This review is accessible and fairly self-contained, focusing on the main ideas and results and not the technical details. The reconstruction offers an informational explanation for the architecture of the theory and specifically for its correlation structure. In particular, it explains entanglement, monogamy and non-locality compellingly from limited accessible information and complementarity. As a by-product, it also unravels new ‘conserved informational charges’ from complementarity relations that characterize the unitary group and the set of pure states.
1. Introduction
Why is the physical world described by quantum theory? If we wish to sensibly address this question, we have to step beyond quantum theory and to consider it within a landscape of alternative theories. This, after all, permits us to ponder about how the world could have been different, possibly described by modifications of quantum theory. Such an endeavor forces us to leave the usual textbook formulation of quantum theory, and everything we take for granted about it, behind and to develop a more general language that also applies to alternative theories. Ideally, this language should be operational, encompassing the interactions of some observer with physical systems in a plethora of conceivable, physically-distinct worlds.
If we wish to also provide a possible answer to the above question, we then have to find physical properties of quantum theory that single it out, at least within the given landscape of alternatives. In particular, the goal should be to find an operational justification for the textbook axioms, i.e., ultimately for complex Hilbert spaces, unitary dynamics, tensor product structure for composite systems, Born rule, and so on. The result would be a reconstruction of quantum theory from operational axioms [1,2,3,4,5,6,7,8,9,10] and should ideally yield a better understanding of what quantum theory tells us about Nature; and why it is the way it is.
In this manuscript, we shall review and summarize how the quantum formalism for arbitrarily many qubits can be reconstructed from operational rules restricting an observer’s acquisition of information about a set of observed systems [1,2]. The goal of this summary is to provide a didactical and easily-accessible overview of this reconstruction. Its underlying framework is especially engineered for unraveling the architecture of quantum theory, and so many reconstruction steps are instructive for understanding the origin of quantum properties. As we shall see, this reconstruction provides a transparent, informational explanation for the structure of qubit quantum theory and especially also for its paradigmatic features, such as entanglement, monogamy and non-locality. The approach also produces novel ‘conserved informational charges’, indeed appearing in quantum theory, that turn out to characterize the unitary group and the set of pure states and which might find practical applications in quantum information.
The premise of the summarized approach is to only speak about information that the observer has access to. It is thus purely operational and survives without any ontological commitments. This approach is inspired, in part, by Rovelli’s relational quantum mechanics [11] and the Brukner–Zeilinger informational interpretation of quantum theory [12,13]; this successful reconstruction can be viewed as a completion of these ideas for qubit systems.
2. Overview of a Landscape of Theories
We shall begin with an overview of a landscape of alternative theories, which has been developed in [1,2] to which we also refer for further details.
2.1. From Questions and Answers to Probabilities and States
Our first aim is to define a notion of a state both for a single system and an ensemble of systems.
Consider an observer O who interrogates an ensemble of (identically prepared [1]) systems , coming out of a preparation device, with binary questions from some set . For example, in the case of quantum theory, such a question could read “is the spin of the electron up in x-direction?” This set shall only contain repeatable questions in the sense that O will receive times the same answer whenever asking any m times in immediate succession to a single system . We shall assume any to always give a definite answer if asked some , which moreover is not independent of ’s preparation. Accordingly, can only contain physically-implementable questions, which are ‘answerable’ by the and not arbitrary logically conceivable binary questions. Furthermore, since we assume definite answers, we do not address the measurement problem. The answers to the given by the shall follow a specific statistics for each way of preparing the (for n sufficiently large). The set of all the possible answer statistics for all for all preparations is denoted by Σ.
O, being a good experimenter, has developed, through his experiments, a theoretical model for and Σ which he employs to interpret the outcomes of his interrogations (and to decide whether a question is in or not). This permits O to assign, for the next to be interrogated, a prior probability that ’s answer to will be ‘yes’. Namely, O determines through a belief updating—in a broadly Bayesian spirit—according to his model of Σ, any prior information on the way of preparation and possibly to the frequencies of ‘yes’ answers to questions from , which he may have recorded in previous interrogation runs on systems identically prepared to . (We add “broadly” here as we also consider the typical laboratory situation of an ensemble of systems.) In particular, O may also not have carried out previous interrogations on systems identically prepared to (e.g., if the ensemble contains only the single ) in which case, he will estimate the prior for the single solely according to his model of Σ and any prior information about the preparation (more on this and update rules will be discussed in Section 2.3 and Section 2.4).
While need not necessarily contain all binary measurements that O could, in principle, perform on the , we shall assume that is ‘tomographically complete’ in the sense that the are sufficient to compute the probabilities for all other physically realizable measurements possibly not contained in the , as well. Hence, the encode everything O could possibly say about the future outcomes to arbitrary experiments on the in his laboratory. It will therefore be sufficient to henceforth restrict O to acquire information about the solely through the . It is also natural to identify O’s ‘catalog of knowledge’ about the given , i.e., the collection of , with the state of relative to O. This is a state of information and an element of Σ. Conversely, any element in Σ assigns a probability to all . Thus, we identify Σ with the state space of .
The state is the prior state for the single to be interrogated next, but also coincides with the state O assigns to the ensemble (which may only contain a single member) given that its members are identically prepared [1].
2.2. Time Evolution of O’s “Catalog of Knowledge”
We permit O to subject the to interactions, which cause a state at time to evolve in time to another legitimate state. Any permitted time evolution shall be temporally translation invariant, thus defining a one-parameter map from Σ to itself, which only depends on the time interval , but not on . We denote by the set of all time evolutions to which we allow O to expose the .
Clearly, is a further crucial ingredient of O’s world model; his model for describing his interrogations with the is thus encoded in the triple .
2.3. Convexity and State of No Information
It will be our challenge to unravel what O’s world model is. This requires us to subject the triple to a number of further operational conditions that are ‘natural’ in the context of information acquisition with a broadly Bayesian spirit. Upon imposing the quantum postulates, this will turn out to restrict and to incorporate only a ‘natural’ subset of all possible quantum measurements and time evolutions, namely projective binary measurements and unitaries, respectively (rather than arbitrary positive operator-valued measures (POVMs) and completely positive maps). However, this suffices for our purposes to reconstruct the textbook quantum formalism.
To account for the possibility of randomness in the method of preparation, we assume Σ to be convex. Consider a collection of identical systems (i.e., with identical ) that are not necessarily in identical states and for which O uses a cascade of biased coin tosses to decide which system to interrogate. Then O is enabled to assign a single prior state to this collection, which is a convex combination of their individual states.
Next, we assume the existence of a special method of preparation, which generates even completely random answer statistics over all . This preparation is described by a special state in Σ, namely , , and shall be called the state of no information. This distinguished state is a constraint on the pair . (E.g., in quantum theory, the pair does not satisfy this condition because there exist inherently biased POVMs, while does.) It plays two crucial roles: it defines (1) the prior state of that O will start with in a Bayesian updating when he has no ‘prior information’ about the (except what his model is); and (2) an unambiguous notion of the (in-)dependence of questions (cf. Section 2.4), which otherwise would be state dependent. (E.g., in quantum theory, the questions “Is the spin of Qubit 1 up in x-direction?” and “Is the spin of Qubit 2 up in x-direction?” are independent relative to the completely mixed state, however not relative to a state with entanglement in x-direction.)
2.4. State Updating and (In)Dependence and Compatibility of Questions
There are two kinds of state update rules, one for the state of the ensemble (which coincides with the prior state assigned to the next to be interrogated) and one for the posterior state of a given ensemble member . In a single shot interrogation, O receives a single , assigns a prior state to it according to his prior information (cf. Section 2.1), interrogates it with some questions from (without intermediate re-preparation) and, depending on the answers, updates the prior to a posterior state valid for this specific only. This requires a consistent posterior state update rule, which permits O to update the probabilities for all in a manner that respects the structure of Σ and the repeatability of questions (i.e., an answer ‘yes’ or ‘no’ must have a posterior or 0 as a consequence, respectively). This is also a belief updating, but about the single , and is not the same as in Section 2.1 and Section 2.3. Specifically, the posterior state of may differ significantly from its prior state if O has experienced an information gain on at least some (this will necessarily happen when complementary questions are involved; see below). This is the ‘collapse’ of the state: it is merely O’s update of information about the specific [1].
By contrast, in a multiple shot interrogation, O carries out a single shot interrogation on each member of an entire (identically prepared [1]) ensemble to do ensemble state tomography and estimate the state of the ensemble from his/her prior information about the preparation and the collection of posterior states from the single shot interrogations. With every further interrogated , O updates the ensemble state, which coincides with the prior state of the next system from the ensemble to be interrogated. Accordingly, this requires a prior state update rule. This is the belief updating alluded to in Section 2.1 and Section 2.3 about the ensemble .
It will not be necessary to specify these two update rules in detail; we just assume O uses consistent ones. Specifically, given a posterior state update rule, we shall call
(One can also define partial compatibility similarly [1].) These relations shall be symmetric; e.g., is independent of if and only if is independent of , etc.
| (maximally) independent | if, after having asked to S in the state of no information, the posterior probability . That is, if the answer to relative to the state of no information tells O ‘nothing’ about the answer to . |
| dependent | if, after having asked to S in the state of no information, the posterior probability (if or 1, they are maximally dependent). That is, if the answer to relative to the state of no information gives O at least partial information about the answer to . |
| (maximally) compatible | if O may know the answers to both simultaneously, i.e., if there exists a state in Σ such that can be simultaneously zero or one. |
| (maximally) complementary | if every state in Σ, which features , necessarily implies . Notice that complementarity implies independence (but not vice versa). |
We impose a final condition on the posterior state update rule: if are maximally compatible and independent, then asking shall not change , i.e., O’s information about .
2.5. Informational Completeness
The fundamental building blocks of the theories in the landscape that we are constructing are to be sets of pairwise independent questions. This will help to render the convoluted parametrization of a state by more economical. Consider a set of pairwise independent questions ; it is called maximal if no question from can be added to without destroying the pairwise independence of its elements. We shall assume that any maximal is informationally complete in the sense that all can be computed from the corresponding probabilities for all states in Σ. Any such features D elements [1] such that Σ becomes a D-dimensional convex set and states become vectors:
2.6. Information Measure
Our focus is O’s acquisition of information, so we need to quantify O’s information about the systems. Since is binary, we quantify O’s information about ’s answer to it by a function with bit and bit ⇔ and bit. O’s total information about a must be a function of the state; we make an additive ansatz:
The quantum postulates will single out the specific function α.
Consider a set of mutually (maximally) complementary questions. It is clear that whenever O has maximal information bit about from this set, he must have zero bits of information about all other questions in the set. We require more generally that such a set cannot support more than one bit of information, regardless of the state:
for otherwise O could, for some states, reduce his total information about such a set by asking another question from it. These complementarity inequalities represent informational uncertainty relations that describe how the information gain about one question enforces an information loss about questions complementary to it (see also the state ‘collapse’ in Section 2.4).
2.7. Composite Systems and (Classical) Rules of Inference
O must be able to tell a composite system apart into its constituents purely by means of the information accessible to him through interrogation and thus ultimately by means of the question sets. Let systems have question sets . It is then natural to say that they define a composite system if any is maximally compatible with any and if:
where only contains composite questions, which are iterative compositions, , via some logical connectives , of individual questions about and about . This definition is extended recursively to composite systems with more than two subsystems.
Since O can never test the truthfulness of statements about the logical connectives of complementary questions through interrogations and since all propositions must have operational meaning, we shall permit O to logically connect two (possibly composite) questions directly with some * only if they are compatible. For the same reason, O is allowed to apply classical rules of inference (in terms of Boolean logic) exclusively to sets of mutually-compatible questions.
We stress that this definition of composite systems is distinct from the usual state tensor product rule in generalized probabilistic theories coming from local tomography [3,4,5]. In particular, this composition rule admits non-locally tomographic composites (see Section 4.3).
2.8. Computing Probabilities and Questions as Vectors
Thanks to informational completeness, the probability function that ‘yes’, given the state , exists for all and . As shown in [2], the exhibited structure yields:
where is a question vector encoding and is a vector with each coefficient equal to one in the basis corresponding to . This equation gives rise to (part of) the Born rule.
Suppose were both encoded by the same . Then, by (4), they would be probabilistically indistinguishable, and O must view them as logically equivalent. O is free to remove any such redundancy from his description of upon which every permissible question vector will encode a unique . Finally, for every , there exists a state , which is the updated posterior state of after O received a ‘yes’ answer to the single question Q from in the (prior) state of no information. O had zero bits of information before, and encodes a single independent question answer, so we naturally require that it encodes one independent bit. Hence, for every , there exists with bit, such that . (In quantum theory, the will only turn out to be pure states for a single qubit; e.g., for two qubits and ‘Is the spin of Qubit 1 up in z-direction?’, represented by the rank-two projector , corresponds to the mixed state . Clearly, .)
3. The Quantum Principles as Rules Constraining O’s Information Acquisition
In the sequel, we consider the most elementary of information carriers. Within the introduced landscape of theories, we now establish rules on O’s acquisition of information that single out the quantum theory of a composite system of qubits, modeled in our language by a triple . Effectively, these rules constitute a set of ‘coordinates’ for quantum theory on this landscape. The rules are spelled out first colloquially, then mathematically and are motivated in more detail in [1,2].
Empirically, the information accessible to an experimenter about (characteristic properties of) elementary systems is limited. For example, an experimenter may know one binary proposition about an electron (e.g., its spin in x-direction), but nothing fully independent of it (and similarly for a classical bit). We shall characterize a composition of N elementary systems according to how much information is, in principle, simultaneously available to O.
Rule 1.
(Limited information) “The observer O can acquire maximally independent bits of information about the system at any moment of time.”
There exists a maximal set , , of N mutually maximally independent and compatible questions in .
O can thereby distinguish maximally states of in a single shot interrogation.
However, empirically, elementary systems admit more independent propositions than what, due to the information limit, they are able to answer at a time. This is Bohr’s complementarity. The unanswered properties must be random (and so ‘in superposition’) because the information limit makes it impossible to ascribe definite outcomes to them. For example, an experimenter may also inquire about the spin of the electron in y-direction. Yet doing so is at the total expense of his information about its spin in the x- and z-directions, and subsequent such measurements have random outcomes. For the N elementary systems, we assert the existence of complementarity.
Rule 2.
(Complementarity) “The observer O can always get up to N new independent bits of information about the system . However, whenever O asks a new question, he experiences no net loss in his total amount of information about .”
There exists another maximal set , , of N mutually maximally independent and compatible questions in , such that are maximally complementary and are maximally compatible.
The peculiar mathematical form of Rule 2 becomes intuitive upon recalling that is a composite system, such that complementarity should exist per elementary system [1].
Rules 1 and 2 are conceptually inspired by (non-technical) proposals made by Rovelli [11] and Zeilinger and Brukner [12,13]. These rules say nothing about what happens in-between interrogations. Naturally, we demand O not to gain or lose information without asking questions.
Rule 3.
(Information preservation) “The total amount of information O has about (an otherwise non-interacting) is preserved in-between interrogations.”
is constant in time in-between interrogations for (an otherwise non-interacting) .
Hence, O’s total information is a ‘conserved charge’ of any time evolution .
The more interactions to which O may subject are available, the more ways in which any state may, in principle, change in time and, thus, the more ‘interesting’ O’s world. We therefore demand that any time evolution is physically realizable as long as it is consistent with the other rules (since are interdependent, this is distinct from ‘maximizing the number’ of states).
Rule 4.
(Time evolution) “O’s ‘catalog of knowledge’ about evolves continuously in time in-between interrogations, and every consistent such evolution is physically realizable.”
is the maximal set of transformations on states such that, for any fixed state , is continuous in and compatible with Principles 1–3 (and the structure of the theory landscape).
(If we did not require this ‘maximality’ of , we would still ultimately obtain a linear, unitary evolution, but not necessarily the full unitary group. This is the sole reason for demanding ‘maximality’. Note that Principles 3 and 4 are not equivalent to the axiom of ‘continuous reversibility’ of generalized probabilistic theories [3,4,5].)
We shall also allow O to ask any question to which ‘makes (probabilistic) sense’.
Rule 5.
(Question unrestrictedness) “Every question that yields legitimate probabilities for every way of preparing is physically realizable by O.”
Every question vector that satisfies and for which there exists with bit, such that corresponds to a .
(Without Principle 5, we would still obtain the structure of an informationally complete set , finding that it encodes a basis of projective Pauli operator measurements [2]; Principle 5 legalizes all such measurements.)
These five rules turn out to leave two solutions for the triple . Remarkably, they cannot distinguish between complex and real numbers. Namely, the two solutions are qubit and rebit quantum theory, i.e., two-level systems over real Hilbert spaces [1,2]. Since the latter is both mathematically and physically a subcase of the former, these five rules can be regarded as sufficient. However, if one also wishes to discriminate rebits operationally, then an extra rule, adapted from [3,4,5] and imposed solely for this purpose (it is partially redundant), succeeds.
Rule 6.
(Tomographic locality) “O can determine the state of the composite system by interrogating only its subsystems.”
As shown in [1,2], Rules 1–6 are equivalent to the textbook axioms. More precisely:
Claim.
The only solution to Rules 1–6 is qubit quantum theory where:
- is the space of density matrices over ,
- states evolve unitarily according to and the equation describing the state dynamics is (equivalent to) the von Neumann evolution equation,
- is (isomorphic to) the set of projective measurements onto the eigenspaces of N-qubit Pauli operators (a Hermitian operator on is a Pauli operator iff it has two eigenvalues of equal multiplicity), and the probability for to be answered with ‘yes’ in some state is given by the Born rule for projective measurements.
4. Synopsis of the Reconstruction Steps and Key Results
Since this gives rise to a constructive derivation of the explicit architecture of qubit quantum theory, it involves a large number of individual steps compared to the rather abstract reconstructions [3,4,5,6,7,8,9,10]. However, this is also rewarding as it offers novel informational explanations for typical features of quantum theory, and so many reconstruction steps are actually quite instructive. We now provide a summary of key results and reconstruction steps from [1,2] (to which we refer for technical details) needed for proving the claim of the previous section.
4.1. Logical Connectives for Building Informationally Complete Sets
The first task is to build informationally complete sets [1]. The conjunction of Rules 1 and 2 implies that for a single elementary system must be a maximal mutually complementary set with . We changed notation slightly compared to rules 1 and 2, labeling complementary questions by numbers, not primes. Of course, in quantum theory, ; the more involved case will entail this. The structure (3) of a composite system implies that should contain individual questions about its subsystems. Continuing with a slight change of notation, we denote for System 1 by and for System 2 with a prime by . Apart from these individual questions, should contain composite questions for some connective *. Pairwise independence of enforces that * must satisfy the following truth table, where ‘yes’ and ‘no’ ( are compatible) [1]:
Hence, * is either the XNOR ↔ (for , ) or its negation, the XOR ⊕ (for , ). Up to an overall negation ¬, the two connectives are logically equivalent, and so, we henceforth make the convention to only build up composite questions (for informationally complete sets) using the XNOR. The composite question is a ‘correlation question’, representing “are the answers to the same?.” Ultimately, in quantum theory, ↔ will turn out to correspond to the tensor product ⊗ in where is a Pauli matrix; will then correspond to “are the spins of Qubit 1 in the i- and of Qubit 2 in the j-direction correlated?.”
4.2. Question Graphs, Independence and Compatibility for and Entanglement
It is convenient to represent questions graphically: individual questions are represented as vertices and bipartite correlation questions as edges between them. For instance, we may have:
Since O is only allowed to connect compatible questions logically, there can be no edge between individual questions of the same system.
Since O is only allowed to connect compatible questions logically, there can be no edge between individual questions of the same system.Using only Rules 1 and 2 and logical arguments, the following result is proven in [1]:
Lemma 1.
are pairwise independent for all and will thus be part of an informationally complete set . Furthermore:
- (i)
- is compatible with , and complementary to , and . That is, graphically, an individual question is compatible with a correlation question if and only if its corresponding vertex is a vertex of the edge corresponding to . By symmetry, the analogous result holds for .
- (ii)
- and are compatible if and only if and . That is, graphically, and are compatible if their corresponding edges do not intersect in a vertex and complementary if they intersect in one vertex.
For example, in the third question graph above is compatible with and complementary to , while and are compatible and and are complementary.
This lemma has a striking consequence: it implies entanglement. Indeed, since, e.g., and are independent and compatible, O may spend his maximally accessible amount of independent bits of information (Rule 1) over correlation questions only. Since non-intersecting edges do not share a common vertex, the lemma implies that no individual question is simultaneously compatible with two correlation questions that are compatible. Hence, when knowing the answers to , O will be entirely ignorant about the individual questions; O has then maximal information about , but purely composite information. This is entanglement in the very sense of Schrödinger (“...the best possible knowledge of a whole does not necessarily include the best possible knowledge of all its parts...” [14]). For example, in quantum theory, a state with ‘yes’ will coincide with a Bell state having the spins of Qubits 1 and 2 correlated in x- and y-direction (and anti-correlated in z-direction). Of course, there is nothing special about , and the argument works similarly for other composite question pairs and can be extended also to states with non-maximal entanglement (see [1] for details).
For systems with limited information content, entanglement is therefore a direct consequence of complementarity; without it there would be no independent and compatible composite questions sufficient to saturate the information limit [1]. For instance, two classical bits satisfy Rule 1, as well, but admit no complementarity so that and the maximum amount of independent bits cannot be spent on composite questions only.


We also note that Rules 1 and 2 offer a simple, intuitive explanation for monogamy of entanglement. Consider, for a moment, elementary systems , and suppose and are maximally entangled (say, because O received the answer ‘yes’ from ). Noting that is a composite bipartite system inside the tripartite , O has then already spent his maximal amount of information of independent bits, which he may know about and can therefore not know anything else that is independent, including non-trivial correlations with , about the pair. To saturate the independent bit limit for the tripartite system , he may then only inquire about individual information about . This is monogamy in its extreme form: the maximally entangled pair cannot be entangled with any other system . This heuristic argument can be made rigorous in terms of the compatibility and independence structure of questions for and can be extended to the non-extremal case using informational monogamy inequalities [1].
4.3. A Logical Explanation for the Three-Dimensionality of the Bloch Ball
A key result of the reconstruction, proven in [1] is the following. Since its proof is instructive and representative for this approach, we shall rephrase it here.
Theorem 1.
or 3.
Proof.
Consider the case. Lemma 1 implies that any maximal set of pairwise compatible correlation questions has elements. Indeed, there are maximally non-intersecting edges between the vertices of System 1 and the vertices of System 2; e.g., the ‘diagonal’ :
are pairwise independent and compatible. The constraints on the posterior state update rule in Section 2.4 entail that they are also mutually compatible (Specker’s principle) [1] such that O may simultaneously know the answers to all . Since O may not know more than independent bits (Rule 1), the cannot be mutually independent if . Thus, assuming the are of equivalent status, the answers to any pair of them, say , must imply the answers to all others, say , . Hence, , , for a connective * that preserves pairwise independence of . Reasoning as in (5) implies that either:
so that for , could not be pairwise independent. Arguing identically for all other sets of pairwise independent and compatible , we conclude that . ☐
are pairwise independent and compatible. The constraints on the posterior state update rule in Section 2.4 entail that they are also mutually compatible (Specker’s principle) [1] such that O may simultaneously know the answers to all . Since O may not know more than independent bits (Rule 1), the cannot be mutually independent if . Thus, assuming the are of equivalent status, the answers to any pair of them, say , must imply the answers to all others, say , . Hence, , , for a connective * that preserves pairwise independence of . Reasoning as in (5) implies that either:
This theorem has several crucial repercussions. We may already suggestively call and the ‘rebit’ (two-level systems over real Hilbert spaces) and ‘qubit’ case, respectively. Reasoning as in (6) shows that the are logically closed under ↔; as demonstrated in [1]:
Theorem 2.
If , then is logically closed under ↔ and, thus, constitutes an informationally complete set for with .
If , then is logically closed under ↔ and, thus, constitutes an informationally complete set for with . Furthermore, is complementary to the individual questions , .
Indeed, are the correct numbers of degrees of freedom for rebits and qubits, respectively. However, since the composite question is complementary to all individual questions in the rebit case (this is not true in the qubit case!), it is impossible for O to do ensemble state tomography by asking only individual questions , thereby violating Rule 6. We are left with the qubit case and shall henceforth ignore rebits (for rebits see [1]).
4.4. Ruling out Local Hidden Variables and the Correlation Structure for
Using (6) and repeating the argument leading to it for ‘non-diagonal’ show that either:
The first case (without relative negation) is the case of classical logic and compatible with local hidden variables for the individual questions . Namely, note that can be rewritten in terms of the individuals as:
Suppose for a moment that had simultaneous definite values (although not accessible to O). It is easy to convince oneself that any distribution of simultaneous truth values over the satisfies (8) [1]. In fact, (8) is a classical logical identity and can be argued to follow from classical rules of inference [1]. However, it involves complementary individual questions, thereby violating our premise from Section 2.7 that O may apply classical rules of inference exclusively to mutually compatible questions. This classical case is thus ruled out.
One can check that the second case, , does not admit a local hidden variable interpretation, but is consistent with the structure of the theory landscape and rules [1]. Since one of the two cases (7) must be true, we conclude that this second case holds. In fact, for any complementary pairs and such that both Q and are compatible with both , one finds similarly [1]:
This precludes to reason classically about the distribution of truth values over O’s questions.
Equation (9) permits us to unravel the complete correlation structure for . In fact, it turns out that there are two distinct representations of this correlation structure: one corresponding to quantum theory in its standard representation, the other to its ‘mirror’ representation, related by a passive (not a physical) transformation, reassigning (in quantum theory tantamount to a partial transpose on qubit 1) [1]. The two distinct representations turn out to be physically equivalent, and so, a convention has to be made. Choosing the ‘standard’ case and using (9), one finds that the compatibility and correlation structure of can be represented graphically as in Figure 1. For compatible, we shall henceforth distinguish between:
| even correlation: | if and |
| odd correlation: | if . |
Figure 1.
The compatibility and correlation structure of the informationally complete set for the qubit case. Two questions are compatible if connected by a triangle edge and complementary otherwise. Red and green triangles denote odd and even correlation, respectively; e.g., . (Taken from [1].)
One can easily check that quantum theory satisfies this correlation structure for projective spin measurements if one replaces by . For instance, ‘yes’ implies, by Figure 1, the dependent ‘no’. In quantum theory, this corresponds to the (unnormalized) Bell state with spin correlation in the x- and y-direction and anti-correlated spins in the z-direction:
4.5. Compatibility, Independence and Informational Completeness for Arbitrary N
Consider N elementary systems in the ‘qubit’ () case and the XNOR conjunction:
of individual questions, where and ‘yes’. The conjunction yields ‘yes’ and ‘no’ if an even and odd number of ‘no’, respectively, and thus, does not represent “are the answers to all the same?.” As shown in [1], these conjunctions are informationally complete:
Theorem 3.
(Qubits) The questions , (we deduct the trivial question ), are pairwise independent and logically closed under ↔ and, thus, form an informationally complete set with . Moreover, and are compatible if they differ by an even number (including zero) of non-zero indices and complementary otherwise.
We note that an N-qubit density matrix has precisely degrees of freedom.
4.6. Linear, Reversible Time Evolution and a Quadratic Information Measure
Thus far, the summarized results invoked only Rules 1 and 2 (and in one instance, Rule 6). Rules 3 and 4, on the other hand, can be demonstrated to entail a linear and reversible evolution of the generalized Bloch vector that already appeared in (4),
where defines a one-parameter matrix group [1]. Suppose correspond to two distinct interactions to which O may subject . By Rule 4, must likewise be contained in , and since both are invertible, also the entire set must be a group. We shall henceforth often represent states with Bloch vectors .
Rules 3 and 4, together with elementary operational conditions on the information measure, enforce it to be quadratic so that O’s total information (1):
is simply the square norm of the Bloch vector [1]. Interestingly, this derivation would not work without the continuity of time evolution (Rule 4). Crucially, (12) is not the Shannon entropy (see [1] for a discussion about why the Shannon entropy is also conceptually not suitable for quantifying O’s information). This reconstruction thereby corroborates an earlier proposal for a quadratic information measure for quantum theory by Brukner and Zeilinger [13,15,16].
This quadratic information measure becomes key for the remaining steps of the reconstruction. Given that (12) is a ‘conserved charge’ of time evolution (rule 3), we can already infer that (4 −1) because time evolution must be connected to the identity.
4.7. Pure and Mixed States
Suppose O knows ’s answers to N mutually compatible questions from , thereby saturating the information limit of N independent bits (Rule 1). He will then also know the answers to each of their bipartite, tripartite, ..., and N-partite XNOR conjunctions which, by Theorem 3, are also in (and compatible). In total, he then knows the answers to:
questions from . Thus, O’s total information (12) is bits in this case. It contains dependent bits of information because the questions in are pairwise, but not all mutually independent. Thanks to Rule 3, this is invariant under time evolution.
This allows us to distinguish two kinds of states [1]; is called a:
- pure state:
- if it is a state of maximal information and, hence, of maximal length:
- mixed state:
- if it is a state of non-maximal information,
As can be easily checked, quantum theory satisfies this characterization. In particular, an N-qubit density matrix, corresponding to a pure state, has a Bloch vector with square norm equal to . This peculiar mathematical fact now has a clear informational interpretation.
4.8. The Bloch Ball and Unitary Group for a Single Qubit from a Conserved Informational Charge
Since (cf. Section 4.3), we have that is a maximal set of mutually complementary questions, i.e., no further can be added to without destroying mutual complementarity in the set (cf. Section 4.1). According to (13), a pure state satisfies:
For later, we thus observe: for pure states, the maximal mutually complementary set carries exactly 1 bit of information, and this is a conserved charge of time evolution (Rule 3).
Rule 1 implies that, e.g., the pure state exists in , and we know . However, it is clear that applying any to , according to (11), yields only states that are also compatible with all Rules 1–3 (and the landscape). Hence, by Rule 4, we must actually have . Clearly, then generates all quantum pure states from , i.e., it yields the entire Bloch sphere (the image of any legal state under a legal time evolution is also a legal state). Recalling that is convex, we obtain that is the entire unit Bloch ball with mixed states (14) lying inside; the completely mixed state equals the state of no information at the center. coincide exactly with the set of density matrices and the set of unitary transformations , , respectively, for a single qubit in its adjoint (i.e., Bloch vector) representation, where is the vector of Pauli matrices. Finally, from the assumptions in Section 2.8 and Rule 5, it is also clear that . This coincides with the set of projectors onto the eigenspaces of the Pauli operators . Noting that:
we also recover that (4) yields the Born rule for projective measurements. We thus have the claim of Section 3 for (for details see [1,2]).
4.9. Unitary Group and Density Matrices for Two Qubits from Conserved Informational Charges
Also for , it is rewarding to consider maximal mutually complementary sets within . Using Lemma 1, one can check that there are exactly six maximal complementarity sets containing five questions and twenty containing three [2]; e.g., two graphical representatives are:
The six maximal complementarity sets of five elements can be represented as a lattice of pentagons; see Figure 2 (which also contains four green triangles, each representing one of the twenty maximal complementarity sets of three questions) [2].
The six maximal complementarity sets of five elements can be represented as a lattice of pentagons; see Figure 2 (which also contains four green triangles, each representing one of the twenty maximal complementarity sets of three questions) [2].
Figure 2.
The six maximal complementarity sets represented as pentagons. Two questions are complementary if they share a pentagon or are connected by an edge and compatible otherwise. Every pentagon is connected to all of the other five because any is contained in precisely two pentagons. The red arrows represent the information swap (21) between Pentagons 1 and 2 that preserves all pentagon equalities (18) and defines the time evolution generator (22). (Figure adapted from [2]. Reprinted with permission from [P. Höhn and C. Wever, Phys. Rev. A95, 012102 2017.] Copyright (2017) by the American Physical Society.)
Each of these sets has to satisfy the complementarity inequalities (2); specifically for the information carried by the five questions in pentagon a. Since any is contained in precisely two pentagons (cf. Figure 2), we find:
Noting that for pure states bits thus produces the pentagon equalities [2]:
Any pure state must satisfy (18), and evolves pure states to pure states (Rule 3). Hence, in analogy to : for pure states, these six maximal mutually complementary sets carry exactly one bit of information, and these are six conserved charges of time evolution. There are further interesting constraints on the distribution of O’s information over [2].
It can be straightforwardly checked that quantum theory actually satisfies (18). Indeed, in the case of quantum theory, the identity for reads in more familiar language (pure states):
etc. Remarkably, these identities of quantum theory seem not to have been reported before in the literature. These novel conserved informational charges are a prediction of our reconstruction, underscoring the benefits of taking this informational approach. Additionally, these informational charges are indispensable for deriving the unitary group and the state space, as we shall now see.
Using that is conserved under entails (with new index ):
where for [2]. The correlation structure of Figure 1 enforces [2]:
Each of the 15 is complementary to eight others, and since , there could be maximally 60 linearly independent of .
These are constructed as follows. For every pair of pentagons, there is a unique information swap transformation that preserves (18). For instance, the red arrows in Figure 2 represent the complete information swap between pentagons and (⟷ is not the XNOR):
that keeps all other components fixed. (18) are preserved because every swap in (21) occurs within a pentagon. The correlation structure of Figure 1 fixes the corresponding generator to [2]:
One can repeat the argument for all 15 pentagon pairs, producing 15 linearly independent generators [2]. Remarkably, they turn out to coincide exactly with the adjoint representation of the 15 fundamental generators of [2]. In particular, (22) is the generator of entangling unitaries leaving invariant. The other 45 independent generators satisfying (20) are ruled out by the correlation structure so that cannot be generated by anything else than these 15 pentagon swaps [2]. One can show that the exponentiation of (linear combinations of) these 15 pentagon swaps generates and that this group abides by all rules and forms a maximal subgroup of [2]. Rule 4 then implies , which is the correct set of unitary transformations , , for two qubits.
It turns out that the set of Bloch vectors satisfying all six pentagon equalities (18) and the conservation equations (19) for the 15 pentagon swaps splits into two sets on each of which acts transitively [2]. These two sets correspond precisely to the two possible conventions of building up composite questions either using the XNOR or XOR (cf. Section 4.1) and are therefore physically equivalent. Adhering to the XNOR convention, we conclude that the surviving set of Bloch vectors solving (18) and (19) is the set of states admitted by the rules. Indeed, it coincides exactly with the set of quantum pure states, which forms a of which is the isometry group [2]. Employing convexity of , one finally finds:
which is exactly the set of normalized density matrices over .
Concluding, the new conserved informational charges (18), in analogy to (15) for , define both the unitary group and the set of states for two qubits (for neglected details, see [2]).
4.10. Unitaries and States for Elementary Systems
According to Theorem 3, is (4 −1)-dimensional and (4 −1) (cf. Section 4.6). The reconstruction of the unitary group uses a universality result from quantum computation: two-qubit unitaries (between any pair) and single-qubit unitaries generate the full projective unitary group (2) for N qubits [17,18]. Given that is a composite system, all of these bipartite and local unitaries must be in . One can check that (2) again abides by all rules and constitutes a maximal subgroup of (4 − 1) [2]. Thanks to Rule 4, this yields (2), which coincides with the set of unitary transformations on N-qubit density matrices. In analogy to the previous case, one obtains as the state space:
which agrees with the set of normalized N-qubit density matrices (for details, see [2]).
4.11. Questions as Projective Measurements and the Born Rule
The assumptions in Section 2.8 and Rule 5 yield the following question set characterization [2]:
As shown in [2], this set is isomorphic to the set of projectors onto the eigenspaces of the Pauli operators , where and . Noting that corresponds to (10) reveals that the XNOR at the question level corresponds to the tensor product ⊗ at the operator level. One also finds that (16) again holds, such that (4) yields the Born rule for projective measurements for arbitrary N (for the neglected details and many further interesting properties of , we refer to [2]).
4.12. The von Neumann Evolution Equation
We thus obtain qubit quantum theory in its adjoint (i.e., Bloch vector) representation. Lastly, we note that with (2) is equivalent to the adjoint action:
of for some Hermitian operator H on , where [2]. (24), in turn, is equivalent to solving the von Neumann evolution equation:
We have therefore also recovered the correct time evolution equation for quantum states.
5. Conclusions
We have reviewed and summarized the key steps from [1,2] necessary to prove the claim of Section 3. This yields a reconstruction of the explicit formalism of qubit quantum theory from rules constraining an observer’s acquisition of information about a system [1,2]. The derivation corroborates the consistency of interpreting the state as the observer’s ‘catalog of knowledge’ and shows that it is sufficient to speak only about the information accessible to him for reproducing quantum theory. In fact, for qubits, this derivation accomplishes an informational reconstruction of the type proposed in Rovelli’s relational quantum mechanics [11] and in the Brukner-Zeilinger informational interpretation of quantum theory [12,13].
As a key benefit, this reconstruction also provides a novel informational explanation for the architecture of qubit quantum theory. In particular, it explains the logical structure of a basis of spin measurements, the dimensionality and structure of quantum state spaces, the correlation structure and the unitarity of time evolution from the perspective of information acquisition. This unravels previously unknown structural properties: conserved ‘informational charges’ from complementarity relations define and explain the unitary group and the set of pure states.
Acknowledgments
The author thanks Christopher S. P. Wever for an enjoyable collaboration on [2]. The project leading to this publication has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie Grant Agreement No. 657661.
Conflicts of Interest
The author declares no conflict of interest.
References
- Höhn, P.A. Toolbox for reconstructing quantum theory from rules on information acquisition. arXiv, 2014; arXiv:1412.8323. [Google Scholar]
- Höhn, P.A.; Wever, C.S.P. Quantum theory from questions. Phys. Rev. A 2017, 95, 012102. [Google Scholar] [CrossRef]
- Hardy, L. Quantum Theory From Five Reasonable Axioms. arXiv, 2001; arXiv:quant-ph/0101012. [Google Scholar]
- Dakic, B.; Brukner, C. Quantum Theory and Beyond: Is Entanglement Special? In Deep Beauty; Halvorson, H., Ed.; Cambridge University Press: Cambridge, UK, 2011; p. 365. [Google Scholar]
- Masanes, L.; Müller, M.P. A derivation of quantum theory from physical requirements. New J. Phys. 2011, 13, 063001. [Google Scholar] [CrossRef]
- Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Informational derivation of quantum theory. Phys. Rev. A 2011, 84, 012311. [Google Scholar] [CrossRef]
- Barnum, H.; Müller, M.P.; Ududec, C. Higher-order interference and single-system postulates characterizing quantum theory. New J. Phys. 2014, 16, 123029. [Google Scholar] [CrossRef]
- De la Torre, G.; Masanes, L.; Short, A.J.; Müller, M.P. Deriving Quantum Theory from Its Local Structure and Reversibility. Phys. Rev. Lett. 2012, 109, 090403. [Google Scholar] [CrossRef] [PubMed]
- Goyal, P. From information geometry to quantum theory. New J. Phys. 2010, 12, 023012. [Google Scholar] [CrossRef]
- Appleby, M.; Fuchs, C.A.; Stacey, B.C.; Zhu, H. Introducing the Qplex: A Novel Arena for Quantum Theory. arXiv, 2016; arXiv:1612.03234. [Google Scholar]
- Rovelli, C. Relational quantum mechanics. Int. J. Theor. Phys. 1996, 35, 1637–1678. [Google Scholar] [CrossRef]
- Zeilinger, A. A Foundational Principle for Quantum Mechanics. Found. Phys. 1999, 29, 631–643. [Google Scholar] [CrossRef]
- Brukner, C.; Zeilinger, A. Information and fundamental elements of the structure of quantum theory. In Time, Quantum and Information; Castell, L., Ischebeck, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
- Schrödinger, E. Discussion of Probability Relations between Separated Systems. Math. Proc. Camb. Philos. Soc. 1935, 31, 555–563. [Google Scholar] [CrossRef]
- Brukner, C.; Zeilinger, A. Operationally Invariant Information in Quantum Measurements. Phys. Rev. Lett. 1999, 83, 3354. [Google Scholar] [CrossRef]
- Brukner, C.; Zeilinger, A. Conceptual inadequacy of the Shannon information in quantum measurements. Phys. Rev. A 2001, 63, 022113. [Google Scholar] [CrossRef]
- Bremner, M.J.; Dawson, C.M.; Dodd, J.L.; Gilchrist, A.; Harrow, A.W.; Mortimer, D.; Nielsen, M.A.; Osborne, T.J. Practical Scheme for Quantum Computation with Any Two-Qubit Entangling Gate. Phys. Rev. Lett. 2002, 89, 247902. [Google Scholar] [CrossRef] [PubMed]
- Harrow, A.W. Exact universality from any entangling gate without inverses. Quant. Inf. Comput. 2009, 9, 773–777. [Google Scholar]
© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).