Open Access
This article is

- freely available
- re-usable

*Entropy*
**2017**,
*19*(6),
253;
https://doi.org/10.3390/e19060253

Article

Ruling out Higher-Order Interference from Purity Principles

^{1}

Centre for the Mathematics of Quantum Theory (QMATH), Department of Mathematical Sciences, University of Copenhagen, DK-2100 Copenhagen, Denmark

^{2}

Department of Physics and Astronomy, University of New Mexico, Albuquerque, NM 87131, USA

^{3}

Department of Physics, University College London, London WC1E 6BT, UK

^{4}

Department of Computer Science, University of Oxford, Oxford OX1 3QD, UK

^{5}

Department of Physics, Imperial College London, London SW7 2AZ, UK

^{*}

Authors to whom correspondence should be addressed.

Academic Editors:
Giacomo Mauro D’Ariano
and
Paolo Perinotti

Received: 21 April 2017 / Accepted: 22 May 2017 / Published: 1 June 2017

## Abstract

**:**

As first noted by Rafael Sorkin, there is a limit to quantum interference. The interference pattern formed in a multi-slit experiment is a function of the interference patterns formed between pairs of slits; there are no genuinely new features resulting from considering three slits instead of two. Sorkin has introduced a hierarchy of mathematically conceivable higher-order interference behaviours, where classical theory lies at the first level of this hierarchy and quantum theory theory at the second. Informally, the order in this hierarchy corresponds to the number of slits on which the interference pattern has an irreducible dependence. Many authors have wondered why quantum interference is limited to the second level of this hierarchy. Does the existence of higher-order interference violate some natural physical principle that we believe should be fundamental? In the current work we show that such principles can be found which limit interference behaviour to second-order, or “quantum-like”, interference, but that do not restrict us to the entire quantum formalism. We work within the operational framework of generalised probabilistic theories, and prove that any theory satisfying Causality, Purity Preservation, Pure Sharpness, and Purification—four principles that formalise the fundamental character of purity in nature—exhibits at most second-order interference. Hence these theories are, at least conceptually, very “close” to quantum theory. Along the way we show that systems in such theories correspond to Euclidean Jordan algebras. Hence, they are self-dual and, moreover, multi-slit experiments in such theories are described by pure projectors.

Keywords:

higher-order interference; generalised probabilistic theories; Euclidean Jordan algebras## 1. Introduction

Described by Feynman as “impossible, absolutely impossible, to explain in any classical way” [1] (volume 1, chapter 37), quantum interference is a distinctive signature of non-classicality. However, as first noted by Rafael Sorkin [2,3], there is a limit to this interference; in contrast to the case of two slits, the interference pattern formed in a three slit experiment can be written as a linear combination of two and one slit patterns. Sorkin has introduced a hierarchy of mathematically conceivable higher-order interference behaviours, where classical theory lies at the first level of this hierarchy and quantum theory theory at the second. Informally, the order in this hierarchy corresponds to the number of slits on which the interference pattern has an irreducible dependence.

Many authors have wondered why quantum interference is limited to the second level of this hierarchy [2,4,5,6,7,8,9,10,11,12,13]. Does the existence of higher-order interference violate some natural physical principle that we believe should be fundamental [14]? In the current work we show that such natural principles can be found which limit interference behaviour to second-order, or “quantum-like”, interference, but that do not restrict us to the entire quantum formalism.

We work in the framework of general probabilistic theories [15,16,17,18,19,20,21,22,23,24,25,26,27,28]. This framework is general enough to accommodate essentially arbitrary operational theories, where an operational theory specifies a set of laboratory devices which can be connected together in different ways, and assigns probabilities to different experimental outcomes. Investigating how the structural and information-theoretic features of a given theory in this framework depend on different physical principles deepens our physical and intuitive understanding of such features. Indeed, many authors [20,22,23,28,29] have derived the entire structure of finite-dimensional quantum theory from simple information-theoretic axioms—reminiscent of Einstein’s derivation of special relativity from two simple physical principles. So far, ruling out higher-order interference has required thermodynamic arguments. Indeed, by combining the results and axioms of Refs. [30,31], higher-order interference could be ruled out in theories satisfying the combined axioms. In this paper we show that we can prove this in a more direct way from first principles, using only the axioms of Ref. [30].

Many experimental investigations have searched for divergences from quantum theory by looking for higher-order interference [32,33,34,35,36]. These experiments involved passing a particle through a physical barrier with multiple slits and comparing the interference patterns formed on a screen behind the barrier when different subsets of slits are closed. Given this set-up, one would expect that the physical theory being tested should possess transformations that correspond to the action of blocking certain subsets of slits. Moreover, blocking all but two subsets of slits should not affect states which can pass through either slit. This intuition suggests that these transformations should correspond to projectors.

Many operational probabilistic theories do not possess such a natural mathematical interpretation of multi-slit experiments; indeed many theories do not admit well-defined projectors [9]. Here, we show that there exist natural information-theoretic principles that both imply the existence of the projector structure, and rule out third-, and higher-, order interference. The principles that ensure this structure are Causality, Purity Preservation, Pure Sharpness, and Purification. These formalise intuitive ideas about the fundamental role of purity in nature. More formally, we show that such theories possess a self-dualising inner product, and that there exist pure projectors which represent the opening and closing of slits in a multi-slit experiment. Barnum, Müller and Ududec have shown that in any self-dual theory in which such projectors exist for every face, if projectors map pure states to pure states, then there can be at most second-order interference [4] (Proposition 29). The conjunction of our new results and the principle of Purity Preservation implies the conditions of Barnum et al.’s proposition. Hence sharp theories with purification do not exhibit higher-order interference. In fact we prove a stronger result, that the systems in such theories are Euclidean Jordan algebras which have been studied in quantum foundations [4,13,37].

This paper is organised as follows. In Section 2 we review the basics of the operational probabilistic theory framework. In Section 3 we formally define higher-order interference. In Section 4 we define sharp theories with purification and review relevant known results. In Section 5 we present and prove our new results. Finally, in Section 6, we offer some suggestions on how new experiments might be devised to observe higher-order interference.

## 2. Framework

We will describe theories in the framework of operational-probabilistic theories (OPTs) [19,20,24,29,38,39,40], arising from the marriage of category theory [41,42,43,44,45,46] with probabilities. The foundation of this framework is the idea that any successful physical theory must provide an account of experimental data. Hence, such theories should have an operational description in terms of such experiments.

The OPT framework is based on the graphical language of circuits, describing experiments that can be performed in a laboratory with physical systems connecting together physical processes, which are denoted as wires and boxes respectively. The systems/wires are labelled with a type denoted $\mathrm{A}$, $\mathrm{B}$, $\mathrm{C}$, …. For example, the type given to a quantum system is the dimension of the Hilbert space describing the system. The processes/boxes are then viewed as transformations with some input and output systems/wires. For instance, in quantum theory these correspond to quantum instruments. We now give a brief introduction to the important concepts in this formalism.

#### 2.1. States, Transformations, and Effects

A fundamental tenant of the OPT framework is composition of systems and physical processes. Given two systems $\mathrm{A}$ and $\mathrm{B}$, they can be combined into a composite system, denoted by $\mathrm{A}\otimes \mathrm{B}$. Physical processes can be composed to build circuits, such as

Processes with no inputs (such as $\rho $ in the above diagram) are called states, those with no outputs (such as a and b) are called effects and, those with both inputs and outputs (such as $\mathcal{A}$, ${\mathcal{A}}^{\prime}$, $\mathcal{B}$) are called transformations. We define:

- $\mathsf{St}\left(\mathrm{A}\right)$ as the set of states of system $\mathrm{A}$,
- $\mathsf{Eff}\left(\mathrm{A}\right)$ as the set of effects on $\mathrm{A}$,
- $\mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ as the set of transformations from $\mathrm{A}$ to $\mathrm{B}$, and $\mathsf{Transf}\left(\mathrm{A}\right)$ as the set of transformations from $\mathrm{A}$ to $\mathrm{A}$,
- $\mathcal{B}\circ \mathcal{A}$ (or $\mathcal{B}\mathcal{A}$, for short) as the sequential composition of two transformations $\mathcal{A}$ and $\mathcal{B}$, with the input of $\mathcal{B}$ matching the output of $\mathcal{A}$,
- $\mathcal{A}\otimes \mathcal{B}$ as the parallel composition (or tensor product) of the transformations $\mathcal{A}$ and $\mathcal{B}$.

OPTs include a particular system, the trivial system $\mathrm{I}$, representing the lack of input or output for a particular device.

Hence, states (resp. effects) are transformations with the trivial system as input (resp. output). Circuits with no external wires, like the circuit in Equation (1), are called scalars and are associated with probabilities. We will often use the notation $\left(a|\rho \right)$ to denote the circuit
and of the notation $\left(a\left|\mathcal{C}\right|\rho \right)$ to denote the circuit

The fact that scalars are probabilities and so are real numbers induces a notion of a sum of transformations, so that the sets $\mathsf{St}\left(\mathrm{A}\right)$, $\mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, and $\mathsf{Eff}\left(\mathrm{A}\right)$ become spanning sets of real vector spaces, denoted by ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, ${\mathsf{Transf}}_{\mathbb{R}}\left(\mathrm{A},\mathrm{B}\right)$, and ${\mathsf{Eff}}_{\mathbb{R}}\left(\mathrm{A}\right)$. In this work we will restrict our attention to finite systems, i.e., systems for which the vector space spanned by states is finite-dimensional for all systems. Operationally this assumption means that one need not perform an infinite number of distinct experiments to fully characterise a state. Restricting ourselves to non-negative real numbers, we have the convex cone of states and of effects, denoted by ${\mathsf{St}}_{+}\left(\mathrm{A}\right)$ and ${\mathsf{Eff}}_{+}\left(\mathrm{A}\right)$ respectively. We moreover make the assumption that the set of states is close. Operationally this is justified by the fact that up to any experimental error a state space is indistinguishable from its closure.

The composition of states and effects leads naturally to a norm. This is defined, for states $\rho $ as $\u2225\rho \u2225:={sup}_{a\in \mathsf{Eff}\left(\mathrm{A}\right)}\left(a|\rho \right)$, and similarly for effects a as $\u2225a\u2225:={sup}_{\rho \in \mathsf{St}\left(\mathrm{A}\right)}\left(a|\rho \right)$. The set of normalised states (resp. effects) of system $\mathrm{A}$ is denoted by ${\mathsf{St}}_{1}\left(\mathrm{A}\right)$ (resp. ${\mathsf{Eff}}_{1}\left(\mathrm{A}\right)$).

Transformations are characterised by their action on states of composite systems: if $\mathcal{A},{\mathcal{A}}^{\prime}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, we have that $\mathcal{A}={\mathcal{A}}^{\prime}$ if and only if
for every system $\mathrm{S}$ and every state $\rho \in \mathsf{St}\left(\mathrm{A}\otimes \mathrm{S}\right)$. However it follows that [19] effects (resp. states) are completely defined by their action on states (resp. effects) of a single system.

Equality on states of the single system $\mathrm{A}$ is, in general, not enough to discriminate between $\mathcal{A}$ and ${\mathcal{A}}^{\prime}$, as is the case for quantum theory over real Hilbert spaces [47]. However, for the scope of the present article, which focuses on single-system properties, we often concern ourselves with equality on single system.

**Definition**

**1.**

Two transformations $\mathcal{A},{\mathcal{A}}^{\prime}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ are equal on single system, denoted by $\mathcal{A}\doteq {\mathcal{A}}^{\prime}$, if $\mathcal{A}\rho ={\mathcal{A}}^{\prime}\rho $ for all states $\rho \in \mathsf{St}\left(\mathrm{A}\right)$.

#### 2.2. Tests and Channels

In general, the boxes corresponding to physical processes come equipped with classical pointers. When used in an experiment, the final position of the a given pointer indicates the particular process which occurred for that box in that run. In general, this procedure can be non-deterministic. These non-deterministic processes are described by tests [19,39]: a test from $\mathrm{A}$ to $\mathrm{B}$ is a collection of transformations ${\left\{{\mathcal{C}}_{i}\right\}}_{i\in \mathsf{X}}$ from $\mathrm{A}$ to $\mathrm{B}$, where $\mathsf{X}$ is the set of outcomes. If $\mathrm{A}$ (resp. $\mathrm{B}$) is the trivial system, the test is called a preparation-test (resp. observation-test). If the set of outcomes $\mathsf{X}$ has a single element, we say that the test is deterministic, because only one transformation can occur. Deterministic transformations will be called channels.

A channel $\mathcal{U}$ from $\mathrm{A}$ to $\mathrm{B}$ is reversible if there exists another channel ${\mathcal{U}}^{-1}$ from $\mathrm{B}$ to $\mathrm{A}$ such that ${\mathcal{U}}^{-1}\mathcal{U}={\mathcal{I}}_{\mathrm{A}}$ and $\mathcal{U}{\mathcal{U}}^{-1}={\mathcal{I}}_{\mathrm{B}}$, where ${\mathcal{I}}_{\mathrm{S}}$ is the identity transformation on system $\mathrm{S}$. If there exists a reversible channel transforming $\mathrm{A}$ into $\mathrm{B}$, we say that $\mathrm{A}$ and $\mathrm{B}$ are operationally equivalent, denoted as $\mathrm{A}\simeq \mathrm{B}$. The composition of systems is required to be symmetric, meaning that $\mathrm{A}\otimes \mathrm{B}\simeq \mathrm{B}\otimes \mathrm{A}$. Physically, this means that for every pair of systems there exists a reversible channel swapping them. A state $\chi $ is called invariant if $\mathcal{U}\chi =\chi $ for all reversible channels $\mathcal{U}$.

A particularly useful class of observation-tests allows for the following.

**Definition**

**2.**

The states ${\left\{{\rho}_{i}\right\}}_{i\in \mathsf{X}}$ are called perfectly distinguishable if there exists an observation-test ${\left\{{a}_{i}\right\}}_{i\in \mathsf{X}}$ such that $\left({a}_{i}|{\rho}_{j}\right)={\delta}_{ij}$ for all $i,j\in \mathsf{X}$.

Moreover, if there is no other state ${\rho}_{0}$ such that the states ${\left\{{\rho}_{i}\right\}}_{i\in \mathsf{X}}\cup \left\{{\rho}_{0}\right\}$ are perfectly distinguishable, the set ${\left\{{\rho}_{i}\right\}}_{i\in \mathsf{X}}$ is said maximal .

#### 2.3. Pure Transformations

There are various different ways to define pure transformations, for example in terms of resources [30,48,49,50,51] or “side information” [39,52]. Informally pure transformations correspond to an experimenter having maximal control of or information about a process. Here, we formalise this notion by defining the notion of a coarse-graining [19]. Coarse-graining is the operation of joining two or more outcomes of a test into a single outcome. More precisely, a test ${\left\{{\mathcal{C}}_{i}\right\}}_{i\in \mathsf{X}}$ is a coarse-graining of the test ${\left\{{\mathcal{D}}_{j}\right\}}_{j\in \mathsf{Y}}$ if there is a partition ${\left\{{\mathsf{Y}}_{i}\right\}}_{i\in \mathsf{X}}$ of $\mathsf{Y}$ such that, for all $i\in \mathsf{X}$

$${\mathcal{C}}_{i}=\sum _{j\in {\mathsf{Y}}_{i}}{\mathcal{D}}_{j}$$

In this case, we say that the test ${\left\{{\mathcal{D}}_{j}\right\}}_{j\in \mathsf{Y}}$ is a refinement of the test ${\left\{{\mathcal{C}}_{i}\right\}}_{i\in \mathsf{X}}$, and that the transformations ${\left\{{\mathcal{D}}_{j}\right\}}_{j\in {\mathsf{Y}}_{i}}$ are a refinement of the transformation ${\mathcal{C}}_{i}$. A transformation $\mathcal{C}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ is pure if it has only trivial refinements, namely refinements $\left\{{\mathcal{D}}_{j}\right\}$ of the form ${\mathcal{D}}_{j}={p}_{j}\mathcal{C}$, where $\left\{{p}_{j}\right\}$ is a probability distribution. We denote the sets of pure transformations, pure states, and pure effects as $\mathsf{PurTransf}\left(\mathrm{A},\mathrm{B}\right)$, $\mathsf{PurSt}\left(\mathrm{A}\right)$, and $\mathsf{PurEff}\left(\mathrm{A}\right)$ respectively. Similarly, ${\mathsf{PurSt}}_{1}\left(\mathrm{A}\right)$, and ${\mathsf{PurEff}}_{1}\left(\mathrm{A}\right)$ denote normalised pure states and effects respectively. Non-pure states are called mixed.

**Definition**

**3.**

Let $\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$. A normalised state σ is contained in ρ if we can write $\rho =p\sigma +\left(1-p\right)\tau $, where $p\in \left(0,1\right]$ and τ is another normalised state.

Clearly, no states are contained in a pure state. On the other edge of the spectrum we have complete states.

**Definition**

**4.**

A state $\omega \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$ is complete if every state is contained in it.

**Definition**

**5.**

We say that two transformations $\mathcal{A},{\mathcal{A}}^{\prime}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ are equal upon input of the state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$ if $\mathcal{A}\sigma ={\mathcal{A}}^{\prime}\sigma $ for every state σ contained in ρ. In this case we will write $\mathcal{A}=\rho {\mathcal{A}}^{\prime}$.

#### 2.4. Causality

A natural requirement of a physical theory is that it is causal, that is, no signals can be sent from the future to the past. In the OPT framework this is formalised as follows:

**Axiom**

**1**

Causality is equivalent to the requirement that, for every system $\mathrm{A}$, there exists a unique deterministic effect ${u}_{\mathrm{A}}$ on $\mathrm{A}$ (or simply u, when no ambiguity can arise) [19]. Owing to the uniqueness of the deterministic effect, the marginals of a bipartite state can be uniquely defined as:

Moreover, this uniqueness forbids the ability to signal [19,53]. We will denote by ${\mathrm{Tr}}_{\mathrm{B}}{\rho}_{\mathrm{AB}}$ the marginal on system $\mathrm{A}$, in analogy with the notation used in the quantum case. We will stick to the notation $\mathrm{Tr}$ in formulas where the deterministic effect is applied directly to a state, e.g., $\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho :=\left(u|\rho \right)$.

In a causal theory it is easy to see that the norm of a state takes the form $\u2225\rho \u2225=\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho $, and that a state can be prepared deterministically if and only if it is normalised.

## 3. Higher-Order Interference

The definition of higher-order interference we shall present in this section takes its motivation from the set-up of multi-slit interference experiments. In such experiments a particle passes through slits in a physical barrier and is detected at a screen. By repeating the experiment many times, one builds up a pattern on the screen. To determine if this experiment exhibits interference one compares this pattern to those produced when certain subsets of the slits are blocked. In quantum theory, for example, the two-slit experiment exhibits interference as the pattern formed with both slits open is not equal to the sum of the one-slit patterns.

Consider the state of the particle just before it passes through the slits. For every slit, there should exist states such that the particle is definitely found at that slit, if measured. Mathematically, this means that there is a face [4] of the state space, such that all states in this face give unit probability for the “yes” outcome of the two-outcome measurement “is the particle at this slit?”. Recall that a face is a convex set with the property that if $px+\left(1-p\right)y$, for $0\le p\le 1$, is an element then x and y are also elements. These faces will be labelled ${F}_{i}$, one for each of the n slits $i\in \left\{1,\dots ,n\right\}$. As the slits should be perfectly distinguishable, the faces associated with each slit should be perfectly distinguishable, or orthogonal. One can additionally ask coarse-grained questions of the form “Is the particle found among a certain subset of slits, rather than somewhere else?”. The set of states that give outcome “yes” with probability one must contain all the faces associated with each slit in the subset. Hence the face associated with the subset of slits $\mathsf{I}\subseteq \left\{1,\dots ,n\right\}$ is the smallest face containing each face in this subset ${F}_{\mathsf{I}}:={\bigvee}_{i\in \mathsf{I}}{F}_{i}$, where the operation ⋁ is the least upper bound of the lattice of faces where the ordering is provided by subset inclusion of one face within another. The face ${F}_{\mathsf{I}}$ contains all those states which can be found among the slits contained in $\mathsf{I}$. The experiment is “complete” if all states in the state space (of a given system $\mathrm{A}$) can be found among some subset of slits. That is, if ${F}_{12\cdots n}=\mathsf{St}\left(\mathrm{A}\right)$.

An n-slit experiment requires a system that has n orthogonal faces ${F}_{i}$, with $i\in \left\{1,\dots ,n\right\}$. Consider an effect E associated with finding a particle at a particular point on the screen. We now formally define an n-slit experiment.

**Definition**

**6.**

An n-slit experiment is a collection of effects ${e}_{\mathsf{I}}$, where $\mathsf{I}\subseteq \left\{1,\dots ,n\right\}$, such that

$$\begin{array}{cc}\hfill \left({e}_{\mathsf{I}}|\rho \right)& =\left(E|\rho \right),\phantom{\rule{8.53581pt}{0ex}}\forall \rho \in {F}_{\mathsf{I}},\phantom{\rule{4.pt}{0ex}}and\hfill \\ \hfill \left({e}_{\mathsf{I}}|\rho \right)& =0,\phantom{\rule{8.53581pt}{0ex}}\forall \rho where\rho \perp {F}_{\mathsf{I}}.\hfill \end{array}$$

The effects introduced in the above definition arise from the conjunction of blocking off the slits $\left\{1,\dots ,n\right\}\backslash \mathsf{I}$ and applying the effect E. If the particle was prepared in a state such that it would be unaffected by the blocking of the slits (i.e., $\rho \in {F}_{\mathsf{I}}$) then we should have $\left({e}_{\mathsf{I}}|\rho \right)=\left(E|\rho \right)$. If instead the particle is prepared in a state which is guaranteed to be blocked (i.e., ${\rho}^{\prime}\perp {F}_{\mathsf{I}}$) then the particle should have no probability of being detected at the screen, i.e., $\left({e}_{\mathsf{I}}|{\rho}^{\prime}\right)=0$.

The relevant quantities for the existence of various orders of interference are [2,9,13,15]:
for some state $\rho $, and defining ${e}_{\left\{1,\dots ,n\right\}}:=E$.

$$\begin{array}{c}{I}_{1}:=\left(E|\rho \right),\hfill \end{array}$$

$$\begin{array}{c}{I}_{2}:=\left(E|\rho \right)-\left({e}_{1}|\rho \right)-\left({e}_{2}|\rho \right),\hfill \end{array}$$

$$\begin{array}{c}{I}_{3}:=\left(E|\rho \right)-\left({e}_{12}|\rho \right)-\left({e}_{23}|\rho \right)-\left({e}_{31}|\rho \right)+\left({e}_{1}|\rho \right)+\left({e}_{2}|\rho \right)+\left({e}_{3}|\rho \right),\hfill \end{array}$$

$$\begin{array}{c}{I}_{n}:=\sum _{\mathsf{\xd8}\ne \mathsf{I}\subseteq \left\{1,\dots ,n\right\}}{\left(-1\right)}^{n-\left|\mathsf{I}\right|}\left({e}_{\mathsf{I}}|\rho \right),\hfill \end{array}$$

**Definition**

**7.**

A theory has n-th order interference if there exists a state ρ and an effect E such that ${I}_{n}\ne 0$.

In a slightly different formal setting, it was shown in [2] that ${I}_{n}=0\Rightarrow {I}_{n+1}=0$, so if there is no nth order interference, there will be no $\left(n+1\right)$th order interference; the argument of [2] applies here.

It should be noted that there appears to be a lot of freedom in choosing a set of effects $\left\{{e}_{\mathsf{I}}\right\}$ to test for the existence of higher-order interference. Indeed, in arbitrary generalised theories this appears to be the case [9]. However, it is natural to ask whether there exists physical transformations ${T}_{\mathsf{I}}$ in the theory which correspond to leaving the subset of slits $\mathsf{I}$ open and blocking the rest. Hence a unique ${e}_{\mathsf{I}}$ is assigned to each fixed E defined as ${e}_{\mathsf{I}}=E{T}_{\mathsf{I}}$. Ruling out the existence of higher-order interference then reduces to proving certain properties of the ${T}_{\mathsf{I}}$. This will turn out to be the case in sharp theories with purification.

## 4. Sharp Theories with Purification

In this section we present the definition and important properties of sharp theories with purification. They were originally introduced in [30,49,54] for the analysis of the foundations of thermodynamics and statistical mechanics.

Sharp theories with purification are causal theories defined by three axioms. The first axiom—Purity Preservation—states that no information can leak when two pure transformations are composed:

**Axiom**

**2**

(Purity Preservation [55]). Sequential and parallel compositions of pure transformations yield pure transformations.

The second axiom—Pure Sharpness—guarantees that every system possesses at least one elementary property.

**Axiom**

**3**

(Pure Sharpness [54]). For every system there exists at least one pure effect occurring with unit probability on some state.

These axioms are satisfied by both classical and quantum theory. Our third axiom—Purification—signals the departure from classicality, and characterises when a physical theory admits a level of description where all deterministic processes are pure and reversible.

Given a normalised state ${\rho}_{\mathrm{A}}\in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$, a normalised pure state $\mathsf{\Psi}\in {\mathsf{PurSt}}_{1}\left(\mathrm{A}\otimes \mathrm{B}\right)$ is a purification of ${\rho}_{\mathrm{A}}$ if
in this case $\mathrm{B}$ is called the purifying system. We say that a pure state $\mathsf{\Psi}\in \mathsf{PurSt}\left(\mathrm{A}\otimes \mathrm{B}\right)$ is an essentially unique purification of its marginal ${\rho}_{\mathrm{A}}$ [39] if every other pure state ${\mathsf{\Psi}}^{\prime}\in \mathsf{PurSt}\left(\mathrm{A}\otimes \mathrm{B}\right)$ satisfying the purification condition must be of the form
for some reversible channel $\mathcal{U}$.

**Axiom**

**4**

Quantum theory, both on complex and real Hilbert spaces, satisfies Purification, and also Spekkens’ toy model [56]. Examples of sharp theories with purification besides quantum theory include fermionic quantum theory [57,58], a superselected version of quantum theory known as doubled quantum theory [49], and a recent extension of classical theory with the theory of codits [30].

#### Properties of Sharp Theories With Purifications

Sharp theories with purifications enjoy some nice properties, which were mainly derived in Refs. [30,54]. The first property is that every non-trivial system admits perfectly distinguishable states [54], and that all maximal sets of pure states have the same cardinality [30].

**Proposition**

**1.**

For every system $\mathrm{A}$ there is a positive integer ${d}_{\mathrm{A}}$, called the dimension of $\mathrm{A}$, such that all maximal sets of pure states have ${d}_{\mathrm{A}}$ elements.

Note that we will omit the subscript $\mathrm{A}$ when the context is clear.

In sharp theories with purification every state can be diagonalised, i.e., written as a convex combination of perfectly distinguishable pure states (cf. Refs. [30,54]).

**Theorem**

**5.**

Every normalised state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$ of a non-trivial system can be decomposed as
where ${\left\{{p}_{i}\right\}}_{i=1}^{d}$ is a probability distribution, and ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ is a pure maximal set. Moreover, given ρ, ${\left\{{p}_{i}\right\}}_{i=1}^{d}$ is unique up to rearrangements.

$$\rho =\sum _{i=1}^{d}{p}_{i}{\alpha}_{i},$$

Such a decomposition is called a diagonalisation of $\rho $, the ${p}_{i}$’s are the eigenvalues of $\rho $, and the ${\alpha}_{i}$’s are the eigenstates. Theorem 5 implies that the eigenvalues of a state are unique, and independent of its diagonalisation. Sharp theories with purification have a unique invariant state $\chi $ [19], which can be diagonalised as $\chi =\frac{1}{d}{\sum}_{i=1}^{d}{\alpha}_{i}$, where ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ is any pure maximal set [30]. Furthermore, the diagonalisation result of Theorem 5 can be extended to every vector in ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, but here the eigenvalues will be generally real numbers [30].

One of the most important consequences for this paper of the axioms defining sharp theories with purification is a duality between normalised pure states and normalised pure effects.

**Theorem**

**6**

(States-effects duality [30,54]) For every system $\mathrm{A}$, there is a bijective correspondence †: ${\mathsf{PurSt}}_{1}\left(\mathrm{A}\right)\to {\mathsf{PurEff}}_{1}\left(\mathrm{A}\right)$ such that if $\alpha \in {\mathsf{PurSt}}_{1}\left(\mathrm{A}\right)$, ${\alpha}^{\u2020}$ is the unique normalised pure effect such that $\left({\alpha}^{\u2020}|\alpha \right)=1$. Furthermore this bijection can be extended by linearity to an isomorphism between the vector spaces ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ and ${\mathsf{Eff}}_{\mathbb{R}}\left(\mathrm{A}\right)$.

With a little abuse of notation we will use † also to denote the inverse map ${\mathsf{PurEff}}_{1}\left(\mathrm{A}\right)\to {\mathsf{PurSt}}_{1}\left(\mathrm{A}\right)$, by which, if $a\in {\mathsf{PurEff}}_{1}\left(\mathrm{A}\right)$, ${a}^{\u2020}$ is the unique pure state such that $\left(a|{a}^{\u2020}\right)=1$. Pure maximal sets ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ have the property that ${\sum}_{i=1}^{d}{\alpha}_{i}^{\u2020}=u$ [30].

A diagonalisation result holds for vectors of ${\mathsf{Eff}}_{\mathbb{R}}\left(\mathrm{A}\right)$ as well [30]: they can be written as $X\phantom{\rule{3.33333pt}{0ex}}=\phantom{\rule{3.33333pt}{0ex}}{\sum}_{i=1}^{d}{\lambda}_{i}{\alpha}_{i}^{\u2020}$, where ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ is a pure maximal set. Again, the ${\lambda}_{i}$’s are uniquely defined given X.

Another result that will be made use of in the following sections is the following. It was shown to hold in Ref. [30], and expresses the possibility of constructing non-disturbing measurements [20,59,60].

**Proposition**

**2.**

Given a system $\mathrm{A}$, let $a\in \mathsf{Eff}\left(\mathrm{A}\right)$ be an effect such that $\left(a|\rho \right)=1$, for some $\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$. Then there exists a pure transformation $\mathcal{T}\in \mathsf{PurTransf}\left(\mathrm{A}\right)$ such that $\mathcal{T}=\rho \mathcal{I}$, with $\left(u\left|\mathcal{T}\right|\sigma \right)\le \left(a|\sigma \right)$, for every state $\sigma \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$.

Note that the pure transformation $\mathcal{T}$ is non-disturbing on $\rho $ because it acts as the identity on $\rho $ and on all states contained in it. In other words, whenever we have an effect occurring with unit probability on some state $\rho $, we can always find a transformation that does not disturb $\rho $ (i.e., a non-disturbing, non-demolition measurement) [30].

Finally, a property that we will use often is a sort of no-restriction hypothesis for tests, derived in [20] (Corollary 4).

**Proposition**

**3.**

A collection of transformations ${\left\{{\mathcal{A}}_{i}\right\}}_{i\in \mathsf{X}}$ is a valid test if and only if ${\sum}_{i\in \mathsf{X}}u{\mathcal{A}}_{i}=u$.

A collection of effects ${\left\{{a}_{i}\right\}}_{i\in \mathsf{X}}$ is a valid observation-test if and only if ${\sum}_{i\in \mathsf{X}}{a}_{i}=u$.

## 5. Sharp Theories with Purification Have No Higher-Order Interference

Here we will show that sharp theories with purification do not exhibit higher-order interference. Our proof strategy will be to show that results of [4], which rule out the existence of higher-order interference from certain assumptions, hold in sharp theories with purification. To this end, we will first prove that these theories are self-dual, and that they admit pure orthogonal projectors which satisfy certain properties, compatible with the setting presented in Section 3.

#### 5.1. Self-Duality

Now we will prove that sharp theories with purification are self-dual. Recall that a theory is self-dual if for every system $\mathrm{A}$ there is an inner product $\u2329\u2022,\u2022\u232a$ on ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ such that $\xi \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$ if and only if $\u2329\xi ,\eta \u232a\ge 0$ for every $\eta \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$. To show that, we need to find a self-dualising inner product on ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ for every system $\mathrm{A}$. The dagger will provide us with a good candidate. First we need the following lemma.

**Lemma**

**1.**

Let $a\in {\mathsf{Eff}}_{1}\left(\mathrm{A}\right)$ be a normalised effect. Then a is of the form $a={\sum}_{i=1}^{r}{\alpha}_{i}^{\u2020}$, with $r\le d$, and the pure states ${\left\{{\alpha}_{i}\right\}}_{i=1}^{r}$ are perfectly distinguishable.

**Proof.**

We know that every effect a can be written as $a={\sum}_{i=1}^{r}{\lambda}_{i}{\alpha}_{i}^{\u2020}$, where $r\le d$, the pure states ${\left\{{\alpha}_{i}\right\}}_{i=1}^{r}$ are perfectly distinguishable, and for every $i\in \left\{1,\dots ,r\right\}$, ${\lambda}_{i}\in \left(0,1\right]$. Since the state space is closed, and a is normalised, then there exists a (normalised) state $\rho $ such that $\left(a|\rho \right)=1$. One has
Now, $\left({\alpha}_{i}^{\u2020}|\rho \right)\ge 0$, and ${\sum}_{i=1}^{r}\left({\alpha}_{i}^{\u2020}|\rho \right)\le 1$ because
where we have used the fact that ${\sum}_{i=1}^{d}{\alpha}_{i}^{\u2020}=u$. Then ${\sum}_{i=1}^{r}{\lambda}_{i}\left({\alpha}_{i}^{\u2020}|\rho \right)\le {\lambda}_{\mathrm{max}}$, where ${\lambda}_{\mathrm{max}}$ is the maximum of the ${\lambda}_{i}$’s. Therefore, ${\lambda}_{\mathrm{max}}\ge 1$, which implies ${\lambda}_{\mathrm{max}}=1$. Now, the condition
means that ${\lambda}_{i}={\lambda}_{\mathrm{max}}=1$ for all $i\in \left\{1,\dots ,r\right\}$. ☐

$$1=\left(a|\rho \right)=\sum _{i=1}^{r}{\lambda}_{i}\left({\alpha}_{i}^{\u2020}|\rho \right).$$

$$\sum _{i=1}^{r}\left({\alpha}_{i}^{\u2020}|\rho \right)\le \sum _{i=1}^{d}\left({\alpha}_{i}^{\u2020}|\rho \right)=\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho =1,$$

$$\sum _{i=1}^{r}{\lambda}_{i}\left({\alpha}_{i}^{\u2020}|\rho \right)={\lambda}_{\mathrm{max}}$$

In the above, we call r the rank of the normalised effect. We can use this result to prove the following.

**Lemma**

**2.**

For every system $\mathrm{A}$, the map
for every $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ is an inner product on ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$.

$$\u2329\xi ,\eta \u232a:=\left({\xi}^{\u2020}|\eta \right),$$

**Proof.**

The map $\u2329\u2022,\u2022\u232a$ is clearly bilinear by construction, because the dagger is also linear. Let us show that it is positive-definite. Take a non-null vector $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, and diagonalise it as $\xi ={\sum}_{i=1}^{d}{x}_{i}{\alpha}_{i}$. Then
where we have used the fact that for perfectly distinguishable pure states $\left({\alpha}_{i}^{\u2020}|{\alpha}_{j}\right)={\delta}_{ij}$ [30].

$$\u2329\xi ,\xi \u232a=\left({\xi}^{\u2020}|\xi \right)=\sum _{i,j=1}^{d}{x}_{i}{x}_{j}\left({\alpha}_{i}^{\u2020}|{\alpha}_{j}\right)=\sum _{i=1}^{d}{x}_{i}^{2}>0,$$

The hard part is to prove that this bilinear map is symmetric, namely $\u2329\xi ,\eta \u232a=\u2329\eta ,\xi \u232a$, for every $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$. Let us define a new (double) dagger ‡. The double dagger of a normalised state $\rho $ is an effect ${\rho}^{\u2021}$ whose action on normalised states $\sigma $ is defined as
where † is the dagger of Theorem 6. Note that Equation (7) is enough to characterise ${\rho}^{\u2021}$ completely, and it guarantees that ${\rho}^{\u2021}$ is a mathematically well-defined effect, because it is linear and $\left({\sigma}^{\u2020}|\rho \right)\in \left[0,1\right]$. Consider now $\rho $ and $\sigma $ to be a normalised pure state $\psi $. Then $\left({\psi}^{\u2021}|\psi \right)=\left({\psi}^{\u2020}|\psi \right)=1$, this means that ${\alpha}^{\u2021}$ is normalised. If we manage to show that ${\psi}^{\u2021}$ is pure, then by Theorem 6 we can conclude that ${\psi}^{\u2021}={\psi}^{\u2020}$. By Lemma 1, ${\psi}^{\u2021}$ is of the form ${\psi}^{\u2021}={\sum}_{i=1}^{r}{\alpha}_{i}^{\u2020}$, with $r\le d$, and the pure states ${\left\{{\alpha}_{i}\right\}}_{i=1}^{r}$ are perfectly distinguishable. Clearly ${\psi}^{\u2021}$ is pure if and only if $r=1$. To prove it, first let us evaluate ${\psi}^{\u2021}$ on $\chi $:
as prescribed by Equation (7). Now, since ${\psi}^{\u2021}={\sum}_{i=1}^{r}{\alpha}_{i}^{\u2020}$, we have
because $\left({\alpha}_{i}^{\u2020}|\chi \right)=\frac{1}{d}$ for every i [30]. A comparison between Equations (8) and (9), shows that $r=1$. This means that ${\psi}^{\u2021}$ is pure, whence ${\psi}^{\u2021}={\psi}^{\u2020}$. Now we can show that the double dagger ‡ actually coincides with the dagger of Theorem 6. Indeed, given a state $\rho $, diagonalise it as $\rho ={\sum}_{i=1}^{d}{p}_{i}{\alpha}_{i}$. One can easily show that the double dagger of Equation (7) is linear, so we have ${\rho}^{\u2021}={\sum}_{i=1}^{d}{p}_{i}{\alpha}_{i}^{\u2021}$, but we have just proved that ${\alpha}_{i}^{\u2021}={\alpha}_{i}^{\u2020}$ for pure states, so ${\rho}^{\u2021}={\sum}_{i=1}^{d}{p}_{i}{\alpha}_{i}^{\u2020}={\rho}^{\u2020}$. This means that $\u2021=\u2020$, and that Equation (7) is nothing but a redefinition of the usual dagger. This means for every normalised states we have
and this extends linearly to all vectors $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$. We have proved that $\u2329\u2022,\u2022\u232a$ is symmetric, and this concludes the proof. ☐

$$\left({\rho}^{\u2021}|\sigma \right):=\left({\sigma}^{\u2020}|\rho \right),$$

$$\left({\psi}^{\u2021}|\chi \right)=\left({\chi}^{\u2020}|\psi \right)=\frac{1}{d}\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\psi =\frac{1}{d},$$

$$\left({\psi}^{\u2021}|\chi \right)=\sum _{i=1}^{r}\left({\alpha}_{i}^{\u2020}|\chi \right)=\frac{r}{d},$$

$$\left({\rho}^{\u2020}|\sigma \right)=\left({\sigma}^{\u2020}|\rho \right),$$

Note that the above result immediately yields the “symmetry of transition probabilities” as defined in Ref. [61,62].

Now we prove that this inner product is invariant under reversible transformations.

**Proposition**

**4.**

For every $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ and every reversible channel $\mathcal{U}$ one has

$$\u2329\mathcal{U}\xi ,\mathcal{U}\eta \u232a=\u2329\xi ,\eta \u232a.$$

**Proof.**

To prove the statement, let us first prove that for a normalised pure state $\alpha $ one has ${\left(\mathcal{U}\alpha \right)}^{\u2020}={\alpha}^{\u2020}{\mathcal{U}}^{-1}$, for every reversible channel $\mathcal{U}$. ${\alpha}^{\u2020}{\mathcal{U}}^{-1}$ is a pure effect and one has $\left({\alpha}^{\u2020}{\mathcal{U}}^{-1}|\mathcal{U}\alpha \right)=\left({\alpha}^{\u2020}|\alpha \right)=1$. By the uniqueness of the dagger for normalised pure states, ${\alpha}^{\u2020}{\mathcal{U}}^{-1}={\left(\mathcal{U}\alpha \right)}^{\u2020}$. This can be extended by linearity to all vectors $\xi $ in ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, so ${\left(\mathcal{U}\xi \right)}^{\u2020}={\xi}^{\u2020}{\mathcal{U}}^{-1}$. Therefore, when we compute $\u2329\mathcal{U}\xi ,\mathcal{U}\eta \u232a$, we have
☐

$$\u2329\mathcal{U}\xi ,\mathcal{U}\eta \u232a=\left({\xi}^{\u2020}\left|{\mathcal{U}}^{-1}\mathcal{U}\right|\eta \right)=\left({\xi}^{\u2020}|\eta \right)=\u2329\xi ,\eta \u232a.$$

The fact that $\u2329\u2022,\u2022\u232a$ is an inner product allows us to define an additional norm in sharp theories with purification: if $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, define the dagger norm as

$${\u2225\xi \u2225}_{\u2020}:=\sqrt{\u2329\xi ,\xi \u232a}.$$

See Appendix A.1 for an extended discussion on the properties of this norm.

Now we are ready to state the core of this subsection.

**Proposition**

**5.**

Sharp theories with purification are self-dual.

**Proof.**

Given a system $\mathrm{A}$, we need to prove that $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ is in ${\mathsf{St}}_{+}\left(\mathrm{A}\right)$ if and only if $\u2329\xi ,\eta \u232a\ge 0$ for all $\eta \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$. Note that $\xi \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$ if and only if it can be diagonalised as $\xi ={\sum}_{i=1}^{d}{x}_{i}{\alpha}_{i}$, where the ${x}_{i}$’s are all non-negative.

Necessity. Suppose $\xi \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$, and take any $\eta \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$, diagonalised as $\eta ={\sum}_{i=1}^{d}{y}_{i}{\beta}_{i}$. Then we have
because all the terms ${x}_{i}$, ${y}_{j}$, and $\left({\alpha}_{i}^{\u2020}|{\beta}_{j}\right)$ are non-negative.

$$\u2329\xi ,\eta \u232a=\sum _{i,j=1}^{d}{x}_{i}{y}_{j}\left({\alpha}_{i}^{\u2020}|{\beta}_{j}\right)\ge 0$$

Sufficiency. Take $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, and assume that $\u2329\xi ,\eta \u232a\ge 0$ for all $\eta \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$. Assume $\xi $ is diagonalised as $\xi ={\sum}_{i=1}^{d}{x}_{i}{\alpha}_{i}$, where the ${x}_{i}$’s are generic real numbers. We wish to prove that all the ${x}_{i}$’s are non-negative. Then

$$\u2329\xi ,\eta \u232a=\sum _{i,j=1}^{d}{x}_{i}\left({\alpha}_{i}^{\u2020}|\eta \right)\ge 0.$$

Recalling that for perfectly distinguishable pure states one has $\left({\alpha}_{i}^{\u2020}|{\alpha}_{j}\right)={\delta}_{ij}$ [30], it is enough to take $\eta $ to be one of the states ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ to conclude that ${x}_{i}\ge 0$ for every $i\in \left\{1,\dots ,d\right\}$, meaning that $\xi \in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$. ☐

The self-dualising inner product, besides being a nice mathematical tool, has some operational meaning, because it provides a measure of the distinguishability of states, as explained in Appendix A.2. Moreover, it is the starting point for extending the dagger to all transformations. This is done in Appendix B.

#### 5.2. Existence of Pure Orthogonal Projectors

Now we show that we have orthogonal projectors on every face of the state space. A consequence of diagonalisation is that all faces are generated by perfectly distinguishable pure states. Indeed, every face F is generated by a state $\omega $ in its relative interior. $\omega $ can be diagonalised as $\omega ={\sum}_{i=1}^{r}{p}_{i}{\alpha}_{i}$, where $r\le d$, and ${p}_{i}>0$ for $i\in \left\{1,\dots ,r\right\}$. By definition of face, this means that the states ${\left\{{\alpha}_{i}\right\}}_{i=1}^{r}$ are in F, and therefore generate F. Consequently, there is an effect a that picks out the whole face as the set of states $\rho $ such that $\left(a|\rho \right)=1$. In the specific case considered above, it is $a={\sum}_{i=1}^{r}{\alpha}_{i}^{\u2020}$. Such faces are called exposed.

Therefore the study of faces of sharp theories with purification reduces to the study of normalised effects. Thanks to Lemma 1, it is enough to consider subsets of pure maximal sets. Pick a pure maximal set ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$, and consider a subset $\mathsf{I}$ of $\left\{1,\dots ,d\right\}$. The subset $\mathsf{I}$ flags the slits that are open in the experiment. Setting ${a}_{\mathsf{I}}:={\sum}_{i\in \mathsf{I}}{\alpha}_{i}^{\u2020}$, we can define the two faces

- ${F}_{\mathsf{I}}:=\left\{\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right):\left({a}_{\mathsf{I}}|\rho \right)=1\right\}$;
- ${F}_{\mathsf{I}}^{\perp}:=\left\{\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\right):\left({a}_{\mathsf{I}}|\rho \right)=0\right\}$,

**Definition**

**8.**

An orthogonal projector (in the sense of [20]) on the face ${F}_{\mathsf{I}}$ is a transformation ${P}_{\mathsf{I}}\in \mathsf{Transf}\left(\mathrm{A}\right)$ such that

- if $\rho \in {F}_{\mathsf{I}}$, then ${P}_{\mathsf{I}}\rho =\rho $;
- if $\rho \in {F}_{\mathsf{I}}^{\perp}$, then ${P}_{\mathsf{I}}\rho =0$.

We can prove the existence of a projector at least in one case, when $\mathsf{I}=\left\{1,\dots ,d\right\}$. In this case ${a}_{\mathsf{I}}=u$, so ${F}_{\mathsf{I}}={\mathsf{St}}_{1}\left(\mathrm{A}\right)$, and ${F}_{\mathsf{I}}^{\perp}=\mathsf{\xd8}$. Then it is enough to take ${P}_{\mathsf{I}}\doteq \mathcal{I}$. However, sharp theories with purification admit projectors on every face.

**Proposition**

**6.**

Sharp theories with purification have pure projectors on every face ${F}_{\mathsf{I}}$. Furthermore one has $u{P}_{\mathsf{I}}={a}_{\mathsf{I}}$.

**Proof.**

Suppose $\rho $ is any state in ${F}_{\mathsf{I}}$, then $\left({a}_{\mathsf{I}}|\rho \right)=1$. By Proposition 2 we know that there is a pure transformation ${P}_{\mathsf{I}}$ such that ${P}_{\mathsf{I}}\rho =\rho $ for every $\rho \in {F}_{\mathsf{I}}$. We also have $\left(u|{P}_{\mathsf{I}}|\sigma \right)\le \left({a}_{\mathsf{I}}|\sigma \right)$, so if $\sigma \in {F}_{\mathsf{I}}^{\perp}$, we have $\left(u|{P}_{\mathsf{I}}|\sigma \right)=0$, whence ${P}_{\mathsf{I}}\sigma =0$.

To prove that $u{P}_{\mathsf{I}}={a}_{\mathsf{I}}$, first note that ${\psi}^{\u2020}{P}_{\mathsf{I}}={\psi}^{\u2020}$ for every pure state $\psi \in {F}_{\mathsf{I}}$. Indeed ${\psi}^{\u2020}{P}_{\mathsf{I}}$ is pure by Purity Preservation, and we have $\left({\psi}^{\u2020}\left|{P}_{\mathsf{I}}\right|\psi \right)=\left({\psi}^{\u2020}|\psi \right)=1$ because ${P}_{\mathsf{I}}\psi =\psi $ by definition. By Theorem 6, we have ${\psi}^{\u2020}{P}_{\mathsf{I}}={\psi}^{\u2020}$. Furthermore, ${\phi}^{\u2020}{P}_{\mathsf{I}}=0$ for a pure state $\phi \in {F}_{\mathsf{I}}^{\perp}$. Indeed, consider

$$\left({\phi}^{\u2020}\left|{P}_{\mathsf{I}}\right|\chi \right)=\frac{1}{d}\sum _{i\in \mathsf{I}}\left({\phi}^{\u2020}\left|{P}_{\mathsf{I}}\right|{\alpha}_{i}\right)+\frac{1}{d}\sum _{i\notin \mathsf{I}}\left({\phi}^{\u2020}\left|{P}_{\mathsf{I}}\right|{\alpha}_{i}\right).$$

The second term vanishes because ${\alpha}_{i}\in {F}_{\mathsf{I}}^{\perp}$ for $i\notin \mathsf{I}$. The first term vanishes because ${P}_{\mathsf{I}}{\alpha}_{i}={\alpha}_{i}$ for $i\in \mathsf{I}$, and $\phi $ is perfectly distinguishable from any of the ${\alpha}_{i}$’s for $i\in \mathsf{I}$ by means of the observation-test $\left\{u-{a}_{\mathsf{I}},{a}_{\mathsf{I}}\right\}$, implying $\left({\phi}^{\u2020}|{\alpha}_{i}\right)=0$ [30]. This means that ${\phi}^{\u2020}{P}_{\mathsf{I}}$ occurs with zero probability on all states contained in $\chi $, and since $\chi $ is complete [19], ${\phi}^{\u2020}{P}_{\mathsf{I}}=0$. Now, when we calculate $u{P}_{\mathsf{I}}$, we separate the contribution arising from states in orthogonal faces:
This concludes the proof. ☐

$$u{P}_{\mathsf{I}}=\sum _{i\in \mathsf{I}}{\alpha}_{i}^{\u2020}{P}_{\mathsf{I}}+\sum _{i\notin \mathsf{I}}{\alpha}_{i}^{\u2020}{P}_{\mathsf{I}}=\sum _{i\in \mathsf{I}}{\alpha}_{i}^{\u2020}={a}_{\mathsf{I}}$$

In other words, ${P}_{\mathsf{I}}$ occurs with the same probability as ${a}_{\mathsf{I}}$, thus satisfying one of the desiderata of Section 3. Moreover, extending some of the results in the Proof 6 by linearity, we obtain the dual statements of Definition 8, namely

- ${\rho}^{\u2020}{P}_{\mathsf{I}}={\rho}^{\u2020}$ if $\rho \in {F}_{\mathsf{I}}$
- ${\rho}^{\u2020}{P}_{\mathsf{I}}=0$ if $\rho \in {F}_{\mathsf{I}}^{\perp}$

Another consequence of Proposition 6 is that projectors actually project on their associated face, viz. for every normalised state $\rho $, ${P}_{\mathsf{I}}\rho =\lambda \sigma $, where $\sigma $ is in ${F}_{\mathsf{I}}$, and $\lambda =\left({a}_{\mathsf{I}}|\rho \right)$. Indeed, $\lambda =\left(u|{P}_{\mathsf{I}}|\rho \right)=\left({a}_{\mathsf{I}}|\rho \right)$. If $\lambda \ne 0$, which means $\rho \notin {F}_{\mathsf{I}}^{\perp}$, then and $\left({a}_{\mathsf{I}}|\sigma \right)=\frac{1}{\lambda}\left({a}_{\mathsf{I}}\left|{P}_{\mathsf{I}}\right|\rho \right)$. However, we know that ${a}_{\mathsf{I}}{P}_{\mathsf{I}}={a}_{\mathsf{I}}$, so $\left({a}_{\mathsf{I}}|\sigma \right)=1$, showing that $\sigma \in {F}_{\mathsf{I}}$.

Furthermore, we can show that every projector ${P}_{\mathsf{I}}$ has a complement ${P}_{\mathsf{I}}^{\perp}$, which is the projector associated with the effect ${a}_{\mathsf{I}}^{\perp}={\sum}_{i\notin \mathsf{I}}{\alpha}_{i}^{\u2020}$, which defines the orthogonal face ${F}_{\mathsf{I}}^{\perp}$. Clearly ${P}_{\mathsf{I}}^{\perp}\rho =\left({a}_{\mathsf{I}}^{\perp}|\rho \right)\sigma $, with $\sigma \in {F}_{\mathsf{I}}^{\perp}$. In particular, ${P}_{\mathsf{I}}^{\perp}\rho $ vanishes if and only if $\rho \in {F}_{\mathsf{I}}$.

These properties are the starting point for proving the idempotence of projectors.

**Proposition**

**7.**

Given a fixed pure maximal set ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ and $\mathsf{I}\subseteq \left\{1,\dots ,d\right\}$, one has ${P}_{\mathsf{I}}^{2}\doteq {P}_{\mathsf{I}}$. Moreover, if $\mathsf{J}$ is another subset of $\left\{1,\dots ,d\right\}$ disjoint from $\mathsf{I}$, then ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\doteq 0$.

**Proof.**

Recall that for every state $\rho $, ${P}_{\mathsf{I}}\rho =\lambda \sigma $, where $\sigma $ is in ${F}_{\mathsf{I}}$. Now, ${P}_{\mathsf{I}}$ leaves $\sigma $ invariant by definition, so
so ${P}_{\mathsf{I}}^{2}\doteq {P}_{\mathsf{I}}$. To prove the other property, note that if $\mathsf{I}$ and $\mathsf{J}$ are disjoint, they define orthogonal faces. Indeed, suppose $\rho \in {F}_{\mathsf{I}}$, then
which implies $\left({a}_{\mathsf{J}}|\rho \right)=0$ because $\left({a}_{\mathsf{I}}|\rho \right)=1$. Hence $\rho \in {F}_{\mathsf{J}}^{\perp}$. Now, given any normalised state $\rho $, ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho =0$ because ${P}_{\mathsf{J}}\rho $ is proportional to a state in ${F}_{\mathsf{I}}^{\perp}$. This proves that ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\doteq 0$. ☐

$${P}_{\mathsf{I}}^{2}\rho =\lambda {P}_{\mathsf{I}}\sigma =\lambda \sigma ,$$

$$1=\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho =\left({a}_{\mathsf{I}}|\rho \right)+\left({a}_{\mathsf{J}}|\rho \right)+\sum _{i\notin \mathsf{I}\cup \mathsf{J}}\left({\alpha}_{i}^{\u2020}|\rho \right),$$

This result shows that, once a pure maximal set ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ is fixed, whenever we have a partition $\left\{{\mathsf{I}}_{j}\right\}$ of $\left\{1,\dots ,d\right\}$, the test $\left\{{P}_{{\mathsf{I}}_{j}}\right\}$ is a von Neumann measurement. The only thing left to check is that ${\sum}_{j}u{P}_{{\mathsf{I}}_{j}}=u$, which is a sufficient condition for a set of transformations to be a test in sharp theories with purification. This is satisfied because, recalling Proposition 6,

$$\sum _{j}u{P}_{{\mathsf{I}}_{j}}=\sum _{j}{a}_{{\mathsf{I}}_{j}}=\sum _{i=1}^{d}{\alpha}_{i}^{\u2020}=u.$$

Because of the properties proved above, von Neumann measurements are repeatable and minimally disturbing measurements in the sense of Refs. [59,63]. Indeed, ${a}_{{\mathsf{I}}_{j}}{P}_{{\mathsf{I}}_{j}}={a}_{{\mathsf{I}}_{j}}$, and
because for $k\ne j$ the ${P}_{{\mathsf{I}}_{k}}$’s project on faces orthogonal to ${F}_{{\mathsf{I}}_{j}}$.

$${a}_{{\mathsf{I}}_{j}}\sum _{k}{P}_{{\mathsf{I}}_{k}}={a}_{{\mathsf{I}}_{j}}{P}_{{\mathsf{I}}_{j}}+\sum _{k\ne j}{a}_{{\mathsf{I}}_{j}}{P}_{{\mathsf{I}}_{k}}={a}_{{\mathsf{I}}_{j}},$$

The next proposition concerns the interplay between orthogonal projectors and the dagger.

**Proposition**

**8.**

For every normalised state ρ, and for every projector ${P}_{\mathsf{I}}$ on a face ${F}_{\mathsf{I}}$, one has ${\left({P}_{\mathsf{I}}\rho \right)}^{\u2020}={\rho}^{\u2020}{P}_{\mathsf{I}}$.

**Proof.**

First of all, note that $0\le \u2225{P}_{\mathsf{I}}\rho \u2225\le 1$, and it vanishes if and only if $\rho \in {F}_{\mathsf{I}}^{\perp}$. If $\rho \in {F}_{\mathsf{I}}^{\perp}$, then ${\rho}^{\u2020}{P}_{\mathsf{I}}=0$, so the statement is trivially true. Now suppose $\u2225{P}_{\mathsf{I}}\rho \u2225>0$. We will first prove the statement for normalised pure states $\psi $, then it is sufficient to extend it by linearity to all states. We will make use of the uniqueness of the dagger for normalised pure states. Then the statement is equivalent to proving
Noting that the term in brackets is a normalised pure state (by Purity Preservation), and that the RHS is a pure effect (again by Purity Preservation), by the uniqueness of the dagger for normalised pure states (cf. Theorem 6), it is enough to prove that
in other words that $\left({\psi}^{\u2020}{P}_{\mathsf{I}}|{P}_{\mathsf{I}}\psi \right)={\u2225{P}_{\mathsf{I}}\psi \u2225}^{2}$. Recall that ${P}_{\mathsf{I}}^{2}\doteq {P}_{\mathsf{I}}$ (Proposition 7, so $\left({\psi}^{\u2020}{P}_{\mathsf{I}}|{P}_{\mathsf{I}}\psi \right)=\left({\psi}^{\u2020}\left|{P}_{\mathsf{I}}\right|\psi \right)$. Now, ${P}_{\mathsf{I}}\psi =\u2225{P}_{\mathsf{I}}\psi \u2225{\psi}^{\prime}$, where ${\psi}^{\prime}$ is a pure state in ${F}_{\mathsf{I}}$. We have $\left({\psi}^{\u2020}{P}_{\mathsf{I}}|{P}_{\mathsf{I}}\psi \right)=\u2225{P}_{\mathsf{I}}\psi \u2225\left({\psi}^{\u2020}|{\psi}^{\prime}\right)$. We only need to prove that $\left({\psi}^{\u2020}|{\psi}^{\prime}\right)=\u2225{P}_{\mathsf{I}}\psi \u2225$. Recall that $\left({\psi}^{\u2020}|{\psi}^{\prime}\right)=\left({\psi}^{\prime \u2020}|\psi \right)$ by Lemma 2, and that ${\psi}^{\prime \u2020}{P}_{\mathsf{I}}={\psi}^{\prime \u2020}$ as ${\psi}^{\prime}\in {F}_{\mathsf{I}}$, thus
By the uniqueness of the dagger for normalised pure states we conclude that ${\left(\frac{{P}_{\mathsf{I}}\psi}{\u2225{P}_{\mathsf{I}}\psi \u2225}\right)}^{\u2020}=\frac{{\psi}^{\u2020}{P}_{\mathsf{I}}}{\u2225{P}_{\mathsf{I}}\psi \u2225}$, namely ${\left({P}_{\mathsf{I}}\psi \right)}^{\u2020}={\psi}^{\u2020}{P}_{\mathsf{I}}$. ☐

$${\left(\frac{{P}_{\mathsf{I}}\psi}{\u2225{P}_{\mathsf{I}}\psi \u2225}\right)}^{\u2020}=\frac{{\psi}^{\u2020}{P}_{\mathsf{I}}}{\u2225{P}_{\mathsf{I}}\psi \u2225},$$

$$\frac{\left({\psi}^{\u2020}{P}_{\mathsf{I}}|{P}_{\mathsf{I}}\psi \right)}{{\u2225{P}_{\mathsf{I}}\psi \u2225}^{2}}=1;$$

$$\left({\psi}^{\u2020}|{\psi}^{\prime}\right)=\left({\psi}^{\prime \u2020}\left|{P}_{\mathsf{I}}\right|\psi \right)=\u2225{P}_{\mathsf{I}}\psi \u2225\left({\psi}^{{}^{\prime}\u2020}|{\psi}^{\prime}\right)=\u2225{P}_{\mathsf{I}}\psi \u2225.$$

A consequence of this proposition is that orthogonal projectors play nicely with the inner product of Lemma 2, namely for every $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ one has
In other words, projections are symmetric with respect to the inner product.

$$\u2329{P}_{\mathsf{I}}\xi ,\eta \u232a=\u2329\xi ,{P}_{\mathsf{I}}\eta \u232a.$$

The last property we need is a generalisation of the results of Proposition 7.

**Proposition**

**9.**

Fixing a pure maximal set ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$, and considering $\mathsf{I},\mathsf{J}\subseteq \left\{1,\dots ,d\right\}$, we have ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\doteq {P}_{\mathsf{I}\cap \mathsf{J}}$.

**Proof.**

First let us prove that
for every normalised state $\rho $, where ${\rho}^{\prime}\in {F}_{\mathsf{I}\cap \mathsf{J}}$. Let us show that $\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho \u2225=\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right)$. By Proposition 6, $\left(u|{P}_{\mathsf{I}}{P}_{\mathsf{J}}|\rho \right)=\left({a}_{\mathsf{I}}\left|{P}_{\mathsf{J}}\right|\rho \right)$. Now, recalling that ${a}_{\mathsf{I}}={\sum}_{i\in \mathsf{I}}{\alpha}_{i}^{\u2020}$,
where we have used the fact that ${\alpha}_{i}^{\u2020}{P}_{\mathsf{J}}={\alpha}_{i}^{\u2020}$ if $i\in \mathsf{J}$, and ${\alpha}_{i}^{\u2020}{P}_{\mathsf{J}}=0$ if $i\notin \mathsf{J}$. If $\rho \in {F}_{\mathsf{I}\cap \mathsf{J}}^{\perp}$, both the LHS and the RHS of Equation (12) vanish, and the statement is trivially satisfied. Now, let us assume $\rho \notin {F}_{\mathsf{I}\cap \mathsf{J}}^{\perp}$, in this case $\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right)>0$. We wish to prove that $\left({a}_{\mathsf{I}\cap \mathsf{J}}\left|{P}_{\mathsf{I}}{P}_{\mathsf{J}}\right|\rho \right)=\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right)$. Recalling the expression of ${a}_{\mathsf{I}\cap \mathsf{J}}$, we have
again by the properties of ${P}_{\mathsf{I}}$ and ${P}_{\mathsf{J}}$. This means that ${P}_{\mathsf{I}}{P}_{\mathsf{J}}$ maps every normalised state to a state of ${F}_{\mathsf{I}\cap \mathsf{J}}$, up to normalisation.

$${P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho =\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho \u2225{\rho}^{\prime}$$

$$\left({a}_{\mathsf{I}}\left|{P}_{\mathsf{J}}\right|\rho \right)=\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}\left|{P}_{\mathsf{J}}\right|\rho \right)+\sum _{i\in \mathsf{I}\backslash \mathsf{J}}\left({\alpha}_{i}^{\u2020}\left|{P}_{\mathsf{J}}\right|\rho \right)=\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}|\rho \right)=\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right),$$

$$\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}\left|{P}_{\mathsf{I}}{P}_{\mathsf{J}}\right|\rho \right)=\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}\left|{P}_{\mathsf{J}}\right|\rho \right)=\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}|\rho \right)=\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right),$$

Now let us prove that ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\right)}^{2}\doteq {P}_{\mathsf{I}}{P}_{\mathsf{J}}$. First note that ${F}_{\mathsf{I}\cap \mathsf{J}}\subseteq {F}_{\mathsf{I}}$. Indeed, suppose $\rho \in {F}_{\mathsf{I}\cap \mathsf{J}}$, then
where we have used the fact that $\left({\alpha}_{i}^{\u2020}|\rho \right)=0$ if $i\notin \mathsf{I}\cap \mathsf{J}$. By a similar argument, ${F}_{\mathsf{I}\cap \mathsf{J}}\subseteq {F}_{\mathsf{J}}$. Now, ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho =\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho \u2225{\rho}^{\prime}$, with ${\rho}^{\prime}\in {F}_{\mathsf{I}\cap \mathsf{J}}$. Then ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\right)}^{2}\rho =\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho \u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}{\rho}^{\prime}$. However, ${\rho}^{\prime}\in {F}_{\mathsf{J}}$, so ${P}_{\mathsf{J}}{\rho}^{\prime}={\rho}^{\prime}$, and, similarly, ${\rho}^{\prime}\in {F}_{\mathsf{I}}$, so ${P}_{\mathsf{I}}{\rho}^{\prime}={\rho}^{\prime}$. Consequently,
proving that ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\right)}^{2}\doteq {P}_{\mathsf{I}}{P}_{\mathsf{J}}$.

$$\left({a}_{\mathsf{I}}|\rho \right)=\sum _{i\in \mathsf{I}\cap \mathsf{J}}\left({\alpha}_{i}^{\u2020}|\rho \right)+\sum _{i\in \mathsf{I}\backslash \mathsf{J}}\left({\alpha}_{i}^{\u2020}|\rho \right)=\left({a}_{\mathsf{I}\cap \mathsf{J}}|\rho \right)=1,$$

$${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\right)}^{2}\rho =\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho \u2225{\rho}^{\prime}={P}_{\mathsf{I}}{P}_{\mathsf{J}}\rho ,$$

Now let us prove that for every $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, we have ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\xi \right)}^{\u2020}={\xi}^{\u2020}{P}_{\mathsf{I}}{P}_{\mathsf{J}}$. Following the lines of proof of Proposition 8, let us show that this is true when $\xi $ is a normalised pure state $\psi $. This boils down to showing that

$$\left({\psi}^{\u2020}{P}_{\mathsf{I}}{P}_{\mathsf{J}}|{P}_{\mathsf{I}}{P}_{\mathsf{J}}\psi \right)={\u2225{P}_{\mathsf{I}}{P}_{\mathsf{J}}\psi \u2225}^{2}.$$

The proof goes on as for Proposition 8, noting that if ${\psi}^{\prime}\in {F}_{\mathsf{I}\cap \mathsf{J}}$, then ${\psi}^{\prime \u2020}{P}_{\mathsf{I}}{P}_{\mathsf{J}}={\psi}^{\prime \u2020}$ because ${\psi}^{\prime \u2020}{P}_{\mathsf{I}}={\psi}^{\prime \u2020}$ as ${\psi}^{\prime}\in {F}_{\mathsf{I}}$, and, similarly, ${\psi}^{\prime \u2020}{P}_{\mathsf{J}}={\psi}^{\prime \u2020}$ as ${\psi}^{\prime}\in {F}_{\mathsf{J}}$. Eventually we find that for pure states ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\psi \right)}^{\u2020}={\psi}^{\u2020}{P}_{\mathsf{I}}{P}_{\mathsf{J}}$, and by linearity this means that ${\left({P}_{\mathsf{I}}{P}_{\mathsf{J}}\xi \right)}^{\u2020}={\xi}^{\u2020}{P}_{\mathsf{I}}{P}_{\mathsf{J}}$.

A consequence of this property is that $\u2329{P}_{\mathsf{I}}{P}_{\mathsf{J}}\xi ,\eta \u232a=\u2329\xi ,{P}_{\mathsf{I}}{P}_{\mathsf{J}}\eta \u232a$, for all $\xi ,\eta \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$. These linear maps on ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ are such that ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)=\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}}{P}_{\mathsf{J}}\oplus ker{P}_{\mathsf{I}}{P}_{\mathsf{J}}$, and $ker{P}_{\mathsf{I}}{P}_{\mathsf{J}}$ is the orthogonal subspace to $\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}}{P}_{\mathsf{J}}$, hence it is uniquely defined once $\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}}{P}_{\mathsf{J}}$ is fixed. Note that for any projector ${P}_{\mathsf{I}}$ we have $\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}}=\mathrm{span}\phantom{\rule{0.222222em}{0ex}}{F}_{\mathsf{I}}$, and we have just proved that $\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}}{P}_{\mathsf{J}}=\mathrm{span}\phantom{\rule{0.222222em}{0ex}}{F}_{\mathsf{I}\cap \mathsf{J}}=\mathrm{im}\phantom{\rule{0.222222em}{0ex}}{P}_{\mathsf{I}\cap \mathsf{J}}$. Having the same image, and consequently the same kernel, ${P}_{\mathsf{I}}{P}_{\mathsf{J}}$ and ${P}_{\mathsf{I}\cap \mathsf{J}}$ agree on a basis of ${\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$, therefore they agree also on all states of $\mathrm{A}$, meaning that ${P}_{\mathsf{I}}{P}_{\mathsf{J}}\doteq {P}_{\mathsf{I}\cap \mathsf{J}}$. ☐

#### 5.3. Main Result

Proposition 29 of [4] asserts that theories satisfying two postulates, Strong Symmetry and Projectivity, have higher-order interference if and only if their projectors (in our terminology here) preserve purity. A close examination of its proof, and those of all lemmas and propositions used in its proof—notably Lemma 22 and Propositions 18, 25, 26, and 28 of [4]—reveals that only premises weaker than the conjunction of Strong Symmetry and Projectivity are used: self-duality, the “spectral-like decomposition” of effects as in Lemma 1 above, the fact that faces are determined by subsets of maximal distinguishable sets of states as in Section 5.2 above, the existence of projectors onto each face in the sense of Definition 8 above, and the fact that these are symmetric with respect to the self-dualising inner product (i.e., orthogonal projectors), and satisfy Proposition 9 above. We have established these weaker premises for sharp theories with purification, and moreover, we have established in Proposition 6 that their projectors preserve purity, so we have proved:

**Theorem**

**7.**

In any sharp theory with purification there can be no nth order interference for $n\ge 3$.

#### 5.4. Jordan-Algebraic Structure

Our results also imply that systems, and therefore also the “subsystems” associated with their faces, are operationally equivalent to finite-dimensional Jordan-algebraic systems. These are systems $\mathrm{A}$ for which ${\mathsf{St}}_{+}\left(\mathrm{A}\right)$ is the cone of squares in a finite-dimensional Euclidean Jordan algebra (EJA) and ${\mathsf{Eff}}_{+}\left(\mathrm{A}\right)$ is identified with the same cone, with evaluation of effects on states given by the inner product and the Jordan unit as the deterministic effect. (See [37] for more on Jordan algebraic operational systems, and [61] for a mathematical treatment.)

**Theorem**

**8.**

In a sharp theory with purification, every system A has both ${\mathsf{St}}_{+}\left(\mathrm{A}\right)$ and ${\mathsf{Eff}}_{+}\left(\mathrm{A}\right)$ isomorphic to the cone of squares in a Euclidean Jordan algebra (EJA) via isomorphisms S and T such that $\left(a|\rho \right)=\u2329Ta,S\rho \u232a$, where $\u2329\u2022,\u2022\u232a$ is the canonical inner product on the EJA, and T takes the deterministic effect to the Jordan unit.

**Proof.**

The proof uses results of Alfsen and Shultz [64], for which we refer to [61]. Theorem 9.33 in [61] implies that finite-dimensional systems with symmetry of transition probabilities (STP), a type of projection operator they call “compression” associated with every face, and whose compressions preserve purity, have state spaces affinely isomorphic to the state spaces of Euclidean Jordan algebras. Sharp theories with purification satisfy STP, as noted following Lemma 2 above. Our projectors are easily shown to be examples of compressions by the same argument as in Theorem 17 of [4]; this argument uses only properties satisfied by our projectors (the same ones needed in the proof of Theorem 7, except for Purity Preservation) and does not need Strong Symmetry. As shown above, our projectors also preserve purity. ☐

Since faces of Jordan-algebraic systems are also Jordan-algebraic (to see this, combine a result of Iochum [65] (Theorem 5.32 in [61]), whose finite dimensional case is that all faces of EJAs are the positive part of the images of compressions, with the facts (cf. pp. 22–26 of [61]) that every face of the cone of squares is the image of such a compression P ([61], Lemma 1.39), and also a Jordan subalgebra whose unit is the image of the order unit under P ([61], Proposition 1.43).), so are the faces of state spaces in sharp theories with purification. However, it is not the case that in sharp theories with purification, each face of a system is necessarily isomorphic to a stand-alone system of the theory (an object of the category, in the categorical formulation), but, it is always possible to extend the theory such that they are. Every category has a Cauchy completion: this is a minimal extension of the category such that every idempotent morphism $\pi :\mathrm{A}\to \mathrm{A}$ can be written as a retraction-section pair, i.e., as the composition $\pi =\sigma \circ \rho $, with $\rho :\mathrm{A}\to \mathrm{B}$ and $\sigma :\mathrm{B}\to \mathrm{A}$, such that the reverse composition $\rho \circ \sigma $ is the identity morphism on B. When the idempotents are projectors P like the ones we consider here, B will be a system isomorphic to the face ${\mathrm{im}}_{+}\left(P\right)$. Of course, since there may be idempotents beyond the projectors onto faces (for example, decoherence of a set of orthogonal subspaces, or damping to a fixed state, in quantum theory), Cauchy completion of an operational theory T may add many objects in addition to ones isomorphic to faces of systems of T; indeed, for many operational theories (e.g., ones possessing idempotent decoherence maps) this will add some classical systems. This is indeed the case for quantum theory where the Cauchy completion leads to the category of finite-dimensional C*-algebras and completely positive maps [66]. The Cauchy completion can be thought of as adding in all operationally accessible systems that can be simulated on the physical system via a consistent restriction on the allowed states, effects and transformations. The Cauchy completion of a sharp theory with purification will likely satisfy the Ideal Compression postulate by virtue of containing the faces that are images of orthogonal projectors; but there are also non-Cauchy complete theories that satisfy it, e.g., the category CPM of finite-dimensional quantum systems and CP maps, in which all systems, and also all images of orthogonal projectors as defined above, are fully coherent quantum systems, but there are no classical systems.

In [37], some categories, including dagger-compact-closed categories, of Jordan algebraic systems were constructed; these categories are equivalent to operational theories as we use the term here. Although sharp theories with purification also have Jordan algebraic state and effect spaces, it is interesting to note that some of the explicit examples in [30,49] involve composites different from those that would be obtained in the categories considered in [37] for systems with the same state spaces. On the other hand, the category combining real and quaternionic systems in [37] does not satisfy Purity Preservation by parallel composition and hence falls outside the class of sharp theories with purification, although its filters do preserve purity. Of course, the failure of Purity Preservation by parallel composition seems likely to allow phenomena like the nonextensiveness of entropy when products of states are taken, which could warrant focusing on sharp theories with purification in thermodynamically motivated work such as [30].

That Jordan-algebraic systems lack higher-order interference was shown by Barnum and Ududec ([12]; announced in [67]) and by Niestegge [68]; combining this with Theorem 8 gives another way to see that our results on sharp theories with purification imply the absence of higher-order interference. Moreover, as not all EJAs satisfy our postulates, it is clear that our postulates are sufficient but not necessary conditions for ruling out higher-order interfence.

## 6. Discussion and Conclusions

We proved that in sharp theories with purification multi-slit experiments must have a pure projector structure and, moreover, such theories exhibit at most second-order interference. Hence these theories are, at least conceptually, very “close” to quantum theory. Moreover, recent work has shown that sharp theories with purification are close to quantum theory in terms of other physical and information processing features. Indeed, such theories possess quantum-like contextuality behaviour [59,63], quantum-like computation [7,8], and quantum-like thermodynamic Properties [30,49,54]. Recall from Section 4 that quantum theory is not the only example of a generalised probabilistic theory satisfying these principles. Hence Causality, Purity Preservation, Pure Sharpness, and Purification do not recover the entire quantum formalism.

However, if one were to introduce the Ideal Compression and Local Discriminability principles of the reconstruction of quantum theory due to Chiribella, D’Ariano, and Perinotti [20], one would indeed regain the entire quantum formalism. Indeed, both additional principles are necessary: Local Discriminability to preclude real quantum theory and Ideal Compression to preclude the contrived—yet admissible—example of the theory in which all systems are composites of qubits. Sharp theories with purification thus serve as a fertile test-bed for physics that is conceptually quite close to that predicted by the quantum world, but which may diverge from it in certain small, yet interesting, ways.

#### Finding Higher Order Interference

To date there has been no experiment that has found higher-order interference, at least, none that cannot be explained by taking into account the fact that the “sets of histories are not mutually exclusive” [2,35]. However, this might be due to the specific experimental set-up employed, rather than a fundamental preclusion of higher-order interference in nature. We show here that many of the properties needed to rule out observing higher-order interference are in fact quite natural assumptions which appear to be suggested by the experimental set-up employed. This suggests that the experimental set-up itself may implicitly rule out observing higher-order interference from the outset.

The main result of the current work is that sharp theories with purification can never exhibit higher-order interference in any experiment. However, in a wider class of theories, we still will not observe higher-order interference in a particular experiment if the following three conditions are met; hence, to have any chance of observing higher-order interference, experiments must be designed in order to try to violate these conditions.

- The transformations corresponding to blocking slits satisfy: ${T}_{\mathsf{I}}{T}_{\mathsf{J}}={T}_{\mathsf{I}\cap \mathsf{J}}$. By this we mean that they share several properties with the projectors ${P}_{\mathsf{I}}$ of Section 5: if we define the effects ${a}_{\mathsf{I}}=u{T}_{\mathsf{I}}$ and the faces ${F}_{\mathsf{I}}$ and ${F}_{\mathsf{I}}^{\perp}$ as in Section 5.2, i.e., as the 1-set and 0-set of ${a}_{\mathsf{I}}$, then the ${T}_{\mathsf{I}}$ are assumed to be orthogonal projectors in the sense of Definition 8, and to be both idempotent and “orthogonal” (${T}_{\mathsf{I}}{T}_{\mathsf{J}}=0$) if $\mathsf{I}$ and $\mathsf{J}$ are disjoint (as in Proposition 7).
- The ${T}_{\mathsf{I}}$’s map pure states to pure states
- The ${T}_{\mathsf{I}}$’s are self-adjoint.

The first of these is generally expected as only those slits belonging to both $\mathsf{I}$ and $\mathsf{J}$ will not be blocked by either ${T}_{\mathsf{I}}$ or ${T}_{\mathsf{J}}$, and so should hold in this experimental set-up for any theory that can describe it.

The second assumption, which is also natural given the multi-slit set-up, is that, in an idealised scenario, the slits should not introduce fundamental noise. That is, if an input state $\rho $ is pure, i.e., has no classical noise associated with it, then ${T}_{\mathsf{I}}\rho $ should also be pure. Hence it appears natural to assume that ${T}_{\mathsf{I}}$ maps pure states to pure states. Violating this principle by just adding noise to the experiment does not seem likely to demonstrate higher-order interference. A more plausible way to violate this however would be if the particle passing through the slits were to become entangled with some degree of freedom associated with them, if we do not have access to this degree of freedom then this would send a pure input to a mixed state.

The final assumption is far less general than the others, as it places a constraint on the theory. That is, to even discuss whether a transformation is self-adjoint (cf. also Appendix B), one requires that the theory itself be self-dual. To fully understand what this assumption entails, one needs an operational or physical interpretation of the self-dualising inner product (see [69] for an example of such an interpretation). However, intuitively this notion reflects the inherent symmetry of the experimental set-up. Here one could consider propagation from the source to the effect or from the effect to the source as being “dual” to one another and, moreover, that the physical blocking of slits has an equivalent effect in either situation. That is, the assumption of self-adjointness corresponds to the statement that the projector has an equivalent action on the effects associated with a particular slit as it does on the states which can pass through them.

If an experiment satisfies these assumptions then for any self-dual theory it was shown in [4] (Proposition 29) that we will not see higher-order interference in this experiment. Hence any set of physical principles which ensure these assumptions hold will rule out higher-order interference. Because the mathematical assumptions involved in formalising a multi-slit experiment are so natural when interpreted operationally, perhaps one should search for higher-order interference in set-ups that don’t seem to preclude it from the outset. This could involve “asymmetric” multi-slit set-ups that are not obviously time-symmetric in an arbitrary generalised probabilistic theory. One could also consider experiments that search for higher-order phases [8], a reformulation of higher-order interference that makes no reference to projectors and hence does not preclude certain generalised theories from the outset. The assumption that nature is self-dual could also be rejected; this poses the question as to whether it is possible to find a direct experimental test of this principle.

## Acknowledgments

The authors thank J. Barrett for useful discussions and J. J. Barry for encouragement while writing the current paper. This work was supported by EPSRC grants through the Controlled Quantum Dynamics Centre for Doctoral Training, the UCL Doctoral Prize Fellowship (project number 534936), and an Oxford doctoral training scholarship, and also by Oxford-Google DeepMind Graduate Scholarship. We also acknowledge financial support from the European Research Council (ERC Grant Agreement No. 337603), the Danish Council for Independent Research (Sapere Aude) and VILLUM FONDEN via the QMATH Centre of Excellence (Grant No. 10059). This work began while the authors were attending the “Formulating and Finding Higher-order Interference” workshop at the Perimeter Institute. Research at Perimeter Institute is supported by the Government of Canada through the Department of Innovation, Science and Economic Development Canada and by the Province of Ontario through the Ministry of Research, Innovation and Science.

## Author Contributions

All authors contributed equally to the present work.

## Conflicts of Interest

The authors declare no conflict of interest.

## Appendix A. Norms and Fidelity

#### Appendix A.1. Operational Norm and Dagger Norm

In Ref. [19] the operational norm for every vector $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ was introduced:

$$\u2225\xi \u2225:=\underset{a\in \mathsf{Eff}\left(\mathrm{A}\right)}{sup}\left(a|\xi \right)-\underset{a\in \mathsf{Eff}\left(\mathrm{A}\right)}{inf}\left(a|\xi \right)$$

As pointed out in [19], in quantum theory the operational norm coincides with the trace norm. The analogy is apparent also in sharp theories with purification.

**Proposition**

**A1.**

Let $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ be diagonalised as $\xi ={\sum}_{i=1}^{d}{x}_{i}{\alpha}_{i}$. Then $\u2225\xi \u2225={\sum}_{i=1}^{d}\left|{x}_{i}\right|$.

**Proof.**

Let us separate the terms with non-negative eigenvalues from the terms with negative eigenvalues, so that we can write $\xi ={\xi}_{+}-{\xi}_{-}$, where ${\xi}_{+}:={\sum}_{{x}_{i}\ge 0}{x}_{i}{\alpha}_{i}$, and ${\xi}_{-}={\sum}_{{x}_{i}<0}\left(-{x}_{i}\right){\alpha}_{i}$. Clearly, ${\xi}_{+},{\xi}_{-}\in {\mathsf{St}}_{+}\left(\mathrm{A}\right)$. In order to achieve the supremum of $\left(a|\xi \right)$ we must have $\left(a|{\xi}_{-}\right)=0$. Moreover,
since $\left(a|{\alpha}_{i}\right)\le 1$ for every i. The supremum of $\left(a|{\xi}_{+}\right)$ is achieved by $a={\sum}_{{x}_{i}\ge 0}{\alpha}_{i}^{\u2020}$. Hence ${sup}_{a}\left(a|\xi \right)={\sum}_{{x}_{i}\ge 0}{x}_{i}$. By a similar argument, one shows that ${inf}_{a}\left(a|\xi \right)={\sum}_{{x}_{i}<0}{x}_{i}$. Therefore
☐

$$\left(a|{\xi}_{+}\right)=\sum _{{x}_{i}\ge 0}{x}_{i}\left(a|{\alpha}_{i}\right)\le \sum _{{x}_{i}\ge 0}{x}_{i}$$

$$\u2225\xi \u2225=\sum _{{x}_{i}\ge 0}{x}_{i}+\sum _{{x}_{i}<0}\left(-{x}_{i}\right)=\sum _{i=1}^{d}\left|{x}_{i}\right|.$$

For $p\ge 1$, the p-norm of a vector $\mathbf{x}\in {\mathbb{R}}^{d}$ is defined as ${\u2225\mathbf{x}\u2225}_{p}:={\left({\sum}_{i=1}^{d}{\left|{x}_{i}\right|}^{p}\right)}^{\frac{1}{p}}$, thus we have $\u2225\xi \u2225={\u2225\mathbf{x}\u2225}_{1}$, where $\mathbf{x}$ is the spectrum of $\xi $.

In sharp theories with purification we have an additional norm, the dagger norm, defined in Section 5.1. The dagger norm of a vector $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\right)$ is ${\u2225\xi \u2225}_{\u2020}=\sqrt{{\sum}_{i=1}^{d}{x}_{i}^{2}}$, where the ${x}_{i}$’s are the eigenvalues of $\xi $. It is obvious from the very definition that ${\u2225\xi \u2225}_{\u2020}={\u2225\mathbf{x}\u2225}_{2}$. Thanks to these results following from diagonalisation, we can derive the standard bounds between the two norms, by making use of the well-known bounds ${\u2225\mathbf{x}\u2225}_{2}\le {\u2225\mathbf{x}\u2225}_{1}\le \sqrt{d}{\u2225\mathbf{x}\u2225}_{2}$, which imply
Note that, unlike Ref. [70], here the bounds are derived without assuming Bit Symmetry [4,71].

$${\u2225\xi \u2225}_{\u2020}\le \u2225\xi \u2225\le \sqrt{d}{\u2225\xi \u2225}_{\u2020}.$$

If we take $\xi $ to be a normalised state $\rho $, its eigenvalues form a probability distribution, and we have ${\u2225\rho \u2225}_{\u2020}\le 1$, with equality if and only if $\rho $ is pure. Note that ${\u2225\rho \u2225}_{\u2020}$ is a Schur-convex function [72] of the eigenvalues of $\rho $, so it is a purity monotone [30]. As such, it attains its minimum on the invariant state, which is ${\u2225\chi \u2225}_{\u2020}=\frac{1}{\sqrt{d}}$, so for every normalised state one has
consistently with the bounds (A1). The square of the dagger norm, still a Schur-convex function, was called purity in Refs. [70,73]. Consequently $1-{\u2225\rho \u2225}_{\u2020}^{2}$ is a measure of mixedness, sometimes called the impurity $I\left(\rho \right)$ of $\rho $. The impurity can be extended to subnormalised states by defining it as $I\left(\rho \right):={\left(\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho \right)}^{2}-{\u2225\rho \u2225}_{\u2020}^{2}$ [4].

$$\frac{1}{\sqrt{d}}\le {\u2225\rho \u2225}_{\u2020}\le 1,$$

The two norms behave differently under channels applied to states. In Ref. [19] it was shown that in causal theories the operational norm of a state $\rho $ is preserved by channels: $\u2225\mathcal{C}\rho \u2225=\u2225\rho \u2225$ for every channel $\mathcal{C}$, because channels are such that $u\mathcal{C}=u$.

Instead the dagger norm shows a different behaviour. To describe it, it is useful to divide channels into two classes: unital and non-unital channels [49].

**Definition**

**A1.**

A channel $\mathcal{D}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ is unital if $\mathcal{D}{\chi}_{\mathrm{A}}={\chi}_{\mathrm{B}}$.

Unital channels do not increase the dagger norm of states.

**Proposition**

**A2.**

If $\mathcal{D}$ is a unital channel, then ${\u2225\mathcal{D}\rho \u2225}_{\u2020}\le {\u2225\rho \u2225}_{\u2020}$, for every normalised state ρ.

**Proof.**

Unital channels can be chosen as free operations for the resource theory of purity [49]. In Ref. [49] it was shown that the spectrum of $\mathcal{D}\rho $ is majorised by the spectrum of $\rho $ (see Ref. [72] for a definition of majorisation and Schur-convex functions). Since the dagger norm is a Schur-convex function, we have ${\u2225\mathcal{D}\rho \u2225}_{\u2020}\le {\u2225\rho \u2225}_{\u2020}$. ☐

Clearly if $\mathcal{D}$ is reversible, the dagger norm is preserved, by Proposition 4.

For non-unital channels there is at least one state—the invariant state $\chi $—for which the dagger norm increases. Indeed, if $\mathcal{C}$ is non-unital, $\chi $ is majorised by $\mathcal{C}\chi $, whence ${\u2225\chi \u2225}_{\u2020}\le {\u2225\mathcal{C}\chi \u2225}_{\u2020}$. Is it true, then, that non-unital channels increase the dagger norm of all states? The answer is clearly negative. Consider the non-unital channel mapping all states to a fixed mixed state ${\rho}_{0}\ne \chi $. For some states, e.g., the invariant state, the dagger norm will increase, for others, e.g., pure states, the dagger norm will decrease because it is a purity monotone. In short, for non-unital channels there is no uniform behaviour of the dagger norm.

#### Appendix A.2. Dagger Fidelity

The inner product defined in Section 5.1 allows us to define a fidelity-like quantity, called the dagger fidelity.

**Definition**

**A2.**

Given two normalised states ρ and σ, the dagger fidelity is defined as

$${F}_{\u2020}\left(\rho ,\sigma \right)=\frac{\u2329\rho ,\sigma \u232a}{{\u2225\rho \u2225}_{\u2020}{\u2225\sigma \u2225}_{\u2020}}.$$

The dagger fidelity measures the overlap between two states. It shares some properties with the fidelity in quantum theory (cf. for instance Ref. [74]), despite not coinciding with it. The first, obvious one, is that ${F}_{\u2020}\left(\rho ,\sigma \right)={F}_{\u2020}\left(\sigma ,\rho \right)$.

To prove the other properties we need the following lemma, generalising one of the results of Ref. [30].

**Lemma**

**A1.**

Let ${\left\{{\rho}_{i}\right\}}_{i=1}^{n}$ be perfectly distinguishable states. Then $\left({\rho}_{i}^{\u2020}|{\rho}_{j}\right)={\u2225{\rho}_{i}\u2225}_{\u2020}^{2}{\delta}_{ij}$.

**Proof.**

Clearly what we need to prove is that $\left({\rho}_{i}^{\u2020}|{\rho}_{j}\right)=0$ if $i\ne j$. Let ${\left\{{a}_{i}\right\}}_{i=1}^{n}$ be the perfectly distinguishing test, and let ${\rho}_{i}$ be diagonalised as ${\rho}_{i}={\sum}_{k=1}^{{r}_{i}}{p}_{k,i}{\alpha}_{k,i}$, where ${p}_{k,i}>0$ for all $k=1,\dots ,r$. We have $\left({a}_{i}|{\rho}_{i}\right)=1$, hence by Proposition 2 there exists a non-disturbing pure transformation ${\mathcal{T}}_{i}$ such that ${\mathcal{T}}_{i}{=}_{{\rho}_{i}}\mathcal{I}$. Specifically, we have that ${\mathcal{T}}_{i}{\alpha}_{k,i}={\alpha}_{k,i}$. Moreover if $i\ne j$, we have $\left(u|{\mathcal{T}}_{i}|{\rho}_{j}\right)\le \left({a}_{i}|{\rho}_{j}\right)=0$, whence $\left(u|{\mathcal{T}}_{i}|{\rho}_{j}\right)=0$. This means that ${\mathcal{T}}_{i}{\rho}_{j}=0$ for all $j\ne i$.

Now, consider
where we have used the fact that ${\mathcal{T}}_{i}{\alpha}_{k,i}={\alpha}_{k,i}$. Since ${\alpha}_{k,i}^{\u2020}{\mathcal{T}}_{i}$ is a pure effect, it must be ${\alpha}_{k,i}^{\u2020}{\mathcal{T}}_{i}={\alpha}_{k,i}^{\u2020}$ by Theorem 6. By linearity we have ${\rho}_{i}^{\u2020}{\mathcal{T}}_{i}={\rho}_{i}^{\u2020}$. Now, using this fact, for all $j\ne i$
because ${\mathcal{T}}_{i}{\rho}_{j}=0$. ☐

$$\left({\alpha}_{k,i}^{\u2020}\left|{\mathcal{T}}_{i}\right|{\alpha}_{k,i}\right)=\left({\alpha}_{k,i}^{\u2020}|{\alpha}_{k,i}\right)=1,$$

$$\left({\rho}_{i}^{\u2020}|{\rho}_{j}\right)=\left({\rho}_{i}^{\u2020}\left|{\mathcal{T}}_{i}\right|{\rho}_{j}\right)=0,$$

Recalling that $\left({\rho}^{\u2020}|\sigma \right)=\u2329\rho ,\sigma \u232a$, this lemma means that perfectly distinguishable states form an orthogonal set. Specifically, if the states are pure, the set is orthonormal.

The following proposition extends and generalises the properties of the self-dualising inner product of Ref. [71].

**Proposition**

**A3.**

The dagger fidelity has the following properties, for all normalised states ρ and σ.

- $0\le {F}_{\u2020}\left(\rho ,\sigma \right)\le 1$;
- ${F}_{\u2020}\left(\rho ,\sigma \right)=0$ if and only if ρ and σ are perfectly distinguishable;
- ${F}_{\u2020}\left(\rho ,\sigma \right)=1$ if and only if $\rho =\sigma $;
- ${F}_{\u2020}\left(\mathcal{U}\rho ,\mathcal{U}\sigma \right)={F}_{\u2020}\left(\rho ,\sigma \right)$, for every reversible channel $\mathcal{U}$.

**Proof.**

Let us prove the various properties.

- Recall that $\u2329\rho ,\sigma \u232a=\left({\rho}^{\u2020}|\sigma \right)\ge 0$, whence ${F}_{\u2020}\left(\rho ,\sigma \right)\ge 0$. Moreover, by Schwarz inequality, $\u2329\rho ,\sigma \u232a\le {\u2225\rho \u2225}_{\u2020}{\u2225\sigma \u2225}_{\u2020}$, so ${F}_{\u2020}\left(\rho ,\sigma \right)\le 1$.
- Suppose $\rho $ and $\sigma $ are perfectly distinguishable, then by Lemma A1 $\u2329\rho ,\sigma \u232a=0$, implying ${F}_{\u2020}\left(\rho ,\sigma \right)=0$. Now suppose ${F}_{\u2020}\left(\rho ,\sigma \right)=0$; then $\u2329\rho ,\sigma \u232a=0$. Let $\rho ={\sum}_{i=1}^{r}{p}_{i}{\alpha}_{i}$ be a diagonalisation of $\rho $, with ${p}_{i}>0$, for all $i=1,\dots ,r$, and $r\le d$. We have ${\sum}_{i=1}^{r}{p}_{i}\left({\alpha}_{i}^{\u2020}|\sigma \right)=0$, which means that $\left({\alpha}_{i}^{\u2020}|\sigma \right)=0$ for $i=1,\dots ,r$. This means that we can build an observation-test that distinguishes $\rho $ and $\sigma $ perfectly by taking $\left\{a,u-a\right\}$, where $a={\sum}_{i=1}^{r}{\alpha}_{i}^{\u2020}$.
- Clearly, if $\rho =\sigma $, $\u2329\rho ,\sigma \u232a={\u2225\rho \u2225}_{\u2020}^{2}$, whence ${F}_{\u2020}\left(\rho ,\sigma \right)=1$. Conversely, suppose ${F}_{\u2020}\left(\rho ,\sigma \right)=1$. This means that $\u2329\rho ,\sigma \u232a={\u2225\rho \u2225}_{\u2020}{\u2225\sigma \u2225}_{\u2020}$. By Schwarz inequality, this is true if and only if $\rho =\lambda \sigma $, for some $\lambda \in \mathbb{R}$. Since both states are normalised, $\lambda =1$, yielding $\rho =\sigma $.
- This property follows by Proposition 4, because the inner product and the dagger norm are invariant under reversible channels.

Note that Property 3 captures the sharpness of the dagger for all normalised states [69].

A property involving tensor product of states is the following.

**Proposition**

**A4.**

For all normalised states ${\rho}_{1}$, ${\rho}_{2}$, ${\sigma}_{1}$, ${\sigma}_{2}$ one has

$${F}_{\u2020}\left({\rho}_{1}\otimes {\rho}_{2},{\sigma}_{1}\otimes {\sigma}_{2}\right)={F}_{\u2020}\left({\rho}_{1},{\sigma}_{1}\right){F}_{\u2020}\left({\rho}_{2},{\sigma}_{2}\right)$$

The proof needs the following easy lemma.

**Lemma**

**A2.**

Let $\rho ,\sigma \in {\mathsf{St}}_{1}\left(\mathrm{A}\right)$, then ${\left(\rho \otimes \sigma \right)}^{\u2020}={\rho}^{\u2020}\otimes {\sigma}^{\u2020}$.

**Proof.**

Let us prove the result for $\rho $ and $\sigma $ pure, the general result will follow by linearity. By Purity Preservation, $\rho \otimes \sigma $ and ${\rho}^{\u2020}\otimes {\sigma}^{\u2020}$ are pure, and one has $\left({\rho}^{\u2020}\otimes {\sigma}^{\u2020}|\rho \otimes \sigma \right)=1$. By Theorem 6, ${\left(\rho \otimes \sigma \right)}^{\u2020}={\rho}^{\u2020}\otimes {\sigma}^{\u2020}$. ☐

Now comes the actual proof.

**Proof**

**of**

**Proposition**

**A4**

We have
Now, by Lemma A2,
Furthermore,
Putting everything together,
☐

$${F}_{\u2020}\left({\rho}_{1}\otimes {\rho}_{2},{\sigma}_{1}\otimes {\sigma}_{2}\right)=\frac{\u2329{\rho}_{1}\otimes {\rho}_{2},{\sigma}_{1}\otimes {\sigma}_{2}\u232a}{{\u2225{\rho}_{1}\otimes {\rho}_{2}\u2225}_{\u2020}{\u2225{\sigma}_{1}\otimes {\sigma}_{2}\u2225}_{\u2020}}.$$

$$\u2329{\rho}_{1}\otimes {\rho}_{2},{\sigma}_{1}\otimes {\sigma}_{2}\u232a=\left({\rho}_{1}^{\u2020}\otimes {\rho}_{2}^{\u2020}|{\sigma}_{1}\otimes {\sigma}_{2}\right)=\left({\rho}_{1}^{\u2020}|{\sigma}_{1}\right)\left({\rho}_{2}^{\u2020}|{\sigma}_{2}\right)=\u2329{\rho}_{1},{\sigma}_{1}\u232a\u2329{\rho}_{2},{\sigma}_{2}\u232a.$$

$${\u2225{\rho}_{1}\otimes {\rho}_{2}\u2225}_{\u2020}=\sqrt{\u2329{\rho}_{1}\otimes {\rho}_{2},{\rho}_{1}\otimes {\rho}_{2}\u232a}=\sqrt{\u2329{\rho}_{1},{\rho}_{1}\u232a\u2329{\rho}_{2},{\rho}_{2}\u232a}={\u2225{\rho}_{1}\u2225}_{\u2020}{\u2225{\rho}_{2}\u2225}_{\u2020}.$$

$${F}_{\u2020}\left({\rho}_{1}\otimes {\rho}_{2},{\sigma}_{1}\otimes {\sigma}_{2}\right)=\frac{\u2329{\rho}_{1},{\sigma}_{1}\u232a}{{\u2225{\rho}_{1}\u2225}_{\u2020}{\u2225{\sigma}_{1}\u2225}_{\u2020}}\xb7\frac{\u2329{\rho}_{2},{\sigma}_{2}\u232a}{{\u2225{\rho}_{2}\u2225}_{\u2020}{\u2225{\sigma}_{2}\u2225}_{\u2020}}={F}_{\u2020}\left({\rho}_{1},{\sigma}_{1}\right){F}_{\u2020}\left({\rho}_{2},{\sigma}_{2}\right).$$

## Appendix B. Dagger of All Transformations

Inspired by the results of Lemma 2, in sharp theories with purification, we can extend the dagger to all transformations, a feature often present in process theories [44,45,69,75].

**Definition**

**A3.**

Given the transformation $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, its dagger (or adjoint) is a linear transformation ${\mathcal{A}}^{\u2020}$ from $\mathrm{B}$ to $\mathrm{A}$ defined as
for every system $\mathrm{S}$, and every state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{B}\otimes \mathrm{S}\right)$.

This definition specifies the dagger of a transformation completely, thanks to Equation (2). Note that Lemma 2 allows us to formulate Equation (10) in term of effects and their dagger:

$$\left(a|{b}^{\u2020}\right)=\left(b|{a}^{\u2020}\right)$$

for all effects a, and b. In this way, Definition A3 can be recast in equivalent terms by taking b as the term in round brackets in the RHS of Equation (A2). This yields
for every system $\mathrm{S}$, every state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{B}\otimes \mathrm{S}\right)$, and every effect $E\in \mathsf{Eff}\left(\mathrm{A}\otimes \mathrm{S}\right)$.

The dagger of a transformation may not be a physical transformation, i.e., it may send physical states to non-physical ones. Indeed, the action of ${\mathcal{A}}^{\u2020}\otimes \mathcal{I}$ on a generic state (the LHS of Equation (A2)) is defined as the dagger of an effect. However, not all daggers of effects are physical states. For instance, take the deterministic effect $u={\sum}_{i=1}^{d}{\alpha}_{i}^{\u2020}$, where ${\left\{{\alpha}_{i}\right\}}_{i=1}^{d}$ is a pure maximal set. Its dagger is ${u}^{\u2020}={\sum}_{i=1}^{d}{\alpha}_{i}=d\chi $, which is a supernormalised (and hence non-physical) state.

For channels, we can give a necessary condition for the existence of a physical dagger of the channel.

**Proposition**

**A5.**

Let $\mathcal{C}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ be a channel. If ${\mathcal{C}}^{\u2020}$ is a physical transformation, then $\mathcal{C}$ is unital, and ${\mathcal{C}}^{\u2020}$ itself is a unital channel.

**Proof.**

If ${\mathcal{C}}^{\u2020}$ is a physical transformation, then, for every normalised state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{B}\right)$, we have $\u2225{\mathcal{C}}^{\u2020}\rho \u2225\le 1$, or in other words, $\left(u|{\mathcal{C}}^{\u2020}|\rho \right)\le 1$. By Equation (A3), $\left(u|{\mathcal{C}}^{\u2020}|\rho \right)=\left({\rho}^{\u2020}\left|\mathcal{C}\right|{u}^{\u2020}\right)$, so the condition $\u2225{\mathcal{C}}^{\u2020}\rho \u2225\le 1$ is equivalent to
with equality if and only if ${\mathcal{C}}^{\u2020}$ is a channel. Suppose by contradiction that $\mathcal{C}$ is not unital, then $\mathcal{C}\chi ={\rho}_{0}\ne \chi $. Diagonalise ${\rho}_{0}$ as ${\rho}_{0}={\sum}_{i=1}^{d}{p}_{i}{\alpha}_{i}$, where ${p}_{1}\ge {p}_{2}\ge \dots \ge {p}_{d}\ge 0$, and ${p}_{1}>\frac{1}{d}$. Then taking $\rho $ to be ${\alpha}_{1}$ in $\left({\rho}^{\u2020}\left|\mathcal{C}\right|\chi \right)$ yields ${p}_{1}$, but ${p}_{1}>\frac{1}{d}$, contradicting Equation (A4).

$$\left({\rho}^{\u2020}\left|\mathcal{C}\right|\chi \right)=\frac{1}{d},$$

Being $\mathcal{C}$ unital, we have that
showing that ${\mathcal{C}}^{\u2020}$ is itself a channel. Let us prove it is unital. The action of ${\mathcal{C}}^{\u2020}$ on $\chi $ is defined in Equation (A2), so
where we have used the fact that $\mathcal{C}$ is a channel, so $u\mathcal{C}=u$. This proves that ${\mathcal{C}}^{\u2020}$ is unital. ☐

$$\left({\rho}^{\u2020}\left|\mathcal{C}\right|\chi \right)=\left({\rho}^{\u2020}|\chi \right)=\frac{1}{d}\mathrm{Tr}\phantom{\rule{0.222222em}{0ex}}\rho =\frac{1}{d},$$

$${\mathcal{C}}^{\u2020}\chi ={\left({\chi}^{\u2020}\mathcal{C}\right)}^{\u2020}=\frac{1}{d}{\left(u\mathcal{C}\right)}^{\u2020}=\frac{1}{d}{u}^{\u2020}=\chi ,$$

We can prove that the dagger of a transformation has some nice properties.

**Proposition**

**A6.**

For every transformation $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, one has ${\left({\mathcal{A}}^{\u2020}\right)}^{\u2020}=\mathcal{A}$.

**Proof.**

By Equation (A3) given any system $\mathrm{S}$, any state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{A}\otimes \mathrm{S}\right)$, and any effect $E\in \mathsf{Eff}\left(\mathrm{B}\otimes \mathrm{S}\right)$, we have
A linear extension of Equation (A3) to cover the case when ${E}^{\u2020}$ is not a physical state, applied to the RHS of Equation (A5) yields
Comparing this with Equation (A5), we get the thesis. ☐

We can give a characterisation of the dagger of reversible channels, which are unital channels.

**Proposition**

**A7.**

If $\mathcal{U}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$ is a reversible channel, ${\mathcal{U}}^{\u2020}={\mathcal{U}}^{-1}$.

**Proof.**

We have
for any $\mathrm{S}$, $\rho $, E. Recalling Lemma 2, the RHS is $\u2329\rho ,\left(\mathcal{U}\otimes \mathcal{I}\right){E}^{\u2020}\u232a$. By Proposition 4 $\u2329\rho ,\left(\mathcal{U}\otimes \mathcal{I}\right){E}^{\u2020}\u232a=\u2329\left({\mathcal{U}}^{-1}\otimes \mathcal{I}\right)\rho ,{E}^{\u2020}\u232a,$ and by symmetry of the inner product we have that
whence the thesis follows. ☐

In particular we have that the dagger of the $\mathtt{SWAP}$ channel between two systems is the $\mathtt{SWAP}$ with the input and output systems reversed.

The orthogonal projectors of Section 5.2, on the other hand, are self-adjoint on single system.

**Proposition**

**A8.**

Given the orthogonal projector ${P}_{\mathsf{I}}$ on a face ${F}_{\mathsf{I}}$, we have ${P}_{\mathsf{I}}^{\u2020}\doteq {P}_{\mathsf{I}}$.

**Proof.**

For every $\rho $ and E, we have $\left(E|{P}_{\mathsf{I}}^{\u2020}|\rho \right)=\left({\rho}^{\u2020}\left|{P}_{\mathsf{I}}\right|{E}^{\u2020}\right)$. The RHS is $\u2329\rho ,{P}_{\mathsf{I}}{E}^{\u2020}\u232a$. By the properties of projectors,
This shows that ${P}_{\mathsf{I}}^{\u2020}\doteq {P}_{\mathsf{I}}$. ☐

$$\u2329\rho ,{P}_{\mathsf{I}}{E}^{\u2020}\u232a=\u2329{P}_{\mathsf{I}}\rho ,{E}^{\u2020}\u232a=\u2329{E}^{\u2020},{P}_{\mathsf{I}}\rho \u232a=\left(E|{P}_{\mathsf{I}}|\rho \right).$$

Finally we prove some properties of the dagger with respect to compositions. We need an easy lemma first.

**Lemma**

**A3.**

For every $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, every system $\mathrm{S},$ and every vector $\xi \in {\mathsf{St}}_{\mathbb{R}}\left(\mathrm{A}\otimes \mathrm{S}\right)$ we have

**Proof.**

Recall that $\mathcal{A}={\left({\mathcal{A}}^{\u2020}\right)}^{\u2020}$; by Definition A3 we have${\left({\mathcal{A}}^{\u2020}\right)}^{\u2020}\xi ={\left({\xi}^{\u2020}{\mathcal{A}}^{\u2020}\right)}^{\u2020}$
Taking the dagger of this equation yields the desired result. ☐

Now we can state the main results. The first concerns sequential composition.

**Proposition**

**A9.**

For all transformations $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, $\mathcal{B}\in \mathsf{Transf}\left(\mathrm{B},\mathrm{C}\right)$, one has ${\left(\mathcal{B}\mathcal{A}\right)}^{\u2020}={\mathcal{A}}^{\u2020}{\mathcal{B}}^{\u2020}$.

**Proof.**

Take any system $\mathrm{S}$, any state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{C}\otimes \mathrm{S}\right)$, and any effect $E\in \mathsf{Eff}\left(\mathrm{A}\otimes \mathrm{S}\right)$. By Equation (A3) we have
Define $\xi $ as $\xi :=\left(\mathcal{A}\otimes \mathcal{I}\right){E}^{\u2020}$, so
By Lemma A3 ${\xi}^{\u2020}={\left[\left(\mathcal{A}\otimes \mathcal{I}\right){E}^{\u2020}\right]}^{\u2020}=E\left({\mathcal{A}}^{\u2020}\otimes \mathcal{I}\right)$, then
therefore ${\left(\mathcal{B}\mathcal{A}\right)}^{\u2020}={\mathcal{A}}^{\u2020}{\mathcal{B}}^{\u2020}$. ☐

Finally the dagger respects parallel composition. Again we need a lemma.

**Lemma**

**A4.**

For every $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, every systems $\mathrm{S}$ and ${\mathrm{S}}^{\prime}$, we have ${\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}\right)}^{\u2020}={\mathcal{I}}_{\mathrm{S}}\otimes {\mathcal{A}}^{\u2020}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}$.

**Proof.**

As a first step, let us prove that, for every system $\mathrm{S}$, we have ${\left(\mathcal{A}\otimes {\mathcal{I}}_{\mathrm{S}}\right)}^{\u2020}={\mathcal{A}}^{\u2020}\otimes {\mathcal{I}}_{\mathrm{S}}$. Take any system ${\mathrm{S}}^{\prime}$, any state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{B}\otimes \mathrm{S}\otimes {\mathrm{S}}^{\prime}\right)$, and any effect $E\in \mathsf{Eff}\left(\mathrm{A}\otimes \mathrm{S}\otimes {\mathrm{S}}^{\prime}\right)$, Equation (A3) yields
Specialising Equation (A3) to the case of a composite system, we have
whence we conclude that ${\left(\mathcal{A}\otimes {\mathcal{I}}_{\mathrm{S}}\right)}^{\u2020}={\mathcal{A}}^{\u2020}\otimes {\mathcal{I}}_{\mathrm{S}}$.

Now let us prove that, for every system $\mathrm{S}$, ${\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\right)}^{\u2020}={\mathcal{I}}_{\mathrm{S}}\otimes {\mathcal{A}}^{\u2020}$. Note that
By Proposition A9, and recalling what we have just proved, we have

To get the thesis, note that ${\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}\right)}^{\u2020}={\left[\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\right)\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}\right]}^{\u2020}$. We have just proved that
and that ${\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\right)}^{\u2020}={\mathcal{I}}_{\mathrm{S}}\otimes {\mathcal{A}}^{\u2020}$, therefore we conclude that ${\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}\right)}^{\u2020}={\mathcal{I}}_{\mathrm{S}}\otimes {\mathcal{A}}^{\u2020}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}$. ☐

$${\left[\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\right)\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}}\right]}^{\u2020}={\left({\mathcal{I}}_{\mathrm{S}}\otimes \mathcal{A}\right)}^{\u2020}\otimes {\mathcal{I}}_{{\mathrm{S}}^{\prime}},$$

**Proposition**

**A10.**

Let $\mathcal{A}\in \mathsf{Transf}\left(\mathrm{A},\mathrm{B}\right)$, and $\mathcal{B}\in \mathsf{Transf}\left(\mathrm{C},\mathrm{D}\right)$. We have ${\left(\mathcal{A}\otimes \mathcal{B}\right)}^{\u2020}={\mathcal{A}}^{\u2020}\otimes {\mathcal{B}}^{\u2020}$.

**Proof.**

Take any system $\mathrm{S}$, any state $\rho \in {\mathsf{St}}_{1}\left(\mathrm{B}\otimes \mathrm{D}\otimes \mathrm{S}\right)$, and any effect $E\in \mathsf{Eff}\left(\mathrm{A}\otimes \mathrm{C}\otimes \mathrm{S}\right)$, we have
Now define $\xi :=\left({\mathcal{I}}_{\mathrm{A}}\otimes \mathcal{B}\otimes {\mathcal{I}}_{\mathrm{S}}\right){E}^{\u2020}$, hence
By Lemmas A3 and A4, we have that ${\xi}^{\u2020}=E\left({\mathcal{I}}_{\mathrm{A}}\otimes {\mathcal{B}}^{\u2020}\otimes {\mathcal{I}}_{\mathrm{S}}\right)$, so
whence the thesis. ☐

This means that the dagger respects the composition of diagrams, and corresponds to the action of flipping a diagram with respect to a vertical axis.

## References

- Feynman, R.P.; Leighton, R.; Sands, M. The Feynman Lectures on Physics. The Definitive and Extended Edition; Addison Wesley: Boston, MA, USA, 2005. [Google Scholar]
- Sorkin, R.D. Quantum mechanics as quantum measure theory. Mod. Phys. Lett. A
**1994**, 9, 3119–3127. [Google Scholar] [CrossRef] - Sorkin, R.D. Quantum Classical Correspondence: The 4th Drexel Symposium on Quantum Nonintegrability; Chapter Quantum Measure Theory and Its Interpretation; International Press: Boston, MA, USA, 1997; pp. 229–251. [Google Scholar]
- Barnum, H.; Müller, M.P.; Ududec, C. Higher-order interference and single-system postulates characterizing quantum theory. New J. Phys.
**2014**, 16, 123029. [Google Scholar] [CrossRef] - Bolotin, A. On the ongoing experiments looking for higher-order interference: What are they really testing? arXiv
**2016**. [Google Scholar] - Dakić, B.; Paterek, T.; Brukner, Č. Density cubes and higher-order interference theories. New J. Phys.
**2014**, 16, 023028. [Google Scholar] [CrossRef] - Lee, C.M.; Selby, J.H. Deriving grover’s lower bound from simple physical principles. New J. Phys.
**2016**, 18, 093047. [Google Scholar] [CrossRef] - Lee, C.M.; Selby, J.H. Generalised phase kick-back: The structure of computational algorithms from physical principles. New J. Phys.
**2016**, 18, 033023. [Google Scholar] [CrossRef] - Lee, C.M.; Selby, J.H. Higher-order interference in extensions of quantum theory. Found. Phys.
**2017**, 47, 89–112. [Google Scholar] [CrossRef] - Niestegge, G. Three-slit experiments and quantum nonlocality. Found. Phys.
**2013**, 43, 805–812. [Google Scholar] [CrossRef] - Ududec, C. Perspectives on the Formalism of Quantum Theory. Ph.D. Thesis, University of Waterloo, Waterloo, ON, Canada, 2012. [Google Scholar]
- Ududec, C.; Barnum, H.; Emerson, J. Probabilistic Interference in Operational Models. 2009; in preparation. [Google Scholar]
- Ududec, C.; Barnum, H.; Emerson, J. Three slit experiments and the structure of quantum theory. Found. Phys.
**2011**, 41, 396–405. [Google Scholar] [CrossRef] - Lee, C.M.; Selby, J.H. A no-go theorem for theories that decohere to quantum mechanics. arXiv
**2017**. [Google Scholar] - Barnum, H.; Barrett, J.; Leifer, M.; Wilce, A. Generalized no-broadcasting theorem. Phys. Rev. Lett.
**2007**, 99, 240501. [Google Scholar] [CrossRef] [PubMed] - Barnum, H.; Wilce, A. Information processing in convex operational theories. Electron. Notes Theor. Comput. Sci.
**2011**, 270, 3–15. [Google Scholar] [CrossRef] - Barrett, J. Information processing in generalized probabilistic theories. Phys. Rev. A
**2007**, 75, 032304. [Google Scholar] [CrossRef] - Barrett, J.; de Beaudrap, N.; Hoban, M.J.; Lee, C.M. The computational landscape of general physical theories. arXiv
**2017**. [Google Scholar] - Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Probabilistic theories with purification. Phys. Rev. A
**2010**, 81, 062348. [Google Scholar] [CrossRef] - Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Informational derivation of quantum theory. Phys. Rev. A
**2011**, 84, 012311. [Google Scholar] [CrossRef] - Chiribella, G.; Spekkens, R.W. (Eds.) Quantum Theory: Informational Foundations and Foils; Fundamental Theories of Physics; Springer: Dordrecht, The Netherlands, 2016; Volume 181. [Google Scholar]
- Dakić, B.; Brukner, Č. Quantum Theory and Beyond: Is Entanglement Special; Cambridge University Press: Cambridge, UK, 2011; pp. 365–392. [Google Scholar]
- Hardy, L. Quantum Theory From Five Reasonable Axioms. arXiv
**2001**. [Google Scholar] - Hardy, L. Foliable Operational Structures for General Probabilistic Theories; Cambridge University Press: Cambridge, UK, 2011; pp. 409–442. [Google Scholar]
- Lee, C.M.; Barrett, J. Computation in generalised probabilistic theories. New J. Phys.
**2015**, 17, 083001. [Google Scholar] [CrossRef] - Lee, C.M.; Hoban, M.J. Bounds on the power of proofs and advice in general physical theories. Proc. R. Soc. A
**2016**, 472, 20160076. [Google Scholar] [CrossRef] [PubMed] - Lee, C.M.; Hoban, M.J. The information content of systems in general physical theories. In Proceedings of the 7th International Workshop on Physics and Computation, Manchester, UK, 14 July 2016; Volume 214, pp. 22–28. [Google Scholar]
- Masanes, L.; Müller, M.P. A derivation of quantum theory from physical requirements. New J. Phys.
**2011**, 13, 063001. [Google Scholar] [CrossRef] - Hardy, L. Reformulating and reconstructing quantum theory. arXiv
**2011**. [Google Scholar] - Chiribella, G.; Scandolo, C.M. Entanglement as an axiomatic foundation for statistical mechanics. arXiv
**2016**. [Google Scholar] - Krumm, M.; Barnum, H.; Barrett, J.; Müller, M.P. Thermodynamics and the structure of quantum theory. New J. Phys.
**2017**, 19, 043025. [Google Scholar] [CrossRef] - Jin, F.; Liu, Y.; Geng, J.; Huang, P.; Ma, W.; Shi, M.; Duan, C.; Shi, F.; Rong, X.; Du, J. Experimental test of born’s rule by inspecting third-order quantum interference on a single spin in solids. Phys. Rev. A
**2017**, 95, 012107. [Google Scholar] [CrossRef] - Kauten, T.; Keil, R.; Kaufmann, T.; Pressl, B.; Brukner, Č.; Weihs, G. Obtaining tight bounds on higher-order interferences with a 5-path interferometer. New J. Phys.
**2017**, 19, 033017. [Google Scholar] [CrossRef] - Park, D.K.; Moussa, O.; Laflamme, R. Three path interference using nuclear magnetic resonance: A test of the consistency of born’s rule. New J. Phys.
**2012**, 14, 113025. [Google Scholar] [CrossRef] - Sinha, A.; Vijay, A.H.; Sinha, U. On the superposition principle in interference experiments. Sci. Rep.
**2015**, 5, 10304. [Google Scholar] [CrossRef] [PubMed] - Sinha, U.; Couteau, C.; Jennewein, T.; Laflamme, R.; Weihs, G. Ruling out multi-order interference in quantum mechanics. Science
**2010**, 329, 418–421. [Google Scholar] [CrossRef] [PubMed] - Barnum, H.; Graydon, M.; Wilce, A. Composites and categories of Euclidean Jordan algebras. arXiv
**2016**. [Google Scholar] - Chiribella, G. Dilation of states and processes in operational-probabilistic theories. In Proceedings of the 11th workshop on Quantum Physics and Logic, Kyoto, Japan, 4–6 June 2014; Volume 172, pp. 1–14. [Google Scholar]
- Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Quantum Theory: Informational Foundations and Foils; Chapter Quantum from Principles; Springer: Dordrecht, The Netherlands, 2016; pp. 171–221. [Google Scholar]
- Hardy, L. Quantum Theory: Informational Foundations and Foils; Chapter Reconstructing Quantum Theory; Springer: Dordrecht, The Netherlands, 2016; pp. 223–248. [Google Scholar]
- Abramsky, S.; Coecke, B. A categorical semantics of quantum protocols. In Proceedings of the 19th Annual IEEE Symposium on Logic in Computer Science, Turku, Finland, 13–17 July 2004; pp. 415–425. [Google Scholar]
- Coecke, B. Kindergarten quantum mechanics: Lecture notes. AIP Conf. Proc.
**2006**, 810, 81–98. [Google Scholar] - Coecke, B. Quantum picturalism. Contemp. Phys.
**2010**, 51, 59. [Google Scholar] [CrossRef] - Coecke, B.; Duncan, R.; Kissinger, A.; Wang, Q. Quantum Theory: Informational Foundations and Foils; Chapter Generalised Compositional Theories and Diagrammatic Reasoning; Springer: Dordrecht, The Netherlands, 2016; pp. 309–366. [Google Scholar]
- Coecke, B.; Kissinger, A. Picturing Quantum Processes: A First Course in Quantum Theory and Diagrammatic Reasoning; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
- Selinger, P. A survey of graphical languages for monoidal categories. In New Structures for Physics; Coecke, B., Ed.; Springer: Berlin, Germany, 2011; pp. 289–356. [Google Scholar]
- Wootters, W.K. Local accessibility of quantum states. In Complexity, Entropy and the Physics of Information; Zurek, W.H., Ed.; Westview Press: Boulder, CO, USA, 1990; pp. 39–46. [Google Scholar]
- Chiribella, G.; Scandolo, C.M. Entanglement and thermodynamics in general probabilistic theories. New J. Phys.
**2015**, 17, 103027. [Google Scholar] [CrossRef] - Chiribella, G.; Scandolo, C.M. Purity in microcanonical thermodynamics: A tale of three resource theories. arXiv
**2016**. [Google Scholar] - Gour, G.; Müller, M.P.; Narasimhachar, V.; Spekkens, R.W.; Yunger Halpern, N. The resource theory of informational nonequilibrium in thermodynamics. Phys. Rep.
**2015**, 583, 1–58. [Google Scholar] [CrossRef] - Horodecki, M.; Horodecki, P.; Oppenheim, J. Reversible transformations from pure to mixed states and the unique measure of information. Phys. Rev. A
**2003**, 67, 062104. [Google Scholar] [CrossRef] - Selby, J.H.; Coecke, B. Leaks: Quantum, classical, intermediate, and more. Entropy
**2017**, 19, 174. [Google Scholar] [CrossRef] - Coecke, B. Terminality implies non-signalling. In Proceedings of the 11th workshop on Quantum Physics and Logic, Kyoto, Japan, 4–6 June 2014; Volume 172, pp. 27–35. [Google Scholar]
- Chiribella, G.; Scandolo, C.M. Operational axioms for diagonalizing states. In Proceedings of the 12th International Workshop on Quantum Physics and Logic, Oxford, UK, 15–17 July 2015; Volume 195, pp. 96–115. [Google Scholar]
- Chiribella, G.; Scandolo, C.M. Conservation of information and the foundations of quantum mechanics. EPJ Web Conf.
**2015**, 95, 03003. [Google Scholar] [CrossRef] - Disilvestro, L.; Markham, D. Quantum protocols within Spekkens’ toy model. Phys. Rev. A
**2017**, 95, 052324. [Google Scholar] [CrossRef] - D’Ariano, G.M.; Manessi, F.; Perinotti, P.; Tosini, A. Fermionic computation is non-local tomographic and violates monogamy of entanglement. Europhys. Lett.
**2014**, 107, 20009. [Google Scholar] [CrossRef] - D’Ariano, G.M.; Manessi, F.; Perinotti, P.; Tosini, A. The Feynman problem and fermionic entanglement: Fermionic theory versus qubit theory. Int. J. Mod. Phys. A
**2014**, 29, 1430025. [Google Scholar] [CrossRef] - Chiribella, G.; Yuan, X. Bridging the gap between general probabilistic theories and the device-independent framework for nonlocality and contextuality. Inf. Comput.
**2016**, 250, 15–49. [Google Scholar] [CrossRef] - Pfister, C.; Wehner, S. An information-theoretic principle implies that any discrete physical theory is classical. Nat. Commun.
**2013**, 4, 1851. [Google Scholar] [CrossRef] [PubMed] - Alfsen, E.M.; Shultz, F.W. Geometry of State Spaces of Operator Algebras; Mathematics Theory & Applications; Birkhäuser: Basel, Switzerland, 2003. [Google Scholar]
- Barnum, H.; Barrett, J.; Krumm, M.; Müller, M.P. Entropy, majorization and thermodynamics in general probabilistic theories. In Proceedings of the 12th International Workshop on Quantum Physics and Logic, Oxford, UK, 15–17 July 2015; Volume 195, pp. 43–58. [Google Scholar]
- Chiribella, G.; Yuan, X. Measurement sharpness cuts nonlocality and contextuality in every physical theory. arXiv
**2014**. [Google Scholar] - Alfsen, E.M.; Shultz, F.W. State spaces of Jordan algebras. Acta Math.
**1978**, 140, 155–190. [Google Scholar] [CrossRef] - Iochum, B. Cônes Autopolaires et Algèbres de Jordan; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 1358; Volume 1049, (In French). [Google Scholar] [CrossRef]
- Coecke, B.; Selby, J.; Tull, S. Two roads to classicality. arXiv
**2017**. [Google Scholar] - Barnum, H. Spectrality as a Tool for Quantum Reconstruction: Higher-Order Interference, Jordan State Space Characterizations, Aug. 2009. Talk Given at the Conference “Reconstructing Quantum Theory”, August 9–11, Perimeter Institute for Theoretical Physics. Available online: http://pirsa.org/09080016/ (accessed on 26 May 2017).
- Niestegge, G. Conditional probability, three-slit experiments, and the jordan algebra structure of quantum mechanics. Adv. Math. Phys.
**2012**, 2012, 156573. [Google Scholar] [CrossRef] - Selby, J.H.; Coecke, B. Process-theoretic characterisation of the hermitian adjoint. arXiv
**2016**. [Google Scholar] - Müller, M.P.; Oppenheim, J.; Dahlsten, O.C.O. The black hole information problem beyond quantum theory. J. High Energy Phys.
**2012**, 2012, 9. [Google Scholar] [CrossRef] - Müller, M.P.; Ududec, C. Structure of reversible computation determines the self-duality of quantum theory. Phys. Rev. Lett.
**2012**, 108, 130401. [Google Scholar] [CrossRef] [PubMed] - Marshall, A.W.; Olkin, I.; Arnold, B.C. Inequalities: Theory of Majorization and Its Applications; Springer Series in Statistics; Springer: New York, NY, USA, 2011. [Google Scholar]
- Müller, M.P.; Dahlsten, O.C.O.; Vedral, V. Unifying typical entanglement and coin tossing: On randomization in probabilistic theories. Commun. Math. Phys.
**2012**, 316, 441–487. [Google Scholar] [CrossRef] - Wilde, M.M. Quantum Information Theory, 2nd ed.; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
- Selinger, P. Dagger compact closed categories and completely positive maps. Electron. Notes Theor. Comput. Sci.
**2007**, 170, 139–163. [Google Scholar] [CrossRef]

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).