1. Introduction
Could there be a quantum superposition of conscious states? This question was raised by Eugene Wigner in the thought experiment now known as Wigner’s Friend. He imagined his friend, in a sealed laboratory, performing a quantum measurement while Wigner himself remains ignorant of the outcome. According to the standard quantum description, the laboratory would then be represented by a superposition of different measurement results, which would appear to imply that his friend was in a superposition of experiencing those results. Wigner argued that this was “absurd because it implies that my friend was in a state of suspended animation” and concluded that “consciousness must have a different role in quantum mechanics than the inanimate measuring device” (p. 180, [
1]). This led him to suggest that consciousness itself might be responsible for wave-function collapse.
Since Wigner’s proposal, there has been extensive debate over whether conscious states could be superposed and how such scenarios should be interpreted. Extensions of the Wigner’s Friend experiment have sharpened the epistemological and consistency issues involved [
2,
3,
4,
5,
6], and many-worlds and many-minds interpretations have offered accounts of how superpositions of experiences might be understood [
7,
8,
9,
10,
11,
12,
13,
14,
15]. However, much of this discussion proceeds without a precise physical criterion for which states are conscious and how different conscious states are to be compared. Without such a criterion, the question of whether conscious states can be superposed and what dynamical consequences this would have remains underdetermined.
Recent neuroscience has seen the development of quantitative theories that aim to relate physical systems to conscious experience. Among these, integrated information theory (IIT) provides an especially explicit mathematical framework, assigning to a system both a quantitative amount of consciousness, denoted
, and a structural description of its conscious state (its “
-structure”) [
16,
17,
18,
19,
20]. The neuroscience of consciousness remains a highly contested field with several competing approaches; for a broad overview see [
21]. IIT has been argued to be consistent with a number of experimental findings [
22,
23,
24,
25,
26,
27,
28]. At the same time, substantial criticisms of IIT have been raised [
29,
30,
31,
32,
33,
34,
35]. Our use of IIT does not depend on its ultimate correctness. Rather, its importance here is methodological: it provides a mathematically precise mapping between physical states and conscious states, making it possible, in principle, to formulate and analyze physical models whose dynamics depend on consciousness.
Once a mathematically explicit characterization of conscious states is available, Wigner’s suggestion can be formulated as a concrete physical proposal. In particular, models have been developed in which wave-function collapse is governed by measures of integrated information associated with a system [
36,
37]. The model of [
37], which we analyze here, specifies collapse rates as a function of differences between the conscious states associated with the superposed physical states. Because scalable quantum computations would involve superpositions of distinct conscious states in this sense, the model predicts a breakdown of universal quantum computing and thus bears on recent no-go results in quantum foundations, especially the Local Friendliness theorem [
6].
A key motivation for such proposals is experimental testability. In comparison to standard collapse models [
38], it has been argued that IIT-based consciousness-collapse models may be experimentally testable with comparatively small systems. In particular, Chalmers and McQueen suggest that suitably designed quantum computers could enter superpositions of conscious states and thereby provide an accessible probe of collapse dynamics [
37]. Central to this proposal is a specific dynamical requirement: collapse rates should depend on how different the associated conscious states are. Superpositions corresponding to very different experiences should collapse rapidly, whereas “small” superpositions, involving only minor experiential differences, should persist longer. This requirement is not merely aesthetic. Collapse theories cannot simply impose rapid collapse universally: empirical considerations already pressure such models to allow persistent small superpositions, as reflected in issues such as the tails problem [
39] and tensions with the quantum Zeno effect [
40]. Making collapse rates track differences between conscious states is intended to provide a principled way to satisfy these constraints, and it is this requirement that we analyze in the present paper.
We propose a simple quantum circuit which, if implemented, would place a minimal physical system into a superposition of conscious states, according to the version of IIT used in [
37]. This controlled setting allows us to analyze, with a very simple case, what structure a consciousness-based collapse model must incorporate if collapse rates are to track differences between conscious states. In particular, it enables us to assess whether the dynamical complexity required to implement this dependence is compatible with the claim that such models are especially simple or experimentally tractable [
41].
To make matters concrete, we follow [
37] and consider a feedback dyad. Classically, the dyad has four possible states:
,
,
, and
. Each state is predicted by IIT to possess a small but non-zero amount of consciousness, and this feature is stable across successive IIT formalisms. In IIT2.0 each state has
, in IIT3.0 each has
[
42], and in
Appendix B we show that in IIT4.0 each has
. Although the quantitative value varies, the prediction that the dyad realizes non-zero consciousness is robust, making it a natural minimal test case.
Although these states have the same
amount of consciousness, the version of IIT employed in [
37] assigns them different
states of consciousness, as reviewed in
Section 3. A quantum superposition of two dyad states therefore constitutes, within that model, a superposition of conscious states. We refer to this system as
Schrödinger’s dyad and present a simple quantum circuit that serves as a blueprint for its realization. In
Section 4 we describe a possible physical implementation of this circuit, in which two photons enter a feedback loop inside an optical cable.
In
Section 5 we analyze collapse dynamics satisfying the requirement that collapse rates depend on differences between the conscious states associated with the superposed basis states. Using the dyad’s conscious-state structure defined in
Section 3, we construct a corresponding Lindblad-type collapse model and study its dynamics. Our main result is a structural constraint on such models. If collapse is implemented with too few collapse operators—in particular, with a single operator—then collapse rates cannot, in general, be determined solely by the pairwise qualitative differences between conscious states. The obstruction follows from basic features of quantum dynamics: as we will show, the decay rates depend on the Euclidean distances between tuples of eigenvalues of the
N collapse operators, and these distances can reflect the qualitative differences of conscious experience only if
N is large enough.
We then show how this limitation can be evaded, and at what cost. Introducing sufficiently many commuting collapse operators can, in principle, reproduce the desired dependence of collapse rates on qualitative differences between conscious states. However, doing so leads to a rapid proliferation of operators even for very simple systems. This bears directly on claims that IIT-based consciousness-collapse models are simple or experimentally tractable, since the complexity of the collapse dynamics is itself part of what determines what an experimental test would involve.
Finally, we show that this complexity does not depend on the particular version of IIT employed in [
37], which uses the IIT3.0 formalism [
18] and the representation of conscious states by “Q-shapes”. The recently proposed IIT4.0 framework [
20] instead characterizes conscious states in terms of
-structures. In this formalism the four classical states of the dyad correspond to the same
-structure and hence to a single conscious state. Nevertheless, as shown in
Section 6, collapse models based on IIT4.0 encounter an analogous difficulty. The reason is that
-structures include
relations linking different distinctions in experience, and the number of such relations grows rapidly with system size. For example, the three-node system in Figure 8A of [
20] contains 60 relations, while the five-node system in Figure 7A contains more than 13,000. A collapse model whose rates track conscious structure must therefore incorporate a corresponding number of dynamical terms, again leading to operator proliferation. We further show that proposed quantum extensions of IIT [
43,
44,
45] do not remove this constraint and in fact substantially increase the required complexity.
The operator proliferation arises in IIT-based collapse models regardless of which IIT formalism is used. As we will show, this is because conscious states in IIT are individuated by counterfactual causal structure encoded in the system’s transition probability matrix (TPM). The significance of this result may extend beyond IIT. Any theory that individuates experiences by more than the instantaneous state of a physical substrate—for example, by the organization or connectivity of the underlying system in addition to its momentary activation—will introduce multiple independent features that distinguish one experience from another. A collapse dynamics that tracks such experiential differences must therefore contain correspondingly many independent collapse operators. The proliferation we identify may thus reflect a general constraint on consciousness-based collapse proposals.
2. The Feedback Dyad
The classical dyad is a simple system consisting of two elements or channels, A and B, that simply swap their states from one time step to the next. That is, if at some time,
, A is in state 1 and B is in state 0, then at the next time step,
, A is in state 0 and B is in state 1. The action on these channels is equivalent to a logical SWAP gate, which is given a simple diagrammatic representation in
Figure 1.
The figure makes it clear that there are three distinct levels of description to the dyad: channels, channel values, and channel relationships. A and B are the channels that are related via the logical SWAP gate in such a way as to exchange their values. In the language of quantum information, the channels are systems, the channel values are states, and the channel relationships are transformations. The SWAP gate is a transformation of the states of systems A and B. The gate itself is never “in a state” on its own.
This distinction matters because in some presentations of IIT—especially in IIT3.0—elements are described in a way that blurs the line between being in a state and implementing a transformation (for example, the abstract of [
18] states that the axioms of IIT are “formalized into postulates that prescribe how physical mechanisms, such as neurons or logic gates, must be configured to generate experience,” and the first postulate asserts that “mechanisms in a state exist,” which can be read as treating logic gates themselves as being in states). In the IIT3.0 formalism, nodes are assigned binary values while also being treated as mechanisms that map the states of other nodes to future states. When the elements are understood as neurons, this dual description is often unproblematic: an “active” or “inactive” neuron can be naturally interpreted as one that is, or is not, transmitting a signal along a channel [
46]. At this level of abstraction, it is convenient to treat neurons as both state-bearers and causal mechanisms. This representational practice has clear precedents in the Boolean network tradition, where nodes are routinely assigned states while also implementing logical update rules [
47,
48]. However, once one moves to a quantum setting, this ambiguity can no longer be ignored. In quantum theory, superposition applies to the states of physical systems, not to transformations or update rules themselves. Gates implement dynamical relations between systems, but are not entities that can meaningfully be said to be in superposition. For the purposes of the present analysis, we therefore treat the channels (or systems) and their states as quantum, while the SWAP operation is taken to be a fixed transformation acting on those states.
Figure 1 also highlights a fundamental causal dependence in the dyad. The output of channel A causally depends on the input to channel B and vice-versa. In order to emphasize this point, we use capital letters to identify the channels or systems themselves and lowercase letters to identify the values the channels can attain, i.e., their states. One could think of the SWAP gate as a black box with the channels simply identifying the locations of the inputs and outputs of the box. Values are fed into the inputs and then produced by the outputs.
To develop a feedback system with this SWAP gate we simply feed the outputs directly back into the inputs. For simplicity we can represent this system over a series of time steps in the manner shown in
Figure 2.
The output at a given time step is given as in
Figure 1. For example, if the system at a given time step is given by
with
where the first element of the set is the state of channel A and the second is the state of channel B, if the inputs were
, the evolution of the system state over time is just
.
Creating Schrödinger’s dyad then requires that we treat the channels as quantum and represent their states as such. That is, a classical state
is equivalent to a pure quantum state in the so-called computational basis
. A superposition of the
and
states can be achieved by feeding the superposition state
into channel
B at
. For example, if the input state to the dyad as a whole at
is
then it evolves into the following state at
:
This is not a superposition of
values, since all four possible classical states of the dyad have the same
value. It
is, however, a superposition of distinct states of consciousness according to the version of IIT used in [
37], as we show in
Section 3. The state at time
of Equation (
3) is therefore described as a superposition of qualitatively distinct states of consciousness. To understand this distinction between levels and states of consciousness intuitively, compare yourself experiencing a green screen with experiencing a blue screen. It might be that these two experiences do not correspond to any difference in
(why would changing only the color change the amount of consciousness?). Now imagine that we put a subject into a superposition of experiencing a blue screen and experiencing a green screen. By assumption this is not a
superposition, but it is clearly a superposition of distinct conscious experiences. One might doubt that distinct human states of consciousness would in practice have exactly identical
values [
49], but IIT allows for this in AI (especially in neuromorphic computing), and IIT3.0 and IIT4.0 predict that this is indeed the case for many very simple systems.
Note that no classical IIT formalism (whether IIT3.0 or IIT4.0) by itself entails this result, since these formalisms describe conscious structure for classical systems and do not specify how such systems behave when placed in quantum superposition. However, classical IIT does aspire to say something about the consciousness of actually existing systems, in particular those that are well described by classical probability theory, such as neural networks or classical circuits. IIT then inevitably claims non-zero for some very small or simple actually existing systems. Some of these systems are small and simple enough to be put in superposition states. This will result in superpositions of states that IIT claims are (differently) conscious.
The reasoning here is similar to Schrödinger’s in the Schrödinger’s cat thought experiment and to Wigner’s in the Wigner’s friend thought experiment. But we make no claim about how the superposition of states should be
interpreted. As an analogy, if we know classically what it means for a spin-1/2 particle to have spin-up or spin-down in the
z-direction, we can talk about the “superposition of up and down”. Using these words does not make any claims about how to interpret the resulting quantum state. We need more input from quantum theory (say, group representation theory) to determine that the resulting state actually describes spin pointing up in the
x-direction. Similarly, perhaps a more general IIT, once properly developed, might say more about the superposition in our case, but that is a question for another day. We now turn to the formal description of the conscious states used in the collapse model in [
37], using the dyad as a simple illustration.
3. Calculating the States of Consciousness (Q-Shapes) of the Feedback Dyad
In IIT, the amount of consciousness of a system is quantified by the scalar value
, but the theory also associates each physical state with a structural description of its conscious content. In the collapse model of [
37], this qualitative structure is represented by a state’s “Q-shape”, a notion inspired by the IIT3.0 formalism (IIT’s mathematical description of conscious states has appeared under several labels. In IIT3.0 [
18] it is primarily called a “maximally irreducible conceptual structure” (MICS), and is sometimes described as a “shape in qualia space”, motivating the simpler term “Q-shape” in [
37]. In IIT4.0 [
20] a related role is played by a “
-structure”, which we discuss in
Section 6. We note that IIT3.0’s formalism admits different interpretive stances regarding whether such structures are defined only up to relabeling symmetries (an intrinsic reading) or whether their explicit coordinate representations can be taken as individuating qualitative states. The collapse model of [
37] adopts the latter reading, and our use of “Q-shape” follows that proposal. Our main results do not depend on this reading, as shown in
Section 6. The role of Q-shapes in the present paper is methodological: they provide an explicit way of individuating conscious states and comparing them. In this section we construct the Q-shapes for all four classical states of the dyad and show that they are distinct. Within the collapse model, these differences correspond to different conscious states.
The dyad states
and
are assigned the same value
in IIT3.0 (see [
37,
42] for explicit calculations), yet in the collapse model in [
37] they are treated as qualitatively different. The difference arises when the system is partitioned—i.e., when certain causal connections are replaced by noise—and one examines the probability distributions obtained by evolving the partitioned system forward and backward in time. These distributions are not the same for
and
, and the Q-shape records this difference. Thus the Q-shape distinguishes dyad states even when their
values coincide. In more complex systems, the contributions of different subsystems must also be weighted by their integrated information
. In the dyad, however,
, so no additional weighting is required and the qualitative structure can be represented directly by simple matrices.
We begin with part A, when the dyad is in state
. The prescription of
partitioning means that we replace the complement of A (that is, B) by noise, i.e., an equiprobable distribution of 0 and 1, while keeping
A in state 1. Evolving this forward in time, we obtain a probability distribution
, where we have labelled the four states in lexicographical order:
. Computing the corresponding backward (cause) distribution over the previous time step yields exactly the same probability distribution. This gives us the first two rows in the Q-shape matrix
The third and fourth rows are the forward (effect) and backward (cause) probability distributions that we obtain if we consider the subsystem B instead, keeping it in state 0 and replacing A by noise as above. Thus, the Q-shape of a given state (such as
) is a collection of four probability distributions over the four dyad states, represented by the four rows in our representation matrix.
Performing the calculation for the other dyad states (which have identical
), we obtain
Our Q-shapes are not literal geometric “shapes”; they are matrices encoding the forward and backward probability distributions associated with each subsystem. The terminology is motivated by the IIT3.0 visualizations in [
18] (see especially Figures 10–12), where qualitative structure is depicted as a configuration in a high-dimensional space. Rather than working with the geometric picture, we follow [
37] and use an explicit coordinate representation of the same information.
In the dyad, each mechanism (the single-element subsystems
A and
B) contributes a
pair of probability distributions over the four system states: its effect distribution and its cause distribution. Concatenating these two four-component distributions yields a vector in
. For
in Equation (
4), the first two rows (effect and cause for
A) determine the point associated with
A, and the last two rows determine the point associated with
B. The Q-shape of a dyad state can therefore be represented by the pair of such points.
In full IIT3.0, these structures (or “concepts”) are additionally weighted by their integrated-information values . In the dyad, however, for all four classical states, so the weights do not differentiate the Q-shapes. Accordingly, we suppress these uniform weights and represent each dyad Q-shape solely by the associated probability distributions.
IIT predicts that the dyad has a non-zero value of
and hence, within the theory, qualifies as minimally conscious. In the collapse model of [
37], the four dyad states are further treated as corresponding to distinct qualitative states via their Q-shapes. It is therefore natural to ask what, if anything, such a system might experience [
50,
51]. We do not pursue that question here. For comparison, (Figure 19, [
18]) discusses the possible experience of a simple photodiode system with non-zero
. Whether or not such interpretations are ultimately correct is not important for our purposes. The present analysis concerns only the mathematical representation of conscious states and the dynamical constraints that follow if collapse rates depend on differences between them.
Before turning to collapse dynamics, we briefly discuss how a system of the kind just described might be physically instantiated. The next section therefore considers a simple optical implementation of the dyad and how the relevant causal units should be identified.
5. A Simple Consciousness-Collapse Model
In [
37], a dynamical collapse model is proposed in which superpositions of distinct Q-shapes (qualitative conscious states) are dynamically unstable and tend to collapse. The goal was to formulate a consciousness-collapse model that is at least as rigorous as standard collapse models [
38]. The authors conclude that their resulting model “is not as simple or powerful as some of the leading interpretations […] Nevertheless, it at least serves as an existence proof for a relatively precise consciousness-collapse model” (p. 50, [
37]). The model has since been discussed as a potential response to the Local Friendliness theorem [
6], which forces the introduction of a new assumption that the model denies, an assumption Wiseman et al. call “Universal Quantum Computing” [
6]. The model also shows how the IIT formalism can be extended to different types of physical theories, in this case to a theory with collapse.
The claim that IIT-based consciousness-collapse models are comparatively simple or experimentally tractable deserves careful scrutiny. In this section we identify a structural constraint on a broad class of Lindblad-form collapse models: if collapse is governed by too few collapse operators—most notably, a single operator—then the pairwise collapse rates between distinct pairs of conscious states cannot, in general, be made to depend only on their pairwise qualitative differences. Conversely, if one introduces enough collapse operators to recover the desired dependence on qualitative differences, one obtains a rapid proliferation of operators even for very simple systems such as the dyad. This yields a dilemma:
Either the model uses few collapse operators, in which case collapse rates fail to track conscious structure in the intended pairwise way; or it uses many collapse operators, in which case the dynamics becomes surprisingly complex even for small systems.
To establish the first horn of the dilemma, we define a simple collapse model which has just one collapse operator, whose eigenstates correspond to total states of consciousness, i.e., Q-shapes. For such a model, we prove that it is impossible to choose the eigenvalues of a single collapse operator so that, for every pair of basis states, the collapse rate of their superposition is large if and only if the corresponding states of consciousness are qualitatively very different. While our formal result concerns the case of a single collapse operator, the underlying source of the problem—the mismatch between the number of collapse parameters and the number of pairwise qualitative distances—suggests that similar constraints will arise whenever collapse is governed by too few operators. As we show below, this difficulty can be avoided by introducing additional collapse operators, but at the cost of a rapid increase in dynamical complexity, even for the simple dyad.
The model in [
37] uses the following general form for continuous collapse models (p. 27, [
52]):
The first term on the right-hand side of the equation represents Schrödinger evolution, while the remaining two terms represent the collapse evolution. Here,
is the Hamiltonian of the system,
is a real-valued parameter governing the collapse rate,
is a collapse operator whose eigenstates the system collapses towards,
is its expected value at time
t, and
is a noise process which ensures that collapse happens stochastically at a rate determined by a measure of difference between the superposed
eigenstates.
The pure state
therefore evolves stochastically. The totality of all predictions that quantum physics makes for measurement outcomes in experiments is given by the expectation values
, where
is any observable and
is the density operator that corresponds to
. However, we have no access to the actual realization of the noise
and the resulting pure-state trajectory. Hence, the best possible prediction of measurement outcomes (and, equivalently, the statistics we will record on repeating the corresponding experiment a large number of times) will correspond to the expectation values
, where
is the density operator averaged over all realizations of the noise process. As a consequence of (
6), this resulting state evolves according to the Lindblad equation
(for the derivation see, e.g., [
52]). The system can evolve via Schrödinger dynamics, via collapse, or via some combination of the two. To understand the collapse term we can ignore the Schrödinger dynamics term by setting its Hamiltonian to zero,
. The collapse term only has an effect when the system is in a superposition of eigenstates of
. In this situation, the double commutator will be non-zero and the state will evolve. The “speed” at which it evolves is a function of the eigenvalues
of
. This is because the
th matrix entry of the double commutator in
’s eigenbasis is
The dampening of the off-diagonal elements of occurs at a rate that grows with where if the system is in a superposition. We see that the eigenbasis of determines the collapse basis, i.e., the basis in which the state becomes “classical”, while its eigenvalues tell us which superpositions of pairs of such states are removed more quickly (namely, those with large ). Let us now use this prescription to construct a very simple consciousness-collapse model for the dyad, involving only a single collapse operator.
The four states of the dyad correspond to mutually distinct states of consciousness and span the Hilbert space of the system. A consciousness-collapse model should therefore drive the state, at large times
t, to one that is diagonal in this basis (this is the preferred basis for the purpose of our simple example. Note that all collapse models, including standard collapse models, postulate a preferred basis. For GRW, it is one whose basis states are definite positions states. For CSL, it is one whose basis states are definite mass-density states. For consciousness-collapse models, it is one whose basis states are definite states of consciousness. For all such models, it is one whose basis states the state vector collapses to. For the purpose of modeling the dyad, we assume that the computational basis yields definite states of consciousness. But for consciousness-collapse models generally, the preferred basis need not correspond exactly to a “classical basis”). Therefore, our collapse operator
will have the form
with four eigenvalues
. Any consciousness-collapse model should arguably imply the following principle for the choice of those eigenvalues:
Superpositions of two dyad states (say, and ) should vanish very quickly if and only if they are qualitatively very different states of consciousness.
That is, it is natural to allow superpositions of “qualitatively similar” states to persist for longer, while qualitatively different states must decohere quickly. As mentioned in the introduction, this principle is motivated by familiar constraints on collapse dynamics—such as the need to avoid tails problems and pathological Zeno-type behavior—which push against indiscriminate or uniformly rapid collapse
For a quantitative application of this prescription, we need a way to compare states of consciousness, i.e., a distance measure on Q-shapes (Figure 11, [
18]). Let us first begin with a simple ansatz that will help us to get some intuition for what is going on, before coming to a more general and rigorous no-go result further below.
Intuitive ansatz. Since Q-shapes are collections of probability distributions, it is natural to define their distance in terms of distance measures on probability distributions, which is a classical and well-studied topic in information theory. The preferred distance measures on probability distributions in IIT have changed in almost every successive version. IIT2.0 used the well-known Kullback–Leibler divergence. IIT3.0 used Earth Mover’s distance [
53]. IIT4.0 uses the intrinsic difference measure described in
Appendix B.
A natural choice is to define the distance of two Q-shapes
(i.e., with rows
) and
as
where
is some choice of distance measure on the set of probability distributions. That is, the distance of two Q-shapes is the sum of the distances of their probability distributions. (This is very similar to the form of IIT3.0’s extended Earth mover’s distance measure.)
Now we have a large choice of possible distance measures
at our disposal. However, note that the four Q-shapes of the dyad (Equations (
4) and (
5)) consist of a small variety of very simple probability distributions only: all entries are 0 or
, and any two rows are either equal, or they differ in all four entries. Two identical rows must have distance zero. Furthermore, it is natural to demand that every two probability distributions arising as rows in these Q-shapes that differ in all four places all have the same distance, which we can set to unity by a choice of scaling factor. For example,
We can then determine the distances between all pairs of Q-shapes of the dyad and obtain the following values, writing
as the
-entry of a table:
| | Q(0,0) | Q(0,1) | Q(1,0) | Q(1,1) |
| Q(0,0) | 0 | 2 | 2 | 4 |
| Q(0,1) | 2 | 0 | 4 | 2 |
| Q(1,0) | 2 | 4 | 0 | 2 |
| Q(1,1) | 4 | 2 | 2 | 0 |
Let us now return to our consciousness-collapse principle, and let us focus only on one implication of its if-and-only-if statement. It reads: If the distance between two Q-shapes and is large, then the distance between the eigenvalues and of the collapse operator must also be large.
This desideratum could always be satisfied by the arbitrary prescription to make all eigenvalues extremely large and distant from each other. However, this would typically induce almost-instantaneous collapse, a behavior that we do not expect for simple systems such as the dyad. Thus, we are searching for a choice of eigenvalues that is as tame as possible while still satisfying the above postulate.
There are many different ways to formulate the requirement “the four eigenvalues
are not very large” as a precise mathematical condition. Since we are still at the stage of developing some intuition (and not yet proving a rigorous no-go result, which will follow below), let us choose one possible formulation arbitrarily which will turn out to lead to a tractable solution: that the sum of all the
is small. This leads us to define the eigenvalues in terms of an optimization problem:
This prescription keeps the collapse behavior “tame” by demanding that the eigenvalues are not arbitrarily large, but only as large as they need to be (in their total sum) to satisfy our principle for all pairs of Q-shapes. Note that the total time scale of the collapse is not determined by
and its eigenvalues, which do not have any physical units. Instead, it is determined by the noise term of (
6), i.e., the parameter
in (
7). This will remain a parameter of the collapse model that needs to be determined experimentally. The above considerations tell us only the
relative speed at which superpositions between distinct Q-shapes are suppressed, whereas the
total speed would depend on
and hence on further considerations as to which states of consciousness are implausible to remain in superposition for significant amounts of time because of, say, human experience.
As we show in
Appendix A, this optimization problem has eight solutions:
This degeneracy can be understood as a consequence of the symmetry of the problem: for example, the table of pairwise distances does not change if we exchange
and
, or
and
.
We can simply pick one of the six solutions and use it to define our collapse operator. For the sake of argument, let us pick (though the choice does not matter for the following discussion).
Let us interpret the result by looking at some example collapse rates. We have which is small, and which is also small (and, indeed, identical). Superpositions of the two dyad states and can thus remain stable for a relatively long time. On the other hand, is large, and so is . Hence, superpositions between the dyad states and will be killed off more quickly.
However, consider the two dyad states and . Their distance, , is small and our principle demands that the corresponding difference of eigenvalues (i.e., the associated collapse rate) is at least as large as that. However, it is actually , which is much larger than required. Thus, any superposition of these two dyad states would fall off much faster than what would be expected by considering the difference between their Q-shapes alone.
A rigorous no-go result. One might object that our calculation above is not fully general. For example, we might have a distance measure that is not of the form (
10) (for example the Euclidean distance), and the eigenvalue differences might not be directly related to the distance
between Q-shapes
Q and
, but to some
function thereof. Furthermore, minimizing the sum of the
was a pretty arbitrary choice within the mathematical model. Let us therefore show a more rigorous and general impossibility result. A larger class of natural distance measures
consists of those that are sensitive (only) to the number of entries on which our
and
differ, giving larger values if more entries are different. In this case, there are two positive numbers
a and
b with
such that
This includes the cases discussed above, where
and
, but it also includes the Euclidean distance in
, where
and
, or the
p-norm distance for
, given by
, yielding
and
, reducing to the Euclidean distance for
.
A natural desideratum for our collapse model would be that there is some distance measure such that the collapse rates, determined by , are larger iff the corresponding pair of Q-shapes has larger .
We will now show that this is impossible for the natural class of distance measures that we allow. To satisfy the requirement, we would have a strictly increasing function
with
such that
. Since only the
differences of the eigenvalues matter for the collapse rates, we can without loss of generality assume that one of the
is zero. For example, suppose that
. Then, on the one hand,
and on the other hand,
, which is a contradiction. We obtain similar contradictions in the other three cases of
.
Thus, for our case of a single collapse operator , every value of must depend on more than just the number . If our principle is satisfied, then a large value of the latter implies a large value of the former, but the converse is not in general true. We must hence have pairs of Q-shapes whose superposition must collapse more quickly than what their mere qualitative distance as states of consciousness would suggest. Superposition resistance hence cannot only reflect the structure of conscious experience, but is also additionally constrained by the general structure of quantum mechanics.
This obstruction can be avoided by having a very large number of collapse operators—essentially, one per independent component of the Q-shapes: Lindblad-form collapse dynamics with multiple commuting collapse operators
suppress off-diagonal terms according to
replacing the right-hand side of (
7), where
is a vector containing the
ith eigenvalue of all
M collapse operators
. Consider all relevant Q-shapes of a given system, and the affine-linear space that is spanned by them, and let
M be the dimension of this space. In a nutshell,
M counts the number of (affinely) independent components of the Q-shapes. For example, in our dyad example, the Q-shapes
are elements of
(or, equivalently, the space of
matrices), but they form the four vertices of a tetrahedron, which spans a three-dimensional affine subspace. If
denotes the location of the
kth Q-shape
in this space, then
is the squared Euclidean distance between the Q-shapes
and
. Hence, by choosing the
as diagonal operators where the
ith eigenvalue is given by
, we obtain a collapse model where the collapse rates are exactly identical (up to a global scaling factor) to the distances between the Q-shapes.
This result, however, relies crucially on the fact that one is fitting distances on a finite set of four conscious states. Nothing in this construction singles out the collapse operators in a principled, system-independent way, nor does it generalize to larger systems whose qualitative structure varies along many independent dimensions. As the number of distinguishable conscious states increases, the dimension required to realize the full pairwise distance structure grows rapidly, and no fixed small set of collapse operators can be expected to suffice in general.
To obtain a collapse model that faithfully tracks qualitative differences across all possible conscious states, one must instead associate collapse operators directly with the independent components of a Q-shape. In this case, each collapse operator corresponds to a particular Q-shape component, and the collapse rate between two states is given by the squared Euclidean distance between their Q-shape representations, as made explicit by Equation (
11). This guarantees that superpositions collapse quickly if and only if the corresponding conscious states are qualitatively very different, something we have shown to be impossible with a single collapse operator.
The cost of this fidelity is substantial dynamical complexity. For a classical IIT system with
n elements, each having
d possible states, there are
subsystems. Each subsystem is associated with two probability distributions (cause and effect repertoires) over
possible system states, as well as a single
-value. Capturing the full Q-shape structure may therefore require
independent collapse operators, unless one obtains some accidental affine-linear dependencies. For the classical dyad (
), this yields
collapse operators (strictly speaking, if we employ the simplifications employed in
Section 3, we could reduce this to
collapse operators (one for each matrix entry e.g., in Equation (
4)). But these simplifications were possible only because we already knew that for the dyad, the composite subsystem AB has zero
(so can be ignored) while the two elements have the same
no matter the dyad’s state (so can also be ignored). A model for a more general class of systems cannot adopt these simplifications). Even modest increases in system size lead to rapid growth: for
, one already requires 385 operators.
Importantly, this complexity is not an artifact of restricting attention to classical IIT. In the consciousness-collapse model of [
37], the Q-shapes are defined using quantum integrated information theory [
43,
44], where each element is a qubit with a continuum of possible pure states. In this case, each subsystem is associated with two density matrices rather than two classical probability distributions. Since a density matrix on
dimensions has
independent real parameters, the number of collapse operators required becomes
For the quantum dyad, this already yields
collapse operators.
The source of the complexity identified here ultimately arises from the fact that IIT defines conscious structure in terms of counterfactual causal relations. If collapse rates are required to track differences in Q-shapes, then collapse dynamics must depend not only on the system’s physical actual state, but also on how that system would behave under a wide range of counterfactual interventions. Standard collapse models such as CSL do not face this burden: their collapse operators depend only on actual physical observables (such as mass density), and make no reference to counterfactual state transitions. The resulting operator proliferation in IIT-based collapse models is therefore not an artifact of modeling choices, but a direct consequence of the counterfactual content of the underlying theory of consciousness.
The resulting picture is therefore fourfold. With a single collapse operator, collapse rates cannot track qualitative differences in conscious states. With a small, system-specific set of operators, one can sometimes fit the behavior of particular finite systems, such as the dyad, but without principled generality. With one operator per independent classical Q-shape component, collapse faithfully tracks conscious structure at the cost of extreme complexity. Finally, quantum extensions of IIT only exacerbate this burden. The dilemma identified here is thus robust: the apparent complexity of consciousness-collapse models is not an optional modeling choice, but a structural consequence of requiring collapse dynamics to reflect qualitative differences in conscious experience.
6. Extending the Result to IIT4.0 and Beyond
The previous analysis employed the IIT3.0-inspired notion of a Q-shape because it provides a simple and explicit representation of qualitative differences between conscious states. Since IIT has been reformulated as IIT4.0 [
20], with conscious states described now by
-structures, it is important to determine whether our earlier result depended on special features of the older formalism. We show that it does not. Although IIT3.0 offered a convenient minimal representation, the same difficulty arises in IIT4.0; the operator proliferation problem therefore reflects a structural feature of consciousness-based collapse proposals rather than an artifact of a particular formulation of IIT.
In IIT4.0 the simplest conscious system is a
monad: a single unit that feeds into itself in a self-loop, i.e., whose next state is fully determined by its current state and nothing else. To make this concrete, consider a binary unit
A whose next state is determined solely by its current state as happens when such a unit is sent into a NOT gate. The system is specified by a transition probability matrix (TPM), giving the probabilities of the next state
conditional on the current state
:
| | | |
| 0 | 1 |
| 1 | 0 |
Given the present state, the TPM fixes the probabilities of both past and future states of the unit. IIT4.0 evaluates this by comparing the unit’s actual causal constraints to an unconstrained baseline in which both past and future states are equally likely. Because the real TPM eliminates this uncertainty, the unit specifies its own cause and effect state with maximal irreducibility. This specification is called a
distinction. Thus the monad generates one distinction: the unit determining its own past and future state.
IIT4.0 then introduces a second level of structure. The cause and effect specifications produced by the mechanism are not treated as independent pieces of information; rather, the theory evaluates how they constrain one another. In the monad, the same unit that specifies a cause state also specifies a corresponding effect state, and these two specifications are linked by the system’s TPM. IIT quantifies this linkage as a relation. For the NOT gate, the TPM deterministically pairs each possible past state with a corresponding future state, so the cause and effect specifications form a single irreducible cause–effect mapping relative to the unconstrained baseline. The resulting -structure therefore consists of one distinction and one relation. The monad thus illustrates a central feature of IIT4.0: conscious structure depends on counterfactual causal organization encoded in the TPM.
Consider now the feedback dyad. Although the dyad can occupy four classical states, its causal organization, as captured by the TPM, does not vary across those states in a way that changes the structure described above. In each state the same mechanisms specify their own cause and effect constraints, and the same linkages between those specifications are present. The dyad therefore generates the same collection of distinctions and the same relations in every classical state. (Explicit calculations are provided in
Appendix B). Accordingly, the dyad possesses a single
-structure that is independent of whether the system is in state
,
,
, or
. What changes between these physical states are the node values themselves, not the counterfactual causal organization encoded in the TPM. Since IIT4.0 individuates conscious states by their
-structure, the dyad does not realize different experiences in its different classical states. A quantum superposition of these states would therefore not correspond to a superposition of conscious experiences.
One might think that this avoids our earlier argument. In IIT3.0 the collapse dynamics appeared to depend directly on probability distributions represented in the Q-shape, whereas in IIT4.0 conscious states are described instead by a -structure consisting of distinctions and relations. However, the relevant structure has not disappeared. The distinctions and relations are defined through the cause and effect specifications generated by the system, and these specifications are determined by the system’s transition probability matrix (TPM). Consequently, the -structure still depends on how the system would behave under counterfactual alternatives encoded in the TPM. A collapse model that makes collapse rates depend on conscious structure must therefore be sensitive to this counterfactual causal organization. The dependence is no longer expressed through explicit probability distributions appearing in the description of the conscious state, but it remains implicitly present through the TPM from which the -structure is derived.
This has a direct consequence for the collapse dynamics. Earlier we showed that a single collapse operator cannot in general make the collapse rate depend only on conscious experience. Any such theory must therefore employ multiple operators if it is to track the features that individuate experiences. In IIT4.0, experiences are specified by a collection of distinctions and relations. Changing one relation while leaving the others unchanged constitutes a change in experience according to the theory. A collapse model whose rates depend on conscious structure must therefore be able to respond to these independent features. It follows that separate collapse operators are required for the independent relations that make up the -structure. The number of operators required by such a theory therefore grows with the number of relations present in the system.
The scaling becomes dramatic very quickly. Even very small IIT4.0 systems possess many relations. The three-node system illustrated in Figure 8A of [
20] contains 60 relations, while the five-node system of Figure 7A contains more than 13,000. This proliferation does not arise from unusually complicated wiring. Rather, it follows from how IIT4.0 defines structure. Any subset of nodes can function as a mechanism and generate a distinction, and relations arise when multiple mechanisms jointly constrain the same past or future states. As the number of nodes increases, the number of possible mechanisms grows exponentially, and the number of ways their constraints can overlap grows combinatorially. A collapse model that tracks conscious structure would therefore require at least a corresponding number of independent collapse operators. The resulting dynamics becomes extremely complex even for very small systems, challenging the suggestion that IIT-based collapse models would be especially simple or experimentally tractable.
To see why operator proliferation might make the experimental testing of consciousness-based collapse models hard, consider the following aspects of such tests. An important challenge will be to delineate proper consciousness-induced collapse from other mechanisms such as environment-induced decoherence or collapse due to gravitational interaction. For this distinction, it is important to have concrete predictions about how collapse rates depend on the structure of the system and its associated conscious experience, rather than simply postulating and perhaps observing a certain decoherence time scale. It is reasonable to expect that consciousness-induced collapse is only practically observable in systems that are sufficiently complex. Computing the Q-shapes and their distances for such systems is expected to be computationally very difficult, due to the involved combinatorial complexities. On the one hand, this makes the actual prediction of concrete collapse times difficult; on the other hand, it may suggest that nature rather implements an incomplete correspondence between collapse rates and differences of conscious experience, similar to the one we have seen in the dyad for a single collapse operator. (Indeed, the impossibility to use natural systems for the solution of computationally very complex problems has been suggested as a physical principle [
54], similar to other impossibility principles such as no-cloning or the non-existence of a perpetuum mobile in thermodynamics). But if this is true, then experiments cannot simply test whether experientially more distinct states of consciousness lead to faster decoherence, and it is unclear how proper consciousness-induced collapse should be structurally distinguished from, say, more ordinary environment-induced decoherence.
The robustness of our conclusion is worth emphasizing. We have shown that the operator proliferation problem arises in (a version of) IIT3.0 and persists under the substantially revised formalism of IIT4.0. The difficulty therefore does not depend on the particular way in which conscious structure is represented—whether by Q-shapes or by -structures — but on the more basic requirement that collapse rates track the structure of experience rather than solely on properties of the physical state, such as mass-density distributions.
Indeed, the underlying tension may extend beyond IIT altogether. Any theory of consciousness that individuates experiences in terms of rich internal organization—for example, by appeal to patterns of connectivity, causal dependence, or counterfactual structure in addition to momentary activation states—will generate multiple independent features that distinguish one experience from another. This idea is not specific to IIT and is taken seriously even by some critics of IIT, for example, the claim in [
55] that “merely possible experiences should affect phenomenal character”. A collapse model that attempts to make its dynamics sensitive to such features will therefore require correspondingly many independent dynamical parameters. The proliferation we have identified thus reflects a general structural pressure on Wigner-style consciousness-based collapse proposals, rather than a peculiarity of IIT.