1. Introduction
This paper examines how collective organization shapes individual identity in clonal populations of
Tetrahymena thermophila, a ciliated protozoan that undergoes synchronized binary fission. Using complete single-cell tracking across four successive fission generations, we find that autonomy, informational diversity, and life-history individuality do not precede the formation of a community; rather, they emerge from it. We call this principle the
Community First Theory: collective dynamics serve as the generative substrate from which individuality is progressively differentiated [
1,
2].
The present paper concentrates on
Tetrahymena as a controlled model system; parallel investigations of honeybee societies [
3], ant colonies [
4], and multi-agent large-language-model environments [
5] are reported separately. It has long been recognized that genetically identical cells can exhibit substantial phenotypic heterogeneity through stochastic gene expression [
6,
7], and that such noise-driven diversity can serve adaptive functions including bet-hedging [
8,
9] and division of labor [
10]. Raj and van Oudenaarden [
11] framed this as the interplay of Nature, Nurture, and Chance; the present work adds a fourth factor—
community—arguing that collective interaction actively structures phenotypic diversity beyond what intrinsic noise alone can produce. Prior quantitative studies of
Tetrahymena behavior have established rich single-cell phenotypic diversity [
12] and homeorhetic community dynamics [
13], and single-cell RNA sequencing of the same organism has revealed distinct transcriptional subpopulations within clonal cultures [
14]. Those experiments operated at low cell density where inter-individual interactions are minimal; our confined microchamber setup places all cells in a strongly interacting regime, making community formation rather than individual phenotyping the primary object of study.
To operationalize this hypothesis we use
non-trivial information closure (NTIC) [
15], defined as
where
denotes mutual information,
denotes conditional mutual information,
is the kinetic energy of a focal cell, and
is the mean kinetic energy of its contemporaries [
1,
16]. In [
15], the “environment” is external to the system of interest; here,
is the community of conspecifics—other cells of the same kind. This difference is not incidental but central to Community First Theory: the relevant environment from which individuality emerges is the collective itself. A positive NTIC indicates that the cell’s own past predicts its future beyond what community context can explain, whereas NTIC near zero signals that self-prediction and community-prediction operate on independent (orthogonal) channels of the future state.
A key concept underlying our framework is
information closure: the condition that a system’s future dynamics can be predicted from its own past without requiring additional explanatory variables [
16]. In collective biological settings, however, closure does not imply isolation. An individual component may remain statistically embedded in a correlated community while its temporal predictability becomes effectively self-determined. This
situated autonomy—agency emerging
within collective organization rather than outside it—is precisely what NTIC is designed to detect [
17]. When
yet coupling to the community is non-negligible (
), the cell is information-closed: it has become a coherent individual without having severed its community ties.
NTIC connects naturally to partial information decomposition (PID) [
18,
19,
20], which decomposes the total predictive information about
carried jointly by
and
into four atoms: unique contributions from each source, redundancy shared by both, and synergy accessible only through their combination. NTIC approximates the difference
Positive NTIC (redundancy exceeds synergy) identifies self-sustaining individuals: the cell’s past and community context carry overlapping predictive information, a signature of stable, role-bearing agents. NTIC near zero with non-zero coupling signals that redundancy and synergy are balanced—the hallmark of situated autonomy. Crucially, high synergy (
) should not be interpreted as a more advanced state; within Community First Theory, synergy-dominated regimes correspond to a
pre-specialization phase in which individual roles have not yet stabilized and predictive structure is accessible only through relational combinations. Computing synergy therefore allows us to distinguish cells that are genuinely self-determining from those whose predictability arises from transient contextual dependence.
Our main empirical findings are threefold. First, NTIC tends to be largest early in a cell’s life and declines over the life history, indicating that young cells bear a strong individual-level predictive structure that is gradually shared with the community. Second, fidelity of parent–daughter kinetic-energy distributions was assessed in the quasi-stationary middle phase only—the early phase immediately after division (when cells move slowly) and the late phase immediately before the next division (when cells again slow) are excluded as confounded periods unsuitable for inheritance estimation. Within this middle phase, coupled-regime cells show significantly higher distributional fidelity than information-closed cells (Mann–Whitney , , ), demonstrating that community coupling actively generates, rather than suppresses, behavioral diversity. Third, the connection between NTIC and partial information decomposition reveals that information-closed cells balance redundancy and synergy in a way consistent with situated autonomy: remaining embedded in community context while maintaining effective self-prediction.
In the following sections, we formalize this framework and apply it to fully tracked multicellular-like dynamics in Tetrahymena.
3. Results and Interpretation
3.1. Kinetic-Energy Dynamics Across Generations
We first examined the kinetic-energy (KE) time series obtained from long-term tracking of complete
Tetrahymena lineages.
Figure 2 shows a representative example (series 190308), where all cells from a single founder through to the eight-cell stage were continuously recorded. Each cell exhibits structured fluctuations in motility across successive fission events.
3.2. Distributional Phenotypes and Inheritance Variability
To characterize inter-generational changes in motility patterns, we compared KE distribution shapes across parent–daughter pairs (
Figure 3). Two broad phenotypes were observed across the seven series: (i) a steep-decay type, with KE concentrated near zero and rapidly vanishing tails, and (ii) a broader heavy-tailed type extending to high-KE excursions. Importantly, inheritance was not uniform: within the same generation, sibling daughters could either preserve the parental distribution closely or diverge strongly, indicating that inheritance loss is often a cell-specific transition rather than a global lineage-wide shift.
3.3. Quantifying Inheritance Fidelity with Jensen–Shannon Divergence
To quantify inheritance fidelity systematically, we computed the Jensen–Shannon divergence (JSD) between parent and daughter KE distributions across all lineage series. At the same time, we asked how motility patterns evolve over generations: whether cells converge to a common motility attractor, or whether distinct attractors emerge through interactions among siblings within the developing collective. To address these questions, we extended the JSD analysis to all individuals in the population, comparing KE distributional shapes across cells.
For consistency, distributions were estimated from the central 25 min window (3000 KE steps at stride 5, corresponding to 1500 s) centered on each cell’s midpoint between birth and subsequent division.
Figure 4 reveals distinct inheritance modes: some lineages show consistently low divergence across generations (stable inheritors), whereas others exhibit localized inheritance “crashes” either early or mid-lineage. These results establish that motility attractors can be transmitted with high fidelity in some divisions, but can also undergo abrupt redistribution in others.
3.4. Information-Theoretic Quantification of Coupling and Closure
We next asked whether inheritance patterns relate to the distribution of predictive structure between individual histories and collective context. For cell
k, we defined a sibling mean-field variable,
where
N is the number of cells in the chamber and
is the KE of cell
j, and quantified non-trivial information closure (Equation (
1)) where
X denotes the KE state and
its next-time state.
Cells were classified into three regimes (
Figure 5): (i)
coupled (
,
), (ii)
information-closed (
,
), and (iii)
independent (
).
3.5. Phase-Wise Redistribution of Predictive Structure
Applying the classification scheme across the three life-history phases reveals a clear developmental trajectory.
| Phase | Coupled | Info Closure | Independent |
| Early | 97% | 1% | 2% |
| Middle | 38% | 27% | 28% |
| Last | 75% | 13% | 10% |
Nearly all cells begin strongly coupled in the early phase. Information closure peaks in the middle phase, reflecting the transient emergence of situated autonomy within a correlated collective context. Coupling partially re-emerges prior to the next division (last phase), suggesting a non-monotonic reorganization of predictive structure across the cell cycle.
Categories are assigned using per-cell surrogate thresholds (50 permutations of
with
held fixed).
| Significant? | NTIC > Upper | NTIC < Lower | Category |
| yes | yes | – | coupled |
| yes | no | no | information closure |
| yes | no | yes | synergistic ∗ |
| no | no | no | independent |
| no | yes (either) | anomalous † |
Upper and lower NTIC thresholds are the 95th and 5th percentiles of each cell’s surrogate distribution. significance is assessed against the 95th percentile of the surrogate MI distribution. ∗ Exactly one cell met this criterion (experiment 201002, cell 6, generation 2, middle phase; NTIC bits), consistent with the expected 5% false-positive rate (); we treat this as a type-I error rather than a genuine synergistic regime. † Cells with non-significant but NTIC outside the surrogate bounds; attributed to estimation noise and excluded from subsequent analyses ( in the middle phase, 7.1%).
The classification proceeds as follows. First, is tested: is the real value above that cell’s 95th percentile of the surrogate distribution? If yes, the cell genuinely shares information with siblings (). Second, NTIC is tested: is the real NTIC above that cell’s 95th percentile of the surrogate distribution? If yes, the shared information genuinely contributes to self-prediction. Information closure is assigned when is significant but NTIC is not: the cell is correlated with siblings, yet this correlation does not help predict the cell’s future beyond what one would get with a random C.
Per-cell thresholds are used rather than a single pooled threshold because different cells have different noise levels: a cell with noisier KE will have larger random fluctuations in MI estimates and therefore needs a higher threshold. In the present data, NTIC thresholds ranged from 0.022 to 0.048; a single pooled threshold would be too lenient for some cells and too strict for others.
3.6. Robustness Under Temporal Coarse-Graining
We further tested the scale dependence of NTIC by temporally coarse-graining KE time series into non-overlapping blocks of size
N (
Figure 6). While both mutual information and conditional mutual information varied with coarse-graining scale, the middle-phase NTIC remained close to zero across all block sizes examined (
Table 1), confirming that information closure is not an artifact of the chosen temporal resolution but a robust feature of the middle-phase predictive structure.
3.7. Inheritance Fidelity Depends on Informational Regime
To test whether informational regime predicts inheritance fidelity, we computed
between parent and daughter KE distributions in the middle phase only (
Figure 4), grouped by the daughter’s regime. Coupled cells showed the highest fidelity (
,
), independent cells intermediate (
,
), and information-closed cells the lowest (
,
). A Kruskal–Wallis test confirmed a significant difference across regimes (
,
); the effect was driven by the coupled vs. information-closed contrast (Mann–Whitney
,
,
), while coupled vs. independent and information-closed vs. independent did not reach significance after Bonferroni correction.
Critically, the lower fidelity of information-closed cells cannot be attributed to a generic loss of coupling: independent cells, which also show low
, do not inherit significantly worse than coupled cells. It is specifically the information-closed state—where coupling to siblings persists (
) but NTIC collapses to zero—that predicts the greatest divergence from the parental KE phenotype, a direct dynamical signature of situated autonomy. The per-series breakdown and generation-level trajectory of this divergence are shown in
Figure 7.
3.8. Per-Series Informational Diversity
The per-series breakdown reveals that each experimental community develops its own characteristic informational signature (
Table 2).
Series 190316 and 201002 are information-closure-dominant, with 43% of middle-phase cells reaching the information-closed regime, and these series also show the greatest inheritance divergence (mean and respectively for information-closed cells). By contrast, series 200617 is independent-dominant (57% independent cells), with almost no information-closed cells (7%), and exhibits a qualitatively different fidelity pattern: here it is the independent cells—not information-closed ones—that show elevated inheritance divergence (), suggesting that in this community decoupling from collective context itself drives phenotypic divergence rather than the emergence of situated autonomy. Series 210818 and 190308 are coupled-dominant (57% and 50% respectively) and show the lowest inheritance divergence overall.
Inheritance Fidelity Across Generations
To examine how inheritance fidelity evolves across successive divisions, we computed the mean
between parent and daughter KE distributions separately for each generation transition, restricted to the middle phase (
Figure 7). At the population level, fidelity increases across generations: pooled mean
declines from
at gen0→1, to
at gen1→2, and to
at gen2→3, suggesting that the community progressively stabilizes its phenotypic structure as it matures.
Between-series trajectories reveal two contrasting patterns (
Figure 7 left). Series 210818 and 210824 show extreme early divergence at gen0→1 (
and
respectively), followed by rapid convergence in subsequent generations; the founder cell appears to carry a distinctive phenotype that scatters widely in the first division, after which daughters progressively re-synchronize. Series 190316 exhibits the opposite trajectory: low initial divergence at gen0→1 (
), a sharp spike at gen1→2 (
), and partial recovery thereafter, suggesting that phenotypic diversity erupts in the second generation rather than the first. Series 190308 follows a monotonically convergent trajectory (
) throughout.
Across the seven series, the fraction of information-closed cells at gen3 shows a positive upward tendency with mean inheritance divergence (Pearson
,
;
Figure 7 right). This association is consistent with Community First Theory, but the nature of the diversity requires clarification. The community does not merely produce IC cells: it bifurcates into a mixture of coupled, information-closed, and independent cells. The IC fraction is therefore a proxy for the
degree of categorical differentiation the community has undergone—how far it has partitioned into distinct informational roles. The JSD divergence, by contrast, measures
phenotypic diversity: how much daughters differ from parents in their KE distributions. The positive trend suggests that communities which undergo greater categorical differentiation (more IC cells emerging alongside coupled and independent ones) also tend to exhibit greater phenotypic divergence across generations. These are two complementary signatures of community-generated individuality, and both are predicted by Community First Theory. The trend does not reach conventional significance, which is expected given there are only seven independent series; the direction and magnitude (
) motivate follow-up experiments with larger cohorts.
3.9. Interpretation: Information Closure as the Relocation of
Agency to the Individual Cell
NTIC as Redundancy Between Self-Prediction and Sibling
Influence
A complementary interpretation of NTIC follows from relating it to transfer entropy from the sibling context
C to the focal cell
X [
28]. Transfer entropy is defined as
quantifying the information that the sibling configuration provides about the future state
beyond what is already contained in the cell’s own past
X.
Applying the chain rule for mutual information, NTIC decomposes as
Thus, NTIC measures the
redundancy between two predictive channels: the self-predictive channel
and the sibling channel
. It captures the portion of sibling information about
that overlaps with what the cell already encodes in its own history.
This decomposition yields a clean interpretation of each regime:
Coupled (): Predictive information from C is largely redundant with self-prediction; the individual’s predictive structure is aligned with that of the collective.
Information closure (): Redundancy vanishes and , meaning that sibling information reaches the cell’s future through a channel that does not overlap with self-prediction—the collective contributes, but in a way that is complementary to, rather than redundant with, the cell’s own history. The two predictive channels operate along informationally independent dimensions of the future state.
Synergistic (): Conditioning on X reveals additional predictive information from C, corresponding to genuinely relational prediction. No statistically robust cases of this regime were identified in the present dataset.
A critical subtlety for the notion of agency is that information closure does
not imply that
. Even when
, sibling influence can remain positive, so the cell is not causally isolated from its environment. Rather, closure means that the cell’s intrinsic predictive structure is undisturbed by that influence:
This is precisely the condition for
situated autonomy: an agent is embedded in and receives information from its surroundings (
), yet its core self-dynamics remain informationally self-sufficient. By contrast,
would represent complete causal isolation—the absence of agent–environment interaction, not its mature form. Information closure therefore marks the point at which predictive agency is relocated from the collective to the individual cell, while the cell remains an embedded member of the collective.
4. Discussion
4.1. Community First Theory: Individuality Emerges from
Collective Organization
The results reported here provide initial quantitative empirical support for Community First Theory, which holds that individuality is not a pre-given property of isolated components, but an emergent informational phenomenon generated by collective organization. The central claim of the theory is that forming a community regenerates individuality at a new dynamical level: it is precisely through the process of collective interaction that distinct individual identities arise. The Tetrahymena data instantiate this claim concretely. In the early phase, when the population is small and cells have only recently divided, 97% of cells are strongly coupled—the collective dominates and individual predictive structure is largely redundant with sibling context. By the middle phase, as the eight-cell community consolidates, 27% of cells enter the information-closed regime: they remain embedded in a correlated sibling context () while their future dynamics become effectively self-determined (). Coupling partially re-emerges in the last phase before division as the community begins to dissolve in preparation for the next division, suggesting that the differentiation of individual identities is transient and non-monotonic rather than a one-way transition.
This trajectory is not a gradual weakening of collective cohesion. It is a redistribution of predictive structure: the community generates the conditions under which distinct individual dynamical identities can crystallize. The sibling mean-field variable
in our framework is precisely the kind of collectively computed macroscopic quantity that Flack [
29] identifies as the locus of downward causation: components coarse-grain the collective state and adjust their behavior accordingly. The transition from coupled to information-closed cells can therefore be read as the passage from a regime in which the macro level causally dominates the micro (high effective information at the collective scale, in the sense of Hoel et al. [
30]) to one in which individual-level causal structure re-emerges. Unlike approaches that infer agency from task performance or predefined roles, NTIC provides an explicit information-theoretic criterion for locating this transition—identifying
where predictive structure is borne, absorbed, or relocated as the collective develops.
4.2. Information Closure as Constructed Individual Identity
The inheritance analysis deepens this interpretation. Cells in the information-closed regime showed the greatest divergence from their parental KE attractor (), significantly larger than coupled cells (; Mann–Whitney , ), and this effect was specific to the information-closed state: independent cells, which also show low , did not inherit significantly worse than coupled cells ().
This specificity is theoretically important. A generic loss of coupling (the independent regime) does not destroy inheritance fidelity. What destroys it is the information-closed state, in which
persists but
. Coupled cells are pulled toward the collective attractor, which closely resembles the parental one, and so they inherit faithfully. Information-closed cells, by contrast, have constructed their own self-determined attractor—one that is no longer anchored to the inherited template. In this sense, information closure marks a dynamical bifurcation: the cell remains a member of the collective while simultaneously departing from the lineage attractor. This corresponds closely to Di Paolo’s [
31] notion of adaptive autonomy—rooted in the autopoietic tradition [
32]—where an agent maintains its self-organizing dynamics while remaining coupled to its environment, with the capacity to modulate that coupling. This is precisely what Community First Theory predicts. The community does not merely reorganize existing individual identities; it generates new ones, and these new identities are recognizable precisely by their departure from what was inherited.
4.3. Relation to the Classical Definition of Information Closure
The definition of information closure used here differs in an important respect from the classical formulation. In the original systems-theoretic treatment [
16], closure is defined as the condition
meaning that the collective context provides
no additional information about the cell’s future once its own past is known. Under this strict definition, the cell is causally isolated from its environment: it neither uses nor is influenced by sibling dynamics.
We argue that this classical condition is neither sufficient nor necessary for situated autonomy, and that it conflates three distinct phenomena. The condition is equivalent to , which encompasses: (i) independence, where (no collective relationship); (ii) coupling, where (collective relationship dominates self-prediction); and (iii) information closure, where but (collective relationship exists but does not compromise autonomy). Only the third regime—which violates the classical condition—constitutes genuine situated autonomy.
In other words, achieves closure trivially, through isolation. achieves closure non-trivially, through autonomy within coupling. A genuine agent is not causally sealed off from its environment; it maintains an autonomous self-model while remaining situated within an informative collective context. The Tetrahymena data support exactly this picture: the information-closed cells identified here have positive —they do receive sibling influence—yet their self-dynamics are informationally self-sufficient.
This non-trivial closure is the empirical basis of Community First Theory. The condition with means that the individual’s future is not written inside the individual alone—it is sustained by the relational structure of the collective. Agency, in this sense, does not reside within the component but is granted to it through collective organization. A collective forms first, and from that collective a new form of individual autonomy emerges.
4.4. Informational Diversity Without Genetic Difference
The per-series results (
Table 2,
Section 3.8) demonstrate that the
type of informational differentiation a community undergoes—whether cells partition into coupled vs. information-closed states, or into coupled vs. independent states—has measurable consequences for how faithfully motility phenotypes are transmitted across generations. This between-series variability is a key prediction of Community First Theory: since individuality is constructed through collective interaction rather than read off from a genetic template, the specific identities that emerge depend on the history and dynamics of each particular community. What might be called “personality” in a clonal population—the dynamical and informational differentiation of cells sharing identical genomes—is therefore a direct expression of community-generated individuality, not a genetic artifact. This finding bears on the concept of organismality [
33] and the broader question of what constitutes a biological individual [
34]: our clonal
Tetrahymena population is genetically uniform yet exhibits non-trivial conflicts of informational interest, placing it at an intermediate point on the society–organism continuum. This resonates with the isologous-diversification theory of Kaneko and Yomo [
35], which showed theoretically that interacting identical cells can spontaneously differentiate through dynamical instabilities; our information-theoretic classification provides empirical evidence that such differentiation manifests as distinct informational regimes. From a systems-theoretic perspective, this coexistence of informational role differentiation within a coupled collective parallels Tononi et al.’s [
36] notion of neural complexity, where high complexity arises from the simultaneous presence of functional segregation and functional integration among homogeneous units, and resonates with Integrated Information Theory’s emphasis on irreducible causal structure [
37]. The generation-level trajectory (
Section 3.8) further shows that the timing of phenotypic diversification itself varies across communities, with some series diverging sharply at the first division and others not until the second, suggesting that the collective interaction history shapes both the degree and the tempo of individuation.
4.5. Connection to Partial Information Decomposition
The identity
(Equation (
2)) places our framework within the partial information decomposition (PID) literature [
18], where the total mutual information
is decomposed into redundant, unique, and synergistic components. The specific values of PID components depend on which redundancy measure is adopted, and different PID frameworks yield different decompositions. However, adopting the minimal mutual information as redundancy [
19,
38],
leads to a particularly clean consequence for information closure. If the empirical condition
holds—meaning a cell predicts its own future better from its own past than from its siblings’ current state—then
. Substituting into
, information closure (
) then implies
In other words, under this redundancy measure and condition (
8), information closure
guarantees that genuine synergy is present: the cell’s past and the sibling context jointly predict the cell’s future in a way that neither does alone. The collective does not merely correlate with the cell; it actively participates in constructing the cell’s future predictability.
We verified condition (
8) empirically across all seven series in the middle phase.
exceeded
for all 98 cells (100%), with mean values of
bits and
bits respectively—a nearly 19-fold difference confirming that temporal self-prediction dominates sibling-to-future coupling throughout. Consequently,
holds universally in this dataset.
Because holds cell by cell, the three informational regimes map directly onto qualitatively distinct PID signatures without requiring separate estimation of Rdn and Syn:
Coupled (): —the cell’s past and the community context carry overlapping predictive information.
Information-closed ( with ): —the algebraic consequence of uniquely characterizes this regime. Both X and C contribute to predicting , but their contributions are complementary: redundancy is near zero, so the predictive information carried by the cell’s own past and by the collective context does not overlap. Each source reaches the cell’s future through an independent channel. Because , non-trivial synergy is present: the cell’s past and the sibling context jointly predict the cell’s future in a way that neither source does alone.
Independent (): trivially, because the cell is decoupled from its siblings.
Thus, is the specific informational fingerprint of information closure and does not arise in either the coupled or the independent case.
The adoption of minimal mutual information as redundancy is a specific assumption within the PID framework, and the quantitative values of Rdn and Syn depend on this choice. However, the qualitative conclusion—that information closure implies non-trivial synergy between self-history and collective context—is robust to this choice whenever condition (
8) holds, which it does universally here.
4.6. Synergy and the Limits of the Present Dataset
Although a formal classification allows for a synergistic regime (, where relational combinations carry predictive structure not accessible from individual histories alone), the present data provide no evidence for such a regime.
In the middle phase, five cells (5.1% of 98) showed negative NTIC alongside significant , making them prima facie synergistic candidates. However, each cell’s significance is assessed against its own surrogate distribution (50 permutations of ), giving a cell-specific lower threshold at the fifth percentile. Of the five candidates, four had NTIC values within their individual noise ranges (i.e., NTIC > cell-specific lower threshold) and were therefore classified as information-closed. Only one cell—cell 6 of experiment 201002 (generation 2, NTIC = −0.039 bits)—fell below its cell-specific lower threshold (−0.027 bits) and was formally classified as synergistic. This single case () is consistent with the expected false-positive rate (), and for this cell only marginally exceeds its significance threshold, further supporting a type-I error interpretation. Notably, the overall count of five negative-NTIC cells (5.1%) matches the FPR almost exactly, reinforcing the conclusion that no genuine synergistic regime is present in this dataset.
Within Community First Theory, synergistic regimes would correspond to a pre-specialization phase in which collective roles have not yet stabilized; the present data do not support this interpretation, and we treat information closure as the primary finding.
4.7. Broader Implications and Future Directions
A central implication of Community First Theory is that agency relocation is not tied to a particular biological substrate, but may arise generically in interacting systems when predictive structure is redistributed between individuals and the collective context. Future work should compute NTIC and related multivariate information measures directly in robotic and multi-agent artificial systems, and examine how learning, environmental heterogeneity, and communication constraints shape the emergence of information closure. Networks of LLM-based agents offer a particularly tractable test case, as individual module histories and collective context variables are directly accessible.
The contrast with prior
Tetrahymena studies clarifies the scope of Community First Theory. Jordan et al. [
12] and Chuang et al. [
13] characterized phenotypic diversity and succession dynamics at low cell densities where inter-individual interactions are sparse; in that regime, behavioral diversity is largely an intrinsic property of isolated cells. Our microchamber experiments operate in a qualitatively different, strongly interacting regime: eight cells confined to an 800
m chamber interact continuously throughout the growth cycle, and it is precisely this strong coupling that enables the community formation we observe. The between-series heterogeneity in informational regime (
Table 2) and the generation-level fidelity trajectories (
Figure 7) are therefore not merely reflections of pre-existing individual variation but signatures of community-mediated differentiation inaccessible to low-density experiments. Furthermore, recent single-cell RNA sequencing of clonal
Tetrahymena cultures has independently revealed distinct transcriptional subpopulations sharing identical genomes [
14], demonstrating that community-generated individuality manifests at multiple biological levels—from gene expression to motility dynamics. Community First Theory provides the unifying explanatory framework: individuality in the strongly interacting regime is not inherited by isolated cells but emerges from, and is structured by, collective dynamics.
These findings align with recent views that agency is reorganized—rather than simply suppressed—across the unicellular-to-multicellular transition [
2], and connect to information-theoretic definitions of agents based on spatiotemporal predictive structure [
17]. Community First Theory provides a unifying framework for these perspectives: it is not that multicellularity suppresses cellular agency, but that collective organization generates a new level of individual identity—one that is informationally distinct from both the coupled collective and the independent isolated cell.
Several limitations should be noted. The present study examines a single biological system (
Tetrahymena thermophila) with seven independent communities and one dynamical observable (kinetic energy), each community comprising eight cells at the third generation. Although the sample size is sufficient to establish statistical significance for the main findings, confirming the generality of Community First Theory requires extension to other interacting systems. We are currently applying related information-theoretic frameworks to honeybee colonies of order
individuals [
39] and to
Pristomyrmex punctatus ant societies of order
[
40], where both the number of interacting individuals and the dimensionality of behavioral observables far exceed those of the present study. These ongoing analyses will test whether the coupled-to-closure transition is a general feature of collective individuation or specific to the micro-confinement geometry used here.
5. Conclusions
In this study, we introduced non-trivial information closure (NTIC) as an information-theoretic measure to characterize where predictive and causal structure is effectively localized in interacting systems. By analyzing complete individual-level dynamics in populations of Tetrahymena, we showed that predictive structure can relocate from individual temporal continuity to collective relations as interaction strength and organization increase.
In the coupled regime (), a cell’s future is largely predictable from the collective context alone: the community determines the individual. In the information-closed regime ( with ), the cell remains embedded in the collective yet its future has become predictable from its own past—an autonomous individual has emerged. Crucially, this autonomy is not a pre-existing property; it is constructed through collective interaction. The community first produces individuals whose futures it controls, and from that coupled state, information closure arises: the collective gives rise to an individual that no longer requires the collective for its own prediction.
Remarkably, genetically identical populations spontaneously differentiate into distinct informational regimes—coupled, information-closed, and independent—whose proportions vary across experimental series. This categorical diversity arises without genetic difference, demonstrating that collective organization alone is sufficient to generate functional individuality.
Inheritance fidelity, measured by Jensen–Shannon divergence between parent and daughter kinetic-energy distributions, is significantly lower for information-closed cells than for coupled cells (), and fidelity increases across successive generations in most series. Information-closed cells depart from the inherited parental attractor, constructing new self-determined phenotypes; the community does not merely reorganize existing identities but generates new ones.
The robustness of NTIC under temporal coarse-graining further demonstrates that this measure captures a structural property of predictive organization rather than a scale-dependent artifact.
Together, these results provide empirical support for Community First Theory. The central claim is that agency does not reside inside the individual but in the relations among individuals: a collective forms first, and from that collective a new form of autonomy is granted to the individual. Within the community, each cell retains a self-predictive structure (), and this structure persists even when collective context is accounted for ()—the individual is genuinely autonomous. Yet this autonomy is not intrinsic; it is constituted by the organizational structure of the community. The condition with —non-trivial information closure—is the quantitative signature of this process: the individual’s future is sustained by collective relational structure, yet has become self-determined. Individuality is not a prerequisite for collective behavior but an emergent product of it.
While a synergistic regime (NTIC < 0) was not statistically resolved in the present dataset, it may be obscured by finite-sample noise in this small-N system; we expect such a regime to become accessible in larger self-organizing collectives such as social insect colonies, multi-agent language model systems, and neural populations, where relational predictive structure can dominate over individual histories.