1. Introduction
Understanding information flow in multivariate systems is a central problem in the analysis of complex data. In applications such as neuroscience, climate science, economics, and networked dynamical systems, observed signals arise from several interacting components. Pairwise measures are often insufficient in such settings, because information about a target may be redundant across sources, unique to one source, or available only through joint observation.
Classical multivariate information measures, such as interaction information, are useful algebraic summaries but can take negative values. This is not an error: the sign reflects the alternating-sum structure of the definition and the fact that redundancy and synergy contribute with opposite signs. Nevertheless, signed net quantities can be difficult to interpret when the aim is to identify distinct informational mechanisms.
Partial information decomposition (PID) addresses this issue by decomposing information about a chosen target into redundant, unique, and synergistic components. Early PID approaches introduced important conceptual distinctions but did not fully resolve the problem of obtaining nonnegative atoms in general multivariate settings. Mages, Anastasiadi, and Rohner [
1] (see recent references in that paper) introduced a full-lattice PID based on the Blackwell order and the geometry of binary-input channels. We refer to this construction as MAR-PID. Its central advantage is that the resulting partial information atoms are nonnegative for a broad class of information functionals.
Partial information decomposition was introduced to separate redundant, unique, and synergistic contributions to the information that several sources provide about a target [
2,
3]. Since then, a number of alternative approaches to redundancy, synergy, and multivariate dependence have been developed, including optimization-based, geometric, pointwise, and dual-decomposition formulations [
4]. Related work has also emphasized the role of synergy in information modification and dynamical (see also references there) systems, as well as the limitations of classical Shannon-type summaries for describing higher-order dependence [
5] (see also references in the cited paper). The present paper does not aim to compare these PID measures. We use the Blackwell-order construction of Mages, Anastasiadi, and Rohner because it provides nonnegative partial information atoms for a broad class of information functionals and is naturally formulated in terms of binary-input channel geometry [
1,
6]. The binary-input channel geometry, Blackwell order, zonogon representation, cumulative loss construction, and nonnegativity theorem are inherited from MAR-PID. The new contribution of the present paper is the finite-resolution empirical layer needed when the variables of interest are continuous or high-cardinality rather than already given in a finite binary form. The comparison made here is conceptual rather than benchmark-based: different PID proposals define different decompositions, and a numerical comparison would require separate choices of estimators, test distributions, and error criteria.
The basic idea is to represent variables by binary coordinates obtained from recursive quantile, or median, partitioning. For a continuous target variable, the finite-resolution representation is a vector of binary target components. MAR-PID is applied to each binary target component, and the resulting atomic information quantities are aggregated over target bits. For sources that are also represented by binary coordinates, the bit-level atoms are pushed forward to atoms indexed by the original source variables. Thus the binary representation acts as an intermediate computational device, while the final summaries are reported at the level of the original variables.
The resulting object is a finite-resolution MAR-PID summary induced by the chosen binary representation, not a representation-independent PID of the underlying continuous variable. This distinction is important: the paper extends the empirical use of MAR-PID to finite-resolution representations of continuous or high-cardinality observations, but it does not modify the MAR-PID nonnegativity theorem itself.
This construction should also be distinguished from set-based structural degrees of freedom. If a discretized dynamical system is known explicitly, and if one knows the set of present-time variables influencing each next-time component, then set cardinalities and set-theoretic inclusion-exclusion methods provide direct structural descriptors. In observational time-series analysis, however, the update rule and its influence sets are usually not available. The present paper does not attempt to define degrees of freedom from such observations. Instead, it provides empirical nonnegative MAR-PID atoms that can later serve as the basis for support-level or scale-based summaries.
Contributions
The main contributions of this paper are as follows:
A finite-resolution binary representation of continuous or multilevel variables based on recursive quantile binarization;
A conditional dyadic property showing that, at every fixed finite-tree level, the recursive quantile code produces balanced and conditionally independent binary coordinates under non-atomic conditional laws;
A bit-level empirical procedure that applies MAR-PID to binary target components obtained from the finite-resolution representation;
An aggregation scheme over resolved target bits, producing finite-resolution target-bit summaries for the original target variable;
A source-side pushforward that maps bit-level MAR-PID atoms to summaries indexed by atoms over the original source variables;
Illustrative examples showing how nonnegative MAR-PID atoms represent XOR-type synergy and mixed redundancy–synergy mechanisms.
Although this paper focuses on the empirical construction and aggregation of MAR-PID atoms, the procedure is not tied to one side of the redundancy–synergy distinction. The finite-resolution binarization, target-bit aggregation, and source-side pushforward apply to the resulting atoms whether they are interpreted as redundant, unique, or synergistic contributions.
The present paper deliberately remains at the information decomposition level. PID-based dimensions and support-based degrees-of-freedom summaries require additional choices concerning scale normalization, activity thresholds, and source/target-bit aggregation. These questions are natural continuations of the present construction and are left to a separate treatment.
The structure of the paper reflects this separation.
Section 2 recalls only the MAR-PID background needed for the construction.
Section 3 defines the finite-resolution binary representation and proves the conditional dyadic property.
Section 4 defines the bit-level MAR-PID aggregation and the pushforward to original-variable atoms.
Section 5 presents the empirical plug-in computation.
Section 6 presents illustrative examples, and
Section 7 discusses interpretation, limitations, and subsequent support- or scale-based summaries.
2. MAR-PID Background
We briefly recall the objects needed for the empirical construction. Let
be a finite collection of observed variables, and let
T denote the target. A source is a nonempty subset
. We write
for the set of all sources. An atom is an antichain in
; that is, a collection
such that no source in
is strictly contained in another. The set of atoms is denoted by
.
The MAR-PID construction assigns to each atom
a partial information contribution about the target. These atoms are obtained by applying Möbius inversion to a cumulative loss functional on the synergy/loss lattice. We write
for the cumulative loss and
for its Möbius inverse. When the functional
f is fixed, we suppress
f when no confusion is possible.
The construction is based on comparing channels in the Blackwell order. For a source
S and a binary target
, the relevant channel is
For binary-input channels, the Blackwell order admits a zonogon representation, which makes joins and meets computable and supplies the geometric inequality behind the nonnegativity of the atoms. The formal lattice and channel definitions are recalled in
Appendix A and
Appendix B.
We use the synergy/loss orientation of the MAR-PID lattice throughout. Thus
denotes the order on antichains used for cumulative loss and Möbius inversion, as recalled in
Appendix A. The symbol ∨ in
denotes the Blackwell join of the source channels
. For binary-input channels this join is represented and computed through the associated zonogon geometry; the zonogon construction is therefore a geometric representation of the Blackwell join, not a separate source lattice operation.
The functional
denotes the information functional used in MAR-PID for a binary-input channel
and target bias
. The cumulative quantity used below is a loss relative to the full source channel:
Partial information atoms are then obtained by Möbius inversion with respect to
. We assume
f belongs to the admissible class covered by the MAR-PID nonnegativity theorem; the present paper does not modify that theorem or enlarge its scope.
A useful way to view the MAR-PID result is that it provides nonnegative target-relative atoms. In the binary-target case, these atoms satisfy
For finite multilevel targets, MAR-PID can be formulated through pointwise binary-input decompositions over target states. In the empirical construction developed below, we use binary components of a finite-resolution representation of the target. This keeps every computational step inside the binary-input setting while allowing the final summaries to refer to the original target variable after aggregation.
3. Recursive Quantile Binarization
In practice, variables may be continuous or have many possible values. We therefore introduce a finite-resolution binary representation. The construction is based on recursive quantile (median) partitioning. For simplicity of notation, assume first that a variable X takes values in ; other one-dimensional variables may be transformed to this case by a monotone distributional transform or by empirical ranks.
Let x be a realization of X.
In empirical work, the medians are estimated from samples. For time-series prediction or conditional analysis, the medians may be conditional on past information or on a conditioning state. The following proposition records the finite-tree-level property used in the construction.
Proposition 1
(Conditional dyadic structure)
. Fix and write . Assume that the recursive conditional median binarization of relative to is well defined up to every finite depth. Then, for every and every binary word ,Consequently, for every fixed K, the variables are conditionally independent given , and each is conditionally Bernoulli. The proof is given in
Appendix C. This statement can be viewed as a finite-depth dyadic analogue of the Rosenblatt transform [
7]. The construction produces balanced binary components, which is useful both statistically and computationally.
Proposition 1 is used here as a finite-resolution coding result. Recursive conditional median splitting produces balanced binary coordinates with a dyadic finite-tree structure. This gives each resolved target coordinate a common binary information scale and makes it suitable as a binary target for MAR-PID. The target-bit aggregation and the projection to original-variable atoms are separate construction steps introduced below.
Remark 1
(Representation dependence). The construction is multivariate at the PID level: MAR-PID is applied to a collection of source variables and a target. The recursive quantile binarization step, however, is specified here for scalar variables. A vector-valued observed variable can be included either by treating its coordinates as separate observed variables, or by first choosing an additional finite-resolution encoding of the vector. Different such encodings may lead to different finite-resolution MAR-PID summaries.
For scalar variables, the quantile construction is invariant under strictly monotone transformations at the population level, or under rank-based empirical implementation, because the induced order of observations is unchanged. The finite-depth dyadic property in Proposition 1 requires the conditional median splits used in the construction; unconditional quantiles would not in general yield the same conditional dyadic property. Ties or atoms require a deterministic or randomized tie-breaking convention, and the exact balanced-split statement should be understood under the stated non-atomic conditional-law assumption.
The use of recursive median splits is part of the finite-resolution representation. Median splitting is the choice used in Proposition 1, since it gives balanced binary coordinates and the finite-depth dyadic property. Other quantile splits may be useful for application-specific encodings—for example, to give more resolution to tail events—but they define different finite-resolution summaries and need not satisfy the same dyadic property. For scalar variables, the rank-based median construction is invariant under strictly monotone transformations, apart from tie-handling conventions. For vector-valued variables, or for non-median encodings, the chosen representation should be reported.
4. Bit-Level MAR-PID and Lifting to Original Variables
The binary variables introduced above are computational intermediates. We now describe how MAR-PID is applied at the bit level and how the resulting atoms are aggregated back to finite-resolution summaries for the original variables.
Let
Y be an original target variable, and let
be its finite-resolution binary representation. Similarly, suppose that each original source variable
is represented at a common source depth
k by
The binary source universe is
For simplicity we use a common source depth
k and target depth
K. Allowing source-dependent depths
is straightforward, but adds notation without changing the construction. For each binary target component
, MAR-PID is applied to the source universe
and target
. This gives bit-level atoms
If Shannon information is used, the finite-resolution contribution associated with a bit-level atom
is aggregated over target bits by
The normalized version is
Throughout the paper, an overline denotes normalization by the number of resolved target bits.
When the sources are represented by binary coordinates, bit-level atoms can also be projected back to original source variables. Define the projection map on source-bit subsets by
For an atom
, the collection
may fail to be an antichain, because distinct bit-level sources can project to nested original-variable sources. We therefore define
to be the antichain obtained by removing every projected set that is strictly contained in another projected set.
For an original-variable atom
, define
Aggregating over target bits gives
and the normalized version is
Thus all bit-level atoms projecting to the same original-variable atom are summed. The resulting -indexed quantities are pushforward summaries of genuine bit-level MAR-PID atoms. This preserves, for each target component, the total MAR-PID information mass and keeps the projected quantities nonnegative. The projection is also a coarsening step: distinct bit-level mechanisms may project to the same original-variable atom and are then no longer separated at the original-variable level.
Aggregating over target bits gives a finite-resolution target-bit summary for the original target variable. The normalized version is an average information contribution per target bit, not a support or variable-count summary. In this sense, the binary coordinates are computational intermediates: MAR-PID is computed at the binary-coordinate level, while the final summaries are indexed by atoms over the original variables.
Remark 2
(Scope of the target-bit aggregation). The source-side projection described above is a pushforward of nonnegative MAR-PID atoms. The subsequent summation over is a finite-resolution target-bit aggregation: it records how MAR-PID information about the resolved target coordinates is distributed over original-variable atoms. This target-bit summary is distinct from a native MAR-PID of the full target block . Cross-target-bit informational mechanisms require a separate target-block analysis or an additional support-level or dimension-level construction.
5. Algorithmic Computation and Estimation
We now summarize the empirical computation. Throughout this section the target is a binary component and the sources are finite-valued variables, either original discrete variables or binary coordinates obtained from the representation above.
5.1. Empirical Meaning and Statistical Scope
In this paper, the term empirical means that the construction starts from a finite observed sample and produces computable finite-resolution MAR-PID summaries after binarization and channel estimation. The resulting quantities are plug-in summaries of the chosen finite-resolution representation.
Their numerical values depend on the available sample, the binarization depths, and the set of source atoms included in the computation. Statistical error control, optimal resolution selection, and finite-sample confidence statements are separate questions and are not addressed here.
For reproducibility, the sample size N, target depth K, source depths , tie-handling convention, and chosen source universe should be reported with any finite-sample computation.
5.2. Channel Estimation
Given samples
from the binary target
and the source universe, estimate
by
For each source
S and each value
, estimate
Let
be the number of observations with
and
, and let
be the number of observations with
. We use the plug-in estimator
These conditional probabilities define the empirical channel
for each source
S. In the remainder of this section, all channels and MAR-PID quantities are empirical plug-in quantities unless explicitly stated otherwise; hats are omitted to lighten the notation.
Regularized channel estimators may be useful in sparse empirical applications, but their choice is an implementation issue and is not part of the finite-resolution MAR-PID construction studied here.
5.3. Zonogons, Joins, and Cumulative Loss
For each empirical channel
, construct the binary-input zonogon associated with its column vectors. Blackwell joins of channels are computed geometrically at the level of these zonogons. For an atom
, write
Let
denote the channel associated with the full source universe. The empirical cumulative loss is
Möbius inversion on the synergy/loss lattice gives
The atoms are evaluated in an order compatible with
.
5.4. Aggregation over Target Bits and Projection to Original Variables
Repeat the binary-target computation for
. The empirical block-level contribution is
or, in normalized form,
If source variables were represented by binary coordinates, one may additionally group bit-level atoms by their projected original-variable atom
as in
Section 4. This yields empirical original-variable summaries
5.5. Algorithmic Summary
The computation proceeds as follows.
Choose finite resolutions for source and target variables.
Represent the original target Y by binary components .
Represent continuous or multilevel source variables by binary coordinates when required.
For each target bit , estimate the binary-input channels for all sources S under consideration.
Construct the associated zonogons and compute Blackwell joins.
Compute empirical cumulative losses .
Apply Möbius inversion to obtain .
Aggregate over target bits to obtain or its normalized version.
If source variables were binarized, project bit-level atoms to original-variable atoms and aggregate the corresponding contributions.
6. Illustrative Examples
The examples below are controlled mechanism checks rather than empirical benchmarks. They are chosen so that the redundant, unique, or synergistic structure is known in advance. This makes it possible to check whether the finite-resolution MAR-PID construction assigns nonnegative information contributions to the expected atoms. Real-data applications require further choices, including preprocessing, source selection, resolution choice, and statistical stabilization. These choices are application dependent and are not part of the construction studied here.
We provide three examples illustrating the finite-resolution MAR-PID viewpoint. The first contrasts the negative value of classical interaction information with the nonnegative PID representation of an XOR structure. The second shows that redundant and synergistic mechanisms may coexist in a binary-target distribution. The third illustrates the treatment of a non-binary discrete target.
6.1. XOR Under the Doubling Map
Consider the doubling map on
,
Write the binary expansion, ignoring the null set of dyadic rationals, as
Then
M acts as the left shift on binary digits.
Let
and
be independent uniformly distributed points in
, with binary expansions
where
and
are independent i.i.d. Bernoulli
sequences. Define
where ⊕ denotes addition modulo two. Let
Then
digitwise for every
.
At resolution
, let
denote the
k-bit prefixes. Since
digitwise, the classical trivariate interaction information satisfies
when information is measured in bits. Thus the signed interaction information summary is negative.
In the MAR-PID representation, keep the sources
and choose a binary target given by one output digit of
.
In this example we take the source universe to be , where each is a k-bit finite-valued source variable. Thus the source blocks are treated as the two sources in the MAR-PID computation. One could alternatively use the individual source bits as the source universe; then the positive synergistic contribution for target bit would be localized at the corresponding pair of source bits and would project back to the original-source atom .
Let
Then
For each binary target bit
, the nonzero MAR-PID contribution is the synergistic atom corresponding to the joint source:
Aggregating over the
k target bits gives
Thus the negative interaction information is replaced by a nonnegative atomic description: one bit of synergistic information is present at each resolved target digit (
Table 1).
6.2. A Mixed Redundancy–Synergy Example
Let be the binary target, and let , , be an independent latent switching variable. The observed sources are constructed as follows:
If , set and . This regime introduces a redundant mechanism, since either source alone determines the target.
If , sample independently of A, and set and . This regime introduces an XOR-type synergistic mechanism, since neither source alone determines the target, while the pair does.
Let be either independent noise or a noisy copy of A, depending on whether one wants to include an additional weakly informative source.
Since B is latent, MAR-PID is applied to the marginal distribution of , after averaging over the switch. The resulting atoms need not decompose additively into the two latent regimes. The example should therefore be read as a qualitative mechanism test case: one regime introduces redundant information, while the other introduces XOR-type synergistic information. The redundant atom associated with and the synergistic atom associated with are the principal atoms to inspect. Additional atoms involving may appear if carries information about A.
For reference,
Table 2 shows population Shannon summaries for the two-source version of this example, without the optional source
. The table is not a MAR-PID benchmark. It only records familiar information quantities for several values of the switch parameter
q. The change of sign of
shows how the same family moves from a redundancy-dominated regime to an XOR-type synergy-dominated regime.
6.3. Discrete Variable with More than 2 Values
Let
be independent Bernoulli
variables and let
Let
and
. Then
Y is a four-valued target, but its quantile binary representation consists of the two independent bits
and
. MAR-PID applied to
assigns one bit uniquely to
, while MAR-PID applied to
assigns one bit uniquely to
. Aggregation over target bits therefore gives two separate original-target information contributions rather than a single undifferentiated two-bit dependence.
The example also shows where representation dependence enters. Internal permutations of bits within a source variable are removed by the projection , so they do not change the resulting original-variable source atom. On the target side, however, the chosen binary representation matters: a substantially different binarization of Y may lead to a different target-bit summary. This is why the proposed quantities are finite-resolution summaries induced by a specified target representation.
Together, the examples illustrate the role of the finite-resolution representation. The XOR example starts from continuous variables on , but MAR-PID is applied only after resolving binary coordinates. The four-valued example shows that target binarization can expose separate informational components that would otherwise appear as a single multivalued dependence.
7. Discussion
The present paper separates two tasks that are often conflated: constructing a nonnegative empirical information decomposition, and deriving support- or scale-based summaries from it. We focus on the first task.
The main construction is a finite-resolution empirical MAR-PID pipeline. Continuous or high-cardinality variables are represented by binary coordinates using recursive quantile binarization. MAR-PID is then applied to binary target components, producing nonnegative bit-level atoms. These atoms are aggregated across target bits and lifted from source-bit atoms back to atoms over the original source variables. Thus the binary coordinates serve as computational intermediates rather than final interpretive units.
The finite-depth dyadic property justifies the use of this binary representation. Under non-atomic conditional laws, the binary tree coordinates are balanced and conditionally independent at each finite-tree level. In the present construction, the source side is resolved over the chosen binary source universe: for each target bit , MAR-PID is computed over , and the resulting source-bit atoms are pushed forward to atoms over the original variables. On the target side, the construction remains MAR-PID-nonnegativity-compatible bit by bit. Each is a genuine nonnegative binary-target MAR-PID atom, and summing over ℓ gives a nonnegative finite-resolution target-bit summary.
The finite-resolution pipeline is neutral with respect to the redundancy–synergy interpretation of the atoms. Once the MAR-PID atoms have been estimated, the same aggregation and lifting procedure applies to atoms interpreted as redundant, unique, or synergistic contributions. Accordingly, the proposed summaries should be interpreted as finite-resolution MAR-PID summaries induced by the chosen binary representation, rather than as a representation-independent continuous-variable PID.
The examples illustrate why the atomic decomposition matters. In the XOR example, classical interaction information is negative, while MAR-PID identifies a nonnegative synergistic contribution at each target bit. In the mixed example, redundant and synergistic mechanisms coexist. A signed net quantity may obscure this coexistence, whereas MAR-PID keeps the mechanisms separated.
The construction has several practical constraints. Finite-sample estimation of high-dimensional conditional channels is statistically demanding. In applied work, one may need restricted atom families, problem-specific source selection, or other regularized channel estimators. Such choices concern large-scale implementation and statistical stabilization; they are not part of the finite-resolution MAR-PID construction itself.
Support-based degrees of freedom, persistent supports, and PID-based dimensions are natural downstream constructions. Once nonnegative empirical atoms are available, one may threshold their support, normalize across source and target resolutions, or study scaling with resolution. These choices require separate treatment. The role of the present paper is to establish the empirical nonnegative MAR-PID layer on which such summaries can be built.
The present paper should be read as a construction paper. It defines a finite-resolution representation, applies MAR-PID at the binary target-component level, and aggregates the resulting nonnegative atoms back to original variables. It does not claim to provide a complete statistical validation procedure. Real-data benchmarking, confidence statements, optimal resolution selection, and large-scale sparse implementation require further statistical and computational choices. These are important for applications, but they are not part of the finite-resolution MAR-PID construction itself.
The full-lattice version is combinatorial. If MAR-PID is computed over m source coordinates, there are nonempty source subsets, and the number of nonempty antichain atoms is , where is the m-th Dedekind number. This gives and 7579 atoms for and 5, respectively. In the bit-level construction, m may be , so source-bit unfolding can increase the lattice quickly. Full-lattice computation should therefore be regarded as a small-source construction. For larger systems, one may reduce the source universe by grouping source bits into original variables, limiting the maximum source order, preselecting variables, or using a problem-specific family of source groups. Pruning very small atoms can be useful as a reporting or post-processing step, but it is not a substitute for the exact Möbius inversion on the chosen lattice.