Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol

Emmerson (Yaohushuason), Parker

doi:10.3390/quantum8010008

Open AccessArticle

Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol

by

Parker Emmerson (Yaohushuason)

Independent Researcher, Chapel Hill, NC 27514, USA

Quantum Rep. 2026, 8(1), 8; https://doi.org/10.3390/quantum8010008

Submission received: 18 December 2025 / Revised: 13 January 2026 / Accepted: 20 January 2026 / Published: 23 January 2026

Download

Browse Figures

Versions Notes

Abstract

Bell–CHSH is an inequality about unconditional expectations: under measurement independence, Bell locality, and bounded outcomes, the CHSH value satisfies

S \leq 2

. Experimental correlators, however, are often computed on an accepted subset of trials defined by detection logic, coincidence matching, quality cuts, and analysis windows. We model this by an acceptance probability

γ (a, b, λ) \in [0, 1]

and the resulting accepted hidden-variable law

ν_{a b}

obtained by weighting the measurement-independent prior

ρ

by

γ

and renormalizing. If

ν_{a b}

depends on the setting pair then the four correlators entering CHSH are expectations under four different measures, and a Bell-local measurement-independent model can yield

S_{obs} > 2

by selection alone. We quantify the required setting dependence in total variation (TV) distance. For any reference law

μ

we prove the sharp bound

S_{obs} \leq 2 + 2 \sum_{q \in Q} TV (ν_{q}, μ)

for a CHSH quartet Q. Optimizing over

μ

yields the intrinsic dispersion bound

S_{obs} \leq 2 + 2 Δ_{Q}

, and, in particular,

S_{obs} \leq min {4, 2 + 6 D_{Q}}

, where

D_{Q}

is the quartet TV diameter. The constants are optimal. Consequently, reproducing Tsirelson’s value

2 \sqrt{2}

within Bell-local measurement-independent models via setting-dependent acceptance requires

Δ_{Q} \geq \sqrt{2} - 1

(hence,

D_{Q} \geq (\sqrt{2} - 1) / 3

). We then propose a two-lane experimental audit protocol: (i) prior-relative fair-sampling diagnostics using tags recorded on all trials, and (ii) prior-free dispersion diagnostics using accepted-tag distributions across settings, with

Δ_{Q, X}

computable by linear programming on finite tag alphabets.

Keywords:

Bell’s theorem; CHSH; post-selection; detection loophole; coincidence-time loophole; selection bias; fair sampling; total variation distance; audit; linear programming

Graphical Abstract

1. Introduction

1.1. Background and Main Question

Bell–CHSH is a theorem about unconditional expectations. Under measurement independence (MI), Bell locality, and bounded outcomes, local hidden variable (LHV) models satisfy the CHSH inequality

S \leq 2

[1,2]. Modern experiments violate CHSH and, thereby, exclude the corresponding Bell-local MI model class, subject to the standard experimental controls (spacelike separation, random settings, etc.) [3,4,5,6].

The CHSH algebra is pointwise in the hidden variable

λ

. To apply it to experimental data, however, one also needs the four correlators entering the reported CHSH statistic to be expectations under a single hidden-variable law. This is exactly where selection can matter.

In real Bell-test pipelines, the data used for estimating correlations are typically defined by an acceptance rule (sometimes explicit, sometimes implicit): detector thresholds, hardware gates, coincidence matching, time windows, quality cuts, dead-time exclusions, or analysis filters. Conditioning on acceptance can change the hidden-variable law, and that change can depend on the settings.

This paper studies the following operational question:

If the accepted hidden-variable law depends on the setting pair, how much can CHSH be inflated above 2 within Bell-local, measurement-independent models, and what experimental diagnostics can bound or audit that inflation?

1.2. Why Quantitative Upper Bounds Help

A reported violation

S_{obs} > 2

is often read as “Bell-local MI models are excluded”. That inference is correct once one has justified that the reported correlators are computed under a single (or effectively single) accepted hidden-variable law. If, instead, the accepted ensemble changes with settings then the CHSH value can be inflated by selection alone, and a quantitative analysis must include information about that selection dependence.

A quantitative upper bound serves two roles:

(i): Necessary-condition reading. If $S_{obs}$ is large, the bound yields a minimum required magnitude of setting-dependent selection that any Bell-local MI selection-based explanation would need.
(ii): Exclusion-test reading. If an experiment can independently upper-bound selection dependence below a threshold then the bound can be used to rule out Bell-local MI selection-based explanations.

1.3. Related Work and Positioning

Selection-based loopholes in Bell tests have a long history. Early analyses include Pearle’s data-rejection model [7] and the efficiency thresholds developed by Garg–Mermin [8] and Eberhard [9]. For coincidence-time selection, see Larsson–Gill [10]. Modern “loophole-free” experiments close the detection loophole by event-ready architectures or by using loss-inclusive inequalities rather than coincidence-conditioned CHSH [4,5,6].

Mathematically, the setting-dependent accepted laws studied here are closely related to the measurement-dependence (relaxed MI) literature, which allows

ρ (λ ∣ a, b)

to depend on settings and derives relaxed Bell bounds under variational-distance constraints; see, e.g., [11,12,13]. The physical distinction emphasized here is that we assume MI at emission (a fixed prior

ρ

) and attribute the setting dependence to conditioning on acceptance.

1.4. Contributions and Roadmap

1.4.1. Theory

We model selection by an acceptance probability

γ (a, b, λ) \in [0, 1]

and the corresponding accepted laws

ν_{a b}

. We then prove sharp total variation (TV) bounds on CHSH inflation under setting-dependent acceptance: for any reference law

μ

,

S_{obs} \leq 2 + 2 \sum_{q \in Q} TV (ν_{q}, μ),

where Q is the CHSH quartet of setting pairs. Optimizing over

μ

yields the intrinsic dispersion bound

S_{obs} \leq 2 + 2 Δ_{Q}

and a simple corollary

S_{obs} \leq min {4, 2 + 6 D_{Q}}

. We also provide an explicit local deterministic construction showing the constants are optimal.

1.4.2. Audit Protocol

Because selection dependence is generally not identifiable from accepted outcomes alone, we propose a two-lane audit protocol based on auxiliary tags:

(i): a prior-relative lane comparing accepted-tag distributions to all-trial tag distributions (fair-sampling diagnostics), and
(ii): a prior-free lane comparing accepted-tag distributions across setting pairs (dispersion diagnostics).

On finite tag alphabets, the resolved dispersion statistic

Δ_{Q, X}

is computable by linear programming.

1.4.3. Outline

Section 2 defines the model and the main notation. Section 3 separates two distinct selection effects: prior-relative bias versus across-setting dispersion. Section 4 proves the main bounds and Tsirelson-scale necessary conditions. Section 5 gives a sharpness example. Section 6 develops the audit protocol, including tag sufficiency and LP computation. Section 7 discusses how the framework maps to representative Bell-test architectures.

1.5. Scope and Non-Claims

This paper does not dispute Bell’s theorem and does not allege flaws in any specific experiment. It provides a general selection calculus, sharp quantitative bounds, and audit targets. Whether a given experiment permits significant setting-dependent selection is an empirical question that depends on the detailed pipeline.

2. Model and Notation

2.1. Settings, Hidden Variables, MI, and Locality

Let

S_{A}, S_{B}

be the setting sets for Alice and Bob (finite or general measurable spaces). Let

(Λ, F)

be a measurable space of hidden variables and let

ρ \in P (Λ)

be a probability measure. Measurement independence (MI) means that the prior law

ρ

does not depend on the setting pair

(a, b)

.

A Bell-local model specifies measurable functions

A : S_{A} \times Λ \to [- 1, 1], B : S_{B} \times Λ \to [- 1, 1],

so each party’s output depends only on its local setting and

λ

. Deterministic

{\pm 1}

models are included as a special case.

We write “

ρ

-a.e.” for “

ρ

-almost everywhere”, i.e., except on a measurable set of

ρ

-measure zero.

2.2. Selection as an Acceptance Rule

Selection is modeled by an acceptance probability

γ : S_{A} \times S_{B} \times Λ \to [0, 1], (a, b, λ) \mapsto γ (a, b, λ) .

Intuitively, given

(a, b, λ)

, the trial is accepted with probability

γ (a, b, λ)

. This can represent detector efficiencies, coincidence-window acceptance, quality cuts, or any composite rule that decides whether a trial enters the dataset used to estimate correlations. Any extra randomization inside the acceptance pipeline can be absorbed into an enlarged hidden space (e.g.,

Λ \times [0, 1]

), so treating

γ

as an acceptance probability entails no loss of generality.

Define the acceptance rate

Z (a, b) : = \int_{Λ} γ (a, b, λ) d ρ (λ), assumed Z (a, b) \in (0, 1] for all (a, b) .

(1)

Definition 1

(Accepted hidden-variable law). For each setting pair

(a, b)

, define

ν_{a b} \in P (Λ)

by

ν_{a b} (E) : = \frac{\int_{E} γ (a, b, λ) d ρ (λ)}{\int_{Λ} γ (a, b, λ) d ρ (λ)} = \frac{1}{Z (a, b)} \int_{E} γ (a, b, λ) d ρ (λ), E \in F .

(2)

Equivalently,

ν_{a b}

is the conditional law of λ given acceptance when acceptance occurs with probability

γ (a, b, λ)

.

Remark 1

(Factorized local detection as a special case). If acceptance is generated by independent local detection probabilities

η_{A} (a, λ), η_{B} (b, λ) \in [0, 1]

with acceptance if both sides detect then

γ (a, b, λ) = η_{A} (a, λ) η_{B} (b, λ),

which is the standard detection-loophole structure [7,8,9]. The main bounds in this paper do not assume factorization; they apply to general (possibly joint) acceptance rules.

2.3. Observed Correlators, Unconditional Correlators, and CHSH

Define the accepted-sample (observed) correlator

E_{obs} (a, b) : = \int_{Λ} A (a, λ) B (b, λ) d ν_{a b} (λ),

(3)

and the unconditional correlator

E_{full} (a, b) : = \int_{Λ} A (a, λ) B (b, λ) d ρ (λ) .

(4)

Using Definition 1, the observed correlator has the explicit “weight-and-renormalize” form

E_{obs} (a, b) = \frac{1}{Z (a, b)} \int_{Λ} A (a, λ) B (b, λ) γ (a, b, λ) d ρ (λ) .

(5)

Fix a CHSH quartet of settings

a_{0}, a_{1} \in S_{A}

and

b_{0}, b_{1} \in S_{B}

, and write

E_{i j} : = E_{obs} (a_{i}, b_{j}), i, j \in {0, 1} .

We use the standard CHSH expression

S_{obs} : = | E_{00} + E_{01} + E_{10} - E_{11} | .

(6)

Define

S_{full}

analogously using

E_{full} (a_{i}, b_{j})

.

2.4. Total Variation Distance and a Key Inequality

Definition 2

(Total variation distance). For probability measures

μ, ν \in P (Λ)

,

TV (μ, ν) : = sup_{E \in F} | μ (E) - ν (E) | .

In particular,

TV (μ, ν) \in [0, 1]

. If Λ is finite then

TV (μ, ν) = \frac{1}{2} \sum_{x \in Λ} | μ (x) - ν (x) |

.

Lemma 1

(TV controls bounded expectation errors). Let

μ, ν \in P (Λ)

and let

f : Λ \to R

be measurable with

{∥ f ∥}_{\infty} \leq 1

. Then

| \int f d μ - \int f d ν | \leq 2 TV (μ, ν) .

Proof.

A standard dual characterization is

TV (μ, ν) = \frac{1}{2} sup_{{∥ g ∥}_{\infty} \leq 1} | \int g d μ - \int g d ν | .

Apply this with

g = f

. □

2.5. Notation Summary

Table 1 collects the core symbols. For a CHSH quartet, we write

Q : = {(a_{0}, b_{0}), (a_{0}, b_{1}), (a_{1}, b_{0}), (a_{1}, b_{1})} .

3. Two Distinct Selection Effects: Prior-Relative Bias vs. Across-Setting Dispersion

Acceptance can (i) bias each correlator relative to the unconditional correlator, and (ii) change the accepted hidden-variable law across setting pairs. Only the second effect is what allows CHSH inflation: CHSH is an inequality about expectations taken under a single measure.

3.1. Prior-Relative Deviation (Fair-Sampling Bias)

Definition 3

(Fair-sampling deviation). For each setting pair

(a, b)

, define

δ_{a b} : = 2 TV (ν_{a b}, ρ) \in [0, 2] .

(7)

For a fixed quartet Q and globally, define

δ_{Q} : = max_{(a, b) \in Q} δ_{a b}, δ : = sup_{a \in S_{A}, b \in S_{B}} δ_{a b} .

Remark 2

(Interpretation). How much

E_{obs} (a, b)

can differ from

E_{full} (a, b)

is controlled by

δ_{a b}

, due to acceptance at that setting pair. By itself,

δ_{a b}

does not control CHSH inflation, because CHSH involves four correlators that may be computed under four different measures.

3.2. Across-Setting Dispersion on a CHSH Quartet

Fix a quartet

(a_{0}, a_{1}, b_{0}, b_{1})

and write

ν_{00} : = ν_{a_{0} b_{0}}, ν_{01} : = ν_{a_{0} b_{1}}, ν_{10} : = ν_{a_{1} b_{0}}, ν_{11} : = ν_{a_{1} b_{1}} .

Definition 4

(Quartet dispersion and diameter). Define the quartet dispersion

Δ_{Q} : = inf_{μ \in P (Λ)} (TV (ν_{00}, μ) + TV (ν_{01}, μ) + TV (ν_{10}, μ) + TV (ν_{11}, μ))

(8)

and the quartet diameter

D_{Q} : = max_{(i, j) \neq (k, ℓ)} TV (ν_{i j}, ν_{k ℓ}) .

(9)

Proposition 1

(When dispersion vanishes).

Δ_{Q} = 0

if and only if

ν_{00} = ν_{01} = ν_{10} = ν_{11}

. Equivalently, the accepted hidden-variable law is setting-independent on the quartet.

Proof.

If all four measures are equal, choose

μ = ν_{00}

in (8). Conversely, if

Δ_{Q} = 0

then there exists a sequence

μ_{n}

, such that

\sum_{q \in {00, 01, 10, 11}} TV (ν_{q}, μ_{n}) \to 0

. Then, for any

q, q^{'}

,

TV (ν_{q}, ν_{q^{'}}) \leq TV (ν_{q}, μ_{n}) + TV (μ_{n}, ν_{q^{'}}) \to 0,

so all four measures coincide. □

Proposition 2

(Relations among

Δ_{Q}

,

D_{Q}

, and

δ_{Q}

). For every quartet Q,

D_{Q} \leq Δ_{Q} \leq 3 D_{Q}, Δ_{Q} \leq \sum_{(a, b) \in Q} TV (ν_{a b}, ρ) = \frac{1}{2} \sum_{(a, b) \in Q} δ_{a b} \leq 2 δ_{Q}, D_{Q} \leq δ_{Q} .

(10)

Proof.

For

D_{Q} \leq Δ_{Q}

fix any

μ

and choose

q, q^{'}

, attaining

D_{Q} = TV (ν_{q}, ν_{q^{'}})

. Then, by the triangle inequality,

TV (ν_{q}, ν_{q^{'}}) \leq TV (ν_{q}, μ) + TV (μ, ν_{q^{'}}) \leq \sum_{r \in Q} TV (ν_{r}, μ) .

Taking

{inf}_{μ}

yields

D_{Q} \leq Δ_{Q}

.

For

Δ_{Q} \leq 3 D_{Q}

choose

μ = ν_{00}

in (8). Then,

Δ_{Q} \leq TV (ν_{01}, ν_{00}) + TV (ν_{10}, ν_{00}) + TV (ν_{11}, ν_{00}) \leq 3 D_{Q} .

For

Δ_{Q} \leq \sum_{(a, b) \in Q} TV (ν_{a b}, ρ)

choose

μ = ρ

in (8). The equality with

δ_{a b}

follows from Definition 3.

For

D_{Q} \leq δ_{Q}

, for any two pairs

q, q^{'} \in Q

,

TV (ν_{q}, ν_{q^{'}}) \leq TV (ν_{q}, ρ) + TV (ρ, ν_{q^{'}}) \leq \frac{δ_{Q}}{2} + \frac{δ_{Q}}{2} = δ_{Q} .

Taking the maximum gives

D_{Q} \leq δ_{Q}

. □

Table 2 summarizes which TV quantities answer which inferential questions and what data are typically needed to bound or estimate them.

4. Universal CHSH Bounds Under Setting-Dependent Acceptance

This section proves the main quantitative statement of the paper: even under measurement independence and Bell locality, the observed CHSH value can exceed 2 if the four correlators are computed under four different accepted hidden-variable laws. The amount of possible inflation is controlled by total variation distances among those laws.

Throughout, fix a CHSH quartet of settings

a_{0}, a_{1} \in S_{A}

and

b_{0}, b_{1} \in S_{B}

, and use the shorthand

ν_{i j} : = ν_{a_{i} b_{j}}, E_{i j} : = E_{obs} (a_{i}, b_{j}) = \int A (a_{i}, λ) B (b_{j}, λ) d ν_{i j} (λ), i, j \in {0, 1} .

Recall the standard CHSH functional

S_{obs} = | E_{00} + E_{01} + E_{10} - E_{11} | .

4.1. Pointwise CHSH Algebra and the Unconditional Theorem

Lemma 2

(Pointwise CHSH bound). Fix

λ \in Λ

and settings

a_{0}, a_{1}, b_{0}, b_{1}

. Let

A_{i} : = A (a_{i}, λ) \in [- 1, 1]

and

B_{j} : = B (b_{j}, λ) \in [- 1, 1]

. Then,

| A_{0} B_{0} + A_{0} B_{1} + A_{1} B_{0} - A_{1} B_{1} | \leq 2 .

(11)

Proof.

Rewrite

A_{0} B_{0} + A_{0} B_{1} + A_{1} B_{0} - A_{1} B_{1} = A_{0} (B_{0} + B_{1}) + A_{1} (B_{0} - B_{1}) .

By the triangle inequality and

| A_{0} |, | A_{1} | \leq 1

,

| A_{0} (B_{0} + B_{1}) + A_{1} (B_{0} - B_{1}) | \leq | B_{0} + B_{1} | + | B_{0} - B_{1} | .

For any real

x, y

, one has

| x + y | + | x - y | = 2 max {| x |, | y |}

. With

x = B_{0}

,

y = B_{1}

and

| B_{0} |, | B_{1} | \leq 1

, the right-hand side is at most 2. □

Theorem 1

(Unconditional Bell–CHSH). Assume measurement independence, Bell locality, and bounded outcomes. Then, the unconditional correlators satisfy

S_{full} : = | E_{full} (a_{0}, b_{0}) + E_{full} (a_{0}, b_{1}) + E_{full} (a_{1}, b_{0}) - E_{full} (a_{1}, b_{1}) | \leq 2 .

Proof.

Define the bounded measurable function

C (λ) : = A (a_{0}, λ) B (b_{0}, λ) + A (a_{0}, λ) B (b_{1}, λ) + A (a_{1}, λ) B (b_{0}, λ) - A (a_{1}, λ) B (b_{1}, λ) .

Then,

S_{full} = | \int C d ρ |

. Using

| \int C d ρ | \leq \int | C | d ρ

and Lemma 2,

S_{full} \leq \int 2 d ρ = 2 .

□

Remark 3

(Where selection enters). Theorem 1 is a statement about unconditional expectations under the MI prior ρ. In experiments, correlators are often computed after conditioning on acceptance, i.e., under

ν_{i j}

. The rest of this section quantifies how this conditioning can inflate CHSH when

ν_{i j}

depends on

(i, j)

.

4.2. Main Inflation Bound: Reference-Measure Form

Theorem 2

(CHSH inflation bound (reference-measure form)). Assume measurement independence, Bell locality, and bounded outcomes. Fix a CHSH quartet and abbreviate

ν_{i j} = ν_{a_{i} b_{j}}

. Then, for every reference law

μ \in P (Λ)

,

S_{obs} \leq 2 + 2 (TV (ν_{00}, μ) + TV (ν_{01}, μ) + TV (ν_{10}, μ) + TV (ν_{11}, μ)) .

(12)

Proof.

Let

f_{i j} (λ) : = A (a_{i}, λ) B (b_{j}, λ)

, so

∥ f_{i j} ∥_{\infty} \leq 1

. Define

E_{i j} : = \int f_{i j} d ν_{i j}, {\tilde{E}}_{i j} : = \int f_{i j} d μ, ε_{i j} : = E_{i j} - {\tilde{E}}_{i j} .

Then,

S_{obs} = | E_{00} + E_{01} + E_{10} - E_{11} | = | ({\tilde{E}}_{00} + {\tilde{E}}_{01} + {\tilde{E}}_{10} - {\tilde{E}}_{11}) + (ε_{00} + ε_{01} + ε_{10} - ε_{11}) | .

By the triangle inequality,

S_{obs} \leq \underset{= : S_{μ}}{\underset{︸}{| {\tilde{E}}_{00} + {\tilde{E}}_{01} + {\tilde{E}}_{10} - {\tilde{E}}_{11} |}} + | ε_{00} + ε_{01} + ε_{10} - ε_{11} | \leq S_{μ} + \sum_{i, j \in {0, 1}} | ε_{i j} | .

(13)

Step 1: CHSH under a single reference measure.

Because

{\tilde{E}}_{i j} = \int f_{i j} d μ

are expectations under the same measure

μ

, Theorem 1 (with

ρ

replaced by

μ

) yields

S_{μ} \leq 2

.

Step 2: bound the error terms by TV.

By Lemma 1,

| ε_{i j} | = | \int f_{i j} d ν_{i j} - \int f_{i j} d μ | \leq 2 TV (ν_{i j}, μ) .

Substitute into (13). □

Remark 4

(How to read Theorem 2). Theorem 2 separates two effects:

a single-measure CHSH contribution (bounded by 2);
a penalty for using four measures instead of one, quantified by TV distances to an arbitrary reference law μ.

Optimizing over μ yields an intrinsic “distance-to-one-law” parameter for the quartet.

4.3. Intrinsic Dispersion and Diameter Bounds

Recall the dispersion and diameter from Definition 4:

Δ_{Q} = inf_{μ \in P (Λ)} \sum_{i, j \in {0, 1}} TV (ν_{i j}, μ), D_{Q} = max_{(i, j) \neq (k, ℓ)} TV (ν_{i j}, ν_{k ℓ}) .

Corollary 1

(Intrinsic dispersion bound).

S_{obs} \leq min {4, 2 + 2 Δ_{Q}} .

(14)

Proof.

Take the infimum of (12) over

μ

and use the definition of

Δ_{Q}

. The cap

S_{obs} \leq 4

is trivial because

E_{i j} \in [- 1, 1]

. □

Corollary 2

(CHSH holds on accepted data under quartet setting-independence). If

ν_{00} = ν_{01} = ν_{10} = ν_{11}

(equivalently

Δ_{Q} = 0

), then

S_{obs} \leq 2

.

Proof.

If

Δ_{Q} = 0

then (14) gives

S_{obs} \leq 2

. □

Corollary 3

(Diameter bound).

S_{obs} \leq min {4, 2 + 6 D_{Q}} .

(15)

Proof.

Choose

μ = ν_{00}

in (12). Then,

S_{obs} \leq 2 + 2 (0 + TV (ν_{01}, ν_{00}) + TV (ν_{10}, ν_{00}) + TV (ν_{11}, ν_{00})) \leq 2 + 2 (3 D_{Q}) = 2 + 6 D_{Q} .

Cap by 4. □

Remark 5

(Dispersion vs. diameter).

Δ_{Q}

is the sharp intrinsic “distance to a single accepted law” for the quartet. The diameter

D_{Q}

is easier to estimate from pairwise comparisons and yields the explicit bound (15). By Proposition 2, one always has

Δ_{Q} \leq 3 D_{Q}

, so the diameter bound is generally looser but operationally convenient.

4.4. Prior-Relative (Fair-Sampling) Bounds as Corollaries

Proposition 3

(Single-correlator bias bound). For every setting pair

(a, b)

,

| E_{obs} (a, b) - E_{full} (a, b) | \leq 2 TV (ν_{a b}, ρ) = δ_{a b} .

(16)

Proof.

Let

f (λ) : = A (a, λ) B (b, λ)

so

{∥ f ∥}_{\infty} \leq 1

. Then,

E_{obs} (a, b) - E_{full} (a, b) = \int f d ν_{a b} - \int f d ρ,

and Lemma 1 gives (16). □

Corollary 4

(Prior-relative CHSH inflation bound). Let

Q : = {(a_{i}, b_{j}) : i, j \in {0, 1}}

. Then,

\begin{matrix} S_{obs} & \leq min {4, 2 + \sum_{(a, b) \in Q} δ_{a b}} \\ \leq min {4, 2 + 4 δ_{Q}} \\ \leq min {4, 2 + 4 δ} . \end{matrix}

(17)

Proof.

Apply Theorem 2 with

μ = ρ

:

S_{obs} \leq 2 + 2 \sum_{(a, b) \in Q} TV (ν_{a b}, ρ) = 2 + \sum_{(a, b) \in Q} δ_{a b} .

Then, bound

\sum_{(a, b) \in Q} δ_{a b} \leq 4 δ_{Q} \leq 4 δ

and cap by 4. □

Remark 6

(When prior-relative bounds are loose). If acceptance is setting-independent on the quartet but not fair (i.e.,

ν_{a b} = ν \neq ρ

for all

(a, b) \in Q

) then CHSH holds on accepted data (

S_{obs} \leq 2

), but

TV (ν, ρ)

may be large. In such cases,

δ_{a b}

correctly measures bias relative to unconditional correlators but it overestimates CHSH inflation. For CHSH inflation, dispersion parameters

Δ_{Q}

and

D_{Q}

are the relevant controls.

4.5. Tsirelson-Scale Necessary Conditions

The next corollaries are immediate but operationally useful: they convert an observed CHSH value into necessary amounts of selection dependence.

Corollary 5

(Necessary dispersion for Tsirelson-scale CHSH). If a Bell-local MI model reproduces

S_{obs} = 2 \sqrt{2}

on a quartet by setting-dependent acceptance then

Δ_{Q} \geq \sqrt{2} - 1 \approx 0.4142, D_{Q} \geq \frac{\sqrt{2} - 1}{3} \approx 0.1381 .

(18)

Proof.

From Corollary 1,

2 \sqrt{2} \leq 2 + 2 Δ_{Q}

implies

Δ_{Q} \geq \sqrt{2} - 1

. From Corollary 3,

2 \sqrt{2} \leq 2 + 6 D_{Q}

implies

D_{Q} \geq (\sqrt{2} - 1) / 3

. □

Corollary 6

(Necessary fair-sampling deviation for Tsirelson-scale CHSH). If a Bell-local MI model reproduces

S_{obs} = 2 \sqrt{2}

on a quartet by setting-dependent acceptance then

δ_{Q} \geq \frac{2 \sqrt{2} - 2}{4} = \frac{\sqrt{2} - 1}{2} \approx 0.2071 .

(19)

Equivalently, for at least one pair

(a, b)

in the quartet,

TV (ν_{a b}, ρ) \geq \frac{\sqrt{2} - 1}{4} \approx 0.1036 .

Proof.

From Corollary 4,

2 \sqrt{2} \leq 2 + 4 δ_{Q}

implies (19). □

4.6. A Coarse Acceptance-Rate-Only Fairness Bound

The next bound is coarse but useful in practice: it upper-bounds fair-sampling deviation for a fixed setting pair using only the acceptance rate.

Proposition 4

(Acceptance-rate bound on fair-sampling deviation). For each setting pair

(a, b)

,

TV (ν_{a b}, ρ) \leq 1 - Z (a, b), equivalently δ_{a b} \leq 2 (1 - Z (a, b)) .

(20)

Proof.

Fix

(a, b)

and abbreviate

Z : = Z (a, b)

and

γ (λ) : = γ (a, b, λ)

. From Definition 1,

d ν_{a b} = \frac{γ (λ)}{Z} d ρ (λ) .

Therefore,

TV (ν_{a b}, ρ) = \frac{1}{2} \int_{Λ} | \frac{γ (λ)}{Z} - 1 | d ρ (λ) = \frac{1}{2 Z} \int_{Λ} | γ (λ) - Z | d ρ (λ) .

Now,

γ (λ) \in [0, 1]

and

\int γ d ρ = Z

. The function

x \mapsto | x - Z |

is convex on

[0, 1]

, and the set of

[0, 1]

-valued random variables with fixed mean Z is convex. A convex functional attains its maximum on this set at an extreme point, which here corresponds to a two-point distribution on

{0, 1}

, i.e., a Bernoulli(Z) variable. For Bernoulli(Z),

E | X - Z | = Z | 1 - Z | + (1 - Z) | 0 - Z | = 2 Z (1 - Z) .

Hence,

\int | γ - Z | d ρ \leq 2 Z (1 - Z)

, and substituting yields

TV (ν_{a b}, ρ) \leq \frac{1}{2 Z} \cdot 2 Z (1 - Z) = 1 - Z .

□

Remark 7

(Interpretation). If

Z (a, b) \approx 1

then the accepted law cannot be far from the prior at that setting pair. This does not, by itself, control dispersion across setting pairs: the accepted laws may still differ substantially even if all acceptance rates are moderate.

5. Sharpness: A Saturating Local Construction

The constants in the dispersion and diameter bounds are optimal: there exist Bell-local MI models in which

S_{obs}

attains the bounds as equalities.

Proposition 5

(Sharpness of the constants). Fix any

ε \in [0, 1 / 2]

. There exists a measurement-independent Bell-local deterministic model (with outcomes in

{\pm 1}

) and a setting-dependent acceptance rule

γ (a, b, λ) \in [0, 1]

, such that for a single CHSH quartet

(a_{0}, a_{1}, b_{0}, b_{1})

:

(i): each setting pair has fair-sampling deviation $δ_{a_{i} b_{j}} = ε$ (hence $δ_{Q} = ε$ );
(ii): the quartet dispersion and diameter satisfy $Δ_{Q} = 2 ε$ and $D_{Q} = 2 ε / 3$ ;
(iii): the observed CHSH value satisfies

$S_{obs} = 2 + 4 ε = 2 + 2 Δ_{Q} = 2 + 6 D_{Q},$

so Corollaries 1 and 3 are tight.

Proof of Construction and verification.

Step 1: A Bell-local deterministic model with $S_{full} = 2$ .

Let

Λ = {λ_{1}, λ_{2}, λ_{3}, λ_{4}}

with the uniform prior

ρ ({λ_{k}}) = 1 / 4

. Define deterministic outputs by

\begin{matrix} A (a_{0}, λ) & A (a_{1}, λ) & B (b_{0}, λ) & B (b_{1}, λ) \\ λ_{1} & + 1 & + 1 & + 1 & + 1 \\ λ_{2} & + 1 & + 1 & + 1 & - 1 \\ λ_{3} & + 1 & - 1 & - 1 & + 1 \\ λ_{4} & - 1 & + 1 & - 1 & - 1 \end{matrix}

For

(i, j) \in {0, 1}^{2}

write

f_{i j} (λ) : = A (a_{i}, λ) B (b_{j}, λ) \in {\pm 1}

. A direct check yields

E_{full} (a_{0}, b_{0}) = \frac{1}{2}, E_{full} (a_{0}, b_{1}) = \frac{1}{2}, E_{full} (a_{1}, b_{0}) = \frac{1}{2}, E_{full} (a_{1}, b_{1}) = - \frac{1}{2};

hence,

S_{full} = | 1 / 2 + 1 / 2 + 1 / 2 - (- 1 / 2) | = 2

.

Step 2: Setting-dependent re-weightings that shift each correlator by $\pm ε$ .

For each

(i, j)

, define a density with respect to

ρ

by

w_{i j} (λ) : = 1 + c_{i j} (f_{i j} (λ) - E_{full} (a_{i}, b_{j})), c_{i j} : = \frac{4}{3} s_{i j} ε,

(21)

where

(s_{00}, s_{01}, s_{10}, s_{11}) = (+ 1, + 1, + 1, - 1)

.

Since

E_{ρ} [f_{i j} - E_{full} (a_{i}, b_{j})] = 0

, each

w_{i j}

satisfies

\int w_{i j} d ρ = 1

.

Non-negativity of $w_{i j}$ .

For the three pairs with

E_{full} = + 1 / 2

, the variable

f_{i j} - E_{full}

takes values

+ 1 / 2

on three points and

- 3 / 2

on one point, and

c_{i j} \geq 0

. Thus, the minimum of

w_{i j}

is

w_{min} = 1 - \frac{3}{2} c_{i j} = 1 - 2 ε \geq 0

for

ε \leq 1 / 2

. For the remaining pair

(1, 1)

with

E_{full} = - 1 / 2

, one has

c_{11} \leq 0

, and

f_{11} - E_{full}

takes values

+ 3 / 2

(on one point) and

- 1 / 2

(on three points), so, again,

w_{11} \geq 1 - 2 ε \geq 0

. Hence, each

w_{i j}

is a valid density.

Define

ν_{i j}

by

d ν_{i j} : = w_{i j} d ρ

.

Observed correlations and $S_{obs}$ .

Because

f_{i j}^{2} \equiv 1

,

\int f_{i j} (f_{i j} - E_{full} (a_{i}, b_{j})) d ρ = \int (1 - E_{full} (a_{i}, b_{j}) f_{i j}) d ρ = 1 - E_{full} {(a_{i}, b_{j})}^{2} = \frac{3}{4} .

Therefore,

E_{obs} (a_{i}, b_{j}) = \int f_{i j} d ν_{i j} = \int f_{i j} w_{i j} d ρ = E_{full} (a_{i}, b_{j}) + c_{i j} \cdot \frac{3}{4} = E_{full} (a_{i}, b_{j}) + s_{i j} ε .

Consequently,

S_{obs} = | (\frac{1}{2} + ε) + (\frac{1}{2} + ε) + (\frac{1}{2} + ε) - (- \frac{1}{2} - ε) | = 2 + 4 ε .

Step 3: fair-sampling deviation $δ_{i j} = ε$ .

For

E_{full} = \pm 1 / 2

with the above

f_{i j}

distributions,

\int | f_{i j} - E_{full} (a_{i}, b_{j}) | d ρ = \frac{3}{4} .

Hence,

δ_{a_{i} b_{j}} = 2 TV (ν_{i j}, ρ) = \int | w_{i j} - 1 | d ρ = | c_{i j} | \cdot \frac{3}{4} = ε,

so

δ_{Q} = ε

.

Step 4: compute $D_{Q}$ and $Δ_{Q}$ .

Since

ρ

is uniform on four points, each

ν_{i j}

assigns masses

ν_{i j} (λ_{k}) = w_{i j} (λ_{k}) / 4

. For each

(i, j)

, the density

w_{i j}

takes exactly two values: three “high” values,

w_{high} = 1 + \frac{2 ε}{3},

and one “low” value,

w_{low} = 1 - 2 ε,

with the location of the low value depending on

(i, j)

and distinct across the four setting pairs. Therefore, any two distinct measures differ on exactly two points, and their TV distance is

TV (ν_{i j}, ν_{k ℓ}) = |\frac{w_{high}}{4} - \frac{w_{low}}{4}| = \frac{w_{high} - w_{low}}{4} = \frac{2 ε / 3 + 2 ε}{4} = \frac{2}{3} ε .

Hence,

D_{Q} = 2 ε / 3

, and the diameter bound (15) is attained:

2 + 6 D_{Q} = 2 + 4 ε = S_{obs}

.

For

Δ_{Q}

, Corollary 1 implies

S_{obs} \leq 2 + 2 Δ_{Q}

, so

2 + 4 ε \leq 2 + 2 Δ_{Q}

and

Δ_{Q} \geq 2 ε

. On the other hand, choosing

μ = ρ

in Definition 4 gives

Δ_{Q} \leq \sum_{i, j} TV (ν_{i j}, ρ) = 4 \cdot \frac{δ_{i j}}{2} = 2 ε .

Thus,

Δ_{Q} = 2 ε

and

S_{obs} = 2 + 2 Δ_{Q}

.

Step 5: realize the densities by an acceptance rule $γ \in [0, 1]$ .

Let

w_{max} = w_{high} = 1 + 2 ε / 3

, and choose

Z : = \frac{1}{w_{max}} = \frac{1}{1 + 2 ε / 3} \in [\frac{3}{4}, 1] .

Define

γ (a_{i}, b_{j}, λ) : = Z w_{i j} (λ)

. Then,

0 \leq γ \leq 1

and

\int γ d ρ = Z

, and Definition 1 yields

d ν_{i j} = w_{i j} d ρ

as required. □

Remark 8

(Meaning of the sharpness construction). Proposition 5 is a proof-of-possibility statement: even under Bell locality and measurement independence, setting-dependent acceptance can inflate CHSH up to the limits given by the TV geometry of the accepted laws. It does not claim that such re-weightings occur in any specific experiment.

6. Experimental Audit Protocol

The bounds in Section 4 are informative only if one can independently constrain the relevant TV quantities. In complete generality, neither the fair-sampling deviations

δ_{a b}

nor the dispersions

Δ_{Q}

and

D_{Q}

are identifiable from accepted outcome data alone: acceptance can act on unobserved degrees of freedom. This section therefore formulates audit targets based on auxiliary tags.

6.1. Audit Goals: Two Distinct Questions

A CHSH violation computed on accepted trials can be inflated above 2 if the accepted law

ν_{a b}

depends on

(a, b)

. Accordingly, we audit two different properties:

(i): Prior-relative bias / fair sampling (Lane A). For each setting pair $(a, b)$ , how close is $ν_{a b}$ to the MI prior $ρ$ ? This controls the difference between accepted and unconditional correlators via Proposition 3.
(ii): Across-setting dispersion (Lane B). For a tested quartet, how close are $ν_{00}, ν_{01}, ν_{10}, ν_{11}$ to being a single common law? This controls the CHSH inflation via Corollaries 1 and 3.

Data requirement for Lane A: a trial definition.

Lane A requires an empirical proxy for the prior marginal distribution of a tag, which, in turn, requires a definition of “all trials” (accepted and rejected). This is natural in event-ready or clocked/pulsed experiments. In continuous-wave coincidence-matching experiments, a trial definition is not intrinsic and must be imposed (e.g., by time bins).

6.2. Schematic: Selection and the Two Audit Lanes

Figure 1 summarizes how acceptance can change the hidden-variable law from the MI prior

ρ

to a setting-pair dependent accepted law

ν_{a b}

, and how Lane A and Lane B audits attach to different parts of the pipeline.

6.3. Tags and Pushforward (Tag) Distributions

Let

X : Λ \to X

be a measurable tag. Operationally, X may represent a discretized arrival-time residual, pulse-energy monitor bin, spectral bin, detector-state flag, or any auxiliary feature that can plausibly influence acceptance. For computability, we often take

X = {1, \dots, K}

finite, but the definitions do not require finiteness.

For a measure

μ \in P (Λ)

, define its pushforward (tag distribution)

μ^{X} : = μ \circ X^{- 1} \in P (X) .

In particular,

ρ^{X} : = ρ \circ X^{- 1}, ν_{a b}^{X} : = ν_{a b} \circ X^{- 1} .

6.4. Lane A: Prior-Relative Fair-Sampling Diagnostics

Definition 5

(Resolved fair-sampling deviation). For a setting pair

(a, b)

, define the resolved deviation at tag resolution X by

δ_{X} (a, b) : = 2 TV (ν_{a b}^{X}, ρ^{X}) .

For a quartet Q, define

δ_{X, Q} : = {max}_{(a, b) \in Q} δ_{X} (a, b)

.

Theorem 3

(Data processing for fair-sampling deviation). For every measurable tag X and every setting pair

(a, b)

,

δ_{X} (a, b) \leq δ_{a b} .

In particular,

δ_{X, Q} \leq δ_{Q}

.

Proof.

Total variation is contractive under measurable maps: for any T,

TV (μ \circ T^{- 1}, ν \circ T^{- 1}) \leq TV (μ, ν)

. Apply this with

T = X

,

μ = ν_{a b}

, and

ν = ρ

. □

Remark 9

(Operational meaning and limitation). How different the accepted-tag distribution is from the all-trial tag distribution is measured by

δ_{X} (a, b)

. It is a lower bound on the true hidden-variable deviation

δ_{a b}

: selection that acts only on untagged degrees of freedom may not be visible in

δ_{X}

. To obtain an upper bound on

δ_{a b}

from data, one needs an additional assumption: for example, that acceptance depends on λ only through the observed tag X (or approximately so).

6.5. Acceptance-Rate Representation on Tags (Optional)

Assume

X = {1, \dots, K}

and

ρ^{X} (i) > 0

. Write

ρ^{X} (i) = ρ (X = i), ν_{a b}^{X} (i) = ν_{a b} (X = i) .

From Definition 1,

ν_{a b}^{X} (i) = \frac{1}{Z (a, b)} \int_{Λ} 1_{{X = i}} (λ) γ (a, b, λ) d ρ (λ) .

Define the tag-level re-weighting factor

w_{a b}^{X} (i) : = \frac{ν_{a b}^{X} (i)}{ρ^{X} (i)} = \frac{E_{ρ} [γ (a, b, λ) ∣ X = i]}{Z (a, b)} .

(22)

Interpreting

Pr (Acc ∣ a, b, X = i) : = E_{ρ} [γ (a, b, λ) ∣ X = i]

, this reads

w_{a b}^{X} (i) = \frac{Pr (Acc ∣ a, b, X = i)}{Pr (Acc ∣ a, b)} .

Then,

δ_{X} (a, b) = \sum_{i = 1}^{K} | w_{a b}^{X} (i) - 1 | ρ^{X} (i) = \sum_{i = 1}^{K} | ν_{a b}^{X} (i) - ρ^{X} (i) | .

This expresses the Lane-A audit as follows: how much does acceptance probability vary across tag bins?

6.6. Lane B: Prior-Free Dispersion Diagnostics on Accepted Tags

Lane B targets the quantities that actually control CHSH inflation: how the accepted laws vary across setting pairs. This does not require knowledge of the prior

ρ

. Instead, it uses tag distributions within accepted data at each setting pair.

Fix a quartet and write

ν_{i j}^{X}

for the accepted-tag law at setting pair

(a_{i}, b_{j})

.

Definition 6

(Resolved diameter and resolved dispersion). Define the resolved quartet diameter

D_{Q, X} : = max_{(i, j) \neq (k, ℓ)} TV (ν_{i j}^{X}, ν_{k ℓ}^{X}),

and the resolved quartet dispersion

Δ_{Q, X} : = inf_{μ^{X} \in P (X)} \sum_{i, j \in {0, 1}} TV (ν_{i j}^{X}, μ^{X}) .

Theorem 4

(Data processing for dispersion). For every measurable tag X,

D_{Q, X} \leq D_{Q}, Δ_{Q, X} \leq Δ_{Q} .

Proof.

Contractivity under X yields

TV (ν_{i j}^{X}, ν_{k ℓ}^{X}) \leq TV (ν_{i j}, ν_{k ℓ})

for each pair. Taking maxima gives

D_{Q, X} \leq D_{Q}

. Similarly, for any

μ \in P (Λ)

one has

TV (ν_{i j}^{X}, μ^{X}) \leq TV (ν_{i j}, μ)

; hence,

inf_{μ^{X}} \sum_{i j} TV (ν_{i j}^{X}, μ^{X}) \leq inf_{μ} \sum_{i j} TV (ν_{i j}, μ),

which is

Δ_{Q, X} \leq Δ_{Q}

. □

Remark 10

(Interpretation). A large observed

Δ_{Q, X}

or

D_{Q, X}

is direct evidence that the accepted ensemble depends on settings at the resolution of the measured tag. Conversely, a small resolved value does not guarantee small hidden dispersion unless one has reason to believe the tag is sufficient (Section 6.7).

6.7. Tag Sufficiency: When Resolved Dispersion Becomes Exact

The inequalities

Δ_{Q, X} \leq Δ_{Q}

and

D_{Q, X} \leq D_{Q}

are one-sided in general. The following sufficiency condition (a standard statistical concept) makes them equalities.

Definition 7

(Tag sufficiency for the accepted family). Let

X : Λ \to X

be measurable. We say that X is sufficient for the accepted family

{ν_{a b}}_{(a, b)}

if there exists a Markov kernel

κ (d λ ∣ x)

from

X

to Λ, such that for every setting pair

(a, b)

,

ν_{a b} (d λ) = \int_{X} κ (d λ ∣ x) ν_{a b}^{X} (d x) .

(23)

Equivalently (on standard measurable spaces), the conditional law

ν_{a b} (d λ ∣ X = x)

does not depend on

(a, b)

for

ν_{a b}^{X}

-almost every x.

Proposition 6

(Exactness of TV geometry under sufficiency). Assume X is sufficient in the sense of Definition 7. Then, for any two setting pairs

(a, b)

and

(a^{'}, b^{'})

,

TV (ν_{a b}, ν_{a^{'} b^{'}}) = TV (ν_{a b}^{X}, ν_{a^{'} b^{'}}^{X}) .

Consequently, for any quartet Q,

D_{Q} = D_{Q, X}, Δ_{Q} = Δ_{Q, X} .

Proof.

Let

μ = ν_{a b}

and

ν = ν_{a^{'} b^{'}}

.

Step 1: pushforward (coarse-graining) inequality.

By contractivity under the map X,

TV (ν_{a b}^{X}, ν_{a^{'} b^{'}}^{X}) \leq TV (ν_{a b}, ν_{a^{'} b^{'}}) .

Step 2: reconstruction inequality using the common kernel.

By sufficiency,

ν_{a b} = κ ν_{a b}^{X}

and

ν_{a^{'} b^{'}} = κ ν_{a^{'} b^{'}}^{X}

for the same kernel

κ

. By contractivity under

κ

,

TV (ν_{a b}, ν_{a^{'} b^{'}}) = TV (κ ν_{a b}^{X}, κ ν_{a^{'} b^{'}}^{X}) \leq TV (ν_{a b}^{X}, ν_{a^{'} b^{'}}^{X}) .

Combining the two inequalities yields equality for pairwise Tvs. Taking maxima over the quartet gives

D_{Q} = D_{Q, X}

.

For dispersion, fix any

μ^{X} \in P (X)

and define

μ : = κ μ^{X}

. Then, for each

(i, j)

,

TV (ν_{i j}, μ) = TV (κ ν_{i j}^{X}, κ μ^{X}) \leq TV (ν_{i j}^{X}, μ^{X});

hence,

\sum_{i j} TV (ν_{i j}, μ) \leq \sum_{i j} TV (ν_{i j}^{X}, μ^{X}) .

Taking the infimum over

μ^{X}

yields

Δ_{Q} \leq Δ_{Q, X}

, while Theorem 4 gives

Δ_{Q, X} \leq Δ_{Q}

. Thus,

Δ_{Q} = Δ_{Q, X}

. □

Remark 11

(How sufficiency is used in practice). Sufficiency means that all setting dependence of the accepted law

ν_{a b}

is already visible in the tag marginal

ν_{a b}^{X}

, while the conditional distribution of the remaining (unobserved) degrees of freedom given X is setting-independent. Under this assumption, resolved quantities

Δ_{Q, X}

and

D_{Q, X}

can be substituted for

Δ_{Q}

and

D_{Q}

in Corollaries 1 and 3.

6.8. Computing $Δ_{Q, X}$ on Finite Tag Alphabets (Linear Programming)

When

X

is finite,

Δ_{Q, X}

is computable by a small linear program. This makes the Lane-B dispersion audit operational on accepted-tag histograms.

Theorem 5

(LP form of

Δ

on a finite space). Let

X = {1, \dots, K}

and let

η_{1}, \dots, η_{m} \in P (X)

(in our application

m = 4

and

η_{r} = ν_{i j}^{X}

). Define

Δ (η_{1}, \dots, η_{m}) : = inf_{μ \in P (X)} \sum_{r = 1}^{m} TV (η_{r}, μ) .

Then,

Δ (η_{1}, \dots, η_{m})

equals the optimum of the linear program

\begin{matrix} minimize & \frac{1}{2} \sum_{r = 1}^{m} \sum_{k = 1}^{K} t_{r, k} \\ over & μ_{k} \geq 0, \sum_{k = 1}^{K} μ_{k} = 1, t_{r, k} \geq 0 \\ subject to & t_{r, k} \geq η_{r} (k) - μ_{k}, t_{r, k} \geq μ_{k} - η_{r} (k) (\forall r, k) . \end{matrix}

(24)

An optimal minimizer

μ^{★}

exists.

Proof.

On a finite alphabet,

TV (η_{r}, μ) = \frac{1}{2} \sum_{k = 1}^{K} | η_{r} (k) - μ_{k} |

. Introducing epigraph variables

t_{r, k} \geq | η_{r} (k) - μ_{k} |

yields (24). The feasible set is compact and the objective is continuous, so a minimizer exists. □

Remark 12

(Lower-bound vs. upper-bound use). Because

Δ_{Q, X} \leq Δ_{Q}

(Theorem 4), the LP value computed from tag data provides a certified lower bound on the hidden dispersion:

Δ_{Q} \geq Δ_{Q, X} .

To use dispersion bounds as an exclusion tool for selection-based local explanations, one needs an upper bound on

Δ_{Q}

or

D_{Q}

. Tag sufficiency (Definition 7) is one route: if X is sufficient then

Δ_{Q} = Δ_{Q, X}

(Proposition 6).

6.9. Estimators and Uncertainty (Discrete Tags)

Assume

X = {1, \dots, K}

.

Lane A (prior-relative) estimators.

Assume a trial definition exists and X is recorded on all trials. Let

\hat{ρ^{X}}

be the empirical distribution of X across all trials, and let

\hat{ν_{a b}^{X}}

be the empirical distribution of X across accepted trials at setting pair

(a, b)

. Then, the plug-in estimate

\hat{δ_{X}} (a, b) = 2 TV (\hat{ν_{a b}^{X}}, \hat{ρ^{X}}) = \sum_{k = 1}^{K} | \hat{ν_{a b}^{X}} (k) - \hat{ρ^{X}} (k) |

is natural. Uncertainty can be quantified by nonparametric bootstrap resampling of trials within each setting cell and within the all-trials pool.

Lane B (prior-free) estimators.

For each setting pair in the quartet, estimate

\hat{ν_{i j}^{X}}

from accepted trials. Estimate pairwise TVs by

\hat{TV} (ν_{i j}^{X}, ν_{k ℓ}^{X}) : = \frac{1}{2} \sum_{r = 1}^{K} | \hat{ν_{i j}^{X}} (r) - \hat{ν_{k ℓ}^{X}} (r) | .

Set

{\hat{D}}_{Q, X}

as the maximum of the six pairwise TVs. Compute

{\hat{Δ}}_{Q, X}

by solving the LP (24) with inputs

\hat{ν_{00}^{X}}, \hat{ν_{01}^{X}}, \hat{ν_{10}^{X}}, \hat{ν_{11}^{X}}

. Bootstrap within each setting cell provides uncertainty bands.

6.10. Decision Logic: What Audits Can Certify vs. What They Can Exclude

Because total variation is contractive under coarse-graining, the directly measurable tag quantities

δ_{X} (a, b)

and

Δ_{Q, X}

are, in general, lower bounds on

δ_{a b}

and

Δ_{Q}

. This creates an asymmetry between certification and exclusion.

What audits can certify without extra assumptions (lower-bound logic).

If $δ_{X} (a, b)$ is large then acceptance is demonstrably non-fair at the resolution of X.
If $Δ_{Q, X}$ or $D_{Q, X}$ is large then the accepted ensemble demonstrably differs across settings at the resolution of X.

These findings do not prove that a Bell-local model actually explains the observed CHSH violation, but they demonstrate that the pipeline contains setting-dependent selection structure strong enough (at measured resolution) to support selection-based inflation in principle.

What is required to exclude selection-based Bell-local explanations (upper-bound logic).

To use Corollary 1 or 3 as exclusion tools, one needs an upper bound on

Δ_{Q}

or

D_{Q}

. Such an upper bound typically requires at least one of the following:

an architecture guarantee implying setting-independent acceptance on the tested quartet (forcing $Δ_{Q} = 0$ );
a justified sufficiency assumption that the measured tag X captures essentially all setting dependence of the accepted law (so that $Δ_{Q} \approx Δ_{Q, X}$ ); or
a physics-based constraint on unobserved degrees of freedom that limits unresolved selection dependence.

A one-sided exclusion test.

Suppose an experiment provides an external upper confidence bound

D_{Q} \leq D^{(+)}

and an estimate

{\hat{S}}_{obs}

with standard error

σ_{S}

. Then, Corollary 3 yields the exclusion condition

\begin{matrix} {\hat{S}}_{obs} - z_{1 - α} σ_{S} & > 2 + 6 D^{(+)} \\ \Rightarrow exclude Bell - local MI selection models \\ with D_{Q} \leq D^{(+)} at confidence 1 - α . \end{matrix}

(25)

Analogously, using dispersion:

{\hat{S}}_{obs} - z_{1 - α} σ_{S} > 2 + 2 Δ^{(+)}

excludes models with

Δ_{Q} \leq Δ^{(+)}

.

6.11. A Publication-Ready “Selection Statement” Checklist

To make selection semantics explicit (analogous to standard discussions of MI and spacelike separation), it is useful to include a short selection statement in Bell-test publications:

S1.: Trial definition. What counts as a trial? (Pulse, clock bin, herald, fixed time bins, etc.)
S2.: Outcome alphabet. Are no-click events retained as outcomes (loss-inclusive analysis), or are correlations conditioned on detection/coincidence?
S3.: Acceptance rule. List every step that discards trials or conditions the analysis (thresholds, coincidence windows, invalid flags, quality cuts).
S4.: Acceptance rates. Report $Z (a, b)$ by setting pair (with uncertainty), and note the coarse bound $TV (ν_{a b}, ρ) \leq 1 - Z (a, b)$ from Proposition 4.
S5.: Tag-based fair-sampling audit (Lane A). If feasible, report at least one tag X recorded on all trials and the resolved deviations $δ_{X} (a, b)$ .
S6.: Tag-based dispersion audit (Lane B). Report at least one accepted-tag dispersion diagnostic across settings (pairwise TVs, $D_{Q, X}$ , and/or $Δ_{Q, X}$ via LP).

6.12. Robustness Sweeps (Threshold/Window Sensitivity)

Many acceptance mechanisms enter through analysis parameters (e.g., coincidence window width, timing offsets, discriminator thresholds, filter bandwidth). A useful diagnostic is to repeat the full analysis over a modest grid of such parameters and report the stability of correlators and the CHSH value.

Let

θ \in Θ

denote a tuning parameter controlling acceptance (or a small grid of parameters). Let

{\hat{S}}_{obs} (θ)

be the estimated CHSH value obtained under parameter

θ

. Define the empirical range

{Range}_{Θ} (S) : = max_{θ \in Θ} {\hat{S}}_{obs} (θ) - min_{θ \in Θ} {\hat{S}}_{obs} (θ) .

Large sensitivity indicates that acceptance details matter operationally. Small sensitivity supports robustness, but by itself does not yield an upper bound on hidden dispersion: selection could still act on degrees of freedom not varied or not observed. In this framework, sweeps are most useful for guiding tag choice and checking whether resolved quantities

δ_{X}

and

Δ_{Q, X}

remain stable under reasonable analysis variations.

7. Discussion

7.1. How the Selection Model Maps to Experimental Architectures

The acceptance function

γ (a, b, λ)

is intentionally general: it can represent independent local detections, coincidence-time pairing, and pipeline-level cuts. Nevertheless, different experimental architectures naturally constrain the plausible setting dependence.

Event-ready (heralded) experiments.

In event-ready Bell tests, trials are often defined by a herald event that is generated before the local settings are chosen (or outside their past light cones in the relevant frame). If acceptance is determined solely by that herald event and is causally prior to the setting choices then one expects

γ (a, b, λ) = γ (λ)

and, hence,

ν_{a b}

is setting-independent. In that regime, Corollary 2 implies CHSH holds on accepted data without requiring fair sampling. Setting dependence can re-enter if additional discards occur after settings are chosen (e.g., basis-dependent invalid flags).

High-efficiency photonic experiments (loss-inclusive inequalities).

Many modern photonic experiments avoid conditioning on coincidence by treating no-click as an outcome and using a loss-inclusive inequality (e.g., Clauser–Horne or Eberhard-type analyses) [5,6,9,14]. In such analyses, the Bell functional is evaluated per trial, reducing the specific selection-induced CHSH inflation mechanism studied here. Nevertheless, any additional discards or conditioning steps (overflows, dead-time cuts, invalid-trial flags) reintroduce a nontrivial acceptance rule and should be disclosed and audited.

Continuous-wave time-tag experiments (coincidence matching).

In continuous-wave experiments without a natural trial clock, coincidence pairs are often produced by a matching rule on two time-tag streams. This is a paradigmatic joint acceptance rule and can generate setting-dependent accepted laws if the matching probability depends on settings or if settings induce timing shifts [10]. In that regime, Lane-B dispersion audits using time-residual tags are particularly natural.

7.2. Relation to Relaxed Measurement Independence

Mathematically, the accepted laws

ν_{a b}

behave like a setting-dependent hidden-variable law

ρ (λ ∣ a, b)

. This connects the present bounds to the measurement-dependence literature [11,12,13], which controls the variability of

ρ (λ ∣ a, b)

(often in variational distance) to obtain relaxed Bell inequalities.

The conceptual distinction here is physical: we assume MI at emission (a fixed prior

ρ

) and attribute setting dependence to conditioning on acceptance. In practice, both mechanisms can be present, and separating them experimentally requires careful design and disclosure.

7.3. Limitations and What the Framework Does (And Does Not) Deliver

The main limitation is identifiability: tag-based audits typically provide lower bounds on hidden selection structure (e.g.,

Δ_{Q, X} \leq Δ_{Q}

). To convert an audit into an upper bound, one needs a justified sufficiency assumption or an architecture argument.

The value of the present framework is that it makes the needed information explicit: an observed CHSH violation excludes Bell-local MI models once one has controlled (or quantitatively bounded) the across-setting dispersion of accepted hidden-variable laws.

7.4. Context: “Locality” Has Inequivalent Formalizations; A Small Taxonomy of CHSH Model Classes

The main results of this paper concern a classical Bell-local, measurement-independent (MI) model class with possible setting-dependent acceptance (selection), where CHSH inflation is controlled by the dispersion of the accepted hidden-variable laws

{ν_{a b}}

(Section 4, Section 5 and Section 6).

A recurring source of interpretive confusion in the Bell literature is that the word “locality” is used in multiple inequivalent technical senses. In this paper, Bell locality means the existence of local response functions

A (a, λ)

and

B (b, λ)

on a single probability space

(Λ, ρ)

(Section 2.1). This is stronger than the microcausal (operator-algebraic) locality axiom used in algebraic QFT, which requires only that Alice-side observables commute with Bob-side observables. Under microcausality, the universal CHSH upper bound is Tsirelson’s

2 \sqrt{2}

[15,16], not 2; a self-contained derivation via the Bell-operator commutator identity is given in Appendix G.

Table 3 summarizes a minimal “no-exception-list” taxonomy: different hypothesis classes have different sharp CHSH bounds. This paper’s contribution is the second row: a sharp, quantitative bound for CHSH computed on accepted data when the accepted ensemble can vary with settings.

8. Conclusions

Bell–CHSH is a theorem about unconditional correlations: under measurement independence, Bell locality, and bounded outcomes, the CHSH value satisfies

S \leq 2

(Theorem 1). Experimental correlators are often computed on an accepted subset of trials, and acceptance can depend on settings. When the accepted hidden-variable law

ν_{a b}

varies across setting pairs, the correlators entering CHSH are expectations under different measures, and CHSH can exceed 2 within Bell-local MI models by selection alone.

We quantified this effect in total variation distance. The main bound (Theorem 2) yields sharp universal inequalities:

S_{obs} \leq 2 + 2 Δ_{Q} \leq min {4, 2 + 6 D_{Q}},

where

Δ_{Q}

measures the quartet’s intrinsic distance to any single common accepted law and

D_{Q}

is the quartet diameter. The constants are optimal (Proposition 5). Consequently, Tsirelson-scale values

S_{obs} = 2 \sqrt{2}

require substantial across-setting dispersion within any Bell-local MI selection-based explanation.

Finally, we proposed a two-lane experimental audit protocol: Lane A provides prior-relative fair-sampling diagnostics using all-trial tags, while Lane B provides prior-free dispersion diagnostics using accepted-tag distributions across settings. On finite tag alphabets, the resolved dispersion statistic

Δ_{Q, X}

is computable by linear programming (Theorem 5). The aim is not to dispute Bell’s theorem, but to make explicit which quantitative selection information is needed to interpret observed CHSH violations as excluding Bell-local measurement-independent models. Bell–CHSH is a theorem about unconditional/shared-ensemble expectations; in real pipelines, the key empirical question is whether the reported correlators are expectations under a single accepted law.

Funding

This research received no external funding.

Data Availability Statement

The original data presented in the study are openly available in https://github.com/sphereofrealization/Bell-CHSH-Under-Setting-Dependent-Selection, accessed on 19 January 2026.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Acceptance as Conditionalization and Feasibility of Bounded Acceptance

This appendix records a few optional technical points. None are needed for the main bounds.

Appendix A.1. Conditionalization and the Radon–Nikodym Derivative

Fix a setting pair

(a, b)

and abbreviate

γ (λ) : = γ (a, b, λ)

and

Z : = Z (a, b)

. By Definition 1, the accepted law is

ν_{a b} (E) = \frac{1}{Z} \int_{E} γ (λ) d ρ (λ) .

In particular,

ν_{a b} ≪ ρ

and the Radon–Nikodym derivative exists:

\frac{d ν_{a b}}{d ρ} (λ) = \frac{γ (λ)}{Z} (ρ - a . e .) .

This is simply the standard “weight by acceptance and renormalize” rule.

Appendix A.2. When a Target Re-Weighting Can Come from γ∈[0,1]

Sometimes, it is convenient to specify a desired density

w = d ν / d ρ

and ask when it can be realized by a bounded acceptance rule.

Proposition A1

(Feasibility under bounded acceptance). Fix a prior ρ and a measurable

w : Λ \to [0, \infty)

with

\int w d ρ = 1

. Define ν by

d ν = w d ρ

. There exists an acceptance probability

γ : Λ \to [0, 1]

and an acceptance rate

Z \in (0, 1]

, such that

w (λ) = \frac{γ (λ)}{Z} (ρ - a . e .)

if and only if

w \in L^{\infty} (ρ)

.

Moreover, when

w \in L^{\infty} (ρ)

, any choice

0 < Z \leq 1 / {∥ w ∥}_{\infty}

yields a valid realization

γ : = Z w

, and the maximal achievable acceptance rate is

Z_{max} = 1 / {∥ w ∥}_{\infty}

.

Proof.

If

w = γ / Z

with

0 \leq γ \leq 1

then

w \leq 1 / Z

ρ

-a.e.; hence,

w \in L^{\infty} (ρ)

. Conversely, if

w \in L^{\infty} (ρ)

and

Z \leq 1 / {∥ w ∥}_{\infty}

then

γ : = Z w

satisfies

0 \leq γ \leq 1

and

\int γ d ρ = Z

. □

Appendix A.3. Other Divergences Controlling Total Variation (Optional)

Total variation is used here because it yields sharp constants and is computable on finite tag alphabets. For readers who prefer other divergences, standard inequalities relate TV to KL and

χ^{2}

divergences. For

μ ≪ ν

:

TV (μ, ν) \leq \sqrt{\frac{1}{2} D_{KL} (μ ∥ ν)} (Pinsker inequality),

and writing

d μ = w d ν

,

TV (μ, ν) = \frac{1}{2} \int | w - 1 | d ν \leq \frac{1}{2} \sqrt{\int {(w - 1)}^{2} d ν} = \frac{1}{2} \sqrt{χ^{2} (μ ∥ ν)} .

Thus, an independent upper bound on

D_{KL} (ν_{a b} ∥ ρ)

or

χ^{2} (ν_{a b} ∥ ρ)

implies an upper bound on

δ_{a b} = 2 TV (ν_{a b}, ρ)

and, therefore, feeds into the prior-relative CHSH bound (Corollary 4).

Appendix B. Factorized Detection Implies a Cross-Ratio Constraint (Optional)

This appendix records a simple structural constraint that holds under factorized acceptance. It is not used in the main bounds (which apply to general, possibly joint, acceptance rules).

Proposition A2

(Cross-ratio invariance under factorized detection). Fix a quartet

(a_{0}, a_{1}, b_{0}, b_{1})

and let

ν_{i j} = ν_{a_{i} b_{j}}

. Assume factorized acceptance

γ (a, b, λ) = η_{A} (a, λ) η_{B} (b, λ), η_{A}, η_{B} \in [0, 1],

and assume

Z (a_{i}, b_{j}) > 0

for all

i, j

. Define the RN densities

w_{i j} (λ) : = \frac{d ν_{i j}}{d ρ} (λ) = \frac{γ (a_{i}, b_{j}, λ)}{Z (a_{i}, b_{j})} .

Then, on the set where all four products

η_{A} (a_{i}, λ) η_{B} (b_{j}, λ)

are positive (equivalently, where all four

w_{i j} (λ)

are positive),

\frac{w_{00} (λ) w_{11} (λ)}{w_{01} (λ) w_{10} (λ)} = \frac{Z (a_{0}, b_{1}) Z (a_{1}, b_{0})}{Z (a_{0}, b_{0}) Z (a_{1}, b_{1})} (ρ - a . e .) .

Proof.

Under factorization,

w_{i j} (λ) = \frac{η_{A} (a_{i}, λ) η_{B} (b_{j}, λ)}{Z (a_{i}, b_{j})} .

Therefore, wherever all four factors are positive,

\frac{w_{00} w_{11}}{w_{01} w_{10}} = \frac{η_{A} (a_{0}) η_{B} (b_{0}) η_{A} (a_{1}) η_{B} (b_{1})}{η_{A} (a_{0}) η_{B} (b_{1}) η_{A} (a_{1}) η_{B} (b_{0})} \cdot \frac{Z (a_{0}, b_{1}) Z (a_{1}, b_{0})}{Z (a_{0}, b_{0}) Z (a_{1}, b_{1})} = 1 \cdot \frac{Z (a_{0}, b_{1}) Z (a_{1}, b_{0})}{Z (a_{0}, b_{0}) Z (a_{1}, b_{1})} .

□

Remark A1

(Possible diagnostic use: cross-ratio constancy and exceptional-bin semantics). Proposition A2 says that under factorized detection the pointwise cross-ratio

R (λ) : = \frac{w_{00} (λ) w_{11} (λ)}{w_{01} (λ) w_{10} (λ)}

(A1)

is constant in λ on the common-support set

{γ_{00} γ_{01} γ_{10} γ_{11} > 0}

, with value

R (λ) = \frac{Z_{01} Z_{10}}{Z_{00} Z_{11}} .

Equivalently, the λ-dependent quantity

log w_{00} (λ) + log w_{11} (λ) - log w_{01} (λ) - log w_{10} (λ)

is constant wherever it is defined.

How this becomes an audit in experiments.

Of course, λ is unobserved. However, if one records a tag X on all trials and computes the tag-binned acceptance ratios (cf. Section 6.5)

w_{i j}^{X} (x) : = \frac{Pr (Acc ∣ a_{i}, b_{j}, X = x)}{Pr (Acc ∣ a_{i}, b_{j})},

then a natural resolved cross-ratio diagnostic is

R_{X} (x) : = \frac{w_{00}^{X} (x) w_{11}^{X} (x)}{w_{01}^{X} (x) w_{10}^{X} (x)} .

(A2)

If the tag X is sufficiently informative that the RN densities are approximately constant within each tag bin (i.e.,

w_{i j} (λ)

is approximately X-measurable) then one expects

R_{X} (x)

to be approximately constant in x with target value

Z_{01} Z_{10} / (Z_{00} Z_{11})

. Large systematic deviations across bins suggest that acceptance is not well described by a factorized detection model, and it may instead involve non-factorized (joint) selection (e.g., coincidence-time matching) or unmodeled pipeline dependence.

Exceptional-bin issue (ratio semantics).

Like any ratio-based diagnostic,

R_{X} (x)

is ill-conditioned when the denominator is small. In practice, bins with very small

w_{01}^{X} (x)

or

w_{10}^{X} (x)

form an exceptional locus analogous to

0 / 0

expressions in CAS algebra. Therefore, an audit should explicitly state which convention is used:

(i): Strict semantics: compute $R_{X} (x)$ only on bins where all four terms in (A2) are well-defined and exceed a minimum-count threshold.
(ii): Bracketed/generic semantics: impose a cutoff $w_{01}^{X} (x) w_{10}^{X} (x) > ε$ (or an analogous count-based cutoff) and report results conditional on that cutoff.
(iii): Regularized semantics: add pseudo-counts (e.g., Laplace/Jeffreys smoothing) to avoid zeros and stabilize $log R_{X} (x)$ .
(iv): Resolved (division-free) semantics: avoid forming a ratio when both numerator and denominator are near zero by reporting the pair

$(w_{00}^{X} (x) w_{11}^{X} (x), w_{01}^{X} (x) w_{10}^{X} (x))$

(or a normalized two-vector) instead of $R_{X} (x)$ itself.

The key point is that any use of the cross-ratio as a factorization diagnostic should make the treatment of the exceptional locus explicit, since that choice can dominate the behavior of the statistic in sparse bins.

Appendix C. Constructive Selection Models (Optional)

This appendix collects illustrative constructions that show what becomes possible once one allows setting-dependent acceptance. These constructions are not used in the main bounds.

Appendix C.1. Canonical Re-Weighting for a Single Correlator

Fix a setting pair

(a, b)

and, for simplicity, assume

A (a, λ) B (b, λ) \in {\pm 1}

. Write

f (λ) : = A (a, λ) B (b, λ)

. Assume there exist measurable disjoint sets

U^{+}, U^{-} \subset Λ

with

ρ (U^{\pm}) > 0

, such that

f = + 1

on

U^{+}

and

f = - 1

on

U^{-}

.

Given a target correlation

E^{tgt} \in [- 1, 1]

, set

α : = (1 + E^{tgt}) / 2

and define

w (λ) : = \frac{α}{ρ (U^{+})} 1_{U^{+}} (λ) + \frac{1 - α}{ρ (U^{-})} 1_{U^{-}} (λ) .

Then,

\int w d ρ = 1

and the re-weighted law

d ν = w d ρ

satisfies

\int f d ν = α \cdot (+ 1) + (1 - α) \cdot (- 1) = E^{tgt} .

If

w \in L^{\infty} (ρ)

then Proposition A1 implies w can be realized by a bounded acceptance probability.

Appendix C.2. Simulating an Arbitrary Finite Correlation Table by Factorized Detection

Theorem A1

(Finite-table simulation by factorized detection). Fix finite settings

{a_{1}, \dots, a_{m}}

and

{b_{1}, \dots, b_{n}}

and target correlations

E_{i j}^{tgt} \in [- 1, 1]

. There exists a probability space

(Λ, ρ)

, Bell-local deterministic outputs

A (a_{i}, λ), B (b_{j}, λ) \in {\pm 1}

, and factorized detection indicators

D_{A} (a_{i}, λ), D_{B} (b_{j}, λ) \in {0, 1}

, such that:

(a): ρ is measurement-independent (independent of $(i, j)$ );
(b): the acceptance event is $Acc = {D_{A} = 1} \cap {D_{B} = 1}$ and factorizes: $γ (a_{i}, b_{j}, λ) = D_{A} (a_{i}, λ) D_{B} (b_{j}, λ)$ ;
(c): the accepted correlations satisfy $E_{obs} (a_{i}, b_{j}) = E_{i j}^{tgt}$ for all $i, j$ .

Construction.

Let

Λ : = {1, \dots, m} \times {1, \dots, n} \times [0, 1]

with

ρ

the product of the uniform measure on

{1, \dots, m} \times {1, \dots, n}

and Lebesgue measure on

[0, 1]

. Write

λ = (u, v, t)

.

Define factorized detection indicators:

D_{A} (a_{i}, λ) : = 1 {u = i}, D_{B} (b_{j}, λ) : = 1 {v = j} .

Thus, at setting pair

(i, j)

acceptance occurs if

u = i

and

v = j

, with acceptance rate

Z (a_{i}, b_{j}) = 1 / (m n)

.

Define outcomes

A (a_{i}, λ) : = + 1, B (b_{j}, λ) : = \{\begin{matrix} + 1, & t \leq α_{u j}, \\ - 1, & t > α_{u j}, \end{matrix} α_{u j} : = \frac{1 + E_{u j}^{tgt}}{2} .

Conditioned on acceptance at

(i, j)

, one has

u = i

,

v = j

, and

t \sim Unif [0, 1]

; hence,

E_{obs} (a_{i}, b_{j}) = E [B ∣ Acc, i, j] = (+ 1) α_{i j} + (- 1) (1 - α_{i j}) = 2 α_{i j} - 1 = E_{i j}^{tgt} .

□

Remark A2

(Interpretation). Theorem A1 does not contradict Bell–CHSH, because the accepted laws

ν_{i j}

are maximally setting dependent: each setting pair selects a different hidden slice. It illustrates an identifiability limitation: without controlling acceptance semantics, accepted-sample correlations alone cannot identify nonlocality.

Appendix D. A Common Mistaken “Local Model” and the Correct Correlation

A frequently proposed Bell-local deterministic model for polarization-like settings is

A (θ_{a}, λ) = sgn (cos (θ_{a} - λ)), B (θ_{b}, λ) = - sgn (cos (θ_{b} - λ)),

with

λ \sim Unif [0, 2 π)

. It is sometimes incorrectly claimed that this yields the quantum correlation

- cos (θ_{a} - θ_{b})

. It does not.

Let

Δ : = θ_{a} - θ_{b}

and wrap

Δ

into

[- π, π]

. Then

A (θ_{a}, λ)

and

B (θ_{b}, λ)

differ in sign precisely on an interval of

λ

of length

2 | Δ |

(modulo

2 π

). A direct geometric argument gives

E (Δ) : = E [A (θ_{a}, λ) B (θ_{b}, λ)] = - (1 - \frac{2 | Δ |}{π}), | Δ | \leq π,

extended periodically with period

2 π

. This is a triangular wave, not a cosine, and it cannot yield CHSH values exceeding 2 under unconditional sampling. Obtaining the cosine correlation locally would require relaxing MI, relaxing locality, or introducing setting-dependent acceptance/conditioning.

Appendix E. Pipeline Semantics (Optional)

It is often helpful to distinguish three layers in a Bell-test data pipeline:

(i): Trial definition: what physical attempts count as trials (pump pulses, clock bins, herald events, fixed time bins, etc.).
(ii): Outcome assignment: how raw records are mapped to an outcome alphabet (including, if desired, an explicit no-click outcome).
(iii): Acceptance/conditioning: which trials are discarded or which statistics are conditioned on an event $Acc$ (coincidences, quality cuts, invalid flags).

The selection calculus in this paper concerns (iii): the acceptance event

Acc

and its dependence on settings and hidden variables. When (iii) is absent (e.g., loss-inclusive analyses with no post-hoc discards), the specific selection-induced inflation mechanism studied here is reduced.

Appendix F. Phenomenological Velocity Analogy (Non-Essential)

This appendix is an optional analogy and is not used in any theorem or audit protocol. It provides one broad physical motif by which setting-dependent acceptance can re-weight hidden degrees of freedom without obviously changing coarse observed kinematics.

Appendix F.1. Fibered Non-Injectivity as a Selection Mechanism

Suppose the hidden state can be decomposed as

Λ ≅ V \times Φ,

where V denotes a phenomenological (externally monitored) kinematic parameter such as a time-residual coordinate, pulse-energy monitor, or spectral bin, and

Φ

denotes additional hidden structure. Let

L : V \times Φ \to M

be an operational summary map used in data handling (e.g., a discretized time-difference residual, a quality score, or an inferred kinematic label). If

L

is non-injective on fibers then many distinct

(v, ϕ)

share the same observed label

ℓ = L (v, ϕ)

.

Within the fiber over a fixed observed label ℓ, the hidden coordinate

ϕ

can carry internal sign structure for

A (a, λ) B (b, λ)

. A setting-dependent acceptance rule can then re-weight the mixture over

ϕ

within the same observed kinematic cell, altering accepted correlations while leaving the coarse observed label ℓ unchanged.

Figure A1 sketches this fibered non-injectivity motif.

Figure A1. Optional PV analogy: within a fixed observed label ℓ, hidden substructure can be re-weighted by selection without changing the coarse observed label.

Appendix F.2. Connection to Tag Choice

In the language of Section 6, the monitored coordinate(s) V motivate candidate tag variables X: if selection depends on hidden variables primarily through coarse kinematic labels (arrival-time residuals, pulse energy bins, etc.) then refining those tags may increase the resolved quantities

Δ_{Q, X}

and

δ_{X}

, revealing setting-dependent selection structure.

Appendix G. Tsirelson’s Bound from Commutators (Microcausal Noncommutative Contrast)

Remark A3

(Purpose and scope). The main text studies classical Bell-local, measurement-independent models in which the reported correlators are computed on an accepted subset of trials; if the accepted hidden-variable law

ν_{a b}

depends on settings, CHSH can be inflated above 2 by selection alone, and the inflation is controlled by the dispersion of

{ν_{a b}}

(Section 4, Section 5 and Section 6).

This appendix records a logically independent and widely used contrast: in a noncommutative (operator-algebraic) formulation with microcausal locality (commuting Alice/Bob observable algebras), the correct universal CHSH bound is Tsirelson’s

2 \sqrt{2}

, not 2. This provides a clean way to separate two statements that are sometimes conflated in informal discussions:

CHSH $\leq 2$ is a theorem for classical (commutative) shared-ensemble models (Theorem 1);
microcausal “no-signalling by commutation” locality does not force CHSH $\leq 2$ , but it does force CHSH $\leq 2 \sqrt{2}$ .

No theorem in the main text depends on this appendix.

Appendix G.1. C*-Probability Spaces and Microcausal Locality

Definition A1

(C*-probability space). A C*-probability space is a pair

(A, ω)

, where

A

is a unital C*-algebra and

ω : A \to C

is a state (linear, positive, and normalized:

ω (1_{A}) = 1

).

Definition A2

(Microcausal bipartite structure). Let

(A, ω)

be a C*-probability space. A microcausal bipartite structure is a pair of commuting unital C*-subalgebras

A_{A}, A_{B} \subset A

, such that

[X, Y] = 0 (\forall X \in A_{A}, \forall Y \in A_{B}),

where

[X, Y] : = X Y - Y X

is the commutator.

Remark A4

(Locality across wings vs. within a wing). Microcausality constrains only cross-wing commutation: Alice-side observables commute with Bob-side observables. It does not require that Alice’s alternative observables commute with each other, nor that Bob’s do. This is exactly where Tsirelson’s bound differs from the classical CHSH bound 2.

Appendix G.2. CHSH Bell Operator and Correlators

Definition A3

(CHSH quartet in a microcausal C* model). Fix self-adjoint unitaries (“

\pm 1

observables”),

A_{0}, A_{1} \in A_{A}, B_{0}, B_{1} \in A_{B}, A_{i}^{*} = A_{i}, B_{j}^{*} = B_{j}, A_{i}^{2} = B_{j}^{2} = 1_{A} .

Define correlators,

E_{i j} : = ω (A_{i} B_{j}), i, j \in {0, 1},

and the corresponding CHSH value,

S_{ω} : = | E_{00} + E_{01} + E_{10} - E_{11} | .

Definition A4

(CHSH Bell operator). Define the Bell operator,

B : = A_{0} (B_{0} + B_{1}) + A_{1} (B_{0} - B_{1}) \in A .

(A3)

Then,

S_{ω} = | ω (B) |

.

Appendix G.3. Bell-Operator Square Identity and Tsirelson Bound

Theorem A2

(Bell-operator square identity). Assume the microcausal bipartite structure above, so, in particular,

[A_{i}, B_{j}] = 0

for all

i, j

. Then, the Bell operator

B

satisfies

B^{2} = 4 1_{A} - [A_{0}, A_{1}] [B_{0}, B_{1}] .

(A4)

Proof.

Because

A_{i}

commutes with each

B_{j}

, products can be rearranged across wings. Expand (A3):

B^{2} = A_{0}^{2} {(B_{0} + B_{1})}^{2} + A_{1}^{2} {(B_{0} - B_{1})}^{2} + A_{0} A_{1} (B_{0} + B_{1}) (B_{0} - B_{1}) + A_{1} A_{0} (B_{0} - B_{1}) (B_{0} + B_{1}) .

Use

A_{0}^{2} = A_{1}^{2} = 1_{A}

to get

B^{2} = {(B_{0} + B_{1})}^{2} + {(B_{0} - B_{1})}^{2} + A_{0} A_{1} (B_{0} + B_{1}) (B_{0} - B_{1}) + A_{1} A_{0} (B_{0} - B_{1}) (B_{0} + B_{1}) .

Now

{(B_{0} + B_{1})}^{2} + {(B_{0} - B_{1})}^{2} = 4 1_{A}

(since

B_{0}^{2} = B_{1}^{2} = 1_{A}

). Also,

(B_{0} + B_{1}) (B_{0} - B_{1}) = - [B_{0}, B_{1}], (B_{0} - B_{1}) (B_{0} + B_{1}) = [B_{0}, B_{1}] .

Therefore, the cross terms equal

A_{0} A_{1} (- [B_{0}, B_{1}]) + A_{1} A_{0} ([B_{0}, B_{1}]) = (A_{1} A_{0} - A_{0} A_{1}) [B_{0}, B_{1}] = - [A_{0}, A_{1}] [B_{0}, B_{1}],

which gives (A4). □

Corollary A1

(Tsirelson bound; commutative (flat) subcase). In the setting above, one has

S_{ω} = | ω (B) | \leq ∥ B ∥ \leq 2 \sqrt{2} .

Moreover, if

[A_{0}, A_{1}] = 0

or

[B_{0}, B_{1}] = 0

then

B^{2} = 4 1_{A}

, and, hence,

S_{ω} \leq 2

.

Proof.

For any state

ω

and any

X \in A

,

| ω (X) | \leq ∥ X ∥

. Since

B

is self-adjoint,

{∥ B ∥}^{2} = ∥ B^{2} ∥

. From Theorem A2,

∥ B^{2} ∥ \leq ∥ 4 1_{A} ∥ + ∥ [A_{0}, A_{1}] [B_{0}, B_{1}] ∥ \leq 4 + ∥ [A_{0}, A_{1}] ∥ ∥ [B_{0}, B_{1}] ∥ .

For

∥ X ∥, ∥ Y ∥ \leq 1

, one has

∥ [X, Y] ∥ \leq ∥ X Y ∥ + ∥ Y X ∥ \leq 2 ∥ X ∥ ∥ Y ∥

; hence,

∥ [A_{0}, A_{1}] ∥ \leq 2

and

∥ [B_{0}, B_{1}] ∥ \leq 2

. Therefore,

∥ B^{2} ∥ \leq 4 + 4 = 8

, so

∥ B ∥ \leq 2 \sqrt{2}

and

S_{ω} \leq 2 \sqrt{2}

.

If

[A_{0}, A_{1}] = 0

or

[B_{0}, B_{1}] = 0

then (A4) gives

B^{2} = 4 1_{A}

, so

∥ B ∥ = 2

and, thus,

S_{ω} \leq 2

. □

Appendix G.4. The Commutative Sector: CHSH ≤ 2 as a Commutative-Subalgebra Bound

Definition A5

(Commutative sector for a CHSH quartet). In the microcausal C* setting of Appendix G, fix

A_{0}, A_{1} \in A_{A}

and

B_{0}, B_{1} \in A_{B}

as

\pm 1

observables. We say the quartet lies in the commutative sector if the C*-subalgebra

C : = C^{*} (A_{0}, A_{1}, B_{0}, B_{1}) \subset A

is commutative. Under microcausality

[A_{i}, B_{j}] = 0

, this is equivalent to requiring

[A_{0}, A_{1}] = 0

and

[B_{0}, B_{1}] = 0

.

Proposition A3

(CHSH

\leq 2

in the commutative sector). Let

(A, ω; A_{A}, A_{B})

be a microcausal C* scenario as above. If the CHSH quartet lies in the commutative sector (Definition A5) then

S_{ω} = | ω (A_{0} B_{0}) + ω (A_{0} B_{1}) + ω (A_{1} B_{0}) - ω (A_{1} B_{1}) | \leq 2 .

Proof.

Let

C : = C^{*} (A_{0}, A_{1}, B_{0}, B_{1})

, which is commutative by assumption. By the Gelfand representation theorem, there exists a compact Hausdorff space

Ω

, such that

C ≅ C (Ω)

as C*-algebras. By the Riesz representation theorem, the restricted state

{ω |}_{C}

corresponds to a probability measure P on

Ω

, such that

ω (f) = \int_{Ω} f d P

for all

f \in C (Ω)

.

Under this identification,

A_{0}, A_{1}, B_{0}, B_{1}

become (bounded) real-valued functions on

Ω

with values in

{\pm 1}

, so pointwise on

Ω

one has

| A_{0} B_{0} + A_{0} B_{1} + A_{1} B_{0} - A_{1} B_{1} | \leq 2 .

Integrating against P and using

| \int g d P | \leq \int | g | d P

yields

S_{ω} \leq 2

. □

Remark A5

(How this classifies Bell–CHSH among broader local frameworks). Proposition A3 formalizes the slogan that the classical CHSH bound 2 is a commutative-sector bound: it applies whenever the relevant observables are jointly representable inside a single commutative algebra (equivalently, a single classical probability space).

By contrast, Appendix G shows that microcausal locality (cross-wing commutation) by itself does not force commutativity within each wing, and in that larger noncommutative class the sharp universal bound is Tsirelson’s

2 \sqrt{2}

.

Finally, the main text of this paper treats a different enlargement of the classical CHSH-2 hypothesis class: it keeps the underlying model classical (commutative), but it allows the effective state/ensemble used for each correlator to depend on settings through acceptance (different

ν_{a b}

). In that case, CHSH inflation is controlled not by commutators but by the dispersion of the accepted laws (Corollaries 1 and 3).

Remark A6

(Contrast with selection-based inflation in the main text). Corollary A1 shows a non-selection route to

S > 2

: noncommutativity within a wing (non-zero commutators) enlarges the universal bound from 2 to

2 \sqrt{2}

even while preserving microcausal cross-wing locality.

The main text instead studies a classical route to

S_{obs} > 2

, in which the classical bound 2 continues to hold for expectations taken under any single measure , but the CHSH expression is formed from correlators evaluated under different accepted laws

ν_{a b}

, due to setting-dependent acceptance. These are logically independent mechanisms.

Remark A7

(Future Work: Synopsis of a companion paper (PV, Bell rungs, and the

- cos

derivation)). A companion paper will treat three logically distinct (but connected) topics that are only staged here [17].

(1) Rung-0 (Bell/CHSH) no-go and PV radical audit. We will begin by stating and using the unconditional single-ensemble Bell–CHSH bound: any Kolmogorov (classical) hidden-variable model with measurement independence, Bell locality, and bounded outcomes satisfies

S \leq 2

on every CHSH quartet. As an immediate consequence, the singlet target

E_{tgt} (a, b) = - cos (a - b)

—which achieves

S = 2 \sqrt{2}

on a standard Tsirelson quartet—cannot be realized as unconditional expectations

E (a, b) = \int A (a, λ) B (b, λ) d ρ (λ)

in that model class. Building on this, we will expand the PV radical-equation audit (S0 vs. S1/S3/S4) into a complete semantics catalogue of the exceptional locus and branch structure, including a precise characterization of when CAS outputs such as

v = \pm \sqrt{c^{2} D} / \sqrt{D}

arise from localization/regularization rather than as strict consequences of the original radical equation on its solution locus.

(2) Rung-2 (microcausal/noncommutative) PV model with a step-by-step

- cos

solution. In the same paper we will present, in full detail, the PV-indexed microcausal construction in which PV (v together with

κ (v) = \sqrt{1 - v^{2} / c^{2}}

) is promoted to a canonical unitary

U (v) \in S U (2)

. This unitary is then used to define PV-parameterized local observables by conjugation inside a bipartite operator algebra with commuting Alice/Bob subalgebras (microcausality). We will give the derivation line-by-line: PV

\Rightarrow θ (v)

\Rightarrow U (v)

⇒ rotated local Pauli observables ⇒ the singlet identity

ω ((σ \cdot u) \otimes (σ \cdot w)) = - u \cdot w

\Rightarrow E (a, b) = - cos (a - b)

, together with explicit verification of cross-wing commutation and unconditional (single-state) semantics.

(3) Rung comparison and physical interpretation (what is “hidden” and what is “measured”). Finally, the companion paper will formalize the rung dictionary and its operational meaning: when PV is treated in a classical (commutative) hidden-variable framework, the unconditional Bell–CHSH bound prevents any Tsirelson-scale

- cos

reproduction; when PV is instead promoted to a noncommutative parameter that indexes the local observable representation in a microcausal algebra, Tsirelson-scale

- cos

follows unconditionally in the singlet state. We will also clarify which aspects of PV are operationally identifiable (or provably unidentifiable) from two-point Bell statistics alone, and how the di-cone seam

U (2)

self-adjoint families provide a geometric realization of the same “PV → unitary parameter” mechanism.

References

Bell, J.S. On the Einstein Podolsky Rosen Paradox. Phys. Phys. Fiz. 1964, 1, 195–200. [Google Scholar] [CrossRef]
Clauser, J.F.; Horne, M.A.; Shimony, A.; Holt, R.A. Proposed experiment to test local hidden-variable theories. Phys. Rev. Lett. 1969, 23, 880–884. [Google Scholar] [CrossRef]
Brunner, N.; Cavalcanti, D.; Pironio, S.; Scarani, V.; Wehner, S. Bell nonlocality. Rev. Mod. Phys. 2014, 86, 419–478. [Google Scholar] [CrossRef]
Hensen, B.; Bernien, H.; Dréau, A.E.; Reiserer, A.; Kalb, N.; Blok, M.S.; Ruitenberg, J.; Vermeulen, R.F.L.; Schouten, R.N.; Abellán, C.; et al. Loophole-free Bell inequality violation using electron spins separated by 1.3 kilometres. Nature 2015, 526, 682–686. [Google Scholar] [CrossRef]
Giustina, M.; Versteegh, M.; Wengerowsky, S.; Handsteiner, J.; Hochrainer, A.; Phelan, A.; Steinlechner, F.; Kofler, J.; Larsson, J.-Å.; Abellán, C.; et al. Significant-Loophole-Free Test of Bell’s Theorem with Entangled Photons. Phys. Rev. Lett. 2015, 115, 250401. [Google Scholar] [PubMed]
Shalm, L.K.; Meyer-Scott, E.; Christensen, B.G.; Bierhorst, P.; Wayne, M.A.; Stevens, M.J.; Gerrits, T.; Glancy, S.; Hamel, D.R.; Allman, M.S.; et al. Strong Loophole-Free Test of Local Realism. Phys. Rev. Lett. 2015, 115, 250402. [Google Scholar] [CrossRef] [PubMed]
Pearle, P.M. Hidden-variable example based upon data rejection. Phys. Rev. D 1970, 2, 1418–1425. [Google Scholar] [CrossRef]
Garg, A.; Mermin, N.D. Detector inefficiencies in the Einstein-Podolsky-Rosen experiment. Phys. Rev. D 1987, 35, 3831–3835. [Google Scholar] [CrossRef] [PubMed]
Eberhard, P.H. Background level and counter efficiencies required for a loophole-free Einstein–Podolsky–Rosen experiment. Phys. Rev. A 1993, 47, R747–R750. [Google Scholar] [CrossRef] [PubMed]
Larsson, J.-Å.; Gill, R.D. Bell’s inequality and the coincidence-time loophole. Europhys. Lett. 2004, 67, 707–713. [Google Scholar] [CrossRef]
Hall, M.J.W. Local deterministic model of singlet state correlations based on relaxing measurement independence. Phys. Rev. Lett. 2010, 105, 250404. [Google Scholar] [CrossRef]
Barrett, J.; Gisin, N. How much measurement independence is needed to demonstrate nonlocality? Phys. Rev. Lett. 2011, 106, 100406. [Google Scholar] [CrossRef] [PubMed]
Pütz, G.; Rosset, D.; Barnea, T.J.; Liang, Y.-C.; Gisin, N. Arbitrarily small amount of measurement independence is sufficient to manifest nonlocality. Phys. Rev. Lett. 2014, 113, 190402. [Google Scholar] [CrossRef] [PubMed]
Clauser, J.F.; Horne, M.A. Experimental consequences of objective local theories. Phys. Rev. D 1974, 10, 526–535. [Google Scholar] [CrossRef]
Tsirelson, B.S. Quantum generalizations of Bell’s inequality. Lett. Math. Phys. 1980, 4, 93–100. [Google Scholar] [CrossRef]
Summers, S.J.; Werner, R. Bell’s inequalities and quantum field theory. I. General setting. J. Math. Phys. 1987, 28, 2440–2447. [Google Scholar] [CrossRef]
Emmerson, P. Nearest–Neighbour Cohomology meets Daisy Self–Similarity: A Unified Operator–Homotopy Framework. Authorea 2026. [Google Scholar] [CrossRef]

Figure 1. Acceptance changes the hidden-variable law from

ρ

to

ν_{a b}

, possibly in a setting-dependent way. Lane A audits prior-relative bias; Lane B audits across-setting dispersion.

Figure 1. Acceptance changes the hidden-variable law from

ρ

to

ν_{a b}

, possibly in a setting-dependent way. Lane A audits prior-relative bias; Lane B audits across-setting dispersion.

Table 1. Core notation.

Symbol	Meaning
$ρ$	MI prior law of $λ$ on $Λ$ .
$γ (a, b, λ)$	Acceptance probability (selection rule).
$Z (a, b)$	Acceptance rate $Z (a, b) = \int γ (a, b, λ) d ρ (λ)$ .
$ν_{a b}$	Accepted law (weighted and renormalized): (2).
$E_{obs} (a, b)$	Accepted-sample correlator: (3).
$E_{full} (a, b)$	Unconditional correlator: (4).
$S_{obs}$	CHSH value $\| E_{00} + E_{01} + E_{10} - E_{11} \|$ with $E_{i j} = E_{obs} (a_{i}, b_{j})$ .
$TV (μ, ν)$	Total variation distance: Definition 2.

Table 2. Summary of which TV quantities answer which inferential questions.

Quantity	Definition	What It Controls	Typical Data Needed
$δ_{a b}$	$2 TV (ν_{a b}, ρ)$	Bias from unconditional: $\| E_{obs} - E_{full} \|$	All-trial tags or a prior model
$Δ_{Q}$	${inf}_{μ} \sum_{q \in Q} TV (ν_{q}, μ)$	CHSH inflation: $S_{obs} \leq 2 + 2 Δ_{Q}$	Needs an upper bound (assumptions/architecture)
$D_{Q}$	${max}_{q \neq q^{'}} TV (ν_{q}, ν_{q^{'}})$	Explicit bound: $S_{obs} \leq min {4, 2 + 6 D_{Q}}$	Needs an upper bound (assumptions/architecture)

Table 3. A minimal taxonomy of CHSH hypothesis classes and their sharp bounds.

Model Class	Structural Features (What Ties the Four Correlators Together)	Sharp CHSH Bound
Classical Bell-local MI (unconditional)	One prior $ρ$ (MI) and local response functions $A (a, λ), B (b, λ)$ ; correlators are unconditional expectations under the same $ρ$	$S \leq 2$ (Theorem 1)
Classical Bell-local MI + setting-dependent acceptance	Same $ρ$ and local $A, B$ , but correlators are evaluated under setting-dependent accepted laws $ν_{a b}$ (different measures across settings)	$S_{obs} \leq 2 + 2 Δ_{Q} \leq min {4, 2 + 6 D_{Q}}$ (Corollaries 1,3)
Microcausal noncommutative (operator-algebraic) scenario	Single state $ω$ on a (generally noncommutative) algebra; commuting Alice/Bob subalgebras (microcausality) but not necessarily commutative within each wing; no selection	$S_{ω} \leq 2 \sqrt{2}$ (Appendix G)
No-signalling extremal	Only operational no-signalling constraints (e.g., PR-box class)	$S \leq 4$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Emmerson, P. Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Rep. 2026, 8, 8. https://doi.org/10.3390/quantum8010008

AMA Style

Emmerson P. Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Reports. 2026; 8(1):8. https://doi.org/10.3390/quantum8010008

Chicago/Turabian Style

Emmerson (Yaohushuason), Parker. 2026. "Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol" Quantum Reports 8, no. 1: 8. https://doi.org/10.3390/quantum8010008

APA Style

Emmerson, P. (2026). Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Reports, 8(1), 8. https://doi.org/10.3390/quantum8010008

Article Menu

Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol

Abstract

1. Introduction

1.1. Background and Main Question

1.2. Why Quantitative Upper Bounds Help

1.3. Related Work and Positioning

1.4. Contributions and Roadmap

1.4.1. Theory

1.4.2. Audit Protocol

1.4.3. Outline

1.5. Scope and Non-Claims

2. Model and Notation

2.1. Settings, Hidden Variables, MI, and Locality

2.2. Selection as an Acceptance Rule

2.3. Observed Correlators, Unconditional Correlators, and CHSH

2.4. Total Variation Distance and a Key Inequality

2.5. Notation Summary

3. Two Distinct Selection Effects: Prior-Relative Bias vs. Across-Setting Dispersion

3.1. Prior-Relative Deviation (Fair-Sampling Bias)

3.2. Across-Setting Dispersion on a CHSH Quartet

4. Universal CHSH Bounds Under Setting-Dependent Acceptance

4.1. Pointwise CHSH Algebra and the Unconditional Theorem

4.2. Main Inflation Bound: Reference-Measure Form

4.3. Intrinsic Dispersion and Diameter Bounds

4.4. Prior-Relative (Fair-Sampling) Bounds as Corollaries

4.5. Tsirelson-Scale Necessary Conditions

4.6. A Coarse Acceptance-Rate-Only Fairness Bound

5. Sharpness: A Saturating Local Construction

6. Experimental Audit Protocol

6.1. Audit Goals: Two Distinct Questions

6.2. Schematic: Selection and the Two Audit Lanes

6.3. Tags and Pushforward (Tag) Distributions

6.4. Lane A: Prior-Relative Fair-Sampling Diagnostics

6.5. Acceptance-Rate Representation on Tags (Optional)

6.6. Lane B: Prior-Free Dispersion Diagnostics on Accepted Tags

6.7. Tag Sufficiency: When Resolved Dispersion Becomes Exact

6.8. Computing Δ Q , X on Finite Tag Alphabets (Linear Programming)

6.9. Estimators and Uncertainty (Discrete Tags)

6.10. Decision Logic: What Audits Can Certify vs. What They Can Exclude

6.11. A Publication-Ready “Selection Statement” Checklist

6.12. Robustness Sweeps (Threshold/Window Sensitivity)

7. Discussion

7.1. How the Selection Model Maps to Experimental Architectures

7.2. Relation to Relaxed Measurement Independence

7.3. Limitations and What the Framework Does (And Does Not) Deliver

7.4. Context: “Locality” Has Inequivalent Formalizations; A Small Taxonomy of CHSH Model Classes

8. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Acceptance as Conditionalization and Feasibility of Bounded Acceptance

Appendix A.1. Conditionalization and the Radon–Nikodym Derivative

Appendix A.2. When a Target Re-Weighting Can Come from γ∈[0,1]

Appendix A.3. Other Divergences Controlling Total Variation (Optional)

Appendix B. Factorized Detection Implies a Cross-Ratio Constraint (Optional)

Appendix C. Constructive Selection Models (Optional)

Appendix C.1. Canonical Re-Weighting for a Single Correlator

Appendix C.2. Simulating an Arbitrary Finite Correlation Table by Factorized Detection

Appendix D. A Common Mistaken “Local Model” and the Correct Correlation

Appendix E. Pipeline Semantics (Optional)

Appendix F. Phenomenological Velocity Analogy (Non-Essential)

Appendix F.1. Fibered Non-Injectivity as a Selection Mechanism

Appendix F.2. Connection to Tag Choice

Appendix G. Tsirelson’s Bound from Commutators (Microcausal Noncommutative Contrast)

Appendix G.1. C*-Probability Spaces and Microcausal Locality

Appendix G.2. CHSH Bell Operator and Correlators

Appendix G.3. Bell-Operator Square Identity and Tsirelson Bound

Appendix G.4. The Commutative Sector: CHSH ≤ 2 as a Commutative-Subalgebra Bound

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

6.8. Computing $Δ_{Q, X}$ on Finite Tag Alphabets (Linear Programming)