Next Article in Journal
Comparative Analysis of Quantum Technology Policies in the United States and China: Strategic Directions and Philosophical Foundations
Previous Article in Journal
Pushing the Limits of Large Language Models in Quantum Operations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol

by
Parker Emmerson (Yaohushuason)
Independent Researcher, Chapel Hill, NC 27514, USA
Quantum Rep. 2026, 8(1), 8; https://doi.org/10.3390/quantum8010008
Submission received: 18 December 2025 / Revised: 13 January 2026 / Accepted: 20 January 2026 / Published: 23 January 2026

Abstract

Bell–CHSH is an inequality about unconditional expectations: under measurement independence, Bell locality, and bounded outcomes, the CHSH value satisfies S 2 . Experimental correlators, however, are often computed on an accepted subset of trials defined by detection logic, coincidence matching, quality cuts, and analysis windows. We model this by an acceptance probability γ ( a , b , λ ) [ 0 , 1 ] and the resulting accepted hidden-variable law ν a b obtained by weighting the measurement-independent prior ρ by γ and renormalizing. If ν a b depends on the setting pair then the four correlators entering CHSH are expectations under four different measures, and a Bell-local measurement-independent model can yield S obs > 2 by selection alone. We quantify the required setting dependence in total variation (TV) distance. For any reference law μ we prove the sharp bound S obs 2 + 2 q Q TV ( ν q , μ ) for a CHSH quartet Q. Optimizing over μ yields the intrinsic dispersion bound S obs 2 + 2 Δ Q , and, in particular, S obs min { 4 , 2 + 6 D Q } , where D Q is the quartet TV diameter. The constants are optimal. Consequently, reproducing Tsirelson’s value 2 2 within Bell-local measurement-independent models via setting-dependent acceptance requires Δ Q 2 1 (hence, D Q ( 2 1 ) / 3 ). We then propose a two-lane experimental audit protocol: (i) prior-relative fair-sampling diagnostics using tags recorded on all trials, and (ii) prior-free dispersion diagnostics using accepted-tag distributions across settings, with Δ Q , X computable by linear programming on finite tag alphabets.

Graphical Abstract

1. Introduction

1.1. Background and Main Question

Bell–CHSH is a theorem about unconditional expectations. Under measurement independence (MI), Bell locality, and bounded outcomes, local hidden variable (LHV) models satisfy the CHSH inequality S 2 [1,2]. Modern experiments violate CHSH and, thereby, exclude the corresponding Bell-local MI model class, subject to the standard experimental controls (spacelike separation, random settings, etc.) [3,4,5,6].
The CHSH algebra is pointwise in the hidden variable λ . To apply it to experimental data, however, one also needs the four correlators entering the reported CHSH statistic to be expectations under a single hidden-variable law. This is exactly where selection can matter.
In real Bell-test pipelines, the data used for estimating correlations are typically defined by an acceptance rule (sometimes explicit, sometimes implicit): detector thresholds, hardware gates, coincidence matching, time windows, quality cuts, dead-time exclusions, or analysis filters. Conditioning on acceptance can change the hidden-variable law, and that change can depend on the settings.
This paper studies the following operational question:
If the accepted hidden-variable law depends on the setting pair, how much can CHSH be inflated above 2 within Bell-local, measurement-independent models, and what experimental diagnostics can bound or audit that inflation?

1.2. Why Quantitative Upper Bounds Help

A reported violation S obs > 2 is often read as “Bell-local MI models are excluded”. That inference is correct once one has justified that the reported correlators are computed under a single (or effectively single) accepted hidden-variable law. If, instead, the accepted ensemble changes with settings then the CHSH value can be inflated by selection alone, and a quantitative analysis must include information about that selection dependence.
A quantitative upper bound serves two roles:
(i)
Necessary-condition reading. If S obs is large, the bound yields a minimum required magnitude of setting-dependent selection that any Bell-local MI selection-based explanation would need.
(ii)
Exclusion-test reading. If an experiment can independently upper-bound selection dependence below a threshold then the bound can be used to rule out Bell-local MI selection-based explanations.

1.3. Related Work and Positioning

Selection-based loopholes in Bell tests have a long history. Early analyses include Pearle’s data-rejection model [7] and the efficiency thresholds developed by Garg–Mermin [8] and Eberhard [9]. For coincidence-time selection, see Larsson–Gill [10]. Modern “loophole-free” experiments close the detection loophole by event-ready architectures or by using loss-inclusive inequalities rather than coincidence-conditioned CHSH [4,5,6].
Mathematically, the setting-dependent accepted laws studied here are closely related to the measurement-dependence (relaxed MI) literature, which allows ρ ( λ a , b ) to depend on settings and derives relaxed Bell bounds under variational-distance constraints; see, e.g., [11,12,13]. The physical distinction emphasized here is that we assume MI at emission (a fixed prior ρ ) and attribute the setting dependence to conditioning on acceptance.

1.4. Contributions and Roadmap

1.4.1. Theory

We model selection by an acceptance probability γ ( a , b , λ ) [ 0 , 1 ] and the corresponding accepted laws ν a b . We then prove sharp total variation (TV) bounds on CHSH inflation under setting-dependent acceptance: for any reference law μ ,
S obs 2 + 2 q Q TV ( ν q , μ ) ,
where Q is the CHSH quartet of setting pairs. Optimizing over μ yields the intrinsic dispersion bound S obs 2 + 2 Δ Q and a simple corollary S obs min { 4 , 2 + 6 D Q } . We also provide an explicit local deterministic construction showing the constants are optimal.

1.4.2. Audit Protocol

Because selection dependence is generally not identifiable from accepted outcomes alone, we propose a two-lane audit protocol based on auxiliary tags:
(i)
a prior-relative lane comparing accepted-tag distributions to all-trial tag distributions (fair-sampling diagnostics), and
(ii)
a prior-free lane comparing accepted-tag distributions across setting pairs (dispersion diagnostics).
On finite tag alphabets, the resolved dispersion statistic Δ Q , X is computable by linear programming.

1.4.3. Outline

Section 2 defines the model and the main notation. Section 3 separates two distinct selection effects: prior-relative bias versus across-setting dispersion. Section 4 proves the main bounds and Tsirelson-scale necessary conditions. Section 5 gives a sharpness example. Section 6 develops the audit protocol, including tag sufficiency and LP computation. Section 7 discusses how the framework maps to representative Bell-test architectures.

1.5. Scope and Non-Claims

This paper does not dispute Bell’s theorem and does not allege flaws in any specific experiment. It provides a general selection calculus, sharp quantitative bounds, and audit targets. Whether a given experiment permits significant setting-dependent selection is an empirical question that depends on the detailed pipeline.

2. Model and Notation

2.1. Settings, Hidden Variables, MI, and Locality

Let S A , S B be the setting sets for Alice and Bob (finite or general measurable spaces). Let ( Λ , F ) be a measurable space of hidden variables and let ρ P ( Λ ) be a probability measure. Measurement independence (MI) means that the prior law ρ does not depend on the setting pair ( a , b ) .
A Bell-local model specifies measurable functions
A : S A × Λ [ 1 , 1 ] , B : S B × Λ [ 1 , 1 ] ,
so each party’s output depends only on its local setting and λ . Deterministic { ± 1 } models are included as a special case.
We write “ ρ -a.e.” for “ ρ -almost everywhere”, i.e., except on a measurable set of ρ -measure zero.

2.2. Selection as an Acceptance Rule

Selection is modeled by an acceptance probability
γ : S A × S B × Λ [ 0 , 1 ] , ( a , b , λ ) γ ( a , b , λ ) .
Intuitively, given ( a , b , λ ) , the trial is accepted with probability γ ( a , b , λ ) . This can represent detector efficiencies, coincidence-window acceptance, quality cuts, or any composite rule that decides whether a trial enters the dataset used to estimate correlations. Any extra randomization inside the acceptance pipeline can be absorbed into an enlarged hidden space (e.g., Λ × [ 0 , 1 ] ), so treating γ as an acceptance probability entails no loss of generality.
Define the acceptance rate
Z ( a , b ) : = Λ γ ( a , b , λ ) d ρ ( λ ) , assumed Z ( a , b ) ( 0 , 1 ] for all ( a , b ) .
Definition 1 
(Accepted hidden-variable law). For each setting pair ( a , b ) , define ν a b P ( Λ ) by
ν a b ( E ) : = E γ ( a , b , λ ) d ρ ( λ ) Λ γ ( a , b , λ ) d ρ ( λ ) = 1 Z ( a , b ) E γ ( a , b , λ ) d ρ ( λ ) , E F .
Equivalently, ν a b is the conditional law of λ given acceptance when acceptance occurs with probability γ ( a , b , λ ) .
Remark 1 
(Factorized local detection as a special case). If acceptance is generated by independent local detection probabilities η A ( a , λ ) , η B ( b , λ ) [ 0 , 1 ] with acceptance if both sides detect then
γ ( a , b , λ ) = η A ( a , λ ) η B ( b , λ ) ,
which is the standard detection-loophole structure [7,8,9]. The main bounds in this paper do not assume factorization; they apply to general (possibly joint) acceptance rules.

2.3. Observed Correlators, Unconditional Correlators, and CHSH

Define the accepted-sample (observed) correlator
E obs ( a , b ) : = Λ A ( a , λ ) B ( b , λ ) d ν a b ( λ ) ,
and the unconditional correlator
E full ( a , b ) : = Λ A ( a , λ ) B ( b , λ ) d ρ ( λ ) .
Using Definition 1, the observed correlator has the explicit “weight-and-renormalize” form
E obs ( a , b ) = 1 Z ( a , b ) Λ A ( a , λ ) B ( b , λ ) γ ( a , b , λ ) d ρ ( λ ) .
Fix a CHSH quartet of settings a 0 , a 1 S A and b 0 , b 1 S B , and write
E i j : = E obs ( a i , b j ) , i , j { 0 , 1 } .
We use the standard CHSH expression
S obs : = | E 00 + E 01 + E 10 E 11 | .
Define S full analogously using E full ( a i , b j ) .

2.4. Total Variation Distance and a Key Inequality

Definition 2 
(Total variation distance). For probability measures μ , ν P ( Λ ) ,
TV ( μ , ν ) : = sup E F | μ ( E ) ν ( E ) | .
In particular, TV ( μ , ν ) [ 0 , 1 ] . If Λ is finite then TV ( μ , ν ) = 1 2 x Λ | μ ( x ) ν ( x ) | .
Lemma 1 
(TV controls bounded expectation errors). Let μ , ν P ( Λ ) and let f : Λ R be measurable with f 1 . Then
| f d μ f d ν | 2 TV ( μ , ν ) .
Proof. 
A standard dual characterization is
TV ( μ , ν ) = 1 2 sup g 1 | g d μ g d ν | .
Apply this with g = f . □

2.5. Notation Summary

Table 1 collects the core symbols. For a CHSH quartet, we write
Q : = { ( a 0 , b 0 ) , ( a 0 , b 1 ) , ( a 1 , b 0 ) , ( a 1 , b 1 ) } .

3. Two Distinct Selection Effects: Prior-Relative Bias vs. Across-Setting Dispersion

Acceptance can (i) bias each correlator relative to the unconditional correlator, and (ii) change the accepted hidden-variable law across setting pairs. Only the second effect is what allows CHSH inflation: CHSH is an inequality about expectations taken under a single measure.

3.1. Prior-Relative Deviation (Fair-Sampling Bias)

Definition 3 
(Fair-sampling deviation). For each setting pair ( a , b ) , define
δ a b : = 2 TV ( ν a b , ρ ) [ 0 , 2 ] .
For a fixed quartet Q and globally, define
δ Q : = max ( a , b ) Q δ a b , δ : = sup a S A , b S B δ a b .
Remark 2 
(Interpretation). How much E obs ( a , b ) can differ from E full ( a , b ) is controlled by δ a b , due to acceptance at that setting pair. By itself, δ a b does not control CHSH inflation, because CHSH involves four correlators that may be computed under four different measures.

3.2. Across-Setting Dispersion on a CHSH Quartet

Fix a quartet ( a 0 , a 1 , b 0 , b 1 ) and write
ν 00 : = ν a 0 b 0 , ν 01 : = ν a 0 b 1 , ν 10 : = ν a 1 b 0 , ν 11 : = ν a 1 b 1 .
Definition 4 
(Quartet dispersion and diameter). Define the quartet dispersion
Δ Q : = inf μ P ( Λ ) TV ( ν 00 , μ ) + TV ( ν 01 , μ ) + TV ( ν 10 , μ ) + TV ( ν 11 , μ )
and the quartet diameter
D Q : = max ( i , j ) ( k , ) TV ( ν i j , ν k ) .
Proposition 1 
(When dispersion vanishes). Δ Q = 0 if and only if ν 00 = ν 01 = ν 10 = ν 11 . Equivalently, the accepted hidden-variable law is setting-independent on the quartet.
Proof. 
If all four measures are equal, choose μ = ν 00 in (8). Conversely, if Δ Q = 0 then there exists a sequence μ n , such that q { 00 , 01 , 10 , 11 } TV ( ν q , μ n ) 0 . Then, for any q , q ,
TV ( ν q , ν q ) TV ( ν q , μ n ) + TV ( μ n , ν q ) 0 ,
so all four measures coincide. □
Proposition 2 
(Relations among Δ Q , D Q , and δ Q ). For every quartet Q,
D Q Δ Q 3 D Q , Δ Q ( a , b ) Q TV ( ν a b , ρ ) = 1 2 ( a , b ) Q δ a b 2 δ Q , D Q δ Q .
Proof. 
For D Q Δ Q fix any μ and choose q , q , attaining D Q = TV ( ν q , ν q ) . Then, by the triangle inequality,
TV ( ν q , ν q ) TV ( ν q , μ ) + TV ( μ , ν q ) r Q TV ( ν r , μ ) .
Taking inf μ yields D Q Δ Q .
For Δ Q 3 D Q choose μ = ν 00 in (8). Then,
Δ Q TV ( ν 01 , ν 00 ) + TV ( ν 10 , ν 00 ) + TV ( ν 11 , ν 00 ) 3 D Q .
For Δ Q ( a , b ) Q TV ( ν a b , ρ ) choose μ = ρ in (8). The equality with δ a b follows from Definition 3.
For D Q δ Q , for any two pairs q , q Q ,
TV ( ν q , ν q ) TV ( ν q , ρ ) + TV ( ρ , ν q ) δ Q 2 + δ Q 2 = δ Q .
Taking the maximum gives D Q δ Q . □
Table 2 summarizes which TV quantities answer which inferential questions and what data are typically needed to bound or estimate them.

4. Universal CHSH Bounds Under Setting-Dependent Acceptance

This section proves the main quantitative statement of the paper: even under measurement independence and Bell locality, the observed CHSH value can exceed 2 if the four correlators are computed under four different accepted hidden-variable laws. The amount of possible inflation is controlled by total variation distances among those laws.
Throughout, fix a CHSH quartet of settings a 0 , a 1 S A and b 0 , b 1 S B , and use the shorthand
ν i j : = ν a i b j , E i j : = E obs ( a i , b j ) = A ( a i , λ ) B ( b j , λ ) d ν i j ( λ ) , i , j { 0 , 1 } .
Recall the standard CHSH functional
S obs = | E 00 + E 01 + E 10 E 11 | .

4.1. Pointwise CHSH Algebra and the Unconditional Theorem

Lemma 2 
(Pointwise CHSH bound). Fix λ Λ and settings a 0 , a 1 , b 0 , b 1 . Let A i : = A ( a i , λ ) [ 1 , 1 ] and B j : = B ( b j , λ ) [ 1 , 1 ] . Then,
| A 0 B 0 + A 0 B 1 + A 1 B 0 A 1 B 1 |     2 .
Proof. 
Rewrite
A 0 B 0 + A 0 B 1 + A 1 B 0 A 1 B 1 = A 0 ( B 0 + B 1 ) + A 1 ( B 0 B 1 ) .
By the triangle inequality and | A 0 | , | A 1 |     1 ,
| A 0 ( B 0 + B 1 ) + A 1 ( B 0 B 1 ) |     | B 0 + B 1 |   +   | B 0 B 1 | .
For any real x , y , one has | x + y | + | x y | = 2 max { | x | , | y | } . With x = B 0 , y = B 1 and | B 0 | , | B 1 |     1 , the right-hand side is at most 2. □
Theorem 1 
(Unconditional Bell–CHSH). Assume measurement independence, Bell locality, and bounded outcomes. Then, the unconditional correlators satisfy
S full : = | E full ( a 0 , b 0 ) + E full ( a 0 , b 1 ) + E full ( a 1 , b 0 ) E full ( a 1 , b 1 ) |     2 .
Proof. 
Define the bounded measurable function
C ( λ ) : = A ( a 0 , λ ) B ( b 0 , λ ) + A ( a 0 , λ ) B ( b 1 , λ ) + A ( a 1 , λ ) B ( b 0 , λ ) A ( a 1 , λ ) B ( b 1 , λ ) .
Then, S full = | C d ρ | . Using | C d ρ | | C | d ρ and Lemma 2,
S full 2 d ρ = 2 .
Remark 3 
(Where selection enters). Theorem 1 is a statement about unconditional expectations under the MI prior ρ. In experiments, correlators are often computed after conditioning on acceptance, i.e., under ν i j . The rest of this section quantifies how this conditioning can inflate CHSH when ν i j depends on ( i , j ) .

4.2. Main Inflation Bound: Reference-Measure Form

Theorem 2 
(CHSH inflation bound (reference-measure form)). Assume measurement independence, Bell locality, and bounded outcomes. Fix a CHSH quartet and abbreviate ν i j = ν a i b j . Then, for every reference law μ P ( Λ ) ,
S obs 2 + 2 TV ( ν 00 , μ ) + TV ( ν 01 , μ ) + TV ( ν 10 , μ ) + TV ( ν 11 , μ ) .
Proof. 
Let f i j ( λ ) : = A ( a i , λ ) B ( b j , λ ) , so f i j 1 . Define
E i j : = f i j d ν i j , E ˜ i j : = f i j d μ , ε i j : = E i j E ˜ i j .
Then,
S obs = | E 00 + E 01 + E 10 E 11 | = | ( E ˜ 00 + E ˜ 01 + E ˜ 10 E ˜ 11 ) + ( ε 00 + ε 01 + ε 10 ε 11 ) | .
By the triangle inequality,
S obs | E ˜ 00 + E ˜ 01 + E ˜ 10 E ˜ 11 | = : S μ + | ε 00 + ε 01 + ε 10 ε 11 | S μ + i , j { 0 , 1 } | ε i j | .
  • Step 1: CHSH under a single reference measure.
Because E ˜ i j = f i j d μ are expectations under the same measure μ , Theorem 1 (with ρ replaced by μ ) yields S μ 2 .
  • Step 2: bound the error terms by TV.
By Lemma 1,
| ε i j | = | f i j d ν i j f i j d μ | 2 TV ( ν i j , μ ) .
Substitute into (13). □
Remark 4 
(How to read Theorem 2). Theorem 2 separates two effects:
  • a single-measure CHSH contribution (bounded by 2);
  • a penalty for using four measures instead of one, quantified by TV distances to an arbitrary reference law μ.
Optimizing over μ yields an intrinsic “distance-to-one-law” parameter for the quartet.

4.3. Intrinsic Dispersion and Diameter Bounds

Recall the dispersion and diameter from Definition 4:
Δ Q = inf μ P ( Λ ) i , j { 0 , 1 } TV ( ν i j , μ ) , D Q = max ( i , j ) ( k , ) TV ( ν i j , ν k ) .
Corollary 1 
(Intrinsic dispersion bound).
S obs min { 4 , 2 + 2 Δ Q } .
Proof. 
Take the infimum of (12) over μ and use the definition of Δ Q . The cap S obs 4 is trivial because E i j [ 1 , 1 ] . □
Corollary 2 
(CHSH holds on accepted data under quartet setting-independence). If ν 00 = ν 01 = ν 10 = ν 11 (equivalently Δ Q = 0 ), then S obs 2 .
Proof. 
If Δ Q = 0 then (14) gives S obs 2 . □
Corollary 3 
(Diameter bound).
S obs min { 4 , 2 + 6 D Q } .
Proof. 
Choose μ = ν 00 in (12). Then,
S obs 2 + 2 0 + TV ( ν 01 , ν 00 ) + TV ( ν 10 , ν 00 ) + TV ( ν 11 , ν 00 ) 2 + 2 ( 3 D Q ) = 2 + 6 D Q .
Cap by 4. □
Remark 5 
(Dispersion vs. diameter). Δ Q is the sharp intrinsic “distance to a single accepted law” for the quartet. The diameter D Q is easier to estimate from pairwise comparisons and yields the explicit bound (15). By Proposition 2, one always has Δ Q 3 D Q , so the diameter bound is generally looser but operationally convenient.

4.4. Prior-Relative (Fair-Sampling) Bounds as Corollaries

Proposition 3 
(Single-correlator bias bound). For every setting pair ( a , b ) ,
| E obs ( a , b ) E full ( a , b ) | 2 TV ( ν a b , ρ ) = δ a b .
Proof. 
Let f ( λ ) : = A ( a , λ ) B ( b , λ ) so f 1 . Then,
E obs ( a , b ) E full ( a , b ) = f d ν a b f d ρ ,
and Lemma 1 gives (16). □
Corollary 4 
(Prior-relative CHSH inflation bound). Let Q : = { ( a i , b j ) : i , j { 0 , 1 } } . Then,
S obs min { 4 , 2 + ( a , b ) Q δ a b } min { 4 , 2 + 4 δ Q } min { 4 , 2 + 4 δ } .
Proof. 
Apply Theorem 2 with μ = ρ :
S obs 2 + 2 ( a , b ) Q TV ( ν a b , ρ ) = 2 + ( a , b ) Q δ a b .
Then, bound ( a , b ) Q δ a b 4 δ Q 4 δ and cap by 4. □
Remark 6 
(When prior-relative bounds are loose). If acceptance is setting-independent on the quartet but not fair (i.e., ν a b = ν ρ for all ( a , b ) Q ) then CHSH holds on accepted data ( S obs 2 ), but TV ( ν , ρ ) may be large. In such cases, δ a b correctly measures bias relative to unconditional correlators but it overestimates CHSH inflation. For CHSH inflation, dispersion parameters Δ Q and D Q are the relevant controls.

4.5. Tsirelson-Scale Necessary Conditions

The next corollaries are immediate but operationally useful: they convert an observed CHSH value into necessary amounts of selection dependence.
Corollary 5 
(Necessary dispersion for Tsirelson-scale CHSH). If a Bell-local MI model reproduces S obs = 2 2 on a quartet by setting-dependent acceptance then
Δ Q 2 1 0.4142 , D Q 2 1 3 0.1381 .
Proof. 
From Corollary 1, 2 2 2 + 2 Δ Q implies Δ Q 2 1 . From Corollary 3, 2 2 2 + 6 D Q implies D Q ( 2 1 ) / 3 . □
Corollary 6 
(Necessary fair-sampling deviation for Tsirelson-scale CHSH). If a Bell-local MI model reproduces S obs = 2 2 on a quartet by setting-dependent acceptance then
δ Q 2 2 2 4 = 2 1 2 0.2071 .
Equivalently, for at least one pair ( a , b ) in the quartet,
TV ( ν a b , ρ ) 2 1 4 0.1036 .
Proof. 
From Corollary 4, 2 2 2 + 4 δ Q implies (19). □

4.6. A Coarse Acceptance-Rate-Only Fairness Bound

The next bound is coarse but useful in practice: it upper-bounds fair-sampling deviation for a fixed setting pair using only the acceptance rate.
Proposition 4 
(Acceptance-rate bound on fair-sampling deviation). For each setting pair ( a , b ) ,
TV ( ν a b , ρ ) 1 Z ( a , b ) , equivalently δ a b 2 1 Z ( a , b ) .
Proof. 
Fix ( a , b ) and abbreviate Z : = Z ( a , b ) and γ ( λ ) : = γ ( a , b , λ ) . From Definition 1,
d ν a b = γ ( λ ) Z d ρ ( λ ) .
Therefore,
TV ( ν a b , ρ ) = 1 2 Λ | γ ( λ ) Z 1 | d ρ ( λ ) = 1 2 Z Λ | γ ( λ ) Z | d ρ ( λ ) .
Now, γ ( λ ) [ 0 , 1 ] and γ d ρ = Z . The function x | x Z | is convex on [ 0 , 1 ] , and the set of [ 0 , 1 ] -valued random variables with fixed mean Z is convex. A convex functional attains its maximum on this set at an extreme point, which here corresponds to a two-point distribution on { 0 , 1 } , i.e., a Bernoulli(Z) variable. For Bernoulli(Z),
E | X Z | = Z | 1 Z | + ( 1 Z ) | 0 Z | = 2 Z ( 1 Z ) .
Hence, | γ Z | d ρ 2 Z ( 1 Z ) , and substituting yields
TV ( ν a b , ρ ) 1 2 Z · 2 Z ( 1 Z ) = 1 Z .
Remark 7 
(Interpretation). If Z ( a , b ) 1 then the accepted law cannot be far from the prior at that setting pair. This does not, by itself, control dispersion across setting pairs: the accepted laws may still differ substantially even if all acceptance rates are moderate.

5. Sharpness: A Saturating Local Construction

The constants in the dispersion and diameter bounds are optimal: there exist Bell-local MI models in which S obs attains the bounds as equalities.
Proposition 5 
(Sharpness of the constants). Fix any ε [ 0 , 1 / 2 ] . There exists a measurement-independent Bell-local deterministic model (with outcomes in { ± 1 } ) and a setting-dependent acceptance rule γ ( a , b , λ ) [ 0 , 1 ] , such that for a single CHSH quartet ( a 0 , a 1 , b 0 , b 1 ) :
(i) 
each setting pair has fair-sampling deviation δ a i b j = ε (hence δ Q = ε );
(ii) 
the quartet dispersion and diameter satisfy Δ Q = 2 ε and D Q = 2 ε / 3 ;
(iii) 
the observed CHSH value satisfies
S obs = 2 + 4 ε = 2 + 2 Δ Q = 2 + 6 D Q ,
so Corollaries 1 and 3 are tight.
Proof of Construction and verification. 
  • Step 1: A Bell-local deterministic model with S full = 2 .
Let Λ = { λ 1 , λ 2 , λ 3 , λ 4 } with the uniform prior ρ ( { λ k } ) = 1 / 4 . Define deterministic outputs by
A ( a 0 , λ ) A ( a 1 , λ ) B ( b 0 , λ ) B ( b 1 , λ ) λ 1 + 1 + 1 + 1 + 1 λ 2 + 1 + 1 + 1 1 λ 3 + 1 1 1 + 1 λ 4 1 + 1 1 1
For ( i , j ) { 0 , 1 } 2 write f i j ( λ ) : = A ( a i , λ ) B ( b j , λ ) { ± 1 } . A direct check yields
E full ( a 0 , b 0 ) = 1 2 , E full ( a 0 , b 1 ) = 1 2 , E full ( a 1 , b 0 ) = 1 2 , E full ( a 1 , b 1 ) = 1 2 ;
hence, S full = | 1 / 2 + 1 / 2 + 1 / 2 ( 1 / 2 ) | = 2 .
  • Step 2: Setting-dependent re-weightings that shift each correlator by ± ε .
For each ( i , j ) , define a density with respect to ρ by
w i j ( λ ) : = 1 + c i j f i j ( λ ) E full ( a i , b j ) , c i j : = 4 3 s i j ε ,
where ( s 00 , s 01 , s 10 , s 11 ) = ( + 1 , + 1 , + 1 , 1 ) .
Since E ρ [ f i j E full ( a i , b j ) ] = 0 , each w i j satisfies w i j d ρ = 1 .
  • Non-negativity of w i j .
For the three pairs with E full = + 1 / 2 , the variable f i j E full takes values + 1 / 2 on three points and 3 / 2 on one point, and c i j 0 . Thus, the minimum of w i j is
w min = 1 3 2 c i j = 1 2 ε 0
for ε 1 / 2 . For the remaining pair ( 1 , 1 ) with E full = 1 / 2 , one has c 11 0 , and f 11 E full takes values + 3 / 2 (on one point) and 1 / 2 (on three points), so, again, w 11 1 2 ε 0 . Hence, each w i j is a valid density.
Define ν i j by d ν i j : = w i j d ρ .
  • Observed correlations and S obs .
Because f i j 2 1 ,
f i j f i j E full ( a i , b j ) d ρ = 1 E full ( a i , b j ) f i j d ρ = 1 E full ( a i , b j ) 2 = 3 4 .
Therefore,
E obs ( a i , b j ) = f i j d ν i j = f i j w i j d ρ = E full ( a i , b j ) + c i j · 3 4 = E full ( a i , b j ) + s i j ε .
Consequently,
S obs = | ( 1 2 + ε ) + ( 1 2 + ε ) + ( 1 2 + ε ) ( 1 2 ε ) | = 2 + 4 ε .
  • Step 3: fair-sampling deviation δ i j = ε .
For E full = ± 1 / 2 with the above f i j distributions,
| f i j E full ( a i , b j ) | d ρ = 3 4 .
Hence,
δ a i b j = 2 TV ( ν i j , ρ ) = | w i j 1 | d ρ = | c i j | · 3 4 = ε ,
so δ Q = ε .
  • Step 4: compute D Q and Δ Q .
Since ρ is uniform on four points, each ν i j assigns masses ν i j ( λ k ) = w i j ( λ k ) / 4 . For each ( i , j ) , the density w i j takes exactly two values: three “high” values,
w high = 1 + 2 ε 3 ,
and one “low” value,
w low = 1 2 ε ,
with the location of the low value depending on ( i , j ) and distinct across the four setting pairs. Therefore, any two distinct measures differ on exactly two points, and their TV distance is
TV ( ν i j , ν k ) = w high 4 w low 4 = w high w low 4 = 2 ε / 3 + 2 ε 4 = 2 3 ε .
Hence, D Q = 2 ε / 3 , and the diameter bound (15) is attained: 2 + 6 D Q = 2 + 4 ε = S obs .
For Δ Q , Corollary 1 implies S obs 2 + 2 Δ Q , so 2 + 4 ε 2 + 2 Δ Q and Δ Q 2 ε . On the other hand, choosing μ = ρ in Definition 4 gives
Δ Q i , j TV ( ν i j , ρ ) = 4 · δ i j 2 = 2 ε .
Thus, Δ Q = 2 ε and S obs = 2 + 2 Δ Q .
  • Step 5: realize the densities by an acceptance rule γ [ 0 , 1 ] .
Let w max = w high = 1 + 2 ε / 3 , and choose
Z : = 1 w max = 1 1 + 2 ε / 3 3 4 , 1 .
Define γ ( a i , b j , λ ) : = Z w i j ( λ ) . Then, 0 γ 1 and γ d ρ = Z , and Definition 1 yields d ν i j = w i j d ρ as required. □
Remark 8 
(Meaning of the sharpness construction). Proposition 5 is a proof-of-possibility statement: even under Bell locality and measurement independence, setting-dependent acceptance can inflate CHSH up to the limits given by the TV geometry of the accepted laws. It does not claim that such re-weightings occur in any specific experiment.

6. Experimental Audit Protocol

The bounds in Section 4 are informative only if one can independently constrain the relevant TV quantities. In complete generality, neither the fair-sampling deviations δ a b nor the dispersions Δ Q and D Q are identifiable from accepted outcome data alone: acceptance can act on unobserved degrees of freedom. This section therefore formulates audit targets based on auxiliary tags.

6.1. Audit Goals: Two Distinct Questions

A CHSH violation computed on accepted trials can be inflated above 2 if the accepted law ν a b depends on ( a , b ) . Accordingly, we audit two different properties:
(i)
Prior-relative bias / fair sampling (Lane A). For each setting pair ( a , b ) , how close is ν a b to the MI prior ρ ? This controls the difference between accepted and unconditional correlators via Proposition 3.
(ii)
Across-setting dispersion (Lane B). For a tested quartet, how close are ν 00 , ν 01 , ν 10 , ν 11 to being a single common law? This controls the CHSH inflation via Corollaries 1 and 3.
  • Data requirement for Lane A: a trial definition.
Lane A requires an empirical proxy for the prior marginal distribution of a tag, which, in turn, requires a definition of “all trials” (accepted and rejected). This is natural in event-ready or clocked/pulsed experiments. In continuous-wave coincidence-matching experiments, a trial definition is not intrinsic and must be imposed (e.g., by time bins).

6.2. Schematic: Selection and the Two Audit Lanes

Figure 1 summarizes how acceptance can change the hidden-variable law from the MI prior ρ to a setting-pair dependent accepted law ν a b , and how Lane A and Lane B audits attach to different parts of the pipeline.

6.3. Tags and Pushforward (Tag) Distributions

Let X : Λ X be a measurable tag. Operationally, X may represent a discretized arrival-time residual, pulse-energy monitor bin, spectral bin, detector-state flag, or any auxiliary feature that can plausibly influence acceptance. For computability, we often take X = { 1 , , K } finite, but the definitions do not require finiteness.
For a measure μ P ( Λ ) , define its pushforward (tag distribution)
μ X : = μ X 1 P ( X ) .
In particular,
ρ X : = ρ X 1 , ν a b X : = ν a b X 1 .

6.4. Lane A: Prior-Relative Fair-Sampling Diagnostics

Definition 5 
(Resolved fair-sampling deviation). For a setting pair ( a , b ) , define the resolved deviation at tag resolution X by
δ X ( a , b ) : = 2 TV ( ν a b X , ρ X ) .
For a quartet Q, define δ X , Q : = max ( a , b ) Q δ X ( a , b ) .
Theorem 3 
(Data processing for fair-sampling deviation). For every measurable tag X and every setting pair ( a , b ) ,
δ X ( a , b ) δ a b .
In particular, δ X , Q δ Q .
Proof. 
Total variation is contractive under measurable maps: for any T, TV ( μ T 1 , ν T 1 ) TV ( μ , ν ) . Apply this with T = X , μ = ν a b , and ν = ρ . □
Remark 9 
(Operational meaning and limitation). How different the accepted-tag distribution is from the all-trial tag distribution is measured by δ X ( a , b ) . It is a lower bound on the true hidden-variable deviation δ a b : selection that acts only on untagged degrees of freedom may not be visible in δ X . To obtain an upper bound on δ a b from data, one needs an additional assumption: for example, that acceptance depends on λ only through the observed tag X (or approximately so).

6.5. Acceptance-Rate Representation on Tags (Optional)

Assume X = { 1 , , K } and ρ X ( i ) > 0 . Write
ρ X ( i ) = ρ ( X = i ) , ν a b X ( i ) = ν a b ( X = i ) .
From Definition 1,
ν a b X ( i ) = 1 Z ( a , b ) Λ 1 { X = i } ( λ ) γ ( a , b , λ ) d ρ ( λ ) .
Define the tag-level re-weighting factor
w a b X ( i ) : = ν a b X ( i ) ρ X ( i ) = E ρ [ γ ( a , b , λ ) X = i ] Z ( a , b ) .
Interpreting Pr ( Acc a , b , X = i ) : = E ρ [ γ ( a , b , λ ) X = i ] , this reads
w a b X ( i ) = Pr ( Acc a , b , X = i ) Pr ( Acc a , b ) .
Then,
δ X ( a , b ) = i = 1 K | w a b X ( i ) 1 | ρ X ( i ) = i = 1 K | ν a b X ( i ) ρ X ( i ) | .
This expresses the Lane-A audit as follows: how much does acceptance probability vary across tag bins?

6.6. Lane B: Prior-Free Dispersion Diagnostics on Accepted Tags

Lane B targets the quantities that actually control CHSH inflation: how the accepted laws vary across setting pairs. This does not require knowledge of the prior ρ . Instead, it uses tag distributions within accepted data at each setting pair.
Fix a quartet and write ν i j X for the accepted-tag law at setting pair ( a i , b j ) .
Definition 6 
(Resolved diameter and resolved dispersion). Define the resolved quartet diameter
D Q , X : = max ( i , j ) ( k , ) TV ( ν i j X , ν k X ) ,
and the resolved quartet dispersion
Δ Q , X : = inf μ X P ( X ) i , j { 0 , 1 } TV ( ν i j X , μ X ) .
Theorem 4 
(Data processing for dispersion). For every measurable tag X,
D Q , X D Q , Δ Q , X Δ Q .
Proof. 
Contractivity under X yields TV ( ν i j X , ν k X ) TV ( ν i j , ν k ) for each pair. Taking maxima gives D Q , X D Q . Similarly, for any μ P ( Λ ) one has TV ( ν i j X , μ X ) TV ( ν i j , μ ) ; hence,
inf μ X i j TV ( ν i j X , μ X ) inf μ i j TV ( ν i j , μ ) ,
which is Δ Q , X Δ Q . □
Remark 10 
(Interpretation). A large observed Δ Q , X or D Q , X is direct evidence that the accepted ensemble depends on settings at the resolution of the measured tag. Conversely, a small resolved value does not guarantee small hidden dispersion unless one has reason to believe the tag is sufficient (Section 6.7).

6.7. Tag Sufficiency: When Resolved Dispersion Becomes Exact

The inequalities Δ Q , X Δ Q and D Q , X D Q are one-sided in general. The following sufficiency condition (a standard statistical concept) makes them equalities.
Definition 7 
(Tag sufficiency for the accepted family). Let X : Λ X be measurable. We say that X is sufficient for the accepted family { ν a b } ( a , b ) if there exists a Markov kernel κ ( d λ x ) from X to Λ, such that for every setting pair ( a , b ) ,
ν a b ( d λ ) = X κ ( d λ x ) ν a b X ( d x ) .
Equivalently (on standard measurable spaces), the conditional law ν a b ( d λ X = x ) does not depend on ( a , b ) for ν a b X -almost every x.
Proposition 6 
(Exactness of TV geometry under sufficiency). Assume X is sufficient in the sense of Definition 7. Then, for any two setting pairs ( a , b ) and ( a , b ) ,
TV ( ν a b , ν a b ) = TV ( ν a b X , ν a b X ) .
Consequently, for any quartet Q,
D Q = D Q , X , Δ Q = Δ Q , X .
Proof. 
Let μ = ν a b and ν = ν a b .
  • Step 1: pushforward (coarse-graining) inequality.
By contractivity under the map X,
TV ( ν a b X , ν a b X ) TV ( ν a b , ν a b ) .
  • Step 2: reconstruction inequality using the common kernel.
By sufficiency, ν a b = κ ν a b X and ν a b = κ ν a b X for the same kernel κ . By contractivity under κ ,
TV ( ν a b , ν a b ) = TV ( κ ν a b X , κ ν a b X ) TV ( ν a b X , ν a b X ) .
Combining the two inequalities yields equality for pairwise Tvs. Taking maxima over the quartet gives D Q = D Q , X .
For dispersion, fix any μ X P ( X ) and define μ : = κ μ X . Then, for each ( i , j ) ,
TV ( ν i j , μ ) = TV ( κ ν i j X , κ μ X ) TV ( ν i j X , μ X ) ;
hence,
i j TV ( ν i j , μ ) i j TV ( ν i j X , μ X ) .
Taking the infimum over μ X yields Δ Q Δ Q , X , while Theorem 4 gives Δ Q , X Δ Q . Thus, Δ Q = Δ Q , X . □
Remark 11 
(How sufficiency is used in practice). Sufficiency means that all setting dependence of the accepted law ν a b is already visible in the tag marginal ν a b X , while the conditional distribution of the remaining (unobserved) degrees of freedom given X is setting-independent. Under this assumption, resolved quantities Δ Q , X and D Q , X can be substituted for Δ Q and D Q in Corollaries 1 and 3.

6.8. Computing Δ Q , X on Finite Tag Alphabets (Linear Programming)

When X is finite, Δ Q , X is computable by a small linear program. This makes the Lane-B dispersion audit operational on accepted-tag histograms.
Theorem 5 
(LP form of Δ on a finite space). Let X = { 1 , , K } and let η 1 , , η m P ( X ) (in our application m = 4 and η r = ν i j X ). Define
Δ ( η 1 , , η m ) : = inf μ P ( X ) r = 1 m TV ( η r , μ ) .
Then, Δ ( η 1 , , η m ) equals the optimum of the linear program
minimize 1 2 r = 1 m k = 1 K t r , k over μ k 0 , k = 1 K μ k = 1 , t r , k 0 subject to t r , k η r ( k ) μ k , t r , k μ k η r ( k ) ( r , k ) .
An optimal minimizer μ exists.
Proof. 
On a finite alphabet, TV ( η r , μ ) = 1 2 k = 1 K | η r ( k ) μ k | . Introducing epigraph variables t r , k | η r ( k ) μ k | yields (24). The feasible set is compact and the objective is continuous, so a minimizer exists. □
Remark 12 
(Lower-bound vs. upper-bound use). Because Δ Q , X Δ Q (Theorem 4), the LP value computed from tag data provides a certified lower bound on the hidden dispersion:
Δ Q Δ Q , X .
To use dispersion bounds as an exclusion tool for selection-based local explanations, one needs an upper bound on Δ Q or D Q . Tag sufficiency (Definition 7) is one route: if X is sufficient then Δ Q = Δ Q , X (Proposition 6).

6.9. Estimators and Uncertainty (Discrete Tags)

Assume X = { 1 , , K } .
  • Lane A (prior-relative) estimators.
Assume a trial definition exists and X is recorded on all trials. Let ρ X ^ be the empirical distribution of X across all trials, and let ν a b X ^ be the empirical distribution of X across accepted trials at setting pair ( a , b ) . Then, the plug-in estimate
δ X ^ ( a , b ) = 2 TV ( ν a b X ^ , ρ X ^ ) = k = 1 K | ν a b X ^ ( k ) ρ X ^ ( k ) |
is natural. Uncertainty can be quantified by nonparametric bootstrap resampling of trials within each setting cell and within the all-trials pool.
  • Lane B (prior-free) estimators.
For each setting pair in the quartet, estimate ν i j X ^ from accepted trials. Estimate pairwise TVs by
TV ^ ( ν i j X , ν k X ) : = 1 2 r = 1 K | ν i j X ^ ( r ) ν k X ^ ( r ) | .
Set D ^ Q , X as the maximum of the six pairwise TVs. Compute Δ ^ Q , X by solving the LP (24) with inputs ν 00 X ^ , ν 01 X ^ , ν 10 X ^ , ν 11 X ^ . Bootstrap within each setting cell provides uncertainty bands.

6.10. Decision Logic: What Audits Can Certify vs. What They Can Exclude

Because total variation is contractive under coarse-graining, the directly measurable tag quantities δ X ( a , b ) and Δ Q , X are, in general, lower bounds on δ a b and Δ Q . This creates an asymmetry between certification and exclusion.
  • What audits can certify without extra assumptions (lower-bound logic).
  • If δ X ( a , b ) is large then acceptance is demonstrably non-fair at the resolution of X.
  • If Δ Q , X or D Q , X is large then the accepted ensemble demonstrably differs across settings at the resolution of X.
These findings do not prove that a Bell-local model actually explains the observed CHSH violation, but they demonstrate that the pipeline contains setting-dependent selection structure strong enough (at measured resolution) to support selection-based inflation in principle.
  • What is required to exclude selection-based Bell-local explanations (upper-bound logic).
To use Corollary 1 or 3 as exclusion tools, one needs an upper bound on Δ Q or D Q . Such an upper bound typically requires at least one of the following:
  • an architecture guarantee implying setting-independent acceptance on the tested quartet (forcing Δ Q = 0 );
  • a justified sufficiency assumption that the measured tag X captures essentially all setting dependence of the accepted law (so that Δ Q Δ Q , X ); or
  • a physics-based constraint on unobserved degrees of freedom that limits unresolved selection dependence.
  • A one-sided exclusion test.
Suppose an experiment provides an external upper confidence bound D Q D ( + ) and an estimate S ^ obs with standard error σ S . Then, Corollary 3 yields the exclusion condition
S ^ obs z 1 α σ S > 2 + 6 D ( + ) exclude Bell - local MI selection models with D Q D ( + ) at confidence 1 α .
Analogously, using dispersion: S ^ obs z 1 α σ S > 2 + 2 Δ ( + ) excludes models with Δ Q Δ ( + ) .

6.11. A Publication-Ready “Selection Statement” Checklist

To make selection semantics explicit (analogous to standard discussions of MI and spacelike separation), it is useful to include a short selection statement in Bell-test publications:
S1. 
Trial definition. What counts as a trial? (Pulse, clock bin, herald, fixed time bins, etc.)
S2. 
Outcome alphabet. Are no-click events retained as outcomes (loss-inclusive analysis), or are correlations conditioned on detection/coincidence?
S3. 
Acceptance rule. List every step that discards trials or conditions the analysis (thresholds, coincidence windows, invalid flags, quality cuts).
S4. 
Acceptance rates. Report Z ( a , b ) by setting pair (with uncertainty), and note the coarse bound TV ( ν a b , ρ ) 1 Z ( a , b ) from Proposition 4.
S5. 
Tag-based fair-sampling audit (Lane A). If feasible, report at least one tag X recorded on all trials and the resolved deviations δ X ( a , b ) .
S6. 
Tag-based dispersion audit (Lane B). Report at least one accepted-tag dispersion diagnostic across settings (pairwise TVs, D Q , X , and/or Δ Q , X via LP).

6.12. Robustness Sweeps (Threshold/Window Sensitivity)

Many acceptance mechanisms enter through analysis parameters (e.g., coincidence window width, timing offsets, discriminator thresholds, filter bandwidth). A useful diagnostic is to repeat the full analysis over a modest grid of such parameters and report the stability of correlators and the CHSH value.
Let θ Θ denote a tuning parameter controlling acceptance (or a small grid of parameters). Let S ^ obs ( θ ) be the estimated CHSH value obtained under parameter θ . Define the empirical range
Range Θ ( S ) : = max θ Θ S ^ obs ( θ ) min θ Θ S ^ obs ( θ ) .
Large sensitivity indicates that acceptance details matter operationally. Small sensitivity supports robustness, but by itself does not yield an upper bound on hidden dispersion: selection could still act on degrees of freedom not varied or not observed. In this framework, sweeps are most useful for guiding tag choice and checking whether resolved quantities δ X and Δ Q , X remain stable under reasonable analysis variations.

7. Discussion

7.1. How the Selection Model Maps to Experimental Architectures

The acceptance function γ ( a , b , λ ) is intentionally general: it can represent independent local detections, coincidence-time pairing, and pipeline-level cuts. Nevertheless, different experimental architectures naturally constrain the plausible setting dependence.
  • Event-ready (heralded) experiments.
In event-ready Bell tests, trials are often defined by a herald event that is generated before the local settings are chosen (or outside their past light cones in the relevant frame). If acceptance is determined solely by that herald event and is causally prior to the setting choices then one expects γ ( a , b , λ ) = γ ( λ ) and, hence, ν a b is setting-independent. In that regime, Corollary 2 implies CHSH holds on accepted data without requiring fair sampling. Setting dependence can re-enter if additional discards occur after settings are chosen (e.g., basis-dependent invalid flags).
  • High-efficiency photonic experiments (loss-inclusive inequalities).
Many modern photonic experiments avoid conditioning on coincidence by treating no-click as an outcome and using a loss-inclusive inequality (e.g., Clauser–Horne or Eberhard-type analyses) [5,6,9,14]. In such analyses, the Bell functional is evaluated per trial, reducing the specific selection-induced CHSH inflation mechanism studied here. Nevertheless, any additional discards or conditioning steps (overflows, dead-time cuts, invalid-trial flags) reintroduce a nontrivial acceptance rule and should be disclosed and audited.
  • Continuous-wave time-tag experiments (coincidence matching).
In continuous-wave experiments without a natural trial clock, coincidence pairs are often produced by a matching rule on two time-tag streams. This is a paradigmatic joint acceptance rule and can generate setting-dependent accepted laws if the matching probability depends on settings or if settings induce timing shifts [10]. In that regime, Lane-B dispersion audits using time-residual tags are particularly natural.

7.2. Relation to Relaxed Measurement Independence

Mathematically, the accepted laws ν a b behave like a setting-dependent hidden-variable law ρ ( λ a , b ) . This connects the present bounds to the measurement-dependence literature [11,12,13], which controls the variability of ρ ( λ a , b ) (often in variational distance) to obtain relaxed Bell inequalities.
The conceptual distinction here is physical: we assume MI at emission (a fixed prior ρ ) and attribute setting dependence to conditioning on acceptance. In practice, both mechanisms can be present, and separating them experimentally requires careful design and disclosure.

7.3. Limitations and What the Framework Does (And Does Not) Deliver

The main limitation is identifiability: tag-based audits typically provide lower bounds on hidden selection structure (e.g., Δ Q , X Δ Q ). To convert an audit into an upper bound, one needs a justified sufficiency assumption or an architecture argument.
The value of the present framework is that it makes the needed information explicit: an observed CHSH violation excludes Bell-local MI models once one has controlled (or quantitatively bounded) the across-setting dispersion of accepted hidden-variable laws.

7.4. Context: “Locality” Has Inequivalent Formalizations; A Small Taxonomy of CHSH Model Classes

The main results of this paper concern a classical Bell-local, measurement-independent (MI) model class with possible setting-dependent acceptance (selection), where CHSH inflation is controlled by the dispersion of the accepted hidden-variable laws { ν a b } (Section 4, Section 5 and Section 6).
A recurring source of interpretive confusion in the Bell literature is that the word “locality” is used in multiple inequivalent technical senses. In this paper, Bell locality means the existence of local response functions A ( a , λ ) and B ( b , λ ) on a single probability space ( Λ , ρ ) (Section 2.1). This is stronger than the microcausal (operator-algebraic) locality axiom used in algebraic QFT, which requires only that Alice-side observables commute with Bob-side observables. Under microcausality, the universal CHSH upper bound is Tsirelson’s 2 2 [15,16], not 2; a self-contained derivation via the Bell-operator commutator identity is given in Appendix G.
Table 3 summarizes a minimal “no-exception-list” taxonomy: different hypothesis classes have different sharp CHSH bounds. This paper’s contribution is the second row: a sharp, quantitative bound for CHSH computed on accepted data when the accepted ensemble can vary with settings.

8. Conclusions

Bell–CHSH is a theorem about unconditional correlations: under measurement independence, Bell locality, and bounded outcomes, the CHSH value satisfies S 2 (Theorem 1). Experimental correlators are often computed on an accepted subset of trials, and acceptance can depend on settings. When the accepted hidden-variable law ν a b varies across setting pairs, the correlators entering CHSH are expectations under different measures, and CHSH can exceed 2 within Bell-local MI models by selection alone.
We quantified this effect in total variation distance. The main bound (Theorem 2) yields sharp universal inequalities:
S obs 2 + 2 Δ Q min { 4 , 2 + 6 D Q } ,
where Δ Q measures the quartet’s intrinsic distance to any single common accepted law and D Q is the quartet diameter. The constants are optimal (Proposition 5). Consequently, Tsirelson-scale values S obs = 2 2 require substantial across-setting dispersion within any Bell-local MI selection-based explanation.
Finally, we proposed a two-lane experimental audit protocol: Lane A provides prior-relative fair-sampling diagnostics using all-trial tags, while Lane B provides prior-free dispersion diagnostics using accepted-tag distributions across settings. On finite tag alphabets, the resolved dispersion statistic Δ Q , X is computable by linear programming (Theorem 5). The aim is not to dispute Bell’s theorem, but to make explicit which quantitative selection information is needed to interpret observed CHSH violations as excluding Bell-local measurement-independent models. Bell–CHSH is a theorem about unconditional/shared-ensemble expectations; in real pipelines, the key empirical question is whether the reported correlators are expectations under a single accepted law.

Funding

This research received no external funding.

Data Availability Statement

The original data presented in the study are openly available in https://github.com/sphereofrealization/Bell-CHSH-Under-Setting-Dependent-Selection, accessed on 19 January 2026.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Acceptance as Conditionalization and Feasibility of Bounded Acceptance

This appendix records a few optional technical points. None are needed for the main bounds.

Appendix A.1. Conditionalization and the Radon–Nikodym Derivative

Fix a setting pair ( a , b ) and abbreviate γ ( λ ) : = γ ( a , b , λ ) and Z : = Z ( a , b ) . By Definition 1, the accepted law is
ν a b ( E ) = 1 Z E γ ( λ ) d ρ ( λ ) .
In particular, ν a b ρ and the Radon–Nikodym derivative exists:
d ν a b d ρ ( λ ) = γ ( λ ) Z ( ρ - a . e . ) .
This is simply the standard “weight by acceptance and renormalize” rule.

Appendix A.2. When a Target Re-Weighting Can Come from γ∈[0,1]

Sometimes, it is convenient to specify a desired density w = d ν / d ρ and ask when it can be realized by a bounded acceptance rule.
Proposition A1 
(Feasibility under bounded acceptance). Fix a prior ρ and a measurable w : Λ [ 0 , ) with w d ρ = 1 . Define ν by d ν = w d ρ . There exists an acceptance probability γ : Λ [ 0 , 1 ] and an acceptance rate Z ( 0 , 1 ] , such that
w ( λ ) = γ ( λ ) Z ( ρ - a . e . )
if and only if w L ( ρ ) .
Moreover, when w L ( ρ ) , any choice 0 < Z 1 / w yields a valid realization γ : = Z w , and the maximal achievable acceptance rate is Z max = 1 / w .
Proof. 
If w = γ / Z with 0 γ 1 then w 1 / Z ρ -a.e.; hence, w L ( ρ ) . Conversely, if w L ( ρ ) and Z 1 / w then γ : = Z w satisfies 0 γ 1 and γ d ρ = Z . □

Appendix A.3. Other Divergences Controlling Total Variation (Optional)

Total variation is used here because it yields sharp constants and is computable on finite tag alphabets. For readers who prefer other divergences, standard inequalities relate TV to KL and χ 2 divergences. For μ ν :
TV ( μ , ν ) 1 2 D KL ( μ ν ) ( Pinsker inequality ) ,
and writing d μ = w d ν ,
TV ( μ , ν ) = 1 2 | w 1 | d ν 1 2 ( w 1 ) 2 d ν = 1 2 χ 2 ( μ ν ) .
Thus, an independent upper bound on D KL ( ν a b ρ ) or χ 2 ( ν a b ρ ) implies an upper bound on δ a b = 2 TV ( ν a b , ρ ) and, therefore, feeds into the prior-relative CHSH bound (Corollary 4).

Appendix B. Factorized Detection Implies a Cross-Ratio Constraint (Optional)

This appendix records a simple structural constraint that holds under factorized acceptance. It is not used in the main bounds (which apply to general, possibly joint, acceptance rules).
Proposition A2 
(Cross-ratio invariance under factorized detection). Fix a quartet ( a 0 , a 1 , b 0 , b 1 ) and let ν i j = ν a i b j . Assume factorized acceptance
γ ( a , b , λ ) = η A ( a , λ ) η B ( b , λ ) , η A , η B [ 0 , 1 ] ,
and assume Z ( a i , b j ) > 0 for all i , j . Define the RN densities
w i j ( λ ) : = d ν i j d ρ ( λ ) = γ ( a i , b j , λ ) Z ( a i , b j ) .
Then, on the set where all four products η A ( a i , λ ) η B ( b j , λ ) are positive (equivalently, where all four w i j ( λ ) are positive),
w 00 ( λ ) w 11 ( λ ) w 01 ( λ ) w 10 ( λ ) = Z ( a 0 , b 1 ) Z ( a 1 , b 0 ) Z ( a 0 , b 0 ) Z ( a 1 , b 1 ) ( ρ - a . e . ) .
Proof. 
Under factorization,
w i j ( λ ) = η A ( a i , λ ) η B ( b j , λ ) Z ( a i , b j ) .
Therefore, wherever all four factors are positive,
w 00 w 11 w 01 w 10 = η A ( a 0 ) η B ( b 0 ) η A ( a 1 ) η B ( b 1 ) η A ( a 0 ) η B ( b 1 ) η A ( a 1 ) η B ( b 0 ) · Z ( a 0 , b 1 ) Z ( a 1 , b 0 ) Z ( a 0 , b 0 ) Z ( a 1 , b 1 ) = 1 · Z ( a 0 , b 1 ) Z ( a 1 , b 0 ) Z ( a 0 , b 0 ) Z ( a 1 , b 1 ) .
Remark A1 
(Possible diagnostic use: cross-ratio constancy and exceptional-bin semantics). Proposition A2 says that under factorized detection the pointwise cross-ratio
R ( λ ) : = w 00 ( λ ) w 11 ( λ ) w 01 ( λ ) w 10 ( λ )
is constant in λ on the common-support set { γ 00 γ 01 γ 10 γ 11 > 0 } , with value
R ( λ ) = Z 01 Z 10 Z 00 Z 11 .
Equivalently, the λ-dependent quantity
log w 00 ( λ ) + log w 11 ( λ ) log w 01 ( λ ) log w 10 ( λ )
is constant wherever it is defined.
  • How this becomes an audit in experiments.
Of course, λ is unobserved. However, if one records a tag X on all trials and computes the tag-binned acceptance ratios (cf. Section 6.5)
w i j X ( x ) : = Pr ( Acc a i , b j , X = x ) Pr ( Acc a i , b j ) ,
then a natural resolved cross-ratio diagnostic is
R X ( x ) : = w 00 X ( x ) w 11 X ( x ) w 01 X ( x ) w 10 X ( x ) .
If the tag X is sufficiently informative that the RN densities are approximately constant within each tag bin (i.e., w i j ( λ ) is approximately X-measurable) then one expects R X ( x ) to be approximately constant in x with target value Z 01 Z 10 / ( Z 00 Z 11 ) . Large systematic deviations across bins suggest that acceptance is not well described by a factorized detection model, and it may instead involve non-factorized (joint) selection (e.g., coincidence-time matching) or unmodeled pipeline dependence.
  • Exceptional-bin issue (ratio semantics).
Like any ratio-based diagnostic, R X ( x ) is ill-conditioned when the denominator is small. In practice, bins with very small w 01 X ( x ) or w 10 X ( x ) form an exceptional locus analogous to 0 / 0 expressions in CAS algebra. Therefore, an audit should explicitly state which convention is used:
(i) 
Strict semantics: compute R X ( x ) only on bins where all four terms in (A2) are well-defined and exceed a minimum-count threshold.
(ii) 
Bracketed/generic semantics: impose a cutoff w 01 X ( x ) w 10 X ( x ) > ε (or an analogous count-based cutoff) and report results conditional on that cutoff.
(iii) 
Regularized semantics: add pseudo-counts (e.g., Laplace/Jeffreys smoothing) to avoid zeros and stabilize log R X ( x ) .
(iv) 
Resolved (division-free) semantics: avoid forming a ratio when both numerator and denominator are near zero by reporting the pair
w 00 X ( x ) w 11 X ( x ) , w 01 X ( x ) w 10 X ( x )
(or a normalized two-vector) instead of R X ( x ) itself.
The key point is that any use of the cross-ratio as a factorization diagnostic should make the treatment of the exceptional locus explicit, since that choice can dominate the behavior of the statistic in sparse bins.

Appendix C. Constructive Selection Models (Optional)

This appendix collects illustrative constructions that show what becomes possible once one allows setting-dependent acceptance. These constructions are not used in the main bounds.

Appendix C.1. Canonical Re-Weighting for a Single Correlator

Fix a setting pair ( a , b ) and, for simplicity, assume A ( a , λ ) B ( b , λ ) { ± 1 } . Write f ( λ ) : = A ( a , λ ) B ( b , λ ) . Assume there exist measurable disjoint sets U + , U Λ with ρ ( U ± ) > 0 , such that f = + 1 on U + and f = 1 on U .
Given a target correlation E tgt [ 1 , 1 ] , set α : = ( 1 + E tgt ) / 2 and define
w ( λ ) : = α ρ ( U + ) 1 U + ( λ ) + 1 α ρ ( U ) 1 U ( λ ) .
Then, w d ρ = 1 and the re-weighted law d ν = w d ρ satisfies
f d ν = α · ( + 1 ) + ( 1 α ) · ( 1 ) = E tgt .
If w L ( ρ ) then Proposition A1 implies w can be realized by a bounded acceptance probability.

Appendix C.2. Simulating an Arbitrary Finite Correlation Table by Factorized Detection

Theorem A1 
(Finite-table simulation by factorized detection). Fix finite settings { a 1 , , a m } and { b 1 , , b n } and target correlations E i j tgt [ 1 , 1 ] . There exists a probability space ( Λ , ρ ) , Bell-local deterministic outputs A ( a i , λ ) , B ( b j , λ ) { ± 1 } , and factorized detection indicators D A ( a i , λ ) , D B ( b j , λ ) { 0 , 1 } , such that:
(a) 
ρ is measurement-independent (independent of ( i , j ) );
(b) 
the acceptance event is Acc = { D A = 1 } { D B = 1 } and factorizes: γ ( a i , b j , λ ) = D A ( a i , λ ) D B ( b j , λ ) ;
(c) 
the accepted correlations satisfy E obs ( a i , b j ) = E i j tgt for all i , j .
Construction. 
Let Λ : = { 1 , , m } × { 1 , , n } × [ 0 , 1 ] with ρ the product of the uniform measure on { 1 , , m } × { 1 , , n } and Lebesgue measure on [ 0 , 1 ] . Write λ = ( u , v , t ) .
Define factorized detection indicators:
D A ( a i , λ ) : = 1 { u = i } , D B ( b j , λ ) : = 1 { v = j } .
Thus, at setting pair ( i , j ) acceptance occurs if u = i and v = j , with acceptance rate Z ( a i , b j ) = 1 / ( m n ) .
Define outcomes
A ( a i , λ ) : = + 1 , B ( b j , λ ) : = + 1 , t α u j , 1 , t > α u j , α u j : = 1 + E u j tgt 2 .
Conditioned on acceptance at ( i , j ) , one has u = i , v = j , and t Unif [ 0 , 1 ] ; hence,
E obs ( a i , b j ) = E [ B Acc , i , j ] = ( + 1 ) α i j + ( 1 ) ( 1 α i j ) = 2 α i j 1 = E i j tgt .
Remark A2 
(Interpretation). Theorem A1 does not contradict Bell–CHSH, because the accepted laws ν i j are maximally setting dependent: each setting pair selects a different hidden slice. It illustrates an identifiability limitation: without controlling acceptance semantics, accepted-sample correlations alone cannot identify nonlocality.

Appendix D. A Common Mistaken “Local Model” and the Correct Correlation

A frequently proposed Bell-local deterministic model for polarization-like settings is
A ( θ a , λ ) = sgn ( cos ( θ a λ ) ) , B ( θ b , λ ) = sgn ( cos ( θ b λ ) ) ,
with λ Unif [ 0 , 2 π ) . It is sometimes incorrectly claimed that this yields the quantum correlation cos ( θ a θ b ) . It does not.
Let Δ : = θ a θ b and wrap Δ into [ π , π ] . Then A ( θ a , λ ) and B ( θ b , λ ) differ in sign precisely on an interval of λ of length 2 | Δ | (modulo 2 π ). A direct geometric argument gives
E ( Δ ) : = E [ A ( θ a , λ ) B ( θ b , λ ) ] = 1 2 | Δ | π , | Δ | π ,
extended periodically with period 2 π . This is a triangular wave, not a cosine, and it cannot yield CHSH values exceeding 2 under unconditional sampling. Obtaining the cosine correlation locally would require relaxing MI, relaxing locality, or introducing setting-dependent acceptance/conditioning.

Appendix E. Pipeline Semantics (Optional)

It is often helpful to distinguish three layers in a Bell-test data pipeline:
(i)
Trial definition: what physical attempts count as trials (pump pulses, clock bins, herald events, fixed time bins, etc.).
(ii)
Outcome assignment: how raw records are mapped to an outcome alphabet (including, if desired, an explicit no-click outcome).
(iii)
Acceptance/conditioning: which trials are discarded or which statistics are conditioned on an event Acc (coincidences, quality cuts, invalid flags).
The selection calculus in this paper concerns (iii): the acceptance event Acc and its dependence on settings and hidden variables. When (iii) is absent (e.g., loss-inclusive analyses with no post-hoc discards), the specific selection-induced inflation mechanism studied here is reduced.

Appendix F. Phenomenological Velocity Analogy (Non-Essential)

This appendix is an optional analogy and is not used in any theorem or audit protocol. It provides one broad physical motif by which setting-dependent acceptance can re-weight hidden degrees of freedom without obviously changing coarse observed kinematics.

Appendix F.1. Fibered Non-Injectivity as a Selection Mechanism

Suppose the hidden state can be decomposed as
Λ V × Φ ,
where V denotes a phenomenological (externally monitored) kinematic parameter such as a time-residual coordinate, pulse-energy monitor, or spectral bin, and Φ denotes additional hidden structure. Let
L : V × Φ M
be an operational summary map used in data handling (e.g., a discretized time-difference residual, a quality score, or an inferred kinematic label). If L is non-injective on fibers then many distinct ( v , ϕ ) share the same observed label = L ( v , ϕ ) .
Within the fiber over a fixed observed label , the hidden coordinate ϕ can carry internal sign structure for A ( a , λ ) B ( b , λ ) . A setting-dependent acceptance rule can then re-weight the mixture over ϕ  within the same observed kinematic cell, altering accepted correlations while leaving the coarse observed label unchanged.
Figure A1 sketches this fibered non-injectivity motif.
Figure A1. Optional PV analogy: within a fixed observed label , hidden substructure can be re-weighted by selection without changing the coarse observed label.
Figure A1. Optional PV analogy: within a fixed observed label , hidden substructure can be re-weighted by selection without changing the coarse observed label.
Quantumrep 08 00008 g0a1

Appendix F.2. Connection to Tag Choice

In the language of Section 6, the monitored coordinate(s) V motivate candidate tag variables X: if selection depends on hidden variables primarily through coarse kinematic labels (arrival-time residuals, pulse energy bins, etc.) then refining those tags may increase the resolved quantities Δ Q , X and δ X , revealing setting-dependent selection structure.

Appendix G. Tsirelson’s Bound from Commutators (Microcausal Noncommutative Contrast)

Remark A3 
(Purpose and scope). The main text studies classical Bell-local, measurement-independent models in which the reported correlators are computed on an accepted subset of trials; if the accepted hidden-variable law ν a b depends on settings, CHSH can be inflated above 2 by selection alone, and the inflation is controlled by the dispersion of { ν a b } (Section 4, Section 5 and Section 6).
This appendix records a logically independent and widely used contrast: in a noncommutative (operator-algebraic) formulation with microcausal locality (commuting Alice/Bob observable algebras), the correct universal CHSH bound is Tsirelson’s 2 2 , not 2. This provides a clean way to separate two statements that are sometimes conflated in informal discussions:
  • CHSH 2 is a theorem for classical (commutative) shared-ensemble models (Theorem 1);
  • microcausal “no-signalling by commutation” locality does not force CHSH 2 , but it does force CHSH 2 2 .
No theorem in the main text depends on this appendix.

Appendix G.1. C*-Probability Spaces and Microcausal Locality

Definition A1 
(C*-probability space). A C*-probability space is a pair ( A , ω ) , where A is a unital C*-algebra and ω : A C is a state (linear, positive, and normalized: ω ( 1 A ) = 1 ).
Definition A2 
(Microcausal bipartite structure). Let ( A , ω ) be a C*-probability space. A microcausal bipartite structure is a pair of commuting unital C*-subalgebras A A , A B A , such that
[ X , Y ] = 0 ( X A A , Y A B ) ,
where [ X , Y ] : = X Y Y X is the commutator.
Remark A4 
(Locality across wings vs. within a wing). Microcausality constrains only cross-wing commutation: Alice-side observables commute with Bob-side observables. It does not require that Alice’s alternative observables commute with each other, nor that Bob’s do. This is exactly where Tsirelson’s bound differs from the classical CHSH bound 2.

Appendix G.2. CHSH Bell Operator and Correlators

Definition A3 
(CHSH quartet in a microcausal C* model). Fix self-adjoint unitaries (“ ± 1 observables”),
A 0 , A 1 A A , B 0 , B 1 A B , A i * = A i , B j * = B j , A i 2 = B j 2 = 1 A .
Define correlators,
E i j : = ω ( A i B j ) , i , j { 0 , 1 } ,
and the corresponding CHSH value,
S ω : = | E 00 + E 01 + E 10 E 11 | .
Definition A4 
(CHSH Bell operator). Define the Bell operator,
B : = A 0 ( B 0 + B 1 ) + A 1 ( B 0 B 1 ) A .
Then, S ω = | ω ( B ) | .

Appendix G.3. Bell-Operator Square Identity and Tsirelson Bound

Theorem A2 
(Bell-operator square identity). Assume the microcausal bipartite structure above, so, in particular, [ A i , B j ] = 0 for all i , j . Then, the Bell operator B satisfies
B 2 = 4 1 A [ A 0 , A 1 ] [ B 0 , B 1 ] .
Proof. 
Because A i commutes with each B j , products can be rearranged across wings. Expand (A3):
B 2 = A 0 2 ( B 0 + B 1 ) 2 + A 1 2 ( B 0 B 1 ) 2 + A 0 A 1 ( B 0 + B 1 ) ( B 0 B 1 ) + A 1 A 0 ( B 0 B 1 ) ( B 0 + B 1 ) .
Use A 0 2 = A 1 2 = 1 A to get
B 2 = ( B 0 + B 1 ) 2 + ( B 0 B 1 ) 2 + A 0 A 1 ( B 0 + B 1 ) ( B 0 B 1 ) + A 1 A 0 ( B 0 B 1 ) ( B 0 + B 1 ) .
Now ( B 0 + B 1 ) 2 + ( B 0 B 1 ) 2 = 4 1 A (since B 0 2 = B 1 2 = 1 A ). Also,
( B 0 + B 1 ) ( B 0 B 1 ) = [ B 0 , B 1 ] , ( B 0 B 1 ) ( B 0 + B 1 ) = [ B 0 , B 1 ] .
Therefore, the cross terms equal
A 0 A 1 ( [ B 0 , B 1 ] ) + A 1 A 0 ( [ B 0 , B 1 ] ) = ( A 1 A 0 A 0 A 1 ) [ B 0 , B 1 ] = [ A 0 , A 1 ] [ B 0 , B 1 ] ,
which gives (A4). □
Corollary A1 
(Tsirelson bound; commutative (flat) subcase). In the setting above, one has
S ω = | ω ( B ) | B 2 2 .
Moreover, if [ A 0 , A 1 ] = 0 or [ B 0 , B 1 ] = 0 then B 2 = 4 1 A , and, hence, S ω 2 .
Proof. 
For any state ω and any X A , | ω ( X ) | X . Since B is self-adjoint, B 2 = B 2 . From Theorem A2,
B 2 4 1 A + [ A 0 , A 1 ] [ B 0 , B 1 ] 4 + [ A 0 , A 1 ] [ B 0 , B 1 ] .
For X , Y     1 , one has [ X , Y ]     X Y + Y X     2 X Y ; hence, [ A 0 , A 1 ]     2 and [ B 0 , B 1 ]     2 . Therefore, B 2     4 + 4 = 8 , so B 2 2 and S ω 2 2 .
If [ A 0 , A 1 ] = 0 or [ B 0 , B 1 ] = 0 then (A4) gives B 2 = 4 1 A , so B   =   2 and, thus, S ω     2 . □

Appendix G.4. The Commutative Sector: CHSH ≤ 2 as a Commutative-Subalgebra Bound

Definition A5 
(Commutative sector for a CHSH quartet). In the microcausal C* setting of Appendix G, fix A 0 , A 1 A A and B 0 , B 1 A B as ± 1 observables. We say the quartet lies in the commutative sector if the C*-subalgebra
C : = C * ( A 0 , A 1 , B 0 , B 1 ) A
is commutative. Under microcausality [ A i , B j ] = 0 , this is equivalent to requiring [ A 0 , A 1 ] = 0 and [ B 0 , B 1 ] = 0 .
Proposition A3 
(CHSH 2 in the commutative sector). Let ( A , ω ; A A , A B ) be a microcausal C* scenario as above. If the CHSH quartet lies in the commutative sector (Definition A5) then
S ω = | ω ( A 0 B 0 ) + ω ( A 0 B 1 ) + ω ( A 1 B 0 ) ω ( A 1 B 1 ) | 2 .
Proof. 
Let C : = C * ( A 0 , A 1 , B 0 , B 1 ) , which is commutative by assumption. By the Gelfand representation theorem, there exists a compact Hausdorff space Ω , such that C C ( Ω ) as C*-algebras. By the Riesz representation theorem, the restricted state ω | C corresponds to a probability measure P on Ω , such that ω ( f ) = Ω f d P for all f C ( Ω ) .
Under this identification, A 0 , A 1 , B 0 , B 1 become (bounded) real-valued functions on Ω with values in { ± 1 } , so pointwise on Ω one has
| A 0 B 0 + A 0 B 1 + A 1 B 0 A 1 B 1 | 2 .
Integrating against P and using | g d P | | g | d P yields S ω 2 . □
Remark A5 
(How this classifies Bell–CHSH among broader local frameworks). Proposition A3 formalizes the slogan that the classical CHSH bound 2 is a commutative-sector bound: it applies whenever the relevant observables are jointly representable inside a single commutative algebra (equivalently, a single classical probability space).
By contrast, Appendix G shows that microcausal locality (cross-wing commutation) by itself does not force commutativity within each wing, and in that larger noncommutative class the sharp universal bound is Tsirelson’s 2 2 .
Finally, the main text of this paper treats a different enlargement of the classical CHSH-2 hypothesis class: it keeps the underlying model classical (commutative), but it allows the effective state/ensemble used for each correlator to depend on settings through acceptance (different ν a b ). In that case, CHSH inflation is controlled not by commutators but by the dispersion of the accepted laws (Corollaries 1 and 3).
Remark A6 
(Contrast with selection-based inflation in the main text). Corollary A1 shows a non-selection route to S > 2 : noncommutativity within a wing (non-zero commutators) enlarges the universal bound from 2 to 2 2 even while preserving microcausal cross-wing locality.
The main text instead studies a classical route to S obs > 2 , in which the classical bound 2 continues to hold for expectations taken under any single measure , but the CHSH expression is formed from correlators evaluated under different accepted laws ν a b , due to setting-dependent acceptance. These are logically independent mechanisms.
Remark A7 
(Future Work: Synopsis of a companion paper (PV, Bell rungs, and the cos derivation)). A companion paper will treat three logically distinct (but connected) topics that are only staged here [17].
(1) Rung-0 (Bell/CHSH) no-go and PV radical audit. We will begin by stating and using the unconditional single-ensemble Bell–CHSH bound: any Kolmogorov (classical) hidden-variable model with measurement independence, Bell locality, and bounded outcomes satisfies S 2 on every CHSH quartet. As an immediate consequence, the singlet target E tgt ( a , b ) = cos ( a b ) —which achieves S = 2 2 on a standard Tsirelson quartet—cannot be realized as unconditional expectations E ( a , b ) = A ( a , λ ) B ( b , λ ) d ρ ( λ ) in that model class. Building on this, we will expand the PV radical-equation audit (S0 vs. S1/S3/S4) into a complete semantics catalogue of the exceptional locus and branch structure, including a precise characterization of when CAS outputs such as v = ± c 2 D / D arise from localization/regularization rather than as strict consequences of the original radical equation on its solution locus.
(2) Rung-2 (microcausal/noncommutative) PV model with a step-by-step cos solution. In the same paper we will present, in full detail, the PV-indexed microcausal construction in which PV (v together with κ ( v ) = 1 v 2 / c 2 ) is promoted to a canonical unitary U ( v ) S U ( 2 ) . This unitary is then used to define PV-parameterized local observables by conjugation inside a bipartite operator algebra with commuting Alice/Bob subalgebras (microcausality). We will give the derivation line-by-line: PV θ ( v ) U ( v ) ⇒ rotated local Pauli observables ⇒ the singlet identity ω ( σ · u ) ( σ · w ) = u · w   E ( a , b ) = cos ( a b ) , together with explicit verification of cross-wing commutation and unconditional (single-state) semantics.
(3) Rung comparison and physical interpretation (what is “hidden” and what is “measured”). Finally, the companion paper will formalize the rung dictionary and its operational meaning: when PV is treated in a classical (commutative) hidden-variable framework, the unconditional Bell–CHSH bound prevents any Tsirelson-scale cos reproduction; when PV is instead promoted to a noncommutative parameter that indexes the local observable representation in a microcausal algebra, Tsirelson-scale cos follows unconditionally in the singlet state. We will also clarify which aspects of PV are operationally identifiable (or provably unidentifiable) from two-point Bell statistics alone, and how the di-cone seam U ( 2 ) self-adjoint families provide a geometric realization of the same “PV → unitary parameter” mechanism.

References

  1. Bell, J.S. On the Einstein Podolsky Rosen Paradox. Phys. Phys. Fiz. 1964, 1, 195–200. [Google Scholar] [CrossRef]
  2. Clauser, J.F.; Horne, M.A.; Shimony, A.; Holt, R.A. Proposed experiment to test local hidden-variable theories. Phys. Rev. Lett. 1969, 23, 880–884. [Google Scholar] [CrossRef]
  3. Brunner, N.; Cavalcanti, D.; Pironio, S.; Scarani, V.; Wehner, S. Bell nonlocality. Rev. Mod. Phys. 2014, 86, 419–478. [Google Scholar] [CrossRef]
  4. Hensen, B.; Bernien, H.; Dréau, A.E.; Reiserer, A.; Kalb, N.; Blok, M.S.; Ruitenberg, J.; Vermeulen, R.F.L.; Schouten, R.N.; Abellán, C.; et al. Loophole-free Bell inequality violation using electron spins separated by 1.3 kilometres. Nature 2015, 526, 682–686. [Google Scholar] [CrossRef]
  5. Giustina, M.; Versteegh, M.; Wengerowsky, S.; Handsteiner, J.; Hochrainer, A.; Phelan, A.; Steinlechner, F.; Kofler, J.; Larsson, J.-Å.; Abellán, C.; et al. Significant-Loophole-Free Test of Bell’s Theorem with Entangled Photons. Phys. Rev. Lett. 2015, 115, 250401. [Google Scholar] [PubMed]
  6. Shalm, L.K.; Meyer-Scott, E.; Christensen, B.G.; Bierhorst, P.; Wayne, M.A.; Stevens, M.J.; Gerrits, T.; Glancy, S.; Hamel, D.R.; Allman, M.S.; et al. Strong Loophole-Free Test of Local Realism. Phys. Rev. Lett. 2015, 115, 250402. [Google Scholar] [CrossRef] [PubMed]
  7. Pearle, P.M. Hidden-variable example based upon data rejection. Phys. Rev. D 1970, 2, 1418–1425. [Google Scholar] [CrossRef]
  8. Garg, A.; Mermin, N.D. Detector inefficiencies in the Einstein-Podolsky-Rosen experiment. Phys. Rev. D 1987, 35, 3831–3835. [Google Scholar] [CrossRef] [PubMed]
  9. Eberhard, P.H. Background level and counter efficiencies required for a loophole-free Einstein–Podolsky–Rosen experiment. Phys. Rev. A 1993, 47, R747–R750. [Google Scholar] [CrossRef] [PubMed]
  10. Larsson, J.-Å.; Gill, R.D. Bell’s inequality and the coincidence-time loophole. Europhys. Lett. 2004, 67, 707–713. [Google Scholar] [CrossRef]
  11. Hall, M.J.W. Local deterministic model of singlet state correlations based on relaxing measurement independence. Phys. Rev. Lett. 2010, 105, 250404. [Google Scholar] [CrossRef]
  12. Barrett, J.; Gisin, N. How much measurement independence is needed to demonstrate nonlocality? Phys. Rev. Lett. 2011, 106, 100406. [Google Scholar] [CrossRef] [PubMed]
  13. Pütz, G.; Rosset, D.; Barnea, T.J.; Liang, Y.-C.; Gisin, N. Arbitrarily small amount of measurement independence is sufficient to manifest nonlocality. Phys. Rev. Lett. 2014, 113, 190402. [Google Scholar] [CrossRef] [PubMed]
  14. Clauser, J.F.; Horne, M.A. Experimental consequences of objective local theories. Phys. Rev. D 1974, 10, 526–535. [Google Scholar] [CrossRef]
  15. Tsirelson, B.S. Quantum generalizations of Bell’s inequality. Lett. Math. Phys. 1980, 4, 93–100. [Google Scholar] [CrossRef]
  16. Summers, S.J.; Werner, R. Bell’s inequalities and quantum field theory. I. General setting. J. Math. Phys. 1987, 28, 2440–2447. [Google Scholar] [CrossRef]
  17. Emmerson, P. Nearest–Neighbour Cohomology meets Daisy Self–Similarity: A Unified Operator–Homotopy Framework. Authorea 2026. [Google Scholar] [CrossRef]
Figure 1. Acceptance changes the hidden-variable law from ρ to ν a b , possibly in a setting-dependent way. Lane A audits prior-relative bias; Lane B audits across-setting dispersion.
Figure 1. Acceptance changes the hidden-variable law from ρ to ν a b , possibly in a setting-dependent way. Lane A audits prior-relative bias; Lane B audits across-setting dispersion.
Quantumrep 08 00008 g001
Table 1. Core notation.
Table 1. Core notation.
SymbolMeaning
ρ MI prior law of λ on Λ .
γ ( a , b , λ ) Acceptance probability (selection rule).
Z ( a , b ) Acceptance rate Z ( a , b ) = γ ( a , b , λ ) d ρ ( λ ) .
ν a b Accepted law (weighted and renormalized): (2).
E obs ( a , b ) Accepted-sample correlator: (3).
E full ( a , b ) Unconditional correlator: (4).
S obs CHSH value | E 00 + E 01 + E 10 E 11 | with E i j = E obs ( a i , b j ) .
TV ( μ , ν ) Total variation distance: Definition 2.
Table 2. Summary of which TV quantities answer which inferential questions.
Table 2. Summary of which TV quantities answer which inferential questions.
QuantityDefinitionWhat It ControlsTypical Data Needed
δ a b 2 TV ( ν a b , ρ ) Bias from unconditional: | E obs E full | All-trial tags or a prior model
Δ Q inf μ q Q TV ( ν q , μ ) CHSH inflation: S obs 2 + 2 Δ Q Needs an upper bound (assumptions/architecture)
D Q max q q TV ( ν q , ν q ) Explicit bound: S obs min { 4 , 2 + 6 D Q } Needs an upper bound (assumptions/architecture)
Table 3. A minimal taxonomy of CHSH hypothesis classes and their sharp bounds.
Table 3. A minimal taxonomy of CHSH hypothesis classes and their sharp bounds.
Model ClassStructural Features (What Ties the Four Correlators Together)Sharp CHSH Bound
Classical Bell-local MI (unconditional)One prior ρ (MI) and local response functions A ( a , λ ) , B ( b , λ ) ; correlators are unconditional expectations under the same ρ S 2 (Theorem 1)
Classical Bell-local MI + setting-dependent acceptanceSame ρ and local A , B , but correlators are evaluated under setting-dependent accepted laws ν a b (different measures across settings) S obs 2 + 2 Δ Q min { 4 , 2 + 6 D Q } (Corollaries 1,3)
Microcausal noncommutative (operator-algebraic) scenarioSingle state ω on a (generally noncommutative) algebra; commuting Alice/Bob subalgebras (microcausality) but not necessarily commutative within each wing; no selection S ω 2 2 (Appendix G)
No-signalling extremalOnly operational no-signalling constraints (e.g., PR-box class) S 4
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Emmerson, P. Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Rep. 2026, 8, 8. https://doi.org/10.3390/quantum8010008

AMA Style

Emmerson P. Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Reports. 2026; 8(1):8. https://doi.org/10.3390/quantum8010008

Chicago/Turabian Style

Emmerson (Yaohushuason), Parker. 2026. "Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol" Quantum Reports 8, no. 1: 8. https://doi.org/10.3390/quantum8010008

APA Style

Emmerson, P. (2026). Bell–CHSH Under Setting-Dependent Selection: Sharp Total-Variation Bounds and an Experimental Audit Protocol. Quantum Reports, 8(1), 8. https://doi.org/10.3390/quantum8010008

Article Metrics

Back to TopTop