Next Article in Journal
Chebfun in Numerical Analytic Continuation of Solutions to Second Order BVPs on Unbounded Domains
Previous Article in Journal
A Novel Method That Is Based on Differential Evolution Suitable for Large-Scale Optimization Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae

by
Domenico Cantone
1,*,
Andrea De Domenico
2,
Pietro Maugeri
1,* and
Eugenio G. Omodeo
3
1
Department of Mathematics and Computer Science, University of Catania, 95125 Catania, Italy
2
IMDEA Software Institute, 28223 Madrid, Spain; andrea.domenico@imdea.org
3
Department of Mathematics, Informatics, and Geosciences, University of Trieste, 34127 Trieste, Italy; eomodeo@units.it
*
Authors to whom correspondence should be addressed.
Foundations 2026, 6(1), 3; https://doi.org/10.3390/foundations6010003
Submission received: 23 September 2025 / Revised: 8 January 2026 / Accepted: 16 January 2026 / Published: 30 January 2026
(This article belongs to the Section Mathematical Sciences)

Abstract

As a contribution to automated set-theoretic inferencing, a translation is proposed of conjunctions of literals of the forms x = y z , x y z , and z = x , where x , y , z stand for variables ranging over the von Neumann universe of sets, into quantifier-free Boolean formulae of a rather simple conjunctive normal form. The formulae in the target language involve variables ranging over a Boolean ring of sets, along with a difference operator and relators designating equality, non-disjointness, and inclusion. Moreover, the result of each translation is a conjunction of literals of the forms x = y z and x y z and of implications whose antecedents are isolated literals and whose consequents are either inclusions (strict or non-strict) between variables, or equalities between variables. Besides reflecting a simple and natural semantics, which ensures satisfiability preservation, the proposed translation has quadratic algorithmic time complexity and bridges two languages, both of which are known to have an NP -complete satisfiability problem.

1. Introduction

As reported in the survey paper [1], the line of investigation to which this paper belongs began in 1978 (ref. [2]), driven by the goal of developing an automated system for proof-checking and program correctness verification. Since then, identifying decidable fragments of set theory has been regarded as a key step toward achieving that goal. As a matter of fact, in the organization of the proof-checker Referee/ÆtnaNova , which came into existence some thirty years later (ref. [3]), a satisfiability decision tester for a specific class of unquantified set-theoretic formulae plays a central and pervasive role.
When it comes to implementations, complexity emerges as an inescapable issue; this is why we recently undertook a systematic study on the algorithmic complexities of satisfiability testers (see, e.g., [4,5]). In the same frame of mind, this paper enhances a quadratic-cost method (announced ref. [4] and then presented in D. Cantone, A. De Domenico, P. Maugeri, and E.G. Omodeo; a quadratic reduction of constraints over nested sets to purely Boolean formulae in CNF; in Proc. 35th Italian Conference on Computational Logic, volume 2710 of CEUR Workshop Proceedings, pages 214–230. 2020). This translates the formulae of an unquantified language involving set-theoretic variables, Boolean operators, and membership and equality relators into propositional combinations of purely Boolean literals.
The enhancement lies in the availability of a singleton-formation operator ‘{■}’, which increases the expressive power of the source language without affecting the algorithmic time complexity of the translation process; however, the fact that the translation preserves satisfiability as well as unsatisfiability must be proved anew and differently. The material that follows bridges two complexity taxonomies, developed and analyzed in the respective papers [4,5]. These taxonomies, one of which—unlike the other—involves membership, concern fragments of set theory for which satisfiability is decidable.
A fit theoretical framework for the study of our target language is the theory of Boolean rings, a merely equational first-order theory endowed with finitely many axioms—at times, one blends this theory with an arithmetic of cardinals (see, e.g., [6])—or just a surrogate of it, cf. Figure 1 in Section 7. Frameworks for the study of the source language are such all-embracing theories as ZF and NBG (the Zermelo–Fraenkel and von Neumann–Bernays–Gödel theories), within which one can cast the whole corpus of mathematical disciplines. Boolean algebra is decidable in its entirety (cf. Section 3.7 ref. [7]); ZF is essentially undecidable, yet nonetheless an effort to find practical decision algorithms for fragments of it began in 1979. The rationale of this long-standing research is that satisfiability testers embodying some knowledge about ZF can act as key inference mechanisms within a programmed system apt to verifying the correctness of large-scale mathematical proofs as envisaged ref. [3].
A priori, one would expect the distance between the performances of decision algorithms for fragments of Boolean algebra, and of the seemingly much more expressive languages whose dictionaries embody nested membership, to be abysmal. Luckily, though, as we will see, this is not the case.
We introduce in Section 2 an interpreted formal language, dubbed BST , within which one can formulate unquantified Boolean constraints. Despite its syntax being quite minimal— BST only encompasses conjunctions of primitive literals of two forms, namely, x = y z and x y z —, the satisfiability problem for BST is NP -complete (ref. [4]). By way of abbreviations, a number of additional constraints (e.g., literals of the form x y z ) can be expressed in BST .
Alongside BST , we also consider its natural Boolean closure BST + , defined as the language of all propositional combinations—using ∧, ∨, ⟶, ⟷, and ¬ with arbitrary nesting—of atoms of the form x = y z . It is straightforward to verify that the satisfiability problem for BST + can be reduced to that for BST in nondeterministic polynomial time; therefore, it is NP -complete in turn.
According to our semantics, the domain of discourse to which BST (and BST + ) refers is a universe of nested sets; however, as will be seen in Section 5, every satisfiable propositional combination of BST literals (in particular, every satisfiable BST constraint) admits a model consisting of sets which are, in a certain sense, ‘flat’. This makes it clear that there is no straightforward way of expressing membership atoms x y in BST . Yet, many natural set-theoretic constructors (most notably the singleton operator {■}) inherently interact with membership, and understanding whether and how such constructs can be eliminated while preserving satisfiability and complexity is the main motivation of this paper.
To address this research gap, Section 3 introduces a complexity-aware notion of expressibility, which requires not only a semantic reduction but also an explicit bound on the cost of the translation. While related notions already appear ref. [4], the present formulation is tailored to our setting and is a valuable technical tool that enables the complexity-preservation arguments developed here.
In terms of this notion, Section 6 provides an explicit translation of conjunctions of literals of the three forms x = y z , x y z , and  z = x into propositional combinations of BST literals, i.e., into BST + . As shown in Section 6.3 and Section 6.4, the translation preserves satisfiability. It yields a conjunction whose conjuncts are either BST literals or simple disjunctions. Moreover, the translation can be computed in quadratic time (see Section 6.1), which entails that satisfiability remains NP -complete when the singleton operator {■} is added to the constructs of BST . This NP -completeness result was already known (see, e.g., [8]), but our approach derives it anew via an explicit, complexity-controlled elimination procedure.
The concluding section, Section 7, recapitulates what has been accomplished, indicates what we intend to pursue next, and situates our line of investigation in relation to that of other scholars.

2. The Theories BST and BST +

Boolean set theory ( BST ) is the quantifier-free theory consisting of all finite conjunctions of literals of the forms
x = y z , x y z ,
where x, y, and z are set variables, ranging over the universe of well-founded sets.
Remark 1. 
Throughout, we use the term theory in a nonstandard sense, referring to a collection of set-theoretic formulae—possibly involving, along with primitive symbols, some derived constructs—rather than a deductively closed collection of sentences.
Semantics for the theory BST is defined in terms of set assignments. Specifically, given a (finite) collection V of set variables, a set assignment M over V—the domain of definition of M, denoted by dom ( M ) —is any map from V into the von Neumann universe V (see below). (Note that our semantics of BST does not rely on flat sets of urelements (as would be doable). Working with those would call for minor adjustments, unjustified—and perhaps disturbing—in the economy of this paper.) The support of M, denoted by supp ( M ) , is the union of the sets assigned by M to the variables in its domain, namely,
supp ( M ) := x dom ( M ) M x .
We shall refer to the cardinality of the support of M as the cardinality of the assignment M. A set assignment M is said to satisfy, or to model, a given literal x = y z , with  x , y , z dom ( M ) , if  M x = M y M z holds, where ‘∖’ denotes standard set difference. Likewise, M is said to satisfy the literal x y z if M x M y M z holds. Finally, M satisfies a BST -conjunction φ such that Vars ( φ ) dom ( M ) (where Vars ( φ ) denotes the collection of the variables occurring free in φ ) if it satisfies all of the conjuncts of φ , in which case we say that M is a model of φ and write
M φ
(a short for M V φ ). A  BST -conjunction φ is said to be satisfiable if it has some model; otherwise, it is said to be unsatisfiable. It is said to be valid if it is true in every model, in which case we write
φ
(again, a short for V φ ). Two BST -conjunctions φ and ψ are said to be equisatisfiable if either both are satisfiable or both are unsatisfiable—that is, if  φ has a model if and only if ψ has a model. Equisatisfiability does not require the two formulae to have the same models, but only that they agree in satisfiability, irrespective of their models. The notions introduced in this paragraph, here tailored to the case of BST -conjunctions, extend naturally to all theories discussed in the sequel.
In ref. [4], it is proved that the satisfiability problem for BST , namely, the problem of establishing algorithmically the satisfiability status of any given BST -conjunction, is NP -complete.
  • The extension  BST + .
Recall that BST only allows for conjunctions of its primitive literals—in particular, a  BST -formula is a finite conjunction of constraints of the form x = y z and x y z . In contrast, BST + is the Boolean closure of BST : it consists of all propositional combinations of atoms of the form x = y z , obtained by unrestrained use of the logical connectives ∧, ∨, ⟶, ⟷, and ¬. Thus, BST + strictly extends BST by allowing for arbitrary Boolean structure (e.g., disjunctions, implications, and negations at any depth), whereas BST is restricted to conjunctive constraints.
The satisfiability problem for BST + reduces to that for BST in nondeterministic polynomial time (by guessing a satisfying assignment for the Boolean structure and checking the resulting conjunction), and hence the satisfiability problem for BST + is NP -complete as well.

The von Neumann Universe

We recall that the von Neumann universe V of (well-founded) sets, also dubbed von Neumann cumulative hierarchy, is built up through a transfinite sequence of steps as the union V : = α On V α of the levels V α : = β < α P ( V β ) , with P (■) denoting the powerset operator and α ranging over the class On of all ordinals.
It can easily be seen that, for every ordinal α (and, in particular, for every integer), we have
V α + 1 = P ( V α ) ,
and consequently
V α + 1 = 2 V α ,
so that V α < V α + 1 .
Based on the level of first appearance in the von Neumann hierarchy, one can define the rank of any set s, denoted rk s . Specifically, rk s is the ordinal α such that s V α + 1 V α . Hence, for every α On , the set V α + 1 V α , hereinafter denoted W α # , collects all sets having rank α .
The following lower bound on the number of well-founded sets of any positive integer rank n, to be proved as Proposition A2 in Appendix A, will be useful:
W n # 2 n 1 .
A particularly important subclass of V is that of hereditarily finite sets, denoted by  HF . By definition,
HF : = n N V n ,
that is, HF consists of all sets of finite rank.
Some handy properties of the rank function that we shall tacitly use are the following, which hold for all sets s , t V :
  • If s t , then rk s < rk t ;
  • If s t , then rk s rk t ;
  • rk s = 0 if s = , sup u s rk u + 1 ( i.e.,   u s rk u + 1 ) otherwise .
We also recall that well-foundedness, as enforced by the regularity (or foundation) axiom of set theory, precludes the formation of infinite descending membership chains of the form
s 2 s 1 s 0 ,
and, in particular, membership cycles of the form
s 0 s n s 2 s 1 s 0 ,
for any sets s i .

3. From Existential Expressibility to O ( f ) -Expressibility Across Theories

We first recall the definition of existential expressibility (cf. [4] for several applications of this notion).
Definition 1 
(Existential expressibility). A formula ψ ( x ) is said to be existentially expressible in a theory T if there exists a T -formula Ψ ( x , z ) such that
Ψ ( x ) ( z ) Ψ ( x , z ) ,
where x and z stand for tuples of set variables.
In spite of the parsimony of BST as just presented, it turns out (see [4]) that several other Boolean constructs, such as the ones in the list of literals
x = , x y , x = y z , x = y z , ¬ D ISJ ( x , y ) , x y , x , x y , x y z , x y z , ¬ D ISJ ( x , y ) ,
can be expressed existentially in BST , where D ISJ ( a , b ) is a short for a b = .
Ref. [4], existential expressibility was generalized into O ( f ) -expressibility (we use standard asymptotic notations O ( · ) , Ω ( · ) , and Θ ( · ) throughout the paper; see, e.g., Ch. 3 of [9]), a notion that helped develop a fine-grained complexity taxonomy of the subfragments of BST .
Definition 2 
( O ( f ) -expressibility). Let T be a theory and let f : N N be a complexity function. A formula ψ ( x ) —typically involving a construct one aims to eliminate—is said to be  O ( f ) -expressible in T  if there exists a transformation
φ ( y ) Ψ φ ( x , y , z )
from T to T , where no variable in z occurs in x or y , such that, for every φ, the following conditions hold:
(a)
The transformation (5) can be computed in O ( f ( φ ) ) -time;
(b)
If φ ( y ) ( z ) Ψ φ ( x , y , z ) is satisfiable, then φ ( y ) ψ ( x ) is satisfiable;
(c)
φ ( y ) ψ ( x ) ( z ) Ψ φ ( x , y , z ) .
Here, we further generalize O ( f ) -expressibility in two directions. First, we allow for a collection  C of set-theoretic formulae, rather than a single formula ψ as in [4]. Second, we explicitly distinguish a source theory T 1 from a target theory T 2 , whereas [4] works in the single-theory setting, with source and target taken to coincide.
Definition 3 
( O ( f ) -expressibility across theories). Let T 1 and T 2 be any theories and f : N N be a given complexity function. A collection C of formulae is said to be  O ( f ) -expressible from T 1 into T 2  if there exists a map
φ ( y ) , ψ ( x ) Ξ φ ψ ( x , y , z )
from T 1 × C into T 2 , where no variable in z occurs in either x or y , such that the following conditions are satisfied:
(a)
The mapping (6) can be computed in O f ( φ ψ )  time;
(b)
If φ ( y ) Ξ φ ψ ( x , y , z ) is satisfiable, so is φ ( y ) ψ ( x ) ;
(c)
φ ( y ) ψ ( x ) ( z ) Ξ φ ψ ( x , y , z ) .
Observe that the two formulae appearing in condition (b) of Definition 3—namely, φ ( y ) Ξ φ ψ ( x , y , z ) and φ ( y ) ψ ( x ) —are in fact equisatisfiable. This is formalized in the following lemma.
Lemma 1. 
Let T 1 and T 2 be theories, and let C be a collection of formulae that is O ( f ) -expressible from T 1 into T 2 via a given mapping φ ( y ) , ψ ( x ) Ξ φ ψ ( x , y , z ) , as defined in Definition 3. Then, for every φ ( y ) T 1 and every ψ ( x ) C , the formulae
φ ( y ) Ξ φ ψ ( x , y , z ) and φ ( y ) ψ ( x )
are equisatisfiable.
Proof. 
Let φ ( y ) T 1 and ψ ( x ) C . We prove that the two conjunctions φ ( y ) ψ ( x ) and φ ( y ) Ξ φ ψ ( x , y , z ) are equisatisfiable.
  • (⇒) Suppose φ ( y ) Ξ φ ψ ( x , y , z ) is satisfiable. Then, by condition (b) of Definition 3, it follows that φ ( y ) ψ ( x ) is satisfiable.
  • (⇐) Conversely, suppose φ ( y ) ψ ( x ) is satisfiable, and let M be a model of this formula. Then, by condition (c) of Definition 3, we have
M φ ( y ) ( z ) Ξ φ ψ ( x , y , z ) .
This means that there exists an extension M of M to the variables in z —which, by hypothesis, occur neither in x nor in y —such that M Ξ φ ψ ( x , y , z ) ; hence, M φ ( y ) Ξ φ ψ ( x , y , z ) .
From the satisfiability of one formula, we have derived that of the other; hence, the two formulae are equisatisfiable.    □
While Lemma 1 shows that the two formulae φ ( y ) ψ ( x ) and φ ( y ) Ξ φ ψ ( x , y , z ) are equisatisfiable, Definition 3 does not adopt equisatisfiability as a primitive requirement. Instead, it explicitly requires the two separate conditions (b) and (c), which together entail equisatisfiability but provide strictly more structure. This asymmetry is intentional: clause (b) ensures that satisfiability of the translated formula readily implies satisfiability of the original, supporting the soundness of the translation; clause (c) provides a constructive existential guarantee—namely, that any model of φ ψ can be extended to a model of φ Ξ φ ψ , thus ensuring a form of witness-preserving completeness.
To analyze complexity preservation under O ( f ) -expressibility, we encounter expressions like g ( n + f ( n ) ) and, more generally, g ( O ( f ( n ) ) ) . It is therefore useful to assume a mild robustness of g under constant scaling, formalized below.
Definition 4 
(Scale-invariant function). A function g : N N is said to be scale-invariant (up to constants) if, for every real constant c > 0 , there exist constants K c > 0 and N c N such that
g ( c n ) K c g ( n ) , for all n N c .
Equivalently, g ( c n ) = O ( g ( n ) ) for every fixed c > 0 .
Remark 2. 
Definition 4 is closely related to classical doubling-type assumptions. For instance, in the theory of Orlicz spaces, the  Δ 2 -condition for a Young function Φ requires Φ ( 2 t ) C Φ ( t ) for all t 0 (ref. [10]). Similarly, if g is (eventually) nondecreasing and satisfies a doubling-type bound, namely, there exist constants K > 0 and N N such that
g ( 2 n ) K g ( n ) for all n N ,
then the estimate extends to any fixed scaling factor c > 1 by iteration. Indeed, choose k N with c 2 k . For all n 2 k 1 N , we have
g ( c n ) g ( 2 k n ) K k g ( n ) .
Finally, our assumption can be viewed as a discrete, one-sided variant of  O -regular variation. In the Karamata–Matuszewska framework, a function f is called  O -regularly varying if, for every λ 1 , the ratio f ( λ x ) / f ( x ) stays asymptotically bounded away from both 0 and + (via lim inf and lim sup) (cf. [11]). In contrast, we only require the upper control g ( c n ) = O ( g ( n ) ) , which is sufficient for the complexity-preservation steps where constant-factor rescalings arise.
Example 1. 
Let g ( n ) = n 2 and take c > 0 . Then,
g ( c n ) = c n 2 c 2 n 2 = c 2 g ( n ) ,
and hence g ( c n ) = O ( g ( n ) ) .
More generally, the scale-invariance property holds for all standard polynomially bounded complexity functions, such as n k and n log n , as well as for poly-logarithmic functions. More generally, it holds for functions of at most polynomial growth, and it also follows from a weak subadditivity condition: there exist constants C 1 1 and C 2 0 such that
g ( m + n ) C 1 g ( m ) + g ( n ) + C 2 ,
for all m , n N .
In contrast, g ( n ) = 2 n is not scale-invariant: for c = 2 we have g ( 2 n ) = 2 2 n = ( 2 n ) 2 , and the ratio g ( 2 n ) / g ( n ) = 2 n is unbounded, so g ( 2 n ) O ( g ( n ) ) .
The next lemma provides the quantitative part of the reduction: under O ( f ) -expressibility, it transfers the time bound for T 1 T 2 to T 1 C . The assumptions that g is nondecreasing and scale-invariant control the rescaling of input sizes, while the at least linear growth of g ensures that the preprocessing cost O ( f ( n ) ) is absorbed in the final bound.
Lemma 2. 
Let T 1 and T 2 be two theories (in the sense of Remark 1) and suppose that their conjunctive product
T 1 T 2 : = { φ 1 φ 2 : φ 1 T 1 , φ 2 T 2 }
admits a decision procedure running in time O g ( n ) , for some nondecreasing, scale-invariant function g : N N of at least linear growth, i.e.,  g ( n ) = Ω ( n ) .
Suppose moreover that the collection C of formulae is O ( f ) -expressible from T 1 into T 2 , for some complexity function f : N N .
Then, the combined theory T 1 C is decidable in time O g ( n + f ( n ) ) .
Proof. 
Let φ ( y ) T 1 and ψ ( x ) C , and set n : = | φ ψ | . Since C is O ( f ) -expressible from T 1 into T 2 , we can compute, in time O ( f ( n ) ) , a formula Ξ φ ψ ( x , y , z ) T 2 satisfying conditions (a)–(c) of Definition 3. Moreover, by condition (a) (and the fact that the output of an O ( f ( n ) ) -time transformation has size O ( f ( n ) ) ), we have | Ξ φ ψ | = O ( f ( n ) ) . Hence, there exist a constant c > 0 and a threshold N 0 N such that, for every input pair ( φ , ψ ) T 1 × C with | φ ψ | N 0 , we have
| φ Ξ φ ψ | c ( n + f ( n ) ) and thus | φ Ξ φ ψ | c ( n + f ( n ) ) ,
where n : = | φ ψ | as stipulated above.
By Lemma 1, the formulae φ ψ and φ Ξ φ ψ are equisatisfiable. Therefore, to decide satisfiability of φ ψ , it suffices to decide satisfiability of φ Ξ φ ψ , which belongs to T 1 T 2 . By hypothesis, this can be done in time O g ( | φ Ξ φ ψ | ) . Using that g is nondecreasing and scale-invariant, we obtain
g ( | φ Ξ φ ψ | ) g ( c ( n + f ( n ) ) ) = O g ( n + f ( n ) ) .
Finally, the overall running time is the sum of the preprocessing time O ( f ( n ) ) needed to compute Ξ φ ψ and the decision time for φ Ξ φ ψ ; hence,
O ( f ( n ) ) + O g ( n + f ( n ) ) .
Since g has at least linear growth, there exist constants a > 0 and N 1 N such that g ( t ) a t for all t N 1 . For all sufficiently large n, we have n + f ( n ) N 1 , and therefore
g ( n + f ( n ) ) a ( n + f ( n ) ) a f ( n ) ,
whence O ( f ( n ) ) O g ( n + f ( n ) ) . Consequently, the total running time is O g ( n + f ( n ) ) .
Since this procedure works uniformly for every input φ ψ T 1 C , the claim follows.    □

4. BST -Replacements and Flat Models

In this section, we develop two central notions that will be used in the rest of the paper. The first is that of BST -replacement, which captures how set assignments can be modified in a controlled way while preserving the satisfaction of BST + -formulae.
The second is the notion of a ♭-flat model, namely, a set assignment in which every element occurring in the interpretation of a variable has rank exactly ♭. This ‘single-layer support’ property makes it possible to reason cleanly about disjointness and membership when dealing with singleton atoms under replacement.

4.1. Replacement Assignments and BST -Replacements

In preparation for the results that follow, we need a method to modify a set assignment for a BST formula without disrupting satisfiability. Such a method is implemented by means of BST -replacements.
Definition 5 
(Replacement assignments and BST -replacements). Let M be a set assignment over a collection V of variables, let W V , and let S , T be nonempty sets. The replacement (assignment) restricted to W of M from S to T (or with respect to the pair ( S , T ) ), denoted ( M | W ) T S , is the set assignment defined for each x V by
( M | W ) T S x : = ( M x S ) T if x W ¬ D ISJ ( S , M x ) if x W M x otherwise .
The assignment ( M | W ) T S is a  BST -replacement restricted to W(or a BST W -replacement) of M from S to T if in addition the following condition holds:
( x W ) ( D ISJ ( S , M x ) S M x ) D ISJ ( T S , M x ) .
When W = V , we omit W from the notation and write M T S , calling it simply a replacement  of M from S to T. If condition (8) also holds, it is a  BST -replacement  of M from S to T.
Note that the requirement that S and T are nonempty is essential only for T: if S = , then ( M | W ) T S = M for any T, so the replacement has no effect and is of no interest.
Example 2 
( BST W -replacement). Let
V = { x , y , z } , W = { x , y } , S = { 2 , 3 } , T = { 3 , 7 } ,
and define the set assignment M by
M x = { 1 , 2 , 3 } , M y = { 2 , 3 , 4 } , M z = { 5 } .
Note that S , T .
Since x , y W with ¬ D ISJ ( S , M x ) and ¬ D ISJ ( S , M y ) , the restricted replacement ( M | W ) T S acts on x and y. On the other hand, as  z W , z is left unchanged:
( M | W ) T S ( x ) = ( M x S ) T = ( { 1 , 2 , 3 } { 2 , 3 } ) { 3 , 7 } = { 1 } { 3 , 7 } = { 1 , 3 , 7 } , ( M | W ) T S ( y ) = ( M y S ) T = ( { 2 , 3 , 4 } { 2 , 3 } ) { 3 , 7 } = { 4 } { 3 , 7 } = { 3 , 4 , 7 } , ( M | W ) T S ( z ) = M z = { 5 } .
We verify the BST -condition (8) for all u W :
( D ISJ ( S , M u ) S M u ) D ISJ ( T S , M u ) .
For u = x and u = y ,
S = { 2 , 3 } M x = { 1 , 2 , 3 } , S = { 2 , 3 } M y = { 2 , 3 , 4 } ,
and
T S = { 7 } is disjoint from M x = { 1 , 2 , 3 } and M y = { 2 , 3 , 4 } .
Hence, the condition holds for every u W . Therefore,
( M | W ) T S is a BST W - replacement of M from S to T .
Remark 3 
(Restricted setting convention). All results in this subsection are stated in the unrestricted case ( W = V ), except where explicitly noted otherwise (cf. Lemma 6), but they extend plainly to the restricted setting ( M | W ) T S with W V .
To study more closely how BST -replacements affect a set assignment, it is useful to introduce the partition induced by a set assignment, which decomposes its support into blocks determined by membership patterns of elements with respect to the assignment’s variable interpretations.
Definition 6. 
Given a set assignment M over a collection V of variables, the partition induced by Mis defined as
Σ M : = { x U M x y V U M y | U V } { } .
This notion provides a precise way to track the effect of a replacement, as illustrated in the subsequent lemma.
Lemma 3. 
Let Φ be a formula of  BST + , M a set assignment for it, and T a nonempty set such that D ISJ ( T , M v ) holds for all v Vars ( Φ ) . Then, for all σ Σ M
(a)
M T σ is a  BST -replacement of M;
(b)
Σ M T σ = ( Σ M { σ } ) { T } .
Proof.  
Let σ Σ M and set M : = M T σ .
(a) By the definition of Σ M , for every x Vars ( Φ ) either σ M x or D ISJ ( σ , M x ) . Together with the assumption that D ISJ ( T , M v ) holds for all v Vars ( Φ ) , this is precisely condition (8). Hence, M T σ is a BST -replacement of M.
(b) Each block of Σ M collects the elements that share the same membership pattern with respect to the variables of Φ . The replacement M T σ removes σ wherever it occurs and inserts T instead. Since T is disjoint from every M v , no other block is modified, and the pattern corresponding to σ is now realized by T. Therefore,
Σ M = ( Σ M { σ } ) { T } .    □
We note that the hypothesis on T in Lemma 3 (namely, its disjointness from every M v ) is stronger than what is needed in general. The next result shows that a replacement can always be reversed under the sole assumption that the BST -replacement conditions (8) are satisfied.
Lemma 4. 
Let M be a set assignment over a collection V of variables, and  let S , T be nonempty sets such that M T S is a BST -replacement of M from S to T. Then, ( M T S ) S T is a BST -replacement of M from T to S, and 
( M T S ) S T = M .
Proof. 
Set M : = M T S . We first check that the BST -conditions (8) hold for the pair ( T , S ) relative to M and then show that ( M ) S T = M .
Fix x V . From the BST -conditions for ( S , T ) relative to M, we know that either D ISJ ( S , M x ) or S M x , and in all cases D ISJ ( T S , M x ) .
  • Case 1: D ISJ ( S , M x ) . Then, M x = M x . Furthermore,
    T M x = ( T S ) ( T S ) M x = ( T S ) M x ( T S ) M x = ,
since D ISJ ( T S , M x ) and D ISJ ( S , M x ) . Hence, D ISJ ( T , M x ) . Also, D ISJ ( S T , M x ) holds because S T S and D ISJ ( S , M x ) . Thus, the BST -conditions for ( T , S ) hold at x, and the reverse update does nothing:
( M ) S T x = M x = M x .
  • Case 2: S M x . Then, M x = ( M x S ) T , so T M x . Moreover, S T is disjoint from M x S and from T; hence, D ISJ ( S T , M x ) . Therefore, the BST -conditions for ( T , S ) hold at x, and
( M ) S T x = ( M x T ) S = ( ( M x S ) T ) T S = ( M x S ) S = M x .
In both cases, the  ( T , S ) -conditions hold relative to M , and  ( M ) S T x = M x . Since x was arbitrary, we conclude that ( M ) S T is a BST -replacement of M from T to S, and indeed ( M ) S T = M .    □
The next lemma shows that satisfiability of a BST + -formula is preserved under BST -replacements: if a formula holds in a given assignment, it continues to hold after any valid BST -replacement.
Lemma 5. 
For every set assignment M, every formula Φ of   BST + , and every BST -replacement M of M, we have
M Φ M Φ .
Proof.  
Let ( S , T ) be any pair of nonempty sets fulfilling the BST -replacement conditions (8) with respect to M. As BST + -formulae are Boolean combinations of atoms of the form x = y z , it is enough to show that for every such atom (with variables in the domain of M) we have
M x = y z M T S x = y z .
Suppose M x = y z , that is, M x = M y M z .
We analyze separately the two cases D ISJ ( S , M x ) and ¬ D ISJ ( S , M x ) .
  • Case D ISJ ( S , M x ) . Then, M x = M T S x , and since S M y implies S M z , we have
    M T S x = M x = M y M z = ( M y S ) ( M z S ) = ( M y S ) T ( M z S ) T ,
using (8) to ensure D ISJ ( T S , M z ) . Hence, M T S x = y z .
  • Case ¬ D ISJ ( S , M x ) . By (8), we have S M x M y , and hence D ISJ ( S , M z ) . Using the definition of M T S , we compute
    M T S x = ( M x S ) T = ( M y M z ) S T = ( M y S ) M z T = ( M y S ) T M z = M T S y M T S z ,
where we used D ISJ ( T S , M z ) and D ISJ ( S , M z ) to conclude D ISJ ( T , M z ) .
Thus, in all cases,
M x = y z M T S x = y z .
For the converse implication, assume M T S x = y z . By Lemma 4, ( M T S ) S T is a BST -replacement of M T S and moreover ( M T S ) S T = M ; hence, by the first part,
M T S x = y z M x = y z .
Therefore, for every atomic formula of the form x = y z ,
M x = y z M T S x = y z .
It follows that the logical equivalence holds for all BST + -formulae Φ . This proves the lemma.    □
For later use, we also state a restricted variant, in which replacements are carried out only with respect to a chosen subset of variables.
Lemma 6. 
Let V be a collection of variables and W V . Let M be a set assignment over V, and let S , T be nonempty sets such that
M := ( M | W ) T S
is a BST -replacement of M restricted to W. If Φ is a BST + -formula such that for every literal ℓ in Φ either Vars ( ) W or Vars ( ) W = holds, then
M Φ M Φ .
Proof.  
As BST + -formulae are Boolean combinations of atoms x = y z , it suffices to check atoms. Fix an atom α .
If Vars ( α ) W = , then none of its variables is modified by the replacement, so M α iff M α .
If Vars ( α ) W , then satisfaction of α depends only on the restrictions M | W and M = ( M | W ) T S . By Lemma 5, M | W α iff M | W α ; hence, M α iff M α .
The claim follows by propositional logic.    □
With the same restriction on variables—namely, every literal lies wholly inside W or wholly outside it—Lemma 6 yields invariance of satisfiability under any finite sequence of BST -replacements restricted to W.
Corollary 1. 
Let V be a collection of variables and let W V . Let M ( 0 ) , , M ( m ) ( m 1 ) be set assignments over V such that, for each i = 1 , , m ,
M ( i ) = M | W ( i 1 ) T i S i
is a BST -replacement of M ( i 1 ) restricted to W, for some nonempty S i , T i . Let Φ be a  BST + -formula such that for every literal ℓ occurring in Φ we have either Vars ( ) W or D ISJ ( Vars ( ) , W ) . Then, for all 0 h , j m ,
M ( h ) Φ M ( j ) Φ .

4.2. ♭-Flat Models

Definition 7. 
For every ordinal 1 , a set assignment M over a collection V of variables is said to be ♭-flat if all sets in the realm M v | v V of M have rank ♭.
No membership atom x y is satisfied by any ♭-flat set assignment:
Lemma 7. 
Let M be a -flat set assignment over a collection V of variables. Then, M x M y for any x , y V .
Proof.  
By the ♭-flatness of M, for all x V , either rk M x = 0 (when M x = ), or  rk M x = + 1 (when M x ). In either case, rk M x , since ♭-flatness presupposes 1 . Thus, M x 1 p t M y y V for all x V .    □
A satisfiable formula of BST + always admits a ♭-flat model, for sufficiently large ♭. This is proved in the next lemma.
Lemma 8. 
Let Φ be a satisfiable  BST + -formula with n distinct variables. Then, for every ordinal n + 1 , Φ admits a ♭-flat set model of cardinality at most 2 n 1 .
Proof.  
Let Φ be satisfiable, M a model of Φ , and  Σ M = { σ 0 , , σ m 1 } its induced partition. Fix an ordinal n + 1 , where n = Vars ( Φ ) .
We associate to each block σ i Σ M a distinct set s i of rank ♭, as follows:
  • For every block σ i that already contains elements of rank ♭, choose one such element as s i .
  • For the remaining blocks, choose s i among elements of rank ♭ (possibly belonging to other blocks), ensuring that the chosen s i are all distinct and different from those fixed in step 1.
This is possible because the number of blocks is at most exponential in n = Vars ( Φ ) , while the collection W # of sets of rank exactly ♭ is large enough to supply that many distinct elements. Indeed, since each block of Σ M is nonempty and determined by a membership pattern over the n variables, it follows that | Σ M | 2 n 1 . On the other hand, the functions κ 2 κ κ and κ V κ are strictly increasing for κ 1 , so from (2) and (3) it follows that, for every integer n + 1 ,
W # = V + 1 V = 2 V V 2 V n + 1 V n + 1 = V n + 2 V n + 1 = W n + 1 # 2 n .
Therefore, whether ♭ is finite or infinite, W # contains at least 2 n elements of rank ♭, enough to assign distinct representatives to all the blocks of Σ M .
The argument that follows requires that the distinct blocks σ 0 , , σ m 1 of Σ M be ordered so that all blocks containing elements of rank ♭ come first. Based on this ordering, we recursively construct a sequence of assignments M 0 , , M m as follows:
M 0 M , M i + 1 ( M i ) { s i } σ i , i = 0 , , m 1 .
For the blocks considered in step 1, we have { s i } σ i ; hence, ( M i ) { s i } σ i is trivially a BST -replacement of M i from σ i to { s i } . For the blocks in step 2, the choice of s i guarantees that { s i } is disjoint from every block not yet replaced, so the BST -conditions also hold.
Therefore, each M i + 1 is a BST -replacement of M i , and since M 0 is a model of Φ , induction and Lemma 5 yield that every M i + 1 is as well. In particular, M m is a model of Φ with support { s 0 , , s m 1 } , consisting solely of sets of rank ♭. Therefore, M m provides the desired ♭-flat model of Φ , with cardinality at most 2 n 1 .    □
Remark 4. 
Building on an argument developed in [12] (see also [13]), one can show that every satisfiable formula Φ of  BST +  admits a model of cardinality less than the number of its distinct variables. Accordingly, in the proof of Lemma 8, the initial model M may be taken with support of size at most n 1 , where n = | Vars ( Φ ) | . The remainder of the argument then produces a ♭-flat model of cardinality at most n 1 .
This observation leads to the following sharper result.
Corollary 2. 
Let Φ be a satisfiable  BST + -formula with n distinct variables. Then, for every ordinal n + 1 , Φ admits a ♭-flat set model of cardinality at most n 1 . □

5. Existential Inexpressibility of z = x in BST +

We investigate whether atoms of the form z = x can be existentially expressed in BST + . If this were possible, then membership atoms x y would also be expressible: indeed, the presence of ‘{■}’ allows one to derive ‘∈’ as a definable construct.
Lemma 9. 
x y ( z ) ( z = x z y ) , if z is distinct from the variables x , y .
Proof.  
If M z = x z y holds, then M x y , since M x M z M y .
Conversely, if  M x y holds, extend M by putting M z = M x ; then, M x M y yields M z = x z y , so that M ( z ) ( z = x z y ) .    □
In what follows we show that membership atoms x y are not existentially expressible in BST + . Therefore, by Lemma 9, atoms of the form z = x are not existentially expressible either. Specifically, we prove that every satisfiable formula Ψ of BST + admits a flat model M, namely, a model whose support consists solely of elements all having the same positive rank. As a consequence, M x M y for any x , y Vars ( Ψ ) , and thus M x y .
We are now ready to prove that membership atoms x y —and hence, by Lemma 9, atoms of the form z = x —are not existentially expressible in BST + .
Theorem 1.
The atom x y is not existentially expressible in  BST + .
Proof.  
By way of contradiction, assume that x y is existentially expressible by a formula Ψ ( x , y , z ) of BST + involving only atoms of the form x = y z , i.e.,
x y ( z ) Ψ ( x , y , z ) .
Since x     y is trivially satisfiable, so are—by (9)— ( z ) Ψ ( x , y , z ) and Ψ ( x , y , z ) . Thus, by Lemma 8, Ψ ( x , y , z ) is modeled by a ♭-flat set assignment M * ; hence, by Lemma 7, M * x     M * y holds. It follows that M * x y ( z ) Ψ ( x , y , z ) holds, and therefore M * ( z ) Ψ ( x , y , z ) x y , which contradicts (9).    □

6. O ( n 2 ) -Expressibility in BST + of Singleton-Atom Conjunctions

In accordance with Definition 3, we shall prove that any conjunction ψ ( x ) of atoms of the form x = y is O ( n 2 ) -expressible from BST into BST + by exhibiting a map
φ ( y ) , ψ ( x ) Ξ φ ψ ( x , y , z ) ,
computable in quadratic time, such that conditions (a)–(c) of Definition 3 hold. Here, φ ( y ) ranges over BST -conjunctions, and the variables in z are distinct from those in x and in y .
Thus, let φ ( y ) and ψ ( x ) be of the said form. For each variable x Vars ( φ ψ ) (note that Vars ( φ ψ ) = Vars ( φ ) Vars ( ψ ) ), introduce a fresh auxiliary variable x ˜ , chosen distinct from all others. These variables will be used to mimic x y via the relation x ˜ y ˜ . Also, let z be an additional fresh auxiliary variable, to be interpreted as the empty set. Then, we put
Ξ φ ψ z = z z x = y in ψ x y x = y in ψ x = { y } in ψ y = y x = x x = y in ψ v Vars ( φ ψ ) ¬ D ISJ ( x , v ) x v x = y in ψ v Vars ( φ ψ ) ¬ D ISJ ( x , v ) y ˜ v ˜ x , y Vars ( φ ψ ) x = y x ˜ = y ˜
(thus, the list z of variables in Definition 3 consists of the collection x ˜ of all auxiliary set variables x ˜ together with z ).
Remark 5. 
In the formula Ξ φ ψ , the following shorthands are used:
D ISJ ( x , v ) def x = x v , x = v def x = v z , x v def x v = z , x v def x v = z v x z .
Here, z is the auxiliary variable intended to denote the empty set. Indeed, for every model M of Ξ φ ψ , the conjunct z = z z ensures M z = , from which the following equivalences follow:
M D ISJ ( x , v ) D ISJ ( M x , M v ) ; M x = v M x = M v ; M x v M x M v ; M x v M x M v .
For instance, the second equivalence can be justified as follows:
M x = v M x = v z M x = M v M z = M v .
These shorthands are introduced purely for readability: their syntactic form mirrors the underlying semantics, making formulae such as Ξ φ ψ easier to parse at a glance without repeatedly unfolding their full definitions.
By the preceding remark, Ξ φ ψ is a formula of BST + ; indeed, it is just a conjunction of a rather simple form. As a preliminary step, we record a basic semantic property that will be used repeatedly.
Lemma 10. 
For every model M of Ξ φ ψ , if x = { y } occurs in ψ, then
M x and D ISJ ( M x , M y ) .
Proof. 
Let M Ξ φ ψ , and let x = { y } be a literal in ψ . Since Ξ φ ψ includes the conjuncts
z = z z , x y , and ¬ D ISJ ( x , y ) x y ,
the equivalences (11) in Remark 5 imply
M x M y and ¬ D ISJ ( M x , M y ) M x M y .
From this, it follows that M x and D ISJ ( M x , M y ) , as claimed.    □

6.1. Design and Analysis of the Translation Algorithm

We first present an explicit procedure that constructs Ξ φ ψ from the input conjunction φ ψ ; see Algorithm 1. We then analyze its running time, thereby establishing point (a) of Definition 3.
We use the auxiliary program variables V and Ξ in the pseudocode below to store, respectively, the list of variables encountered in φ ψ and the list of generated conjuncts.
Algorithm 1 Construction of Ξ φ ψ
  1:
Initialize V as an empty list of set variables;
  2:
Initialize Ξ as an empty list of conjuncts;
  3:
for each set variable x that appears in φ  do
  4:
    add x to V ;
  5:
for each conjunct x = y in ψ  do
  6:
    add x and y to V ;
  7:
    add x y to Ξ ;
  8:
for each conjunct x = y in ψ  do
  9:
    for each v V  do
10:
        add ¬ D ISJ ( x , v ) x v ¬ D ISJ ( x , v ) y ˜ v ˜ to Ξ ;
11:
for each pair x = y , x = y of distinct conjuncts in ψ  do
12:
    add y = y x = x to Ξ ;
13:
for all x , y V  do
14:
    add ( x = y x ˜ = y ˜ ) to Ξ ;
15:
add z = z z to Ξ ;
16:
return  Ξ .
The running time of Algorithm 1 is bounded as follows.
Lemma 11. 
The formula Ξ φ ψ can be constructed from φ and ψ in time O φ ψ 2 .
Proof.  
In order to prove the lemma, it is convenient to spell out in Algorithm 1 the procedure that constructs Ξ φ ψ from the conjunction φ ψ . We maintain the program variable V as a list of variables and Ξ as a list of conjuncts, so that appending a variable or a conjunct takes constant time. Moreover, each conjunct appended to Ξ has constant size.
The for-loop at lines 3–4 can be performed in Θ ( φ ) time, where φ denotes the total length of the conjunction φ . Similarly, the for-loop at lines 5–7 can be performed in Θ ( ψ ) time.
Letting m be the length of the list V after the execution of the for-loops 3–4 and 5–7, we have m φ + ψ .
  • Upper bound (running time). The only nontrivial operations performed by Algorithm 1 are iterating through the for-loops and appending variables/conjuncts to the lists V and Ξ . By assumption, each append takes constant time, and each conjunct appended to Ξ has constant size. Hence, the running time is asymptotically proportional to the total number of loop iterations.
As noted, lines 5–7 perform Θ ( ψ ) iterations. Lines 8–10 perform Θ ( ψ · m ) iterations (for each conjunct in ψ , the inner loop ranges over all v in V ). Lines 11–12 perform Θ ( ψ 2 ) iterations, since they range over all pairs of distinct conjuncts in ψ . Finally, lines 13–14 perform Θ ( m 2 ) iterations, since they range over all pairs ( x , y ) with x and y in V , whereas line 15 clearly takes constant time.
Therefore, the overall running time is
Θ φ + ψ + ψ · m + ψ 2 + m 2 .
Since m φ + ψ φ ψ and ψ φ ψ , we conclude that Algorithm 1 runs in time
O φ ψ 2 ,
as claimed. □
Before proving the semantic conditions (b) and (c) of Definition 3 for the translation map (10), we briefly illustrate Ξ φ ψ on two concrete examples. We then return to the main line of the argument and show that
  • If φ ( y ) Ξ φ ψ ( x , y , z ) is satisfiable, then so is φ ( y ) ψ ( x ) ;
  • Every model of φ ( y ) ψ ( x ) can be extended into a model of Ξ φ ψ ( x , y , z ) .
As above, this is where the list z of variables consists of the collection x ˜ of all auxiliary set variables x ˜ together with z . This will prove that conditions (b) and (c) of Definition 3 hold and hence that every singleton-atom conjunction is O ( n 2 ) -expressible from BST into BST + .

6.2. Illustrating the Translation Ξ φ ψ on Examples

We start with an unsatisfiable input, showing how Ξ φ ψ preserves unsatisfiability, and then consider a satisfiable instance illustrating the model extension mechanism used for condition (c).
Example 3. 
The conjunction φ ψ , where φ y = x z and ψ x = y y = { z } , is not satisfiable; in fact, any set assignment M satisfying this formula is such that M y = M x M z , M x = { M y } , and M y = { M z } ; thus, M y = M z , and hence M z = M z , a contradiction.
Our translation Ξ φ ψ comprises
y x y ¬ D ISJ ( x , y ) x y .
Any model M for φ Ξ φ ψ is such that M y M x (since M y = M x M z ), M y , and M x M y hold, so that ¬ D ISJ ( M x , M y ) ; but then M x M y must hold, which conflicts with M x M y .
The next example complements the previous one: it is satisfiable and highlights the extension of a model of φ ψ to the auxiliary variables so as to satisfy Ξ φ ψ (cf. Section 6.4).
As we will prove in Section 6.4, satisfiability carries over from φ ψ to φ Ξ φ ψ . Here is an interesting example of this:
Example 4.
Consider the conjunction φ ψ , where
φ x = y y x = z z , and ψ z = { y } .
This formula is satisfiable: for instance, it is satisfied by every set assignment M over Vars ( φ ψ ) of the form
M x , M y s , M z s , M y s s , M z s s ,
where s, s , and s are any well-founded sets.
It can easily be checked that, for every set assignment M of the form (12), the extension M + of M over the auxiliary variables v ˜ , for v Vars ( φ ψ ) and where
M + v ˜ s s if s M v otherwise ,
satisfies Ξ φ ψ , so that M ( x ˜ , y ˜ , z ˜ , y ˜ , z ˜ ) Ξ φ ψ holds. Thus, to show that condition (c) of Definition 3 is satisfied, namely, that φ ψ ( x ˜ , y ˜ , z ˜ , y ˜ , z ˜ ) Ξ φ ψ holds, it is enough to check that the conjunction φ ψ is satisfied by set assignments of the form (12) only. Let then M ¯ be any model for φ ψ . Since M ¯ x M ¯ z and M ¯ z = M ¯ y , either M ¯ x = or M ¯ x = M ¯ y holds. The latter case can be readily ruled out, for in view of M ¯ x M ¯ y it would follow M ¯ y M ¯ y , which is untenable in the realm of well-founded sets. Thus, M ¯ x = must hold. Letting s M ¯ y —so that M ¯ z = s —, since M ¯ y M ¯ y and M ¯ z M ¯ z , we have M ¯ y = s s and M ¯ z = s s , for some sets s and s , and therefore M ¯ has the form (12).

6.3. Satisfiability Preservation

We first establish that if φ Ξ φ ψ is satisfiable, then so is φ ψ , thereby fulfilling condition (b) of Definition 3.
Let M be a model of φ Ξ φ ψ . In order to convert it into a model of φ ψ , we can assume that M is ♭-flat over Vars ( φ Ξ φ ψ ) for some integer
> max Vars ( φ Ξ φ ψ ) , | ψ | .
Indeed, on the one hand, Lemma 8 enables us to do so; on the other hand, Lemma 7 tells us that such an M does not model any of the atoms x = y in ψ .
Set M ( 0 ) M , and let m be the number of singleton conjuncts in ψ . For i = 1 , , m , we select an atom x i = y i in ψ and lift M ( i 1 ) to a model M ( i ) of φ such that every conjunct of ψ already satisfied in M ( i 1 ) remains satisfied in M ( i ) , and the selected conjunct is also satisfied in M ( i ) . At the end of this iterative process, all conjuncts of ψ will be satisfied. More precisely, if ψ i denotes the conjunction of the atoms selected up to step i, then ψ 1 ψ m coincides with ψ (possibly reordered and with repetitions).
In our set up, each ψ i must hence comprise a conjunct x i = y i from ψ not appearing in any ψ j with j < i . The selection of x i = y i determines the transformation from M ( i 1 ) to M ( i ) and is made as follows: among the conjuncts of ψ that are not already present in ψ 1 ψ i 1 , select one that is minimal with respect to the ordering ψ M on the conjuncts of ψ , defined below:
Definition 8.
The relation ψ M is the minimal transitively closed relation over the conjuncts of ψ such that, for all singleton atoms x = { y } and x = { y } in ψ, one has
( x = { y } ) ψ M ( x = { y } ) whenever ¬ D ISJ ( M x , M y ) .
This definition enforces the following:
Lemma 12.
The relation ψ M is a strict partial order.
Proof.  
By definition ψ M is transitive, so it remains only to prove that it is acyclic.
Suppose, by way of contradiction, that ψ M contains some cycle. Then, there are atoms 0 , 1 , , h of ψ with
0 M 1 M M m 1 M h = 0 ,
where each i is of the form x i = y i for i = 0 , 1 , , h . By the definition of ψ M , we have
¬ D ISJ ( M x 0 , M y 1 ) , , ¬ D ISJ ( M x h 1 , M y 0 ) .
Since M Ξ φ ψ , we know in particular that
M x = y in ψ v Vars ( φ ψ ) ¬ D ISJ ( x , v ) y ˜ v ˜ ,
and so we obtain the strict chain
M y ˜ 0 M y ˜ 1 M y ˜ h 1 M y ˜ 0 ,
which yields the contradiction M y ˜ 0 M y ˜ 0 .
Therefore, ψ M has no cycles and is thus acyclic. Together with its transitivity by definition, this shows that ψ M is a strict partial order, as claimed.   □
From the previous lemma, we may arrange the literals in ψ as
1 , , m ,
where each i has the form x i = y i , in such a way that the sequence complies with the strict partial order ψ M ; that is, for all 1 i < j m , we have j ¬ ψ M i .
Such an ordering will guide the iterative process: it is not arbitrary but rather ensures that the satisfaction of atoms established in earlier steps is preserved in all subsequent steps.
The iterative process proceeds as follows: at each step i, an atom i x i = y i is selected from ψ , as specified above, so that i is minimal, in ψ with all atoms selected at previous steps removed, with respect to ψ M ; that is, for all i , j { 1 , , m } ,
if i < j then j ψ M i ,
Then, letting
X i M ( i 1 ) x i and Y i M ( i 1 ) y i ,
we define
M ( i ) M | W ( i 1 ) { Y i } X i ,
where
W Vars ( ϕ ψ ) .
That is, M ( i ) is obtained from M ( i 1 ) by replacing X i with { Y i } while restricting to the variables in Vars ( ϕ ψ ) .
Remark 6.
Throughout the remainder of this subsection, we may occasionally omit to mention the explicit restriction to W = Vars ( φ ψ ) and, for instance, write ( M ( i 1 ) ) { Y i } X i in place of M | W ( i 1 ) { Y i } X i .
The following lemma establishes a useful dichotomy on the possible ranks of the values assigned to variables during the replacement process.
Lemma 13.
For each i = 1 , , m , we have
either rk M ( i 1 ) y i i 1 < m or rk M ( i 1 ) y i > .
Proof.  
Preliminarily, it is immediate to check that for every i = 1 , , m we have
supp ( M ( i ) ) supp ( M ( i 1 ) ) { M ( i 1 ) y i } .
We proceed by induction on i.
Base case ( i = 1 ). Before any replacement we are at M ( 0 ) , which by construction is ♭-flat; hence, every member of every M ( 0 ) v has rank ♭. Thus, rk M ( 0 ) y 1 is either 0 or + 1 , so the stated dichotomy holds.
Inductive step. Fix i > 1 and assume the statement holds for all indices 1 , , i 1 . By iterating (16) we obtain
supp ( M ( i 1 ) ) supp ( M ( 0 ) ) { M ( t 1 ) y t 1 t i 1 } .
Let s M ( i 1 ) y i be arbitrary. Then, s supp ( M ( i 1 ) ) , so by (17) either
(a)
s supp ( M ( 0 ) ) , in which case rk s = (since M ( 0 ) is ♭-flat);
(b)
s = M ( t 1 ) y t for some t { 1 , , i 1 } . By the inductive hypothesis at index t, either rk s = rk M ( t 1 ) y t t 1 i 2 , or rk s > .
If case (a) occurs for some member s of M ( i 1 ) y i , then rk M ( i 1 ) y i + 1 > . If case (b) occurs with rk s > for some member s, again rk M ( i 1 ) y i > . Otherwise, all members of M ( i 1 ) y i (if any) have rank at most i 2 , whence
rk M ( i 1 ) y i ( i 2 ) + 1 = i 1 .
Finally, by our choice of ♭ we have m , so in particular i 1 < m . This yields exactly the stated dichotomy for i.
This completes the proof.   □
Our goal is to prove that the final assignment M ( m ) is a model of φ ψ . To this end, we first establish a stability lemma, from which it follows that after step i the sets assigned to the variables in the atom x i = y i remain unchanged in all later assignments M ( t ) , for t > i .
Lemma 14. 
For every i = 1 , , m , letting x i = y i be the atom of ψ selected at step i, we have the following:
(a)
M ( i ) is a  BST -replacement of M ( i 1 ) restricted to Vars ( φ ψ ) such that M ( i ) φ Ξ φ ψ ;
(b)
for each j with 1 j < i ,
(b1)
M ( j ) y j = M ( i ) y j ;
(b2)
M ( j ) x j = M ( i ) x j ;
(c)
M ( i ) y i = M ( i 1 ) y i ;
(d)
M ( i ) x i = M ( i ) y i .
Proof Sketch. 
The proof proceeds by induction on i, relying on the preservation properties of BST -replacements and the constraints encoded in Ξ φ ψ . The detailed argument, which involves a careful case analysis for clauses (a)–(d), is rather technical. For readability, we defer the full proof to Appendix B. □
The replacement construction of Lemma 14 immediately entails a monotonicity condition on the cardinalities of the assigned sets, stated next.
Corollary 3.
For every i = 1 , , m and every v Vars ( φ Ξ φ ψ ) , we have
M ( i ) v M ( i 1 ) v .
Consequently, for every v Vars ( φ Ξ φ ψ ) , we have
M ( m ) v M ( 0 ) v .
Proof. 
Fix i { 1 , , m } and v Vars ( φ Ξ φ ψ ) . If M ( i ) v = M ( i 1 ) v , the claim is immediate. Otherwise, since M ( i ) is obtained from M ( i 1 ) by a BST -replacement (with respect to the pair M ( i 1 ) x i , { M ( i 1 ) y i } ), the only way M ( i 1 ) v can change is when M ( i 1 ) x i M ( i 1 ) v , in which case
M ( i ) v = M ( i 1 ) v M ( i 1 ) x i { M ( i 1 ) y i } .
Plainly, M ( i 1 ) x i { M ( i 1 ) y i } ; hence, M ( i ) v M ( i 1 ) v .
Applying this inequality successively for i = 1 , 2 , , m yields
M ( m ) v M ( m 1 ) v M ( 0 ) v ,
as required.   □
We are now ready to prove that M ( m ) satisfies φ ψ . By Lemma 14(a), we have
M ( m ) φ .
Moreover, Lemma 14(c) guarantees that, for every i with 1 i m ,
M ( i ) x i = { M ( i ) y i } .
Thus, to establish M ( m ) ψ , it suffices to prove that, for all i,
M ( m ) x i = { M ( m ) y i } .
If i = m , then (19) immediately gives (20). If i < m , clauses (b1) and (b2) of Lemma 14 yield M ( i ) y i = M ( m ) y i and M ( i ) x i = M ( m ) x i , respectively. Combining these with (19), we again obtain (20). Hence, M ( m ) ψ . From this and (18), it follows that M ( m ) φ ψ , thus completing the proof of satisfiability preservation.

6.4. Model Extension

We next prove that every model of φ ψ can be extended into a model of φ Ξ φ ψ , which corresponds to condition (c) of Definition 3.
Let φ ψ be satisfiable, and let M be a model of it. Define ≺ as the minimal transitive relation on Vars ( φ ψ ) such that
y x whenever M y M x , for all x , y Vars ( φ ψ ) .
By set well-foundedness, ≺ is irreflexive; together with transitivity, this makes ≺ a strict ordering on Vars ( φ ψ ) .
Extend M to the auxiliary variables v ˜ , for each v Vars ( φ ψ ) , by setting for each of them:
M v ˜ { M x x v } .
Moreover, extend M to the auxiliary variable z by stipulating
M z ,
so that in particular
M z = z z .
We prove that M Ξ φ ψ holds.
Preliminarily, note that M x is a singleton for every singleton atom x = y in ψ . It then follows directly that
M x = y in ψ v Vars ( φ ψ ) ¬ D ISJ ( x , v ) x v .
Moreover, for each singleton atom x = { y } in ψ , the acyclicity of the membership relation yields M x M y , and so M x y . Therefore,
M x = { y } ψ x y .
In addition, for each pair of singleton atoms x = { y } and x = { y } in ψ , we plainly have
M y = y x = x ,
whence
M x = { y } ψ x = { y } ψ y = y x = x .
Now, consider any implication of the form
¬ D ISJ ( x , v ) y ˜ v ˜ ,
with x = { y } in ψ and v Vars ( φ ψ ) , and assume that M ¬ D ISJ ( x , v ) . Since M x = { M y } , this yields M x M v ; hence, M y M v . Thus, y v . By the transitivity and strictness of ≺, we obtain
M y ˜ = { M y y y } { M y y v } = M v ˜ ,
namely, M y ˜ v ˜ .
Hence, by the arbitrariness of x = { y } in ψ and v Vars ( φ ψ ) , it follows that
M x = { y } ψ v Vars ( φ ψ ) ¬ D ISJ ( x , v ) y ˜ v ˜ .
Finally, consider the implications
x = y x ˜ = y ˜ ,
with x , y Vars ( φ ψ ) . Assume M x = M y . Suppose v x . By definition of ≺, there exists a finite chain of variables v 1 , , v n such that
v = v 1 , M v i M v i + 1 for each i , v n = x .
Hence,
M v = M v 1 M v n = M x = M y ,
which yields v y . By a symmetric argument, v x follows from v y . Therefore,
{ v v x } = { v v y } ,
and thus M x ˜ = M y ˜ . Hence,
M x , y Vars ( φ ψ ) x = y x ˜ = y ˜ .
Collecting (21)–(25), and the above argument, we see that all conjuncts of Ξ φ ψ are satisfied by M. Therefore,
M Ξ φ ψ ,
which establishes the model extension property.
We have so extended a generic M such that M φ ψ into a model of φ Ξ φ ψ ; therefore, we get
φ ( y ) ψ ( x ) ( z ) Ξ φ ψ ( x , y , z ) ,
where the list z consists of the collection x ˜ of all auxiliary set variables x ˜ together with z .
Finally, in view of Definition 3, combining the satisfiability preservation established in Section 6.3 with the model extension property of Section 6.4, and relying on Lemma 11, we obtain our main expressibility result:
Theorem 2.
Singleton-atom conjunctions are O ( n 2 ) -expressible from  BST into  BST + .

7. Conclusions: Related and Planned Work

The main contributions of this paper are as follows:
  • The introduction of the notion of O ( f ) -expressibility across theories, refining the existing notion of existential expressibility, which, while useful, is too coarse for our purposes.
  • The proof that atoms of the form z = x are not existentially expressible in BST + .
  • The proof that, by contrast, any conjunction of such atoms is O ( n 2 ) -expressible from BST into BST + , using a construction we call the nested-to-flat translation.
As noted in the Introduction, the authors’ interest in satisfiability mechanisms for restricted fragments of set theory arises primarily from the design and experimental use of the ÆtnaNova proof verifier.
As discussed ref. [3], ELEM is the core inference mechanism in ÆtnaNova , where it often operates implicitly alongside other methods. It is based on an enhanced form of multilevel syllogistic (ref. [1]), a decision procedure for checking the satisfiability of certain unquantified set-theoretic formulae. This procedure allows ÆtnaNova to establish that a statement follows from a given proof context by showing that its negation yields an unsatisfiable conjunction with earlier statements.
When parts of a proof involve constructs that fall outside the scope of ELEM ’s built-in syllogistic, a preprocessing step replaces them with fresh variables, ensuring uniform treatment of identical structures. However, proof steps are rarely uniquely determined by prior lines and hints alone. This is because ELEM often generates a range of easy consequences from a given context, leaving the user free to choose which of these consequences to assert as the next proof step.
Since inference mechanisms akin to ELEM are likely to play a central role in proof technology, it is important to develop translation methods of low algorithmic cost—such as the nested-to-flat translation discussed above—that make their integration possible.

7.1. Envisaged Enhancements to the Nested-to-Flat Translation

We expect that the satisfiability-preserving translation treated in Section 6 can be tuned to theories richer that BST , such as the following. Let BSTS denote the collection of conjunctions of literals of the forms
x = y z , x y z , x = y
and let BSTS + be the Boolean closure of atoms of the forms
x = y z , x = y .
Then, it seems plausible that conjunctions of atoms of the form
F INITE x , D ENUMERABLE x ,
together with their negations—where F INITE x expresses that the set assigned to x is finite and D ENUMERABLE x expresses that the set assigned to x is denumerable (i.e., finite or countably infinite)—are likewise O ( n 2 ) -expressible from BSTS to BSTS + . Exploring this broader scenario is part of our ongoing research agenda.

7.2. Towards Integrating Set-Theoretic, Boolean, and Numerical Constraint Reasoning

Besides its independent interest, the nested-to-flat translation discussed so far can be seen as a preparatory step toward the quantitative approach to logical inference (cf. [14,15]), as specialized to the field of computable set theory (cf. Ch. 11 ref. [16]).
Our translation, in fact, lays the groundwork for reducing satisfiability problems about sets to satisfiability problems about nonnegative integers—or, if we move from the theory of hereditarily finite sets to a more general setting, to the language of the additive theory of cardinals (which is decidable; ref. [17]).
One such reduction, where the source language embodies a cardinality operator x indicating how many elements belong to the set x, is presented in Sec. 11.1 ref. [16]. Let us briefly outline how it proceeds. We are given a conjunction ψ of literals of the forms
x = y z , x = y , h = x , h = i + j , h = 1 .
Suppose we want to test ψ for satisfiability over the hereditarily finite sets. Here, x , y , z stand for set variables while i , j , h stand for numeric variables drawn from a disjoint infinite set of symbols and are intended to range over the nonnegative integers. Let X be the collection of all set variables occurring in ψ ; let, moreover, Q be the collection of all nonempty subsets Q of X such that the assignment
B Q ( v ) := i f v Q , i f v X Q ,
satisfies all equalities of the form x = y z occurring in ψ . Associate two numeric variables ν Q , ϱ Q with each Q in Q . We have an algorithm that, given ψ , constructs a system C of purely arithmetic constraints such that ψ is satisfiable over sets and natural numbers if and only if C has a solution over the natural numbers. Those literals of the forms h = i + j and h = 1 that were in ψ from the outset are retained in C ; in addition to them, C encompasses conditions specifying the intended meanings of ν Q , ϱ Q relative to a (hypothetical) model x ξ x of ψ , namely,
  • Each ν Q represents the cardinality of the set
    Q ^ := v Q ξ v w X Q ξ w ;
  • Each ϱ Q represents the position occupied by Q ^ in a total ordering that extends the rank comparison relation.
The reduction just outlined can certainly be refined beyond the treatment ref. [16], as we will strive to do in the future. We also expect that it can be boosted with the treatment of explicit rank-related constructs, such as the comparison relator rk x < rk y .
Some reductions of the set-satisfiability problem to integer programming can be found ref. [18], whose line of research aimed at integrating linear programming problems and set constraint manipulation methods in a single logic programming language, as explained refs. [19,20] (the endeavor of integrating cardinality constraints into constraint logic programming with sets has been carried out with a different approach, as reported ref. [21]). A technique for reducing the problem of multilevel syllogistic (cf. [3]) to propositional consistency testing was described ref. [18] (an account of it can also be found in Sec. 11.3 ref. [16]).
In Sec. 4.5 ref. [22], the authors show that satisfiability of nonrecursive Tarskian set constraints is decidable in nondeterministic double-exponential time by reducing the problem to a class of Diophantine constraints called prequadratic. They prove that satisfiability of prequadratic Diophantine constraints is decidable in nondeterministic exponential time and conjecture that it is in NP. If this conjecture holds, satisfiability of nonrecursive Tarskian constraints would be decidable in nondeterministic single-exponential time.
Along a related line of research, ref. [23] reduces quantifier-free constraints on sets involving cardinalities along with direct and inverse images of functions on sets, to systems of numerical constraints in linear integer arithmetic or of the form x y d , where d is a positive integer.

7.3. Difference Algebras

We take this opportunity to mention an issue concerning an alternative semantics for BST , which, although only loosely related to the central theme of this paper, is nevertheless of independent interest. In his bachelor’s thesis Unificazione semantica in strutture booleane (‘Semantic unification in Boolean structures’), defended at the University of Trieste in 2020, Mattia Furlan isolated the valid formulae involving Boolean difference that are displayed in Figure 1.
Let us adopt the universal closures of these formulae as the axioms of a theory in quantificational first-order logic with equality. These axioms characterize an algebraic variety, whose instances we provisionally call difference algebras.
A natural open question is whether every difference algebra D = ( D , D ) is isomorphic to an algebra of the form S = ( S , ) , in which the operator ‘∖’ is interpreted as ordinary set-theoretic difference. In this case, S must be a family of sets closed under difference, and hence under intersection, since
X Y = X ( X Y )
holds for all sets X , Y .
One might hope to settle this question by appealing to Stone’s celebrated representation theorem, which states that every Boolean algebra is isomorphic to a field of sets. However, we see no direct way to apply this theorem, since there exist difference algebras D whose carrier D is not closed under symmetric difference, viewed as an operation Y , Z Y D Z satisfying, for all X , Y , Z in D ,
X = Y D Z ( X D ( Y D Z ) = Z D Y Y D Z = X D Z ) .
Moreover, it is unclear how to embed an arbitrary difference algebra into one that forms a proper Boolean ring by virtue of satisfying this closure property.

Author Contributions

Conceptualization, D.C., A.D.D., P.M. and E.G.O.; Methodology, D.C., A.D.D., P.M. and E.G.O.; Writing – original draft, D.C., A.D.D., P.M. and E.G.O.; Writing – review & editing, D.C. and E.G.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data supporting this theoretical study are contained within the article.

Acknowledgments

This work benefited from interactions with the Research Program PIAno di inCEntivi per la Ricerca di Ateneo 2024–2026—Linea di Intervento I “Progetti di ricerca collaborativa”—University of Catania—Project “Semantic Web of EveryThing through Ontological Protocols” (SWETOP). We also acknowledge INdAM/GNCS.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. A Lower Bound on the Number of Sets of a Fixed Integer Rank

Here, we figure out inequalities preparatory to the proof of Proposition A2 below.
Proposition A1.
For every positive integer n, we have
2 2 n 2 n 2 ( 2 n n ) .
Proof.  
We prove the proposition by establishing the equivalent inequality
2 2 n 3 · 2 n 2 n ,
which follows by simply adding 2 n to both sides of the original inequality.
We proceed by induction on n 1 .
  • Base case:
For n = 1 , we have
2 2 1 = 4 and 3 · 2 1 2 · 1 = 6 2 = 4 .
Thus, the base case holds with equality.
  • Inductive step:
Fix n 1 and assume that
2 2 n 3 · 2 n 2 n .
We aim to prove that
2 2 n + 1 3 · 2 n + 1 2 ( n + 1 ) .
Note that 2 2 n + 1 = 2 2 n 2 . Since n 1 , we have 2 2 n 2 2 = 4 , and thus
2 2 n 2 4 · 2 2 n .
Using the inductive hypothesis (A2), we obtain
2 2 n + 1 = 2 2 n 2 4 · 2 2 n 4 ( 3 · 2 n 2 n ) = 12 · 2 n 8 n .
Now, observe that
3 · 2 n + 1 2 ( n + 1 ) = 6 · 2 n 2 n 2 ,
so it suffices to verify the inequality
12 · 2 n 8 n 6 · 2 n 2 n 2 ,
which reduces to
6 · 2 n 6 n + 2 0 .
This inequality clearly holds, since n 1 implies 2 n n 0 , and the leftover + 2 ensures strict positivity.
Thus, the inductive step is verified.   □
Next, we come to a proposition which lies in the background of this paper:
Proposition A2. 
For every positive integer n, the number of well-founded sets of rank equal to n is greater than or equal to 2 n 1 , namely,
W n # 2 n 1 .
Proof.  
We proceed by induction on n 1 .
  • Base case:
For n = 1 , we have W 1 # = 1 = 2 1 1 .
  • Inductive step:
Let n 1 be such that W n # 2 n 1 . We aim to show that
W n + 1 # 2 n .
From the equality W n # = V n + 1 V n and the inductive hypothesis, it follows that
V n + 1 = W n # + V n 2 n 1 + V n .
Hence, by (2),
W n + 1 # = V n + 2 V n + 1 = 2 V n + 1 V n + 1 = 2 2 V n 2 V n 2 2 V n V n ( by Proposition , since V n 1 ) = 2 V n + 1 V n = 2 W n # 2 · 2 n 1 ( by the induction hypothesis ) = 2 n .      

Appendix B. Proof of Lemma 14

For convenience, we restate Lemma 14 before giving its proof.
Lemma 14.
For every i = 1 , , m , letting x i = y i be the atom of ψ selected at step i, we have
(a)
M ( i ) is a BST -replacement of M ( i 1 ) such that M ( i ) φ Ξ φ ψ ;
(b)
For each j with 1 j < i
(b1)
M ( j ) y j = M ( i ) y j ;
(b2)
M ( j ) x j = M ( i ) x j ;
(c)
M ( i ) y i = M ( i 1 ) y i ;
(d)
M ( i ) x i = M ( i ) y i .
Proof. 
We prove the lemma by strong induction on i = 1 , , m .
Base case ( i = 1 ). Since M ( 0 ) φ Ξ φ ψ , we have in particular
M ( 0 ) x Vars ( φ ψ ) ( ¬ D ISJ ( x 1 , x ) x 1 x ) .
Moreover, by Lemma 10, M ( 0 ) x 1 . In addition, since rk M ( 0 ) y 1 is either 0 or + 1 , the ♭-flatness of M ( 0 ) ensures that D ISJ ( M ( 0 ) y 1 , M v ) , for all v Vars ( φ ψ ) . Hence, the BST -conditions for the pair M ( 0 ) x 1 , M ( 0 ) y 1 are satisfied, and therefore M ( 1 ) is a BST -replacement of M ( 0 ) from M ( 0 ) x 1 to M ( 0 ) y 1 (restricted to Vars ( φ ψ ) ). Since every literal in φ Ξ φ ψ satisfies either Vars ( ) Vars ( φ ψ ) or Vars ( ) Vars ( φ ψ ) = , Lemma 6 ensures that M ( 1 ) φ Ξ φ ψ . Therefore, clause (a) holds in the base case.
Clause (b) is vacuous when i = 1 .
Finally, concerning clause (c), from (A5) and Lemma 10 we obtain D ISJ ( M ( 0 ) x 1 , M ( 0 ) y 1 ) . Therefore, by the definition of M ( 1 ) , it follows that M ( 1 ) y 1 = M ( 0 ) y 1 .
Inductive step ( 1 < i m ). Assume (a)–(d) hold for all 1 t < i . □
Proof of (a).
We show that M ( i ) is a BST -replacement of M ( i 1 ) by verifying that the BST -conditions (8) hold for the pair M ( i 1 ) x i , M ( i 1 ) y i relative to M ( i 1 ) .
We first show that M ( i 1 ) x i . By the inductive hypothesis (a) at step i 1 , we have M ( i 1 ) φ Ξ φ ψ . Therefore, by Lemma 10, we conclude that M ( i 1 ) x i . Moreover, M ( i 1 ) y i holds trivially. Thus, to complete the proof that M ( i ) is a BST -replacement of M ( i 1 ) , it remains to check that the following conditions hold for all v Vars ( φ ψ ) :
(A)
D ISJ M ( i 1 ) x i , M ( i 1 ) v M ( i 1 ) x i M ( i 1 ) v ;
(B)
D ISJ M ( i 1 ) y i M ( i 1 ) x i , M ( i 1 ) v .
Concerning (A), recalling that M ( i 1 ) φ Ξ φ ψ , in particular we have
M ( i 1 ) v Vars ( φ ψ ) ¬ D ISJ ( x i , v ) x i v ,
as such conjunction is explicitly contained in Ξ φ ψ . Thus,
¬ D ISJ ( M ( i 1 ) x i , M ( i 1 ) v ) M ( i 1 ) x i M ( i 1 ) v , for all v Vars ( φ ψ ) ,
which establishes (A).   □
We now turn to clause (B). Its verification requires a case analysis, carried out for an arbitrary v Vars ( φ ψ ) .
If D ISJ { M ( i 1 ) y i } , M ( i 1 ) v holds, then a fortiori so does D ISJ { M ( i 1 ) y i } M ( i 1 ) x i , M ( i 1 ) v .
The complementary case ¬ D ISJ { M ( i 1 ) y i } , M ( i 1 ) v requires a much more detailed argument, which we provide below.
Thus, let { M ( i 1 ) y i } M ( i 1 ) v , so that M ( i 1 ) y i M ( i 1 ) v .
By Lemma 13, rk M ( i 1 ) y i , so M ( i 1 ) y i M ( 0 ) v , i.e., D ISJ { M ( i 1 ) y i } , M ( 0 ) v . Therefore, there must exist an index k with 1 k i 1 such that
M ( i 1 ) y i M ( k 1 ) v and M ( i 1 ) y i M ( k ) v .
Consequently,
M ( k ) v = M ( k 1 ) v M ( k 1 ) x k { M ( k 1 ) y k } ,
hence,
M ( k 1 ) y k = M ( i 1 ) y i .
Since by inductive hypothesis (a) at step k the assignment M ( k ) is a BST -replacement of M ( k 1 ) , it must be the case that M ( k 1 ) x k , that is, ¬ D ISJ M ( k 1 ) x k , M ( k 1 ) x k ) . Thus, we have
M ( k ) x k = M ( k 1 ) x k M ( k 1 ) x k { M ( k 1 ) y k } = { M ( k 1 ) y k } = { M ( k ) y k } ,
where the final equality follows by the inductive hypothesis (c).
Moreover, whether k = i 1 or k < i 1 , it follows in both cases that
M ( k ) y k = M ( i 1 ) y k
(the former trivially; the latter by the inductive hypothesis (b1)).
Thus, we have
M ( i 1 ) y i = M ( k 1 ) y k ( by ( A6 ) ) = M ( k ) y k ( by inductive hypothesis ( c ) ) = M ( i 1 ) y k ( by ( A8 ) ) .
Recalling again that M ( i 1 ) Ξ φ ψ , we obtain
M ( i 1 ) y k = y i x k = x i ,
since Ξ φ ψ contains the conjunct y k = y i x k = x i . In combination with (A9) and the equivalences (11) in Remark 5, this yields
M ( i 1 ) x k = M ( i 1 ) x i .
Whether k = i 1 (immediate) or k < i 1 (by the inductive hypothesis (b2)), we obtain
M ( i 1 ) x k = M ( k ) x k .
Thus,
M ( i 1 ) x i = M ( i 1 ) x k ( by ( A10 ) ) = M ( k ) x k ( by ( A11 ) ) = M ( k ) y k ( by ( A7 ) ) = M ( i 1 ) y i ( by ( A18 ) ) .
Hence, { M ( i 1 ) y i } = M ( i 1 ) x i . Therefore, { M ( i 1 ) y i } M ( i 1 ) x i = , and so the disjointness condition holds trivially. This completes the verification of (B) also in the case ¬ D ISJ { M ( i 1 ) y i } , M ( i 1 ) v . Since v was arbitrary, property (B) holds for all variables in Vars ( φ ψ ) . Together with (A), it follows that M ( i ) is a BST -replacement of M ( i 1 ) . Finally, recalling that, by inductive hypothesis (a) at step i 1 , we have M ( i 1 ) φ Ξ φ ψ , Lemma 6 yields M ( i ) φ Ξ φ ψ . Thus, clause (a) is established at step i.
Proof of (b).
Concerning (b1), assume by way of contradiction that M ( j ) y j M ( i ) y j for some j with 1 j < i , and let k be an index such that j < k i and
M ( k 1 ) y j = M ( j ) y j while M ( k ) y j M ( j ) y j .
Such an index k must exist; otherwise, M ( h ) y j = M ( j ) y j for all h with j h i , which would give M ( i ) y j = M ( j ) y j and contradict the assumption. Since M ( k 1 ) y j M ( k ) y j , it follows that M ( k 1 ) x k M ( k 1 ) y j and M ( k ) y j = M ( k 1 ) y j M ( k 1 ) x k M ( k 1 ) y k . Moreover, as M ( k 1 ) x k y j , the inductive hypothesis (a) guarantees that this inclusion already holds in M ( 0 ) . Namely, M ( 0 ) x k M ( 0 ) y j . Finally, by Lemma 10, we also know that M ( 0 ) x k . Thus, k M ( 0 ) j , implying k < j , a contradiction to j < k . Therefore, M ( j ) y j = M ( i ) y j for all 1 j < i .   □
Regarding (b2), assume by way of contradiction that M ( j ) x j M ( i ) x j for some j with 1 j < i . Then, there must exist a k with j < k i and such that
M ( k 1 ) x j = M ( j ) x j while M ( k ) x j M ( j ) x j ,
otherwise M ( h ) x j = M ( j ) x j for all h with j h i , yielding M ( i ) x j = M ( j ) x j and contradicting the assumption.
By inductive hypothesis (d) at step j, we have
M ( j ) x j = { M ( j ) y j } .
Moreover, since j k 1 < i , the inductive hypothesis (b2) gives
M ( k 1 ) x j = M ( j ) x j .
At stage k, M ( k ) is a BST -replacement of M ( k 1 ) (by the inductive hypothesis (a) if k < i , or by clause (a) at step i if k = i ). Thus, the pair M ( k 1 ) x k , M ( k 1 ) y k satisfies the BST -replacement conditions relative to M ( k 1 ) , and combined with (A12) this yields
M ( k 1 ) x k M ( k 1 ) x j .
Since M ( j ) x j is a singleton, if follows from (A14) and (A15) that
M ( k 1 ) x k = M ( k 1 ) x j .
Therefore, M ( k 1 ) x k = x j .
Because M ( 0 ) Ξ φ ψ , the inductive hypothesis (a) ensures M ( k 1 ) Ξ φ ψ . As y k = y j x k = x j is a conjunct of Ξ φ ψ , we deduce
M ( k 1 ) y k = y j x k = x j ,
hence M ( k 1 ) y k = y j . By the equivalences (11) in Remark 5, this gives
M ( k 1 ) y k = M ( k 1 ) y j .
Finally, since j < k i , the inductive hypothesis (b1) entails
M ( j ) y j = M ( k 1 ) y j .
Putting everything together, we obtain
M ( k ) x j = M ( k 1 ) x j M ( k 1 ) x k { M ( k 1 ) y k } ( by (A15) and (A16) ) = { M ( k 1 ) y k } = { M ( j ) y j } ( by (A17) and (A18) ) = M ( j ) x j ( by (A13) ) .
This contradicts (A12), and therefore clause (b2) is established at step i.
Proof of (c).
We need to show that M ( i ) y i = M ( i 1 ) y i . To this end, recall first that M ( 0 ) Ξ φ ψ . As already noted, we have M ( i 1 ) Ξ φ ψ . Lemma 10 then gives D ISJ ( M ( i 1 ) x i , M ( i 1 ) y i ) , and by the definition (7) of M ( i ) y i we conclude that M ( i ) y i = M ( i 1 ) y i . This completes the verification of clause (c) at step i.   □
Proof of (d).
Since, by (a), M ( i ) is a BST -replacement of M ( i 1 ) from M ( i 1 ) x i to { M ( i 1 ) y i } , we must have M ( i 1 ) x i , equivalently ¬ D ISJ ( M ( i 1 ) x i , , M ( i 1 ) x i ) . By the definition of BST -replacement, this yields
M ( i ) x i = M ( i 1 ) x i M ( i 1 ) x i { M ( i 1 ) y i } = { M ( i 1 ) y i } = { M ( i ) y i } ,
where the last equality follows from clause (c). This establishes (d). Since the inductive step is now complete, and the base case was already established, the lemma follows.   □

References

  1. Cantone, D.; Omodeo, E.G. Onset and today’s perspectives of multilevel syllogistic. In From Computational Logic to Computational Biology: Essays Dedicated to Alfredo Ferro to Celebrate His Scientific Career; Cantone, D., Pulvirenti, A., Eds.; Springer Nature: Cham, Switzerland, 2024; Volume 14070 of LNCS, pp. 9–55. [Google Scholar] [CrossRef]
  2. Schwartz, J.T. Instantiation and Decision Procedures for Certain Classes of Quantified Set-Theoretic Formulae; Technical Report 78-10; Institute for Computer Applications in Science and Engineering, NASA Langley Research Center: Hampton, Virginia, 1978. [Google Scholar]
  3. Schwartz, J.T.; Cantone, D.; Omodeo, E.G. Computational Logic and Set Theory—Applying Formalized Logic to Analysis; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
  4. Cantone, D.; Domenico, A.D.; Maugeri, P.; Omodeo, E.G. Complexity assessments for decidable fragments of set theory. I: A taxonomy for the Boolean case. Fundam. Informaticae 2021, 181, 37–69. [Google Scholar] [CrossRef]
  5. Cantone, D.; Maugeri, P.; Omodeo, E.G. Complexity assessments for decidable fragments of set theory. II: A taxonomy for ‘small’ languages involving membership. Theor. Comput. Sci. 2020, 848, 28–46. [Google Scholar] [CrossRef]
  6. Kuncak, V.; Nguyen, H.H.; Rinard, M. Deciding Boolean algebra with Presburger arithmetic. J. Autom. Reason. 2006, 36, 213–239. [Google Scholar] [CrossRef]
  7. Rabin, M.O. Decidable theories. In Handbook of Mathematical Logic, Volume 90 of Studies in Logic and the Foundations of Mathematics; Barwise, J., Ed.; Elsevier (North-Holland Publishing Co.): Amsterdam, The Netherlands, 1977; pp. 595–629. [Google Scholar]
  8. Cantone, D.; Omodeo, E.G.; Policriti, A. The automation of syllogistic. II: Optimization and complexity issues. J. Autom. Reason. 1990, 6, 173–188. [Google Scholar] [CrossRef]
  9. Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to Algorithms, 3rd ed.; MIT Press: Cambridge, MA, USA, 2009. [Google Scholar]
  10. Angrisani, F.; Ascione, G.; Manzo, G. Orlicz spaces with a O- Type Structure. Ric. Mat. 2019, 68, 841–857. [Google Scholar] [CrossRef]
  11. Jiménez-Garrido, J.; Sanz, J.; Schindl, G. Indices of O-regular variation for weight functions and weight sequences. Rev. Real Acad. Cienc. Exactas Físicas Nat. Ser. A Matemáticas 2018, 113, 3659–3697. [Google Scholar] [CrossRef]
  12. Cantone, D.; Ferro, A. Techniques of computable set theory with applications to proof verification. Commun. Pure Appl. Math. 1995, 48, 901–945. [Google Scholar] [CrossRef]
  13. Parlamento, F.; Policriti, A.; Rao, K.P.S.B. Witnessing Differences without Redundancies. Proc. Am. Math. Soc. 1997, 125, 587–594. [Google Scholar] [CrossRef]
  14. Hooker, J.N. A quantitative approach to logical inference. Decis. Support Syst. 1988, 4, 45–69. [Google Scholar] [CrossRef]
  15. Hooker, J.N.; Fedjiki, C. Branch-and-cut solution of inference problems in propositional logic. Ann. Math. Artif. Intell. 1990, 1, 123–139. [Google Scholar] [CrossRef]
  16. Cantone, D.; Omodeo, E.G.; Policriti, A. Set Theory for Computing. From Decision Procedures to Declarative Programming with Sets; Monographs in Computer Science; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
  17. Tarski, A. Ordinal Algebras; Studies in Logic and the Foundations of Mathematics; North-Holland: Amsterdam, The Netherlands, 1956. [Google Scholar]
  18. Hibti, M. Décidabilité et Complexité de Systèmes de Contraintes Ensemblistes. PhD Thesis, Université de Franche-Comté, Besançon, France, 1995. [Google Scholar]
  19. Hibti, M.; Lombardi, H.; Legeard, B. Deciding in HFS-theory via linear integer programming. In Logic Programming and Automated Reasoning, Proceedings of the 4th Int’l Conference, LPAR’93, St. Petersburg, Russia, 13–20 July 1993; Voronkov, A., Ed.; Springer: Berlin/Heidelberg, Germany, 1993; Volume 698, pp. 170–181. [Google Scholar]
  20. Hibti, M.; Legeard, B.; Lombardi, H. Une procédure de décision pour un problème de satisfiabilité dans un univers ensembliste héréditairement fini. RAIRO Theor. Inform. Appl. 1997, 31, 205–236. [Google Scholar] [CrossRef]
  21. Cristiá, M.; Rossi, G. Integrating cardinality constraints into constraint logic programming with sets. Theory Pract. Log. Program. 2021, 23, 468–502. [Google Scholar] [CrossRef]
  22. Givan, R.; McAllester, D.A.; Witty, C.; Kozen, D. Tarskian Set Constraints. Inf. Comput. 2002, 174, 105–131. [Google Scholar] [CrossRef]
  23. Raya, R.; Hamza, J.; Kunčak, V. On the Complexity of Convex and Reverse Convex Prequadratic Constraints. In LPAR 2023: Proceedings of the 24th International Conference on Logic for Programming, Artificial Intelligence and Reasoning, Manizales, Colombia, 4–9 June 2023; Piskac, R., Voronkov, A., Eds.; EasyChair: Stockport, UK, 2023; Volume 94, pp. 350–368. [Google Scholar] [CrossRef]
Figure 1. Axioms of the variety of difference algebras.
Figure 1. Axioms of the variety of difference algebras.
Foundations 06 00003 g001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cantone, D.; De Domenico, A.; Maugeri, P.; Omodeo, E.G. Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations 2026, 6, 3. https://doi.org/10.3390/foundations6010003

AMA Style

Cantone D, De Domenico A, Maugeri P, Omodeo EG. Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations. 2026; 6(1):3. https://doi.org/10.3390/foundations6010003

Chicago/Turabian Style

Cantone, Domenico, Andrea De Domenico, Pietro Maugeri, and Eugenio G. Omodeo. 2026. "Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae" Foundations 6, no. 1: 3. https://doi.org/10.3390/foundations6010003

APA Style

Cantone, D., De Domenico, A., Maugeri, P., & Omodeo, E. G. (2026). Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations, 6(1), 3. https://doi.org/10.3390/foundations6010003

Article Metrics

Back to TopTop