1. Introduction
As reported in the survey paper [
1], the line of investigation to which this paper belongs began in 1978 (ref. [
2]), driven by the goal of developing an automated system for proof-checking and program correctness verification. Since then, identifying decidable fragments of set theory has been regarded as a key step toward achieving that goal. As a matter of fact, in the organization of the proof-checker
, which came into existence some thirty years later (ref. [
3]), a satisfiability decision tester for a specific class of unquantified set-theoretic formulae plays a central and pervasive role.
When it comes to implementations, complexity emerges as an inescapable issue; this is why we recently undertook a systematic study on the algorithmic complexities of satisfiability testers (see, e.g., [
4,
5]). In the same frame of mind, this paper enhances a quadratic-cost method (announced ref. [
4] and then presented in D. Cantone, A. De Domenico, P. Maugeri, and E.G. Omodeo; a quadratic reduction of constraints over nested sets to purely Boolean formulae in CNF; in
Proc. 35th Italian Conference on Computational Logic, volume 2710 of
CEUR Workshop Proceedings, pages 214–230. 2020). This translates the formulae of an unquantified language involving set-theoretic variables, Boolean operators, and
membership and equality relators into propositional combinations of purely Boolean literals.
The enhancement lies in the availability of a singleton-formation operator ‘{■}’, which increases the expressive power of the source language without affecting the algorithmic time complexity of the translation process; however, the fact that the translation preserves satisfiability as well as unsatisfiability must be proved anew and differently. The material that follows bridges two complexity taxonomies, developed and analyzed in the respective papers [
4,
5]. These taxonomies, one of which—unlike the other—involves membership, concern fragments of set theory for which satisfiability is decidable.
A fit theoretical framework for the study of our target language is the theory of Boolean rings, a merely equational first-order theory endowed with finitely many axioms—at times, one blends this theory with an arithmetic of cardinals (see, e.g., [
6])—or just a surrogate of it, cf.
Figure 1 in
Section 7. Frameworks for the study of the source language are such all-embracing theories as ZF and NBG (the Zermelo–Fraenkel and von Neumann–Bernays–Gödel theories), within which one can cast the whole corpus of mathematical disciplines. Boolean algebra is decidable in its entirety (cf. Section 3.7 ref. [
7]); ZF is essentially undecidable, yet nonetheless an effort to find practical decision algorithms for fragments of it began in 1979. The rationale of this long-standing research is that satisfiability testers embodying some knowledge about ZF can act as key inference mechanisms within a programmed system apt to verifying the correctness of large-scale mathematical proofs as envisaged ref. [
3].
A priori, one would expect the distance between the performances of decision algorithms for fragments of Boolean algebra, and of the seemingly much more expressive languages whose dictionaries embody nested membership, to be abysmal. Luckily, though, as we will see, this is not the case.
We introduce in
Section 2 an interpreted formal language, dubbed
, within which one can formulate unquantified Boolean constraints. Despite its syntax being quite minimal—
only encompasses conjunctions of primitive literals of two forms, namely,
and
—, the satisfiability problem for
is
-complete (ref. [
4]). By way of abbreviations, a number of additional constraints (e.g., literals of the form
) can be expressed in
.
Alongside , we also consider its natural Boolean closure , defined as the language of all propositional combinations—using ∧, ∨, ⟶, ⟷, and ¬ with arbitrary nesting—of atoms of the form . It is straightforward to verify that the satisfiability problem for can be reduced to that for in nondeterministic polynomial time; therefore, it is -complete in turn.
According to our semantics, the domain of discourse to which
(and
) refers is a universe of nested sets; however, as will be seen in
Section 5, every satisfiable propositional combination of
literals (in particular, every satisfiable
constraint) admits a model consisting of sets which are, in a certain sense, ‘flat’. This makes it clear that there is no straightforward way of expressing membership atoms
in
. Yet, many natural set-theoretic constructors (most notably the singleton operator {■}) inherently interact with membership, and understanding whether and how such constructs can be eliminated while preserving satisfiability and complexity is the main motivation of this paper.
To address this research gap,
Section 3 introduces a complexity-aware notion of expressibility, which requires not only a semantic reduction but also an explicit bound on the cost of the translation. While related notions already appear ref. [
4], the present formulation is tailored to our setting and is a valuable technical tool that enables the complexity-preservation arguments developed here.
In terms of this notion,
Section 6 provides an explicit translation of conjunctions of literals of the three forms
,
, and
into propositional combinations of
literals, i.e., into
. As shown in
Section 6.3 and
Section 6.4, the translation preserves satisfiability. It yields a conjunction whose conjuncts are either
literals or simple disjunctions. Moreover, the translation can be computed in quadratic time (see
Section 6.1), which entails that satisfiability remains
-complete when the singleton operator {■} is added to the constructs of
. This
-completeness result was already known (see, e.g., [
8]), but our approach derives it anew via an explicit, complexity-controlled elimination procedure.
The concluding section,
Section 7, recapitulates what has been accomplished, indicates what we intend to pursue next, and situates our line of investigation in relation to that of other scholars.
2. The Theories and
Boolean set theory (
) is the quantifier-free theory consisting of all finite conjunctions of literals of the forms
where
x,
y, and
z are
set variables, ranging over the universe of well-founded sets.
Remark 1. Throughout, we use the term theory in a nonstandard sense, referring to a collection of set-theoretic formulae—possibly involving, along with primitive symbols, some derived constructs—rather than a deductively closed collection of sentences.
Semantics for the theory
is defined in terms of set assignments. Specifically, given a (finite) collection
V of set variables, a
set assignment M over
V—the
domain of definition of
M, denoted by
—is any map from
V into the
von Neumann universe (see below). (Note that our semantics of
does not rely on flat sets of urelements (as would be doable). Working with those would call for minor adjustments, unjustified—and perhaps disturbing—in the economy of this paper.) The
support of
M, denoted by
, is the union of the sets assigned by
M to the variables in its domain, namely,
We shall refer to the cardinality of the support of
M as the
cardinality of the assignment M. A set assignment
M is said to
satisfy, or to
model, a given literal
, with
, if
holds, where ‘∖’ denotes standard set difference. Likewise,
M is said to satisfy the literal
if
holds. Finally,
M satisfies a
-conjunction
such that
(where
denotes the collection of the variables occurring free in
) if it satisfies all of the conjuncts of
, in which case we say that
M is a
model of
and write
(a short for
). A
-conjunction
is said to be
satisfiable if it has some model; otherwise, it is said to be
unsatisfiable. It is said to be
valid if it is true in every model, in which case we write
(again, a short for
). Two
-conjunctions
and
are said to be
equisatisfiable if either both are satisfiable or both are unsatisfiable—that is, if
has a model if and only if
has a model. Equisatisfiability does not require the two formulae to have the same models, but only that they agree in satisfiability, irrespective of their models. The notions introduced in this paragraph, here tailored to the case of
-conjunctions, extend naturally to all theories discussed in the sequel.
In ref. [
4], it is proved that the satisfiability problem for
, namely, the problem of establishing algorithmically the satisfiability status of any given
-conjunction, is
-complete.
Recall that only allows for conjunctions of its primitive literals—in particular, a -formula is a finite conjunction of constraints of the form and . In contrast, is the Boolean closure of : it consists of all propositional combinations of atoms of the form , obtained by unrestrained use of the logical connectives ∧, ∨, ⟶, ⟷, and ¬. Thus, strictly extends by allowing for arbitrary Boolean structure (e.g., disjunctions, implications, and negations at any depth), whereas is restricted to conjunctive constraints.
The satisfiability problem for reduces to that for in nondeterministic polynomial time (by guessing a satisfying assignment for the Boolean structure and checking the resulting conjunction), and hence the satisfiability problem for is -complete as well.
The von Neumann Universe
We recall that the von Neumann universe of (well-founded) sets, also dubbed von Neumann cumulative hierarchy, is built up through a transfinite sequence of steps as the union
of the levels , with denoting the powerset operator and ranging over the class of all ordinals.
It can easily be seen that, for every ordinal
(and, in particular, for every integer), we have
and consequently
so that
.
Based on the level of first appearance in the von Neumann hierarchy, one can define the rank of any set s, denoted . Specifically, is the ordinal such that . Hence, for every , the set , hereinafter denoted , collects all sets having rank .
The following lower bound on the number of well-founded sets of any positive integer rank
n, to be proved as Proposition A2 in
Appendix A, will be useful:
A particularly important subclass of
is that of
hereditarily finite sets, denoted by
. By definition,
that is,
consists of all sets of finite rank.
Some handy properties of the rank function that we shall tacitly use are the following, which hold for all sets :
If , then ;
If , then ;
We also recall that well-foundedness, as enforced by the
regularity (or
foundation) axiom of set theory, precludes the formation of infinite descending membership chains of the form
and, in particular, membership cycles of the form
for any sets
.
3. From Existential Expressibility to -Expressibility Across Theories
We first recall the definition of
existential expressibility (cf. [
4] for several applications of this notion).
Definition 1 (Existential expressibility).
A formula is said to be existentially expressible
in a theory if there exists a -formula such thatwhere and stand for tuples of set variables. In spite of the parsimony of
as just presented, it turns out (see [
4]) that several other Boolean constructs, such as the ones in the list of literals
can be expressed existentially in
, where
is a short for
.
Ref. [
4], existential expressibility was generalized into
-expressibility (we use standard asymptotic notations
,
, and
throughout the paper; see, e.g., Ch. 3 of [
9]), a notion that helped develop a fine-grained complexity taxonomy of the subfragments of
.
Definition 2 (
-expressibility).
Let be a theory and let be a complexity function. A formula —typically involving a construct one aims to eliminate—is said to be -expressible in
if there exists a transformationfrom to , where no variable in occurs in or , such that, for every φ, the following conditions hold:- (a)
The transformation (
5)
can be computed in -time; - (b)
If is satisfiable, then is satisfiable;
- (c)
.
Here, we further generalize
-expressibility in two directions. First, we allow for a
collection of set-theoretic formulae, rather than a single formula
as in [
4]. Second, we explicitly distinguish a
source theory
from a
target theory
, whereas [
4] works in the single-theory setting, with source and target taken to coincide.
Definition 3 (
-expressibility across theories).
Let and be any theories and be a given complexity function. A collection of formulae is said to be
-expressible from
into
if there exists a mapfrom into , where no variable in occurs in either or , such that the following conditions are satisfied:- (a)
The mapping (
6)
can be computed in time; - (b)
If is satisfiable, so is ;
- (c)
.
Observe that the two formulae appearing in condition (b) of Definition 3—namely, and —are in fact equisatisfiable. This is formalized in the following lemma.
Lemma 1. Let and be theories, and let be a collection of formulae that is -expressible from into via a given mapping , as defined in Definition 3. Then, for every and every , the formulaeare equisatisfiable. Proof. Let and . We prove that the two conjunctions and are equisatisfiable.
(⇒) Suppose is satisfiable. Then, by condition (b) of Definition 3, it follows that is satisfiable.
(⇐) Conversely, suppose is satisfiable, and let M be a model of this formula. Then, by condition (c) of Definition 3, we have
This means that there exists an extension
of
M to the variables in
—which, by hypothesis, occur neither in
nor in
—such that
; hence,
.
From the satisfiability of one formula, we have derived that of the other; hence, the two formulae are equisatisfiable. □
While Lemma 1 shows that the two formulae and are equisatisfiable, Definition 3 does not adopt equisatisfiability as a primitive requirement. Instead, it explicitly requires the two separate conditions (b) and (c), which together entail equisatisfiability but provide strictly more structure. This asymmetry is intentional: clause (b) ensures that satisfiability of the translated formula readily implies satisfiability of the original, supporting the soundness of the translation; clause (c) provides a constructive existential guarantee—namely, that any model of can be extended to a model of , thus ensuring a form of witness-preserving completeness.
To analyze complexity preservation under -expressibility, we encounter expressions like and, more generally, . It is therefore useful to assume a mild robustness of g under constant scaling, formalized below.
Definition 4 (Scale-invariant function).
A function is said to be scale-invariant (up to constants)
if, for every real constant , there exist constants and such that Equivalently, for every fixed . Remark 2. Definition 4 is closely related to classical doubling-
type assumptions. For instance, in the theory of Orlicz spaces, the -condition for a Young function Φ
requires for all (ref. [10]). Similarly, if g is (eventually) nondecreasing and satisfies a doubling-type bound, namely, there exist constants and such that then the estimate extends to any fixed scaling factor by iteration. Indeed, choose with . For all , we have Finally, our assumption can be viewed as a discrete, one-sided
variant of
-regular variation.
In the Karamata–Matuszewska framework, a function f is called
-regularly varying
if, for every , the ratio stays asymptotically bounded away from both 0 and (via lim inf
and lim sup
) (cf. [11]). In contrast, we only require the upper control , which is sufficient for the complexity-preservation steps where constant-factor rescalings arise. Example 1. Let and take . Then,and hence . More generally, the scale-invariance property holds for all standard polynomially bounded complexity functions, such as and , as well as for poly-logarithmic functions. More generally, it holds for functions of at most polynomial growth, and it also follows from a weak subadditivity condition: there exist constants and such thatfor all . In contrast, is not scale-invariant: for we have , and the ratio is unbounded, so .
The next lemma provides the quantitative part of the reduction: under -expressibility, it transfers the time bound for to . The assumptions that g is nondecreasing and scale-invariant control the rescaling of input sizes, while the at least linear growth of g ensures that the preprocessing cost is absorbed in the final bound.
Lemma 2. Let and be two theories (in the sense of Remark 1) and suppose that their conjunctive product
admits a decision procedure running in time , for some nondecreasing, scale-invariant function of at least linear growth, i.e., . Suppose moreover that the collection of formulae is -expressible from into , for some complexity function .
Then, the combined theory is decidable in time .
Proof. Let
and
, and set
. Since
is
-expressible from
into
, we can compute, in time
, a formula
satisfying conditions (a)–(c) of Definition 3. Moreover, by condition (a) (and the fact that the output of an
-time transformation has size
), we have
. Hence, there exist a constant
and a threshold
such that, for every input pair
with
, we have
where
as stipulated above.
By Lemma 1, the formulae
and
are equisatisfiable. Therefore, to decide satisfiability of
, it suffices to decide satisfiability of
, which belongs to
. By hypothesis, this can be done in time
. Using that
g is nondecreasing and scale-invariant, we obtain
Finally, the overall running time is the sum of the preprocessing time
needed to compute
and the decision time for
; hence,
Since
g has at least linear growth, there exist constants
and
such that
for all
. For all sufficiently large
n, we have
, and therefore
whence
. Consequently, the total running time is
.
Since this procedure works uniformly for every input , the claim follows. □
4. -Replacements and Flat Models
In this section, we develop two central notions that will be used in the rest of the paper. The first is that of -replacement, which captures how set assignments can be modified in a controlled way while preserving the satisfaction of -formulae.
The second is the notion of a ♭-flat model, namely, a set assignment in which every element occurring in the interpretation of a variable has rank exactly ♭. This ‘single-layer support’ property makes it possible to reason cleanly about disjointness and membership when dealing with singleton atoms under replacement.
4.1. Replacement Assignments and -Replacements
In preparation for the results that follow, we need a method to modify a set assignment for a formula without disrupting satisfiability. Such a method is implemented by means of -replacements.
Definition 5 (Replacement assignments and
-replacements).
Let M be a set assignment over a collection V of variables, let , and let be nonempty
sets.
The replacement (assignment) restricted to
W of
M from
S to
T (
or with respect to the pair
),
denoted ,
is the set assignment defined for each by The assignment is a
-replacement restricted to W(or a -replacement) of M from S to T if in addition the following condition holds: When , we omit W from the notation and write , calling it simply a replacement
of M from S to T. If condition (8) also holds, it is a
-replacement
of M from S to T.Note that the requirement that S and T are nonempty is essential only for T: if , then for any T, so the replacement has no effect and is of no interest.
Example 2 (
-replacement).
Let and define the set assignment M by Note that .Since with and , the restricted replacement acts on x and y. On the other hand, as , z is left unchanged:We verify the -condition (8) for all :For and ,andHence, the condition holds for every . Therefore, Remark 3 (Restricted setting convention). All results in this subsection are stated in the unrestricted case (), except where explicitly noted otherwise (cf. Lemma 6), but they extend plainly to the restricted setting with .
To study more closely how -replacements affect a set assignment, it is useful to introduce the partition induced by a set assignment, which decomposes its support into blocks determined by membership patterns of elements with respect to the assignment’s variable interpretations.
Definition 6. Given a set assignment M over a collection V of variables, the partition induced by Mis defined as This notion provides a precise way to track the effect of a replacement, as illustrated in the subsequent lemma.
Lemma 3. Let Φ be a formula of
, M a set assignment for it, and T a nonempty set such that holds for all . Then, for all
- (a)
is a
-replacement of M;
- (b)
.
Proof. Let and set .
(a) By the definition of
, for every
either
or
. Together with the assumption that
holds for all
, this is precisely condition (
8). Hence,
is a
-replacement of
M.
(b) Each block of
collects the elements that share the same membership pattern with respect to the variables of
. The replacement
removes
wherever it occurs and inserts
T instead. Since
T is disjoint from every
, no other block is modified, and the pattern corresponding to
is now realized by
T. Therefore,
We note that the hypothesis on
T in Lemma 3 (namely, its disjointness from every
) is stronger than what is needed in general. The next result shows that a replacement can always be reversed under the sole assumption that the
-replacement conditions (
8) are satisfied.
Lemma 4. Let M be a set assignment over a collection V of variables, and let be nonempty sets such that is a -replacement of M from S to T. Then, is a -replacement of M from T to S, and Proof. Set
. We first check that the
-conditions (
8) hold for the pair
relative to
and then show that
.
Fix . From the -conditions for relative to M, we know that either or , and in all cases .
Case 1: . Then,
. Furthermore,
since
and
. Hence,
. Also,
holds because
and
. Thus, the
-conditions for
hold at
x, and the reverse update does nothing:
Case 2: . Then, , so . Moreover, is disjoint from and from T; hence, . Therefore, the -conditions for hold at x, and
In both cases, the -conditions hold relative to , and . Since x was arbitrary, we conclude that is a -replacement of from T to S, and indeed . □
The next lemma shows that satisfiability of a -formula is preserved under -replacements: if a formula holds in a given assignment, it continues to hold after any valid -replacement.
Lemma 5. For every set assignment M, every formula Φ
of , and every -replacement of M, we have Proof. Let
be any pair of nonempty sets fulfilling the
-replacement conditions (
8) with respect to
M. As
-formulae are Boolean combinations of atoms of the form
, it is enough to show that for every such atom (with variables in the domain of
M) we have
Suppose , that is, .
We analyze separately the two cases and .
Case . Then,
, and since
implies
, we have
using (
8) to ensure
. Hence,
.
Case . By (
8), we have
, and hence
. Using the definition of
, we compute
where we used and to conclude .
For the converse implication, assume
. By Lemma 4,
is a
-replacement of
and moreover
; hence, by the first part,
Therefore, for every atomic formula of the form
,
It follows that the logical equivalence holds for all
-formulae
. This proves the lemma. □
For later use, we also state a restricted variant, in which replacements are carried out only with respect to a chosen subset of variables.
Lemma 6. Let V be a collection of variables and . Let M be a set assignment over V, and let be nonempty sets such thatis a -replacement of M restricted to W. If Φ
is a -formula such that for every literal ℓ in Φ
either or holds, then Proof. As -formulae are Boolean combinations of atoms , it suffices to check atoms. Fix an atom .
If , then none of its variables is modified by the replacement, so iff .
If , then satisfaction of depends only on the restrictions and . By Lemma 5, iff ; hence, iff .
The claim follows by propositional logic. □
With the same restriction on variables—namely, every literal lies wholly inside W or wholly outside it—Lemma 6 yields invariance of satisfiability under any finite sequence of -replacements restricted to W.
Corollary 1. Let V be a collection of variables and let . Let () be set assignments over V such that, for each ,is a -replacement of restricted to
W, for some nonempty . Let Φ
be a
-formula such that for every
literal ℓ occurring in Φ
we have either or . Then, for all , 4.2. ♭-Flat Models
Definition 7. For every ordinal , a set assignment M over a collection V of variables is said to be ♭-flat if all sets in the realm of M have rank ♭.
No membership atom is satisfied by any ♭-flat set assignment:
Lemma 7. Let M be a -flat set assignment over a collection V of variables. Then, for any .
Proof. By the ♭-flatness of M, for all , either (when ), or (when ). In either case, , since ♭-flatness presupposes . Thus, for all . □
A satisfiable formula of always admits a ♭-flat model, for sufficiently large ♭. This is proved in the next lemma.
Lemma 8. Let Φ be a satisfiable -formula with n distinct variables. Then, for every ordinal , Φ admits a ♭-flat set model of cardinality at most .
Proof. Let be satisfiable, M a model of , and its induced partition. Fix an ordinal , where .
We associate to each block a distinct set of rank ♭, as follows:
For every block that already contains elements of rank ♭, choose one such element as .
For the remaining blocks, choose among elements of rank ♭ (possibly belonging to other blocks), ensuring that the chosen are all distinct and different from those fixed in step 1.
This is possible because the number of blocks is at most exponential in
, while the collection
of sets of rank exactly ♭ is large enough to supply that many distinct elements. Indeed, since each block of
is nonempty and determined by a membership pattern over the
n variables, it follows that
. On the other hand, the functions
and
are strictly increasing for
, so from (
2) and (
3) it follows that, for every integer
,
Therefore, whether ♭ is finite or infinite,
contains at least
elements of rank ♭, enough to assign distinct representatives to all the blocks of
.
The argument that follows requires that the distinct blocks
of
be ordered so that all blocks containing elements of rank ♭ come first. Based on this ordering, we recursively construct a sequence of assignments
as follows:
For the blocks considered in step 1, we have
; hence,
is trivially a
-replacement of
from
to
. For the blocks in step 2, the choice of
guarantees that
is disjoint from every block not yet replaced, so the
-conditions also hold.
Therefore, each is a -replacement of , and since is a model of , induction and Lemma 5 yield that every is as well. In particular, is a model of with support , consisting solely of sets of rank ♭. Therefore, provides the desired ♭-flat model of , with cardinality at most . □
Remark 4. Building on an argument developed in [12] (see also [13]), one can show that every satisfiable formula Φ
of admits a model of cardinality less than the number of its distinct variables. Accordingly, in the proof of Lemma 8, the initial model M may be taken with support of size at most , where . The remainder of the argument then produces a ♭-flat model of cardinality at most . This observation leads to the following sharper result.
Corollary 2. Let Φ be a satisfiable -formula with n distinct variables. Then, for every ordinal , Φ admits a ♭-flat set model of cardinality at most . □
5. Existential Inexpressibility of in
We investigate whether atoms of the form can be existentially expressed in . If this were possible, then membership atoms would also be expressible: indeed, the presence of ‘{■}’ allows one to derive ‘∈’ as a definable construct.
Lemma 9. if z is distinct from the variables .
Proof. If holds, then , since .
Conversely, if holds, extend M by putting ; then, yields , so that . □
In what follows we show that membership atoms are not existentially expressible in . Therefore, by Lemma 9, atoms of the form are not existentially expressible either. Specifically, we prove that every satisfiable formula of admits a flat model M, namely, a model whose support consists solely of elements all having the same positive rank. As a consequence, for any , and thus .
We are now ready to prove that membership atoms —and hence, by Lemma 9, atoms of the form —are not existentially expressible in .
Theorem 1. The atom is not existentially expressible in .
Proof. By way of contradiction, assume that
is existentially expressible by a formula
of
involving only atoms of the form
, i.e.,
Since
is trivially satisfiable, so are—by (
9)—
and
. Thus, by Lemma 8,
is modeled by a ♭-flat set assignment
; hence, by Lemma 7,
holds. It follows that
holds, and therefore
, which contradicts (
9). □
6. -Expressibility in of Singleton-Atom Conjunctions
In accordance with Definition 3, we shall prove that any conjunction
of atoms of the form
is
-expressible from
into
by exhibiting a map
computable in quadratic time, such that conditions (a)–(c) of Definition 3 hold. Here,
ranges over
-conjunctions, and the variables in
are distinct from those in
and in
.
Thus, let
and
be of the said form. For each variable
(note that
), introduce a fresh auxiliary variable
, chosen distinct from all others. These variables will be used to mimic
via the relation
. Also, let
be an additional fresh auxiliary variable, to be interpreted as the empty set. Then, we put
(thus, the list
of variables in Definition 3 consists of the collection
of all auxiliary set variables
together with
).
Remark 5. In the formula , the following shorthands are used:Here, is the auxiliary variable intended to denote the empty set. Indeed, for every model M of , the conjunct ensures , from which the following equivalences follow: For instance, the second equivalence can be justified as follows: These shorthands are introduced purely for readability: their syntactic form mirrors the underlying semantics, making formulae such as easier to parse at a glance without repeatedly unfolding their full definitions.
By the preceding remark, is a formula of ; indeed, it is just a conjunction of a rather simple form. As a preliminary step, we record a basic semantic property that will be used repeatedly.
Lemma 10. For every model M of , if occurs in ψ, then Proof. Let
, and let
be a literal in
. Since
includes the conjuncts
the equivalences (
11) in Remark 5 imply
From this, it follows that
and
, as claimed. □
6.1. Design and Analysis of the Translation Algorithm
We first present an explicit procedure that constructs from the input conjunction ; see Algorithm 1. We then analyze its running time, thereby establishing point (a) of Definition 3.
We use the auxiliary program variables
and
in the pseudocode below to store, respectively, the list of variables encountered in
and the list of generated conjuncts.
| Algorithm 1 Construction of |
- 1:
Initialize as an empty list of set variables; - 2:
Initialize as an empty list of conjuncts; - 3:
for each set variable x that appears in do - 4:
add x to ; - 5:
for each conjunct in do - 6:
add x and y to ; - 7:
add to ; - 8:
for each conjunct in do - 9:
for each do - 10:
add to ; - 11:
for each pair , of distinct conjuncts in do - 12:
add to ; - 13:
for all do - 14:
add to ; - 15:
add to ; - 16:
return .
|
The running time of Algorithm 1 is bounded as follows.
Lemma 11. The formula can be constructed from φ and ψ in time .
Proof. In order to prove the lemma, it is convenient to spell out in Algorithm 1 the procedure that constructs from the conjunction . We maintain the program variable as a list of variables and as a list of conjuncts, so that appending a variable or a conjunct takes constant time. Moreover, each conjunct appended to has constant size.
The for-loop at lines 3–4 can be performed in time, where denotes the total length of the conjunction . Similarly, the for-loop at lines 5–7 can be performed in time.
Letting m be the length of the list after the execution of the for-loops 3–4 and 5–7, we have .
Upper bound (running time). The only nontrivial operations performed by Algorithm 1 are iterating through the for-loops and appending variables/conjuncts to the lists and . By assumption, each append takes constant time, and each conjunct appended to has constant size. Hence, the running time is asymptotically proportional to the total number of loop iterations.
As noted, lines 5–7 perform iterations. Lines 8–10 perform iterations (for each conjunct in , the inner loop ranges over all v in ). Lines 11–12 perform iterations, since they range over all pairs of distinct conjuncts in . Finally, lines 13–14 perform iterations, since they range over all pairs with x and y in , whereas line 15 clearly takes constant time.
Therefore, the overall running time is
Since
and
, we conclude that Algorithm 1 runs in time
as claimed. □
Before proving the semantic conditions (b) and (c) of Definition 3 for the translation map (
10), we briefly illustrate
on two concrete examples. We then return to the main line of the argument and show that
If is satisfiable, then so is ;
Every model of can be extended into a model of .
As above, this is where the list of variables consists of the collection of all auxiliary set variables together with . This will prove that conditions (b) and (c) of Definition 3 hold and hence that every singleton-atom conjunction is -expressible from into .
6.2. Illustrating the Translation on Examples
We start with an unsatisfiable input, showing how preserves unsatisfiability, and then consider a satisfiable instance illustrating the model extension mechanism used for condition (c).
Example 3. The conjunction , where and , is not satisfiable; in fact, any set assignment M satisfying this formula is such that , , and ; thus, , and hence , a contradiction.
Our translation comprisesAny model M for is such that (since ), , and hold, so that ; but then must hold, which conflicts with . The next example complements the previous one: it is satisfiable and highlights the extension of a model of
to the auxiliary variables so as to satisfy
(cf.
Section 6.4).
As we will prove in
Section 6.4, satisfiability carries over from
to
. Here is an interesting example of this:
Example 4. Consider the conjunction , whereThis formula is satisfiable: for instance, it is satisfied by every set assignment M over of the formwhere s, , and are any well-founded sets. It can easily be checked that, for every set assignment M of the form (12), the extension of M over the auxiliary variables , for and wheresatisfies , so that holds. Thus, to show that condition (c) of Definition 3 is satisfied, namely, that holds, it is enough to check that the conjunction is satisfied by set assignments of the form (12) only. Let then be any model for . Since and , either or holds. The latter case can be readily ruled out, for in view of it would follow , which is untenable in the realm of well-founded sets. Thus, must hold. Letting —so that —, since and , we have and , for some sets and , and therefore has the form (12). 6.3. Satisfiability Preservation
We first establish that if is satisfiable, then so is , thereby fulfilling condition (b) of Definition 3.
Let
M be a model of
. In order to convert it into a model of
, we can assume that
M is ♭-flat over
for some integer
Indeed, on the one hand, Lemma 8 enables us to do so; on the other hand, Lemma 7 tells us that such an
M does not model any of the atoms
in
.
Set , and let m be the number of singleton conjuncts in . For , we select an atom in and lift to a model of such that every conjunct of already satisfied in remains satisfied in , and the selected conjunct is also satisfied in . At the end of this iterative process, all conjuncts of will be satisfied. More precisely, if denotes the conjunction of the atoms selected up to step i, then coincides with (possibly reordered and with repetitions).
In our set up, each must hence comprise a conjunct from not appearing in any with . The selection of determines the transformation from to and is made as follows: among the conjuncts of that are not already present in , select one that is minimal with respect to the ordering on the conjuncts of , defined below:
Definition 8. The relation is the minimal transitively closed relation over the conjuncts of ψ such that, for all singleton atoms and in ψ, one has This definition enforces the following:
Lemma 12. The relation is a strict partial order.
Proof. By definition is transitive, so it remains only to prove that it is acyclic.
Suppose, by way of contradiction, that
contains some cycle. Then, there are atoms
of
with
where each
is of the form
for
. By the definition of
, we have
Since
, we know in particular that
and so we obtain the strict chain
which yields the contradiction
.
Therefore, has no cycles and is thus acyclic. Together with its transitivity by definition, this shows that is a strict partial order, as claimed. □
From the previous lemma, we may arrange the literals in
as
where each
has the form
, in such a way that the sequence complies with the strict partial order
; that is, for all
, we have
.
Such an ordering will guide the iterative process: it is not arbitrary but rather ensures that the satisfaction of atoms established in earlier steps is preserved in all subsequent steps.
The iterative process proceeds as follows: at each step
i, an atom
is selected from
, as specified above, so that
is minimal, in
with all atoms selected at previous steps removed, with respect to
; that is, for all
,
Then, letting
we define
where
That is,
is obtained from
by replacing
with
while restricting to the variables in
.
Remark 6. Throughout the remainder of this subsection, we may occasionally omit to mention the explicit restriction to and, for instance, write in place of .
The following lemma establishes a useful dichotomy on the possible ranks of the values assigned to variables during the replacement process.
Lemma 13. For each , we have Proof. Preliminarily, it is immediate to check that for every
we have
We proceed by induction on i.
Base case (). Before any replacement we are at , which by construction is ♭-flat; hence, every member of every has rank ♭. Thus, is either 0 or , so the stated dichotomy holds.
Inductive step. Fix
and assume the statement holds for all indices
. By iterating (
16) we obtain
Let
be arbitrary. Then,
, so by (
17) either
- (a)
, in which case (since is ♭-flat);
- (b)
for some . By the inductive hypothesis at index t, either , or .
If case (a) occurs for some member
s of
, then
. If case (b) occurs with
for some member
s, again
. Otherwise, all members of
(if any) have rank at most
, whence
Finally, by our choice of ♭ we have , so in particular . This yields exactly the stated dichotomy for i.
This completes the proof. □
Our goal is to prove that the final assignment is a model of . To this end, we first establish a stability lemma, from which it follows that after step i the sets assigned to the variables in the atom remain unchanged in all later assignments , for .
Lemma 14. For every , letting be the atom of ψ selected at step i, we have the following:
- (a)
is a -replacement of restricted to such that ;
- (b)
for each j with ,
- (b1)
;
- (b2)
;
- (c)
;
- (d)
.
Proof Sketch. The proof proceeds by induction on
i, relying on the preservation properties of
-replacements and the constraints encoded in
. The detailed argument, which involves a careful case analysis for clauses (a)–(d), is rather technical. For readability, we defer the full proof to
Appendix B. □
The replacement construction of Lemma 14 immediately entails a monotonicity condition on the cardinalities of the assigned sets, stated next.
Corollary 3. For every and every , we have Consequently, for every , we have Proof. Fix
and
. If
, the claim is immediate. Otherwise, since
is obtained from
by a
-replacement (with respect to the pair
), the only way
can change is when
, in which case
Plainly, ; hence, .
Applying this inequality successively for
yields
as required. □
We are now ready to prove that
satisfies
. By Lemma 14(a), we have
Moreover, Lemma 14(c) guarantees that, for every
i with
,
Thus, to establish
, it suffices to prove that, for all
i,
If
, then (
19) immediately gives (
20). If
, clauses (b
1) and (b
2) of Lemma 14 yield
and
, respectively. Combining these with (
19), we again obtain (
20). Hence,
. From this and (
18), it follows that
, thus completing the proof of satisfiability preservation.
6.4. Model Extension
We next prove that every model of can be extended into a model of , which corresponds to condition (c) of Definition 3.
Let
be satisfiable, and let
M be a model of it. Define ≺ as the minimal transitive relation on
such that
By set well-foundedness, ≺ is irreflexive; together with transitivity, this makes ≺ a strict ordering on .
Extend
M to the auxiliary variables
, for each
, by setting for each of them:
Moreover, extend
M to the auxiliary variable
by stipulating
so that in particular
We prove that holds.
Preliminarily, note that
is a singleton for every singleton atom
in
. It then follows directly that
Moreover, for each singleton atom
in
, the acyclicity of the membership relation yields
, and so
. Therefore,
In addition, for each pair of singleton atoms
and
in
, we plainly have
whence
Now, consider any implication of the form
with
in
and
, and assume that
. Since
, this yields
; hence,
. Thus,
. By the transitivity and strictness of ≺, we obtain
namely,
.
Hence, by the arbitrariness of
in
and
, it follows that
Finally, consider the implications
with
. Assume
. Suppose
. By definition of ≺, there exists a finite chain of variables
such that
Hence,
which yields
. By a symmetric argument,
follows from
. Therefore,
and thus
. Hence,
Collecting (
21)–(
25), and the above argument, we see that all conjuncts of
are satisfied by
M. Therefore,
which establishes the model extension property.
We have so extended a generic
M such that
into a model of
; therefore, we get
where the list
consists of the collection
of all auxiliary set variables
together with
.
Finally, in view of Definition 3, combining the satisfiability preservation established in
Section 6.3 with the model extension property of
Section 6.4, and relying on Lemma 11, we obtain our main expressibility result:
Theorem 2. Singleton-atom conjunctions are -expressible from into .
7. Conclusions: Related and Planned Work
The main contributions of this paper are as follows:
The introduction of the notion of -expressibility across theories, refining the existing notion of existential expressibility, which, while useful, is too coarse for our purposes.
The proof that atoms of the form are not existentially expressible in .
The proof that, by contrast, any conjunction of such atoms is -expressible from into , using a construction we call the nested-to-flat translation.
As noted in the Introduction, the authors’ interest in satisfiability mechanisms for restricted fragments of set theory arises primarily from the design and experimental use of the proof verifier.
As discussed ref. [
3],
is the core inference mechanism in
, where it often operates implicitly alongside other methods. It is based on an enhanced form of multilevel syllogistic (ref. [
1]), a decision procedure for checking the satisfiability of certain unquantified set-theoretic formulae. This procedure allows
to establish that a statement follows from a given proof context by showing that its negation yields an unsatisfiable conjunction with earlier statements.
When parts of a proof involve constructs that fall outside the scope of ’s built-in syllogistic, a preprocessing step replaces them with fresh variables, ensuring uniform treatment of identical structures. However, proof steps are rarely uniquely determined by prior lines and hints alone. This is because often generates a range of easy consequences from a given context, leaving the user free to choose which of these consequences to assert as the next proof step.
Since inference mechanisms akin to are likely to play a central role in proof technology, it is important to develop translation methods of low algorithmic cost—such as the nested-to-flat translation discussed above—that make their integration possible.
7.1. Envisaged Enhancements to the Nested-to-Flat Translation
We expect that the satisfiability-preserving translation treated in
Section 6 can be tuned to theories richer that
, such as the following. Let
denote the collection of conjunctions of literals of the forms
and let
be the Boolean closure of atoms of the forms
Then, it seems plausible that conjunctions of atoms of the form
together with their negations—where
expresses that the set assigned to
x is finite and
expresses that the set assigned to
x is denumerable (i.e., finite or countably infinite)—are likewise
-expressible from
to
. Exploring this broader scenario is part of our ongoing research agenda.
7.2. Towards Integrating Set-Theoretic, Boolean, and Numerical Constraint Reasoning
Besides its independent interest, the nested-to-flat translation discussed so far can be seen as a preparatory step toward the quantitative approach to logical inference (cf. [
14,
15]), as specialized to the field of computable set theory (cf. Ch. 11 ref. [
16]).
Our translation, in fact, lays the groundwork for reducing satisfiability problems about sets to satisfiability problems about nonnegative integers—or, if we move from the theory of hereditarily finite sets to a more general setting, to the language of the additive theory of cardinals (which is decidable; ref. [
17]).
One such reduction, where the source language embodies a cardinality operator
indicating how many elements belong to the set
x, is presented in Sec. 11.1 ref. [
16]. Let us briefly outline how it proceeds. We are given a conjunction
of literals of the forms
Suppose we want to test
for satisfiability over the hereditarily finite sets. Here,
stand for set variables while
stand for numeric variables drawn from a disjoint infinite set of symbols and are intended to range over the nonnegative integers. Let
be the collection of all set variables occurring in
; let, moreover,
be the collection of all nonempty subsets
Q of
such that the assignment
satisfies all equalities of the form
occurring in
. Associate two numeric variables
with each
Q in
. We have an algorithm that, given
, constructs a system
of
purely arithmetic constraints such that
is satisfiable over sets and natural numbers if and only if
has a solution over the natural numbers. Those literals of the forms
and
that were in
from the outset are retained in
; in addition to them,
encompasses conditions specifying the intended meanings of
relative to a (hypothetical) model
of
, namely,
The reduction just outlined can certainly be refined beyond the treatment ref. [
16], as we will strive to do in the future. We also expect that it can be boosted with the treatment of explicit rank-related constructs, such as the comparison relator
.
Some reductions of the set-satisfiability problem to integer programming can be found ref. [
18], whose line of research aimed at integrating linear programming problems and set constraint manipulation methods in a single logic programming language, as explained refs. [
19,
20] (the endeavor of integrating cardinality constraints into constraint logic programming with sets has been carried out with a different approach, as reported ref. [
21]). A technique for reducing the problem of multilevel syllogistic (cf. [
3]) to propositional consistency testing was described ref. [
18] (an account of it can also be found in Sec. 11.3 ref. [
16]).
In Sec. 4.5 ref. [
22], the authors show that satisfiability of nonrecursive Tarskian set constraints is decidable in nondeterministic double-exponential time by reducing the problem to a class of Diophantine constraints called
prequadratic. They prove that satisfiability of prequadratic Diophantine constraints is decidable in nondeterministic exponential time and conjecture that it is in NP. If this conjecture holds, satisfiability of nonrecursive Tarskian constraints would be decidable in nondeterministic single-exponential time.
Along a related line of research, ref. [
23] reduces quantifier-free constraints on sets involving cardinalities along with direct and inverse images of functions on sets, to systems of numerical constraints in linear integer arithmetic or of the form
, where
d is a positive integer.
7.3. Difference Algebras
We take this opportunity to mention an issue concerning an alternative semantics for
, which, although only loosely related to the central theme of this paper, is nevertheless of independent interest. In his bachelor’s thesis
Unificazione semantica in strutture booleane (‘Semantic unification in Boolean structures’), defended at the University of Trieste in 2020, Mattia Furlan isolated the valid formulae involving Boolean difference that are displayed in
Figure 1.
Let us adopt the universal closures of these formulae as the axioms of a theory in quantificational first-order logic with equality. These axioms characterize an algebraic variety, whose instances we provisionally call difference algebras.
A natural open question is whether every difference algebra
is isomorphic to an algebra of the form
, in which the operator ‘∖’ is interpreted as ordinary set-theoretic difference. In this case,
must be a family of sets closed under difference, and hence under intersection, since
holds for all sets
.
One might hope to settle this question by appealing to Stone’s celebrated representation theorem, which states that every Boolean algebra is isomorphic to a field of sets. However, we see no direct way to apply this theorem, since there exist difference algebras
whose carrier
is not closed under symmetric difference, viewed as an operation
satisfying, for all
in
,
Moreover, it is unclear how to embed an arbitrary difference algebra into one that forms a proper Boolean ring by virtue of satisfying this closure property.