Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae

Cantone, Domenico; De Domenico, Andrea; Maugeri, Pietro; Omodeo, Eugenio G.

doi:10.3390/foundations6010003

Open AccessArticle

Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae

¹

Department of Mathematics and Computer Science, University of Catania, 95125 Catania, Italy

²

IMDEA Software Institute, 28223 Madrid, Spain; andrea.domenico@imdea.org

³

Department of Mathematics, Informatics, and Geosciences, University of Trieste, 34127 Trieste, Italy; eomodeo@units.it

^*

Authors to whom correspondence should be addressed.

Foundations 2026, 6(1), 3; https://doi.org/10.3390/foundations6010003

Submission received: 23 September 2025 / Revised: 8 January 2026 / Accepted: 16 January 2026 / Published: 30 January 2026

(This article belongs to the Section Mathematical Sciences)

Download

Browse Figure

Versions Notes

Abstract

As a contribution to automated set-theoretic inferencing, a translation is proposed of conjunctions of literals of the forms

x = y ∖ z

,

x \neq y ∖ z

, and

z = \{x\}

, where

x, y, z

stand for variables ranging over the von Neumann universe of sets, into quantifier-free Boolean formulae of a rather simple conjunctive normal form. The formulae in the target language involve variables ranging over a Boolean ring of sets, along with a difference operator and relators designating equality, non-disjointness, and inclusion. Moreover, the result of each translation is a conjunction of literals of the forms

x = y ∖ z

and

x \neq y ∖ z

and of implications whose antecedents are isolated literals and whose consequents are either inclusions (strict or non-strict) between variables, or equalities between variables. Besides reflecting a simple and natural semantics, which ensures satisfiability preservation, the proposed translation has quadratic algorithmic time complexity and bridges two languages, both of which are known to have an

NP

-complete satisfiability problem.

Keywords:

satisfiability problem; computable set theory; expressibility; proof verification; NP-completeness; quantitative logical inference

1. Introduction

As reported in the survey paper [1], the line of investigation to which this paper belongs began in 1978 (ref. [2]), driven by the goal of developing an automated system for proof-checking and program correctness verification. Since then, identifying decidable fragments of set theory has been regarded as a key step toward achieving that goal. As a matter of fact, in the organization of the proof-checker

Referee/ÆtnaNova

, which came into existence some thirty years later (ref. [3]), a satisfiability decision tester for a specific class of unquantified set-theoretic formulae plays a central and pervasive role.

When it comes to implementations, complexity emerges as an inescapable issue; this is why we recently undertook a systematic study on the algorithmic complexities of satisfiability testers (see, e.g., [4,5]). In the same frame of mind, this paper enhances a quadratic-cost method (announced ref. [4] and then presented in D. Cantone, A. De Domenico, P. Maugeri, and E.G. Omodeo; a quadratic reduction of constraints over nested sets to purely Boolean formulae in CNF; in Proc. 35th Italian Conference on Computational Logic, volume 2710 of CEUR Workshop Proceedings, pages 214–230. 2020). This translates the formulae of an unquantified language involving set-theoretic variables, Boolean operators, and membership and equality relators into propositional combinations of purely Boolean literals.

The enhancement lies in the availability of a singleton-formation operator ‘{■}’, which increases the expressive power of the source language without affecting the algorithmic time complexity of the translation process; however, the fact that the translation preserves satisfiability as well as unsatisfiability must be proved anew and differently. The material that follows bridges two complexity taxonomies, developed and analyzed in the respective papers [4,5]. These taxonomies, one of which—unlike the other—involves membership, concern fragments of set theory for which satisfiability is decidable.

A fit theoretical framework for the study of our target language is the theory of Boolean rings, a merely equational first-order theory endowed with finitely many axioms—at times, one blends this theory with an arithmetic of cardinals (see, e.g., [6])—or just a surrogate of it, cf. Figure 1 in Section 7. Frameworks for the study of the source language are such all-embracing theories as ZF and NBG (the Zermelo–Fraenkel and von Neumann–Bernays–Gödel theories), within which one can cast the whole corpus of mathematical disciplines. Boolean algebra is decidable in its entirety (cf. Section 3.7 ref. [7]); ZF is essentially undecidable, yet nonetheless an effort to find practical decision algorithms for fragments of it began in 1979. The rationale of this long-standing research is that satisfiability testers embodying some knowledge about ZF can act as key inference mechanisms within a programmed system apt to verifying the correctness of large-scale mathematical proofs as envisaged ref. [3].

A priori, one would expect the distance between the performances of decision algorithms for fragments of Boolean algebra, and of the seemingly much more expressive languages whose dictionaries embody nested membership, to be abysmal. Luckily, though, as we will see, this is not the case.

We introduce in Section 2 an interpreted formal language, dubbed

BST

, within which one can formulate unquantified Boolean constraints. Despite its syntax being quite minimal—

BST

only encompasses conjunctions of primitive literals of two forms, namely,

x = y ∖ z

and

x \neq y ∖ z

—, the satisfiability problem for

BST

is

NP

-complete (ref. [4]). By way of abbreviations, a number of additional constraints (e.g., literals of the form

x \neq y \cup z

) can be expressed in

BST

.

Alongside

BST

, we also consider its natural Boolean closure

{BST}^{+}

, defined as the language of all propositional combinations—using ∧, ∨, ⟶, ⟷, and ¬ with arbitrary nesting—of atoms of the form

x = y ∖ z

. It is straightforward to verify that the satisfiability problem for

{BST}^{+}

can be reduced to that for

BST

in nondeterministic polynomial time; therefore, it is

NP

-complete in turn.

According to our semantics, the domain of discourse to which

BST

(and

{BST}^{+}

) refers is a universe of nested sets; however, as will be seen in Section 5, every satisfiable propositional combination of

BST

literals (in particular, every satisfiable

BST

constraint) admits a model consisting of sets which are, in a certain sense, ‘flat’. This makes it clear that there is no straightforward way of expressing membership atoms

x \in y

in

BST

. Yet, many natural set-theoretic constructors (most notably the singleton operator {■}) inherently interact with membership, and understanding whether and how such constructs can be eliminated while preserving satisfiability and complexity is the main motivation of this paper.

To address this research gap, Section 3 introduces a complexity-aware notion of expressibility, which requires not only a semantic reduction but also an explicit bound on the cost of the translation. While related notions already appear ref. [4], the present formulation is tailored to our setting and is a valuable technical tool that enables the complexity-preservation arguments developed here.

In terms of this notion, Section 6 provides an explicit translation of conjunctions of literals of the three forms

x = y ∖ z

,

x \neq y ∖ z

, and

z = \{x\}

into propositional combinations of

BST

literals, i.e., into

{BST}^{+}

. As shown in Section 6.3 and Section 6.4, the translation preserves satisfiability. It yields a conjunction whose conjuncts are either

BST

literals or simple disjunctions. Moreover, the translation can be computed in quadratic time (see Section 6.1), which entails that satisfiability remains

NP

-complete when the singleton operator {■} is added to the constructs of

BST

. This

NP

-completeness result was already known (see, e.g., [8]), but our approach derives it anew via an explicit, complexity-controlled elimination procedure.

The concluding section, Section 7, recapitulates what has been accomplished, indicates what we intend to pursue next, and situates our line of investigation in relation to that of other scholars.

2. The Theories $BST$ and ${BST}^{+}$

Boolean set theory (

BST

) is the quantifier-free theory consisting of all finite conjunctions of literals of the forms

x = y ∖ z, x \neq y ∖ z,

(1)

where x, y, and z are set variables, ranging over the universe of well-founded sets.

Remark 1.

Throughout, we use the term theory in a nonstandard sense, referring to a collection of set-theoretic formulae—possibly involving, along with primitive symbols, some derived constructs—rather than a deductively closed collection of sentences.

Semantics for the theory

BST

is defined in terms of set assignments. Specifically, given a (finite) collection V of set variables, a set assignment M over V—the domain of definition of M, denoted by

dom (M)

—is any map from V into the von Neumann universe

V

(see below). (Note that our semantics of

BST

does not rely on flat sets of urelements (as would be doable). Working with those would call for minor adjustments, unjustified—and perhaps disturbing—in the economy of this paper.) The support of M, denoted by

supp (M)

, is the union of the sets assigned by M to the variables in its domain, namely,

supp (M) := ⋃_{x \in dom (M)} M x .

We shall refer to the cardinality of the support of M as the cardinality of the assignment M. A set assignment M is said to satisfy, or to model, a given literal

x = y ∖ z

, with

x, y, z \in dom (M)

, if

M x = M y ∖ M z

holds, where ‘∖’ denotes standard set difference. Likewise, M is said to satisfy the literal

x \neq y ∖ z

if

M x \neq M y ∖ M z

holds. Finally, M satisfies a

BST

-conjunction

φ

such that

Vars (φ) \subseteq dom (M)

(where

Vars (φ)

denotes the collection of the variables occurring free in

φ

) if it satisfies all of the conjuncts of

φ

, in which case we say that M is a model of

φ

and write

M ⊧ φ

(a short for

M ⊧_{V} φ

). A

BST

-conjunction

φ

is said to be satisfiable if it has some model; otherwise, it is said to be unsatisfiable. It is said to be valid if it is true in every model, in which case we write

⊧ φ

(again, a short for

⊧_{V} φ

). Two

BST

-conjunctions

φ

and

ψ

are said to be equisatisfiable if either both are satisfiable or both are unsatisfiable—that is, if

φ

has a model if and only if

ψ

has a model. Equisatisfiability does not require the two formulae to have the same models, but only that they agree in satisfiability, irrespective of their models. The notions introduced in this paragraph, here tailored to the case of

BST

-conjunctions, extend naturally to all theories discussed in the sequel.

In ref. [4], it is proved that the satisfiability problem for

BST

, namely, the problem of establishing algorithmically the satisfiability status of any given

BST

-conjunction, is

NP

-complete.

The extension ${BST}^{+}$ .

Recall that

BST

only allows for conjunctions of its primitive literals—in particular, a

BST

-formula is a finite conjunction of constraints of the form

x = y ∖ z

and

x \neq y ∖ z

. In contrast,

{BST}^{+}

is the Boolean closure of

BST

: it consists of all propositional combinations of atoms of the form

x = y ∖ z

, obtained by unrestrained use of the logical connectives ∧, ∨, ⟶, ⟷, and ¬. Thus,

{BST}^{+}

strictly extends

BST

by allowing for arbitrary Boolean structure (e.g., disjunctions, implications, and negations at any depth), whereas

BST

is restricted to conjunctive constraints.

The satisfiability problem for

{BST}^{+}

reduces to that for

BST

in nondeterministic polynomial time (by guessing a satisfying assignment for the Boolean structure and checking the resulting conjunction), and hence the satisfiability problem for

{BST}^{+}

is

NP

-complete as well.

The von Neumann Universe

We recall that the von Neumann universe

V

of (well-founded) sets, also dubbed von Neumann cumulative hierarchy, is built up through a transfinite sequence of steps as the union

V : = \cup_{α \in On} V_{α}

of the levels

V_{α} : = \cup_{β < α} P (V_{β})

, with

P (■)

denoting the powerset operator and

α

ranging over the class

On

of all ordinals.

It can easily be seen that, for every ordinal

α

(and, in particular, for every integer), we have

V_{α + 1} = P (V_{α}),

and consequently

|V_{α + 1}| = 2^{|V_{α}|},

(2)

so that

|V_{α}| < |V_{α + 1}|

.

Based on the level of first appearance in the von Neumann hierarchy, one can define the rank of any set s, denoted

rk (s)

. Specifically,

rk (s)

is the ordinal

α

such that

s \in V_{α + 1} ∖ V_{α}

. Hence, for every

α \in On

, the set

V_{α + 1} ∖ V_{α}

, hereinafter denoted

W_{α}^{#}

, collects all sets having rank

α

.

The following lower bound on the number of well-founded sets of any positive integer rank n, to be proved as Proposition A2 in Appendix A, will be useful:

|W_{n}^{#}| ⩾ 2^{n - 1} .

(3)

A particularly important subclass of

V

is that of hereditarily finite sets, denoted by

HF

. By definition,

HF : = ⋃_{n \in N} V_{n},

that is,

HF

consists of all sets of finite rank.

Some handy properties of the rank function that we shall tacitly use are the following, which hold for all sets

s, t \in V

:

If $s \in t$ , then $rk (s) < rk (t)$ ;
If $s \subseteq t$ , then $rk (s) ⩽ rk (t)$ ;
$rk (s) = \{\begin{matrix} 0 & if s = ⌀, \\ {sup}_{u \in s} (rk (u) + 1) & (i.e., \cup_{u \in s} (rk (u) + 1)) otherwise . \end{matrix}$

We also recall that well-foundedness, as enforced by the regularity (or foundation) axiom of set theory, precludes the formation of infinite descending membership chains of the form

\dots \in s_{2} \in s_{1} \in s_{0},

and, in particular, membership cycles of the form

s_{0} \in s_{n} \in \dots \in s_{2} \in s_{1} \in s_{0},

for any sets

s_{i}

.

3. From Existential Expressibility to $O (f)$ -Expressibility Across Theories

We first recall the definition of existential expressibility (cf. [4] for several applications of this notion).

Definition 1

(Existential expressibility). A formula

ψ (x)

is said to be existentially expressible in a theory

T

if there exists a

T

-formula

Ψ (x, z)

such that

⊧ Ψ (x) ⟷ (\exists z) Ψ (x, z),

where

x

and

z

stand for tuples of set variables.

In spite of the parsimony of

BST

as just presented, it turns out (see [4]) that several other Boolean constructs, such as the ones in the list of literals

\begin{matrix} x = ⌀, x \subseteq y, x = y \cap z, x = y \cup z, D ISJ (x, y), x ⊊ y, \\ x \neq ⌀, x ⊈ y, x \neq y \cap z, x \neq y \cup z, \neg D ISJ (x, y), \end{matrix}

(4)

can be expressed existentially in

BST

, where

D ISJ (a, b)

is a short for

a \cap b = ⌀

.

Ref. [4], existential expressibility was generalized into

O (f)

-expressibility (we use standard asymptotic notations

O (\cdot)

,

Ω (\cdot)

, and

Θ (\cdot)

throughout the paper; see, e.g., Ch. 3 of [9]), a notion that helped develop a fine-grained complexity taxonomy of the subfragments of

BST

.

Definition 2

(

O (f)

-expressibility). Let

T

be a theory and let

f : N \to N

be a complexity function. A formula

ψ (x)

—typically involving a construct one aims to eliminate—is said to be

O (f)

-expressible in

T

if there exists a transformation

φ (y) \mapsto Ψ_{φ} (x, y, z)

(5)

from

T

to

T

, where no variable in

z

occurs in

x

or

y

, such that, for every φ, the following conditions hold:

(a): The transformation (5) can be computed in $O (f (|φ|))$ -time;
(b): If $φ (y) \land (\exists z) Ψ_{φ} (x, y, z)$ is satisfiable, then $φ (y) \land ψ (x)$ is satisfiable;
(c): $⊧ (φ (y) \land ψ (x)) ⟶ (\exists z) Ψ_{φ} (x, y, z)$ .

Here, we further generalize

O (f)

-expressibility in two directions. First, we allow for a collection

C

of set-theoretic formulae, rather than a single formula

ψ

as in [4]. Second, we explicitly distinguish a source theory

T_{1}

from a target theory

T_{2}

, whereas [4] works in the single-theory setting, with source and target taken to coincide.

Definition 3

(

O (f)

-expressibility across theories). Let

T_{1}

and

T_{2}

be any theories and

f : N \to N

be a given complexity function. A collection

C

of formulae is said to be

O (f)

-expressible from

T_{1}

into

T_{2}

if there exists a map

〈φ (y), ψ (x)〉 ⟼ Ξ_{φ}^{ψ} (x, y, z)

(6)

from

T_{1} \times C

into

T_{2}

, where no variable in

z

occurs in either

x

or

y

, such that the following conditions are satisfied:

(a): The mapping (6) can be computed in $O (f (|φ \land ψ|))$ time;
(b): If $φ (y) \land Ξ_{φ}^{ψ} (x, y, z)$ is satisfiable, so is $φ (y) \land ψ (x)$ ;
(c): $⊧ (φ (y) \land ψ (x)) ⟶ (\exists z) Ξ_{φ}^{ψ} (x, y, z)$ .

Observe that the two formulae appearing in condition (b) of Definition 3—namely,

φ (y) \land Ξ_{φ}^{ψ} (x, y, z)

and

φ (y) \land ψ (x)

—are in fact equisatisfiable. This is formalized in the following lemma.

Lemma 1.

Let

T_{1}

and

T_{2}

be theories, and let

C

be a collection of formulae that is

O (f)

-expressible from

T_{1}

into

T_{2}

via a given mapping

〈φ (y), ψ (x)〉 ⟼ Ξ_{φ}^{ψ} (x, y, z)

, as defined in Definition 3. Then, for every

φ (y) \in T_{1}

and every

ψ (x) \in C

, the formulae

φ (y) \land Ξ_{φ}^{ψ} (x, y, z) and φ (y) \land ψ (x)

are equisatisfiable.

Proof.

Let

φ (y) \in T_{1}

and

ψ (x) \in C

. We prove that the two conjunctions

φ (y) \land ψ (x)

and

φ (y) \land Ξ_{φ}^{ψ} (x, y, z)

are equisatisfiable.

(⇒) Suppose $φ (y) \land Ξ_{φ}^{ψ} (x, y, z)$ is satisfiable. Then, by condition (b) of Definition 3, it follows that $φ (y) \land ψ (x)$ is satisfiable.
(⇐) Conversely, suppose $φ (y) \land ψ (x)$ is satisfiable, and let M be a model of this formula. Then, by condition (c) of Definition 3, we have

M ⊧ φ (y) \land (\exists z) Ξ_{φ}^{ψ} (x, y, z) .

This means that there exists an extension

M^{★}

of M to the variables in

z

—which, by hypothesis, occur neither in

x

nor in

y

—such that

M^{★} ⊧ Ξ_{φ}^{ψ} (x, y, z)

; hence,

M^{★} ⊧ φ (y) \land Ξ_{φ}^{ψ} (x, y, z)

.

From the satisfiability of one formula, we have derived that of the other; hence, the two formulae are equisatisfiable. □

While Lemma 1 shows that the two formulae

φ (y) \land ψ (x)

and

φ (y) \land Ξ_{φ}^{ψ} (x, y, z)

are equisatisfiable, Definition 3 does not adopt equisatisfiability as a primitive requirement. Instead, it explicitly requires the two separate conditions (b) and (c), which together entail equisatisfiability but provide strictly more structure. This asymmetry is intentional: clause (b) ensures that satisfiability of the translated formula readily implies satisfiability of the original, supporting the soundness of the translation; clause (c) provides a constructive existential guarantee—namely, that any model of

φ \land ψ

can be extended to a model of

φ \land Ξ_{φ}^{ψ}

, thus ensuring a form of witness-preserving completeness.

To analyze complexity preservation under

O (f)

-expressibility, we encounter expressions like

g (n + f (n))

and, more generally,

g (O (f (n)))

. It is therefore useful to assume a mild robustness of g under constant scaling, formalized below.

Definition 4

(Scale-invariant function). A function

g : N \to N

is said to be scale-invariant (up to constants) if, for every real constant

c > 0

, there exist constants

K_{c} > 0

and

N_{c} \in N

such that

g (⌈ c n ⌉) ⩽ K_{c} g (n), for all n ⩾ N_{c} .

Equivalently,

g (⌈ c n ⌉) = O (g (n))

for every fixed

c > 0

.

Remark 2.

Definition 4 is closely related to classical doubling-type assumptions. For instance, in the theory of Orlicz spaces, the

Δ_{2}

-condition for a Young function Φ requires

Φ (2 t) ⩽ C Φ (t)

for all

t ⩾ 0

(ref. [10]). Similarly, if g is (eventually) nondecreasing and satisfies a doubling-type bound, namely, there exist constants

K > 0

and

N \in N

such that

g (2 n) ⩽ K g (n) for all n ⩾ N,

then the estimate extends to any fixed scaling factor

c > 1

by iteration. Indeed, choose

k \in N

with

c ⩽ 2^{k}

. For all

n ⩾ 2^{k - 1} N

, we have

g (⌈ c n ⌉) ⩽ g (2^{k} n) ⩽ K^{k} g (n) .

Finally, our assumption can be viewed as a discrete, one-sided variant of

O

-regular variation. In the Karamata–Matuszewska framework, a function f is called

O

-regularly varying if, for every

λ ⩾ 1

, the ratio

f (λ x) / f (x)

stays asymptotically bounded away from both 0 and

+ \infty

(via lim inf and lim sup) (cf. [11]). In contrast, we only require the upper control

g (⌈ c n ⌉) = O (g (n))

, which is sufficient for the complexity-preservation steps where constant-factor rescalings arise.

Example 1.

Let

g (n) = n^{2}

and take

c > 0

. Then,

g (⌈ c n ⌉) = {⌈ c n ⌉}^{2} ⩽ {⌈ c ⌉}^{2} n^{2} = {⌈ c ⌉}^{2} g (n),

and hence

g (⌈ c n ⌉) = O (g (n))

.

More generally, the scale-invariance property holds for all standard polynomially bounded complexity functions, such as

n^{k}

and

n log n

, as well as for poly-logarithmic functions. More generally, it holds for functions of at most polynomial growth, and it also follows from a weak subadditivity condition: there exist constants

C_{1} ⩾ 1

and

C_{2} ⩾ 0

such that

g (m + n) ⩽ C_{1} (g (m) + g (n)) + C_{2},

for all

m, n \in N

.

In contrast,

g (n) = 2^{n}

is not scale-invariant: for

c = 2

we have

g (2 n) = 2^{2 n} = {(2^{n})}^{2}

, and the ratio

g (2 n) / g (n) = 2^{n}

is unbounded, so

g (2 n) \notin O (g (n))

.

The next lemma provides the quantitative part of the reduction: under

O (f)

-expressibility, it transfers the time bound for

T_{1} \land T_{2}

to

T_{1} \land C

. The assumptions that g is nondecreasing and scale-invariant control the rescaling of input sizes, while the at least linear growth of g ensures that the preprocessing cost

O (f (n))

is absorbed in the final bound.

Lemma 2.

Let

T_{1}

and

T_{2}

be two theories (in the sense of Remark 1) and suppose that their conjunctive product

T_{1} \land T_{2} : = {φ_{1} \land φ_{2} : φ_{1} \in T_{1}, φ_{2} \in T_{2}}

admits a decision procedure running in time

O (g (n))

, for some nondecreasing, scale-invariant function

g : N \to N

of at least linear growth, i.e.,

g (n) = Ω (n)

.

Suppose moreover that the collection

C

of formulae is

O (f)

-expressible from

T_{1}

into

T_{2}

, for some complexity function

f : N \to N

.

Then, the combined theory

T_{1} \land C

is decidable in time

O (g (n + f (n)))

.

Proof.

Let

φ (y) \in T_{1}

and

ψ (x) \in C

, and set

n : = | φ \land ψ |

. Since

C

is

O (f)

-expressible from

T_{1}

into

T_{2}

, we can compute, in time

O (f (n))

, a formula

Ξ_{φ}^{ψ} (x, y, z) \in T_{2}

satisfying conditions (a)–(c) of Definition 3. Moreover, by condition (a) (and the fact that the output of an

O (f (n))

-time transformation has size

O (f (n))

), we have

| Ξ_{φ}^{ψ} | = O (f (n))

. Hence, there exist a constant

c > 0

and a threshold

N_{0} \in N

such that, for every input pair

(φ, ψ) \in T_{1} \times C

with

| φ \land ψ | ⩾ N_{0}

, we have

| φ \land Ξ_{φ}^{ψ} | ⩽ c (n + f (n)) and thus | φ \land Ξ_{φ}^{ψ} | ⩽ ⌈ c (n + f (n)) ⌉,

where

n : = | φ \land ψ |

as stipulated above.

By Lemma 1, the formulae

φ \land ψ

and

φ \land Ξ_{φ}^{ψ}

are equisatisfiable. Therefore, to decide satisfiability of

φ \land ψ

, it suffices to decide satisfiability of

φ \land Ξ_{φ}^{ψ}

, which belongs to

T_{1} \land T_{2}

. By hypothesis, this can be done in time

O (g (| φ \land Ξ_{φ}^{ψ} |))

. Using that g is nondecreasing and scale-invariant, we obtain

g (| φ \land Ξ_{φ}^{ψ} |) ⩽ g (⌈ c (n + f (n)) ⌉) = O (g (n + f (n))) .

Finally, the overall running time is the sum of the preprocessing time

O (f (n))

needed to compute

Ξ_{φ}^{ψ}

and the decision time for

φ \land Ξ_{φ}^{ψ}

; hence,

O (f (n)) + O (g (n + f (n))) .

Since g has at least linear growth, there exist constants

a > 0

and

N_{1} \in N

such that

g (t) ⩾ a t

for all

t ⩾ N_{1}

. For all sufficiently large n, we have

n + f (n) ⩾ N_{1}

, and therefore

g (n + f (n)) ⩾ a (n + f (n)) ⩾ a f (n),

whence

O (f (n)) \subseteq O (g (n + f (n)))

. Consequently, the total running time is

O (g (n + f (n)))

.

Since this procedure works uniformly for every input

φ \land ψ \in T_{1} \land C

, the claim follows. □

4. $BST$ -Replacements and Flat Models

In this section, we develop two central notions that will be used in the rest of the paper. The first is that of

BST

-replacement, which captures how set assignments can be modified in a controlled way while preserving the satisfaction of

{BST}^{+}

-formulae.

The second is the notion of a ♭-flat model, namely, a set assignment in which every element occurring in the interpretation of a variable has rank exactly ♭. This ‘single-layer support’ property makes it possible to reason cleanly about disjointness and membership when dealing with singleton atoms under replacement.

4.1. Replacement Assignments and $BST$ -Replacements

In preparation for the results that follow, we need a method to modify a set assignment for a

BST

formula without disrupting satisfiability. Such a method is implemented by means of

BST

-replacements.

Definition 5

(Replacement assignments and

BST

-replacements). Let M be a set assignment over a collection V of variables, let

W \subseteq V

, and let

S, T

be nonempty sets. The replacement (assignment) restricted to W of M from S to T (or with respect to the pair

(S, T)

), denoted

{(M_{| W})}_{T}^{S}

, is the set assignment defined for each

x \in V

by

{(M_{| W})}_{T}^{S} x : = \{\begin{matrix} (M x ∖ S) \cup T & if x \in W \land \neg D ISJ (S, M x) \\ M x & otherwise . \end{matrix}

(7)

The assignment

{(M_{| W})}_{T}^{S}

is a

BST

-replacement restricted to W(or a

{BST}_{W}

-replacement) of M from S to T if in addition the following condition holds:

(\forall x \in W) ((D ISJ (S, M x) \lor S \subseteq M x) \land D ISJ (T ∖ S, M x)) .

(8)

When

W = V

, we omit W from the notation and write

M_{T}^{S}

, calling it simply a replacement of M from S to T. If condition (8) also holds, it is a

BST

-replacement of M from S to T.

Note that the requirement that S and T are nonempty is essential only for T: if

S = ⌀

, then

{(M_{| W})}_{T}^{S} = M

for any T, so the replacement has no effect and is of no interest.

Example 2

(

{BST}_{W}

-replacement). Let

V = {x, y, z}, W = {x, y}, S = {2, 3}, T = {3, 7},

and define the set assignment M by

M x = {1, 2, 3}, M y = {2, 3, 4}, M z = {5} .

Note that

S, T \neq ⌀

.

Since

x, y \in W

with

\neg D ISJ (S, M x)

and

\neg D ISJ (S, M y)

, the restricted replacement

{(M_{| W})}_{T}^{S}

acts on x and y. On the other hand, as

z \notin W

, z is left unchanged:

\begin{matrix} {(M_{| W})}_{T}^{S} (x) & = (M x ∖ S) \cup T = ({1, 2, 3} ∖ {2, 3}) \cup {3, 7} = {1} \cup {3, 7} = {1, 3, 7}, \\ {(M_{| W})}_{T}^{S} (y) & = (M y ∖ S) \cup T = ({2, 3, 4} ∖ {2, 3}) \cup {3, 7} = {4} \cup {3, 7} = {3, 4, 7}, \\ {(M_{| W})}_{T}^{S} (z) & = M z = {5} . \end{matrix}

We verify the

BST

-condition (8) for all

u \in W

:

(D ISJ (S, M u) \lor S \subseteq M u) \land D ISJ (T ∖ S, M u) .

For

u = x

and

u = y

,

S = {2, 3} \subseteq M x = {1, 2, 3}, S = {2, 3} \subseteq M y = {2, 3, 4},

and

T ∖ S = {7} is disjoint from M x = {1, 2, 3} and M y = {2, 3, 4} .

Hence, the condition holds for every

u \in W

. Therefore,

{(M_{| W})}_{T}^{S} is a {BST}_{W} - replacement of M from S to T .

Remark 3

(Restricted setting convention). All results in this subsection are stated in the unrestricted case (

W = V

), except where explicitly noted otherwise (cf. Lemma 6), but they extend plainly to the restricted setting

{(M_{| W})}_{T}^{S}

with

W \subseteq V

.

To study more closely how

BST

-replacements affect a set assignment, it is useful to introduce the partition induced by a set assignment, which decomposes its support into blocks determined by membership patterns of elements with respect to the assignment’s variable interpretations.

Definition 6.

Given a set assignment M over a collection V of variables, the partition induced by Mis defined as

Σ_{M} : = {⋂_{x \in U} M x ∖ ⋃_{y \in V ∖ U} M y | ⌀ \neq U \subseteq V} ∖ {⌀} .

This notion provides a precise way to track the effect of a replacement, as illustrated in the subsequent lemma.

Lemma 3.

Let Φ be a formula of

{BST}^{+}

, M a set assignment for it, and T a nonempty set such that

D ISJ (T, M v)

holds for all

v \in Vars (Φ)

. Then, for all

σ \in Σ_{M}

(a): $M_{T}^{σ}$ is a $BST$ -replacement of M;
(b): $Σ_{M_{T}^{σ}} = (Σ_{M} ∖ {σ}) \cup {T}$ .

Proof.

Let

σ \in Σ_{M}

and set

M^{'} : = M_{T}^{σ}

.

(a) By the definition of

Σ_{M}

, for every

x \in Vars (Φ)

either

σ \subseteq M x

or

D ISJ (σ, M x)

. Together with the assumption that

D ISJ (T, M v)

holds for all

v \in Vars (Φ)

, this is precisely condition (8). Hence,

M_{T}^{σ}

is a

BST

-replacement of M.

(b) Each block of

Σ_{M}

collects the elements that share the same membership pattern with respect to the variables of

Φ

. The replacement

M_{T}^{σ}

removes

σ

wherever it occurs and inserts T instead. Since T is disjoint from every

M v

, no other block is modified, and the pattern corresponding to

σ

is now realized by T. Therefore,

Σ_{M^{'}} = (Σ_{M} ∖ {σ}) \cup {T} . □

We note that the hypothesis on T in Lemma 3 (namely, its disjointness from every

M v

) is stronger than what is needed in general. The next result shows that a replacement can always be reversed under the sole assumption that the

BST

-replacement conditions (8) are satisfied.

Lemma 4.

Let M be a set assignment over a collection V of variables, and let

S, T

be nonempty sets such that

M_{T}^{S}

is a

BST

-replacement of M from S to T. Then,

{(M_{T}^{S})}_{S}^{T}

is a

BST

-replacement of M from T to S, and

{(M_{T}^{S})}_{S}^{T} = M .

Proof.

Set

M^{'} : = M_{T}^{S}

. We first check that the

BST

-conditions (8) hold for the pair

(T, S)

relative to

M^{'}

and then show that

{(M^{'})}_{S}^{T} = M

.

Fix

x \in V

. From the

BST

-conditions for

(S, T)

relative to M, we know that either

D ISJ (S, M x)

or

S \subseteq M x

, and in all cases

D ISJ (T ∖ S, M x)

.

Case 1: $D ISJ (S, M x)$ . Then, $M^{'} x = M x$ . Furthermore,

$T \cap M x = ((T ∖ S) \cup (T \cap S)) \cap M x = ((T ∖ S) \cap M x) \cup ((T \cap S) \cap M x) = ⌀,$

since

D ISJ (T ∖ S, M x)

and

D ISJ (S, M x)

. Hence,

D ISJ (T, M^{'} x)

. Also,

D ISJ (S ∖ T, M^{'} x)

holds because

S ∖ T \subseteq S

and

D ISJ (S, M x)

. Thus, the

BST

-conditions for

(T, S)

hold at x, and the reverse update does nothing:

{(M^{'})}_{S}^{T} x = M^{'} x = M x .

Case 2: $S \subseteq M x$ . Then, $M^{'} x = (M x ∖ S) \cup T$ , so $T \subseteq M^{'} x$ . Moreover, $S ∖ T$ is disjoint from $M x ∖ S$ and from T; hence, $D ISJ (S ∖ T, M^{'} x)$ . Therefore, the $BST$ -conditions for $(T, S)$ hold at x, and

{(M^{'})}_{S}^{T} x = (M^{'} x ∖ T) \cup S = (((M x ∖ S) \cup T) ∖ T) \cup S = (M x ∖ S) \cup S = M x .

In both cases, the

(T, S)

-conditions hold relative to

M^{'}

, and

{(M^{'})}_{S}^{T} x = M x

. Since x was arbitrary, we conclude that

{(M^{'})}_{S}^{T}

is a

BST

-replacement of

M^{'}

from T to S, and indeed

{(M^{'})}_{S}^{T} = M

. □

The next lemma shows that satisfiability of a

{BST}^{+}

-formula is preserved under

BST

-replacements: if a formula holds in a given assignment, it continues to hold after any valid

BST

-replacement.

Lemma 5.

For every set assignment M, every formula Φ of

{BST}^{+}

, and every

BST

-replacement

M^{'}

of M, we have

M ⊧ Φ ⟺ M^{'} ⊧ Φ .

Proof.

Let

(S, T)

be any pair of nonempty sets fulfilling the

BST

-replacement conditions (8) with respect to M. As

{BST}^{+}

-formulae are Boolean combinations of atoms of the form

x = y ∖ z

, it is enough to show that for every such atom (with variables in the domain of M) we have

M ⊧ x = y ∖ z ⟺ M_{T}^{S} ⊧ x = y ∖ z .

Suppose

M ⊧ x = y ∖ z

, that is,

M x = M y ∖ M z

.

We analyze separately the two cases

D ISJ (S, M x)

and

\neg D ISJ (S, M x)

.

Case $D ISJ (S, M x)$ . Then, $M x = M_{T}^{S} x$ , and since $S \subseteq M y$ implies $S \subseteq M z$ , we have

$M_{T}^{S} x = M x = M y ∖ M z = (M y ∖ S) ∖ (M z ∖ S) = ((M y ∖ S) \cup T) ∖ ((M z ∖ S) \cup T),$

using (8) to ensure

D ISJ (T ∖ S, M z)

. Hence,

M_{T}^{S} ⊧ x = y ∖ z

.

Case $\neg D ISJ (S, M x)$ . By (8), we have $S \subseteq M x \subseteq M y$ , and hence $D ISJ (S, M z)$ . Using the definition of $M_{T}^{S}$ , we compute

$\begin{matrix} M_{T}^{S} x & = (M x ∖ S) \cup T = ((M y ∖ M z) ∖ S) \cup T \\ = ((M y ∖ S) ∖ M z) \cup T = ((M y ∖ S) \cup T) ∖ M z = M_{T}^{S} y ∖ M_{T}^{S} z, \end{matrix}$

where we used

D ISJ (T ∖ S, M z)

and

D ISJ (S, M z)

to conclude

D ISJ (T, M z)

.

Thus, in all cases,

M ⊧ x = y ∖ z ⟹ M_{T}^{S} ⊧ x = y ∖ z .

For the converse implication, assume

M_{T}^{S} ⊧ x = y ∖ z

. By Lemma 4,

{(M_{T}^{S})}_{S}^{T}

is a

BST

-replacement of

M_{T}^{S}

and moreover

{(M_{T}^{S})}_{S}^{T} = M

; hence, by the first part,

M_{T}^{S} ⊧ x = y ∖ z ⟹ M ⊧ x = y ∖ z .

Therefore, for every atomic formula of the form

x = y ∖ z

,

M ⊧ x = y ∖ z ⟺ M_{T}^{S} ⊧ x = y ∖ z .

It follows that the logical equivalence holds for all

{BST}^{+}

-formulae

Φ

. This proves the lemma. □

For later use, we also state a restricted variant, in which replacements are carried out only with respect to a chosen subset of variables.

Lemma 6.

Let V be a collection of variables and

W \subseteq V

. Let M be a set assignment over V, and let

S, T

be nonempty sets such that

M^{'} := {(M_{| W})}_{T}^{S}

is a

BST

-replacement of M restricted to W. If Φ is a

{BST}^{+}

-formula such that for every literal ℓ in Φ either

Vars (ℓ) \subseteq W

or

Vars (ℓ) \cap W = ⌀

holds, then

M ⊧ Φ ⟺ M^{'} ⊧ Φ .

Proof.

As

{BST}^{+}

-formulae are Boolean combinations of atoms

x = y ∖ z

, it suffices to check atoms. Fix an atom

α

.

If

Vars (α) \cap W = ⌀

, then none of its variables is modified by the replacement, so

M ⊧ α

iff

M^{'} ⊧ α

.

If

Vars (α) \subseteq W

, then satisfaction of

α

depends only on the restrictions

M_{| W}

and

M^{'} = {(M_{| W})}_{T}^{S}

. By Lemma 5,

M_{| W} ⊧ α

iff

M_{| W}^{'} ⊧ α

; hence,

M ⊧ α

iff

M^{'} ⊧ α

.

The claim follows by propositional logic. □

With the same restriction on variables—namely, every literal lies wholly inside W or wholly outside it—Lemma 6 yields invariance of satisfiability under any finite sequence of

BST

-replacements restricted to W.

Corollary 1.

Let V be a collection of variables and let

W \subseteq V

. Let

M^{(0)}, \dots, M^{(m)}

(

m ⩾ 1

) be set assignments over V such that, for each

i = 1, \dots, m

,

M^{(i)} = {(M_{| W}^{(i - 1)})}_{T_{i}}^{S_{i}}

is a

BST

-replacement of

M^{(i - 1)}

restricted to W, for some nonempty

S_{i}, T_{i}

. Let Φ be a

{BST}^{+}

-formula such that for every literal ℓ occurring in Φ we have either

Vars (ℓ) \subseteq W

or

D ISJ (Vars (ℓ), W)

. Then, for all

0 ⩽ h, j ⩽ m

,

M^{(h)} ⊧ Φ ⟺ M^{(j)} ⊧ Φ .

4.2. ♭-Flat Models

Definition 7.

For every ordinal

♭ ⩾ 1

, a set assignment M over a collection V of variables is said to be ♭-flat if all sets in the realm

\cup \{M v | v \in V\}

of M have rank ♭.

No membership atom

x \in y

is satisfied by any ♭-flat set assignment:

Lemma 7.

Let M be a

♭

-flat set assignment over a collection V of variables. Then,

M x \notin M y

for any

x, y \in V

.

Proof.

By the ♭-flatness of M, for all

x \in V

, either

rk (M x) = 0

(when

M x = ⌀

), or

rk (M x) = ♭ + 1

(when

M x \neq ⌀

). In either case,

rk (M x) \neq ♭

, since ♭-flatness presupposes

♭ ⩾ 1

. Thus,

M x \notin - 1 p t \cup \{M y ∣ y \in V\}

for all

x \in V

. □

A satisfiable formula of

{BST}^{+}

always admits a ♭-flat model, for sufficiently large ♭. This is proved in the next lemma.

Lemma 8.

Let Φ be a satisfiable

{BST}^{+}

-formula with n distinct variables. Then, for every ordinal

♭ ⩾ n + 1

, Φ admits a ♭-flat set model of cardinality at most

2^{n} - 1

.

Proof.

Let

Φ

be satisfiable, M a model of

Φ

, and

Σ_{M} = {σ_{0}, \dots, σ_{m - 1}}

its induced partition. Fix an ordinal

♭ ⩾ n + 1

, where

n = |Vars (Φ)|

.

We associate to each block

σ_{i} \in Σ_{M}

a distinct set

s_{i}

of rank ♭, as follows:

For every block $σ_{i}$ that already contains elements of rank ♭, choose one such element as $s_{i}$ .
For the remaining blocks, choose $s_{i}$ among elements of rank ♭ (possibly belonging to other blocks), ensuring that the chosen $s_{i}$ are all distinct and different from those fixed in step 1.

This is possible because the number of blocks is at most exponential in

n = |Vars (Φ)|

, while the collection

W_{♭}^{#}

of sets of rank exactly ♭ is large enough to supply that many distinct elements. Indeed, since each block of

Σ_{M}

is nonempty and determined by a membership pattern over the n variables, it follows that

| Σ_{M} | ⩽ 2^{n} - 1

. On the other hand, the functions

κ \mapsto 2^{κ} - κ

and

κ \mapsto |V_{κ}|

are strictly increasing for

κ ⩾ 1

, so from (2) and (3) it follows that, for every integer

ℓ ⩾ n + 1

,

\begin{matrix} |W_{ℓ}^{#}| & = |V_{ℓ + 1}| - |V_{ℓ}| = 2^{|V_{ℓ}|} - |V_{ℓ}| \\ ⩾ 2^{|V_{n + 1}|} - |V_{n + 1}| = |V_{n + 2}| - |V_{n + 1}| = |W_{n + 1}^{#}| ⩾ 2^{n} . \end{matrix}

Therefore, whether ♭ is finite or infinite,

W_{♭}^{#}

contains at least

2^{n}

elements of rank ♭, enough to assign distinct representatives to all the blocks of

Σ_{M}

.

The argument that follows requires that the distinct blocks

σ_{0}, \dots, σ_{m - 1}

of

Σ_{M}

be ordered so that all blocks containing elements of rank ♭ come first. Based on this ordering, we recursively construct a sequence of assignments

M_{0}, \dots, M_{m}

as follows:

\{\begin{matrix} M_{0} M, \\ M_{i + 1} {(M_{i})}_{{s_{i}}}^{σ_{i}}, i = 0, \dots, m - 1 . \end{matrix}

For the blocks considered in step 1, we have

{s_{i}} \subseteq σ_{i}

; hence,

{(M_{i})}_{{s_{i}}}^{σ_{i}}

is trivially a

BST

-replacement of

M_{i}

from

σ_{i}

to

{s_{i}}

. For the blocks in step 2, the choice of

s_{i}

guarantees that

{s_{i}}

is disjoint from every block not yet replaced, so the

BST

-conditions also hold.

Therefore, each

M_{i + 1}

is a

BST

-replacement of

M_{i}

, and since

M_{0}

is a model of

Φ

, induction and Lemma 5 yield that every

M_{i + 1}

is as well. In particular,

M_{m}

is a model of

Φ

with support

{s_{0}, \dots, s_{m - 1}}

, consisting solely of sets of rank ♭. Therefore,

M_{m}

provides the desired ♭-flat model of

Φ

, with cardinality at most

2^{n} - 1

. □

Remark 4.

Building on an argument developed in [12] (see also [13]), one can show that every satisfiable formula Φ of

{BST}^{+}

admits a model of cardinality less than the number of its distinct variables. Accordingly, in the proof of Lemma 8, the initial model M may be taken with support of size at most

n - 1

, where

n = | Vars (Φ) |

. The remainder of the argument then produces a ♭-flat model of cardinality at most

n - 1

.

This observation leads to the following sharper result.

Corollary 2.

Let Φ be a satisfiable

{BST}^{+}

-formula with n distinct variables. Then, for every ordinal

♭ ⩾ n + 1

, Φ admits a ♭-flat set model of cardinality at most

n - 1

. □

5. Existential Inexpressibility of $z = \{x\}$ in ${BST}^{+}$

We investigate whether atoms of the form

z = \{x\}

can be existentially expressed in

{BST}^{+}

. If this were possible, then membership atoms

x \in y

would also be expressible: indeed, the presence of ‘{■}’ allows one to derive ‘∈’ as a definable construct.

Lemma 9.

⊧ x \in y ⟷ (\exists z) (z = \{x\} \land z \subseteq y),

if z is distinct from the variables

x, y

.

Proof.

If

M ⊧ z = \{x\} \land z \subseteq y

holds, then

M ⊧ x \in y

, since

M x \in M z \subseteq M y

.

Conversely, if

M ⊧ x \in y

holds, extend M by putting

M z = \{M x\}

; then,

M x \in M y

yields

M ⊧ z = \{x\} \land z \subseteq y

, so that

M ⊧ (\exists z) (z = \{x\} \land z \subseteq y)

. □

In what follows we show that membership atoms

x \in y

are not existentially expressible in

{BST}^{+}

. Therefore, by Lemma 9, atoms of the form

z = \{x\}

are not existentially expressible either. Specifically, we prove that every satisfiable formula

Ψ

of

{BST}^{+}

admits a flat model M, namely, a model whose support consists solely of elements all having the same positive rank. As a consequence,

M x \notin M y

for any

x, y \in Vars (Ψ)

, and thus

M ⊭ x \in y

.

We are now ready to prove that membership atoms

x \in y

—and hence, by Lemma 9, atoms of the form

z = \{x\}

—are not existentially expressible in

{BST}^{+}

.

Theorem 1.

The atom

x \in y

is not existentially expressible in

{BST}^{+}

.

Proof.

By way of contradiction, assume that

x \in y

is existentially expressible by a formula

Ψ (x, y, z)

of

{BST}^{+}

involving only atoms of the form

x^{'} = y^{'} ∖ z^{'}

, i.e.,

⊧ x \in y ⟷ (\exists z) Ψ (x, y, z) .

(9)

Since

x \in y

is trivially satisfiable, so are—by (9)—

(\exists z) Ψ (x, y, z)

and

Ψ (x, y, z)

. Thus, by Lemma 8,

Ψ (x, y, z)

is modeled by a ♭-flat set assignment

M^{*}

; hence, by Lemma 7,

M^{*} x \notin M^{*} y

holds. It follows that

M^{*} ⊧ x \notin y \land (\exists z) Ψ (x, y, z)

holds, and therefore

M^{*} ⊭ (\exists z) Ψ (x, y, z) ⟶ x \in y

, which contradicts (9). □

6. $O (n^{2})$ -Expressibility in ${BST}^{+}$ of Singleton-Atom Conjunctions

In accordance with Definition 3, we shall prove that any conjunction

ψ (x)

of atoms of the form

x = \{y\}

is

O (n^{2})

-expressible from

BST

into

{BST}^{+}

by exhibiting a map

〈φ (y), ψ (x)〉 \mapsto Ξ_{φ}^{ψ} (x, y, z),

computable in quadratic time, such that conditions (a)–(c) of Definition 3 hold. Here,

φ (y)

ranges over

BST

-conjunctions, and the variables in

z

are distinct from those in

x

and in

y

.

Thus, let

φ (y)

and

ψ (x)

be of the said form. For each variable

x \in Vars (φ \land ψ)

(note that

Vars (φ \land ψ) = Vars (φ) \cup Vars (ψ)

), introduce a fresh auxiliary variable

\tilde{x}

, chosen distinct from all others. These variables will be used to mimic

x \in y

via the relation

\tilde{x} ⊊ \tilde{y}

. Also, let

z

be an additional fresh auxiliary variable, to be interpreted as the empty set. Then, we put

\begin{matrix} Ξ_{φ}^{ψ} ≔ & z = z ∖ z \land \underset{x = \{y\} in ψ}{⋀} x ⊈ y \land \\ \underset{\begin{matrix} x = \{y\} in ψ \\ x^{'} = {y^{'}} in ψ \end{matrix}}{⋀} (y = y^{'} ⟷ x = x^{'}) \land \underset{\begin{matrix} x = \{y\} in ψ \\ v \in Vars (φ \land ψ) \end{matrix}}{⋀} (\neg D ISJ (x, v) ⟶ x \subseteq v) \land \\ \underset{\begin{matrix} x = \{y\} in ψ \\ v \in Vars (φ \land ψ) \end{matrix}}{⋀} (\neg D ISJ (x, v) ⟶ \tilde{y} ⊊ \tilde{v}) \land \underset{x, y \in Vars (φ \land ψ)}{⋀} (x = y ⟶ \tilde{x} = \tilde{y}) \end{matrix}

(10)

(thus, the list

z

of variables in Definition 3 consists of the collection

\tilde{x}

of all auxiliary set variables

\tilde{x}

together with

z

).

Remark 5.

In the formula

Ξ_{φ}^{ψ}

, the following shorthands are used:

\begin{matrix} D ISJ (x, v) & \overset{def}{\Leftrightarrow} x = x ∖ v, \\ x = v & \overset{def}{\Leftrightarrow} x = v ∖ z, \\ x \subseteq v & \overset{def}{\Leftrightarrow} x ∖ v = z, \\ x ⊊ v & \overset{def}{\Leftrightarrow} x ∖ v = z \land v ∖ x \neq z . \end{matrix}

Here,

z

is the auxiliary variable intended to denote the empty set. Indeed, for every model M of

Ξ_{φ}^{ψ}

, the conjunct

z = z ∖ z

ensures

M z = ⌀

, from which the following equivalences follow:

\begin{matrix} M ⊧ D ISJ (x, v) & ⟺ D ISJ (M x, M v); \\ M ⊧ x = v & ⟺ M x = M v; \\ M ⊧ x \subseteq v & ⟺ M x \subseteq M v; \\ M ⊧ x ⊊ v & ⟺ M x ⊊ M v . \end{matrix}

(11)

For instance, the second equivalence can be justified as follows:

M ⊧ x = v ⟺ M ⊧ x = v ∖ z ⟺ M x = M v ∖ M z = M v .

These shorthands are introduced purely for readability: their syntactic form mirrors the underlying semantics, making formulae such as

Ξ_{φ}^{ψ}

easier to parse at a glance without repeatedly unfolding their full definitions.

By the preceding remark,

Ξ_{φ}^{ψ}

is a formula of

{BST}^{+}

; indeed, it is just a conjunction of a rather simple form. As a preliminary step, we record a basic semantic property that will be used repeatedly.

Lemma 10.

For every model M of

Ξ_{φ}^{ψ}

, if

x = {y}

occurs in ψ, then

M x \neq ⌀ and D ISJ (M x, M y) .

Proof.

Let

M ⊧ Ξ_{φ}^{ψ}

, and let

x = {y}

be a literal in

ψ

. Since

Ξ_{φ}^{ψ}

includes the conjuncts

z = z ∖ z, x ⊈ y, and (\neg D ISJ (x, y) \to x \subseteq y),

the equivalences (11) in Remark 5 imply

M x ⊈ M y and \neg D ISJ (M x, M y) ⟹ M x \subseteq M y .

From this, it follows that

M x \neq ⌀

and

D ISJ (M x, M y)

, as claimed. □

6.1. Design and Analysis of the Translation Algorithm

We first present an explicit procedure that constructs

Ξ_{φ}^{ψ}

from the input conjunction

φ \land ψ

; see Algorithm 1. We then analyze its running time, thereby establishing point (a) of Definition 3.

We use the auxiliary program variables

V

and

Ξ

in the pseudocode below to store, respectively, the list of variables encountered in

φ \land ψ

and the list of generated conjuncts.

Algorithm 1 Construction of

Ξ_{φ}^{ψ}

1:: Initialize $V$ as an empty list of set variables;
2:: Initialize $Ξ$ as an empty list of conjuncts;
3:: for each set variable x that appears in $φ$ do
4:: add x to $V$ ;
5:: for each conjunct $x = \{y\}$ in $ψ$ do
6:: add x and y to $V$ ;
7:: add $x ⊈ y$ to $Ξ$ ;
8:: for each conjunct $x = \{y\}$ in $ψ$ do
9:: for each $v \in V$ do
10:: add $(\neg D ISJ (x, v) \to x \subseteq v) \land (\neg D ISJ (x, v) \to \tilde{y} ⊊ \tilde{v})$ to $Ξ$ ;
11:: for each pair $x = \{y\}$ , $x^{'} = \{y^{'}\}$ of distinct conjuncts in $ψ$ do
12:: add $(y = y^{'} \leftrightarrow x = x^{'})$ to $Ξ$ ;
13:: for all $x, y \in V$ do
14:: add $(x = y \to \tilde{x} = \tilde{y})$ to $Ξ$ ;
15:: add $z = z ∖ z$ to $Ξ$ ;
16:: return $Ξ$ .

The running time of Algorithm 1 is bounded as follows.

Lemma 11.

The formula

Ξ_{φ}^{ψ}

can be constructed from φ and ψ in time

O ({|φ \land ψ|}^{2})

.

Proof.

In order to prove the lemma, it is convenient to spell out in Algorithm 1 the procedure that constructs

Ξ_{φ}^{ψ}

from the conjunction

φ \land ψ

. We maintain the program variable

V

as a list of variables and

Ξ

as a list of conjuncts, so that appending a variable or a conjunct takes constant time. Moreover, each conjunct appended to

Ξ

has constant size.

The for-loop at lines 3–4 can be performed in

Θ (|φ|)

time, where

|φ|

denotes the total length of the conjunction

φ

. Similarly, the for-loop at lines 5–7 can be performed in

Θ (|ψ|)

time.

Letting m be the length of the list

V

after the execution of the for-loops 3–4 and 5–7, we have

m ⩽ |φ| + |ψ|

.

Upper bound (running time). The only nontrivial operations performed by Algorithm 1 are iterating through the for-loops and appending variables/conjuncts to the lists $V$ and $Ξ$ . By assumption, each append takes constant time, and each conjunct appended to $Ξ$ has constant size. Hence, the running time is asymptotically proportional to the total number of loop iterations.

As noted, lines 5–7 perform

Θ (|ψ|)

iterations. Lines 8–10 perform

Θ (|ψ| \cdot m)

iterations (for each conjunct in

ψ

, the inner loop ranges over all v in

V

). Lines 11–12 perform

Θ ({|ψ|}^{2})

iterations, since they range over all pairs of distinct conjuncts in

ψ

. Finally, lines 13–14 perform

Θ (m^{2})

iterations, since they range over all pairs

(x, y)

with x and y in

V

, whereas line 15 clearly takes constant time.

Therefore, the overall running time is

Θ (|φ| + |ψ| + |ψ| \cdot m + {|ψ|}^{2} + m^{2}) .

Since

m \leq |φ| + |ψ| \leq |φ \land ψ|

and

|ψ| \leq |φ \land ψ|

, we conclude that Algorithm 1 runs in time

O ({|φ \land ψ|}^{2}),

as claimed. □

Before proving the semantic conditions (b) and (c) of Definition 3 for the translation map (10), we briefly illustrate

Ξ_{φ}^{ψ}

on two concrete examples. We then return to the main line of the argument and show that

If $φ (y) \land Ξ_{φ}^{ψ} (x, y, z)$ is satisfiable, then so is $φ (y) \land ψ (x)$ ;
Every model of $φ (y) \land ψ (x)$ can be extended into a model of $Ξ_{φ}^{ψ} (x, y, z)$ .

As above, this is where the list

z

of variables consists of the collection

\tilde{x}

of all auxiliary set variables

\tilde{x}

together with

z

. This will prove that conditions (b) and (c) of Definition 3 hold and hence that every singleton-atom conjunction is

O (n^{2})

-expressible from

BST

into

{BST}^{+}

.

6.2. Illustrating the Translation $Ξ_{φ}^{ψ}$ on Examples

We start with an unsatisfiable input, showing how

Ξ_{φ}^{ψ}

preserves unsatisfiability, and then consider a satisfiable instance illustrating the model extension mechanism used for condition (c).

Example 3.

The conjunction

φ \land ψ

, where

φ ≔ y = x ∖ z

and

ψ ≔ x = \{y\} \land y = {z}

, is not satisfiable; in fact, any set assignment M satisfying this formula is such that

M y = M x ∖ M z

,

M x = {M y}

, and

M y = {M z}

; thus,

M y = M z

, and hence

M z = \{M z\}

, a contradiction.

Our translation

Ξ_{φ}^{ψ}

comprises

y \neq ⌀ \land x ⊈ y \land (\neg D ISJ (x, y) ⟶ x \subseteq y) .

Any model M for

φ \land Ξ_{φ}^{ψ}

is such that

M y \subseteq M x

(since

M y = M x ∖ M z

),

M y \neq ⌀

, and

M x ⊈ M y

hold, so that

\neg D ISJ (M x, M y)

; but then

M x \subseteq M y

must hold, which conflicts with

M x ⊈ M y

.

The next example complements the previous one: it is satisfiable and highlights the extension of a model of

φ \land ψ

to the auxiliary variables so as to satisfy

Ξ_{φ}^{ψ}

(cf. Section 6.4).

As we will prove in Section 6.4, satisfiability carries over from

φ \land ψ

to

φ \land Ξ_{φ}^{ψ}

. Here is an interesting example of this:

Example 4.

Consider the conjunction

φ \land ψ

, where

φ ≔ x = y ∖ y^{'} \land x = z ∖ z^{'}, and ψ ≔ z = {y} .

This formula is satisfiable: for instance, it is satisfied by every set assignment M over

Vars (φ \land ψ)

of the form

M x ≔ ⌀, M y ≔ s, M z ≔ \{s\}, M y^{'} ≔ s \cup s^{'}, M z^{'} ≔ \{s\} \cup s^{''},

(12)

where s,

s^{'}

, and

s^{''}

are any well-founded sets.

It can easily be checked that, for every set assignment M of the form (12), the extension

M^{+}

of M over the auxiliary variables

\tilde{v}

, for

v \in Vars (φ \land ψ)

and where

M^{+} \tilde{v} ≔ \{\begin{matrix} s \cup \{s\} & if s \in M v \\ ⌀ & otherwise, \end{matrix}

satisfies

Ξ_{φ}^{ψ}

, so that

M ⊧ (\exists \tilde{x}, \tilde{y}, \tilde{z}, {\tilde{y}}^{'}, {\tilde{z}}^{'}) Ξ_{φ}^{ψ}

holds. Thus, to show that condition (c) of Definition 3 is satisfied, namely, that

⊧ (φ \land ψ) ⟶ (\exists \tilde{x}, \tilde{y}, \tilde{z}, {\tilde{y}}^{'}, {\tilde{z}}^{'}) Ξ_{φ}^{ψ}

holds, it is enough to check that the conjunction

φ \land ψ

is satisfied by set assignments of the form (12) only. Let then

\bar{M}

be any model for

φ \land ψ

. Since

\bar{M} x \subseteq \bar{M} z

and

\bar{M} z = \{\bar{M} y\}

, either

\bar{M} x = ⌀

or

\bar{M} x = \{\bar{M} y\}

holds. The latter case can be readily ruled out, for in view of

\bar{M} x \subseteq \bar{M} y

it would follow

\bar{M} y \in \bar{M} y

, which is untenable in the realm of well-founded sets. Thus,

\bar{M} x = ⌀

must hold. Letting

s ≔ \bar{M} y

—so that

\bar{M} z = \{s\}

—, since

\bar{M} y \subseteq \bar{M} y^{'}

and

\bar{M} z \subseteq \bar{M} z^{'}

, we have

\bar{M} y^{'} = s \cup s^{'}

and

\bar{M} z^{'} = \{s\} \cup s^{''}

, for some sets

s^{'}

and

s^{''}

, and therefore

\bar{M}

has the form (12).

6.3. Satisfiability Preservation

We first establish that if

φ \land Ξ_{φ}^{ψ}

is satisfiable, then so is

φ \land ψ

, thereby fulfilling condition (b) of Definition 3.

Let M be a model of

φ \land Ξ_{φ}^{ψ}

. In order to convert it into a model of

φ \land ψ

, we can assume that M is ♭-flat over

Vars (φ \land Ξ_{φ}^{ψ})

for some integer

♭ > max (|Vars (φ \land Ξ_{φ}^{ψ})|, | ψ |) .

Indeed, on the one hand, Lemma 8 enables us to do so; on the other hand, Lemma 7 tells us that such an M does not model any of the atoms

x = \{y\}

in

ψ

.

Set

M^{(0)} ≔ M

, and let m be the number of singleton conjuncts in

ψ

. For

i = 1, \dots, m

, we select an atom

x_{i} = \{y_{i}\}

in

ψ

and lift

M^{(i - 1)}

to a model

M^{(i)}

of

φ

such that every conjunct of

ψ

already satisfied in

M^{(i - 1)}

remains satisfied in

M^{(i)}

, and the selected conjunct is also satisfied in

M^{(i)}

. At the end of this iterative process, all conjuncts of

ψ

will be satisfied. More precisely, if

ψ_{i}

denotes the conjunction of the atoms selected up to step i, then

ψ_{1} \land \dots \land ψ_{m}

coincides with

ψ

(possibly reordered and with repetitions).

In our set up, each

ψ_{i}

must hence comprise a conjunct

x_{i} = \{y_{i}\}

from

ψ

not appearing in any

ψ_{j}

with

j < i

. The selection of

x_{i} = \{y_{i}\}

determines the transformation from

M^{(i - 1)}

to

M^{(i)}

and is made as follows: among the conjuncts of

ψ

that are not already present in

ψ_{1} \land \dots \land ψ_{i - 1}

, select one that is minimal with respect to the ordering

≺_{ψ}^{M}

on the conjuncts of

ψ

, defined below:

Definition 8.

The relation

≺_{ψ}^{M}

is the minimal transitively closed relation over the conjuncts of ψ such that, for all singleton atoms

x = {y}

and

x^{'} = {y^{'}}

in ψ, one has

(x = {y}) ≺_{ψ}^{M} (x^{'} = {y^{'}}) whenever \neg D ISJ (M x, M y^{'}) .

This definition enforces the following:

Lemma 12.

The relation

≺_{ψ}^{M}

is a strict partial order.

Proof.

By definition

≺_{ψ}^{M}

is transitive, so it remains only to prove that it is acyclic.

Suppose, by way of contradiction, that

≺_{ψ}^{M}

contains some cycle. Then, there are atoms

ℓ_{0}, ℓ_{1}, \dots, ℓ_{h}

of

ψ

with

ℓ_{0} ≺^{M} ℓ_{1} ≺^{M} \dots ≺^{M} ℓ_{m - 1} ≺^{M} ℓ_{h} = ℓ_{0},

where each

ℓ_{i}

is of the form

x_{i} = \{y_{i}\}

for

i = 0, 1, \dots, h

. By the definition of

≺_{ψ}^{M}

, we have

\neg D ISJ (M x_{0}, M y_{1}), \dots, \neg D ISJ (M x_{h - 1}, M y_{0}) .

Since

M ⊧ Ξ_{φ}^{ψ}

, we know in particular that

M ⊧ \underset{\begin{matrix} x = \{y\} in ψ \\ v \in Vars (φ \land ψ) \end{matrix}}{⋀} (\neg D ISJ (x, v) ⟶ \tilde{y} ⊊ \tilde{v}),

and so we obtain the strict chain

M {\tilde{y}}_{0} ⊊ M {\tilde{y}}_{1} ⊊ \dots ⊊ M {\tilde{y}}_{h - 1} ⊊ M {\tilde{y}}_{0},

which yields the contradiction

M {\tilde{y}}_{0} ⊊ M {\tilde{y}}_{0}

.

Therefore,

≺_{ψ}^{M}

has no cycles and is thus acyclic. Together with its transitivity by definition, this shows that

≺_{ψ}^{M}

is a strict partial order, as claimed. □

From the previous lemma, we may arrange the literals in

ψ

as

ℓ_{1}, \dots, ℓ_{m},

where each

ℓ_{i}

has the form

x_{i} = \{y_{i}\}

, in such a way that the sequence complies with the strict partial order

≺_{ψ}^{M}

; that is, for all

1 ⩽ i < j ⩽ m

, we have

ℓ_{j} \neg ≺_{ψ}^{M} ℓ_{i}

.

Such an ordering will guide the iterative process: it is not arbitrary but rather ensures that the satisfaction of atoms established in earlier steps is preserved in all subsequent steps.

The iterative process proceeds as follows: at each step i, an atom

ℓ_{i} x_{i} = \{y_{i}\}

is selected from

ψ

, as specified above, so that

ℓ_{i}

is minimal, in

ψ

with all atoms selected at previous steps removed, with respect to

≺_{ψ}^{M}

; that is, for all

i, j \in {1, \dots, m}

,

if i < j then ℓ_{j} ≺_{ψ}^{M} ℓ_{i},

(13)

Then, letting

X_{i} ≔ M^{(i - 1)} x_{i} and Y_{i} ≔ M^{(i - 1)} y_{i},

(14)

we define

M^{(i)} {≔ (M_{| W}^{(i - 1)})}_{{Y_{i}}}^{X_{i}},

(15)

where

W ≔ Vars (ϕ \land ψ) .

That is,

M^{(i)}

is obtained from

M^{(i - 1)}

by replacing

X_{i}

with

{Y_{i}}

while restricting to the variables in

Vars (ϕ \land ψ)

.

Remark 6.

Throughout the remainder of this subsection, we may occasionally omit to mention the explicit restriction to

W = Vars (φ \land ψ)

and, for instance, write

{(M^{(i - 1)})}_{{Y_{i}}}^{X_{i}}

in place of

{(M_{| W}^{(i - 1)})}_{{Y_{i}}}^{X_{i}}

.

The following lemma establishes a useful dichotomy on the possible ranks of the values assigned to variables during the replacement process.

Lemma 13.

For each

i = 1, \dots, m

, we have

either rk (M^{(i - 1)} y_{i}) ⩽ i - 1 < m ⩽ ♭ or rk (M^{(i - 1)} y_{i}) > ♭ .

Proof.

Preliminarily, it is immediate to check that for every

i = 1, \dots, m

we have

supp (M^{(i)}) \subseteq supp (M^{(i - 1)}) \cup {M^{(i - 1)} y_{i}} .

(16)

We proceed by induction on i.

Base case (

i = 1

). Before any replacement we are at

M^{(0)}

, which by construction is ♭-flat; hence, every member of every

M^{(0)} v

has rank ♭. Thus,

rk (M^{(0)} y_{1})

is either 0 or

♭ + 1

, so the stated dichotomy holds.

Inductive step. Fix

i > 1

and assume the statement holds for all indices

1, \dots, i - 1

. By iterating (16) we obtain

supp (M^{(i - 1)}) \subseteq supp (M^{(0)}) \cup {M^{(t - 1)} y_{t} ∣ 1 ⩽ t ⩽ i - 1} .

(17)

Let

s \in M^{(i - 1)} y_{i}

be arbitrary. Then,

s \in supp (M^{(i - 1)})

, so by (17) either

(a): $s \in supp (M^{(0)})$ , in which case $rk (s) = ♭$ (since $M^{(0)}$ is ♭-flat);
(b): $s = M^{(t - 1)} y_{t}$ for some $t \in {1, \dots, i - 1}$ . By the inductive hypothesis at index t, either $rk (s) = rk (M^{(t - 1)} y_{t}) ⩽ t - 1 ⩽ i - 2$ , or $rk (s) > ♭$ .

If case (a) occurs for some member s of

M^{(i - 1)} y_{i}

, then

rk (M^{(i - 1)} y_{i}) ⩾ ♭ + 1 > ♭

. If case (b) occurs with

rk (s) > ♭

for some member s, again

rk (M^{(i - 1)} y_{i}) > ♭

. Otherwise, all members of

M^{(i - 1)} y_{i}

(if any) have rank at most

i - 2

, whence

rk (M^{(i - 1)} y_{i}) ⩽ (i - 2) + 1 = i - 1 .

Finally, by our choice of ♭ we have

m ⩽ ♭

, so in particular

i - 1 < m ⩽ ♭

. This yields exactly the stated dichotomy for i.

This completes the proof. □

Our goal is to prove that the final assignment

M^{(m)}

is a model of

φ \land ψ

. To this end, we first establish a stability lemma, from which it follows that after step i the sets assigned to the variables in the atom

x_{i} = \{y_{i}\}

remain unchanged in all later assignments

M^{(t)}

, for

t > i

.

Lemma 14.

For every

i = 1, \dots, m

, letting

x_{i} = \{y_{i}\}

be the atom of ψ selected at step i, we have the following:

(a)

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

restricted to

Vars (φ \land ψ)

such that

M^{(i)} ⊧ φ \land Ξ_{φ}^{ψ}

;

(b)

for each j with

1 ⩽ j < i

,

(b₁): $M^{(j)} y_{j} = M^{(i)} y_{j}$ ;
(b₂): $M^{(j)} x_{j} = M^{(i)} x_{j}$ ;

(c)

M^{(i)} y_{i} = M^{(i - 1)} y_{i}

;

(d)

M^{(i)} x_{i} = \{M^{(i)} y_{i}\}

.

Proof Sketch.

The proof proceeds by induction on i, relying on the preservation properties of

BST

-replacements and the constraints encoded in

Ξ_{φ}^{ψ}

. The detailed argument, which involves a careful case analysis for clauses (a)–(d), is rather technical. For readability, we defer the full proof to Appendix B. □

The replacement construction of Lemma 14 immediately entails a monotonicity condition on the cardinalities of the assigned sets, stated next.

Corollary 3.

For every

i = 1, \dots, m

and every

v \in Vars (φ \land Ξ_{φ}^{ψ})

, we have

|M^{(i)} v| ⩽ |M^{(i - 1)} v| .

Consequently, for every

v \in Vars (φ \land Ξ_{φ}^{ψ})

, we have

|M^{(m)} v| ⩽ |M^{(0)} v| .

Proof.

Fix

i \in {1, \dots, m}

and

v \in Vars (φ \land Ξ_{φ}^{ψ})

. If

M^{(i)} v = M^{(i - 1)} v

, the claim is immediate. Otherwise, since

M^{(i)}

is obtained from

M^{(i - 1)}

by a

BST

-replacement (with respect to the pair

(M^{(i - 1)} x_{i}, {M^{(i - 1)} y_{i}})

), the only way

M^{(i - 1)} v

can change is when

⌀ \neq M^{(i - 1)} x_{i} \subseteq M^{(i - 1)} v

, in which case

M^{(i)} v = (M^{(i - 1)} v ∖ M^{(i - 1)} x_{i}) \cup {M^{(i - 1)} y_{i}} .

Plainly,

|M^{(i - 1)} x_{i}| ⩾ |{M^{(i - 1)} y_{i}}|

; hence,

|M^{(i)} v| ⩽ |M^{(i - 1)} v|

.

Applying this inequality successively for

i = 1, 2, \dots, m

yields

|M^{(m)} v| ⩽ |M^{(m - 1)} v| ⩽ \dots ⩽ |M^{(0)} v|,

as required. □

We are now ready to prove that

M^{(m)}

satisfies

φ \land ψ

. By Lemma 14(a), we have

M^{(m)} ⊧ φ .

(18)

Moreover, Lemma 14(c) guarantees that, for every i with

1 ⩽ i ⩽ m

,

M^{(i)} x_{i} = {M^{(i)} y_{i}} .

(19)

Thus, to establish

M^{(m)} ⊧ ψ

, it suffices to prove that, for all i,

M^{(m)} x_{i} = {M^{(m)} y_{i}} .

(20)

If

i = m

, then (19) immediately gives (20). If

i < m

, clauses (b₁) and (b₂) of Lemma 14 yield

M^{(i)} y_{i} = M^{(m)} y_{i}

and

M^{(i)} x_{i} = M^{(m)} x_{i}

, respectively. Combining these with (19), we again obtain (20). Hence,

M^{(m)} ⊧ ψ

. From this and (18), it follows that

M^{(m)} ⊧ φ \land ψ

, thus completing the proof of satisfiability preservation.

6.4. Model Extension

We next prove that every model of

φ \land ψ

can be extended into a model of

φ \land Ξ_{φ}^{ψ}

, which corresponds to condition (c) of Definition 3.

Let

φ \land ψ

be satisfiable, and let M be a model of it. Define ≺ as the minimal transitive relation on

Vars (φ \land ψ)

such that

y ≺ x whenever M y \in M x, for all x, y \in Vars (φ \land ψ) .

By set well-foundedness, ≺ is irreflexive; together with transitivity, this makes ≺ a strict ordering on

Vars (φ \land ψ)

.

Extend M to the auxiliary variables

\tilde{v}

, for each

v \in Vars (φ \land ψ)

, by setting for each of them:

M \tilde{v} ≔ {M x ∣ x ≺ v} .

Moreover, extend M to the auxiliary variable

z

by stipulating

M z ≔ ⌀,

so that in particular

M ⊧ z = z ∖ z .

(21)

We prove that

M ⊧ Ξ_{φ}^{ψ}

holds.

Preliminarily, note that

M x

is a singleton for every singleton atom

x = \{y\}

in

ψ

. It then follows directly that

M ⊧ \underset{\begin{matrix} x = \{y\} in ψ \\ v \in Vars (φ \land ψ) \end{matrix}}{⋀} (\neg D ISJ (x, v) ⟶ x \subseteq v) .

(22)

Moreover, for each singleton atom

x = {y}

in

ψ

, the acyclicity of the membership relation yields

M x ⊈ M y

, and so

M ⊧ x ⊈ y

. Therefore,

M ⊧ \underset{x = {y} \in ψ}{⋀} x ⊈ y .

(23)

In addition, for each pair of singleton atoms

x = {y}

and

x^{'} = {y^{'}}

in

ψ

, we plainly have

M ⊧ y = y^{'} ⟷ x = x^{'},

whence

M ⊧ \underset{\begin{matrix} x = {y} \in ψ \\ x^{'} = {y^{'}} \in ψ \end{matrix}}{⋀} (y = y^{'} ⟷ x = x^{'}) .

(24)

Now, consider any implication of the form

\neg D ISJ (x, v) ⟶ \tilde{y} ⊊ \tilde{v},

with

x = {y}

in

ψ

and

v \in Vars (φ \land ψ)

, and assume that

M ⊧ \neg D ISJ (x, v)

. Since

M x = {M y}

, this yields

M x \subseteq M v

; hence,

M y \in M v

. Thus,

y ≺ v

. By the transitivity and strictness of ≺, we obtain

M \tilde{y} = {M y^{'} ∣ y^{'} ≺ y} \subseteq {M y^{'} ∣ y^{'} ≺ v} = M \tilde{v},

namely,

M ⊧ \tilde{y} \subseteq \tilde{v}

.

Hence, by the arbitrariness of

x = {y}

in

ψ

and

v \in Vars (φ \land ψ)

, it follows that

M ⊧ \underset{\begin{matrix} x = {y} \in ψ \\ v \in Vars (φ \land ψ) \end{matrix}}{⋀} (\neg D ISJ (x, v) ⟶ \tilde{y} ⊊ \tilde{v}) .

(25)

Finally, consider the implications

x = y ⟶ \tilde{x} = \tilde{y},

with

x, y \in Vars (φ \land ψ)

. Assume

M x = M y

. Suppose

v ≺ x

. By definition of ≺, there exists a finite chain of variables

v_{1}, \dots, v_{n}

such that

v = v_{1}, M v_{i} \in M v_{i + 1} for each i, v_{n} = x .

Hence,

M v = M v_{1} \in \dots \in M v_{n} = M x = M y,

which yields

v ≺ y

. By a symmetric argument,

v ≺ x

follows from

v ≺ y

. Therefore,

{v ∣ v ≺ x} = {v ∣ v ≺ y},

and thus

M \tilde{x} = M \tilde{y}

. Hence,

M ⊧ \underset{x, y \in Vars (φ \land ψ)}{⋀} (x = y ⟶ \tilde{x} = \tilde{y}) .

(26)

Collecting (21)–(25), and the above argument, we see that all conjuncts of

Ξ_{φ}^{ψ}

are satisfied by M. Therefore,

M ⊧ Ξ_{φ}^{ψ},

which establishes the model extension property.

We have so extended a generic M such that

M ⊧ φ \land ψ

into a model of

φ \land Ξ_{φ}^{ψ}

; therefore, we get

⊧ (φ (y) \land ψ (x)) ⟶ (\exists z) Ξ_{φ}^{ψ} (x, y, z),

where the list

z

consists of the collection

\tilde{x}

of all auxiliary set variables

\tilde{x}

together with

z

.

Finally, in view of Definition 3, combining the satisfiability preservation established in Section 6.3 with the model extension property of Section 6.4, and relying on Lemma 11, we obtain our main expressibility result:

Theorem 2.

Singleton-atom conjunctions are

O (n^{2})

-expressible from

BST

into

{BST}^{+}

.

7. Conclusions: Related and Planned Work

The main contributions of this paper are as follows:

The introduction of the notion of $O (f)$ -expressibility across theories, refining the existing notion of existential expressibility, which, while useful, is too coarse for our purposes.
The proof that atoms of the form $z = \{x\}$ are not existentially expressible in ${BST}^{+}$ .
The proof that, by contrast, any conjunction of such atoms is $O (n^{2})$ -expressible from $BST$ into ${BST}^{+}$ , using a construction we call the nested-to-flat translation.

As noted in the Introduction, the authors’ interest in satisfiability mechanisms for restricted fragments of set theory arises primarily from the design and experimental use of the

ÆtnaNova

proof verifier.

As discussed ref. [3],

ELEM

is the core inference mechanism in

ÆtnaNova

, where it often operates implicitly alongside other methods. It is based on an enhanced form of multilevel syllogistic (ref. [1]), a decision procedure for checking the satisfiability of certain unquantified set-theoretic formulae. This procedure allows

ÆtnaNova

to establish that a statement follows from a given proof context by showing that its negation yields an unsatisfiable conjunction with earlier statements.

When parts of a proof involve constructs that fall outside the scope of

ELEM

’s built-in syllogistic, a preprocessing step replaces them with fresh variables, ensuring uniform treatment of identical structures. However, proof steps are rarely uniquely determined by prior lines and hints alone. This is because

ELEM

often generates a range of easy consequences from a given context, leaving the user free to choose which of these consequences to assert as the next proof step.

Since inference mechanisms akin to

ELEM

are likely to play a central role in proof technology, it is important to develop translation methods of low algorithmic cost—such as the nested-to-flat translation discussed above—that make their integration possible.

7.1. Envisaged Enhancements to the Nested-to-Flat Translation

We expect that the satisfiability-preserving translation treated in Section 6 can be tuned to theories richer that

BST

, such as the following. Let

BSTS

denote the collection of conjunctions of literals of the forms

x = y ∖ z, x \neq y ∖ z, x = \{y\}

and let

{BSTS}^{+}

be the Boolean closure of atoms of the forms

x = y ∖ z, x = \{y\} .

Then, it seems plausible that conjunctions of atoms of the form

F INITE (x), D ENUMERABLE (x),

together with their negations—where

F INITE (x)

expresses that the set assigned to x is finite and

D ENUMERABLE (x)

expresses that the set assigned to x is denumerable (i.e., finite or countably infinite)—are likewise

O (n^{2})

-expressible from

BSTS

to

{BSTS}^{+}

. Exploring this broader scenario is part of our ongoing research agenda.

7.2. Towards Integrating Set-Theoretic, Boolean, and Numerical Constraint Reasoning

Besides its independent interest, the nested-to-flat translation discussed so far can be seen as a preparatory step toward the quantitative approach to logical inference (cf. [14,15]), as specialized to the field of computable set theory (cf. Ch. 11 ref. [16]).

Our translation, in fact, lays the groundwork for reducing satisfiability problems about sets to satisfiability problems about nonnegative integers—or, if we move from the theory of hereditarily finite sets to a more general setting, to the language of the additive theory of cardinals (which is decidable; ref. [17]).

One such reduction, where the source language embodies a cardinality operator

|x|

indicating how many elements belong to the set x, is presented in Sec. 11.1 ref. [16]. Let us briefly outline how it proceeds. We are given a conjunction

ψ

of literals of the forms

\begin{matrix} x = y ∖ z, & x = \{y\}, & h = |x|, & h = i + j, & h = 1 . \end{matrix}

Suppose we want to test

ψ

for satisfiability over the hereditarily finite sets. Here,

x, y, z

stand for set variables while

i, j, h

stand for numeric variables drawn from a disjoint infinite set of symbols and are intended to range over the nonnegative integers. Let

X

be the collection of all set variables occurring in

ψ

; let, moreover,

Q

be the collection of all nonempty subsets Q of

X

such that the assignment

\begin{matrix} B_{Q} (v) & := & \{\begin{matrix} \{⌀\} & i f & v \in Q, \\ ⌀ & i f & v \in X ∖ Q, \end{matrix} \end{matrix}

satisfies all equalities of the form

x = y ∖ z

occurring in

ψ

. Associate two numeric variables

ν_{Q}, ϱ_{Q}

with each Q in

Q

. We have an algorithm that, given

ψ

, constructs a system

C

of purely arithmetic constraints such that

ψ

is satisfiable over sets and natural numbers if and only if

C

has a solution over the natural numbers. Those literals of the forms

h = i + j

and

h = 1

that were in

ψ

from the outset are retained in

C

; in addition to them,

C

encompasses conditions specifying the intended meanings of

ν_{Q}, ϱ_{Q}

relative to a (hypothetical) model

x \mapsto ξ_{x}

of

ψ

, namely,

Each $ν_{Q}$ represents the cardinality of the set

$\hat{Q} := (\cap_{v \in Q} ξ_{v}) ∖ (\cup_{w \in X ∖ Q} ξ_{w});$
Each $ϱ_{Q}$ represents the position occupied by $\hat{Q}$ in a total ordering that extends the rank comparison relation.

The reduction just outlined can certainly be refined beyond the treatment ref. [16], as we will strive to do in the future. We also expect that it can be boosted with the treatment of explicit rank-related constructs, such as the comparison relator

rk (x) < rk (y)

.

Some reductions of the set-satisfiability problem to integer programming can be found ref. [18], whose line of research aimed at integrating linear programming problems and set constraint manipulation methods in a single logic programming language, as explained refs. [19,20] (the endeavor of integrating cardinality constraints into constraint logic programming with sets has been carried out with a different approach, as reported ref. [21]). A technique for reducing the problem of multilevel syllogistic (cf. [3]) to propositional consistency testing was described ref. [18] (an account of it can also be found in Sec. 11.3 ref. [16]).

In Sec. 4.5 ref. [22], the authors show that satisfiability of nonrecursive Tarskian set constraints is decidable in nondeterministic double-exponential time by reducing the problem to a class of Diophantine constraints called prequadratic. They prove that satisfiability of prequadratic Diophantine constraints is decidable in nondeterministic exponential time and conjecture that it is in NP. If this conjecture holds, satisfiability of nonrecursive Tarskian constraints would be decidable in nondeterministic single-exponential time.

Along a related line of research, ref. [23] reduces quantifier-free constraints on sets involving cardinalities along with direct and inverse images of functions on sets, to systems of numerical constraints in linear integer arithmetic or of the form

x ⩽ y^{d}

, where d is a positive integer.

7.3. Difference Algebras

We take this opportunity to mention an issue concerning an alternative semantics for

BST

, which, although only loosely related to the central theme of this paper, is nevertheless of independent interest. In his bachelor’s thesis Unificazione semantica in strutture booleane (‘Semantic unification in Boolean structures’), defended at the University of Trieste in 2020, Mattia Furlan isolated the valid formulae involving Boolean difference that are displayed in Figure 1.

Let us adopt the universal closures of these formulae as the axioms of a theory in quantificational first-order logic with equality. These axioms characterize an algebraic variety, whose instances we provisionally call difference algebras.

A natural open question is whether every difference algebra

D = (D, ∖_{D})

is isomorphic to an algebra of the form

S = (S, ∖)

, in which the operator ‘∖’ is interpreted as ordinary set-theoretic difference. In this case,

S

must be a family of sets closed under difference, and hence under intersection, since

X \cap Y = X ∖ (X ∖ Y)

holds for all sets

X, Y

.

One might hope to settle this question by appealing to Stone’s celebrated representation theorem, which states that every Boolean algebra is isomorphic to a field of sets. However, we see no direct way to apply this theorem, since there exist difference algebras

D

whose carrier

D

is not closed under symmetric difference, viewed as an operation

〈Y, Z〉 \mapsto Y △_{D} Z

satisfying, for all

X, Y, Z

in

D

,

\begin{matrix} X & = & Y △_{D} Z & ⟷ & (X ∖_{D} (Y ∖_{D} Z) & = & Z ∖_{D} Y & \land & Y ∖_{D} Z & = & X ∖_{D} Z) . \end{matrix}

Moreover, it is unclear how to embed an arbitrary difference algebra into one that forms a proper Boolean ring by virtue of satisfying this closure property.

Author Contributions

Conceptualization, D.C., A.D.D., P.M. and E.G.O.; Methodology, D.C., A.D.D., P.M. and E.G.O.; Writing – original draft, D.C., A.D.D., P.M. and E.G.O.; Writing – review & editing, D.C. and E.G.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data supporting this theoretical study are contained within the article.

Acknowledgments

This work benefited from interactions with the Research Program PIAno di inCEntivi per la Ricerca di Ateneo 2024–2026—Linea di Intervento I “Progetti di ricerca collaborativa”—University of Catania—Project “Semantic Web of EveryThing through Ontological Protocols” (SWETOP). We also acknowledge INdAM/GNCS.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. A Lower Bound on the Number of Sets of a Fixed Integer Rank

Here, we figure out inequalities preparatory to the proof of Proposition A2 below.

Proposition A1.

For every positive integer n, we have

2^{2^{n}} - 2^{n} ⩾ 2 (2^{n} - n) .

Proof.

We prove the proposition by establishing the equivalent inequality

2^{2^{n}} ⩾ 3 \cdot 2^{n} - 2 n,

(A1)

which follows by simply adding

2^{n}

to both sides of the original inequality.

We proceed by induction on

n ⩾ 1

.

Base case:

For

n = 1

, we have

2^{2^{1}} = 4 and 3 \cdot 2^{1} - 2 \cdot 1 = 6 - 2 = 4 .

Thus, the base case holds with equality.

Inductive step:

Fix

n ⩾ 1

and assume that

2^{2^{n}} ⩾ 3 \cdot 2^{n} - 2 n .

(A2)

We aim to prove that

2^{2^{n + 1}} ⩾ 3 \cdot 2^{n + 1} - 2 (n + 1) .

(A3)

Note that

2^{2^{n + 1}} = {(2^{2^{n}})}^{2} .

Since

n ⩾ 1

, we have

2^{2^{n}} ⩾ 2^{2} = 4

, and thus

{(2^{2^{n}})}^{2} ⩾ 4 \cdot 2^{2^{n}} .

Using the inductive hypothesis (A2), we obtain

\begin{matrix} 2^{2^{n + 1}} & = {(2^{2^{n}})}^{2} ⩾ 4 \cdot 2^{2^{n}} ⩾ 4 (3 \cdot 2^{n} - 2 n) = 12 \cdot 2^{n} - 8 n . \end{matrix}

Now, observe that

3 \cdot 2^{n + 1} - 2 (n + 1) = 6 \cdot 2^{n} - 2 n - 2,

so it suffices to verify the inequality

12 \cdot 2^{n} - 8 n ⩾ 6 \cdot 2^{n} - 2 n - 2,

which reduces to

6 \cdot 2^{n} - 6 n + 2 ⩾ 0 .

This inequality clearly holds, since

n ⩾ 1

implies

2^{n} - n ⩾ 0

, and the leftover

+ 2

ensures strict positivity.

Thus, the inductive step is verified. □

Next, we come to a proposition which lies in the background of this paper:

Proposition A2.

For every positive integer n, the number of well-founded sets of rank equal to n is greater than or equal to

2^{n - 1}

, namely,

|W_{n}^{#}| ⩾ 2^{n - 1} .

Proof.

We proceed by induction on

n ⩾ 1

.

Base case:

For

n = 1

, we have

|W_{1}^{#}| = 1 = 2^{1 - 1}

.

Inductive step:

Let

n ⩾ 1

be such that

|W_{n}^{#}| ⩾ 2^{n - 1}

. We aim to show that

|W_{n + 1}^{#}| ⩾ 2^{n} .

(A4)

From the equality

|W_{n}^{#}| = |V_{n + 1}| - |V_{n}|

and the inductive hypothesis, it follows that

|V_{n + 1}| = |W_{n}^{#}| + |V_{n}| ⩾ 2^{n - 1} + |V_{n}| .

Hence, by (2),

\begin{matrix} |W_{n + 1}^{#}| & = |V_{n + 2}| - |V_{n + 1}| \\ = 2^{|V_{n + 1}|} - |V_{n + 1}| \\ = 2^{2^{|V_{n}|}} - 2^{|V_{n}|} \\ ⩾ 2 (2^{|V_{n}|} - |V_{n}|) & (by Proposition, since |V_{n}| ⩾ 1) \\ = 2 (|V_{n + 1}| - |V_{n}|) \\ = 2 |W_{n}^{#}| \\ ⩾ 2 \cdot 2^{n - 1} & (by the induction hypothesis) \\ = 2^{n} . □ \end{matrix}

Appendix B. Proof of Lemma 14

For convenience, we restate Lemma 14 before giving its proof.

Lemma 14.

For every

i = 1, \dots, m

, letting

x_{i} = \{y_{i}\}

be the atom of ψ selected at step i, we have

(a)

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

such that

M^{(i)} ⊧ φ \land Ξ_{φ}^{ψ}

;

(b)

For each j with

1 ⩽ j < i

(b₁): $M^{(j)} y_{j} = M^{(i)} y_{j}$ ;
(b₂): $M^{(j)} x_{j} = M^{(i)} x_{j}$ ;

(c)

M^{(i)} y_{i} = M^{(i - 1)} y_{i}

;

(d)

M^{(i)} x_{i} = \{M^{(i)} y_{i}\}

.

Proof.

We prove the lemma by strong induction on

i = 1, \dots, m

.

Base case (

i = 1

). Since

M^{(0)} ⊧ φ ⋀ Ξ_{φ}^{ψ}

, we have in particular

M^{(0)} ⊧ \underset{x \in Vars (φ \land ψ)}{⋀} (\neg D ISJ (x_{1}, x) ⟶ x_{1} \subseteq x) .

(A5)

Moreover, by Lemma 10,

M^{(0)} x_{1} \neq ⌀

. In addition, since

rk (M^{(0)} y_{1})

is either 0 or

♭ + 1

, the ♭-flatness of

M^{(0)}

ensures that

D ISJ (M^{(0)} y_{1}, M v)

, for all

v \in Vars (φ \land ψ)

. Hence, the

BST

-conditions for the pair

(M^{(0)} x_{1}, \{M^{(0)} y_{1}\})

are satisfied, and therefore

M^{(1)}

is a

BST

-replacement of

M^{(0)}

from

M^{(0)} x_{1}

to

\{M^{(0)} y_{1}\}

(restricted to

Vars (φ \land ψ)

). Since every literal ℓ in

φ \land Ξ_{φ}^{ψ}

satisfies either

Vars (ℓ) \subseteq Vars (φ \land ψ)

or

Vars (ℓ) \cap Vars (φ \land ψ) = ⌀

, Lemma 6 ensures that

M^{(1)} ⊧ φ \land Ξ_{φ}^{ψ}

. Therefore, clause (a) holds in the base case.

Clause (b) is vacuous when

i = 1

.

Finally, concerning clause (c), from (A5) and Lemma 10 we obtain

D ISJ (M^{(0)} x_{1}, M^{(0)} y_{1})

. Therefore, by the definition of

M^{(1)}

, it follows that

M^{(1)} y_{1} = M^{(0)} y_{1}

.

Inductive step (

1 < i ⩽ m

). Assume (a)–(d) hold for all

1 ⩽ t < i

. □

Proof of (a).

We show that

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

by verifying that the

BST

-conditions (8) hold for the pair

(M^{(i - 1)} x_{i}, \{M^{(i - 1)} y_{i}\})

relative to

M^{(i - 1)}

.

We first show that

M^{(i - 1)} x_{i} \neq ⌀

. By the inductive hypothesis (a) at step

i - 1

, we have

M^{(i - 1)} ⊧ φ \land Ξ_{φ}^{ψ}

. Therefore, by Lemma 10, we conclude that

M^{(i - 1)} x_{i} \neq ⌀

. Moreover,

\{M^{(i - 1)} y_{i}\} \neq ⌀

holds trivially. Thus, to complete the proof that

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

, it remains to check that the following conditions hold for all

v \in Vars (φ \land ψ)

:

(A): $D ISJ (M^{(i - 1)} x_{i}, M^{(i - 1)} v) \lor M^{(i - 1)} x_{i} \subseteq M^{(i - 1)} v$ ;
(B): $D ISJ (\{M^{(i - 1)} y_{i}\} ∖ M^{(i - 1)} x_{i}, M^{(i - 1)} v)$ .

Concerning (A), recalling that

M^{(i - 1)} ⊧ φ \land Ξ_{φ}^{ψ}

, in particular we have

M^{(i - 1)} ⊧ \underset{v \in Vars (φ \land ψ)}{⋀} (\neg D ISJ (x_{i}, v) ⟶ x_{i} \subseteq v),

as such conjunction is explicitly contained in

Ξ_{φ}^{ψ}

. Thus,

\neg D ISJ (M^{(i - 1)} x_{i}, M^{(i - 1)} v) ⟹ M^{(i - 1)} x_{i} \subseteq M^{(i - 1)} v, for all v \in Vars (φ \land ψ),

which establishes (A). □

We now turn to clause (B). Its verification requires a case analysis, carried out for an arbitrary

v \in Vars (φ \land ψ)

.

If

D ISJ ({M^{(i - 1)} y_{i}}, M^{(i - 1)} v)

holds, then a fortiori so does

D ISJ ({M^{(i - 1)} y_{i}} ∖ M^{(i - 1)} x_{i}, M^{(i - 1)} v)

.

The complementary case

\neg D ISJ ({M^{(i - 1)} y_{i}}, M^{(i - 1)} v)

requires a much more detailed argument, which we provide below.

Thus, let

{M^{(i - 1)} y_{i}} \cap M^{(i - 1)} v \neq ⌀

, so that

M^{(i - 1)} y_{i} \in M^{(i - 1)} v

.

By Lemma 13,

rk (M^{(i - 1)} y_{i}) \neq ♭

, so

M^{(i - 1)} y_{i} \notin M^{(0)} v

, i.e.,

D ISJ ({M^{(i - 1)} y_{i}}, M^{(0)} v)

. Therefore, there must exist an index k with

1 ⩽ k ⩽ i - 1

such that

M^{(i - 1)} y_{i} \notin M^{(k - 1)} v and M^{(i - 1)} y_{i} \in M^{(k)} v .

Consequently,

M^{(k)} v = (M^{(k - 1)} v ∖ M^{(k - 1)} x_{k}) \cup {M^{(k - 1)} y_{k}},

hence,

M^{(k - 1)} y_{k} = M^{(i - 1)} y_{i} .

(A6)

Since by inductive hypothesis (a) at step k the assignment

M^{(k)}

is a

BST

-replacement of

M^{(k - 1)}

, it must be the case that

M^{(k - 1)} x_{k} \neq ⌀

, that is,

\neg D ISJ M^{(k - 1)} x_{k}, M^{(k - 1)} x_{k})

. Thus, we have

M^{(k)} x_{k} = (M^{(k - 1)} x_{k} ∖ M^{(k - 1)} x_{k}) \cup {M^{(k - 1)} y_{k}} = {M^{(k - 1)} y_{k}} = {M^{(k)} y_{k}},

(A7)

where the final equality follows by the inductive hypothesis (c).

Moreover, whether

k = i - 1

or

k < i - 1

, it follows in both cases that

M^{(k)} y_{k} = M^{(i - 1)} y_{k}

(A8)

(the former trivially; the latter by the inductive hypothesis (b₁)).

Thus, we have

\begin{matrix} M^{(i - 1)} y_{i} & = M^{(k - 1)} y_{k} & (by (A6)) \\ = M^{(k)} y_{k} & (by inductive hypothesis (c)) \\ = M^{(i - 1)} y_{k} & (by (A8)) . \end{matrix}

(A9)

Recalling again that

M^{(i - 1)} ⊧ Ξ_{φ}^{ψ}

, we obtain

M^{(i - 1)} ⊧ y_{k} = y_{i} ⟷ x_{k} = x_{i},

since

Ξ_{φ}^{ψ}

contains the conjunct

y_{k} = y_{i} ⟷ x_{k} = x_{i}

. In combination with (A9) and the equivalences (11) in Remark 5, this yields

M^{(i - 1)} x_{k} = M^{(i - 1)} x_{i} .

(A10)

Whether

k = i - 1

(immediate) or

k < i - 1

(by the inductive hypothesis (b₂)), we obtain

M^{(i - 1)} x_{k} = M^{(k)} x_{k} .

(A11)

Thus,

\begin{matrix} M^{(i - 1)} x_{i} & = M^{(i - 1)} x_{k} & (by (A10)) \\ = M^{(k)} x_{k} & (by (A11)) \\ = \{M^{(k)} y_{k}\} & (by (A7)) \\ = \{M^{(i - 1)} y_{i}\} & (by (A18)) . \end{matrix}

Hence,

{M^{(i - 1)} y_{i}} = M^{(i - 1)} x_{i}

. Therefore,

{M^{(i - 1)} y_{i}} ∖ M^{(i - 1)} x_{i} = ⌀

, and so the disjointness condition holds trivially. This completes the verification of (B) also in the case

\neg D ISJ ({M^{(i - 1)} y_{i}}, M^{(i - 1)} v)

. Since v was arbitrary, property (B) holds for all variables in

Vars (φ \land ψ)

. Together with (A), it follows that

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

. Finally, recalling that, by inductive hypothesis (a) at step

i - 1

, we have

M^{(i - 1)} ⊧ φ \land Ξ_{φ}^{ψ}

, Lemma 6 yields

M^{(i)} ⊧ φ \land Ξ_{φ}^{ψ}

. Thus, clause (a) is established at step i.

Proof of (b).

Concerning (b₁), assume by way of contradiction that

M^{(j)} y_{j} \neq M^{(i)} y_{j}

for some j with

1 ⩽ j < i

, and let k be an index such that

j < k ⩽ i

and

M^{(k - 1)} y_{j} = M^{(j)} y_{j} while M^{(k)} y_{j} \neq M^{(j)} y_{j} .

Such an index k must exist; otherwise,

M^{(h)} y_{j} = M^{(j)} y_{j}

for all h with

j ⩽ h ⩽ i

, which would give

M^{(i)} y_{j} = M^{(j)} y_{j}

and contradict the assumption. Since

M^{(k - 1)} y_{j} \neq M^{(k)} y_{j}

, it follows that

M^{(k - 1)} x_{k} \subseteq M^{(k - 1)} y_{j}

and

M^{(k)} y_{j} = (M^{(k - 1)} y_{j} ∖ M^{(k - 1)} x_{k}) \cup \{M^{(k - 1)} y_{k}\}

. Moreover, as

M^{(k - 1)} ⊧ x_{k} \subseteq y_{j}

, the inductive hypothesis (a) guarantees that this inclusion already holds in

M^{(0)}

. Namely,

M^{(0)} x_{k} \subseteq M^{(0)} y_{j}

. Finally, by Lemma 10, we also know that

M^{(0)} x_{k} \neq ⌀

. Thus,

ℓ_{k} ≺^{M^{(0)}} ℓ_{j}

, implying

k < j

, a contradiction to

j < k

. Therefore,

M^{(j)} y_{j} = M^{(i)} y_{j}

for all

1 ⩽ j < i

. □

Regarding (b₂), assume by way of contradiction that

M^{(j)} x_{j} \neq M^{(i)} x_{j}

for some j with

1 ⩽ j < i

. Then, there must exist a k with

j < k ⩽ i

and such that

M^{(k - 1)} x_{j} = M^{(j)} x_{j} while M^{(k)} x_{j} \neq M^{(j)} x_{j},

(A12)

otherwise

M^{(h)} x_{j} = M^{(j)} x_{j}

for all h with

j ⩽ h ⩽ i

, yielding

M^{(i)} x_{j} = M^{(j)} x_{j}

and contradicting the assumption.

By inductive hypothesis (d) at step j, we have

M^{(j)} x_{j} = {M^{(j)} y_{j}} .

(A13)

Moreover, since

j ⩽ k - 1 < i

, the inductive hypothesis (b₂) gives

M^{(k - 1)} x_{j} = M^{(j)} x_{j} .

(A14)

At stage k,

M^{(k)}

is a

BST

-replacement of

M^{(k - 1)}

(by the inductive hypothesis (a) if

k < i

, or by clause (a) at step i if

k = i

). Thus, the pair

(M^{(k - 1)} x_{k}, \{M^{(k - 1)} y_{k}\})

satisfies the

BST

-replacement conditions relative to

M^{(k - 1)}

, and combined with (A12) this yields

⌀ \neq M^{(k - 1)} x_{k} \subseteq M^{(k - 1)} x_{j} .

(A15)

Since

M^{(j)} x_{j}

is a singleton, if follows from (A14) and (A15) that

M^{(k - 1)} x_{k} = M^{(k - 1)} x_{j} .

(A16)

Therefore,

M^{(k - 1)} ⊧ x_{k} = x_{j}

.

Because

M^{(0)} ⊧ Ξ_{φ}^{ψ}

, the inductive hypothesis (a) ensures

M^{(k - 1)} ⊧ Ξ_{φ}^{ψ}

. As

y_{k} = y_{j} ⟷ x_{k} = x_{j}

is a conjunct of

Ξ_{φ}^{ψ}

, we deduce

M^{(k - 1)} ⊧ y_{k} = y_{j} ⟷ x_{k} = x_{j},

hence

M^{(k - 1)} ⊧ y_{k} = y_{j}

. By the equivalences (11) in Remark 5, this gives

M^{(k - 1)} y_{k} = M^{(k - 1)} y_{j} .

(A17)

Finally, since

j < k ⩽ i

, the inductive hypothesis (b₁) entails

M^{(j)} y_{j} = M^{(k - 1)} y_{j} .

(A18)

Putting everything together, we obtain

\begin{matrix} M^{(k)} x_{j} & = (M^{(k - 1)} x_{j} ∖ M^{(k - 1)} x_{k}) \cup {M^{(k - 1)} y_{k}} & (by (A15) and (A16)) \\ = {M^{(k - 1)} y_{k}} \\ = {M^{(j)} y_{j}} & (by (A17) and (A18)) \\ = M^{(j)} x_{j} & (by (A13)) . \end{matrix}

This contradicts (A12), and therefore clause (b₂) is established at step i.

Proof of (c).

We need to show that

M^{(i)} y_{i} = M^{(i - 1)} y_{i}

. To this end, recall first that

M^{(0)} ⊧ Ξ_{φ}^{ψ}

. As already noted, we have

M^{(i - 1)} ⊧ Ξ_{φ}^{ψ}

. Lemma 10 then gives

D ISJ (M^{(i - 1)} x_{i}, M^{(i - 1)} y_{i})

, and by the definition (7) of

M^{(i)} y_{i}

we conclude that

M^{(i)} y_{i} = M^{(i - 1)} y_{i}

. This completes the verification of clause (c) at step i. □

Proof of (d).

Since, by (a),

M^{(i)}

is a

BST

-replacement of

M^{(i - 1)}

from

M^{(i - 1)} x_{i}

to

{M^{(i - 1)} y_{i}}

, we must have

M^{(i - 1)} x_{i} \neq ⌀

, equivalently

\neg D ISJ (M^{(i - 1)} x_{i},, M^{(i - 1)} x_{i})

. By the definition of

BST

-replacement, this yields

\begin{matrix} M^{(i)} x_{i} & = (M^{(i - 1)} x_{i} ∖ M^{(i - 1)} x_{i}) \cup {M^{(i - 1)} y_{i}} \\ = {M^{(i - 1)} y_{i}} = {M^{(i)} y_{i}}, \end{matrix}

where the last equality follows from clause (c). This establishes (d). Since the inductive step is now complete, and the base case was already established, the lemma follows. □

References

Cantone, D.; Omodeo, E.G. Onset and today’s perspectives of multilevel syllogistic. In From Computational Logic to Computational Biology: Essays Dedicated to Alfredo Ferro to Celebrate His Scientific Career; Cantone, D., Pulvirenti, A., Eds.; Springer Nature: Cham, Switzerland, 2024; Volume 14070 of LNCS, pp. 9–55. [Google Scholar] [CrossRef]
Schwartz, J.T. Instantiation and Decision Procedures for Certain Classes of Quantified Set-Theoretic Formulae; Technical Report 78-10; Institute for Computer Applications in Science and Engineering, NASA Langley Research Center: Hampton, Virginia, 1978. [Google Scholar]
Schwartz, J.T.; Cantone, D.; Omodeo, E.G. Computational Logic and Set Theory—Applying Formalized Logic to Analysis; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Cantone, D.; Domenico, A.D.; Maugeri, P.; Omodeo, E.G. Complexity assessments for decidable fragments of set theory. I: A taxonomy for the Boolean case. Fundam. Informaticae 2021, 181, 37–69. [Google Scholar] [CrossRef]
Cantone, D.; Maugeri, P.; Omodeo, E.G. Complexity assessments for decidable fragments of set theory. II: A taxonomy for ‘small’ languages involving membership. Theor. Comput. Sci. 2020, 848, 28–46. [Google Scholar] [CrossRef]
Kuncak, V.; Nguyen, H.H.; Rinard, M. Deciding Boolean algebra with Presburger arithmetic. J. Autom. Reason. 2006, 36, 213–239. [Google Scholar] [CrossRef]
Rabin, M.O. Decidable theories. In Handbook of Mathematical Logic, Volume 90 of Studies in Logic and the Foundations of Mathematics; Barwise, J., Ed.; Elsevier (North-Holland Publishing Co.): Amsterdam, The Netherlands, 1977; pp. 595–629. [Google Scholar]
Cantone, D.; Omodeo, E.G.; Policriti, A. The automation of syllogistic. II: Optimization and complexity issues. J. Autom. Reason. 1990, 6, 173–188. [Google Scholar] [CrossRef]
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to Algorithms, 3rd ed.; MIT Press: Cambridge, MA, USA, 2009. [Google Scholar]
Angrisani, F.; Ascione, G.; Manzo, G. Orlicz spaces with a O- Type Structure. Ric. Mat. 2019, 68, 841–857. [Google Scholar] [CrossRef]
Jiménez-Garrido, J.; Sanz, J.; Schindl, G. Indices of O-regular variation for weight functions and weight sequences. Rev. Real Acad. Cienc. Exactas Físicas Nat. Ser. A Matemáticas 2018, 113, 3659–3697. [Google Scholar] [CrossRef]
Cantone, D.; Ferro, A. Techniques of computable set theory with applications to proof verification. Commun. Pure Appl. Math. 1995, 48, 901–945. [Google Scholar] [CrossRef]
Parlamento, F.; Policriti, A.; Rao, K.P.S.B. Witnessing Differences without Redundancies. Proc. Am. Math. Soc. 1997, 125, 587–594. [Google Scholar] [CrossRef]
Hooker, J.N. A quantitative approach to logical inference. Decis. Support Syst. 1988, 4, 45–69. [Google Scholar] [CrossRef]
Hooker, J.N.; Fedjiki, C. Branch-and-cut solution of inference problems in propositional logic. Ann. Math. Artif. Intell. 1990, 1, 123–139. [Google Scholar] [CrossRef]
Cantone, D.; Omodeo, E.G.; Policriti, A. Set Theory for Computing. From Decision Procedures to Declarative Programming with Sets; Monographs in Computer Science; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Tarski, A. Ordinal Algebras; Studies in Logic and the Foundations of Mathematics; North-Holland: Amsterdam, The Netherlands, 1956. [Google Scholar]
Hibti, M. Décidabilité et Complexité de Systèmes de Contraintes Ensemblistes. PhD Thesis, Université de Franche-Comté, Besançon, France, 1995. [Google Scholar]
Hibti, M.; Lombardi, H.; Legeard, B. Deciding in HFS-theory via linear integer programming. In Logic Programming and Automated Reasoning, Proceedings of the 4th Int’l Conference, LPAR’93, St. Petersburg, Russia, 13–20 July 1993; Voronkov, A., Ed.; Springer: Berlin/Heidelberg, Germany, 1993; Volume 698, pp. 170–181. [Google Scholar]
Hibti, M.; Legeard, B.; Lombardi, H. Une procédure de décision pour un problème de satisfiabilité dans un univers ensembliste héréditairement fini. RAIRO Theor. Inform. Appl. 1997, 31, 205–236. [Google Scholar] [CrossRef]
Cristiá, M.; Rossi, G. Integrating cardinality constraints into constraint logic programming with sets. Theory Pract. Log. Program. 2021, 23, 468–502. [Google Scholar] [CrossRef]
Givan, R.; McAllester, D.A.; Witty, C.; Kozen, D. Tarskian Set Constraints. Inf. Comput. 2002, 174, 105–131. [Google Scholar] [CrossRef]
Raya, R.; Hamza, J.; Kunčak, V. On the Complexity of Convex and Reverse Convex Prequadratic Constraints. In LPAR 2023: Proceedings of the 24th International Conference on Logic for Programming, Artificial Intelligence and Reasoning, Manizales, Colombia, 4–9 June 2023; Piskac, R., Voronkov, A., Eds.; EasyChair: Stockport, UK, 2023; Volume 94, pp. 350–368. [Google Scholar] [CrossRef]

Figure 1. Axioms of the variety of difference algebras.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cantone, D.; De Domenico, A.; Maugeri, P.; Omodeo, E.G. Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations 2026, 6, 3. https://doi.org/10.3390/foundations6010003

AMA Style

Cantone D, De Domenico A, Maugeri P, Omodeo EG. Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations. 2026; 6(1):3. https://doi.org/10.3390/foundations6010003

Chicago/Turabian Style

Cantone, Domenico, Andrea De Domenico, Pietro Maugeri, and Eugenio G. Omodeo. 2026. "Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae" Foundations 6, no. 1: 3. https://doi.org/10.3390/foundations6010003

APA Style

Cantone, D., De Domenico, A., Maugeri, P., & Omodeo, E. G. (2026). Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae. Foundations, 6(1), 3. https://doi.org/10.3390/foundations6010003

Article Menu

Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae

Abstract

1. Introduction

2. The Theories $BST$ and ${BST}^{+}$

The von Neumann Universe

3. From Existential Expressibility to $O (f)$ -Expressibility Across Theories

4. $BST$ -Replacements and Flat Models

4.1. Replacement Assignments and $BST$ -Replacements

4.2. ♭-Flat Models

5. Existential Inexpressibility of $z = \{x\}$ in ${BST}^{+}$

6. $O (n^{2})$ -Expressibility in ${BST}^{+}$ of Singleton-Atom Conjunctions

6.1. Design and Analysis of the Translation Algorithm

6.2. Illustrating the Translation $Ξ_{φ}^{ψ}$ on Examples

6.3. Satisfiability Preservation

6.4. Model Extension

7. Conclusions: Related and Planned Work

7.1. Envisaged Enhancements to the Nested-to-Flat Translation

7.2. Towards Integrating Set-Theoretic, Boolean, and Numerical Constraint Reasoning

7.3. Difference Algebras

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. A Lower Bound on the Number of Sets of a Fixed Integer Rank

Appendix B. Proof of Lemma 14

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Complexity Assessments for Decidable Fragments of Set Theory. IV: A Quadratic Reduction from Constraints over Nested Sets to Boolean Formulae

Abstract

1. Introduction

2. The Theories BST and BST +

The von Neumann Universe

3. From Existential Expressibility to O ( f ) -Expressibility Across Theories

4. BST -Replacements and Flat Models

4.1. Replacement Assignments and BST -Replacements

4.2. ♭-Flat Models

5. Existential Inexpressibility of z = x in BST +

6. O ( n 2 ) -Expressibility in BST + of Singleton-Atom Conjunctions

6.1. Design and Analysis of the Translation Algorithm

6.2. Illustrating the Translation Ξ φ ψ on Examples

6.3. Satisfiability Preservation

6.4. Model Extension

7. Conclusions: Related and Planned Work

7.1. Envisaged Enhancements to the Nested-to-Flat Translation

7.2. Towards Integrating Set-Theoretic, Boolean, and Numerical Constraint Reasoning

7.3. Difference Algebras

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. A Lower Bound on the Number of Sets of a Fixed Integer Rank

Appendix B. Proof of Lemma 14

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. The Theories $BST$ and ${BST}^{+}$

3. From Existential Expressibility to $O (f)$ -Expressibility Across Theories

4. $BST$ -Replacements and Flat Models

4.1. Replacement Assignments and $BST$ -Replacements

5. Existential Inexpressibility of $z = \{x\}$ in ${BST}^{+}$

6. $O (n^{2})$ -Expressibility in ${BST}^{+}$ of Singleton-Atom Conjunctions

6.2. Illustrating the Translation $Ξ_{φ}^{ψ}$ on Examples