Three Problems for Decision Rule Systems from Closed Classes

Kerven Durdymyradov; Mikhail Moshkov

doi:10.3390/axioms14080648

and

Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Axioms2025, 14(8), 648;https://doi.org/10.3390/axioms14080648

Version Notes

Order Reprints

Abstract

The study of the relationships between DRSs (Decision Rule Systems) and DTs (Decision Trees) is of considerable interest in computer science. In this paper, we consider classes of DRSs that are closed under specific operations. First, we examine classes that are closed under the operation of the removal of features and analyze the functions characterizing the worst-case dependence of the minimum depth of DDTs (Deterministic Decision Trees) and NDTs (Nondeterministic Decision Trees), solving the task of finding all true DRs in a DRS on the number of different features in the system. Second, we extend our analysis to classes that are closed under the removal of features and rules, studying the worst-case behavior of the minimum DT depth for the task of finding at least one true DR. Third, we investigate classes closed under the removal of features and rules in the context of finding all right-hand sides of true DRs. We prove that, in all three cases, the corresponding functions characterizing the worst-case minimum depth of DTs are either bounded from above by a constant or grow linearly.

Keywords:

closed class; decision rule system; deterministic decision tree; nondeterministic decision tree

MSC:

68Q25

1. Introduction

DTs (Decision Trees) [1,2,3,4,5] and DRSs (Decision Rule Systems) [6,7,8,9,10,11] are common tools for structuring and expressing knowledge. They act as classifiers, providing predictions for unseen cases, and are also employed as algorithms in diverse domains such as fault diagnosis, combinatorial optimization, and beyond. Among classification and knowledge representation models, DTs and DRSs stand out for their high level of interpretability [12,13]. Exploring the connections and transformations between DTs and DRSs has become an important focus of research within computer science.

In this work, we examine classes of DRSs that are closed under certain operations. The operation of feature removal is natural: if we have a DRS but cannot work with one of its features because it is unavailable for some reason, we can remove this feature from the DR and try to work with the resulting DRS. Similarly, rule removal is a fundamental operation that allows us to disregard certain DRs while maintaining the structure of the system.

One of the main tasks associated with a DRS is to find, given a tuple of feature values, all DRs that are true on this tuple (that have a true left-hand side). To solve this task, we use DDTs (Deterministic Decision Trees) and NDTs (Nondeterministic Decision Trees). In addition, we consider two related tasks: finding at least one true DR in a DRS and finding all right-hand sides of true DRs.

For an arbitrary closed class of DRSs, we investigate functions that describe, in the worst-case scenario, how the minimal depth of DDTs and NDTs required to solve these problems depends on the number of distinct features in the system. Specifically, we analyze the task of finding all true DRs in classes closed under the removal of features, the task of finding at least one true DR in classes closed under the removal of features and rules, and the task of finding all right-hand sides of true DRs in classes closed under the same operations. We prove that, in all three cases, the behavior of these functions is such that they are bounded from above by a constant value or have linear growth.

In this paper, we continue to develop a syntactic approach to the study of the relationships between DRSs and DTs, outlined in [14,15]. This method relies on the assumption that we have access only to the DRS itself, without the underlying input data. An overview of earlier research in this area is presented primarily in one book [16]. In that book, we considered all three tasks: finding all true DRs, finding at least one true DR, and finding all right-hand sides of true DRs. However, unlike in this paper, we did not consider NDTs and classes of DRSs closed under specific operations.

Note that this paper is an extension of two conference papers: in [17], we analyzed the problem of finding all realizable rules, and in [18], we studied the problem of finding at least one realizable rule.

The structure of the paper is as follows. In Section 2, we provide the key definitions and notation, which are closely aligned with those in [16]. Section 3 is devoted to the presentation of the main results. Finally, Section 4 presents brief conclusions.

2. Definitions

This section introduces the notation and key definitions for DRSs and DTs.

2.1. DRSs—Decision Rule Systems

Let

N_{0} = {0, 1, 2, \dots}

and

F = {f_{i} : i \in N_{0}}

. Elements of the set F will be called features. Let

k \in N_{0} ∖ {0, 1}

. Let

E_{k} = {0, 1, \dots, k - 1}

.

Let

E S_{k}

represent the set of equation systems of the form

{f_{i_{1}} = v_{1}, \dots, f_{i_{m}} = v_{m}},

where

m \in N_{0}

,

f_{i_{1}}, \dots,

f_{i_{m}} \in F

and

v_{1}, \dots, v_{m} \in E_{k}

. We say that the system is inconsistent if there exist indices

t, p \in 1, \dots, m

with

t \neq p

, such that

i_{t} = i_{p}

but

v_{t} \neq v_{p}

. Otherwise, the system of equations will be consistent.

Definition 1.

A k-DR (Decision Rule) is defined as an expression of the form

(f_{i_{1}} = v_{1}) \land \dots \land (f_{i_{m}} = v_{m}) \to σ,

where

m \in N_{0} ∖ {0}

,

f_{i_{1}}, \dots, f_{i_{m}}

are pairwise different features from F,

v_{1}, \dots,

v_{m} \in E_{k}

, and

σ \in N_{0}

.

This DR will be denoted by r. The expression

(f_{i_{1}} = v_{1}) \land \dots \land (f_{i_{m}} = v_{m})

is referred to as the left-hand side, while the value

σ

is termed the right-hand side of r. The integer m is called the length of the DR and is written as

l (r)

. Let

F t (r) = {f_{i_{1}}, \dots, f_{i_{m}}}

and

K (r) = {f_{i_{1}} = v_{1}, \dots, f_{i_{m}} = v_{m}}

. Two DRs,

r_{1}

and

r_{2}

, will be called equal if

K (r_{1}) = K (r_{2})

and the right-hand sides of the DRs

r_{1}

and

r_{2}

are equal.

Definition 2.

A k-DRS (Decision Rule System) S is defined as a finite set of k-DRs.

We define

F t (S) = ⋃_{r \in S} F (r)

,

D (S)

as the set of right-hand sides of all DRs in S, and

l (S) = max {l (r) : r \in S}

for a nonempty DRS S. If

S = \emptyset

, then

F t (S) = D (S) = \emptyset

and

l (S) = 0

.

Let

S \neq \emptyset

and

F t (S) = {f_{j_{1}}, \dots, f_{j_{n}}}

, where

j_{1} < \dots < j_{n}

. For

\bar{v} = (v_{1}, \dots, v_{n}) \in E_{k}^{n}

, denote

K (S, \bar{v}) = {f_{j_{1}} = v_{1}, \dots, f_{j_{n}} = v_{n}}

.

A DR

r \in S

is said to be true for a tuple

\bar{v} \in E_{k}^{| F t (S) |}

whenever

K (r) \subseteq K (S, \bar{v})

. We denote by

S (\bar{v})

the subset of S consisting of all such rules that hold for

\bar{v}

.

Definition 3.

All Rules task for a DRS S: given a tuple

\bar{v} \in E_{k}^{| F t (S) |}

, determine the subset of rules

S (\bar{v})

. This task will be denoted by

AR (S)

.

Definition 4.

Some Rules task for a DRS S: given a tuple

\bar{v} \in E_{k}^{| F t (S) |}

, determine a subset

Z \subseteq S

such that

Every rule in Z is true for the $\bar{v}$ ;
If $Z = \emptyset$ , then no rule from S is true for $\bar{v}$ .

We denote this task as

SR (S)

.

Definition 5.

All Decisions task for a DRS S: given

\bar{v} \in E_{k}^{| F t (S) |}

, determine a subset

Z \subseteq S

such that

Every rule in Z is true for the $\bar{v}$ ;
For every $σ \in D (S) ∖ D (Z)$ , any DR from S with the right-hand side equal to σ is not true for the tuple $\bar{v}$ .

We denote this task as

AD (S)

.

We now define the operation of removal of a feature

f_{i} \in F t (S)

from the DRSS. Let

r \in S

. If

f_{i} \notin F t (r)

, then denote

r_{f_{i}} = r

. If

f_{i} \in F t (r)

, then denote by

r_{f_{i}}

the DR derived from r by the removal from the left-hand side of r the equality containing the feature

f_{i}

. We denote

S (f_{i}) = {r_{f_{i}} : r \in F t (S), l (r_{f_{i}}) > 0}

. The DRS

S (f_{i})

is the result of applying to S the operation of removal feature

f_{i}

.

The operation of removal of a rule from the DRS S is defined in a natural way: we can remove from S an arbitrary DR.

Later, we will consider two types of closed classes of DRSs: classes of k-DRSs closed under the removal of features and classes of k-DRSs closed under the removal of features and rules.

It is easy to see that the empty DRS belongs to any closed class.

2.2. DTs—Decision Trees

A finite rooted directed tree is a finite directed tree with exactly one vertex that has no incoming edges, referred to as the root. Vertices with no outgoing edges are called leaves, while those that are neither root nor leaves are termed internal vertices. A complete path is a sequence

ξ = u_{1}, e_{1}, \dots, u_{m}, e_{m}, u_{m + 1}

where

u_{1}

is the root,

u_{m + 1}

is a leaf, and each

e_{i}

connects

u_{i}

to

u_{i + 1}

for

i = 1, \dots, m

.

Consider S to be a nonempty k-DRS.

Definition 6.

A DT (Decision Tree) over the DRS S is a finite rooted labeled directed tree

G

with at least two vertices, such that

The root and its outgoing edges are not labeled;
Every internal vertex of $G$ is labeled by a feature from $F t (S)$ , and the edges leaving it are labeled by elements of $E_{k}$ ;
Each leaf of $G$ is labeled by a subset of S.

Definition 7.

A DT

G

over S is termed a DDT (Deterministic Decision Tree) if exactly one edge leaves from the root and, at every internal vertex, the outgoing edges are labeled with pairwise distinct labels.

Let

G

be a DT over S. Denote by

C P (G)

the set of all complete paths in the DT

G

. For a complete path

ξ = u_{1}, e_{1}, \dots, u_{m}, e_{m}, u_{m + 1}

, we associate an equation system

K (ξ) \in E S_{k}

. If

m = 1

and

ξ = u_{1}, e_{1}, u_{2}

, then

K (ξ) = \emptyset

. For

m \geq 2

, each vertex

u_{j}

(

j = 2, \dots, m

) is labeled by a feature

f_{i_{j}}

, and the edge

e_{j}

is labeled with a value

v_{j} \in E_{k}

. In this case,

K (ξ) = {f_{i_{2}} = v_{2}, \dots, f_{i_{m}} = v_{m}}

. We denote by

τ (ξ)

the collection of DRs attached to the leaf

u_{m + 1}

.

Let

G

be a DT over S,

\bar{v} \in E_{k}^{| F t (S) |}

, and

ξ \in C P (G)

. The path

ξ

is said to accept the tuple

\bar{v}

whenever

K (ξ) \subseteq K (S, \bar{v})

.

For each of the three tasks under consideration, we give definitions of DTs that solve them nondeterministically. After each definition, we add some explanations about a complete path

ξ

accepting a tuple

\bar{v}

in a DT

G

that solves the task. In particular, these definitions and explanations entail Remarks 1, 2, and 3 below.

Definition 8.

We say that

G

nondeterministically solves the task

AR (S)

if, for every tuple

\bar{v} \in E_{k}^{| F t (S) |}

, there exists a complete path

ξ \in C P (G)

that accepts

\bar{v}

, and each complete path ξ with a consistent equation system

K (ξ)

satisfies the following:

For every $r \in τ (ξ)$ , we have $K (r) \subseteq K (ξ)$ .
For every $r \in S ∖ τ (ξ)$ , the union $K (r) \cup K (ξ)$ is inconsistent.

In this situation,

G

is referred to as an NDT (Nondeterministic Decision Tree) solving

AR (S)

.

Let

G

solve the task

AR (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, and

ξ

accept

\bar{v}

. This means that

K (ξ) \subseteq K (S, \bar{v})

. From this, it follows that

K (ξ)

is consistent. Then, for any DR

r \in τ (ξ)

, we have

K (r) \subseteq K (ξ)

and

K (r) \subseteq K (S, \bar{v})

. Therefore, r is true for

\bar{v}

.

We also have that, for any DR

r \in S ∖ τ (ξ)

, the system of equations

K (r) \cup K (ξ)

is inconsistent. Since

K (ξ) \subseteq K (S, \bar{v})

, it follows that

K (r) \cup K (S, \bar{v})

is also inconsistent. Therefore, r is not true for

\bar{v}

. Thus,

τ (ξ) = S (\bar{v})

.

Remark 1.

Let

G

solve the task

AR (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, and ξ accept

\bar{v}

. Then,

τ (ξ) = S (\bar{v})

.

Definition 9.

We say that

G

nondeterministically solves the task

SR (S)

if, for every tuple

\bar{v} \in E_{k}^{| F t (S) |}

, there exists a complete path

ξ \in C P (G)

that accepts

\bar{v}

, and each complete path ξ with consistent

K (ξ)

satisfies the following:

For each $r \in τ (ξ)$ , we have $K (r) \subseteq K (ξ)$ .
If $τ (ξ) = \emptyset$ , then $K (r) \cup K (ξ)$ is inconsistent for every $r \in S$ .

In this case,

G

is called an NDT (Nondeterministic Decision Tree) that solves

SR (S)

.

Let

G

solve the task

SR (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, and

ξ

accept

\bar{v}

. This means that

K (ξ) \subseteq K (S, \bar{v})

. From this, it follows that

K (ξ)

is consistent. Then, for any DR

r \in τ (ξ)

, we have

K (r) \subseteq K (ξ)

and

K (r) \subseteq K (S, \bar{v})

. Therefore, r is true for

\bar{v}

.

If

τ (ξ) = \emptyset

, then for any DR

r \in S

, the system of equations

K (r) \cup K (ξ)

is inconsistent. Since

K (ξ) \subseteq K (S, \bar{v})

, it follows that

K (r) \cup K (S, \bar{v})

is also inconsistent. Therefore, r is not true for

\bar{v}

.

Definition 10.

We say that

G

nondeterministically solves the task

AD (S)

if, for every tuple

\bar{v} \in E_{k}^{| F t (S) |}

, there exists a complete path

ξ \in C P (G)

that accepts

\bar{v}

, and each complete path ξ with consistent

K (ξ)

satisfies the following:

For all $r \in τ (ξ)$ , we have the relation $K (r) \subseteq K (ξ)$ .
If $r \in S ∖ τ (ξ)$ and the right-hand side of r does not belong to the set $D (τ (ξ))$ , then the system of equations $K (r) \cup K (ξ)$ is inconsistent.

In this case,

G

is called an NDT (Nondeterministic Decision Tree) that solves

AD (S)

.

Let

G

solve the task

AD (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, and

ξ

accept

\bar{v}

. This means that

K (ξ) \subseteq K (S, \bar{v})

. From this, it follows that

K (ξ)

is consistent. Then, for any DR

r \in τ (ξ)

, we have

K (r) \subseteq K (ξ)

and

K (r) \subseteq K (S, \bar{v})

. Therefore, r is true for

\bar{v}

.

If

r \in S ∖ τ (ξ)

and the right-hand side of r is not contained in

D (τ (ξ))

, then the union

K (r) \cup K (ξ)

is inconsistent. Because

K (ξ) \subseteq K (S, \bar{v})

, this implies that

K (r) \cup K (S, \bar{v})

is also inconsistent. Therefore, r is not true for

\bar{v}

.

Remark 2.

Let

T \in {AR, SR, AD}

,

G

solve the task

T (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, ξ accept

\bar{v}

, and

τ (ξ) = \emptyset

. Then, for any DR

r \in S

, the system of equations

K (r) \cup K (ξ)

is inconsistent, and the DR r is not true for

\bar{v}

.

Remark 3.

Let

T \in {AR, SR, AD}

,

G

solve the task

T (S)

nondeterministically,

ξ \in C P (G)

,

\bar{v} \in E_{k}^{| F t (S) |}

, ξ accept

\bar{v}

, and

S (\bar{v}) = \emptyset

. Then,

τ (ξ) = \emptyset

.

Definition 11.

Let

T \in {AR, SR, AD}

. We say that

G

solves the task

T (S)

deterministically if it is a DDT (Deterministic Decision Tree) that also solves task

T (S)

in the nondeterministic sense. In this case,

G

is referred to as a DDT solving

T (S)

.

Definition 12.

For each complete path

ξ \in C P (G)

, let

h (ξ)

denote the number of internal nodes along ξ. The value

h (G) = max {h (ξ) : ξ \in C P (G)}

is defined as the depth of the DT

G

.

Let S be a nonempty k-DRS and let

T \in AR, SR, AD

. Define

h_{T}^{d} (S)

as the minimum depth of a DDT over S that solves

T (S)

, and

h_{T}^{a} (S)

as the minimum depth of an NDT over S that solves

T (S)

. For the empty system

S = \emptyset

, we set

h_{T}^{d} (S) = h_{T}^{a} (S) = 0

.

3. Main Results

Let

T \in {AR, SR, AD}

, and C be a class of k-DRSs closed under the removal of features if

T = AR

; otherwise, C is a class of k-DRSs closed under the removal of features and rules. In this section, we investigate the functions

H_{C, T}^{d} : N_{0} \to N_{0}

and

H_{C, T}^{a} : N_{0} \to N_{0}

, which are defined as follows. Let

n \in N_{0}

, then

\begin{matrix} H_{C, T}^{d} (n) & = max {h_{T}^{d} (S) : S \in C, | F t (S) | \leq n}, \\ H_{C, T}^{a} (n) & = max {h_{T}^{a} (S) : S \in C, | F t (S) | \leq n} . \end{matrix}

These functions characterize the minimum depth of DTs solving the task

T

for systems from the closed class C growthin the worst case with the growth of the number of different features in the DRSs. In the case of the function

H_{C, T}^{d}

, we consider DDTs, and in the case of the functions

H_{C, T}^{a}

, we consider NDTs. We begin by establishing several auxiliary results.

Lemma 1.

For any k-DRS S and

T \in {AR, SR, AD}

, the inequality

h_{T}^{d} (S) \leq | F t (S) |

holds.

Proof.

If the DRS S is empty, then

h_{T}^{d} (S) = | F t (S) | = 0

. Let S be a nonempty DRS. It is easy to show that we can construct a DDT solving the task

T (S)

by the sequential computation of all features from

F t (S)

. The depth of this DT is equal to

| F t (S) |

. Thus,

h_{T}^{d} (S) \leq | F t (S) |

. □

Lemma 2.

Let

T \in {AR, SR, AD}

, and C be a class of k-DRSs closed under the removal of features if

T = AR

; otherwise, C is a class of k-DRSs closed under the removal of features and rules. Then, for any

n \in N_{0}

, the values

H_{C, T}^{d} (n)

and

H_{C, T}^{a} (n)

are defined and the inequalities

0 \leq H_{C, T}^{a} (n) \leq H_{C, T}^{d} (n) \leq n

hold.

Proof.

Let

S \in C

. If the DRS S is empty, then

h_{T}^{a} (S) = h_{T}^{d} (S) = | F t (S) | = 0

. Let S be a nonempty DRS. It is clear that any DDT solving the task

T (S)

is an NDT solving the task

T (S)

. Therefore

h_{T}^{a} (S) \leq h_{T}^{d} (S)

. From Lemma 1, it follows that

h_{T}^{d} (S) \leq | F t (S) |

. From all of the above, the statement of the lemma follows. □

Lemma 3.

Let

T \in {AR, SR, AD}

, and C be a class of k-DRSs closed under the removal of features if

T = AR

; otherwise, C is a class of k-DRSs closed under the removal of features and rules. If the value

| F t (S) |

for DRSs

S \in C

is bounded from above by a positive constant b, then for any

n \in N_{0}

,

0 \leq H_{C, T}^{a} (n) \leq H_{C, T}^{d} (n) \leq b .

Proof.

Let the value

| F t (S) |

for DRSs

S \in C

be bounded from above by a positive constant b. Using Lemma 1, we obtain that for any

S \in C

,

h_{T}^{d} (S) \leq | F t (S) | \leq b

. Therefore,

H_{C, T}^{d} (n) \leq b

for any

n \in N_{0}

. Using Lemma 2, we obtain that

0 \leq H_{C, T}^{a} (n) \leq H_{C, T}^{d} (n) \leq b

for any

n \in N_{0}

. □

Lemma 4.

Let C be a class of k-DRSs closed under the removal of features, and let the value

| F t (S) |

for systems

S \in C

not be bounded from above. If the value

l (S)

for systems

S \in C

is not bounded from above, then for any

n \in N_{0} ∖ {0}

, there exists a DRS

S^{'}

from C with

| F t (S^{'}) | = n

that contains a DR of the length n.

Proof.

Let

n \in N_{0} ∖ {0}

. Let S be a system from C such that

l (S) \geq n

, and r be a DR from S, the length of which is at least n. We remove from r (and from

S)

l (r) - n

features and obtain a DR

r^{'}

of the length n. We also remove from S all features that do not belong to the set

F t (r^{'})

. As a result, we obtain a DRS

S^{'}

from C with

| F t (S^{'}) | = n

containing the DR

r^{'}

of length n. □

Lemma 5.

Let C be a class of k-DRSs closed under the removal of features, and the value

| F t (S) |

for systems

S \in C

be not bounded from above. If the value

l (S)

for system

S \in C

is bounded from above, then for any

n \in N_{0} ∖ {0}

, there is a DRS

S^{*}

from C with

| F t (S^{*}) | = n

that contains n DRs of length 1 with pairwise different features.

Proof.

Let the value

l (S)

for DRSs

S \in C

be bounded from above by a positive integer l.

Let

t \in N_{0} ∖ {0}

and

S \in C

. We denote by

S_{t}

the set of DRs from S, the length of which is equal to t. We know that the value

| F t (S) |

for DRSs

S \in C

is not bounded from above by a constant. We now show that there exists

t \in {1, \dots, l}

for which the value

| F t (S_{t}) |

for DRSs

S \in C

is not bounded from above by a constant. Assume the contrary. Then, there exists a positive constant b such that, for any

t \in {1, \dots, l}

, the value

| F t (S_{t}) |

for DRSs

S \in C

is at most b. It is clear that

| F t (S_{t}) | = 0

if

t > l

. Therefore, for any

S \in C

,

| F t (S) | \leq | F t (S_{1}) | + \dots + | F t (S_{l}) |

. As a result, we obtain that

| F t (S) | \leq b l

for any

S \in C

, which is impossible.

We denote by

t_{0}

the minimum number

t \in {1, \dots, l}

for which the value

| F t (S_{t}) |

for DRSs

S \in C

is not bounded from above by a constant. We will show that

t_{0} = 1

.

Let

S \in C

and

r \in S

. The set

F t (r)

will be called the type of the DR r. For

f_{i} \in F t (S)

, we denote by

d_{f_{i}} (S)

the number of DRs

ρ \in S

with pairwise different types such that

f_{i} \in F t (ρ)

. Denote

d (S) = max {d_{f_{i}} (S) : f_{i} \in F t (S)}

.

First, we consider the case when the value

d (S_{t_{0}})

for DRSs

S \in C

is bounded from above by a positive integer d. Let

n \in N_{0} ∖ {0}

and S be a DRS from C for which

| F t (S_{t_{0}}) | \geq n d (t_{0} - 1)

. Denote

Q = S_{t_{0}}

. Choose a feature

f_{i_{1}} \in F t (Q)

and remove from the DRSs Q and S all features with the exception of

f_{i_{1}}

, which belong to DRs

ρ \in Q

, such that

f_{i_{1}} \in F t (ρ)

. As a result, we remove at most

d (t_{0} - 1)

features and obtain DRSs

Q^{1}

and

S^{1}

. One can show that

| F t (Q^{1}) | \geq (n - 1) d (t_{0} - 1)

,

d (Q^{1}) \leq d

, and

l (Q^{1}) \leq t_{0}

. If

n > 1

, choose a feature

f_{i_{2}} \in F t (Q^{1})

different from

f_{i_{1}}

and remove from the DRSs

Q^{1}

and

S^{1}

all features with the exception of

f_{i_{2}}

, which belong to DRs

ρ \in Q^{1}

such that

f_{i_{2}} \in F t (ρ)

. We denote by

Q^{2}

and

S^{2}

the obtained DRSs, etc. We repeat the described procedure n times and obtain DRSs

Q^{n}

and

S^{n}

. In each of these DRSs, there are n pairwise different features

f_{i_{1}}, \dots, f_{i_{n}}

and n DRs

r_{1}, \dots, r_{n}

such that, for

j = 1, \dots, n

,

l (r_{j}) = 1

and

F t (r_{j}) = {f_{i_{j}}}

. Moreover,

S^{n} \in C

. Since n is an arbitrary number from

N_{0} ∖ {0}

, we obtain that the value

| F t (S_{1}) |

for DRSs

S \in C

is not bounded from above by a constant. Therefore,

t_{0} = 1

.

We now consider the case when the value

d (S_{t_{0}})

for DRSs

S \in C

is not bounded from above by a constant. We will prove that

t_{0} = 1

. Let us assume the contrary:

t_{0} > 1

. Let

n \in N_{0} ∖ {0}

. Choose a DRS

S \in C

such that

d (S_{t_{0}}) \geq n^{t_{0} - 1}

. Let

f_{i} \in F t (S_{t_{0}})

and

d_{f_{i}} (S_{t_{0}}) = d (S_{t_{0}})

. Remove the feature

f_{i}

from the DRS S. As a result, we obtain a DRS

S^{'}

from C, which contains a number of DRs of the length

t_{0} - 1

. Let us show that these DRs contain at least n pairwise different features. Let us assume the contrary: the considered DRs contain only

m < n

pairwise different features. Then, the number of different types of DRs of length

t_{0}

in S that contain the feature

f_{i}

is at most the number of different subsets of the cardinality

t_{0} - 1

of the set of considered m features, which is at most

m^{t_{0} - 1}

. Evidently,

m^{t_{0} - 1} < n^{t_{0} - 1}

, but this is impossible, since

d_{f_{i}} (S_{t_{0}}) \geq n^{t_{0} - 1}

. Taking into account that n is an arbitrary number from

N_{0} ∖ {0}

, we obtain that the value

| F t (S_{t_{0} - 1}) |

for DRSs

S \in C

is not bounded from above by a constant. We obtained the contradiction. Thus,

t_{0} = 1

.

Let

n > 0

and S be a DRS from C for which

| F t (S_{1}) | \geq n

. We remove from S all features with the exception of n pairwise different features from

F t (S_{1})

. We denote by

S^{*}

the obtained DRS from C. It is clear that

| F t (S^{*}) | = n

and

S^{*}

contains n DRs

r_{1}, \dots, r_{n}

of length 1 with pairwise different features. □

We will now formulate and prove the main results of this paper.

Theorem 1.

Let C be a class of k-DRSs closed under the removal of features. Then

(a) If the value

| F t (S) |

for DRSs

S \in C

is bounded from above by a positive constant b, then

0 \leq H_{C, AR}^{a} (n) \leq H_{C, AR}^{d} (n) \leq b

for any

n \in N_{0}

.

(b) Otherwise,

H_{C, AR}^{a} (n) = H_{C, AR}^{d} (n) = n

, for any

n \in N_{0}

.

Proof.

(a) This statement follows from Lemma 3.

(b) Let the value

| F t (S) |

for DRSs

S \in C

be not bounded from above by a positive constant. We now consider two possibilities.

(b.1) Let the value

l (S)

for DRSs

S \in C

be not bounded from above by a constant. Let

n \in N_{0}

. We now show that

H_{C, T}^{a} (n) \geq n

. It is clear that

H_{C, T}^{a} (0) = 0

. Let

n > 0

. From Lemma 4, it follows that there exists a DRS

S^{'}

from C with

| F t (S^{'}) | = n

containing a DR

r^{'}

of the length n.

Let

G

be an NDT over

S^{'}

that solves the task

AR (S^{'})

and satisfies

h (G) = h_{AR}^{a} (S^{'})

. Consider a tuple

\bar{v} \in E_{k}^{n}

for which the DR

r^{'}

is true. Then there exists a complete path

ξ \in C P (G)

that accepts the tuple

\bar{v}

. By Remark 1, the set

τ (ξ)

associated with the leaf of

ξ

coincides with

S (\bar{v})

, the set of rules in S that are true for

\bar{v}

. In particular,

r^{'} \in τ (ξ)

. Hence, it follows that the relation

K (r^{'}) \subseteq K (ξ)

holds, and therefore

h (ξ) \geq n

and

h_{AR}^{a} (S^{'}) = h (G) \geq n

. Since

| F t (S^{'}) | = n

, we obtain

H_{C, AR}^{a} (n) \geq n

. From Lemma 2 we have

H_{C, AR}^{a} (n) = H_{C, AR}^{d} (n) = n

.

(b.2) Let the value

l (S)

for DRSs

S \in C

be bounded from above by a positive integer l. Let

n \in N_{0}

. We now show that

H_{C, T}^{a} (n) \geq n

for any

n \in N_{0}

. It is clear that

H_{C, T}^{a} (0) = 0

. Let

n > 0

. From Lemma 5, this implies the existence of a DRS

S^{*}

from C with

| F t (S^{*}) | = n

that contains n DRs

r_{1}, \dots, r_{n}

of length 1 with pairwise different features.

Let

G

be an NDT over

S^{*}

, which solves the task

AR (S^{*})

and for which

h (G) = h_{AR}^{a} (S^{*})

. Let

\bar{v} \in E_{k}^{n}

be a tuple for which the DRs

r_{1}, \dots, r_{n}

are true. Then there exists a path

ξ \in C P (G)

, which accepts the tuple

\bar{v}

. From Remark 1, it follows that the set

τ (ξ)

attached to the leaf vertex of

ξ

coincides with the set

S (\bar{v})

of DRs from S that are true for the tuple

\bar{v}

. In particular,

{r_{1}, \dots, r_{n}} \subseteq τ (ξ)

. From here it follows that the relation

K (r_{j}) \subseteq K (ξ)

holds for

j = 1, \dots, n

. Therefore

h (ξ) \geq n

and

h_{AR}^{a} (S^{*}) = h (G) \geq n

. Taking into account that

| F t (S^{*}) | = n

, we obtain

H_{C, AR}^{a} (n) \geq n

. From Lemma 2 it follows that

H_{C, AR}^{a} (n) = H_{C, AR}^{d} (n) = n

. □

Theorem 2.

Let

T \in {SR, AD}

, and C be a class of k-DRSs closed under the removal features and rules. Then,

(a) If the value

| F t (S) |

for DRSs

S \in C

is bounded from above by a positive constant b, then

0 \leq H_{C, T}^{a} (n) \leq H_{C, T}^{d} (n) \leq b

for any

n \in N_{0}

.

(b) Otherwise,

H_{C, T}^{a} (n) = H_{C, T}^{d} (n) = n

, for any

n \in N_{0}

.

Proof.

(a) This statement follows from Lemma 3.

(b) Let the value

| F t (S) |

for DRSs

S \in C

be not bounded from above by a positive constant. We now consider two possibilities.

(b.1) Let the value

l (S)

for DRSs

S \in C

be not bounded from above by a constant. Let

n \in N_{0}

. We now show that

H_{C, T}^{a} (n) \geq n

. It is clear that

H_{C, T}^{a} (0) = 0

. Let

n > 0

. From Lemma 4, it follows that there exists a DRS

S^{'}

from C with

| F t (S^{'}) | = n

containing a DR

r^{'}

of length n. Denote by

S^{″}

the DRS obtained from

S^{'}

by removal of all DRs with the exception of

r^{'}

. Then,

S^{″} \in C

.

Let

G

be an NDT over

S^{″}

, which solves the task

T (S^{″})

and for which

h (G) = h_{T}^{a} (S^{″})

. Let

\bar{v} \in E_{k}^{n}

be a tuple for which the DR

r^{'}

is true. Then there exists a path

ξ \in C P (G)

, which accepts the tuple

\bar{v}

. Using Remark 2 and taking into account that there is only

r^{'}

, and it is true for

\bar{v}

, we obtain

τ (ξ) = {r^{'}}

. From here, we have

K (r^{'}) \subseteq K (ξ)

. Therefore,

h (ξ) \geq n

and

h_{T}^{a} (S^{″}) = h (G) \geq n

. Since

| F t (S^{″}) | = n

, we obtain

H_{C, T}^{a} (n) \geq n

. From Lemma 2 it follows that

H_{C, T}^{a} (n) = H_{C, T}^{d} (n) = n

.

(b.2) Let the value

l (S)

for DRSs

S \in C

be bounded from above by a positive integer l. Let

n \in N_{0}

. We now show that

H_{C, T}^{a} (n) \geq n

for any

n \in N_{0}

. It is clear that

H_{C, T}^{a} (0) = 0

. Let

n > 0

. From Lemma 5, it follows that there exists a DRS

S^{*}

from C with

| F t (S^{*}) | = n

that contains n DRs

r_{1}, \dots, r_{n}

of the length 1 with pairwise different features. Denote by

S^{* *}

the DRS obtained from

S^{*}

by removal of all DRs with the exception of

r_{1}, \dots, r_{n}

. Then,

S^{* *} \in C

.

Let

G

be an NDT over

S^{* *}

, which solves the task

T (S^{* *})

and for which

h (G) = h_{T}^{a} (S^{* *})

. Let

\bar{v} \in E_{k}^{n}

be a tuple for which all the DRs

r_{1}, \dots, r_{n}

are not true. Then, there exists a path

ξ \in C P (G)

, which accepts the tuple

\bar{v}

. Using Remark 3, we obtain

τ (ξ) = \emptyset

. By Remark 2, we see that

K (ξ) \cup K (r_{j})

is inconsistent for

j = 1, \dots, n

. Therefore

h (ξ) \geq n

and

h_{T}^{a} (S^{* *}) = h (G) \geq n

. Taking into account that

| F t (S^{* *}) | = n

, we obtain

H_{C, T}^{a} (n) \geq n

. From Lemma 2 it follows that

H_{C, T}^{a} (n) = H_{C, T}^{d} (n) = n

. □

Let

T \in {SR, AD}

, and C be a class of DRS closed under the operation of removal of features only. We now show that in this case the functions

H_{C, T}^{a}

and

H_{C, T}^{d}

can be bounded from above by a constant even if the value

| F t (S) |

for DRSs

S \in C

is not bounded from above by a constant.

Let B be a finite subset of the set

F = {f_{i} : i \in N_{0}}

. Denote

S_{B} = {(f_{i} = 0) \to 0, (f_{i} = 1) \to 0 : i \in B}

. Let us consider the class

C = {S_{B} : B is a finite subset of A}

of 2-DRSs. One can show that the class C is closed under the removal of features, and the value

| F t (S_{B}) |

for DRSs

S_{B} \in C

is not bounded from above by a constant. Let

S_{B} \in C

, B be nonempty, and

f_{i} \in B

. Then, the DDT depicted in Figure 1 solves the task

T (S_{B})

. Therefore, the functions

H_{C, T}^{a}

and

H_{C, T}^{d}

are bounded from above by the constant 1.

Figure 1. DDT (Deterministic Decision Tree) solving the task

T (S_{B})

,

T \in {SR, AD}

.

4. Conclusions

In this paper, for arbitrary closed classes of DRSs, we investigated the functions characterizing the worst-case dependence of the minimum depth of DDTs and NDTs on the number of different features in the DRS for three tasks:

Finding all true DRs in a DRS, considering classes closed under the removal of features operation.
Determining whether at least one true DR exists, considering classes closed under the removal of features and rules.
Finding all right-hand sides of true DRs in a DRS, considering classes closed under the removal of features and rules.

It was proven that, in all three cases, the functions describing the worst-case depth of DTs are either bounded from above by a constant or grow linearly. In the future, we plan to extend this study to further tasks of DRSs and DTs.

Author Contributions

Conceptualization, M.M.; Methodology, K.D. and M.M.; Formal analysis, K.D. and M.M.; Writing—original draft, K.D. and M.M.; Writing—review & editing, K.D. and M.M.; Supervision, M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by King Abdullah University of Science and Technology (KAUST).

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).

Conflicts of Interest

The authors declare no conflicts of interest.

References

AbouEisha, H.; Amin, T.; Chikalov, I.; Hussain, S.; Moshkov, M. Extensions of Dynamic Programming for Combinatorial Optimization and Data Mining; Intelligent Systems Reference Library; Springer: Cham, Switzerland, 2019; Volume 146. [Google Scholar]
Alsolami, F.; Azad, M.; Chikalov, I.; Moshkov, M. Decision and Inhibitory Trees and Rules for Decision Tables with Many-valued Decisions; Intelligent Systems Reference Library; Springer: Cham, Switzerland, 2020; Volume 156. [Google Scholar]
Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Springer: Cham, Switzerland, 1984. [Google Scholar]
Quinlan, J.R. C4.5: Programs for Machine Learning; Morgan Kaufmann: Burlington, MA, USA, 1993. [Google Scholar]
Rokach, L.; Maimon, O. Data Mining with Decision Trees—Theory and Applications; Series in Machine Perception and Artificial Intelligence; Springer: Cham, Switzerland, 2007; Volume 69. [Google Scholar]
Boros, E.; Hammer, P.L.; Ibaraki, T.; Kogan, A. Logical analysis of numerical data. Math. Program. 1997, 79, 163–190. [Google Scholar] [CrossRef]
Boros, E.; Hammer, P.L.; Ibaraki, T.; Kogan, A.; Mayoraz, E.; Muchnik, I.B. An implementation of logical analysis of data. IEEE Trans. Knowl. Data Eng. 2000, 12, 292–306. [Google Scholar] [CrossRef]
Chikalov, I.; Lozin, V.V.; Lozina, I.; Moshkov, M.; Nguyen, H.S.; Skowron, A.; Zielosko, B. Three Approaches to Data Analysis—Test Theory, Rough Sets and Logical Analysis of Data; Intelligent Systems Reference Library; Springer: Cham, Switzerland, 2013; Volume 41. [Google Scholar]
Fürnkranz, J.; Gamberger, D.; Lavrac, N. Foundations of Rule Learning; Cognitive Technologies: Arlington, VA, USA, 2012. [Google Scholar]
Pawlak, Z. Rough Sets—Theoretical Aspects of Reasoning about Data; Theory and Decision Library: Series D; Springer: Cham, Switzerland, 1991; Volume 9. [Google Scholar]
Pawlak, Z.; Skowron, A. Rudiments of rough sets. Inf. Sci. 2007, 177, 3–27. [Google Scholar] [CrossRef]
Costa, V.G.; Pedreira, C.E. Recent advances in decision trees: An updated survey. Artif. Intell. Rev. 2023, 56, 4765–4800. [Google Scholar] [CrossRef]
Molnar, C. Interpretable Machine Learning. A Guide for Making Black Box Models Explainable, 2nd ed.; 2022; Available online: https://freecomputerbooks.com/Interpretable-Machine-Learning-A-Guide-for-Making-Black-Box-Models-Explainable.html (accessed on 14 July 2025).
Moshkov, M. Some relationships between decision trees and decision rule systems. In Proceedings of the Rough Sets and Current Trends in Computing, First International Conference, RSCTC’98, Warsaw, Poland, 22–26 June 1998, Proceedings; Polkowski, L., Skowron, A., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 1998; Volume 1424, pp. 499–505. [Google Scholar]
Moshkov, M. On transformation of decision rule systems into decision trees. In Proceedings of the Seventh International Workshop Discrete Mathematics and its Applications, Moscow, Russia, 29 January–2 February 2001; Part 1. Center for Applied Investigations of Faculty of Mathematics and Mechanics; Moscow State University: Moscow, Russia, 2001; pp. 21–26. (In Russian) [Google Scholar]
Durdymyradov, K.; Moshkov, M.; Ostonov, A. Decision Trees Versus Systems of Decision Rules. A Rough Set Approach; Studies in Big Data; Springer: Cham, Switzerland, 2024; Volume 160. [Google Scholar]
Durdymyradov, K.; Moshkov, M. Deterministic and nondeterministic decision trees for decision rule systems from closed classes. In Proceedings of the 24th International Conference on Artificial Intelligence and Soft Computing (ICAISC 2025), Zakopane, Poland, 22–26 June 2025. [Google Scholar]
Durdymyradov, K.; Moshkov, M. On depth of deterministic and nondeterministic decision trees for decision rule systems from closed classes. In Proceedings of the 29th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2025), Osaka, Japan, 10–12 September 2025. [Google Scholar]

Figure 1. DDT (Deterministic Decision Tree) solving the task

T (S_{B})

,

T \in {SR, AD}

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Three Problems for Decision Rule Systems from Closed Classes

Abstract

1. Introduction

2. Definitions

2.1. DRSs—Decision Rule Systems

2.2. DTs—Decision Trees

3. Main Results

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics