Correctness of Fuzzy Inference Systems Based on f-Inclusion

Díaz-Montarroso, Carolina; Madrid, Nicolás; Ramírez-Poussa, Eloísa

doi:10.3390/math13111897

Open AccessFeature PaperArticle

Correctness of Fuzzy Inference Systems Based on f-Inclusion

by

Carolina Díaz-Montarroso

,

Nicolás Madrid

^*

and

Eloísa Ramírez-Poussa

Department of Mathematics, University of Cádiz, C. Republica Saharaui, 11510 Puerto Real, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(11), 1897; https://doi.org/10.3390/math13111897

Submission received: 10 April 2025 / Revised: 23 May 2025 / Accepted: 30 May 2025 / Published: 5 June 2025

(This article belongs to the Section E1: Mathematics and Computer Science)

Download

Browse Figures

Versions Notes

Abstract

Recent work has shown that the f-index of inclusion can serve as a foundation for modeling Generalized Modus Ponens. In this paper, we develop a novel fuzzy inference system based on this inference rule. To establish its soundness, we connect it to a Fuzzy Description Logic

LU

enriched with fuzzy modifiers (also known as fuzzy hedges). This logic background provides to the approach a strength absent in most fuzzy inference systems in the literature, which allows us to formally prove a series of results that culminate in a final correctness theorem for the proposed fuzzy inference system. This paper also presents a running example aimed at showing the potential applicability of the proposal.

Keywords:

fuzzy logic; f-index of inclusion; fuzzy inference system; fuzzy description logic; inclusion measures

MSC:

68T37

1. Introduction

Fuzzy inference rules have long served as a cornerstone in fuzzy set theory, bridging theoretical concepts with practical applications. There are two main kinds of fuzzy inference systems (FISs) according to how the inference is performed. The first group is formed by the so-called relational fuzzy inference systems and includes the well-known compositional rule of inference, which performs an inference by means of a fuzzy relation and a couple of fuzzy operators, standardly, a fuzzy conjunction and a fuzzy implication. In this group, the most prominent approach is the one of Mamdani [1], but we can find in the literature more general approaches [2,3,4]. On the other hand, the second group of approaches is formed by those whose rules provide crisp values which are combined by means of aggregator operators in order to obtain a final inference. In this latter group, the most prominent approaches are those of Takagi–Sugeno [5] and Tsukamoto’s FIS [6], although some other recent FISs in this family can be found as well, such as those based on F-transforms [7], those considering probability distributions as consequents [8], or those considering nonlinear systems as consequents [9].

While their utility is undeniable (e.g., there are recent approaches dealing with Multicriteria Decision-Making [10,11] or medical diagnosis [12]), a critical flag remains from a theoretical point of view. Rigorously speaking, most of the FISs in the literature are not developed on the basis of a formal fuzzy logic system. In other words, although those FISs rely on fuzzy logic operators, they lack the formal syntax and semantics necessary for a rigorous definition of the notion of consequence. This absence impedes the ability to formally define syllogisms and to prove the correctness of those inference processes. As a result, the literature on FISs focuses mainly on appropriateness rather than on logical correctness. Here are some illustrative examples: in [8], the appropriateness of fuzzy-probabilistic inference systems is measured according to empirical experimentation with ad hoc data and real data; in [7], the appropriateness of dependency rules is rephrased in terms of results that prove the representation of functions via F-transforms; in [3], the appropriateness of relational fuzzy inference rules is reduced to the satisfiability of fuzzy relational equations; and in [9], the appropriateness of a switching Takagi–Sugeno FIS is identified with results on stability.

Most FISs are based on approximate reasoning and the so-called Generalized Modus Ponens (GMP). GMP is a conceptual inference that extends the Modus Ponens syllogism as follows: given

X \to Y

and something similar to X (denoted by

X^{'}

), we can infer something similar to Y (denoted by

Y^{'}

). This inference rule is usually depicted as

\begin{matrix} X \to Y \\ \frac{X^{'}}{∴ Y^{'}} \end{matrix}

(1)

Taking this as the reference point, we present in this paper a novel FIS that makes use of a GMP recently defined by means of the f-inclusion [13] between fuzzy sets [14].

Remarkably, both components needed in the definition of the GMP can be modeled by f-inclusion. This is due to a dual interpretability of this operator. On the one hand, in [15], we proved that f-inclusion can be linked to an optimal choice of an implication operator in order to perform a Modus Ponens. Consequently, this operator can be interpreted as the truth degree of an If–Then rule. On the other hand, f-inclusion was originally designed to represent inclusion, but then, it was also appropriated to represent similarity between two fuzzy sets [16].

The proposed FIS is defined as a formal logic system by means of a syntactic construction of rules and a semantics, i.e., models. The semantics is based on Fuzzy Description Logic [17], a family of formal logics that represents knowledge and reasons with it by using as central element the notion of inclusion between fuzzy sets. This link with Fuzzy Description Logic allows us to prove that the GMP used in our FIS is a correct inference rule; in other words, our FIS performs correct inferences even if the input (a fuzzy set) does not coincide with the antecedents of rules.

It is worth mentioning here the two main differences of our approach with respect to the compositional rule of inference.

The compositional rule of inference requires a fuzzy relation to link the universes of X and Y and to compute the output. In our approach, the connection between universes is left to interpretations and models. This simplifies the computation of the output in our FIS.
The compositional rule of inference requires to fix a pair of operators in advance, in general, an implication operator and a conjunction operator. However, we do not need to fix operators in advance; the inference is performed directly by a mapping that represents the inclusion between fuzzy sets.

These two facts make our approach simpler and more interpretable than the traditional FISs.

It is also worth mentioning here two approaches that deal with providing a formal logical sustain to FIS. The first is [18], which shows that most of the deduction procedures appearing in FISs can be axiomatized in Rational Pavelka Predicate Logic. The other is [4], where the author presents an FIS based on a formal fuzzy logic system with its respective syntax and semantics. The main difference between those approaches and the FIS based on f-inclusions is that the logic theory supporting the latter is Description Logic, which is a much more application-oriented logic theory than Rational Pavelka Predicate Logic [19] (used in [18]) and the

{MTLF}_{Δ}

[20] (used in [4]). This fact makes our approach more application-oriented than [4].

This paper is structured as follows: In Section 2, we present some preliminary notions that are used throughout the paper. Subsequently, in Section 3, we introduce the Description Logic

{LU}_{F H}

, which allows us to consider intersection, union, and concept modifiers (or fuzzy hedges) in the syntax. The semantics provided for modifiers in our approach differs slightly from the one presented in [21], since here, we require them to be part of an adjoint pair. This modification allows us to define a correct GMP in

{LU}_{F H}

through the adjunction property. Section 4 contains the main content of the paper, the FIS based on the notion of f-inclusion. Such a section describes the syntax and semantics of knowledge databases, allowing us to formally define the concept of consequence. In this section, we also present a link between models of knowledge databases and those of the Description Logic

{LU}_{F H}

, allowing us to use any inference of

{LU}_{F H}

on our knowledge databases. Throughout this section, we provide an illustrative example that shows the semantics and the performance of our FIS. Finally, in Section 6, we present some conclusions and future works.

2. Preliminaries

Let us begin by recalling the notion of fuzzy set.

Definition 1.

A fuzzy set is a pair

A = (U, μ_{A})

where

U

is a non-empty set (called universe) and

μ_{A}

a mapping from

U

to

[0, 1]

(called membership function).

The set of fuzzy sets defined on the universe

U

is denoted by

F (U)

. Moreover, since the universe is in general pre-fixed by the context, for the sake of clarity, we refer to a fuzzy set

(U, μ_{A})

directly by its membership function, i.e.,

A (u) = μ_{A} (u)

.

Let us recall that a fuzzy partition of a universe

U

is a set

P = {A_{1}, A_{2}, \dots, A_{n}}

of fuzzy sets on

U

satisfying the covering property, i.e., for all

u \in U

, there exists a fuzzy set

A_{i} \in P

such that

A_{i} (u) > 0

. In this paper, we also use the notions of the core and support of a fuzzy set.

Definition 2.

Given

A \in F (U)

, the support of A is defined as

s u p p (A) = {u \in U | μ_{A} (u) > 0}

and the core of A is defined as

c o r e (A) = {u \in U | μ_{A} (u) = 1} .

Zadeh identified the ordering between fuzzy sets with the ordering of their membership functions, i.e.,

A \leq B

if and only if

μ_{A} (u) \leq μ_{B} (u)

for all

u \in U

. Because this relation is quite restrictive and rigid, many authors have focused on generalizing the ordering between fuzzy sets with a graduated structure. In this paper, we focus on the f-index of inclusion, which is a novel notion that identify the ordering between fuzzy sets via a mapping from

[0, 1]

to

[0, 1]

. Not any mapping from

[0, 1]

to

[0, 1]

is appropriate to represent such an inclusion, so the first step is to fix the set of possible functions used as indexes.

Definition 3.

The set of indexes of inclusion, denoted by Ω, consists of all the deflationary and monotonically increasing mappings, that is, any mapping

f : [0, 1] \to [0, 1]

such that

$f (x) \leq x$ for all $x \in [0, 1]$ ;
$x \leq y$ implies $f (x) \leq f (y)$ for all $x, y \in [0, 1]$ .

The indexes of inclusion are assigned by means of the notion of f-inclusion recalled below.

Definition 4.

Let A and B be two fuzzy sets and consider

f \in Ω

. We say that A is f-included in B, denoted by

A \subseteq_{f} B

, if and only if the inequality

f (A (u)) \leq B (u)

holds for all

u \in U

.

Here, it is convenient to note that each mapping

f \in Ω

determines a different restriction by means of f-inclusion; in other words, given a fuzzy set A, the greater

f \in Ω

, the stronger the restriction imposed on the fuzzy set B by the f-inclusion. The reader is referred to [13,16] for more details about that assertion. Finally, the f-index of inclusion between two fuzzy sets is determined by choosing an index in

Ω

that represents the criterion given by the f-inclusion. Instead of presenting the original one [13], we consider here a recent generalization [22,23], which allows us to consider subsets of

Ω

to determine such a choice.

Definition 5.

Let A and B be two fuzzy sets and Θ be a join-subsemilattice of Ω, i.e., Θ is closed under arbitrary suprema and contains

0

and id. Then, the f-index of inclusion restricted to Θ, denoted by

{Inc}_{Θ} (A, B)

, is defined as

{Inc}_{Θ} (A, B) = sup {f \in Θ ∣ A \subseteq_{f} B} .

If

Θ = Ω

, we write directly

Inc (A, B)

to denote the f-index of inclusion. In [22], the properties of the f-index of inclusion were studied according to the axiomatic properties required by the standard approaches of inclusion between fuzzy sets [24,25]. We refer the reader to [16] for a detailed explanation on how to compute the f-index of inclusion.

Theorem 1.

Let

A, B

, and C be three fuzzy sets and let

Θ \subseteq Ω

be a join-subsemilattice of Ω with

0, id \in Θ

; then,

1.: ${Inc}_{Θ} (A, B) \leq Inc (A, B)$ ;
2.: ${Inc}_{Θ} (A, B) = id$ if and only if $A (u) \leq B (u)$ for all $u \in U$ ;
3.: ${Inc}_{Θ} (A, B) = 0$ if and only if, for each $f \in Θ$ with $f \neq 0$ , there exists an element $u \in U$ such that $f (A (u)) > B (u)$ ;
4.: $Inc (A, B) = 0$ if and only if there exists a set ${u_{i}}_{i \in I} \subseteq U$ such that $A (u_{i}) = 1$ and ${inf}_{i \in I} B (u_{i}) = 0$ ;
5.: If Θ is closed under composition (i.e., for all $f, g \in Θ$ , we have $f \circ g \in Θ$ ), then $I n c_{Θ} (B, C) \circ I n c_{Θ} (A, B) \leq I n c_{Θ} (A, C)$ ;
6.: If $B (u) \leq C (u)$ for every $u \in U$ , then $I n c_{Θ} (C, A) \leq I n c_{Θ} (B, A)$ ;
7.: If $B (u) \leq C (u)$ for every $u \in U$ , then $I n c_{Θ} (A, B) \leq I n c_{Θ} (A, C)$ ;
8.: Let $T : U \to U$ be a bijective mapping on $U$ ; then, $I n c_{Θ} (A, B) = I n c_{Θ} (T (A), T (B))$ ;
9.: $I n c_{Θ} (A, B \cap C) \geq inf {I n c_{Θ} (A, B), I n c_{Θ} (A, C)$ };
10.: $I n c_{Θ} (A \cup B, C) \geq inf {I n c_{Θ} (A, C), I n c_{Θ} (B, C)$ }.

Among all the join-semilattices of

Ω

, the following one, denoted by

G

, is remarkable, and it is used throughout the paper.

Definition 6.

The set

G

contains all the mappings in

f \in Ω

satisfying that there exists

g : [0, 1] \to [0, 1]

such that

(f, g)

is an adjoint pair.

Let us recall that an adjoint pair

(f, g)

in

[0, 1]

is a pair of functions

f, g : [0, 1] \to [0, 1]

such that

f (x) \leq y \Leftrightarrow x \leq g (y)

for all

x, y \in [0, 1]

. This property (called adjointness) is used in this paper to define a Generalized Modus Ponens that is used as the inference engine of our fuzzy inference system. It is worth recalling here a well-known result about adjoint pairs [26]. With a fixed mapping

f : [0, 1] \to [0, 1]

, there exists a mapping

g : [0, 1] \to [0, 1]

such that

(f, g)

forms an adjoint pair if and only if f is left continuous. Moreover, in this case, the mapping g is unique and right continuous.

3. The Description Logic with Concept Modifiers ${LU}_{F H}$

In this section, we provide the syntax and semantics of the Description Logic

{LU}_{F H}

that adds to the standard syntax of

LU

fuzzy modifiers (or fuzzy hedges). Thanks to these modifiers, which are interpreted as mappings in

G

(Definition 6), we can define a correct Generalized Modus Ponens (GMP) in

{LU}_{F H}

. Later, in Section 4, we use this Description Logic as the reference to define our inference engine.

3.1. Syntax of ${LU}_{F H}$

The formal language of

{LU}_{F H}

consists of a set of alphabets of symbols, for individuals, concepts, modifiers, and membership degrees.

The set of individuals is represented by lowercase letters $a, b, c, \dots$ . In general, they are denoted by $a_{i}$ .
The set of primitive concepts, identified by uppercase letters $A, B, \dots$ ( $A_{i}$ in general), denote properties satisfied by individuals. Here, we also include two particular concepts, the top (⊤) and bottom (⊥) concepts.
The set of truth-depressing modifiers is denoted by the symbol $m_{i}$ .
The set of membership degrees is represented through Greek letters $α, β, \dots$ (in general, $α_{i}$ ).

On the other hand, this syntax also includes the connectives ⊓, ⊔, ⊑, ∘, and −. Specifically, given a truth-depressing modifier m, we can construct a new modifier called truth-stressing modifier as follows:

\begin{matrix} \bar{m} ∣ & (truth - stressing modifier) \end{matrix}

Note that by construction, the operator − cannot be applied twice to one modifier, since it is applied directly on truth-depressing modifiers, which are atomic elements in the alphabet. Moreover, given two (truth-stressing or truth-depressing) modifiers

m_{1}

and

m_{2}

, the following expression is a modifier as well:

\begin{matrix} m_{1} \circ m_{2} ∣ & (composition of modifiers) \end{matrix}

Finally, given two concepts C and D and one modifier m, the following expressions are concepts as well:

\begin{matrix} C ⊓ D ∣ & (concept conjunction) \\ C ⊔ D ∣ & (concept disjunction) \\ {}^{m}C ∣ & (modified concept) \end{matrix}

The interpretation of the connectives ⊓, ⊔, ⊑, ∘, and − is left to the following section, which is devoted to the semantics of

{LU}_{F H}

. Moreover, the auxiliary symbols 〈 and 〉 are also used in this syntax to define the following axioms.

Definition 7

([17]). Given a concept C and an individual a, an assertion is an expression of type

C (a)

, where a is called instance of C. A fuzzy assertion is a pair

〈 C (a), α 〉

, where

C (a)

is an assertion and

α \in [0, 1]

.

A fuzzy assertion

〈 C (a), α 〉

is an expression that can be translated as “the membership degree of a being an instance of C is at least

α

”. A finite set of fuzzy assertions is called an A-Box, which is denoted by

A

.

Definition 8

([17]). A concept specialization or terminological axiom is a relation between two concepts C and D denoted by

C ⊑ D

.

A concept specialization

C ⊑ D

is interpreted as “C is more specific than D”. A finite set of terminological axioms, denoted by

T

, is called a T-Box.

Definition 9.

A set of axioms is an ordered pair

〈 T, A 〉

consisting of a T-Box and a A-Box.

Below, we present an example showing a T-Box and an A-Box contextualized on cars.

Example 1.

Let us consider the following T-Box, denoted by

T

, aimed at describing the notions of a sports car and a family car.

\begin{matrix} {}^{H i g h}{H o r s e P o w e r} (x) ⊓ F a s t (x) ⊑ S p o r t s C a r (x) \\ {}^{H i g h}{H o r s e P o w e r} (x) ⊓ S m a l l (x) ⊓ L i g h t (x) ⊑ {}^{v e r y}{F a s t} (x) \\ b i g_I n t e r i o r (x) ⊓ b i g_T r u n k (x) ⊑ F a m i l y C a r (x) \end{matrix}

Let us consider also the following A-box,

A

, with instances about one hypothetical car a

\begin{matrix} 〈 H o r s e P o w e r (a); 0.9 〉 \\ 〈 S m a l l (a); 0.9 〉 \\ 〈 L i g h t (a); 0.7 〉 \end{matrix}

In order to obtain the consequences from the T-Box and the assertions in the A-Box, we need to define the semantics in

{LU}_{F H}

, which is carried out in next section.

Table 1 presents a summary of the main symbols employed in this syntax along with their respective interpretation.

3.2. Semantics of ${LU}_{F H}$

Definition 10

([17]). A fuzzy interpretation (or simply an interpretation)

I

is a pair

I = (Δ^{I}, \cdot^{I})

such that

Δ^{I}

is a non-empty set called domain, and

\cdot^{I}

is an interpretation function mapping:

Different individuals into different elements of $Δ^{I}$ ;
Primitive concepts into membership degree functions $Δ^{I} \to [0, 1]$ ;
Truth-depressing modifiers into mappings in $G$ (see Definition 6).

For convention, given an individual a, a primitive concept C, and a modifier m, the mappings given by a fuzzy interpretation

I

are denoted by

a^{I}, C^{I}

and

m^{I}

, respectively. Moreover, note that fuzzy interpretations can be extended to concepts constructed by means of connectives and other symbols as follows:

\begin{matrix} ⊤^{I} (u) & = 1 \\ ⊥^{I} (u) & = 0 \\ {(C ⊓ D)}^{I} (u) & = min {C^{I} (u), D^{I} (u)} \\ {(C ⊔ D)}^{I} (u) & = max {C^{I} (u), D^{I} (u)} \\ {(^{m} C)}^{I} (u) & = m^{I} (C^{I} (u)) \end{matrix}

for all

u \in Δ^{I}

, and to modifiers constructed by composition and − as follows:

\begin{matrix} {(m_{1} \circ m_{2})}^{I} & = {(m_{1})}^{I} \circ {(m_{2})}^{I} \\ {(\bar{m})}^{I} & = the only mapping f such that (m^{I}, f) is an adjoint pair . \end{matrix}

Now that the notion of interpretation has been introduced, the reader is aware of the reasons why we have called the atomic modifiers truth-depressing and those constructed by −truth-stressing. In more detail, given an interpretation

I

and a truth-depressing modifier m,

m^{I}

is a mapping in

Ω

, so

m^{I} (α) \leq α

for all

α \in [0, 1]

. On the other hand, by construction, the modifier

{\bar{m}}^{I}

satisfies

{\bar{m}}^{I} (α) \geq α

for all

α \in [0, 1]

. The former kind of modifiers are called truth-depressors in the literature, whereas the latter, truth-stressors [27].

Table 2 shows some examples of truth-depressing modifiers that appear in the literature [28,29] along with their associated truth-stressing modifiers.

Definition 11.

An interpretation

I

satisfies a fuzzy assertion

〈 C (a), α 〉

if and only if

α \leq C^{I} (a^{I})

. Moreover,

I

is said to satisfy a concept specialization

C ⊑ D

if and only if

C^{I} (u) \leq D^{I} (u)

for all

u \in Δ^{I}

(i.e.,

C^{I}

is lesser than or equal to

D^{I}

in Zadeh’s sense).

Definition 12.

An interpretation

I

is called a model of a set of axioms

〈 T, A 〉

if and only if

I

satisfies each element of

〈 T, A 〉

.

Definition 13.

A set of axioms

〈 T, A 〉

entails a fuzzy assertion

〈 C (a), α 〉

, written

〈 T, A 〉 ⊧ 〈 C (a), α 〉

, if and only if every model of

〈 T, A 〉

also satisfies

〈 C (a), α 〉

. Similarly, we say that

〈 T, A 〉

entails a concept specialization

C ⊑ D

, written

〈 T, A 〉 ⊧ C ⊑ D

, if and only if every model of

〈 T, A 〉

also satisfies

C ⊑ D

.

Example 2.

Let us reconsider the T-Box T and the A-Box A described in Example 1. Let us consider a model

I = (Δ^{I}, \cdot^{I})

of

〈 T, A 〉

. Then,

I

satisfies:

\begin{matrix} 0.9 & \leq {H o r s e P o w e r}^{I} (a^{I}) \\ 0.9 & \leq {S m a l l}^{I} (a^{I}) \\ 0.7 & \leq {L i g h t}^{I} (a^{I}) \end{matrix}

Consider the truth-depressing modifier ^High. This modifier is interpreted by the model as follows:

{H i g h}^{I} (α) = α^{2}

for every

α \in [0, 1]

. Then, if we apply this modifier to the first assertion of the A-Box, we obtain the new assertion

〈^{H i g h} H o r s e P o w e r (a); 0.81 〉

. By Definition 11 it can be proven that the fuzzy assertions

〈^{v e r y} F a s t (a); 0.7 〉

and

〈 S p o r t s C a r (a); 0.7 〉

are consequences of

〈 T, A 〉

.

The literature on Description Logic contains plenty of algorithms to derive consequences and concept specializations from a set of axioms

〈 T, A 〉

. However, that is out of the scope of this paper, and we refer the reader to [17,21] for more details.

3.3. Generalized Modus Ponens in ${LU}_{F H}$

The inclusion of modifiers in the syntax of

{LU}_{F H}

enables the formulation of new inference rules that extend beyond the standard Description Logic

LU

. The first result shows a couple of tautologies in

{LU}_{F H}

involving modifiers.

Proposition 1.

Given a concept specialization C and a truth-depressing modifier m, we have that

$⊧ {}^{m}C ⊑ C$ ;
$⊧ C ⊑ {}^{\bar{m}}C$ .

Proof.

To prove the first item, we have to show that any interpretation

I = (Δ^{I}, \cdot^{I})

is a model of

{}^{m}C ⊑ C

. By definition of an interpretation,

m^{I} \in G

, so

m^{I} (α) \leq α

for all

α \in [0, 1]

. As a result,

m^{I} (C (u)) \leq C (u)

for all

u \in Δ^{I}

.

To prove the second item, let us consider an interpretation

I = (Δ^{I}, \cdot^{I})

; we see that

I

is a model of

C ⊑ {}^{\bar{m}}C

. By definition of an interpretation,

{\bar{m}}^{I}

is the right part of an adjoint pair

(m^{I}, {\bar{m}}^{I})

with

m^{I} \in G

. Since

m^{I} (α) \leq α

for all

α \in [0, 1]

, we have by the adjoint property that

α \leq {\bar{m}}^{I} (α)

for all

α \in [0, 1]

. As a result,

C (u) \leq {\bar{m}}^{I} (u)

for all

u \in Δ^{I}

. □

The second result shows that the modifier connective − can be used in inferences of concept specializations.

Proposition 2.

Given two concept specialization C and D and a truth-depressing modifier m, we have that

${}^{m}C ⊑ D ⊧ C ⊑ {}^{\bar{m}}D$ .
$C ⊑ {}^{\bar{m}}D ⊧ {}^{m}C ⊑ D$ .

Proof.

Let us prove the first item. Let

I = (Δ^{I}, \cdot^{I})

be a model of

{}^{m}C ⊑ D

. Then, for all

u \in Δ^{I}

, we have that

m^{I} (C^{I} (u)) \leq D^{I} (u)

Since

m^{I} \in G

, then

(m^{I}, {\bar{m}}^{I})

forms an adjoint pair, which implies that

C^{I} (u) \leq {\bar{m}}^{I} (D^{I} (u))

for all

u \in Δ^{I}

, and then

I

satisfies

C ⊑ {}^{\bar{m}}D

. The other item is proved similarly. □

The following results present various versions of Generalized Modus Ponens (GMP) within the

{LU}_{F H}

language. In this framework, the syllogism of GMP is comprised of a concept specialization, which serves the role of implication, and the instance of the antecedent is incorporated in the form of an assertion.

Theorem 2.

Let

〈 T, A 〉

be a set of axioms composed of

T = 〈 C ⊑ D 〉

and

A = 〈^{m} C (a), α 〉

. Then,

〈 T, A 〉 ⊧ 〈^{m} D (a), α 〉

.

Proof.

Consider a model

I

of

〈 T, A 〉

. By definition,

I

satisfies

$α \leq m^{I} (C^{I} (a^{I}))$ , where $m^{I} \in G$ ;
$C^{I} (u) \leq D^{I} (u)$ for all $u \in Δ^{I}$ .

Specifically,

C^{I} (a^{I}) \leq D^{I} (a^{I})

. Since

m^{I} \in G

is a monotonic mapping, from the previous inequality, we can obtain

α \leq m^{I} (C^{I} (a^{I})) \leq m^{I} (D^{I} (a^{I}))

. In other words,

I

satisfies the assertion

〈^{m} D (a), α 〉

. In conclusion,

〈 T, A 〉 ⊧ 〈^{m} D (a), α 〉

. □

The second GMP considers a modifier in a concept specialization and an assertion free of modifier.

Theorem 3.

Let

〈 T, A 〉

be a set of axioms composed of

T = 〈^{m} C ⊑ D 〉

and

A = 〈 C (a), α 〉

. Then,

〈 T, A 〉 ⊧ 〈^{\bar{m}} D (a), α 〉

.

Proof.

Consider a model

I

of

〈 T, A 〉

. By Proposition 2, every model of

〈^{m} C ⊑ D 〉

equally satisfies

〈 C ⊑ {}^{\bar{m}}D 〉

. Therefore, we have

$α \leq C^{I} (a^{I})$ ;
$C^{I} (u) \leq {\bar{m}}^{I} (D^{I} (u))$ for all $u \in Δ^{I}$ .

Specifically, the following chain of inequalities holds:

α \leq C^{I} (a^{I}) \leq {\bar{m}}^{I} (D^{I} (a^{I})) .

Namely,

I

satisfies the assertion

〈^{\bar{m}} D (a), α 〉

. In conclusion,

〈 T, A 〉 ⊧ 〈^{\bar{m}} D (a), α 〉

. □

The third GMP considers two concept modifiers: one of them appears in the antecedent of the concept specialization, and the other one is included in the instance of the antecedent.

Theorem 4.

Let

〈 T, A 〉

be a set of axioms composed of

T = 〈^{m_{1}} C ⊑ D 〉

and

A = 〈^{m_{2}} C (a), α 〉

. Then,

〈 T, A 〉 ⊧ 〈^{m_{2} \circ \bar{m_{1}}} D (a), α 〉

.

Proof.

Let

I

be a model of

〈 T, A 〉

. Then,

$C^{I} (a^{I}) \leq {\bar{m_{1}}}^{I} (D^{I} (a^{I}))$ for all $a^{I} \in Δ^{I}$ ;
$α \leq {m_{2}}^{I} (C^{I} (a^{I}))$ .

Therefore, by using the monotonicity of

{m_{2}}^{I}

, we have

α \leq {m_{2}}^{I} (C^{I} (a^{I})) \leq {m_{2}}^{I} ({\bar{m_{1}}}^{I} (D^{I} (a^{I}))) = ({m_{2}}^{I} \circ {\bar{m_{1}}}^{I}) (D^{I} (a^{I})),

in other words,

I

satisfies the assertion

〈^{m_{2} \circ \bar{m_{1}}} D (a), α 〉

, so

〈 T, A 〉 ⊧ 〈^{m_{2} \circ \bar{m_{1}}} D (a), α 〉

. □

In the following section, the latter rule is used to derive conclusions from an inference system by applying it as the core of an inference engine.

4. A Fuzzy Inference System Based on the $f$ -Index of Inclusion

Although in the literature, we can find different versions of fuzzy inference systems, all of them can be divided, in general, in four steps: fuzzification, knowledge database, inference engine and defuzzification. The first step is the fuzzification process, where crisp data are transformed into fuzzy information. Usually, this step is performed by means of a fuzzy partition. The knowledge database comprises some knowledge by rules of the type If–Then. The inference engine is the central stage, where the fuzzy inference system takes the fuzzified input and returns an output by combining the input with the rules in the knowledge database. Finally, the defuzzification process takes the output of the inference engine (usually a fuzzy set) and returns a crisp output according to the applied context of the FIS. Next, we specify how these four phases are formalized in our inference system based on the f-index of inclusion. We start with the knowledge database, to show how the information is comprised in If–Then-type rules. Then, under the semantics of the rules in the knowledge database, the inference engine is introduced using the Description Logic

{LU}_{F H}

as support. Finally, the fuzzification and defuziffication are analyzed according to the previous consideration.

4.1. The Knowledge Database

Definition 14.

A frame is a tuple

(X, Y, A_{X}, B_{Y})

where X and Y are sets and

A_{X}

and

B_{Y}

are two fuzzy partitions over the universes X and Y, where

A_{X} = {A_{i}}_{i \in I}

and

B_{Y} = {B_{j}}_{j \in J}

.

Definition 15.

A knowledge database (

K B

) on the frame

(X, Y, A_{X}, B_{Y})

is a set of rules of the form

〈 A_{i} \to B_{j}; f_{i j} 〉

where

A_{i} \in A_{X}

,

B_{j} \in B_{Y}

and

f_{i j} \in G

.

To provide an interpretation to the connection between the implication rule

A_{i} \to B_{j}

and its associated function

f_{i j}

, we use the f-inclusion relation between fuzzy sets, as presented in the following definition.

Definition 16.

Let Γ be a

K B

on the frame

(X, Y, A_{X}, B_{Y})

. A subset of pairs

M \subseteq X \times Y

is a model of Γ, if, for every rule

〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ

, we have that

f_{i j} (A_{i} (x)) \leq B_{j} (y)

for all

(x, y) \in M

. The set of all models of Γ is denoted by

M_{Γ}

.

The use of the f-inclusion to model implication rules in our frame

(X, Y, A_{X}, B_{Y})

is justified by the ability of this operator to serve as fuzzy implication, as explained in [15].

Note that a model of a

K B

is a crisp set, i.e., a subset of

X \times Y

is either a model or not. Consequently, with a fixed

K B

Γ

, we can consider in the set of models of

Γ

,

M_{Γ}

, the standard order between sets and then analyze the structure of

(M_{Γ}, \subseteq)

. The next result shows that

(M_{Γ}, \subseteq)

is a complete lattice:

Theorem 5.

Let Γ be a

K B

on a frame

(X, Y, A_{X}, B_{Y})

, and let

M_{Γ}

be the set of models of Γ. Then,

M_{Γ}

has a complete lattice structure with the standard order between sets.

Proof.

Let us prove that both intersection and union of an arbitrary number of models are also models. Let

{M_{i}}_{i \in I} \subseteq M_{Γ}

be a subset of models of

Γ

and let us begin by showing that

M_{\cup} = ⋃_{i \in I} M_{i}

is a model. Let

(x, y) \in M_{\cup}

, then, necessarily, there exists a model

\bar{M} \in {M_{i}}_{i \in I}

such that

(x, y) \in \bar{M}

. As a result, for any rule

〈 A_{i} \to B_{j}; f_{i j} 〉

in

Γ

, we have

f_{i j} (A_{i} (x)) \leq B_{j} (y)

. In other words,

M_{\cup}

is a model.

The proof that shows that

M_{\cap} = ⋂_{i \in I} M_{i}

is a model of

Γ

is similar. □

Corollary 1.

Let Γ be a

K B

on a frame

(X, Y, A_{X}, B_{Y})

, and let

M_{Γ}

be the set of models of Γ. Then, the greatest element of

(M_{Γ}, \subseteq)

, denoted by

M_{Γ}

, is the join of every model of Γ, and the least model of

(M_{Γ}, \subseteq)

is the empty set.

The previous result highlights a substantial difference with respect to Description Logic and logic programming. In the aforementioned formal theories, the emphasis is placed on minimal models (the least model in logic programming or canonical models in DL). In our work, we focus on the maximal model, as it determines the possible pairs of points in

X \times Y

that are consistent with its associated

K B

. Later, in Section 4.2, it is shown that we can reduce our analysis to the greatest model of

Γ

to check correct inferences.

Below, we provide an illustrative example describing a knowledge database where the step-by-step construction of its greatest model is described. For the sake of showing the potential application of this inference system, it has been contextualized in pediatrics despite the synthetic nature of the example.

Example 3.

Let us assume that we have conducted a study on heart rate in infants and children between the ages of one and nine. In order to establish a relation between these two properties, we consider the frame

(X, Y, A_{X}, B_{Y})

consisting of

The interval $X = [1, 9]$ to represent the ages between 1 and 9, e.g., the age of an infant who is 18 months old would be $1.5$ ;
The interval $Y = [70, 130]$ to cover a wide range of heart rates (in bpm).

The fuzzy partition $A_{X} = {A_{1}, A_{2}, A_{3}, A_{4}}$ consists of four fuzzy sets $A_{i} \in F (X)$ which distinguish four fuzzy age ranges. The membership functions corresponding to each element of $A_{X}$ are

$A_{1} (x) = \{\begin{matrix} 1 & if x \in [1, 2] \\ 2 - 0.5 x & if x \in (2, 4] \\ 0 & if x \in (4, 9], \end{matrix}$

$A_{2} (x) = \{\begin{matrix} 0 & if x \in [1, 2] \\ 0.5 x - 1 & if x \in (2, 4] \\ 3 - 0.5 x & if x \in (4, 6] \\ 0 & if x \in (6, 9], \end{matrix}$

$A_{3} (x) = \{\begin{matrix} 0 & if x \in [1, 4] \\ 0.5 x - 2 & if x \in (4, 6] \\ 4 - 0.5 x & if x \in (6, 8] \\ 0 & if x \in (8, 9], \end{matrix}$

$A_{4} (x) = \{\begin{matrix} 0 & if x \in [1, 6] \\ 0.5 x - 3 & if x \in (6, 8] \\ 1 & if x \in (8, 9] . \end{matrix}$
The fuzzy partition $B_{Y} = {B_{1}, B_{2}, B_{3}}$ only consists of three elements, which can be classified into $B_{1}$ = “low heart rate”, $B_{2}$ = “standard heart rate”, and $B_{3}$ = “high heart rate”, respectively. These fuzzy sets have the following respective associated membership functions:

$B_{1} (y) = \{\begin{matrix} \frac{100 - y}{30} & if y \in [70, 100] \\ 0 & if y \in (100, 130], \end{matrix}$

$B_{2} (y) = \{\begin{matrix} \frac{y - 70}{30} & if y \in [70, 100] \\ \frac{130 - y}{30} & if y \in (100, 130], \end{matrix}$

$B_{3} (y) = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{y - 100}{30} & if y \in (100, 130] . \end{matrix}$

Both of these fuzzy partitions are represented in Figure 1.

Once the frame

(X, Y, A_{X}, B_{Y})

has been defined, it is time to set the rules

〈 A_{i} \to B_{j}; f_{i j} 〉

that constitute the knowledge base Γ. This

K B

contains

4 \cdot 3 = 12

different rules, each one corresponding to a different combination of elements

A_{i} \in A_{X}

and

B_{j} \in B_{Y}

. As explained before, the mapping

f_{i j} \in G

associated with each rule determines a relation of f-inclusion

A_{i} \to B_{j}

. In other words, when the relationship between

A_{i}

and

B_{j}

is stronger (i.e., patients with ages corresponding to

A_{i}

tend to have a heart rate within

B_{j}

), the mapping

f_{i j} \in G

is closer to the identity (i.e., the f-inclusion relation is more restrictive). Conversely, when the elements

A_{i}

and

B_{j}

are not related at all, the inclusion function is null, i.e.,

f_{i j} (α) = 0 (α) = 0

for all

α \in [0, 1]

.

Since, in general, the average heart rate of infants and children decreases with age, the knowledge base associated with this frame is represented as in Table 3.

Where

g_{n} (α) = \{\begin{matrix} 0 & i f α \leq n \\ n & i f α > n, \end{matrix}

f_{n} (α) = \{\begin{matrix} α & i f α \leq n \\ n & i f α > n, \end{matrix}

and

0 (α) = 0

for all

α \in [0, 1]

.

Now that the

K B

has been defined (denoted by Γ), it is time to compute its greatest model

M_{Γ}

. Let us recall from Definition 16 that a model M of Γ is a set of pairs in

X \times Y

which satisfy all the following f-inclusion relations:

f_{i j} (A_{i} (x)) \leq B_{j} (y)

for every rule

〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ

. Thus, in order to obtain the greatest model, we have to compute which pairs

(x, y) \in X \times Y

satisfy each of these f-inclusions. This calculation can be performed graphically in the

X \times Y

plane through the representation of all the restrictions imposed by each f-inclusion associated with each rule

〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ

. This plotting process is performed step by step in order to facilitate its understanding.

Let us start by focusing on the second element of

A_{X}

,

A_{2}

, which covers the fuzzy age range from 2 to 6 years. According to Table 3, the three rules associated with this fuzzy set are

〈 A_{2} \to B_{1}; 0 〉, 〈 A_{2} \to B_{2}; f_{0.5} 〉, 〈 A_{2} \to B_{3}; g_{0.25} 〉 .

First of all, note that the former rule,

〈 A_{2} \to B_{1}; 0 〉

, does not impose any restriction on the set of models, since all pairs

(x, y) \in X \times Y

satisfy null f-inclusions, i.e.,

0 = 0 (A_{2} (x)) \leq B_{1} (y)

. Therefore, the step-by-step graphical representation is only performed for the next two rules (since the graph associated to the first rule does not impose any constraint on the maximal model).

The graph in Figure 2 illustrates the pairs in

X \times Y

that satisfy the condition imposed by the rule

〈 A_{2} \to B_{2}; f_{0.5} 〉

. This is carried out in two simple steps:

First, the fuzzy set $f_{0.5} (A_{2})$ is computed, and its graph is represented over $A_{2}$ in the partition $A_{X}$ .
Then, the pairs with positive membership degree in $A_{2}$ are selected. Among these, the pairs $(x, y) \in X \times Y$ whose y-coordinate has a membership degree in $B_{2}$ lower than $f_{0.5} (A_{2} (x))$ are discarded from $M_{Γ}$ .

Note that the mapping

f_{0.5}

associated with the rule

〈 A_{2} \to B_{2}; f_{0.5} 〉

implies the existence of a patient whose age belongs to

A_{2}

with the highest possible membership degree, that is, 4 years old, and with a heart rate with a membership degree in

B_{2}

equal to

0.5

, e.g., 85 or 115 beats per minute.

A similar process is followed to plot in Figure 3 the set of pairs

(x, y) \in X \times Y

that satisfy the rule

〈 A_{2} \to B_{3}; g_{0.25} 〉

. The apparition of this mapping implies the existence of two different patients: one of them who is 4 years old and whose heart rate is approximately 110 and another one whose age has a membership degree in

A_{2}

higher than

0.25

(i.e., their age is between 30 and 66 months old) and whose heart rage is lesser or equal than 100 bpm, so it does not belong to partition

B_{3}

.

The final step consists in computing the intersection of the shaded regions obtained in Figure 2 and Figure 3, as the resulting region contains the set of pairs

(x, y) \in X \times Y

that satisfy all the rules associated with the fuzzy set

A_{2}

. Figure 4 shows the result of this computation.

By performing an analogous process with the remaining elements of

A_{X}

, we obtain the graphical representation in the

X \times Y

plane of the greatest model

M_{Γ}

, which corresponds to the shaded region in the

X \times Y

plane shown in Figure 5. Note that the pairs

(x, y) \in X \times Y

that lie on the “boundary” of the shaded section also belong to

M_{Γ}

.

The computation of the greatest model provides an easy way to calculate other models of the

K B

, since the search for a model is reduced to selecting a set of pairs

(x, y) \in X \times Y

that lie within the shaded region. In some applied environments, some models different from the greatest one may be of interest. For example, Figure 6 shows (in bold) two different models. The left graph presents a finite model,

M_{1}

, consisting of seven elements

(x, y) \in M_{Γ}

:

M_{1} = {(2, 120), (3, 115), (4, 110), (4, 115), (6, 95), (7, 85), (9, 75)} .

On the other hand, the right graph in Figure 6 shows a model

M_{2}

consisting of a functional dataset, namely, a set of pairs

(x, f (x)) \in M_{Γ}

.

The following result shows that with this semantics, we can identify each model of a

K B

with one particular T-Box of the Description Logic

{LU}_{F H}

defined in the previous section. As shown in a subsequent section, this link allows the application of inference rules of

{LU}_{F H}

in our

K B

.

Proposition 3.

Let Γ be a

K B

on a frame

(X, Y, A_{X}, B_{Y})

, and let

T_{Γ}

be the T-Box of

{LU}_{F H}

constructed as follows:

For each $A_{i}$ in $A_{X}$ (resp., $B_{j}$ in $B_{Y}$ ) consider the concept $C_{A_{i}}$ (resp., $C_{B_{j}}$ );
For each $f_{i j} \in G$ appearing in Γ, consider the concept modifier $m_{i j}$ ;
For each rule $〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ$ , consider in $T_{Γ}$ the specialization concept

${}^{m_{i j}}C_{A_{i}} ⊑ C_{B_{j}} .$

Then, given a model M of Γ, the interpretation

I = (Δ_{I}, \cdot^{I})

given by

$Δ^{I} = M$ ;
If $A_{i} \in A_{X}$ , then $C_{A_{i}}^{I} (x, y) = A_{i} (x)$ ;
If $B \in B_{Y}$ , then $C_{B_{j}}^{I} (x, y) = B_{j} (y)$ ;
${(^{m_{i j}} C)}^{I} (x, y) = f_{i j} (C^{I} (x, y))$ ;

is a model of

T_{Γ}

.

Proof.

Let

(x, y) \in M

. Then, by definition of a model,

f_{i j} (A_{i} (x)) \leq B_{j} (y)

for every rule in

Γ

, which implies that

{(^{m_{i j}} C_{A_{i}})}^{I} (x, y)) \leq C_{B_{j}}^{I} (x, y)

. In conclusion,

I

satisfies

{}^{m_{i j}}C_{A_{i}} ⊑ C_{B_{j}}

for every concept specialization in

T_{Γ}

. In other words,

I

is a model of

T_{Γ}

. □

Obviously, the converse of the previous result is not true in general, since the models in

{LU}_{F H}

may be very general. Actually, a

K B

with only empty models may be connected to a T-Box with non-empty models via the translation described in Proposition 3.

Example 4.

Let us consider, on the universes

X = {x_{1}, x_{2}, x_{3}}

and

Y = {y_{1}, y_{2}, y_{3}}

, the fuzzy partitions

A_{X} = {A_{1}, A_{2}} \subseteq F (X)

and

B_{Y} = {B_{1}, B_{2}} \subseteq F (Y)

given by the fuzzy sets defined in Table 4.

On this frame

(X, Y, A_{X}, B_{Y})

, we define the

K B

composed by the next four rules:

$〈 A_{1} \to B_{1}; id 〉$ ; ● $〈 A_{2} \to B_{1}; \frac{id}{2} 〉$ ;
$〈 A_{1} \to B_{2}; \frac{id}{2} 〉$ ; ● $〈 A_{2} \to B_{2}; id 〉$ .

Consider each element

(x_{i}, y_{j}) \in X \times Y

, and let us check that there is at least one rule in Γ that is not satisfied by

(x_{i}, y_{i})

:

$(x_{1}, y_{1})$ and $(x_{2}, y_{1})$ do not satisfy $〈 A_{1} \to B_{2}; \frac{id}{2} 〉$ , since $\frac{id}{2} (A_{1} (x_{1})) = \frac{id}{2} (A_{1} (x_{2})) = 0.5 ≰ B_{2} (y_{1}) = 0$ .
$(x_{3}, y_{1})$ does not satisfy $〈 A_{2} \to B_{2}; id 〉$ , because $A_{2} (x_{3}) = 1 ≰ B_{2} (y_{1}) = 0$ .
$(x_{1}, y_{2})$ and $(x_{2}, y_{2})$ do not satisfy $〈 A_{1} \to B_{1}; id 〉$ , since $A_{1} (x_{1}) = A_{1} (x_{2}) = 1 ≰ B_{1} (y_{2}) = 0.5$ .
$(x_{3}, y_{2})$ does not satisfy $〈 A_{2} \to B_{2}; id 〉$ , because $A_{2} (x_{3}) = 1 ≰ B_{2} (y_{2}) = 0.5$ .
$(x_{1}, y_{3})$ and $(x_{2}, y_{3})$ do not satisfy $〈 A_{1} \to B_{1}; id 〉$ , since $A_{1} (x_{1}) = A_{1} (x_{2}) = 1 ≰ B_{1} (y_{3}) = 0$ .
Finally, $(x_{3}, y_{3})$ does not satisfy $〈 A_{2} \to B_{1}; \frac{id}{2} 〉$ , since $\frac{id}{2} (A_{2} (x_{3})) = 0.5 ≰ B_{1} (y_{3}) = 0$ .

In conclusion, there is no pair

(x, y) \in X \times Y

that belongs to a model of

K B

; in other words, the only model is the empty set,

M_{Γ} = {\emptyset}

.

Now, let us define in

{LU}_{F H}

the set of concepts

Π = {C_{A_{1}}, C_{A_{2}}, C_{B_{1}}, C_{B_{2}}}

associated with each one of the elements of

A_{X}

and

B_{Y}

, respectively, and consider the T-Box defined as in Proposition 3:

T_{Γ} = \{〈^{m_{1}} C_{A_{1}} ⊑ C_{B_{1}} 〉, 〈^{m_{2}} C_{A_{1}} ⊑ C_{B_{2}} 〉, 〈^{m_{2}} C_{A_{2}} ⊑ C_{B_{1}} 〉, 〈^{m_{1}} C_{A_{2}} ⊑ C_{B_{2}} 〉\} .

Given the interpretation

I = (Δ^{I}, \cdot^{I})

where

$Δ^{I} = {u}$ ;
$C^{I} (u) = 0.5$ for all $u \in Δ^{I}$ and for every concept $C \in Π$ ;
${m_{1}}^{I} = id$ and ${m_{2}}^{I} = \frac{id}{2}$ ;

it can be easily checked that

I

satisfies every concept specialization in

T_{Γ}

. Therefore,

I

is a model of

T_{Γ}

.

4.2. Adding Inputs to $K B$ s: Defining Consequences

In the previous subsection, we described the structure of

K B

; now, let us explain how to reason with it. The purpose of this fuzzy inference system is to draw conclusions from a

K B

and a certain input. Let us begin by defining what is a

K B

-input on a frame.

Definition 17.

A

K B

-inputon the frame

(X, Y, A_{X}, B_{Y})

is a fuzzy set defined either on X or on Y.

Thanks to Proposition 3, it looks natural to incorporate inputs to

K B

s by giving a similar semantics to assertions in

{LU}_{L H}

. Since assertions in

{LU}_{L H}

resemble the so-called

α

-cuts in fuzzy set theory, our semantics considers

α

-models that represent the

α

-cuts of a (fuzzy) model.

Definition 18.

Let A (resp., B) be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

such that

A \in F (X)

(resp.,

B \in F (Y)

), and let

α \in [0, 1]

. A subset of pairs

M \subseteq X \times Y

is an

α

-model of A (resp., B) if and only if

M \subseteq {(x, y) \in X \times Y ∣ A (x) \geq α}

if and only if

M \subseteq {(x, y) \in X \times Y ∣ B (y) \geq α}

).

Under the previous definition of an

α

-model,

K B

-inputs can be interpreted as constraints that bound the pairs of elements in

X \times Y

. Note that the greater the

K B

-input (as a fuzzy set), the weaker the restriction, since it admits more

α

-models. On the other hand, the value

α

also determines a restriction in the sense that the greater the value

α

, the stronger the restriction imposed to pairs in

X \times Y

to be in an

α

-model.

Note that from a theoretical point of view, we can assume theoretically that only two inputs can be considered: one fuzzy set on the universe X and another on Y. In other words, if we are interested in reasoning with two different fuzzy sets

A_{1}

and

A_{2}

on X, this is equivalent to considering the fuzzy set

A_{1} \cap A_{2}

as a

K B

-input.

Proposition 4.

Let

A_{1}

and

A_{2}

be two

K B

-inputs on a frame

(X, Y, A_{X}, B_{Y})

, both of them defined on the same universe. A subset of pairs

M \subseteq X \times Y

is an α-model of

A_{1}

and

A_{2}

if and only if M is an α-model of

A_{1} \cap A_{2}

.

Proof.

Let

A_{1}, A_{2} \in F (X)

and let M be an

α

-model of

A_{1}

and

A_{2}

. By definition of an

α

-model, we have that for all

(x, y) \in M

,

A_{1} (x) \geq α

and

A_{2} (x) \geq α

. That is equivalent to saying

A_{1} \cap A_{2} (x) = min {A_{1} (x), A_{2} (x)} \geq α .

Therefore, M is a model of

A_{1} \cap A_{2}

. □

As a consequence of the previous result, we can combine different fuzzy sets defined on the same universe into one

K B

-input and keep the semantics. Thus, as mentioned before, from a theoretical point of view, only two inputs are possible, one fuzzy set on the universe X and another on the universe Y. For the sake of simplicity in this approach, we only consider one

K B

-input for reasoning.

The following result shows that the set of

α

-models has a complete lattice structure.

Proposition 5.

Let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Fixing

α \in [0, 1]

, the set of α-models of A has the structure of a complete lattice.

It is also convenient for the reader to keep the following meaning of a

K B

-input A: the fuzzy set A determines the set of possible values of X (or Y) a system may consider for reasoning with. In this way, we focus on determining the greater set of possible values in

X \times Y

that satisfy the constraints represented by the rules in both a

K B

and in a

K B

-input. The following definition determines the semantics for the fusion of the knowledge represented by a

K B

and the restriction imposed by a

K B

-input as a fuzzy set on

X \times Y

.

Definition 19.

Let Γ be a

K B

, and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. A model of

Γ \cup {A}

is a fuzzy set M on the universe

X \times Y

such that for each

α > 0

, the α-cut

M^{α} = {(x, y) \in X \times Y ∣ M (x, y) \geq α}

is a model of Γ and an α-model of A.

Note that a model of the pair of a

K B

Γ

and a

K B

-input A is a fuzzy set constructed by

α

-cuts that intersect models of

Γ

and

α

-models of A. The following result shows an equivalent definition of a model that is used in some proofs of further results.

Theorem 6.

Let Γ be a

K B

, and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. M is a model of

Γ \cup {A}

if and only if

s u p p (M)

is a model of Γ and

M (x, y) \leq A (x)

.

Proof.

Let us start by proving the backward implication, as it is more straightforward:

⇐

Let

M \in F (X \times Y)

, and let us assume that

s u p p (M)

is a model of

Γ

and that

M (x, y) \leq A (x)

for all

(x, y) \in X \times Y

. Let us prove that M is a model of

Γ \cup {A}

. Consider the

α

-cut

M^{α}

with fixed

α > 0

:

–: First, since $M^{α} \subseteq s u p p (M) = {(x, y) \in X \times Y ∣ M (x, y) > 0}$ and $s u p p (M)$ is a model of $Γ$ , $M^{α}$ is also a model of $Γ$ .
–: Second, for each $(x, y) \in M^{α}$ , we have that $α \leq M (x, y) \leq A (x)$ by assumption. Then, $α \leq A (x)$ for all $(x, y) \in M^{α}$ . In consequence, $M^{α} \subseteq {(x, y) \in X \times Y ∣ A (x) \geq α}$ , namely, $M^{α}$ is an $α$ -model of A.

⇒

Assume that

M \in F (X \times Y)

is a model of

Γ \cup {A}

. Then,

–: On the one hand, by Definition 19, every $M^{α}$ with $α > 0$ is a model of $Γ$ . Moreover, the support of a fuzzy set can be characterized by

$s u p p (M) = ⋃_{α > 0} M^{α} .$

Since, by Theorem 5, $(M_{Γ}, \subseteq)$ is a complete lattice, $s u p p (M)$ is also a model of $Γ$ .
–: On the other hand, by Definitions 18 and 19, every $M^{α}$ with $α > 0$ is also an $α$ -model of A, which means that $M^{α} \subseteq {(x, y) \in X \times Y ∣ A (x) \geq α}$ . Let us prove by reductio ad absurdum that $M (x, y) \leq A (x)$ for all $(x, y) \in X \times Y$ . Suppose that there exists an element $(x_{0}, y_{0}) \in X \times Y$ such that

$β = M (x_{0}, y_{0}) > A (x_{0})$

for certain $β \in (0, 1]$ . Then, by definition, $(x_{0}, y_{0})$ belongs to the $β$ -cut of M, i.e., $β \in M^{β}$ . By assumption, $M^{β}$ is a $β$ -model of A, which implies

$(x_{0}, y_{0}) \in M^{β} \subseteq {(x, y) \in X \times Y ∣ A (x) \geq β}$

which implies that $A (x_{0}) \geq β$ . This leads us to a contradiction since

$β \leq A (x_{0}) < M (x_{0}, y_{0}) = β$

which completes the proof.

□

In Theorem 5, it was already demonstrated that for every

K B

, there exists a greatest model associated with it. The following theorem presents an analogous result for models

K B

s with

K B

-inputs.

Theorem 7.

Let Γ be a

K B

, and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. The set of models of

Γ \cup {A}

, denoted by

M_{Γ, A}

, has a complete lattice structure with the standard Zadeh’s ordering between fuzzy sets (i.e.,

A \leq B

if and only if

A (x) \leq B (x))

.

Proof.

Let us begin by proving that

M_{Γ, A}

is closed by the union of arbitrary models. Let

{M_{i}}_{i \in I} \in M_{Γ, A}

, and let us show that both fuzzy sets

M_{\cup} = ⋃_{i \in I} M_{i}

and

M_{\cap} = ⋂_{i \in I} M_{i}

are models of

Γ \cup {A}

, i.e., they are in

M_{Γ, A}

. By Theorem 6:

We know that $M_{i} (x, y) \leq A (x)$ for all $i \in I$ and $(x, y) \in X \times Y$ ;
The result is reduced to prove that prove that $M_{\cup} (x, y) \leq A (x)$ and $M_{\cap} (x, y) \leq A (x)$ for all $(x, y) \in X \times Y$ .

Let

(x, y) \in X \times Y

; then,

\begin{matrix} M_{\cup} (x, y) & = sup_{i \in I} M_{i} (x, y) \leq A (x); \\ M_{\cap} (x, y) & = inf_{i \in I} M_{i} (x, y) \leq A (x) . \end{matrix}

This is what we wanted to prove. □

It it straightforward to check that the empty set is the least model of

M_{Γ, A}

. The following theorem shows the expression of the greatest model of

M_{Γ, A}

.

Theorem 8.

Let Γ be a

K B

and A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. The greatest model of

Γ \cup {A}

, denoted by

M_{Γ, A}

, is the fuzzy set given by the restriction of the support of A to the greatest model of Γ:

M_{Γ, A} (x, y) = \{\begin{matrix} A (x) & if (x, y) \in M_{Γ} \\ 0 & otherwise . \end{matrix}

(2)

Proof.

Let us prove that

M_{Γ, A}

is a model of

Γ \cup {A}

and that every model M of

Γ \cup {A}

satisfies

M \leq M_{Γ, A}

. Then, necessarily,

M_{Γ, A}

is the greatest model of

Γ \cup {A}

.

By Theorem 6,

M_{Γ, A}

is a model of

Γ \cup {A}

, since by definition,

M_{Γ, A} (x, y) \leq A (x)

for all

(x, y) \in X \times Y

.

On the other hand, let M be a model of

M_{Γ, A}

and let us assume by reductio ad absurdum that there exists

(x_{0}, y_{0}) \in X \times Y

such that

M (x_{0}, y_{0}) > M_{Γ, A} (x_{0}, y_{0})

. By definition, we have that

M (x_{0}, y_{0}) > M_{Γ, A} (x_{0}, y_{0}) = A (x_{0})

, which contradicts the fact that M is a model according to the characterization of Theorem 6. □

We can now introduce the notion of logical consequence as usual in formal logic theories.

Definition 20.

Let Γ be a

K B

, and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. We say that a fuzzy set

B \in F (X)

(resp.,

B \in F (Y)

) is a consequence of

Γ \cup {A}

, denoted by

Γ \cup {A} ⊧ B

, if for every model M of

Γ \cup {A}

, we have

B (x) \geq M (x, y)

for all

(x, y) \in X \times Y

(resp.,

B (y) \geq M (x, y)

for all

(x, y) \in X \times Y

).

Thanks to the complete lattice structure of the set of models of

Γ \cup {A}

(see Theorem 7), we can reduce the validation of consequences to checking the greatest model of

Γ \cup {A}

.

Theorem 9.

Let Γ be a

K B

, and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. A fuzzy set

B \in F (X)

(resp.,

B \in F (Y)

) is a consequence of

Γ \cup {A}

if and only if we have

B (x) \geq M_{Γ, A} (x, y)

for all

(x, y) \in X \times Y

(resp.,

B (y) \geq M_{Γ, A} (x, y)

), where

M_{Γ, A}

is the greatest model of

Γ \cup {A}

.

Proof.

Suppose that

B \in F (X)

(the proof for the case

B \in F (Y)

is similar) and that

Γ \cup {A} ⊧ B

. Then, every model M of

Γ \cup {A}

satisfies

B (x) \geq M (x, y)

for all

(x, y) \in X \times Y

. In particular, that inequality holds for the greatest model

M_{Γ, A}

. That is,

B (x) \geq M_{Γ, A} (x, y)

for all

(x, y) \in X \times Y

.

Conversely, let us assume that

B (y) \geq M_{Γ, A} (x, y)

for all

(x, y) \in X \times Y

. Given a model M of

Γ \cup {A}

, we have that

M (x, y) \leq M_{Γ, A} (x, y)

for all

(x, y) \in X \times Y

, since

M_{Γ, A}

is the greatest model of

(M_{Γ, A}, \leq)

. Therefore,

B (y) \geq M_{Γ, A} (x, y) \geq M (x, y)

for all

(x, y) \in X \times Y

and

Γ \cup {A} ⊧ B

. □

From the previous characterization, it is obvious that in order to determine consequences, we can focus only on the greatest model of

Γ \cup {A}

. The following result shows that when considering a

K B

with a simple condition of coherence, determining whether a fuzzy set defined on the same universe as the

K B

-input is a consequence is trivial.

Theorem 10.

Let Γ be a

K B

on a frame

(X, Y, A_{X}, B_{Y})

. Let us assume that for each

x \in X

and

y \in Y

, there exists

y_{x} \in Y

and

x_{y} \in X

such that

{(x, y_{x}), (x_{y}, y)}

is a model of Γ. Then:

Given $A, B \in F (X)$ , $Γ \cup {A} ⊧ B$ if and only if $A \leq B$ ;
Given $A, B \in F (Y)$ , $Γ \cup {A} ⊧ B$ if and only if $A \leq B$ .

Proof.

Let us prove the first item, since the second is proved similarly. Let

M_{Γ, A}

be the greatest model of

Γ \cup {A}

. From Theorem 8, we have that

M_{Γ, A} (x, y) = A (x)

for all

(x, y) \in M_{Γ}

and

M_{Γ, A} (x, y) = 0

if

(x, y) \notin M_{Γ}

. Let

B \in F (X)

such that

Γ \cup {A} ⊧ B

. Then, by Theorem 9, that is equivalent to say that

M_{Γ, A} (x, y) \leq B (x)

. Then, for each

x \in X

, and choosing

y_{x} \in Y

such that

(x, y_{x}) \in M_{Γ}

, we have

A (x) = M_{Γ, A} (x, y_{x}) \leq B (x),

which is equivalent to say that

A \leq B

. □

From the previous result we have the following: given a

K B

Γ

on a frame

(X, Y, A_{X}, B_{Y})

and a

K B

-input

A \in F (X)

, the only non-trivial consequences of

Γ \cup {A}

are those fuzzy sets defined on Y. The following proposition shows that there is a monotonicity on the set of consequences with respect to the ordering of fuzzy sets with respect to Zadeh’s ordering.

Proposition 6.

Let Γ be a

K B

and

A \in F (X)

be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Let

B, C \in F (Y)

such that

B \subseteq C

and

Γ \cup {A} ⊧ B

; then,

Γ \cup {A} ⊧ C

.

Proof.

Consider a model M of

Γ \cup {A}

. If

Γ \cup {A} ⊧ B

, then (Theorem 9)

B (y) \geq M (x, y)

for all

(x, y) \in X \times Y

. Since

B (y) \leq C (y)

for all

y \in Y

, then

C (y) \geq M (x, y)

for all

(x, y) \in X \times Y

. Again, by Theorem 9, we can conclude that

Γ \cup {A} ⊧ C

. □

It can be also proved that the set of consequences of

Γ \cup {A}

inherits the complete lattice structure from the set of models of

Γ \cup {A}

.

Proposition 7.

Let Γ be a

K B

and

A \in F (X)

be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. The set of consequences of

Γ \cup {A}

, i.e., the set

{B \in F (Y) ∣ Γ \cup {A} ⊧ B}

, has a complete lattice structure with greatest element Y.

Proof.

Let

C

be the set of consequences of

Γ \cup {A}

, and let

{B_{i}}_{i \in I} \subseteq C

. By Proposition 6, we have directly that

⋃_{i \in I} B_{i}

is a consequence of

Γ \cup {A}

.

Let us prove now that

⋂_{i \in I} B_{i}

is a consequence of

Γ \cup {A}

as well. By definition of a consequence, we have that for all

B_{i}

’s and all models M of

Γ \cup {A}

, we have

B_{i} (y) \geq M (x, y)

for all

(x, y) \in X \times Y

. As a result,

⋂_{i \in I} B_{i} (y) \geq M (x, y)

for all models M and all

(x, y) \in X \times Y

. That is,

⋂_{i \in I} B_{i}

is a consequence of

Γ \cup {A}

. □

Corollary 2.

Let Γ be a

K B

and

A \in F (X)

be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Let

B, C \in F (Y)

be two fuzzy sets such that

Γ \cup {A} ⊧ B

and

Γ \cup {A} ⊧ C

; then,

$Γ \cup {A} ⊧ B \cap C$ ;
$Γ \cup {A} ⊧ B \cup C$ .

4.3. Inference Engine: Links with ${LU}_{F H}$

In the previous section, we showed that given a

K B

Γ

and a

K B

-input

A \in F (X)

, the set of consequences was determined by the greatest model of

Γ \cup {A}

or by the least consequence on

F (Y)

. Determining the greatest model and checking whether a fuzzy set is a consequence by comparison with it may be complex from a computational point of view. In this section, we propose a simple inference process to obtain correct consequences that upper-bound the least consequence of

Γ \cup {A}

.

In order to keep the support of our fuzzy inference system on the Description Logic

{LU}_{F H}

(and its power of reasoning), it would be desirable to link the models of both constructions, as linked in the previous section by Proposition 3. The following theorem presents an analogous result for a

K B

and a

K B

-input by fixing elements in

α

-cuts of models.

Theorem 11.

Let Γ be a

K B

and let

A \in F (X)

be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Let

A_{Γ}

be the A-Box of

{LU}_{F H}

constructed as follows:

For each $A_{i}$ in $A_{X}$ , consider the concept $C_{A_{i}}$ ;
For A in $F (X)$ , consider the concept $C_{A}$ ;
For each concept $C_{A_{i}}$ , consider the pair of concept modifiers $〈 m_{i}, {\bar{m}}_{i} 〉$ ;
For each concept $C_{A_{i}}$ , consider in $A_{Γ}$ the assertion

$〈^{{\bar{m}}_{i}} C_{A_{i}} (a) α 〉,$

where $α \in (0, 1]$ .

Consider a non-empty model M of

Γ \cup {A}

, its α-cut

M^{α}

, and

(x_{0}, y_{0}) \in M^{α}

. Then, the interpretation

I = (M^{α}, \cdot^{I})

given in Proposition 3 plus

$a^{I} = (x_{0}, y_{0})$ and
$m_{i}^{I} = {Inc}_{G} (A, A_{i})$

is a model of

A_{Γ}

.

Proof.

By definition of the f-index of inclusion,

{Inc}_{G} (A, A_{i}) (A (x)) \leq A_{i} (x)

for all

A_{i} \in F (X)

and for every element

(x, y)

of

M^{α}

. Then, the interpretation

I = (M^{α}, \cdot^{I})

satisfies

m_{i}^{I} (C_{A}^{I} (x, y)) \leq C_{A_{i}}^{I} (x, y)

for all

(x, y) \in M^{α}

. In other words, it satisfies the concept specialization

{}^{m_{i}}C_{A} ⊑ C_{A_{i}}

. Then, by Theorem 2,

I

must also satisfy

C_{A} ⊑ {}^{{\bar{m}}_{i}}C_{A_{i}}

.

At the same time, since the instance a of

C_{A i}

satisfies

a^{I} = (x_{0}, y_{0}) \in M^{α}

and

M^{α}

is an

α

-model of A,

C_{A}^{I} (a^{I}) = A (x_{0}) \geq α

. Therefore, given the assertion

〈^{{\bar{m}}_{i}} C_{A_{i}} (a), α 〉

:

α \leq C_{A}^{I} (a^{I}) \leq {\bar{m}}_{i}^{I} (C_{A_{i}}^{I} (a^{I})) .

In conclusion,

I

satisfies the assertion

〈^{{\bar{m}}_{i}} C_{A_{i}} (a), α 〉

for all

A_{i} \in A_{X}

, and therefore,

I

is a model of

A_{Γ}

. □

Note that

K B

-inputs

A \in F (X)

in a

K B

and a frame

(X, Y, A_{X}, B_{Y})

are translated as a series of assertions in Description Logic through concept modifiers

{Inc}_{G} (A, A_{i})

for each one of the elements of the partition

A_{i} \in A_{X}

. That is to say, for each

K B

-input,

A \in F (X)

is considered an A-Box consisting of as many assertions as there are elements in the partition

A_{X}

. This identification is carried out individually for each pair

(x, y)

in the support of the model

M \in M_{Γ, A}

, taking into account the

α

-cuts of M due to the specific characteristics of Description Logic, i.e., DL only allows for the inclusion of a countable set of instances, whereas the universes X and Y can be of a fundamentally different nature, such as

R

. However, this identification of the instance a (in

{LU}_{F H}

) with its corresponding

(x, y)

is fixed but arbitrary within the

α

-cut

M^{α}

, as is the choice of the value

α \in (0, 1]

. This makes it possible to translate the assertions

〈 C_{A} (a), α 〉

defined in

{LU}_{F H}

into a fuzzy set A defined on a universe X or Y.

The next result justifies the use of reasoning tools from Description Logic within our framework, and in particular, the application of the Generalized Modus Ponens given in Theorem 4.

Corollary 3.

Let Γ be a

K B

and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Let

T_{Γ}

be the T-Box of

{LU}_{F H}

and

A_{Γ}

the A-Box of

{LU}_{F H}

constructed as in Proposition 3 and Theorem 11, respectively. Then, all fuzzy assertions entailed by the set of axioms

〈 T_{Γ}, A_{Γ} 〉

(on the context of

{LU}_{F H}

) form a correct inference from Γ and A (interpreted as

K B

and

K B

-input).

Proof.

Let

〈 C (a), β 〉

be a fuzzy assertion entailed by

〈 T_{Γ}, A_{Γ} 〉

, which means that every model of

〈 T_{Γ}, A_{Γ} 〉

satisfies

〈 C (a), β 〉

as well. Then, given a model M of

Γ \cup {A}

and given the interpretation

I = (M^{α}, \cdot^{I})

, with

α \in (0, 1]

, defined in Proposition 3 and Theorem 11, it is guaranteed that

I

is a model of

〈 T_{Γ}, A_{Γ} 〉

; therefore, it also satisfies

〈 C (a), β 〉

. As

C^{I} (a^{I}) \geq β

and

a^{I} \in M^{α}

, it can be concluded that

M^{α}

is a

β

-model of

C^{I}

. □

As a consequence of the previous corollary, we can apply tools of inference from the Description Logic

{LU}_{F H}

on

K B

and

K B

-inputs. Among all of them, we are interested here on the Generalized Modus Ponens (GMP) described on Section 3.3. Specifically, we can join all the possible GMP applicable on the T-Box and A-Box constructed according to Proposition 3 and Theorem 11 and obtain the following inference on

K B

, which is the core of the inference engine considered in our approach.

Theorem 12.

Let Γ be a

K B

and let A be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

, where

A \in F (X)

. Then, for all

〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ

, we have

{〈 A_{i} \to B_{j}; f_{i j} 〉, A} ⊧ B_{i j},

where

B_{i j} \in F (Y)

is the fuzzy set given by

B_{i j} (y) = {\bar{Inc}}_{G} (A, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)),

(3)

where

{\bar{f}}_{i j}

is the only mapping such that

(f_{i j}, {\bar{f}}_{i j})

forms an adjoint pair.

Proof.

Consider in

{LU}_{F H}

the T-Box

T_{Γ}

and the A-Box

A_{Γ}

constructed as in Proposition 3 and Theorem 11, respectively, by a fixed model M of

Γ \cup {A}

and a fixed

α

-cut of M,

M^{α}

, with

α \in (0, 1]

. Given the construction of both sets, one can rearrange them in pairs of elements

(〈 {}^{m_{i j}}C_{A_{i}} ⊑ C_{B_{j}} 〉, 〈 {}^{{\bar{m}}_{i}}C_{A_{i}} (a), α 〉) \in T_{Γ} \times A_{Γ}

related through the concept specialization antecedent

C_{A_{i}}

. Then, by applying the Generalized Modus Ponens given in Theorem 4, we have

〈 {}^{m_{i j}}C_{A_{i}} ⊑ C_{B_{j}} 〉, 〈 {}^{{\bar{m}}_{i}}C_{A_{i}} (a), α 〉 ⊧ 〈 {}^{{\bar{m}}_{i} \circ {\bar{m}}_{i j}}C_{B_{i j}} (a), α 〉,

where the concept

{}^{{\bar{m}}_{i} \circ {\bar{m}}_{i j}}C_{B_{i j}}

obtained as an output is interpreted as

{(^{{\bar{m}}_{i} \circ {\bar{m}}_{i j}} C_{B_{i j}})}^{I} (x, y) = {\bar{Inc}}_{G} (A, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y))

for all

(x, y) \in M

. If we denote this fuzzy set as

B_{i j}

, it is concluded that

M^{α}

is an

α

-model of

B_{i j}

. □

The previous result leads to the application of the following inference engine for a frame

(X, Y, A_{X}, B_{Y})

and a

K B

. Given a

K B

-input

A \in F (X)

, for each pair

(A_{i}, B_{j}) \in A_{X} \times B_{Y}

, there exists a rule

〈 A_{i} \to B_{j}; f_{i j} 〉

from which we can obtain the following inference:

B_{i j} (y) = {\bar{Inc}}_{G} (A, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)) \in F (Y),

where

\bar{f}

is the only mapping such that

(f, \bar{f})

is an adjoint pair, as is

{\bar{Inc}}_{G} (A, A_{i})

with respect to

{Inc}_{G} (A, A_{i})

, i.e., the f-index of inclusion of A into

A_{i}

is restricted to the set

G

(Definition 6).

Thanks to Theorem 12, we obtain the consequence

Γ \cup {A} ⊧ B_{i j}

for each pair

(A_{i}, B_{j}) \in A_{X} \times B_{Y}

, where

A_{X} = {A_{i}}_{i \in I}

and

B_{Y} = {B_{j}}_{j \in J}

. By Corollary 2, we can conclude that the intersections and unions of the

B_{i j}

are also consequences. Since our goal is to find the smallest set that is a consequence of

Γ

and A, the inference engine takes the infimum of all of them. This leads to our inference engine:

B (y) = (⋂_{\begin{matrix} i \in I, \\ j \in J \end{matrix}} B_{i j}) (y) = inf_{\begin{matrix} A_{i} \in A_{X}, \\ B_{j} \in B_{Y} \end{matrix}} {\bar{Inc}}_{G} (A, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)),

(4)

which is called the

K B

-output of

Γ \cup {A}

. The next result states that the fuzzy set obtained by our inference engine is actually a consequence of a

K B

and a

K B

-input.

Corollary 4.

Let Γ be a

K B

and let

A \in F (X)

be a

K B

-input on a frame

(X, Y, A_{X}, B_{Y})

. Then, the fuzzy set

B \in F (Y)

defined by Equation (4) is a consequence of

Γ \cup {A}

, that is,

Γ \cup {A} ⊧ B

.

Proof.

As a consequence of Theorem 12, given a model of

Γ \cup {A}

, every

α

-model of A with

α \in (0, 1]

is also an

α

-model if

B_{i j}

, which means that every

(x, y) \in M^{α}

satisfies

B_{i j} (y) \geq α

. Then, the fuzzy set defined in Equation (4) also satisfies

B (y) = (⋂_{\begin{matrix} i \in I, \\ j \in J \end{matrix}} B_{i j}) (y) = inf_{\begin{matrix} i \in I, \\ j \in J \end{matrix}} B_{i j} (y) \geq α

for every

(x, y) \in M^{α}

. In consequence,

M^{α}

is an

α

-model of the

K B

-output B. □

Let us look at a pair of examples applying this inference engine to the

K B

defined previously in Example 3. First, we study the case where the

K B

-input introduced is a singleton. Second, the case where the

K B

-input is a more general fuzzy set is studied.

Example 5.

Let

(X, Y, A_{X}, B_{Y})

be the frame and Γ the

K B

defined in Example 3, where a study on the heart rate (in bpm) (Y) in children between 1 and 9 years old (X) is conducted. In this example, the greatest model of Γ,

M_{Γ}

, was plotted in Figure 6. Suppose that we want to study the possible heart rate of a child who is 30 months old, that is, two and a half years old. This case is translated on the frame

(X, Y, A_{X}, B_{Y})

as a crisp singleton

K B

-input

A \in F (X)

, whose membership function is given below in Figure 7:

\begin{matrix} A (x) = \{\begin{matrix} 1 & if x = 2.5 \\ 0 & otherwise \end{matrix} \end{matrix}

The inference engine consists of the application of the GMP based on the f-index of inclusion for each of the elements in Γ. Since

A_{X}

has 4 elements and

B_{Y}

has 3 elements, we apply a total of

4 \cdot 3 = 12

GMP as described in Theorem 12:

{〈 A_{i} \to B_{j}; f_{i j} 〉, A} ⊧ B_{i j},

where

B_{i j} \in F (Y)

is described in terms of Equation (3):

B_{i j} = {\bar{Inc}}_{G} (A, A_{i}) \circ \bar{f} (B_{j} (y)) .

Let us recall that

\bar{f}

is the only mapping such that

(f_{i j}, {\bar{f}}_{i j})

forms an adjoint pair, as is

{\bar{Inc}}_{G} (A, A_{i})

with respect to the f-index of inclusion restricted to

G

of A in

A_{i}

. Therefore, in order to apply the GMP, it is necessary to compute each of the mappings

{\bar{f}}_{i j}

and

{\bar{Inc}}_{G} (A, A_{i})

. The following formula [30] allows us to compute the the adjoint pair associated with each

f_{i j}

that appears in Γ by a straightforward manual calculation:

{\bar{f}}_{i j} (α) = sup_{β \in [0, 1]} {f_{i j} (β) \leq α}

(5)

which results in the Table 5.

Where

{\bar{g}}_{n} (α) = \{\begin{matrix} n & if α < n \\ 1 & if α \geq n, \end{matrix}

{\bar{f}}_{n} (α) = \{\begin{matrix} α & if α < n \\ 1 & if α \geq n, \end{matrix}

and

1 (α) = 1

for all

α \in [0, 1]

.

Secondly, in order to apply the inference engine, the f-index of inclusion restricted to

G

of the

K B

-input A in each of the elements of the partition

A_{i} \in A_{X}

must be computed. The results obtained are as follows:

{Inc}_{G} (A, A_{1}) = f_{0.75}, {Inc}_{G} (A, A_{2}) = f_{0.25}, {Inc}_{G} (A, A_{3}) = {Inc}_{G} (A, A_{4}) = 0 .

Nevertheless, the GMP defined in Theorem 12 employs the adjoint pair associated with each of these f-inclusions. By using the formula described in Equation (5), adapted for each

{Inc}_{G} (A, A_{i})

, we obtain each

{\bar{Inc}}_{G} (A, A_{i})

, with

i = 1, \dots, 4

:

{\bar{Inc}}_{G} (A, A_{1}) = {\bar{f}}_{0.75}, {\bar{Inc}}_{G} (A, A_{2}) = {\bar{f}}_{0.25}, {\bar{Inc}}_{G} (A, A_{3}) = {\bar{Inc}}_{G} (A, A_{4}) = 1 .

Now, we can proceed to apply the GMP based on the f-index of inclusion restricted to

G

. Let us first show an example of computation of this inference rule, considering the rule

〈 A_{1} \to B_{3}; f_{0.5} 〉

. The f-index of inclusion of the

K B

-input A into

A_{1}

is

f_{0.75}

; then, according to Theorem 12, the result of applying GMP is

B_{13} (y) = {\bar{f}}_{0.75} \circ {\bar{f}}_{0.5} (B_{3} (y)) .

Note that this composition of mappings is applied to

B_{3}

; in other words, it does not depend on the

K B

-input A. Recall that the membership function of

B_{3}

is:

B_{3} (y) = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{y - 100}{30} & if y \in (100, 130] . \end{matrix}

Let us compute the output step by step. The result of applying this fuzzy set to the mapping

{\bar{f}}_{0.5}

is:

{\bar{f}}_{0.5} (B_{3} (y)) = \{\begin{matrix} B_{3} (y) & if y \in [70, 115) \\ 1 & if y \in [115, 130] \end{matrix} = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{y - 100}{30} & if y \in (100, 115) \\ 1 & if y \in [115, 130] . \end{matrix}

Finally, the output of this GMP is:

{\bar{f}}_{0.75} \circ {\bar{f}}_{0.5} (B_{3} (y)) = \{\begin{matrix} {\bar{f}}_{0.5} (B_{3} (y)) & if y \in [70, 122.5) \\ 1 & if y \in [122.5, 130] \end{matrix} = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{y - 100}{30} & if y \in (100, 115) \\ 1 & if y \in [115, 130] . \end{matrix}

In other words, applying the mapping

{\bar{f}}_{0.75}

does not affect the final result.

In the vast majority of cases, we obtain the output

B_{i j} = Y

due to the appearance of the top element

1

as one of the components of the function composition. All these cases are consequences of two possibilities: either the respective rule in Γ determines no restriction or the inclusion of A into

A_{i}

is null. Recall that the final inference is the intersection of all the outputs

B_{i j}

; hence, all these cases are degenerate and do not produce any restriction. Accordingly, only those outputs where the mapping

1

is not involved in the application of the GMP are shown below:

\begin{matrix} B_{13} (y) = {\bar{f}}_{0.75} \circ {\bar{f}}_{0.5} (B_{3} (y)) = \{\begin{matrix} B_{1} (y) & if y \in [70, 115) \\ 1 & if y \in [115, 130], \end{matrix} \\ B_{22} (y) = {\bar{f}}_{0.25} \circ {\bar{f}}_{0.5} (B_{2} (y)) = \{\begin{matrix} B_{2} (y) & if y \in [70, 77.5) \\ 1 & if y \in [77.5, 122.5] \\ B_{2} (y) & if y \in [122.5, 130), \end{matrix} \\ B_{23} (y) = {\bar{f}}_{0.25} \circ {\bar{g}}_{0.25} (B_{3} (y)) = 1 (i . e ., B_{23} = Y) \end{matrix}

Once each of the inferences

B_{i j}

is obtained, the

K B

-output is given by the fuzzy set B obtained by computing the intersection of each

B_{i j}

. Its membership function is given by:

B (y) = ⋂_{\begin{matrix} i \in {1, 2, 3, 4} \\ j \in {1, 2, 3} \end{matrix}} B_{i j} (y) = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{x - 100}{30} & if y \in (100, 115) \\ 1 & if y \in [115, 122.5] \\ \frac{130 - x}{30} & if y \in (122.5, 130] . \end{matrix}

Figure 8 shows the membership function of this fuzzy set.

Finally, let us study the greatest model of

Γ \cup {A}

. Unlike the greatest model of Γ,

M_{Γ, A}

is a fuzzy set, and therefore, it may be difficult to represent its membership function on the plane

X \times Y

; instead, we can consider their α-cuts to study the relation between the

K B

-output and the greatest model. Since the

K B

-input A is a crisp singleton (i.e., A only takes values in

{0, 1}

), let us consider the core (i.e., the 1-cut) of the

K B

-output B, which is represented in Figure 9. Its algebraic expression is:

c o r e (M_{Γ, A}) = {(x, y) \in X \times Y ∣ x = 2.5, y \in [115, 122.5]} .

(6)

Observe that for every element

(x, y) \in c o r e (M_{Γ, A})

, the X-component is contained in the core of A, and the Y-component is contained in the core of B, which is the interval

[115, 122.5]

, that is,

c o r e (M_{Γ, A}) = c o r e (K B - input) \times c o r e (K B - output) .

Later on, we demonstrate that this phenomenon is not a coincidence. This result is used to justify the defuzzification procedure of our inference system, as can be seen in the following section.

Let us see now another example of application of the inference engine, this time involving a fuzzy set as

K B

-input with a continuous membership function.

Example 6.

Consider the same frame

(X, Y, A_{X}, B_{Y})

and the same

K B

Γ defined in Examples 3 and 5. Suppose now that we want to study the heart rate of a child who is “approximately seven years old”. This statement is represented by the

K B

-input

A \in F (X)

defined in Figure 10.

Just like in the previous example, 12 GMP based on the f-index of inclusion have to be carried out. Each one generates an output

B_{i j}

given by the following expression:

B_{i j} = {\bar{Inc}}_{G} (A, A_{i}) \circ \bar{f} (B_{j} (y)),

where

A_{i} \in A_{X}

and

B_{j} \in B_{Y}

. Let us now compute each f-index of inclusion of the

K B

-input A in each element of the partition

A_{i} \in A_{X}

:

Inc (A, A_{1}) = Inc (A, A_{2}) = 0, Inc (A, A_{3}) = Inc (A, A_{4}) = \frac{id}{2} .

\begin{matrix} A (x) = \{\begin{matrix} 0 & if x \in [1, 6] \\ x - 6 & if x \in (6, 7] \\ 8 - x & if x \in (7, 8] \\ 0 & if x \in (8, 9] . \end{matrix} \end{matrix}

Let us now compute the mapping

\bar{\frac{id}{2}}

such that

(\frac{id}{2}, \bar{\frac{id}{2}})

forms an adjoint pair by using Equation (5):

\bar{\frac{id}{2}} (α) = \{\begin{matrix} 2 x & if α \in [0, 0.5) \\ 1 & if α \in [0.5, 1] . \end{matrix}

Only two outputs

B_{i j}

different from Y are obtained, and they are given by those GMP obtained by applying rules

〈 A_{3} \to B_{2}; f_{0.75} 〉

and

〈 A_{4} \to B_{1}; f_{0.5} 〉

:

\begin{matrix} B_{32} = \bar{\frac{id}{2}} \circ {\bar{f}}_{0.75} (B_{2} (y)) = \{\begin{matrix} 2 \cdot B_{2} (y) & if y \in [70, 85) \\ 1 & if y \in [85, 115] \\ 2 \cdot B_{2} (y) & if y \in (115, 130], \end{matrix} \\ B_{41} = \bar{\frac{id}{2}} \circ {\bar{f}}_{0.5} (B_{1} (y)) = \{\begin{matrix} 1 & if y \in [70, 85] \\ 2 \cdot B_{1} (y) & if y \in (85, 130] . \end{matrix} \end{matrix}

The final inference is obtained through the intersection of each output of the 12 GMP, which results in

B_{32} \cap B_{41}

. This

K B

-output is represented in Figure 11, and it is given by the following expression:

B (y) = ⋂_{\begin{matrix} i \in {1, 2, 3, 4} \\ j \in {1, 2, 3} \end{matrix}} B_{i j} (y) = \{\begin{matrix} \frac{x - 70}{15} & if y \in [70, 85) \\ \frac{100 - x}{15} & if y \in [85, 100) \\ 0 & if y \in [100, 130] . \end{matrix}

Finally, let us study the greatest model of

Γ \cup {A}

, as in the previous example. In Figure 12, we have represented the

K B

-input A, the

K B

-output B, and the core of the greatest model of

Γ \cup {A}

. In this case, we can also check that this set can be rewritten in terms of the cores of both A and B, namely,

c o r e (M_{Γ, A}) = c o r e (A) \times c o r e (B) = {(7, 85)}

. Moreover, there is an additional property satisfied by the greatest model

M_{Γ, A}

, which is related to the support of

M_{Γ, A}

(represented in Figure 12 by the shaded region within

M_{Γ}

), the support of A, and the support of B:

s u p p (A)

is contained in the X-component of

s u p p (M)

, and

s u p p (B)

is contained in its Y-component.

Nevertheless, this last property is not always satisfied by the

K B

-output of a

K B

Γ and a

K B

-input; e.g., the

K B

-output obtained in Example 5 has a greater support than the Y-component of

s u p p (M_{Γ, A})

(see Figure 9).

4.4. On Fuzzification and Defuzzification Procedures

Last but not least, we discuss the procedures of fuzzification and defuzzification. The reason is mainly because firstly, they depend strongly on the particular context of application of the inference system, and secondly, the scope of this paper is to show the correctness of the inference system. Nevertheless, here, we present some discussions and some issues to be considered in further approaches.

Obviously, the definition of rules in the knowledge database requires a pair of fuzzy partitions

A_{X}

and

B_{Y}

on the universes X and Y, respectively. That is a fuzzification procedure, but there is another that transforms the input of the system into a

K B

-input. In most approaches, the same fuzzy partitions used in the construction of the knowledge database are used to fuzzify the input of the system. However, in our approach, the

K B

-input may be any fuzzy set defined independently from the fuzzy partition

A_{X}

considered in X for the construction of the

K B

. That opens an endless number of possibilities for transforming the input of the fuzzy inference system into the

K B

-input. For instance, consider that for the sake of accuracy, we use two fuzzy partitions

A_{X}

and

B_{Y}

on the universes X and Y with hundreds of fuzzy sets each. On the other hand, in order to keep an interpretable input and output of the fuzzy inference system, we use linguistic labels in the form of fuzzy sets on X. These linguistic labels may have more or less elements, may be defined differently for each user, and may even have no relation with the partition

A_{X}

used in X for the

K B

. In all cases, given a linguistic label as a fuzzy set on X, we can reason with the

K B

.

Another possibility for a fuzzification is to iterate fuzzy inference systems, and then, the

K B

-output of one fuzzy inference system becomes the

K B

-input of the next one, so the fuzzification is then the result of a fuzzy inference system. Note that we can even not consider any fuzzification. That is, given a value or an interval of values in X (i.e., a crisp input), we can consider it as a set (or a singleton as a set as in Example 5) and then reason with it. As mentioned above, the fuzzy partitions considered in the construction of knowledge databases do not limit the fuzzification procedure for the

K B

-input; it is open to any possible fuzzification.

On the other hand, the defuzzification also depends on the scope of the application rather than the fuzzification. Therefore, it is hard to talk about it in this approach. Nevertheless, it is convenient to bear in mind the following characteristic of our fuzzy inference systems. The consideration of crisp models for the semantics of

K B

allows us to focus on models as a defuzzification procedure. In this respect, we have two options, to either take into consideration all the possible values that are coherent with the modeling of a

K B

and the

K B

-input (which results in choosing the greatest model as the (crisp) output of our system) or to focus on selecting a specific kind of model. This latter case is certainly more interesting, and the characteristics of these models depend on the scope of the application; for example, a regression model focuses on a functional model, whereas a classification model may focus on measure models with a certain

y \in Y

in the second component.

It is worth finishing this section by focusing on the simplest case of fuzzification and defuzzification procedures: a singleton crisp input and the search for a set of possible values for the variable Y in the output. This is exactly the case illustrated in Example 5, were the crisp value

x = 2.5

is directly considered as a crisp-singleton fuzzy set as

K B

-input. The following result shows that in that case, the core of the

K B

-output determines the greatest model. Note that given

x_{0} \in X

, the singleton

{x_{0}}

can be interpreted as the fuzzy set A such that

A (x_{0}) = 1

and

A (x) = 0

if

x \neq x_{0}

.

Corollary 5.

Let Γ be a

K B

on a frame

(X, Y, A_{X}, B_{Y})

, let

x_{0} \in X

, let M be the greatest model of

Γ \cap {x_{0}}

and let B be the fuzzy set obtained in Equation (4) as the

K B

-output of

Γ \cap {x_{0}}

. Then, M is a crisp set (i.e.,

M (x, y) \in {0, 1}

) and

M (x_{0}, y) = 1 if and only if B (y) = 1

Proof.

Proving that M is crisp comes directly from Definitions 18 and 19, since all

α

-models of

{x_{0}}

coincide with the one-model of

{x_{0}}

.

Let us prove the “if and only if” part. By definition of a consequence and the correctness of Corollary 4,

M (x, y) = 1

implies

B (y) = 1

. In other words, to prove the other implication, let us assume that

B (y) = 1

. By definition of the inference engine in terms of intersections, we have that necessarily, for all rules

〈 A_{i} \to B_{j}; f_{i j} 〉 \in Γ

,

B_{i j} (y) = {\bar{Inc}}_{G} ({x_{0}}, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)) = 1

Moreover, since

{x_{0}}

is a singleton, it can be proved that

{Inc}_{G} ({x_{0}}, A_{i}) (x) = \{\begin{matrix} A_{i} (x_{0}) & if x \leq A_{i} (x_{0}) \\ x & otherwise . \end{matrix}

Then, we have the following chain of inequalities:

\begin{matrix} 1 \leq B_{i j} (y) = {\bar{Inc}}_{G} ({x_{0}}, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)) \\ \Leftrightarrow & {Inc}_{G} ({x_{0}}, A_{i}) (1) \leq {\bar{f}}_{i j} (B_{j} (y)) \\ \Leftrightarrow & A_{i} (x_{0}) \leq {\bar{f}}_{i j} (B_{j} (y)) \\ \Leftrightarrow & f (A_{i} (x_{0})) \leq B_{j} (y) . \end{matrix}

In other words,

(x_{0}, y)

satisfies all the rules in

Γ

, and

{(x_{0}, y)}

is a model of

Γ

. Moreover, since the

K B

-input is crisp,

{(x_{0}, y)}

is a one-model of

Γ \cap {x_{0}}

. By maximality of M, necessarily, we have that

M (x_{0}, y) = 1

, which is what we wanted to prove. □

In other words, in the case of working in the simplest setting of fuzzification (a crisp entry) and defuzzification (interval of plausible values in the greatest model), the values in the core of the

K B

-output directly determine the greatest model of the

K B

and the

K B

-input. That fact is illustrated in the following example.

Example 7.

Let us reconsider Example 5, where the

K B

-input was the crisp singleton set

{2.5}

. The result of the fuzzy engine is

B (y) = ⋂_{\begin{matrix} i \in {1, 2, 3, 4} \\ j \in {1, 2, 3} \end{matrix}} B_{i j} (y) = \{\begin{matrix} 0 & if y \in [70, 100] \\ \frac{x - 100}{30} & if y \in (100, 115) \\ 1 & if y \in [115, 122.5] \\ \frac{130 - x}{30} & if y \in (122.5, 130] . \end{matrix}

whose core is the interval

[115, 122.5]

. From Corollary 5, we can conclude that the greatest model of

Γ \cup {2.5}

is

M = {(x, y) \in R^{2} ∣ x = 2.5 and y \in [115, 122.5]}

The reader can check in Figure 9 that effectively, all pairs of values

(x, y) \in X \times Y

in the greatest model of Γ with

x = 2.5

are exactly the interval

[115, 122.5]

.

5. Comparison with Relational Fuzzy Inference Systems

Now that the FIS based on the notion of f-inclusion has been introduced, we can better describe the similarities and differences with the existing FISs in the literature. Let us recall that, as mentioned in the Introduction, there exist two families of FISs, namely, the so-called relational fuzzy inference systems and those based on aggregating crisp values (or entities). Our approach is mainly related to the former ones, because they consider relations to link universes (the one of the antecedent and the one of the consequent), because the result of the inference is a fuzzy set, which needs a defuzzification procedure, and because, as we show below, the f-index of inclusion can be related to a pair formed by a fuzzy conjunction and a fuzzy implication.

In the family of relational FISs, we have two main models, the Mamdani and the implicative models [3], which have the following general form: given two sets X and Y, a set of rules

If x is A_{i} THEN y is B_{i} i = 1, \dots n

with

A_{i} \in F (X)

and

B_{i} \in F (Y)

, an adjoint pair

(*, \to)

(although in many papers, the choice of the fuzzy conjunction and implication looks general, the truth is that without the adjointness property, results may be unexpected [3,31]), and a fuzzy set

A^{'} \in F (X)

as input, the inference

B^{'} \in F (Y)

is performed in the implicative model by

B^{'} (y) = (A^{'} \circ R) (y) = \underset{x \in X}{⋁} A^{'} (x) * R (x, y)

where the relation R is defined as

R (x, y) = ⋀_{i}^{n} A_{i} (x) \to B_{i} (y)

and in the Mamdani model by

B^{'} (y) = (A^{'} ⊳ R) (y) = \underset{x \in X}{⋀} A^{'} (x) \to R (x, y)

where the relation R is defined as

R (x, y) = ⋁_{i}^{n} A_{i} (x) * B_{i} (y) .

These two formulae turn into the two following general forms:

\underset{x \in X}{⋁} (A^{'} (x) * ⋀_{i}^{n} A_{i} (x) \to B_{i} (y)) and \underset{x \in X}{⋀} (A^{'} (x) \to ⋁_{i}^{n} A_{i} (x) * B_{i} (y)) .

(7)

Here, we can see a clear difference. Whereas in standard relational FISs, the relations are the core of the inference engine, in the proposed FIS based on f-inclusion, the relations belong to the semantics, providing to the approach a formal logic support. Here, it is convenient to recall the following result from [15]: given

f \in G

, there exists an adjoint pair

(*, \to)

with * a commutative fuzzy conjunction and

α \in [0, 1]

such that

f (x) = x * α .

Moreover, if f is the f-index of inclusion of two fuzzy sets A and B on the universe

U

, then

α = \underset{u \in U}{⋀} A (u) \to B (u) .

Let us recall that in the FIS based on f-inclusion, rules are weighted by f-inclusions. If those weights are interpreted as the f-index of inclusion between antecedent and consequent, then we have that for each rule

〈 A_{i} \to B_{j} : f_{i j} 〉

there exists an adjoint pair

(*_{i j}, \to_{i j})

mapping

f_{i j}

with the form

f_{i j} (z) = z *_{i j} (\underset{(x, y) \in X \times Y}{⋀} A_{i} (x) \to_{i j} B_{j} (y))

and its respective right adjoint with the form

{\bar{f}}_{i j} (z) = (\underset{(x, y) \in X \times Y}{⋀} A_{i} (x) \to_{i j} B_{j} (y)) \to_{i j} z .

Consequently, and keeping the previous notation, the full inference for an input

A^{'} \in F (X)

given in Equation (4) is given by the following formula:

B^{'} (z) = inf_{\begin{matrix} A_{i} \in A_{X} \\ B_{j} \in B_{Y} \end{matrix}} [(inf_{x \in X} A^{'} (x) \to_{i} A (x)) \to_{i} ((inf_{(x, y) \in X \times Y} A_{i} (x) \to_{i j} B_{j} (z)) \to_{i j} B_{j} (y))]

(8)

Here, we can clearly see the difference between the inference performance of our approach (Equation (8)) with the standard relational FIS (Equation (7)).

At this point, and due to the intricate pattern of Equation (8), it is worth focusing on the simplicity of our approach. The formulation of our inference engine, shown in Equation (4), is

B^{'} (y) = inf_{\begin{matrix} A_{i} \in A_{X}, \\ B_{j} \in B_{Y} \end{matrix}} {\bar{Inc}}_{G} (A^{'}, A_{i}) \circ {\bar{f}}_{i j} (B_{j} (y)),

Therefore, our inference is just the intersection of the inference performed by each rule in the

K B

. In turn, the inference performed by each rule

A_{i} \to B_{j}

is just the composition of two mappings:

${\bar{f}}_{i j}$ that represents the inclusion of the antecedent in the consequent;
${\bar{Inc}}_{G} (A^{'}, A_{i})$ that represents the inclusion of the input in the antecedent of the respective rule.

Note that we do not need to consider any adjoint pair of fuzzy conjunctions and implications to perform the inference, we only take mappings in

G

, i.e., Equation (8) is purely theoretical.

6. Conclusions and Future Works

In this article, we showed how to define a fuzzy inference system (FIS) in terms of the notion of f-inclusion. The main difference between this FIS and the ones from the literature is that it is based on a formal logic background. Indeed, we related the models of our knowledge databases with models of a certain Fuzzy Description Logic (FDL)

{LU}_{F H}

, which consists of the standard FDL

LU

enriched with modifiers (or fuzzy hedges). This connection allowed us to apply inferences from

{LU}_{F H}

in our FIS based on f-inclusion, allowing us to use a Generalized Modus Ponens (GMP) in our inference engine. Another advantage of our FIS is that it allowed us to consider any fuzzy set as input of the inference engine. That allowed us to consider fuzzy partitions for the construction of the knowledge database independently on the fuzzification used to transform the input of the FIS into the inference engine. For example, we can consider any set of linguistic labels in our FIS independently from the fuzzy partitions used to define the knowledge database.

From a theoretical point of view, we highlight the following achievements. First, we defined the notion of the consequence of a FIS, which allowed us to prove the correctness of the proposed inference engine. Second, we proved convenient properties of the consequences of a knowledge database and an input concerning monotony and a lattice algebraic structure. Finally, we reduced the search of consequences to the computation of the greatest model.

This approach opens an interesting number of future research lines. First, in this article, we focused on the theoretical correctness of the FIS. Therefore, it is necessary to define more application-oriented procedures aimed at defining knowledge databases from data. Because of the properties of the f-index of inclusion and the definition of the

K B

, we believe that we can define easy machine learning methods for rules in our FIS based on f-inclusion. Second, additional theoretical properties of the set of consequences in our FIS are necessary to provide stronger support for our approach. Along this line, to obtain a completeness inference engine would be ideal. Finally, applications of this FIS to real applied domains are welcome and of interest to the authors. Since the semantics of the FIS is inspired by Fuzzy Description Logic, we expect to have applications of this FIS based on f-inclusion in the development of expert systems, although we do not discard other applications, such as in classification or control systems.

Author Contributions

Conceptualization, N.M.; Methodology, N.M. and E.R.-P.; Formal analysis, C.D.-M., N.M. and E.R.-P.; Writting, C.D.-M., N.M. and E.R.-P.; Investigation, C.D.-M., N.M. and E.R.-P.; Funding acquisition; N.M. and E.R.-P. All authors have read and agreed to the published version of the manuscript.

Funding

Partially supported by the projects PID2022-140630NB-I00 and PID2022-137620NB-I00 funded by MICIU/AEI/10.13039/501100011033 and FEDER, UE, by the grant TED2021-129748B-I00 funded by MCIN/AEI/10.13039/501100011033 and European Union NextGenerationEU/PRTR. Partially suported by Plan Propio–UCA 2025–2027.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Mamdani, E.; Assilian, S. An experiment in linguistic synthesis with a fuzzy logic controller. Int. J. Man-Mach. Stud. 1975, 7, 1–13. [Google Scholar] [CrossRef]
Eghbal Ahmadi, M.H.; Royaee, S.J.; Tayyebi, S.; Bozorgmehry Boozarjomehry, R. A new insight into implementing Mamdani fuzzy inference system for dynamic process modeling: Application on flash separator fuzzy dynamic modeling. Eng. Appl. Artif. Intell. 2020, 90, 103485. [Google Scholar] [CrossRef]
Štěpnička, M.; Jayaram, B.; Su, Y. A short note on fuzzy relational inference systems. Fuzzy Sets Syst. 2018, 338, 90–96. [Google Scholar] [CrossRef]
Daňková, M. On approximate reasoning with graded rules. Fuzzy Sets Syst. 2007, 158, 652–673. [Google Scholar] [CrossRef]
Sugeno, M. Industrial Applications of Fuzzy Control; Elsevier Science Ltd.: Amsterdam, The Netherlands, 1985. [Google Scholar]
Gupta, P.K.; Muhuri, P.K. Extended Tsukamoto’s inference method for solving multi-objective linguistic optimization problems. Fuzzy Sets Syst. 2019, 377, 102–124. [Google Scholar] [CrossRef]
Perfilieva, I.; Novák, V.; Dvořák, A. Fuzzy transform in the analysis of data. Int. J. Approx. Reason. 2008, 48, 36–46. [Google Scholar] [CrossRef]
Madrid, N. Significance measures for rules in probabilistic-fuzzy inference systems based on fuzzy transforms. Fuzzy Sets Syst. 2023, 467, 108575. [Google Scholar] [CrossRef]
Benzaouia, A. Switching Takagi-Sugeno Systems. In Saturated Switching Systems; Springer London: London, UK, 2012; pp. 247–274. [Google Scholar] [CrossRef]
Koohathongsumrit, N.; Meethom, W. An integrated approach of fuzzy risk assessment model and data envelopment analysis for route selection in multimodal transportation networks. Expert Syst. Appl. 2021, 171, 114342. [Google Scholar] [CrossRef]
Veeramani, C.; Venugopal, R.; Muruganandan, S. An Exploration of the Fuzzy Inference System for the Daily Trading Decision and Its Performance Analysis Based on Fuzzy MCDM Methods. Comput. Econ. 2023, 62, 1313–1340. [Google Scholar] [CrossRef]
Cao, J.; Zhou, T.; Zhi, S.; Lam, S.; Ren, G.; Zhang, Y.; Wang, Y.; Dong, Y.; Cai, J. Fuzzy inference system with interpretable fuzzy rules: Advancing explainable artificial intelligence for disease diagnosis—A comprehensive review. Inf. Sci. 2024, 662, 120212. [Google Scholar] [CrossRef]
Madrid, N.; Ojeda-Aciego, M.; Perfiljeva, I. ƒ-inclusion indexes between fuzzy sets. In Proceedings of the Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology, Riga, Latvia, 21–25 July 2025; pp. 1528–1533. [Google Scholar] [CrossRef]
Díaz-Montarroso, C.; Madrid, N.; Ramírez-Poussa, E. Towards a Generalized Modus Ponens Based on the f-Index of Inclusion. In Proceedings of the Conceptual Knowledge Structures: First International Joint Conference, CONCEPTS 2024, Cádiz, Spain, 9–13 September 2024; pp. 36–48. [Google Scholar] [CrossRef]
Madrid, N.; Ojeda-Aciego, M. The f-index of inclusion as optimal adjoint pair for fuzzy modus ponens. Fuzzy Sets Syst. 2023, 466, 108474. [Google Scholar] [CrossRef]
Madrid, N.; Ojeda-Aciego, M. Functional degrees of inclusion and similarity between L-fuzzy sets. Fuzzy Sets Syst. 2020, 390, 1–22. [Google Scholar] [CrossRef]
Straccia, U. A fuzzy description logic. In Proceedings of the AAAI/IAAI, 1998, Madison, WI, USA, 26–30 July 1998; pp. 594–599. [Google Scholar]
Godo, L.; Hájek, P. Fuzzy Inference as Deduction. J. Appl. Non-Class. Logics 1999, 9, 37–60. [Google Scholar] [CrossRef]
Hájek, P.; Paris, J.; Shepherdson, J. Rational Pavelka Predicate Logic is a Conservative Extension of Łukasiewicz Predicate Logic. J. Symb. Log. 2000, 65, 669–682. [Google Scholar] [CrossRef]
Flaminio, T.; Marchioni, E. T-norm-based logics with an independent involutive negation. Fuzzy Sets Syst. 2006, 157, 3125–3144. [Google Scholar] [CrossRef]
Hölldobler, S.; Khang, T.D.; Störr, H.P. A Fuzzy Description Logic with Hedges as Concept Modifiers. In Proceedings of the InTech/VJFuzzy’2002; Phuong, N.H., Nguyen, H.T., Ho, N.C., Santiprabhob, P., Eds.; Science and Technics Publishing House: Hanoi, Vietnam, 2002; pp. 25–34. [Google Scholar]
Madrid, N.; Ramírez-Poussa, E. Analysis of the f-index of inclusion restricted to a set of indexes. In Proceedings of the 20th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU2024), Lisbon, Portugal, 22–26 July 2024. [Google Scholar]
Madrid, N.; Ojeda-Aciego, M. Composition as a fuzzy conjunction between indexes of inclusion. Fuzzy Sets Syst. submitted.
Sinha, D.; Dougherty, E. Fuzzification of Set Inclusion: Theory and Applications. Fuzzy Sets Syst. 1993, 55, 15–42. [Google Scholar] [CrossRef]
Kitainik, L.M. Fuzzy Inclusions and Fuzzy Dichotomous Decision Procedures. In Optimization Models Using Fuzzy Sets and Possibility Theory; Kacprzyk, J., Orlovski, S.A., Eds.; Springer: Dordrecht, The Netherlands, 1987; pp. 154–170. [Google Scholar] [CrossRef]
Birkhoff, G. Lattice Theory; Colloquium Publications, American Mathematical Society: Providence, RI, USA, 1940; Volume 25. [Google Scholar]
Esteva, F.; Godo, L.; Noguera, C. Fuzzy logics with truth hedges revisited. In Proceedings of the 7th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT-11), Aix-Les-Bains, France, 18–22 July 2011; pp. 146–152. [Google Scholar] [CrossRef]
Baldwin, J. A new approach to approximate reasoning using a fuzzy logic. Fuzzy Sets Syst. 1979, 2, 309–325. [Google Scholar] [CrossRef]
Dubois, D.; Prade, H. What are fuzzy rules and how to use them. Fuzzy Sets Syst. 1996, 84, 169–185. [Google Scholar] [CrossRef]
Ganter, B.; Wille, R. Formal Concept Analysis: Mathematical Foundations; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
Novák, V.; Perfilieva, I.; Močkoř, J. Mathematical Principles of Fuzzy Logic; Springer: Boston, MA, USA, 1999. [Google Scholar] [CrossRef]

Figure 1. Membership functions of every fuzzy set in

A_{X}

and

B_{Y}

in Example 3.

Figure 1. Membership functions of every fuzzy set in

A_{X}

and

B_{Y}

in Example 3.

Figure 2. Graphical representation of rule

〈 A_{2} \to B_{2}; f_{0.5} 〉

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rule.

Figure 2. Graphical representation of rule

〈 A_{2} \to B_{2}; f_{0.5} 〉

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rule.

Figure 3. Graphical representation of rule

〈 A_{2} \to B_{3}; g_{0.25} 〉

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rule.

Figure 3. Graphical representation of rule

〈 A_{2} \to B_{3}; g_{0.25} 〉

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rule.

Figure 4. Graphical representation of rules

〈 A_{2} \to B_{j}; f_{2 j} 〉

for

j \in {1, 2, 3}

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rules.

Figure 4. Graphical representation of rules

〈 A_{2} \to B_{j}; f_{2 j} 〉

for

j \in {1, 2, 3}

(bold line) in Example 3. Dashed lines are auxiliar lines and grey lines represent the partitions

A_{X}

and

B_{Y}

. The shaded region represents the points which satisfy the rules.

Figure 5. Graphical representation of the greatest model

M_{Γ}

in Example 3 (the shaded zone). Dashed lines represent the restrictions imposed by each rule in

Γ

and solid lines are part of

M_{Γ}

.

Figure 5. Graphical representation of the greatest model

M_{Γ}

in Example 3 (the shaded zone). Dashed lines represent the restrictions imposed by each rule in

Γ

and solid lines are part of

M_{Γ}

.

Figure 6. Left, graphical representation of the discrite model

M_{1}

(with black dots) of

Γ

in Example 3. Right, graphical representations of the functional model

M_{2}

(with a black curve) of

Γ

in Example 3. Dashed lines represent, in both graphs, the restrictions imposed by each rule in

K B

and solid lines are part of

M_{Γ}

.

Figure 6. Left, graphical representation of the discrite model

M_{1}

(with black dots) of

Γ

in Example 3. Right, graphical representations of the functional model

M_{2}

(with a black curve) of

Γ

in Example 3. Dashed lines represent, in both graphs, the restrictions imposed by each rule in

K B

and solid lines are part of

M_{Γ}

.

Figure 7. Membership function of

K B

-input A (bold) in Example 5 together with partition

A_{X}

(grey). The dashed line represents the discontinuity in the membership function of A.

Figure 7. Membership function of

K B

-input A (bold) in Example 5 together with partition

A_{X}

(grey). The dashed line represents the discontinuity in the membership function of A.

Figure 8.

K B

-output (bold line) obtained by applying the inference engine from

Γ

and A in Example 5. The dashed line represents the discontinuity in the membership function of the

K B

-output.

Figure 8.

K B

-output (bold line) obtained by applying the inference engine from

Γ

and A in Example 5. The dashed line represents the discontinuity in the membership function of the

K B

-output.

Figure 9.

c o r e (M_{Γ, A})

represented together with

K B

-input A and

K B

-output B from Example 5 (bold lines). The shaded region represents the greatest model of

Γ

.

Figure 9.

c o r e (M_{Γ, A})

represented together with

K B

-input A and

K B

-output B from Example 5 (bold lines). The shaded region represents the greatest model of

Γ

.

Figure 10. Membership function of

K B

-input A (bold) in Example 6 together with partition

A_{X}

(grey).

Figure 10. Membership function of

K B

-input A (bold) in Example 6 together with partition

A_{X}

(grey).

Figure 11.

K B

-output (bold) obtained by applying the inference engine from

Γ

and A in Example 6 together with partition

B_{Y}

(grey).

Figure 11.

K B

-output (bold) obtained by applying the inference engine from

Γ

and A in Example 6 together with partition

B_{Y}

(grey).

Figure 12.

c o r e (M_{Γ, A})

and

s u p p (M_{Γ, A})

(dark gray) together with

K B

-input A and

K B

-output B (bold lines) from Example 6. The shaded region represents the greatest model of

Γ

and auxiliary lines are represented in dashed.

Figure 12.

c o r e (M_{Γ, A})

and

s u p p (M_{Γ, A})

(dark gray) together with

K B

-input A and

K B

-output B (bold lines) from Example 6. The shaded region represents the greatest model of

Γ

and auxiliary lines are represented in dashed.

Table 1. Summary of connectors and modifiers included in

{LU}_{F H}

.

Table 1. Summary of connectors and modifiers included in

{LU}_{F H}

.

Symbol	Label
⊤	Top concept
⊥	Bottom concept
$C ⊓ D$	Concept conjunction
$C ⊔ D$	Concept disjunction
$〈 C (a), α 〉$	Fuzzy assertion
$C ⊑ D$	Concept specialization
m	Truth-depressing modifier
$\bar{m}$	Truth-stressing modifier

Table 2. Examples of truth-depressing modifiers and their associated truth-depressing modifiers.

Truth-Depressing Modifiers		Truth-Stressing Modifiers
Label	Analytical Expression	Label	Analytical Expression
True	$m (α) = α$	True	$\bar{m} (α) = α$
Very true	$m (α) = α^{2}$	Fairly true	$\bar{m} (α) = \sqrt{α}$
Extremely true	$m (α) = α^{5}$	Slightly true	$\bar{m} (α) = \sqrt[5]{α}$
At most n-true	$m (α) = \{\begin{matrix} α & if α \leq n \\ n & if α > n \end{matrix}$	At most n-certain	$\bar{m} (α) = \{\begin{matrix} α & if α < n \\ 1 & if α \geq n \end{matrix}$
n-strictly true	$m (α) = \{\begin{matrix} 0 & if α \leq n \\ n & if α > n \end{matrix}$	n-uncertain	$\bar{m} (α) = \{\begin{matrix} n & if α < n \\ 1 & if α \geq n \end{matrix}$
At least n-true	$m (α) = \{\begin{matrix} 0 & if α \leq n \\ α & if α > n \end{matrix}$	At least n-uncertain	$\bar{m} (α) = \{\begin{matrix} n & if α < n \\ α & if α \geq n \end{matrix}$

Table 3.

K B

Γ

associated with

(X, Y, A_{X}, B_{Y})

in Example 3.

Table 3.

K B

Γ

associated with

(X, Y, A_{X}, B_{Y})

in Example 3.

	$B_{1}$	$B_{2}$	$B_{3}$
$A_{1}$	$0$	$0$	$f_{0.5}$
$A_{2}$	$0$	$f_{0.5}$	$g_{0.25}$
$A_{3}$	$0$	$f_{0.75}$	$0$
$A_{4}$	$f_{0.5}$	$0$	$0$

Table 4. Membership functions of fuzzy sets in partitions

A_{X}

and

B_{Y}

in Example 4.

Table 4. Membership functions of fuzzy sets in partitions

A_{X}

and

B_{Y}

in Example 4.

	$x_{1}$	$x_{2}$	$x_{3}$			$y_{1}$	$y_{2}$	$y_{3}$
$A_{1}$	1	1	0		$B_{1}$	1	$0.5$	0
$A_{2}$	0	0	1		$B_{2}$	0	$0.5$	1

Table 5. Adjoint pairs associated with each one of the mappings on Table 3 in Example 3.

	$B_{1}$	$B_{2}$	$B_{3}$
$A_{1}$	$1$	$1$	${\bar{f}}_{0.5}$
$A_{2}$	$1$	${\bar{f}}_{0.5}$	${\bar{g}}_{0.25}$
$A_{3}$	$1$	${\bar{f}}_{0.75}$	$1$
$A_{4}$	${\bar{f}}_{0.5}$	$1$	$1$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Díaz-Montarroso, C.; Madrid, N.; Ramírez-Poussa, E. Correctness of Fuzzy Inference Systems Based on f-Inclusion. Mathematics 2025, 13, 1897. https://doi.org/10.3390/math13111897

AMA Style

Díaz-Montarroso C, Madrid N, Ramírez-Poussa E. Correctness of Fuzzy Inference Systems Based on f-Inclusion. Mathematics. 2025; 13(11):1897. https://doi.org/10.3390/math13111897

Chicago/Turabian Style

Díaz-Montarroso, Carolina, Nicolás Madrid, and Eloísa Ramírez-Poussa. 2025. "Correctness of Fuzzy Inference Systems Based on f-Inclusion" Mathematics 13, no. 11: 1897. https://doi.org/10.3390/math13111897

APA Style

Díaz-Montarroso, C., Madrid, N., & Ramírez-Poussa, E. (2025). Correctness of Fuzzy Inference Systems Based on f-Inclusion. Mathematics, 13(11), 1897. https://doi.org/10.3390/math13111897

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Correctness of Fuzzy Inference Systems Based on f-Inclusion

Abstract

1. Introduction

2. Preliminaries

3. The Description Logic with Concept Modifiers ${LU}_{F H}$

3.1. Syntax of ${LU}_{F H}$

3.2. Semantics of ${LU}_{F H}$

3.3. Generalized Modus Ponens in ${LU}_{F H}$

4. A Fuzzy Inference System Based on the $f$ -Index of Inclusion

4.1. The Knowledge Database

4.2. Adding Inputs to $K B$ s: Defining Consequences

4.3. Inference Engine: Links with ${LU}_{F H}$

4.4. On Fuzzification and Defuzzification Procedures

5. Comparison with Relational Fuzzy Inference Systems

6. Conclusions and Future Works

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Correctness of Fuzzy Inference Systems Based on f-Inclusion

Abstract

1. Introduction

2. Preliminaries

3. The Description Logic with Concept Modifiers LU F H

3.1. Syntax of LU F H

3.2. Semantics of LU F H

3.3. Generalized Modus Ponens in LU F H

4. A Fuzzy Inference System Based on the f -Index of Inclusion

4.1. The Knowledge Database

4.2. Adding Inputs to K B s: Defining Consequences

4.3. Inference Engine: Links with LU F H

4.4. On Fuzzification and Defuzzification Procedures

5. Comparison with Relational Fuzzy Inference Systems

6. Conclusions and Future Works

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. The Description Logic with Concept Modifiers ${LU}_{F H}$

3.1. Syntax of ${LU}_{F H}$

3.2. Semantics of ${LU}_{F H}$

3.3. Generalized Modus Ponens in ${LU}_{F H}$

4. A Fuzzy Inference System Based on the $f$ -Index of Inclusion

4.2. Adding Inputs to $K B$ s: Defining Consequences

4.3. Inference Engine: Links with ${LU}_{F H}$