About Granular Rough Computing—Overview of Decision System Approximation Techniques and Future Perspectives

Artiemjew, Piotr

doi:10.3390/a13040079

Open AccessReview

About Granular Rough Computing—Overview of Decision System Approximation Techniques and Future Perspectives

by

Piotr Artiemjew

Faculty of Mathematics and Computer Science, University of Warmia and Mazury in Olsztyn, 10-710 Olsztyn, Poland

Algorithms 2020, 13(4), 79; https://doi.org/10.3390/a13040079

Submission received: 2 February 2020 / Revised: 26 March 2020 / Accepted: 27 March 2020 / Published: 29 March 2020

(This article belongs to the Special Issue Granular Computing: From Foundations to Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Granular computing techniques are a huge discipline in which the basic component is to operate on groups of similar objects according to a fixed similarity measure. The first references to the granular computing can be seen in the works of Zadeh in fuzzy set theory. Granular computing allows for a very natural modelling of the world. It is very likely that the human brain, while solving problems, performs granular calculations on data collected from the senses. The researchers of this paradigm have proven the unlimited possibilities of granular computing. Among other things, they are used in the processes of classification, regression, missing values handling, for feature selection, and as mechanisms of data approximation. It is impossible to quote all methods based on granular computing—we can only discuss a selected group of techniques. In the article, we have presented a review of recently developed granulation techniques belonging to the family of approximation algorithms founded by Polkowski—in the framework of rough set theory. Starting from the basic Polkowski’s standard granulation, we have described further developed by us concept dependent, layered, and epsilon variants, and our recent homogeneous granulation. We are presenting simple numerical examples and samples of research results. The effectiveness of these methods in terms of decision system size reduction and maintenance of the internal knowledge from the original data are presented. The reduction in the number of objects in our techniques while maintaining classification efficiency reaches 90 percent—for standard granulation with usage of a kNN classifier (we achieve similar efficiency for the concept-dependent technique for the Naive Bayes classifier). The largest reduction achieved in the number of exhaustive set of rules at the efficiency level to the original data are 99 percent—it is for concept-dependent granulation. In homogeneous variants, the reduction is less than 60 percent, but the advantage of these techniques is that it is not necessary to look for optimal granulation parameters, which are selected dynamically. We also describe potential directions of development of granular computing techniques by prism of described methods.

Keywords:

rough sets; granular rough computing; granulation techniques; classification

1. Introduction

Granular computing is dedicated to work on data in the form of grouped, similar information vectors. The idea was introduced by Lotfi Zadeh [1,2]. Granulation is an integral part of the fuzzy set theory by the very definition of fuzzy set, where inverse values of fuzzy membership functions are the basic forms of granules. Shortly after Lotfi Zadeh proposed the idea of granular computing, the granules were introduced in terms of rough set theory [3] by T.Y. Lin, L. Polkowski, and A. Skowron. In this theory, granules are defined as classes of indiscernibility relations. Interesting research on more flexible granules based on blocks was conducted by (Grzymala–Busse) (see the LEM2 algorithm), and templates by (H.S. Nguyen), used in classification processes. The granules based on rough inclusions were introduced by (Polkowski and Skowron [4]), based on tolerance or similarity relations, and, more generally, binary relations by (T.Y. Lin [5], Y. Y. Yao [6,7,8]). In the context of rough mereology was proposed by (L. Polkowski and A. Skowron), in approximation spaces by (A. Skowron and J. Stepaniuk [9,10]), and finally in logic for approximate reasoning by (L. Polkowski, M. Semeniuk-Polkowska [11], and Qing Liu [12]). Of course, many other authors are conducting considerations on groups of similar objects, which is simply the most natural way of modeling problems; it is impossible to name them all. Let us quote a few very interesting works on various research topics from recent years on granular computing [13,14,15,16,17,18]. Additionally, interesting research on the field of granular computation with the use of neural network techniques can be found in the works [19,20,21].

We have developed our methods in terms of granular rough computing paradigm—the internal part of rough sets theory [3]. The computations are based on granules, the groups of objects collected together by fixed similarity measure or metrics. Theoretical background and the framework of discussed methods were proposed by Polkowski in [22,23,24]—the idea of data approximation using rough inclusions. The basic idea was to create the r-indiscernible groups of objects (objects indiscernible in fixed degree) around each training sample, cover the original training decision system using selected granules and create the granular reflection of training data using granules from the covering in the final step. This particular technique is called standard granulation and was proposed in [24]. The initial work was extended later in many variants and contexts—see [25,26], Polkowski [27,28], Polkowski and Artiemjew [29,30]. These methods, among others, have found application in classification processes [31], data approximations [30], missing values absorbtion [26,29], and, in the recent work, these were used as a key component of the new Ensemble model—see [32].

In the review, we are focusing on decision system size reduction and maintaining the internal knowledge at the same time. Despite the fact that the granulation of the decision systems in a pessimistic case has a square complexity, it is possible to apply classical techniques of transferring methods to big data for the purpose mentioned. In the article, we have described standard granulation [24], concept-dependent [25], layered [25] and homogeneous granulation [33]—designed for symbolic data, and exemplary variants developed for numerical one—with descriptors indiscernibility ratio–epsilon granulation [33,34].

The rest of the paper has the following content. In Section 2, there is a detailed description of granulation techniques with toy examples. In Section 3, we present the experimental part for a kNN classifier. In Section 4, we have additional results for the SVM and Naive Bayes classifier. In Section 5, we write about possible future developments of these techniques, and we conclude the paper in Section 6.

2. Granulation Techniques

Our methods are based on rough inclusions. Introduction to rough inclusions in the framework of rough mereology is available in Polkowski [22,35]; a detailed, extensive discussion can be found in Polkowski [23]. We refer the reader for a very precise theoretical introduction, but, in the paper, we include the details that allow for understanding its content.

In Polkowski’s granulation procedure, we can distinguish three basic steps.

2.0.1.: First Step—Granulation
We begin with computation of granules around each training object using a selected method.
2.0.2.: Second Step—The Process of Covering
The training decision system is covered by selected granules.
2.0.3.: Third Step—Building the Granular Reflections
The granular reflection of original training decision system is derived from the granules selected in step 2.
We start with detailed description of the basic method—see [24].

2.1. Standard Granulation

Let us consider the decision system

(U, A, d)

, where U is the universe of objects, A the set of conditional attributes,

d \notin A

is the decision attribute, and

r_{g r a n}

granulation radius from the set {

0, \frac{1}{| A |}, \frac{2}{| A |}, \dots, 1

}.

The standard rough inclusion

μ

, for

u, v \in U

and for selected

r_{g r a n}

is defined as

μ (v, u, r_{g r a n}) \Leftrightarrow \frac{| I N D (u, v) |}{| A |} \geq r_{g r a n}, where I N D (u, v) = {a \in A : a (u) = a (v)},

(1)

For each object

u \in U

, and selected

r_{g r a n}

, we compute the standard granule

g_{r_{g r a n}} (u)

as follows:

g_{r_{g r a n}} (u) is {v \in U : μ (v, u, r_{g r a n})} .

(2)

In the next step, we use a selected strategy to cover the training decision system U by computed granules—the random choice is the simplest among the most effective studied in [30]). All studied methods are available in [30] (pages 105–220).

In addition, in the last step, granular reflection of training decision set is computed with the use of the Majority Voting procedure. The ties are resolved randomly. In the next section, we show the toy example of the method. To present toy examples, we used the same system from Table 1.

Toy Example

For a given training decision system from Table 1, the granulation radius

r_{g r a n} \in {0, 0.25, 0.5, 0.75, 1}

, the steps of the standard granulation are as follows.

In case of

r_{g r a n} = 0

, each single granule is equal U because objects are treated as indiscernible even if they are completely different. In addition, we expected only one object as the granular reflection of the training data.

The second boundary case is

r_{g r a n} = 1

; each granule contains only their central object or duplicates because the objects are indiscernible.

Now, allow us to show how the standard granulation works for radius

r_{g r a n} = 0.5

.

Assuming that g_{r_{g r a n}} (u_{i}) = {u_{j} \in U_{t r n} : \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq r_{g r a n}}

I N D (u_{i}, u_{j}) = {a \in A; a (u_{i}) = a (u_{j})}, U_{t r n} is the universe of training objects,

and | X | is the cardinality of set

The sample standard granules with a 0.5 radius, derived from decision systems from Table 1 look as follows,

g_{0.5} (u_{1}) = {u_{1}, u_{2}, u_{3}, u_{4}, u_{8}, u_{9}, u_{13},}

,

g_{0.5} (u_{2}) = {u_{1}, u_{2}, u_{3}, u_{8}, u_{11}, u_{12}, u_{14},}

g_{0.5} (u_{3}) = {u_{1}, u_{2}, u_{3}, u_{4}, u_{8}, u_{12}, u_{13},}

,

g_{0.5} (u_{4}) = {u_{1}, u_{3}, u_{4}, u_{5}, u_{8}, u_{10}, u_{12}, u_{14},}

g_{0.5} (u_{5}) = {u_{4}, u_{5}, u_{6}, u_{7}, u_{9}, u_{10}, u_{13},}

,

g_{0.5} (u_{6}) = {u_{5}, u_{6}, u_{7}, u_{9}, u_{10}, u_{11}, u_{14},}

g_{0.5} (u_{7}) = {u_{5}, u_{6}, u_{7}, u_{9}, u_{11}, u_{12}, u_{13},}

,

g_{0.5} (u_{8}) = {u_{1}, u_{2}, u_{3}, u_{4}, u_{8}, u_{9}, u_{10}, u_{11}, u_{12}, u_{14},}

g_{0.5} (u_{9}) = {u_{1}, u_{5}, u_{6}, u_{7}, u_{8}, u_{9}, u_{10}, u_{11}, u_{13},}

,

g_{0.5} (u_{10}) = {u_{4}, u_{5}, u_{6}, u_{8}, u_{9}, u_{10}, u_{11}, u_{13}, u_{14},}

g_{0.5} (u_{11}) = {u_{2}, u_{6}, u_{7}, u_{8}, u_{9}, u_{10}, u_{11}, u_{12}, u_{14},}

,

g_{0.5} (u_{12}) = {u_{2}, u_{3}, u_{4}, u_{7}, u_{8}, u_{11}, u_{12}, u_{14},}

g_{0.5} (u_{13}) = {u_{1}, u_{3}, u_{5}, u_{7}, u_{9}, u_{10}, u_{13},}

,

g_{0.5} (u_{14}) = {u_{2}, u_{4}, u_{6}, u_{8}, u_{10}, u_{11}, u_{12}, u_{14},}

The process of granulation can be tuned with help from the triangular part of granular indiscernibility matrix

{[c_{i j}]}_{(i, j = 1) | U |}

, where

c_{i j} = \{\begin{matrix} 1, if \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq r_{g r a n}, i < j \\ 0, otherwise \end{matrix}

This matrix for

r_{g r a n} = 0.5

is in Table 2.

Reading the matrix line–wise, we read granules off.

In the next step, we have chosen the random granules to cover the universe of training objects from the Table 1. Our choice is the set.

The U is covered, when, in the set of chosen granules, each object of U appears at least once. The granular reflection of the set Table 1 for the radius 0.5 is in Table 3.

Random coverage of training systems is as follows,

C o v e r (U_{t r n}) = {g_{0.5} (u_{1}), g_{0.5} (u_{4}), g_{0.5} (u_{5}), g_{0.5} (u_{14}),}

The granular reflection is created by application of majority voting inside selected granules. Ties are resolved randomly.

2.2. Concept Dependent Granulation

A concept–dependent (cd) granule

g_{r_{g r a n}}^{c d} (u)

of the radius

r_{g r a n}

about u is defined as follows:

v \in g_{r_{g r a n}}^{c d} (u) if and only if μ (v, u, r_{g r a n}) and (d (u) = d (v))

(3)

Toy Example

For the decision system from Table 1, we have found concept-dependent granules. For the granulation radius

r_{g r a n} = 0.25

, the granular concept–dependent indiscernibility matrix (gcdm)—see Table 4—is

c_{i j}^{c d} = \{\begin{matrix} 1, i f \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq 0.25, d (u_{i}) = d (u_{j}), i < j \\ 0, o t h e r w i s e \end{matrix}

hence, the granules in this case are

Assuming that g_{r_{g r a n}} (u_{i}) = {u_{j} \in U_{t r n} : \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq r_{g r a n}, d (u_{i}) = d (u_{j})}

I N D (u_{i}, u_{j}) = {a \in A; a (u_{i}) = a (u_{j})}, U_{t r n} is the universe of training objects,

and | X | is the cardinality of set

The sample concept-dependent granules with a 0.25 radius, derived from decision systems from Table 1 look as follows,

g_{0.25}^{c d} (u_{1}) = {u_{1}, u_{2}, u_{8}, u_{14},}

,

g_{0.25}^{c d} (u_{2}) = {u_{1}, u_{2}, u_{6}, u_{8}, u_{14},}

g_{0.25}^{c d} (u_{3}) = {u_{3}, u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{12}, u_{13},}

,

g_{0.25}^{c d} (u_{4}) = {u_{3}, u_{4}, u_{5}, u_{9}, u_{10}, u_{11}, u_{12}, u_{13},}

g_{0.25}^{c d} (u_{5}) = {u_{3}, u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{13},}

,

g_{0.25}^{c d} (u_{6}) = {u_{2}, u_{6}, u_{14},}

g_{0.25}^{c d} (u_{7}) = {u_{3}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{12}, u_{13},}

,

g_{0.25}^{c d} (u_{8}) = {u_{1}, u_{2}, u_{8}, u_{14},}

g_{0.25}^{c d} (u_{9}) = {u_{3}, u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{13},}

,

g_{0.25}^{c d} (u_{10}) = {u_{3}, u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{12}, u_{13},}

g_{0.25}^{c d} (u_{11}) = {u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{12}, u_{13},}

,

g_{0.25}^{c d} (u_{12}) = {u_{3}, u_{4}, u_{7}, u_{10}, u_{11}, u_{12}, u_{13},}

g_{0.25}^{c d} (u_{13}) = {u_{3}, u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{12}, u_{13},}

,

g_{0.25}^{c d} (u_{14}) = {u_{1}, u_{2}, u_{6}, u_{8}, u_{14},}

Random coverage of training systems is as follows,

C o v e r (U_{t r n}) = {g_{0.25}^{c d} (u_{13}), g_{0.25}^{c d} (u_{14}),}

The concept-dependent granular reflection of decision system from Table 1 is in Table 5.

2.3. Homogeneous Granulation

The homogeneous granules are defined based on standard and concept dependent granules previously defined,

g_{r_{g r a n}}^{h o m o g e n e o u s} (u) = {v \in U : | g_{r_{g r a n}}^{c d} (u) | - | g_{r_{g r a n}} (u) | = = 0}

f o r m i n i m a l r_{g r a n} f u l f i l l s t h e e q u a t i o n

Toy Example

Consider the training decision system from Table 1.

Homogeneous granules for all training objects:

g_{1} (u_{1}) = (u_{1})

,

g_{0.75} (u_{2}) = (u_{1}, u_{2})

,

g_{1} (u_{3}) = (u_{3})

,

g_{1} (u_{4}) = (u_{4})

,

g_{1} (u_{5}) = (u_{5})

,

g_{1} (u_{6}) = (u_{6})

,

g_{1} (u_{7}) = (u_{7})

,

g_{1} (u_{8}) = (u_{8})

,

g_{0.75} (u_{9}) = (u_{5}, u_{9})

,

g_{0.75} (u_{10}) = (u_{4}, u_{5}, u_{10})

,

g_{0.75} (u_{11}) = (u_{11})

,

g_{1} (u_{12}) = (u_{12})

,

g_{0.75} (u_{13}) = (u_{3}, u_{13})

,

g_{1} (u_{14}) = (u_{14})

.

Randomly selected coverage granules,

g_{0.75} (u_{2}) = (u_{1}, u_{2}),

,

g_{1} (u_{4}) = (u_{4})

,

g_{1} (u_{6}) = (u_{6})

,

g_{1} (u_{7}) = (u_{7})

,

g_{1} (u_{8}) = (u_{8})

,

g_{0.75} (u_{9}) = (u_{5}, u_{9})

,

g_{0.75} (u_{10}) = (u_{4}, u_{5}, u_{10})

,

g_{1} (u_{12}) = (u_{12})

,

g_{0.75} (u_{13}) = (u_{3}, u_{13})

,

g_{1} (u_{14}) = (u_{14})

.

The granular decision system from the above granules is in Table 6.

2.4. Layered Granulation

Layered granulation leads to a sequence of granular reflections of decreasing sizes, which stabilizes after a finite number of steps; usually, about five steps are sufficient. Another development that may be stressed here is the heuristic rule for finding the optimal granulation radius giving the highest accuracy.

the optimal granulation radius is located around the value which yields the maximal decrease in size of the granular reflection between the first and the second granulation layers—see [30].

Toy Example

Exemplary multiple granulation of Quinlan’s data set [36], see Table 1, for the granulation radius of 0.5 and layers

l_{0}, l_{1}, \dots

runs as follows.

For the decision system from Table 1, granules in the first layer are (

r_{g r a n} = 0.5

):

g_{0.5, l_{1}}^{c d} (u_{1}) = {u_{1}, u_{2}, u_{8}}

,

g_{0.5, l_{1}}^{c d} (u_{2}) = {u_{1}, u_{2}, u_{8}, u_{14}}

,

g_{0.5, l_{1}}^{c d} (u_{3}) = {u_{3}, u_{4}, u_{12}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{4}) = {u_{3}, u_{4}, u_{5}, u_{10}, u_{12}}

,

g_{0.5, l_{1}}^{c d} (u_{5}) = {u_{4}, u_{5}, u_{7}, u_{9}, u_{10}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{6}) = {u_{6}, u_{14}}

,

g_{0.5, l_{1}}^{c d} (u_{7}) = {u_{5}, u_{7}, u_{9}, u_{11}, u_{12}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{8}) = {u_{1}, u_{2}, u_{8}, u_{14}}

,

g_{0.5, l_{1}}^{c d} (u_{9}) = {u_{5}, u_{7}, u_{9}, u_{10}, u_{11}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{10}) = {u_{4}, u_{5}, u_{9}, u_{10}, u_{11}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{11}) = {u_{7}, u_{9}, u_{10}, u_{11}, u_{12}}

,

g_{0.5, l_{1}}^{c d} (u_{12}) = {u_{3}, u_{4}, u_{7}, u_{11}, u_{12}}

,

g_{0.5, l_{1}}^{c d} (u_{13}) = {u_{3}, u_{5}, u_{7}, u_{9}, u_{10}, u_{13}}

,

g_{0.5, l_{1}}^{c d} (u_{14}) = {u_{2}, u_{6}, u_{8}, u_{14}}

.

Covering process of

U_{l_{0}}

with usage of order–preserving strategy yields us the covering:

U_{l_{0}, C o v e r} \leftarrow

∅,

Step1

g_{0.5, l_{1}}^{c d} (u_{1}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{8}}

,

Step2

g_{0.5, l_{1}}^{c d} (u_{2}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{8}, u_{14}}

,

Step3

g_{0.5, l_{1}}^{c d} (u_{3}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{3}, u_{4}, u_{8}, u_{12}, u_{13}, u_{14}}

,

Step4

g_{0.5, l_{1}}^{c d} (u_{4}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{3}, u_{4}, u_{5}, u_{8}, u_{10}, u_{12}, u_{13}, u_{14}}

,

Step5

g_{0.5, l_{1}}^{c d} (u_{5}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{3}, u_{4}, u_{5}, u_{7}, u_{8}, u_{9}, u_{10}, u_{12}, u_{13}, u_{14}}

,

Step6

g_{0.5, l_{1}}^{c d} (u_{6}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = {u_{1}, u_{2}, u_{3}, u_{4}, u_{5}, u_{6}, u_{7}, u_{8}, u_{9}, u_{10}, u_{12}, u_{13}, u_{14}}

,

Step7

g_{0.5, l_{1}}^{c d} (u_{7}) \to U_{l_{0}, C o v e r}

,

U_{l_{0}, C o v e r} = U_{l_{0}}

.

The granular reflection of

(U_{l_{0}}, A, d)

based on granules from

U_{l_{0}, C o v e r}

, with use of Majority Voting, where ties are resolved according to the ordering of granules are shown in Table 7.

Exemplary granular reflection formation based on Majority Voting looks as follows. In case, e.g., of the granule

g_{0.5, l_{1}}^{c d} (u_{1})

, we have

M V (g_{0.5, l_{1}}^{c d} (u_{1})) = \{\begin{matrix} \underset{̲}{S u n n y} \underset{̲}{H o t} \underset{̲}{H i g h} \underset{̲}{W e a k} \\ \underset{̲}{S u n n y} \underset{̲}{H o t} \underset{̲}{H i g h} S t r o n g \\ \underset{̲}{S u n n y} M i l d \underset{̲}{H i g h} \underset{̲}{W e a k} \end{matrix}\} = \underset{̲}{S u n n y} \underset{̲}{H o t} \underset{̲}{H i g h} \underset{̲}{W e a k}

Treating all other granules in the same way, we obtain the granular reflection

(U_{l_{1}}, A, d)

shown in Table 7.

Granulation performed in the same manner with the granular reflection

(U_{l_{1}}, A, d)

from Table 7 yields the granule set in the second layer.

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{1})), M V (g_{0.5, l_{1}}^{c d} (u_{2}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{2}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{1})), M V (g_{0.5, l_{1}}^{c d} (u_{2}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{3})), M V (g_{0.5, l_{1}}^{c d} (u_{4}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{3})), M V (g_{0.5, l_{1}}^{c d} (u_{4})), M V (g_{0.5, l_{1}}^{c d} (u_{5}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{4})), M V (g_{0.5, l_{1}}^{c d} (u_{5})), M V (g_{0.5, l_{1}}^{c d} (u_{7}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{6}))}

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{7}))) = {M V (g_{0.5, l_{1}}^{c d} (u_{5})), M V (g_{0.5, l_{1}}^{c d} (u_{7}))}

The covering process of

U_{l_{1}, C o v e r}

runs in the following steps:

Step1

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))) \to U_{l_{1}, C o v e r}

, Step2

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{2}))) ↛ U_{l_{1}, C o v e r}

,

Step3

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))) \to U_{l_{1}, C o v e r}

, Step4

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))) \to U_{l_{1}, C o v e r}

,

Step5

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))) \to U_{l_{1}, C o v e r}

, Step6

g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))) \to U_{l_{1}, C o v e r}

,

U_{l_{1}, C o v e r} = U_{l_{1}}

Applying Majority Voting to granules in

U_{l_{1}}

, we obtain the second granular reflection shown in Table 8.

The third layer of granulation based on system

(U_{l_{2}}, A, d)

from Table 8 is as follows:

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))) = {M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))}

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))) =

{M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3})))), M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))}

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))) =

= {M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3})))), M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4})))), M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))}

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))) =

{M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4})))), M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))}

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))) = {M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))}

Covering process for the third layer is as follows:

Step1

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))) \to U_{l_{2}, C o v e r}

,

Step2

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))) \to U_{l_{2}, C o v e r}

,

Step3

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))) \to U_{l_{2}, C o v e r}

,

Step4

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))) ↛ U_{l_{2}, C o v e r}

,

Step5

g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))) \to U_{l_{2}, C o v e r}

,

U_{l_{2}, C o v e r} = U_{l_{2}}

Using Majority voting, we get the third layer of granular reflections shown in Table 9.

2.5. Epsilon Variants

These methods are designed for numerical data; we can use, for instance,

ε

-normalized Hamming metric, which, for given

ε

, is defined as

d_{H, ε} (u, v) = | {a \in A : \frac{a b s (a (u) - a (v))}{m a x_{a} - m i n_{a}} > ε} |,

(4)

where abs is absolute value,

The methods work analogously to variants for symbolic data; thus, we show only exemplary definition without toy examples.

$ε$ –Modification of the Standard Rough Inclusion

Given a parameter

ε

valued in the unit interval

[0, 1]

, we define the set

I N D_{ε} (u, v) = {a \in A : d i s t (a (u), a (v)) \leq ε},

(5)

and we set

μ_{ε} (v, u, r) \Leftrightarrow \frac{| I N D_{ε} (u, v) |}{| A |} \geq r

(6)

Epsilon variant of homogeneous granulation can be defined as follows.

2.6. Epsilon Homogeneous Granulation

The method is defined in the following way:

g_{r_{u}}^{ε, h o m o g e n e o u s} = {v \in U : | g_{r_{u}}^{ε - c d} | - | g_{r_{u}}^{ε} | = = 0}, f o r m i n i m a l r_{u} f u l f i l l s t h e e q u a t i o n

where g_{r_{u}}^{ε, c d} (u) = {v \in U : \frac{| I N D_{ε} (u, v) |}{| A |} \leq r_{u} A N D d (u) = = d (v)}

and g_{r_{u}}^{ε} (u) = {v \in U : \frac{| I N D_{ε} (u, v) |}{| A |} \leq r_{u}}, r_{u} = {\frac{0}{| A |}, \frac{1}{| A |}, \dots, \frac{| A |}{| A |}}

I N D_{ε} (u, v) = {a \in A : \frac{a b s (a (u) - a (v))}{m a x_{a} - m i n_{a}} \leq ε}

where

m a x_{a}

,

m i n_{a}

are the maximal and minimal attribute values for

a \in A

in the original data set.

3. A Sample of the Experimental Work Results

In this section, we show the exemplary results for our selected techniques, to show its effectiveness in the context of reducing training data size. For the sake of simplicity, we have chosen the k-NN classifier as a base. We carried out experiments on selected data from the UCI repository [37]—see Table 10. In Table 11, Table 12, Table 13, Table 14, Table 15, Table 16, Table 17, Table 18, Table 19 and Table 20 and Figure 1, we have the results for Cross Validation 5 method.

Let us move on to the discussion of selected detailed results, starting from description of the results for the Australian Credit data set. The result for Standard (SG) and Concept-dependent granulation (CDG) is in Table 11, where, in case of SG for radius 0.5, we have reduction in training size of around 90 percent preserving classification accuracy in the range of

84.7

percent. For the CDG variant, we have reduction in training size of about

99.5

percent for radius

0.071

, where the exhaustive rule set is reduced in

99.9

percent and accuracy of classification is around 77 percent. The results are comparable, but the concept-dependent variant shows a more stable classification as the radius increases. In case of Homogeneous granulation, see Table 15, we have accuracy equal to

0.835

with a 48 percent reduction of training size. The sample of results for exemplary epsilon variant—

ε

Homogeneous Granulation—is in Table 20, where we have reduction in training size about 50 percent, with accuracy of

0.842

. The layered granulation process is visible in Table 16, where the basic method is concept-dependent granulation and the result is similar to a single concept-dependent variant. In the case of Car data set, see Table 12, the concept-dependent variant works best giving accuracy of 0.864, with a reduction in training size of around 73 percent. For a Hepatitis data set, concept-dependent also works best, for radius

0.474

, the accuracy is equal to

0.875

, with a 90 percent reduction in training size. In addition, finally, the spectacular result is obtained for Heart Disease data set, where with 99 percent reduction in training size, we have obtained for concept-dependent and standard granulation the accuracy

0.8

. The results for homogeneous variants are shown in Table 15 and Table 20. The best result we have achieved on the tested data are a reduction of 62 percent in the number of objects with full classification efficiency. Allow us to summarize the results obtained in this section. The internal knowledge from the original training decision systems—measured by ability for classification—seems to be preserved in each mentioned case (the accuracy of classification is fully comparable with nil case, without reduction). Both techniques, standard granulation and concept-dependent, prove to be comparable. In the concept-dependent variant, we observe a higher classification stability with an increasing radius. Another advantage of the concept-dependent variant is the creation of granular reflection, which from the smallest radii contain patterns from all decision-making classes. The multiple variant does not produce spectacular results, but, according to our previous research, see [30]—it allows us to look for the optimal granulation radii. Our research shows that the radius for which the reduction of objects between the first and second layer is greatest is close to the optimal one in most tested systems. In this way, the optimum granulation radius can be estimated without classification tests. The last group of tested techniques are recently discovered homogeneous methods, which work dynamically on every data and do not require estimation of optimal parameters. It is obvious that the effectiveness of our methods depends to a large extent on the data under investigation.

We do not plan to present an overview of the effectiveness of the whole range of classification techniques because our aim was to present an example of the effectiveness of approximation methods for decision-making systems. Let us move on to presenting additional test results for selected previously used classifiers.

4. Application of Selected Other Classifiers on Granular Data

In our previous research, we checked the performance of the tens of classifiers; each variant examined matched well with the granular data. Some of the most interesting results were obtained for the Naive Bayes classifier (see the results in Chapter 7 of [30]), the SVM technique [38], and Deep Learning [39]. Examples of results are presented in this section.

In Figure 2, we have the accuracy of the classification of the granular data using the SVM method with an RBF kernel. We use the

ε

concept-dependent granulation—see Section 2.5. It is the result for Wisconsin Diagnostic Breast Cancer data set (see [37]) 569 objects and 32 attributes. Analyzing Figure 2 and Figure 3, we see that the level of accuracy of the classification is reasonable with a considerable percentage of the size reduction of granular systems.

Considering four variants of classification for the Naive Bayes classifier (for which the parameters determining the classification are as follows):

$P a r a m_{d = d_{i}}^{V 1} = \sum_{m = 1}^{n} P (b_{m} = a_{m} (v) | d = d_{i})$ .
$P a r a m_{d = d_{i}}^{V 2} = P (d = d_{i}) * \sum_{m = 1}^{n} P (b_{m} = a_{m} (v) | d = d_{i})$ .
$P a r a m_{d = d_{i}}^{V 3} = \prod_{m = 1}^{n} P (b_{m} = a_{m} (v) | d = d_{i})$ .
$P a r a m_{d = d_{i}}^{V 4} = P (d = d_{i}) * \prod_{m = 1}^{n} P (b_{m} = a_{m} (v) | d = d_{i})$ .

The results showing the effectiveness of the Naive Bayes classifier can be found in Table 21, Table 22, Table 23 and Table 24 (the details can be found in [30]). The most spectacular approximation is for the 0.428571 radius, where, with an Australian credit data set, accuracy of classification is 0.852, and the average number of objects is reduced by about 94 percent.

In Table 25, we have presented an example of the result of a deep neural network on the granulated data—see [39]. It turns out that it learns the internal knowledge of decision-making systems and maintains a high level of classification effectiveness. In Table 25 and Figure 4, we have the result for Australian Credit data set, for radius

0.66

, with a reduction of 40 percent, and classification efficiency is around 84 percent.

The additional experimental results presented were to show that our granular techniques are compatible with various classification methods. In the next section, we discuss the potential directions of development of granular computing methods, through the prism of the possibilities of our own methods.

5. Future Directions in Granular Computing Paradigm

Granular computing techniques will undoubtedly play a key role in building artificial intelligence because intelligent handling of data are based on analyzing its similarity and abstracting from the vast amount of information available in the environment. One of the problems to be solved is the ability to use real-time granular computing techniques on large data. The only barrier against using these methods is scalability problem. To deal with possible scalability problems, the following methods can be considered: Data sampling method and creation of model based on samples; Decomposition method, to use the algorithms on the split data and work on them separately; the streaming computing method, incremental data processing; the mass parallel computing technique on the computer cluster, with the use of classic ways to compute in parallel, like MPI implementation (Message Passing Interface); and mass parallel computing methods based on future technologies like quantum calculations. Without a doubt, deep neural networks is one of the promising fields for using granular computing. New methods of data preprocessing can be expected to emerge, before feeding it into deep neural networks. In particular, we mean the use of granular computing in the convolutionary and pooling part of the convolutional neural networks. The granular structures of the granular computing paradigm can intuitively be used to build such new network architectures at a time when we have no clear limit of creating neural network structures. Modeling the world using granular computing is a very natural process for us, which will undoubtedly play a crucial role in the development of future technologies.

6. Conclusions

In this work, we offer a review of selected recently developed granular computing techniques dedicated to the approximation of decision systems (from the family of methods proposed by Polkowski in [22,24]). That is, techniques which, among other things, aim at reducing the size of data while maintaining their classification efficiency. A very important family of techniques is dedicated to speeding up decision-making processes. Our approximation techniques reduce the size of decision systems significantly maintaining the internal knowledge at the same time, which was proven in many experimental works. In our research, the main problem for standard, concept dependent, and layered methods is the need to estimate the optimal granulation radius searching among all possible ones. The problem has been partially solved for these methods—in the previous works, we have developed heuristics for searching optimal parameters by a double granulation technique (see [30]). In our last technique, homogeneous granulation, this problem does not apply because parameters are automatically set in the process of approximation. Our last method seems to be an important discovery, as it is immediately applicable, without the need to estimate the parameters, and it turns out to work very well in all the contexts we have studied. Particularly noteworthy is its application in the new technique of boosting classification—Ensemble of Random Granular Reflections [32]. To sum up our work, the presented granulation techniques allow for reducing the number of exhaustive set of rules by up to 99 percent while maintaining classification efficiency at the level obtained on the original unreduced data. Such efficiency was obtained, for example, for the concept-dependent technique using the kNN classifier. On the other hand, our methods achieve a reduction in the number of objects to more than 90 percent while maintaining classification efficiency on the original data. We have achieved such results, for example, for standard granulation with the kNN classification and concept-dependent granulation using the Naive Bayes classifier. As the closest directions of research on the development of our knowledge granulation methods, we can point out the work on hybrids with deep neural network learning and Random Forests technique. Another direction of work is the application in the process of convolution and pooling for the convolutionary neural networks and development of our proposed Ensemble model based on random granular reflections of decision systems. In conclusion of this review, we may add that, without any doubt, real-time granular computing methods will play an important role in creating artificial intelligence. Therefore, it is worthwhile to develop methods for the approximation of decision systems in order to invest in research into this prospective paradigm of knowledge.

Funding

This work has been fully supported by the grant from the Ministry of Science and Higher Education of the Republic of Poland under the project number 23.610.007-000.

Conflicts of Interest

The author declares no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Zadeh, L.A. Fuzzy Sets and Information Granularity. 1979. Available online: https://digitalassets.lib.berkeley.edu/techreports/ucb/text/ERL-m-79-45.pdf (accessed on 13 February 2020).
Zadeh, L.A. Graduation and granulation are keys to computation with information described in natural language. In Proceedings of the 2006 IEEE International Conference on Granular Computing, Atlanta, GA, USA, 10–12 May 2006. [Google Scholar]
Pawlak, Z. Rough sets. Int. J. Comput. Inform. Sci. 1982, 11, 341–356. [Google Scholar] [CrossRef]
Skowron, A.; Polkowski, L. Synthesis of decision systems from data tables. In Rough Sets and Data Mining; Lin, T.Y., Cercone, N., Eds.; Springer: Boston, MA, USA, 1997; pp. 289–299. [Google Scholar]
Lin, T.Y. Granular computing: Examples, intuitions and modeling. In Proceedings of the 2005 IEEE International Conference on Granular Computing, Beijing, China, 25–27 July 2005; Volume 1, pp. 40–44. [Google Scholar]
Yao, Y.Y. Granular computing: Basic issues and possible solutions. In Proceedings of the 5th Joint Conference on Information Sciences, Atlantic, NJ, USA, 27 February 2000; Volume 1, pp. 186–189. [Google Scholar]
Yao, Y. Information Granulation and Approximation in a Decision-Theoretical Model of Rough Sets. In Rough-Neural Computing; Pal, S.K., Polkowski, L., Skowron, A., Eds.; Springer: Berlin, Germany, 2004; pp. 491–516. [Google Scholar]
Yao, Y. Perspectives of granular computing. In Proceedings of the 2005 IEEE International Conference on Granular Computing, Beijing, China, 25–27 July 2005; Volume 1, pp. 85–90. [Google Scholar]
Skowron, A.; Stepaniuk, J. Information granules: Towards foundations of granular computing. Int. J. Intell. Syst. 2001, 16, 57–85. [Google Scholar] [CrossRef]
Skowron, A.; Stepaniuk, J. Information Granules and Rough-Neural Computing. In Rough-Neural Computing; Pal, S.K., Polkowski, L., Skowron, A., Eds.; Springer: Berlin, Germany, 2004; pp. 43–84. [Google Scholar]
Polkowski, L.; Semeniuk–Polkowska, M. On rough set logics based on similarity relations. Fund. Inform. 2005, 64, 379–390. [Google Scholar]
Liu, Q.; Sun, H. Theoretical study of granular computing. In Rough Sets and Knowledge Technology; Wang, G.Y., Peters, J.F., Skowron, A., Yao, Y., Eds.; Springer: Berlin, Germany, 2006; Volume 4062, pp. 92–102. [Google Scholar]
Cabrerizo, F.J.; Al-Hmouz, R.; Morfeq, A.; Martínez, M.A.; Pedrycz, W.; Herrera-Viedma, E. Estimating incomplete information in group decision-making: A framework of granular computing. Appl. Soft Comput. 2020, 86, 105930. [Google Scholar] [CrossRef]
Hryniewicz, O.; Kaczmarek, K. Bayesian analysis of time series using granular computing approach. Appl. Soft Comput. 2016, 47, 644–652. [Google Scholar] [CrossRef]
Martino, A.; Giuliani, A.; Rizzi, A. (Hyper) Graph Embedding and Classification via Simplicial Complexes. Algorithms 2019, 12, 223. [Google Scholar] [CrossRef] [Green Version]
Martino, A.; Giuliani, A.; Todde, V.; Bizzarri, M.; Rizzi, A. Metabolic networks classification and knowledge discovery by information granulation. Comput. Biol. Chem. 2020, 84, 107187. [Google Scholar] [CrossRef] [PubMed]
Pownuk, A.; Kreinovich, V. Granular approach to data processing under probabilistic uncertainty. In Granular Computing; Springer: Berlin, Germany, 2019; pp. 1–17. [Google Scholar]
Zhong, C.; Pedrycz, W.; Wang, D.; Li, L.; Li, Z. Granular data imputation: A framework of granular computing. Appl. Soft Comput. 2016, 46, 307–316. [Google Scholar] [CrossRef]
Leng, J.; Chen, Q.; Mao, N.; Jiang, P. Combining granular computing technique with deep learning for service planning under social manufacturing contexts. Knowl.-Based Syst. 2018, 143, 295–306. [Google Scholar] [CrossRef]
Ghiasi, B.; Sheikhian, H.; Zeynolabedin, A.; Niksokhan, M.H. Granular computing-neural network model for prediction of longitudinal dispersion coefficients in rivers. Water Sci. Technol. 2020, 80, 1880–1892. [Google Scholar] [CrossRef] [PubMed]
Capizzi, G.; Lo Sciuto, G.; Napoli, C.; Połap, D.; Woźniak, M. Small Lung Nodules Detection Based on Fuzzy-Logic and Probabilistic Neural Network with Bio-inspired Reinforcement Learning. 2019. Available online: https://ieeexplore.ieee.org/abstract/document/8895990 (accessed on 13 February 2020).
Polkowski, L. Formal granular calculi based on rough inclusions. In Proceedings of the 2005 IEEE Conference on Granular Computing, Beijing, China, 25–27 July 2005; pp. 57–62. [Google Scholar]
Polkowski, L. Approximate Reasoning by Parts. An Introduction to Rough Mereology; Springer: Berlin, Germany, 2011. [Google Scholar]
Polkowski, L. A model of granular computing with applications. In Proceedings of the 2006 IEEE Conference on Granular Computing, Atlanta, GA, USA, 10 May 2006; pp. 9–16. [Google Scholar]
Artiemjew, P. Classifiers from Granulated Data Sets: Concept Dependent and Layered Granulation. 2007. Available online: https://pdfs.semanticscholar.org/e46a/0e41d0833263220680aa1ec7ae9ed3edbb42.pdf#page=7 (accessed on 13 February 2020).
Artiemjew, P.; Ropiak, K.K. On Granular Rough Computing: Handling Missing Values by Means of Homogeneous Granulation. Computers 2020, 9, 13. [Google Scholar] [CrossRef] [Green Version]
Polkowski, L. Granulation of knowledge in decision systems: The approach based on rough inclusions. The method and its applications. In Rough Sets and Intelligent Systems Paradigms; Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A., Eds.; Springer: Berlin, Germany, 2007; Volume 4585, pp. 69–79. [Google Scholar]
Polkowski, L. Granulation of Knowledge: Similarity Based Approach in Information and Decision Systems. In Encyclopedia of Complexity and System Sciences; Meyers, R.A., Ed.; Springer: Berlin, Germany, 2009. [Google Scholar]
Polkowski, L.; Artiemjew, P. On granular rough computing with missing values. In Rough Sets and Intelligent Systems Paradigms; Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A., Eds.; Springer: Berlin, Germany, 2007; Volume 4585, pp. 271–279. [Google Scholar]
Polkowski, L.; Artiemjew, P. Granular Computing in Decision Approximation - An Application of Rough Mereology; Springer: Cham, Switzerland, 2015. [Google Scholar]
Polkowski, L.; Artiemjew, P. On granular rough computing: Factoring classifiers through granular structures. In Rough Sets and Intelligent Systems Paradigms; Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A., Eds.; Springer: Berlin, Germany, 2007; Volume 4585, pp. 280–290. [Google Scholar]
Artiemjew, P.; Ropiak, K. A Novel Ensemble Model - The Random Granular Reflections. 2018. Available online: http://ceur-ws.org/Vol-2240/paper17.pdf (accessed on 13 February 2020).
Ropiak, K.; Artiemjew, P. Homogenous Granulation and Its Epsilon Variant. Computers 2019, 8, 36. [Google Scholar] [CrossRef] [Green Version]
Artiemjew, P. A Review of the Knowledge Granulation Methods: Discrete vs. Continuous Algorithms. In Rough Sets and Intelligent Systems - Professor Zdzisław Pawlak in Memoriam; Skowron, A., Suraj, Z., Eds.; Springer: Berlin, Germany, 2013; Volume 43, pp. 41–59. [Google Scholar]
Polkowski, L. Rough Sets; Springer: Berlin, Germany, 2002. [Google Scholar]
Quinlan, J.R. C4.5: Programs for Machine Learning; Elsevier: New York, NY, USA, 2004. [Google Scholar]
University of California, Irvine Machine Learning Repository. Available online: https://archive.ics.uci.edu/ml/index.php (accessed on 13 February 2020).
Szypulski, J.; Artiemjew, P. The Rough Granular Approach to Classifier Synthesis by Means of SVM. In Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing; Yao, Y., Hu, Q., Yu, H., Grzymala-Busse, J., Eds.; Springer: Cham, Switzerland, 2015; Volume 9437, pp. 256–263. [Google Scholar]
Ropiak, K.; Artiemjew, P. On a Hybridization of Deep Learning and Rough Set Based Granular Computing. Algorithms 2020, 13, 63. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Visualization of results for Australian credit.

Figure 2. Results of classification accuracy for SVM with RBF kernel, 5 × CV5 test,

ε

concept-dependent granulation; Wisconsin Diagnostic Breast Cancer data set; Epslilon = is descriptors indiscernibility ratio, Radius = granulation radius.

Figure 2. Results of classification accuracy for SVM with RBF kernel, 5 × CV5 test,

ε

concept-dependent granulation; Wisconsin Diagnostic Breast Cancer data set; Epslilon = is descriptors indiscernibility ratio, Radius = granulation radius.

Figure 3. Percentage size of granulated data, 5 × CV5 test,

ε

concept-dependent granulation; Wisconsin Diagnostic Breast Cancer data set; Epslilon = is descriptors indiscernibility ratio, Radius = granulation radius.

Figure 3. Percentage size of granulated data, 5 × CV5 test,

ε

concept-dependent granulation; Wisconsin Diagnostic Breast Cancer data set; Epslilon = is descriptors indiscernibility ratio, Radius = granulation radius.

Figure 4. Visualization of classification efficiency for ten learning cycles of the neural network taking into account the percentage reduction of objects.

Table 1. Exemplary decision system

(U, A, d)

by J. R. Quinlan [36].

Table 1. Exemplary decision system

(U, A, d)

by J. R. Quinlan [36].

Day	Outlook	Temperature	Humidity	Wind	Play.golf
$u_{1}$	Sunny	Hot	High	Weak	No
$u_{2}$	Sunny	Hot	High	Strong	No
$u_{3}$	Overcast	Hot	High	Weak	Yes
$u_{4}$	Rainy	Mild	High	Weak	Yes
$u_{5}$	Rainy	Cool	Normal	Weak	Yes
$u_{6}$	Rainy	Cool	Normal	Strong	No
$u_{7}$	Overcast	Cool	Normal	Strong	Yes
$u_{8}$	Sunny	Mild	High	Weak	No
$u_{9}$	Sunny	Cool	Normal	Weak	Yes
$u_{10}$	Rainy	Mild	Normal	Weak	Yes
$u_{11}$	Sunny	Mild	Normal	Strong	Yes
$u_{12}$	Overcast	Mild	High	Strong	Yes
$u_{13}$	Overcast	Hot	Normal	Weak	Yes
$u_{14}$	Rainy	Mild	High	Strong	No

Table 2. Triangular indiscernibility matrix for standard granulation (

i < j

), derived from Table 1

c_{i j} = 1, i f \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq 0.5 0, o t h e r w i s e

.

Table 2. Triangular indiscernibility matrix for standard granulation (

i < j

), derived from Table 1

c_{i j} = 1, i f \frac{| I N D (u_{i}, u_{j}) |}{| A |} \geq 0.5 0, o t h e r w i s e

.

	$u_{1}$	$u_{2}$	$u_{3}$	$u_{4}$	$u_{5}$	$u_{6}$	$u_{7}$	$u_{8}$	$u_{9}$	$u_{10}$	$u_{11}$	$u_{12}$	$u_{13}$	$u_{14}$
$u_{1}$	1	1	1	1	0	0	0	1	1	0	0	0	1	0
$u_{2}$		1	1	0	0	0	0	1	0	0	1	1	0	1
$u_{3}$			1	1	0	0	0	1	0	0	0	1	1	0
$u_{4}$				1	1	0	0	1	0	1	0	1	0	1
$u_{5}$					1	1	1	0	1	1	0	0	1	0
$u_{6}$						1	1	0	1	1	1	0	0	1
$u_{7}$							1	0	1	0	1	1	1	0
$u_{8}$								1	1	1	1	1	0	1
$u_{9}$									1	1	1	0	1	0
$u_{10}$										1	1	0	1	1
$u_{11}$											1	1	0	1
$u_{12}$												1	0	1
$u_{13}$													1	0
$u_{14}$														1

Table 3. Standard granular reflection of the exemplary training system from Table 1, in radius 0.5, 5 attributes, 4 objects; MV is Majority Voting procedure (the most frequent descriptors create a granular reflection).

$Day$	$Outlook$	$Temperature$	$Humidity$	$Wind$	Play.golf
$M V (g_{0.5} (u_{1}))$	Sunny	Hot	High	Weak	Yes
$M V (g_{0.5} (u_{4}))$	Rainy	Mild	High	Weak	Yes
$M V (g_{0.5} (u_{5}))$	Rainy	Cool	Normal	Weak	Yes
$M V (g_{0.5} (u_{14}))$	Rainy	Mild	High	Strong	No

Table 4. Triangular indiscernibility matrix for concept-dependent granule generation (

i < j

), derived from Table 1.

Table 4. Triangular indiscernibility matrix for concept-dependent granule generation (

i < j

), derived from Table 1.

	$u_{1}$	$u_{2}$	$u_{3}$	$u_{4}$	$u_{5}$	$u_{6}$	$u_{7}$	$u_{8}$	$u_{9}$	$u_{10}$	$u_{11}$	$u_{12}$	$u_{13}$	$u_{14}$
$u_{1}$	1	1	0	0	0	0	0	1	0	0	0	0	0	1
$u_{2}$		1	0	0	0	1	0	1	0	0	0	0	0	1
$u_{3}$			1	1	1	0	1	0	1	1	0	1	1	0
$u_{4}$				1	1	0	0	0	1	1	1	1	1	0
$u_{5}$					1	0	1	0	1	1	1	0	1	0
$u_{6}$						1	0	0	0	0	0	0	0	1
$u_{7}$							1	0	1	1	1	1	1	0
$u_{8}$								1	0	0	0	0	0	1
$u_{9}$									1	1	1	0	1	0
$u_{10}$										1	1	1	1	0
$u_{11}$											1	1	1	0
$u_{12}$												1	1	0
$u_{13}$													1	0
$u_{14}$														1

Table 5. Concept-dependent granular reflection of the exemplary training system from Table 1, in radius 0.25, 5 attributes, 2 objects; MV is Majority Voting procedure (the most frequent descriptors create a granular reflection).

$Day$	$Outlook$	$Temperature$	$Humidity$	$Wind$	Play.golf
$M V (g_{0.25}^{c d} (u_{13}))$	Overcast	Mild	Normal	Weak	Yes
$M V (g_{0.25}^{c d} (u_{14}))$	Sunny	Hot	High	Strong	No

Table 6. Homogeneous granular decision system formed from covering granules.

Day	Outlook	Temperature	Humidity	Wind	Play Golf
$M V (g_{0.75} (u_{2}))$	Sunny	Hot	High	Weak	No
$M V (g_{1} (u_{4}))$	Rainy	Mild	High	Weak	Yes
$M V (g_{1} (u_{6}))$	Rainy	Cool	Normal	Strong	No
$M V (g_{1} (u_{7}))$	Overcast	Cool	Normal	Strong	Yes
$M V (g_{1} (u_{8}))$	Sunny	Mild	High	Weak	No
$M V (g_{0.75} (u_{9}))$	Rainy	Cool	Normal	Weak	Yes
$M V (g_{0.75} (u_{10}))$	Rainy	Mild	Normal	Weak	Yes
$M V (g_{1} (u_{12}))$	Overcast	Mild	High	Strong	Yes
$M V (g_{0.75} (u_{13}))$	Overcast	Hot	High	Weak	Yes
$M V (g_{1} (u_{14}))$	Rainy	Mild	High	Strong	No

Table 7. The decision system

(U_{l_{1}}, A, d)

.

Table 7. The decision system

(U_{l_{1}}, A, d)

.

Day	Outlook	Temperature	Humidity	Wind	Play Golf
$M V (g_{0.5, l_{1}}^{c d} (u_{1}))$	Sunny	Hot	High	Weak	No
$M V (g_{0.5, l_{1}}^{c d} (u_{2}))$	Sunny	Hot	High	Weak	No
$M V (g_{0.5, l_{1}}^{c d} (u_{3}))$	Overcast	Mild	High	Weak	Yes
$M V (g_{0.5, l_{1}}^{c d} (u_{4}))$	Rainy	Mild	High	Weak	Yes
$M V (g_{0.5, l_{1}}^{c d} (u_{5}))$	Rainy	Cool	Normal	Weak	Yes
$M V (g_{0.5, l_{1}}^{c d} (u_{6}))$	Rainy	Cool	Normal	Strong	No
$M V (g_{0.5, l_{1}}^{c d} (u_{7}))$	Overcast	Cool	Normal	Strong	Yes

Table 8. The decision system

(U_{l_{2}}, A, d)

,

t e m p_{1} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))

,

t e m p_{2} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))

,

t e m p_{3} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))

,

t e m p_{4} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))

,

t e m p_{5} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))

.

Table 8. The decision system

(U_{l_{2}}, A, d)

,

t e m p_{1} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))

,

t e m p_{2} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))

,

t e m p_{3} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))

,

t e m p_{4} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{5}))))

,

t e m p_{5} = M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))

.

Day	Outlook	Temperature	Humidity	Wind	Play Golf
$t e m p_{1}$	Sunny	Hot	High	Weak	No
$t e m p_{2}$	Overcast	Mild	High	Weak	Yes
$t e m p_{3}$	Rainy	Mild	High	Weak	Yes
$t e m p_{4}$	Rainy	Cool	Normal	Weak	Yes
$t e m p_{5}$	Rainy	Cool	Normal	Strong	No

Table 9. The decision system

(U_{l_{3}}, A, d)

,

t e m p_{1} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))))

,

t e m p_{2} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))))

,

t e m p_{3} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))))

,

t e m p_{4} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))))

.

Table 9. The decision system

(U_{l_{3}}, A, d)

,

t e m p_{1} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{1}))))))

,

t e m p_{2} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{3}))))))

,

t e m p_{3} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{4}))))))

,

t e m p_{4} = M V (g_{0.5, l_{3}}^{c d} (M V (g_{0.5, l_{2}}^{c d} (M V (g_{0.5, l_{1}}^{c d} (u_{6}))))))

.

Day	Outlook	Temperature	Humidity	Wind	Play Golf
$t e m p_{1}$	Sunny	Hot	High	Weak	No
$t e m p_{2}$	Overcast	Mild	High	Weak	Yes
$t e m p_{3}$	Rainy	Mild	High	Weak	Yes
$t e m p_{4}$	Rainy	Cool	Normal	Strong	No

Table 10. Exemplary decision systems from UCI Machine Learning Repository. Australian credit, Car Evaluation, Heartdisease, and Hepatitis were used in the comparison of standard and concept-dependent granulation with a kNN Classifier. Comparing homogeneous variants with a kNN Classifier, we did not use the car system in the epsilon variant because it is symbolic. We used all four systems to present the effectiveness with the Classifier. To present the effectiveness with the SVM classifier, we used a Wisconsin Diagnostic Breast Cancer system [37].

Name	Attr No.	Obj No.	Class No.
Australian-credit	15	690	2
Car Evaluation	7	1728	4
Heartdisease	14	270	2
Hepatitis	20	155	2
Wisconsin Diagnostic Breast Cancer	32	569	2

Table 11. Exemplary result for Standard vs. Concept-Dependent Granulation—5 times Cross Validation 5; Australian Credit data set;

r_{g r a n} =