Relative Reduction of Neighborhood-Covering Pessimistic Multigranulation Rough Set Based on Evidence Theory

Relative reduction of multiple neighborhood-covering with multigranulation rough set has been one of the hot research topics in knowledge reduction theory. In this paper, we explore the relative reduction of covering information system by combining the neighborhood-covering pessimistic multigranulation rough set with evidence theory. First, the lower and upper approximations of multigranulation rough set in neighborhood-covering information systems are introduced based on the concept of neighborhood of objects. Second, the belief and plausibility functions from evidence theory are employed to characterize the approximations of neighborhood-covering multigranulation rough set. Then the relative reduction of neighborhood-covering information system is investigated by using the belief and plausibility functions. Finally, an algorithm for computing a relative reduction of neighborhood-covering pessimistic multigranulation rough set is proposed according to the significance of coverings defined by the belief function, and its validity is examined by a practical example.


Introduction
Relative reduction [1] of covering information system refers to reducing extra covering within a family of covering while keeping the characterization ability of knowledge.Knowledge reduction [2] is one of the most important research subjects for both theoretical development and application.It has been widely used in the field of artificial intelligence [3], pattern recognition [4] and machine learning [5].
Rough set [6], an important mathematical tool of granular computing [7], provides an effective method of knowledge discovery [8] and knowledge reduction.Covering rough set [9] and multigranulation rough set are two special models dealing with the real data sets when overlapping and multiple knowledge are involved.Many researchers have studied the two models, especially their hybrid model, covering multigranulation rough set [10].Qian et al. [11] proposed multigranulation rough sets (MGRS) based on multiple equivalence relation.Liu and Miao [12] introduced four types of covering MGRS models in the covering approximation space.Xu et al. further weakened the equivalence relation, and proposed covering MGRS based on order relation [13], generalized relation [14] and fuzzy compatibility relation [15].In real life, different MGRS models will be generated according to the needs of different data sets, for example, variable-precision MGRS model.Dou et al. [16] first proposed the variable-precision MGRS model and explored its properties.Ju et al. [17] introduced the model of the variable-precision MGRS, and presented a heuristic algorithm for computing reduction of variable-precision MGRS.Feng et al. [18] proposed Type-1 variable-precision multigranulation decision-theoretic fuzzy rough set based on three-way decisions.
The Dempser-Shafer evidence theory [19,20] is based on a basic probability distribution, i.e., a mass function, and uses the belief and plausibility functions derived from the mass function to describe the uncertainty of evidence.There is a strong connection between rough set and evidence theory.Many scholars combined rough set with evidence theory to investigate the uncertainty measures and knowledge representation.Yao et al. [21] indicated that the belief and plausibility functions can be derived by the lower and upper approximation operators in rough set theory.Wu et al. [22] combined the belief structure with the rough approximation space and investigated knowledge reductions of rough sets based on evidence theory.Xu et al. [23] employed belief and plausibility functions to describe the attribute reductions of ordered information systems.Chen et al. [24] associated evidence theory with neighborhood-covering rough set, and discussed the connection between a pair of covering approximation operator and belief and plausibility functions.They did not consider covering rough set with MGRSs.Zhang et al. [25] explored the attribute reductions of neighborhood-covering rough set in the covering decision information systems.Tan et al. [26] employed the evidence theory to discuss the numerical characterization of multigranulation rough sets in incomplete information system, and developed an attribute reduction algorithm based on evidence theory.They did not consider the relation under general covering.Che et al. [27] used evidence theory to characterize the numerical characterization of multigranulation rough sets in a multi-source covering information system.
However, covering MGRS have been rarely considered in the reduction theory by using evidence theory.This brings limitations for the applications of rough set theory in dealing with the data which are usually formalized to multiple coverings.To address this issue, we in this paper aim to measure the approximations of covering MGRS and characterize the reductions of covering MGRS by the belief and plausibility functions.Based on these studies, the relationship between covering MGRS and evidence theory is established, and fusion methods are generated for uncertainty measurement in information systems.
In this paper, the relative reduction of neighborhood-covering pessimistic multigranulation rough set is investigated by using belief and plausibility function based on evidence theory.First, the lower and upper approximations of multigranulation rough set in neighborhood-covering information systems are introduced.Second, the belief and plausibility functions from evidence theory are employed to characterize the approximations of neighborhood-covering multigranulation rough set.The relative reduction of neighborhood-covering information system is then investigated.Finally, an algorithm for computing a relative reduction of neighborhood-covering pessimistic multigranulation rough set is proposed according to the significance of coverings defined by the belief function, and its validity is examined by a practical example.

Preliminaries
In this section, we review some basic concepts related to covering rough sets, multigranulation rough sets and evidence theory.More details can be found in [9,11,19,20,28,29].

Covering Rough Set Based on Neighborhood
Definition 1. [9] Let U be a universe and C be a family of subsets of U. If no subsets in C are empty and C = U, then C is called a covering of U. The ordered pair (U, C) is called a covering approximation space.
One can see that a partition of U is certainly a covering of U.
Definition 2. (Neighborhood).[9] Let U be a universe and C be a covering of U.For x ∈ U, denote (x) C = {K ∈ C | x ∈ K} as the neighborhood of x regarding C. Definition 3. (neighborhood-covering information systems).[28] Let (U, C) be a covering approximation space.For X ⊆ U, N = {(x) C | x ∈ U}, we call (U, N) the neighborhood-covering information systems induced by (U, C).Definition 4. [9] Let (U, C) be a covering approximation space.A pair of approximation operators (C, C) is defined as: for

Multigranulation Approximations
In this subsection, we introduce the multigranulation approximations of MGRS described in [11].
Definition 5. [11] Let (U, A) be an information system, U be a universe and A be a set of attributes, A 1 , A 2 , ..., A m ⊆ A and X ⊆ U.The optimistic multigranulation lower and upper approximations of X regarding A 1 , A 2 , ..., A m are denoted by It is easy to see that the optimistic multigranulation lower and upper approximations are dual, i.e., , where ∼X is the complement of X. Definition 6. [11] Let (U, A) be an information system, U be a universe and A be a set of attributes, A 1 , A 2 , ..., A m ⊆ A and X ⊆ U.The pessimistic multigranulation lower and upper approximations of X regarding A 1 , A 2 , ..., A m are denoted by ∑ m i=1 A P i (X) and ∑ m i=1 A P i (X), respectively, where m It is easy to see that the pessimistic multigranulation lower and upper approximations are dual, i.e., ∑ m i=1 A P i (X) =∼ ∑ m i=1 A P i (∼ X), where ∼X is the complement of X.

Evidence Theory
First, we review some basic concepts from Dempster-Shafer evidence theory.More details can be found in [19,20].Definition 7. (mass function).[19,20] Let U be a universe.A set function m : 2 U → [0, 1] (2 U is the power set of U) in the following is referred to as a basic probability assignment (mass function) if it satisfies the following conditions: (1) m(∅) = 0, (2) m(X) denotes the trust values of the evidence in X.If m(X) = 0, we call X a focal element of m.Let M be the union of all focal elements, and M is called the core.The pair (M, m) is called a belief structure on U. The belief and plausibility functions can be derived based on the belief structure.Definition 8. [19,20] Let U be a universe and m : 2 U → [0, 1] be a basic probability assignment.We can export the following: A set function Bel : A set function is referred to as a plausibility function on if It is easy to see that the belief function and the plausibility function are dual, i.e., Bel(X) =∼ Pl(∼ X), where ∼X is the complement of X.
In addition, a belief function also satisfies the following: (1) Next, we review some basic concepts from Smets evidence theory.More details can be found in [29].

Definition 9. (mass function). [29]
The function m : 2 U → [0, 1] (2 U is the power set of U) is called a basic belief assignment (bba) and the m values are called the basic belief mass (bbm), with: Definition 10. [29] Based on the bbm, the functiona bel(X) and pl(X) are defined for X ⊆ U by: In this article, the mass function is used to measure the mass of a set.The set we want to measure is the power set of U. The set for measures, ordered from the smallest to the largest, is from the empty set to the universal set.To do this, we picked numbers from zero to one to measure from the empty set to the universal set.Therefore, Dempster's evidence theory is relatively suitable for the measurement of sets in this paper.

Neighborhood-Covering Multigranulation Rough Set
Let the pair (U, C) denote a covering information system, where U = {x 1 , x 2 , ..., x n } is a nonempty, finite set of objects called the universe of discourse, and Definition 11. [10] Let (U, C) be a covering information system, U be a universe and C = {C 1 , C 2 , ..., C m } is a family of covering on U.For X ⊆ U, the neighborhood-covering optimistic multigranulation lower and upper approximations of X regarding C 1 , C 2 , ..., C m are denoted by , where ∼X is the complement of X.
Definition 12. [10] Let (U, C) be a covering information system, U be a universe and C = {C 1 , C 2 , ..., C m } is a family of covering on U.For X ⊆ U, the neighborhood-covering pessimistic multigranulation lower and upper approximations of X regarding C 1 , C 2 , ..., C m are denoted by ∑ m i=1 C P i (X) and ∑ m i=1 C P i (X), respectively, where It is easy to see that the neighborhood-covering pessimistic multigranulation lower and upper approximations are dual, i.e., ∑ m i=1 C P i (X) =∼ ∑ m i=1 C P i (∼ X), where ∼X is the complement of X.
Lemma 1. [10] Let (U, C) be a covering information system, U be a universe and C = {C 1 , C 2 , ..., C m } be a family of coverings of U. The following property holds, for X, Y ⊆ U, (1) Example 1.Let (U, C) be a covering information system, U = {x 1 , x 2 , x 3 , x 4 } be a universe and C = {C 1 , C 2 } be a family of coverings of U.For X ⊆ Y, X = {x 1 , x 2 , x 3 }, and According to Definition 2.2, we can calculate According to Definitions 3.1, we can calculate the neighborhood-covering optimistic multigranulation lower and upper approximations of X regarding C 1 , C 2 as the following: According to Definitions 3.2, we can calculate the neighborhood-covering pessimistic multigranulation lower and upper approximations of X regarding C 1 , C 2 as the following:

The Belief Structure of Neighborhood-Covering Multigranulation Rough Set
Next, we use the belief and plausibility function to analyze the belief structure of the neighborhood-covering multigranulation rough set.Tan et al. [26] pointed out that only the pessimistic multigranulation rough sets have the belief structure.Therefore, we only discuss the belief structure of neighborhood-covering pessimistic multigranulation rough sets.
Let P be an average probability distribution, i.e., P(X) = |X| |U| for X ⊆ U, where | • | denotes the cardinality of a set.
Chen et al. [24] stated that the neighborhood-covering single-granularity rough sets have the belief structure.If the covering C is single-granularity covering in the model of neighborhood-covering multigranulation rough set, then the neighborhood-covering multigranulation rough sets are reduced to neighborhood-covering single-granularity rough sets.This is a special case in the model of neighborhood-covering multigranulation rough set, where the belief and plausibility function can be employed to characterize the belief structure.Theorem 1. [24] Let (U, C) be a covering information system.If the covering C is single-granularity covering, then there is a belief structure such that for any X ⊆ U, Bel(X) = P(C P i (X)), Pl(X) = P(C P i (X)). ( Then Bel(X) is a belief function on U, and Pl(X) is a plausibility function on U.
Corollary 1. [24] Let (U, C) be a covering information system.If the covering C is single-granularity covering, then there is a belief structure such that for any X ⊆ U, Then Bel(X) is a belief function on U, and Pl(X) is a plausibility function on U.
However, whether the belief structure exists in the general case of neighborhood-covering multigranulation rough set needs to be further discussed.First, we use the union of sets and transform the neighborhood-covering pessimistic multigranulation rough set to the neighborhood-covering single-granulation rough set.Then we use the relationship partition function to establish the relationship between neighborhood-covering and partition, and transform the neighborhood-covering single-granulation rough set into the single-granulation classic rough set.Finally, we obtain the relationship between the evidence theory and neighborhood-covering multigranulation rough set.
We use the following definition to transform the neighborhood-covering pessimistic multigranulation rough set to the neighborhood-covering single-granulation rough set.Definition 13.Let (U, C) be a covering information system and C = {C 1 , C 2 , ..., C m } be a family of coverings of U.For x ∈ U, (x) = {(x) C i |x ∈ U} denotes a covering based on the covering family C w.r.t the neighborhood of x.
The definition of covering in Definition 13 is the single-grain covering of U, therefore the pessimistic multigranulation rough set is transformed into the single-grain rough set.Next, we will define the relationship partitioning function, establish the relationship between covering and partition, and transform the covering rough set into the classic rough set.Theorem 2. Let U be a universe and C be a covering of U.For x ∈ U, we define the relationship partition function Proof of Theorem 2. First, we prove that f (X) f (X ) = ∅, for ∀X, X ⊆ U, X = X .
Therefore, f (X) is a partition of U.
Because of Theorem 2, we transform the covering rough set into the classic rough set.Yao et al. [21] showed that the belief and plausibility functions can be derived by the lower and upper approximation operators in rough set theory.So, the following theorem holds.Theorem 3. Let (U, C) be a covering information system, C = {C 1 , C 2 , ..., C m } be a family of coverings of U.For any X ⊆ U, x ∈ U, a probability assignment function is m : 2 U → [0, 1], and its definition is as follows: then the belief and plausibility function on U are Proof of Theorem 3. By Theorem 2, we have The following proves Bel(X) = P( (X)).We have: The proof of Pl(X) = P(∑ m i=1 C P i (X)) is similar.Thus, we can assert this conclusion.
Next, we will give a counterexample to illustrate that neighborhood-covering optimistic multigranulation rough set approximation cannot be characterized by evidence theory.
Example 2. Let (U, C) be a covering information system, U = {x 1 , x 2 , x 3 , We can calculate: As we know, a belief function satisfies the following: Thus, neighborhood-covering optimistic multigranulation rough set approximation cannot be characterized by belief and plausibility functions.

Relative Reduction of Neighborhood-Covering Pessimistic Multigranulation Rough Set
The relative reduction of neighborhood-covering pessimistic multigranularity rough set is discussed below.First, we give the definition of relative reduction of neighborhood-covering pessimistic multigranulation rough set.
A covering decision information system is a triple (U, C, D), where U = {x 1 , x 2 , ..., x n } is a nonempty, finite set of objects called the universe of discourse, C = {C 1 , C 2 , ..., C m } is a family of coverings of U and D = {D 1 , D 2 , ..., D l } is a decision partition of U. Definition 14. [11] Let (U, C, D) be a covering decision information system, C = {C 1 , C 2 , ..., C m } be a family of coverings of U, and D = {D 1 , D 2 , ..., D l } be a decision partition of U. We have the following definition.
( (3 Next, let (U, C, D) be a covering decision information system, C = {C 1 , C 2 , ..., C m } be a family of coverings of U and D = {D 1 , D 2 , ..., D l } be a decision partition of U.
In Algorithm 1, computing the neighborhood of all the objects can be done in O(|U| 2 |C|), and the time complex for computing C P (d) is O(|U||C||D|).Since |D| < |U|, the time complexity of the first step is O(|U| 2 |C|).In Step 2-3, the time complex is O(|U||C| 2 |D|).In sum, the total time complexity of Algorithm 1 does not exceed O(|U| 2 |C| 2 ).Next, we give an example to calculate the relative reduction of the pessimistic multigranularity covering lower approximation.

Algorithm 1 Relative reduction algorithm of neighborhood-covering pessimistic multigranularity lower approximation
Input: a covering decision information system (U, C, D).Output: relative reduction set B of neighborhood-covering pessimistic multigranularity lower approximation.Example 3. Consider a house evaluation problem.Let U = {x 1 , x 2 , ..., x 6 } be a set of six houses, A = {equally shared area, color, price, surroundings} be a set of attribute, and B = {purchase opinions} be a set of decision.The values of equally shared area could be {large, ordinary, small}.The values of color could be {excellent, good, bad}.The values of price¡ could be {high, middle, low}.The values of surroundings could be {quiet, noisy, very noisy}.The decision values of purchase opinions could be {support, oppose}, which is randomly chosen from experts.The evaluation results are shown in Table 1.
Through the definition of the core, we can get the relative reduction algorithm of neighborhoodcovering pessimistic multigranularity lower approximation.
The mechanism of Algorithm 2 can be described as follows.In Step 2, computing the significance of all covering can be done in O(|U| Example 4. We use Algorithm 2 to carry out the relative reduction of the decision system in Example 3.
First let B = ∅.For the second step, } are all relative reduction set of neighborhood-covering pessimistic multigranularity lower approximation w.r.t C. Algorithm 1 in this paper is the original algorithm for the relative reduction of neighborhood-covering pessimistic multigranulation rough set.Its time complexity is relatively low, but it has more approximate data, which is troublesome to compare, and only one reduction can be obtained.Algorithm 2 is proposed by combining the neighborhood-covering pessimistic multigranulation rough set with evidence theory.Algorithm 2 employs the belief function from evidence theory to measure the quality of the lower approximation of the model.After the data is simplified, the comparison of the data is relatively concise, and all the reduction can be obtained.
The relative reduction in this paper is the reduction that keeps the upper and lower approximations unchanged, and the belief and plausibility functions are used to calculate the mass of the upper and lower approximations.The upper and lower approximation not changing is equivalent to the mass function of the upper and lower approximation not changing.This algorithm can be widely used in neighborhood-covering pessimistic multigranulation rough set model to solve the relative reduction that keeps the upper and lower approximations unchanged.
The algorithm in this paper is investigated by using belief and plausibility function based on evidence theory.Since the neighborhood-covering optimistic multigranulation rough set approximation cannot be characterized by belief and plausibility functions, the proposed algorithm is not applicable to the neighborhood-covering optimistic multigranulation rough set, but only suitable for computing the relative reduction of neighborhood-covering pessimistic multigranularity rough set.

Conclusions
In this paper, the relative reduction of neighborhood-covering multigranulation rough set is explored by using evidence theory.We introduce the lower and upper approximations of multigranulation rough set in neighborhood-covering information systems based on the concept of neighborhood of objects.The approximations of neighborhood-covering multigranulation rough set are characterized by the belief and plausibility functions from evidence theory.Moreover, according to the significance of coverings defined by the belief function, the algorithm for computing a relative reduction of neighborhood-covering information systems is proposed, and its validity is examined by a practical example.This paper does not only enrich the relative reduction theory of multigranulation rough set, but also provide a new idea for relative reduction of data sets based on the covering decision information system.In the future, the relative reduction theory of covering decision information system under other covering multigranulation rough set approximation operators will be further considered, and the reduction theory and results under different covering multigranulation rough set approximation operators will be compared.

3 :
Remove a covering in B again and get B .If B P (d) = C P (d), return B; else, go to Step 2; 4: Repeat the Steps 2 and 3 for each covering in C to get all the relative reduce of the covering family.

Theorem 4 .
Finally, let B = {C 3 , C 4 }, by removing any covering on B, we can get B , and B P (d) = B P (d).Therefore B is a d reduction of neighborhood-covering pessimistic multigranularity lower approximation w.r.t C. Let (U, C, D) be a covering decision information system and D = {D 1 , D 2 , ..., D l } be a decision partition of U. Let ∑ l j=1 Bel C (D j ) = M, then B ⊆ C is a neighborhood-covering pessimistic multigranularity lower approximation relative reduction of C iff ∑ l j=1 Bel B (D j ) = M, and for any subset B ⊆ B, ∑ l j=1 Bel B (D j ) > M. Proof of Theorem 4. Sufficiency.If B ⊆ C is a relative reduction of neighborhood-covering pessimistic multigranularity lower approximation w.r.t C, ∀j ∈ {1, 2, .., l}, we have ∑ m i=1 C P i (D j ) = ∑ m i=1 B P i (D j ).By Definition 14, we can see that ∀j ∈ {1, 2, .., l}, Bel C (D j ) = Bel B (D j ), then ∑ l j=1 Bel B (D j ) = M, and for any B ⊆ B, ∑ l j=1 Bel B (D j ) > M. Necessity.Since B ⊆ C, we have ∀j ∈ {1, 2, .., l}, Bel C (D j ) ≤ Bel B (D j ).Since ∑ l j=1 Bel B (D j ) = M, and for any B ⊆ B, ∑ l j=1 Bel B (D j ) > M, then for ∀j ∈ {1, 2, .., l}, Bel C (D j ) = Bel B (D j ), Bel C (D j ) = Bel B ' (D j ).By Definition 14, we have ∀j ∈ {1, 2, .., l}, ∑ m i=1 C P i (D j ) = ∑ m i=1 B P i (D j ), C P (d) = B P (d), and for any B ⊆ B, C P (d) = B P (d).Therefore, B ⊆ C is a reduction of neighborhood-covering pessimistic multigranularity lower approximation w.r.t C. Definition 15.Let (U, C, D) be a covering information system and D = {D 1 , D 2 , ..., D l } be a decision partition of U.For B ⊆ C,C i ∈ B, the significance of C i w.r.t B is defined as: Sig easy to see that the neighborhood-covering optimistic multigranulation lower and upper approximations are dual, i.e., ∑ m i=1 1) If B ⊆ C and B P (d) = C P (d), but B P (d) = C P (d), for B ⊆ B, then B is a d reduction of neighborhood-covering pessimistic multigranularity lower approximation w.r.t C; (2) If B ⊆ C and B P (d) = C P (d), but B P (d) = C P (d), for B ⊆ B, then B is a d reduction of neighborhood-covering pessimistic multigranularity upper approximation w.r.t C;