Dynamic Horizontal Union Algorithm for Multiple Interval Concept Lattices

In the era of big data, the data is updating in real-time. How to prepare the data accurately and efficiently is the key to mining association rules. In view of the above questions, this paper proposes a dynamic horizontal union algorithm of multiple interval concept lattices under the same background of the different attribute set and object set. First, in order to ensure the integrity of the lattice structure, the interval concept lattice incremental generation algorithm was improved, and then interval concept was divided into existing concept, redundancy concept and empty concept. Secondly, combining the characteristics of the interval concept lattice, the concept of consistency of interval concept lattice was defined and it is necessary and sufficient for the horizontal union of the lattice structure. Further, the interval concepts united were discussed, and the principle of horizontal unions was given. Finally, the sequence was scanned by the traversal method. This method increased the efficiency of horizontal union. A case study shows the feasibility and efficiency of the proposed algorithm.


Introduction
With the era of big data, the complexity of data processing in time and space have become increasingly demanding.Real-time updating of data requires efficient processing of dynamic data.The concept lattice is a powerful tool for data analysis which was proposed by Professor Wille R in 1982 [1].It has completeness and accuracy, and has been widely applied in information retrieval, digital library, knowledge discovery, and so on [2][3][4].Domestic and overseas scholars have carried out various research on concept lattices that mainly include a construction algorithm and improvement [5][6][7], rule mining based on concept lattices [8,9], and the fusion of other theories such as fuzzy theory, predicate logic, and rough set theory.
For different needs, some expanded concept lattices have been produced, such as, the fuzzy concept lattice, weighted concept lattice, constraint concept lattice, quantitative concept lattice, expansion concept lattice, rough concept lattice etc. [10][11][12].In particular, the classic concept lattice, fuzzy concept lattice and weighted concept lattices, the extent contains the objects which meet all the attributes in the intent.To find the concepts which have partial attributes, we must scan the concept lattice and combine the concepts.The time cost is large especially for large concept lattices.While, in the rough concept lattice, although the concepts which have partial attributes can be searched, there may be a lot of objects which only have an attribute of the intent, thus the support and degree of confidence of constructing association rules will be greatly reduced.In practical applications, we often care about the object set which has a certain number or percent of attributes in intent.Then, some pertinent association rules will be mined through a correlation analysis.
Based on the above questions, the interval concept lattice [13] was put forward in 2012 as a collection of objects which had a certain number or percentage of attributes in the connotation.Its expression of concept is (M α , M β , Y). Classical concept lattices are constructed from all attributes with full connotations of extensions, and its expression of concept is (X, Y).Rough concept lattices are composed of the concepts of a maximum attribute set and minimum attribute set containing connotative attributes respectively by upper approximate extension and lower approximate extension.The conceptual form of rough concept lattices is (M, N, Y).Interval concept lattices degenerate into classical concept lattices or rough concept lattices when the parameters are α = 0, β = 1 or α = 1/|Y|, β = 1 , separately.As will be readily seen, interval concept lattice is a general form of classical concept lattice and rough concept lattice.
On the other hand, with the dynamic change of data, the structure of an interval concept lattice will also change.The association rules are also updated in real time.For example, every day a huge amount of transaction information is generated.If we build the interval concept lattice from the daily trading information, we can only tap the local association rules [14,15].It cannot provide timely and accurate decision-making for the decision-makers from the overall supermarket shopping system.Therefore, it is very necessary to carry out research on the uniting of interval concept lattices to realize data aggregation.Therefore, the dynamic uniting of interval concept lattices is of great significance.
At present, the main union algorithms of concept lattices are as follows: Reference [16] arranges sub-concept lattices vertically or horizontally in ascending or descending order according to the connotation or extension of the concept; Reference [17] proposes synonymous concepts and updates all father-child nodes according to the relationship between father-child concepts; Reference [18] gives an ordered outline by discussing the relationship between the same concept lattices with the same object set.Since the interval concept lattice has only been proposed for about two years, research on it is limited to the progressive generation algorithm of lattice structure, dynamic compression algorithm and association rules mining algorithm.There is no relevant literature on the uniting of interval concept lattices.The uniting of interval concept lattices is divided into vertical and horizontal uniting.The principle and algorithm of vertical uniting of lattices [19] are studied preliminarily.In this paper, the algorithm for dynamic horizontal uniting of interval concept lattices generated from multiple databases was studied.
The structure is as follows: Section 2 introduces the basic concepts of interval concept lattices, the incremental generation algorithm of interval concept lattices [20] and the related concepts of formal context uniting [21,22]; Section 3 proposes the basic theorem of interval concept lattices' horizontal uniting.On the basis of improving the incremental generation algorithm of interval concept lattices, the horizontal uniting algorithm of interval concept lattices was designed.In Section 4, an example is given to demonstrate the feasibility and efficiency of the algorithm.

Theoretical and Methodological Basis
For example, in the supermarket shopping system, the promotional manager often pays more attention to the customers who purchase k (k > 1) kinds of goods or more and the potential demand of these customers, and then carry out product marketing to get the greatest benefit through minimum promotion.However, in the existing concept lattice structure, this kind of query cannot be operated directly, and some union connections or filtrations must be performed.The time and space costs are too high.In order to address this problem, interval concept lattices are required.

Interval Concept Lattice
Definition 1.For the formal context (U, A, R) and its rough concept lattice RL(U, A, R), (M, N, Y) is the rough concept.Set an interval [α, β] (0 ≤ α ≤ β ≤ 1), then α upper bound extension M α and β lower bound extent M β are: Y is the concept intension and |Y| is the number of elements in Y, that is base number, M α refers to the objects which may be covered by α × |Y| attributes or more in Y. M β refers to the objects which may be covered by β × |Y| attributes or more in Y. Definition 2. Suppose (U, A, R) is a formal context and (M α , M β , Y) is an interval concept.Y is the intent.M α is the α upper bound extension and M β is the β lower bound extension.
Then "≤" is called the partial order relationship of L , then the layers of the Lattice Structure is |A| + 1 and node C is at Layer |Y|.In particular, when Y = ∅, C was recorded on the zeroth layer.

Incremental Construction Algorithm of Interval Concept Lattice (ICAICL)
Thought of algorithm [11] is as follows.
(1) Calculate the attribute power set P(A) from formal context and let each element Y of power set be intent.Construct the initial node-set G according to the intension cardinal number in ascending order.For clarity, suppose each concept is a six-point group: where M α , M β is null set, and Parent = Children = "NULL".
(2) Set parameters α, β and the node of (3) Firstly, construct the root and end node, then insert other nodes as new ones into the lattice incrementally to form the lattice structure.After inserting a new one, there are three kinds of nodes in the L β α : New node (the node inserted), invariant and update node.[12,13].Otherwise, K 1 and K 2 are inconsistent.

Basic Principles
The concepts in L β α (U, A, R) can be divided into two categories.The first is that the number of conceptual connotative attributes in the original structure equals the number of layers of the concepts, and the second is that the sum of the number of conceptual connotative attributes in the two original structures equals the number of layers of the concepts.The specific combination of the two concepts is as follows: is the interval concept after vertical union.
Theorem 1 shows that for the first kind of concepts, the extension and intension of upper and lower bounds of the concepts themselves remain unchanged before and after uniting.But the flags of conceptual types are different.When the conceptual labels flag equals 2 and flag equals 3 in the lattice structure, they remain unchanged after uniting; when the conceptual label flag equals 1 in the lattice structure, flags may equal 1 or 2 after uniting.
Proof.Since the formal context corresponding to the primitive structure is contained in the formal context corresponding to the united lattice structure, the ternary ordered pairs of concepts in the primitive structure are completely preserved in the united lattice structure.At the same time, there is no change before and after the uniting of empty concepts and redundant concepts.For the existence concepts, the new concepts generated by uniting may make themselves redundant.
For the second kind of concept, taking the upper boundary extension as an example, the case of horizontal union can be divided into the following two types: See Theorems 2 and 3.
2 − M α are empty, if not empty, then bring them into and verify whether ≥ α, add objects to M α , or not.Symbol x denotes the smallest integer greater than or equal to x.
Proof.Determined by the boundary extension itself ≥ α, the object satisfying this formula contains at least B attributes, and may still contain objects larger than or equal to (|Y 1 ∪ Y 2 |α), so it needs further verification and cannot be eliminated directly.
M α are empty, if not empty, then bring them into and verify whether Here C m |A 1 | refers to all interval concepts whose number of intension attributes is m in ) and all interval concepts whose number of intension attributes is 1 in

Algorithmic Design
In order to generate interval concept lattices while retaining all interval concepts, including existing concepts, redundant concepts and empty concepts, and effectively improve the uniting efficiency of interval concept lattices, the existing incremental interval concept lattice generation algorithm needs to be modified first, and on this basis, a dynamic horizontal uniting algorithm of multiple interval concept lattices is proposed.

Improved Progressive Generation Algorithms for Interval Concept Lattices
In order to distinguish between different interval concepts, concept nodes are defined and stored in structured form as follows: (1) Calculate power set of attribute set P(A) to determine the intent of concept, generate concept node of initialization G.
(2) Determine upper bound extent M α i and lower bound extent M β i , make flag of empty concept be 3 and the other be 1.
(3) According to the partial order relationship, determine the level of the node and the parent-child relationship, make flag of redundant concept be 2.
The method of finding out redundant concept is Romove-redun (Ch,Gi).
Remove-redun(Ch,Gi) //find out redundant concept, sign and store, and delete it { for each children Ch in Gi // Ch pointer point to every children of Gi Delete Gi from L β α }}} (4) To find concept of no = 1, structure the root node.Then insert other nodes into the lattice according to the parent-child relationship successively.Eventually form the structure of interval concept lattice.

Dynamic Lateral Uniting Algorithms for Multi-Interval Concept Lattices
The basic idea of the algorithm is that whenever a new data set is generated, it will be transformed into a formal representation of the interval concept lattice and united with the united lattice structure.In this way, the information of the interval concept lattice can be aggregated again and again, which lays a foundation for further mining association rules. 1 Let A ∧ i = A i ∩ A, delete the concept of attributes in the newly generated interval concept lattice i , and mark its lattice structure as  Horizontal union-M Layer (L

and corresponding attributes set as
The sum of the number of attributes in L The Horizontal union-M Layer algorithm calls three sub-functions, Remove-redun(Ch,Gi), union1- The algorithm starts from the interval concept lattice directly, makes full use of the lattice structure of the original interval concept lattice and covers the uniting of all concepts.Therefore, it has completeness and effectiveness.Compared with the method of uniting formal context first and then using ICAICL algorithm to construct the interval concept lattice, this algorithm reduces time complexity and has a value of O(n * n i * (n + n i )), which proves the efficiency of the algorithm.

Example Analysis
Two formal contexts are listed in Tables 1 and 2. U = {1, 2, 3, 4, 5} is the element set.A 1 = {a, b, c} and A 2 = {c, d, e, f } are attribute sets before and after the change, respectively.Their corresponding lattice structures and union are described as follows.
(1) Set α = 0.6, β = 0.7, applying the improved interval concept lattice progressive generation algorithm to generate the primitive lattice structure is shown in Figure 1a    Meanwhile, {1345} − {1} = {345} = ∅, bring the elements of {345} into | f (x) ∩ acd|/|acd| ≥ 0.6, determining whether the proportion of intension attributes corresponding to deleted objects satisfies the relationship further, and then add the object into M α * if the relationship is satisfied, otherwise it will be eliminated completely.By analogy, the upper and lower boundary extensions are used to determine the uniting case, and the deletion concept is further verified.Finally, the type of the deletion concept is determined according to the upper and lower boundary extensions after the concept is generated.

Conclusions
When facing real-time data updating in the era of big data, how to deal with data effectively which are generated at any time has become one of the key issues.In this paper, the concept of consistency is introduced as a prerequisite for the uniting of interval concept lattices.Lateral uniting of interval concept lattices can be carried out when the parameter intervals of two interval concept

Conclusions
When facing real-time data updating in the era of big data, how to deal with data effectively which are generated at any time has become one of the key issues.In this paper, the concept of consistency is introduced as a prerequisite for the uniting of interval concept lattices.Lateral uniting of interval concept lattices can be carried out when the parameter intervals of two interval concept lattices are identical and the object sets are identical.In order to preserve the integrity of the lattice structure, the concepts in lattice structures are divided into three categories: Existential concept, redundant concept and empty concept by improving the progressive generation algorithm of interval concept lattices.The concept of L(U, A, R) is introduced at the same time.Lateral uniting of lattice structures is specified to the horizontal uniting of lattice nodes.It can be divided into two situations, that is, the number of connotative attributes of concepts in the original structure equals the number of conceptual layers and the sum of the number of connotative attributes of concepts in the two original structures equals the number of conceptual layers, and the two cases are further refined according to the actual situation, so as to realize the horizontal union of interval concept lattices.
However, in the face of large-scale data, the structure of the interval concept lattice will be greatly expanded, which will lead to high operational complexity.How to develop efficient dynamic uniting software of interval concept lattice, further reduce the complexity of time and space, realize the optimal merging of interval concept lattices, and proposing association rules from them will be the next major research work.

Definition 5 .
Suppose (U, A, R) has an interval concept, C = (M α , M β , Y).If C is neither the redundant concept nor the empty concept, then C is called the existence concept.L β α (U, A, R) is a collection of all the existence concepts.Definition 6. L β α (U, A, R) refers to all the [α, β] interval concepts, which includes: existence concepts, redundant concepts and empty concepts, that is: parent, children; int flag; } The concept of category is marked by flag.When flag = 1, flag = 2 and flag = 3, stored concept is exist concept, redundant concept and empty concept separately.Algorithm 1. Improved ICAICL Input: formal context (U, A, R) Output: interval concept lattice L β α and L β α

Algorithm 2 . 2 )
DHM (Dynamic Horizontal union) The newly generated interval concept lattices L β α i and L β α i (i = 2, 3 . . .n) are united with L β α and L β α respectively.The united results are assigned to L β α and L β α .The uniting steps of the two interval concept lattices are as follows: compute the intension of the concept determined by attribute set P(A * ), and generate concept node set G * of the initialized concept L β * α .According to node set G * , the hierarchy and parent-child relationship of the node are determined according to partial order relation.Let flag = 0, and the upper and lower bounds are empty.

3
Scanning the interval concepts in L β α and L β α i * by sequence and generating the concepts of L β * α .There, Horizontal union-M Layer is used for horizontal uniting of layer M.

α
respectively.The function of Remove-redun(Ch,Gi) is to find redundant concepts of united interval concept lattices and mark them.The function of union1-C * i C(C * i , C, C * ) and union2-C * i C(C * i , C, C * ) are to calculate and test the deleted objects for uniting cases, and to get the concept of interval after uniting.separately.Let i = i + 1, go to step 2.

1 *
,b. (2) A = {a, b, c, d, e, f }.Flag = 0.The upper and lower boundaries are empty.Initialization of Interval Concept Constitution Generated by Parent-Child Relation is L β α .(3) A * = A 1 ∩ A 2 = ∅, A * = c.Deleting all the concept nodes in L β α 1 which contain the attribute c.Here, Lattice structure is L β α and its attributes set is A * 1 = A 1 − A * .sequence, and the interval concept is generated in different cases.L β α can be obtained from L β α .The interval concept lattice after horizontal union is shown in Figure 2.

• Case 3 : 1 *
Uniting the nodes which have two intension attributes in L β α and the nodes which have one intension attributes in L β α 2. The interval concepts obtained from the above three cases together constitute the third layer of L β α .