Neutrosophic Association Rule Mining Algorithm for Big Data Analysis

: Big Data is a large-sized and complex dataset, which cannot be managed using traditional data processing tools. Mining process of big data is the ability to extract valuable information from these large datasets. Association rule mining is a type of data mining process, which is indented to determine interesting associations between items and to establish a set of association rules whose support is greater than a speciﬁc threshold. The classical association rules can only be extracted from binary data where an item exists in a transaction, but it fails to deal effectively with quantitative attributes, through decreasing the quality of generated association rules due to sharp boundary problems. In order to overcome the drawbacks of classical association rule mining, we propose in this research a new neutrosophic association rule algorithm. The algorithm uses a new approach for generating association rules by dealing with membership, indeterminacy, and non-membership functions of items, conducting to an efﬁcient decision-making system by considering all vague association rules. To prove the validity of the method, we compare the fuzzy mining and the neutrosophic mining. The results show that the proposed approach increases the number of generated association rules.


Introduction
The term 'Big Data' originated from the massive amount of data produced every day.Each day, Google receives cca. 1 billion queries, Facebook registers more than 800 million updates, and YouTube counts up to 4 billion views, and the produced data grows with 40% every year.Other sources of data are mobile devices and big companies.The produced data may be structured, semi-structured, or unstructured.Most of the big data types are unstructured; only 20% of data consists in structured data.There are four dimensions of big data: (1) Volume: big data is measured by petabytes and zettabytes.
(2) Velocity: the accelerating speed of data flow.
(3) Variety: the various sources and types of data requiring analysis and management.(4) Veracity: noise, abnormality, and biases of generated knowledge.Consequently, Gartner [1] outlines that big data's large volume requires cost-effective, innovative forms for processing information, to enhance insights and decision-making processes.
These various applications help people to obtain better services, experiences, or be healthier, by detecting illness symptoms much earlier than before [2].Some significant challenges of managing and analyzing big data are [4,5]: (1) Analytics Architecture: The optimal architecture for dealing with historic and real-time data at the same time is not obvious yet.(2) Statistical significance: Fulfill statistical results, which should not be random.
(3) Distributed mining: Various data mining methods are not fiddling to paralyze.(4) Time evolving data: Data should be improved over time according to the field of interest.(5) Compression: To deal with big data, the amount of space that is needed to store is highly relevant.(6) Visualization: The main mission of big data analysis is the visualization of results.(7) Hidden big data: Large amounts of beneficial data are lost since modern data is unstructured data.
Due to the increasing volume of data at a matchless rate and of various forms, we need to manage and analyze uncertainty of various types of data.Big data analytics is a significant function of big data, which discovers unobserved patterns and relationships among various items and people interest on a specific item from the huge data set.Various methods are applied to obtain valid, unknown, and useful models from large data.Association rule mining stands among big data analytics functionalities.The concept of association rule (AR) mining already returns to H'ajek et al. [6].Each association rule in database is composed from two different sets of items, which are called antecedent and consequent.A simple example of association rule mining is "if the client buys a fruit, he/she is 80% likely to purchase milk also".The previous association rule can help in making a marketing strategy of a grocery store.Then, we can say that association rule-mining finds all of the frequent items in database with the least complexities.From all of the available rules, in order to determine the rules of interest, a set of constraints must be determined.These constraints are support, confidence, lift, and conviction.Support indicates the number of occurrences of an item in all transactions, while the confidence constraint indicates the truth of the existing rule in transactions.The factor "lift" explains the dependency relationship between the antecedent and consequent.On the other hand, the conviction of a rule indicates the frequency ratio of an occurring antecedent without a consequent occurrence.Association rules mining could be limited to the problem of finding large itemsets, where a large itemset is a collection of items existing in a database transactions equal to or greater than the support threshold [7][8][9][10][11][12][13][14][15][16][17][18][19][20].In [8], the author provides a survey of the itemset methods for discovering association rules.The association rules are positive and negative rules.The positive association rules take the form X → Y , X ⊆ I, Y ⊆ I and X ∩ Y = ϕ, where X, Y are antecedent and consequent and I is a set of items in database.Each positive association rule may lead to three negative association rules, → Y , X → Y , and X → Y .Generating association rules in [9] consists of two problems.The first problem is to find frequent itemsets whose support satisfies a predefined minimum value.Then, the concern is to derive all of the rules exceeding a minimum confidence, based on each frequent itemset.Since the solution of the second problem is straightforward, most of the proposed work goes in for solving the first problem.An a priori algorithm has been proposed in [19], which was the basis for many of the forthcoming algorithms.A two-pass algorithm is presented in [11].It consumes only two database scan passes, while a priori is a multi-pass algorithm and needs up to c+1 database scans, where c is the number of items (attributes).Association rules mining is applicable in numerous database communities.It has large applications in the retail industry to improve market basket analysis [7].Streaming-Rules is an algorithm developed by [9] to report an association between pairs of elements in streams for predictive caching and detecting the previously undetectable hit inflation attacks in advertising networks.Running mining algorithms on numerical attributes may result in a large set of candidates.Each candidate has small support and many rules have been generated with useless information, e.g., the age attribute, salary attribute, and students' grades.Many partitioning algorithms have been developed to solve the numerical attributes problem.The proposed algorithms faced two problems.The first problem was the partitioning of attribute domain into meaningful partitions.The second problem was the loss of many useful rules due to the sharp boundary problem.Consequently, some rules may fail to achieve the minimum support threshold because of the separating of its domain into two partitions.
Fuzzy sets have been introduced to solve these two problems.Using fuzzy sets make the resulted association rules more meaningful.Many mining algorithms have been introduced to solve the quantitative attributes problem using fuzzy sets proposed algorithms in [13][14][15][16][17][18][19][20][21][22][23][24][25][26][27] that can be separated into two types related to the kind of minimum support threshold, fuzzy mining based on single-minimum support threshold, and fuzzy mining based on multi-minimum support threshold [21].Neutrosophic theory was introduced in [28] to generalize fuzzy theory.In [29][30][31][32], the neutrosophic theory has been proposed to solve several applications and it has been used to generate a solution based on neutrosophic sets.Single-valued neutrosophic set was introduced in [33] to transfer the neutrosophic theory from the philosophic field into the mathematical theory, and to become applicable in engineering applications.In [33], a differentiation has been proposed between intuitionistic fuzzy sets and neutrosophic sets based on the independence of membership functions (truth-membership function, falsity-membership function, and indeterminacy-membership function).In neutrosophic sets, indeterminacy is explicitly independent, and truth-membership function and falsity-membership function are independent as well.In this paper, we introduce an approach that is based on neutrosophic sets for mining association rules, instead of fuzzy sets.Also, a comparison resulted association rules in both of the scenarios has been presented.In [34], an attempt to express how neutrosophic sets theory could be used in data mining has been proposed.They define SVNSF (single-valued neutrosophic score function) to aggregate attribute values.In [35], an algorithm has been introduced to mining vague association rules.Items properties have been added to enhance the quality of mining association rules.In addition, almost sold items (items has been selected by the customer, but not checked out) were added to enhance the generated association rules.AH-pair Database consisting of a traditional database and the hesitation information of items was generated.The hesitation information was collected, depending online shopping stores, which make it easier to collect that type of information, which does not exist in traditional stores.In this paper, we are the first to convert numerical attributes (items) into neutrosophic sets.While vague association rules add new items from the hesitating information, our framework adds new items by converting the numerical attributes into linguistic terms.Therefore, the vague association rule mining can be run on the converted database, which contains new linguistic terms.

Research Contribution
Detecting hidden and affinity patterns from various, complex, and big data represents a significant role in various domain areas, such as marketing, business, medical analysis, etc.These patterns are beneficial for strategic decision-making.Association rules mining plays an important role as well in detecting the relationships between patterns for determining frequent itemsets, since classical association rules cannot use all types of data for the mining process.Binary data can only be used to form classical rules, where items either exist in database or not.However, when classical association rules deal with quantitative database, no discovered rules will appear, and this is the reason for innovating quantitative association rules.The quantitative method also leads to the sharp boundary problem, where the item is below or above the estimation values.The fuzzy association rules are introduced to overcome the classical association rules drawbacks.The item in fuzzy association rules has a membership function and a fuzzy set.The fuzzy association rules can deal with vague rules, but not in the best manner, since it cannot consider the indeterminacy of rules.In order to overcome drawbacks of previous association rules, a new neutrosophic association rule algorithm has been introduced in this research.Our proposed algorithm deals effectively and efficiently with vague rules by considering not only the membership function of items, but also the indeterminacy and the falsity functions.Therefore, the proposed algorithm discovers all of the possible association rules and minimizes the losing processes of rules, which leads to building efficient and reliable decision-making system.By comparing our proposed algorithm with fuzzy approaches, we note that the number of association rules is increased, and negative rules are also discovered.The separation of negative association rules from positive ones is not a simple process, and it helps in various fields.As an example, in the medical domain, both positive and negative association rules help not only in the diagnosis of disease, but also in detecting prevention manners.
The rest of this research is organized as follows.The basic concepts and definitions of association rules mining are presented in Section 2. A quick overview of fuzzy association rules is described in Section 3. The neutrosophic association rules and the proposed model are presented in Section 4. A case study of Telecom Egypt Company is presented in Section 5.The experimental results and comparisons between fuzzy and proposed association rules are discussed in Section 6.The conclusions are drawn in Section 7.

Association Rules Mining
In this section, we formulate the |D| transactions from the mining association rules for a database D. We used the following notations: (i) I = {i 1 , i 2 , . . .i m } represents all the possible data sets, called items.(ii) Transaction set T is the set of domain data resulting from transactional processing such as T ⊆ I. (iii) For a given itemset X ⊆ I and a given transaction T, we say that T contains X if and only if X ⊆ T. (iv) σ X : the support frequency of X, which is defined as the number of transactions out of D that contain X. (v) s: the support threshold.
X is considered a large itemset, if σ X ≥ |D| × s.Further, an association rule is an implication of the form X ⇒ Y , where X ⊆ I, Y ⊆ I and X ∩ Y = ϕ.
An association rule X ⇒ Y is addressed in D with confidence c if at least c transactions out of D contain both X and Y.The rule X ⇒ Y is considered as a large itemset having a minimum support s if: For a specific confidence and specific support thresholds, the problem of mining association rules is to find out all of the association rules having confidence and support that is larger than the corresponding thresholds.This problem can simply be expressed as finding all of the large itemsets, where a large itemset L is:

Fuzzy Association Rules
Mining of association rules is considered as the main task in data mining.An association rule expresses an interesting relationship between different attributes.Fuzzy association rules can deal with both quantitative and categorical data and are described in linguistic terms, which are understandable terms [26].
Let T = {t 1 , . . . ,t n } be a database transactions.Each transaction consists of a number of attributes (items).Let I = {i 1 , . . . ,i m } be a set of categorical or quantitative attributes.For each attribute i k , (k = 1, . . ., m), we consider {n 1 , . . . ,n k } associated fuzzy sets.Typically, a domain expert determines the membership function for each attribute.
The tuple < X, A > is called the fuzzy itemset, where X ⊆ I (set of attributes) and A is a set of fuzzy sets that is associated with attributes from X.
Following is an example of fuzzy association rule: IF salary is high and age is old THEN insurance is high Before the mining process starts, we need to deal with numerical attributes and prepare them for the mining process.The main idea is to determine the linguistic terms for the numerical attribute and define the range for every linguistic term.For example, the temperature attribute is determined by the linguistic terms {very cold, cold, cool, warm, hot}.Figure 1 illustrates the membership function of the temperature attribute.The membership function has been calculated for the following database transactions illustrated in Table 1.We add the linguistic terms {very cold, cold, cool, warm, hot} to the candidate set and calculate the support for those itemsets.After determining the linguistic terms for each numerical attribute, the fuzzy candidate set have been generated.
Table 2 contains the support for each itemset individual one-itemsets.The count for every linguistic term has been calculated by summing its membership degree over the transactions.Table 3 shows the support for two-itemsets.The count for the fuzzy sets is the summation of degrees that resulted from the membership function of that itemset.The count for two-itemset has been calculated by summing the minimum membership degree of the 2 items.For example, {cold, cool} has count 0.8, which resulted from transactions T2 and T3.For transaction T2, membership degree of cool is 0.6 and membership degree for cold is 0.4, so the count for set {cold, cool} in T2 is 0.4.Also, T3 has the same count for {cold, cool}.So, the count of set {cold, cool} over all transactions is 0.8.In subsequent discussions, we denote an itemset that contains k items as k-itemset.The set of all k-itemsets in L is referred as L k .

Neutrosophic Association Rules
In this section, we overview some basic concepts of the NSs and SVNSs over the universal set X, and the proposed model of discovering neutrosophic association rules.

Definition 1 ([33]
).Let X be a space of points and x∈X.A neutrosophic set (NS) A in X is definite by a truth-membership function T A (x), an indeterminacy-membership function I A (x) and a falsity-membership function F A (x). T A (x), I A (x) and F A (x) are real standard or real nonstandard subsets of ] − 0, 1 + [.That is T A (x): X → ] − 0, 1 + [, I A (x): X → ] − 0, 1 + [ and F A (x): X → ] − 0, 1 + [.There is no restriction on the sum of T A (x), I A (x) and F A (x), so 0 − ≤ sup T A (x) + sup I A (x) + sup F A (x) ≤ 3 + .
Neutrosophic is built on a philosophical concept, which makes it difficult to process during engineering applications or to use it to real applications.To overcome that, Wang et al. [31], defined the SVNS, which is a particular case of NS.Definition 2. Let X be a universe of discourse.A single valued neutrosophic set (SVNS) A over X is an object taking the form A = { x,T A (x), I A (x), F A (x) : x∈X}, where T A (x): X → [0, 1], I A (x): X → [0, 1] and F A (x): X → [0, 1] with 0 ≤ T A (x) + I A (x) + F A (x) ≤ 3 for all x∈X.The intervals T A (x), I A (x) and F A (x) represent the truth-membership degree, the indeterminacy-membership degree and the falsity membership degree of x to A, respectively.For convenience, a SVN number is represented by A = (a, b, c), where a, b, c∈[0, 1] and a + b + c ≤ 3.

Definition 3 (Intersection) ([31]). For two SVNSs
, the intersection of these SVNSs is again an SVNSs which is defined as C = A ∩ B whose truth, indeterminacy and falsity membership functions are defined as T C (x) = min(T A (x), T B (x)), I A (x) = min(I A (x), I B (x)) and F C (x) = max(F A (x), F B (x)).

Definition 4 (Union) ([31]
).For two SVNSs A = T A (x), I A (x), F A (x) and B = T B (x), I B (x), F B (x) , the union of these SVNSs is again an SVNSs which is defined as C = A ∪ B whose truth, indeterminacy and falsity membership functions are defined as T C (x) = max(T A (x), T B (x)), I A (x) = max(I A (x), I B (x)) and F C (x) = min(F A (x), F B (x)).

Definition 5 (Containment) ([31]). A single valued neutrosophic set A contained in the other SVNS B, denoted by A ⊆ B if and only if T
Next, we propose a method for generating the association rule under the SVNS environment.

Proposed Model for Association Rule
In this paper, we introduce a model to generate association rules of form: X → Y where X ∩ Y = ϕ and X, Y are neutrosophic sets.Our aim is to find the frequent itemsets and their corresponding support.Generating an association rule from its frequent itemsets, which are dependent on the confidence threshold, are also discussed here.This has been done by adding the neutrosophic set into I, where I is all of the possible data sets, which are referred as items.So I = N ∪ M where N is neutrosophic set and M is classical set of items.The general form of an association rule is an implication of the form X → Y , where Therefore, an association rule X → Y is addressed in Database D with confidence 'c' if at least c transactions out of D contains both X and Y. On the other hand, the rule X → Y is considered a large item set having a minimum support s if σ X∪Y ≥ |D| × s.Furthermore, the process of converting the quantitative values into the neutrosophic sets is proposed, as shown in Figure 2. The proposed model for the construction of the neutrosophic numbers is summarized in the following steps: Step 1 Set linguistic terms of the variable, which will be used for quantitative attribute.
Step 2 Define the truth, indeterminacy, and the falsity membership functions for each constructed linguistic term.
Step 3 For each transaction t in T, compute the truth-membership, indeterminacy-membership and falsity-membership degrees.
Step 4 Extend each linguistic term l in set of linguistic terms L into T L , I L , and F L to denote truthmembership, indeterminacy-membership, and falsity-membership functions, respectively.
• calculate count of each linguistic term by summing degrees of membership for each transaction as • calculate support for each linguistic term s = No. o f trnsactions .
Step 6 The above procedure has been repeated for every quantitative attribute in the database.
In order to show the working procedure of the approach, we consider the temperature as an attribute and the terms "very cold", "cold", "cool", "warm", and "hot" as their linguistic terms to represent the temperature of an object.Then, following the steps of the proposed approach, construct their membership function as below: Step 1 The attribute temperature' has set the linguistic terms "very cold", "cold", "cool", "warm", and "hot", and their ranges are defined in Table 4. Step 2 Based on these linguistic term ranges, the truth-membership functions of each linguistic variable are defined, as follows: The falsity-membership functions of each linguistic variable are defined as follows: ; otherwise The indeterminacy membership functions of each linguistic variables are defined as follows: f or x ≤ −2.5 (x + 2.5)/5 ; f or − 2.5 ≤ x ≤ 2.5 (7.5 − x)/5 ; f or 2.5 ≤ x ≤ 7.5 0 ; f or x ≥ 7.5 (x + 2.5)/5 ; f or 2.5 ≤ x ≤ 2.5 (7.5 − x)/5 ; f or 2.5 ≤ x ≤ 7.5 (x − 7.5)/5 ; f or 7.5 ≤ x ≤ 12.5 (17.5 − x)/5 ; f or 12.5 ≤ x ≤ 17.5 0 ; otherwise The graphical membership degrees of these variables are summarized in Figure 3.The graphical falsity degrees of these variables are summarized in Figure 4. Also, the graphical indeterminacy degrees of these variables are summarized in Figure 5. On the other hand, for a particular linguistic term, 'Cool' in the temperature attribute, their neutrosophic membership functions are represented in Figure 6.Step 3 Based on the membership grades, different transaction has been set up by taking different sets of the temperatures.The membership grades in terms of the neutrosophic sets of these transactions are summarized in Table 5. Very cold <0,0,1> cold <0.4,0.9,0.6>cool <0.6,0.9,0.4>warm <0,0,1> hot <0,0,1> T3 12 Very cold <0,0,1> cold <0.6,0.9,0.4>cool <0.4,0.9,0.6>warm <0,0,1> hot <0,0,1> T4 33 Very cold <0,0,1> cold <0,0,1> cool <0,0,1> warm <0.4,0.9,0.6>hot <0.6,0.9,0.4>T5 21 Very cold <0,0,1> cold <0,0,1> cool <0.8,0.7,0.2>warm <0.2,0.7,0.8>hot <0,0,1> T6 25 Very cold <0,0,1> cold <0,0,1> cool <0,0,1> warm <1,0.5,0>hot <0,0,1> Step 4 Now, we count the set of linguistic terms {very cold, cold, cool, warm, hot} for every element in transactions.Since the truth, falsity, and indeterminacy-memberships are independent functions, the set of linguistic terms can be extended to T very−cold , T cold , T cool , T warm , T hot F very−cold , F cold , F cool , F warm , F hot I very−cold , I cold , I cool , I warm , I hot where F warm means not worm and I warm means not sure of warmness.This enhances dealing with negative association rules, which is handled as positive rules without extra calculations.Step 5 By using the membership degrees that are given in Table 5 for candidate 1-itemset, the count and support has been calculated, respectively.The corresponding results are summarized in Table 6.Similarly, the two-itemset support is illustrated in Table 7 and the rest of itemset generation (k-itemset for k = 3, 4 . . .8) are obtained similarly.The count for k-item set in database record is defined by minimum count of each one-itemset exists.
For example: {T Cold , T Cool } count is 0.8 Because they exists in both T2 and T3.In T2: T Cold = 0.4 and T Cool = 0.6 so, count for {T Cold , T Cool } in T2 = 0.4 In T3: T Cold = 0.6 and T Cool = 0.4 so, count for {T Cold , T Cool } in T2 = 0.4 Thus, count of {T Cold , T Cool } in (Database) DB is 0.8.

Case Study
In this section, the case of Telecom Egypt Company stock records has been studied.Egyptian stock market has many companies.One of the major questions for stock market users is when to buy or to sell a specific stock.Egyptian stock market has three indicators, EGX30, EGX70, and EGX100.Each indicator gives a reflection of the stock market.Also, these indicators have an important impact on the stock market users, affecting their decisions of buying or selling stocks.We focus in our study on the relation between the stock and the three indicators.Also, we consider the month and quarter of the year to be another dimension in our study, while the sell/buy volume of the stock per day is considered to be the third dimension.
In this study, the historical data has been taken from the Egyptian stock market program (Mist) during the program September 2012 until September 2017.For every stock/indicator, Mist keeps a daily track of number of values (opening price, closing price, high price reached, low price reached, and volume).The collected data of Telecom Egypt Stock are summarized in Figure 7.The same was performed for the stock market indicators.Now, we take the attributes as "quarter", "month", "stock change rate", "volume percentage", and "indicators change rate".Table 8 illustrates the segment of resulted data after preparation.Based on these linguistic terms, define the ranges under the SVNSs environment.For this, corresponding to the attribute in "change rate" and "volume", the truth-membership functions by defining their linguistic terms as {"high up", "high low", "no change", "low down", "high down"} corresponding to attribute "change rate", while for the attribute "volume", the linguistic terms are (low, medium, and high) and their ranges are summarized in Figures 8 and 9, respectively.The falsity-membership function and indeterminacy-membership function have been calculated and applied as well for change rate attribute.

Experimental Results
We proceeded to a comparison between fuzzy mining and neutrosophic mining algorithms, and we found out that the number of generated association rules increased in neutrosophic mining.
A program has been developed to generate large itemsets for Telecom Egypt historical data.VB.net has been used in creating this program.The obtained data have been stored in an access database.The comparison depends on the number of generated association rules in a different min-support threshold.It should be noted that the performance cannot be part of the comparison because of the number of items (attributes) that are different in fuzzy vs. neutrosophic association rules mining.In fuzzy mining, the number of items was 14, while in neutrosophic mining it is 34.This happens because the number of attributes increased.Spreading each linguistic term into three (True, False, Indeterminacy) terms make the generated rules increase.The falsity-generated association rules can be considered a negative association rules.As pointed out in [36], the conviction of a rule conv(X → Y) is defined as the ratio of the expected frequency that X happened without Y falsity-association rules to be used to generate negative association rules if T(x) + F(x) = 1.In Table 9, the number of generated fuzzy rules in each k−itemset using different min-support threshold are reported, while the total generated fuzzy association rule is presented in Figure 10.As compared to the fuzzy approach, by applying the same min-support threshold, we get a huge set of neutrosophic association rules.Table 10 illustrates the booming that happened to generated neutrosophic association rules.We stop generating itemsets at iteration 4 due to the noted expansion in the results shown in Figure 11, which shows the number of neutrosophic association rules.Experiment has been re-run using different min-support threshold values and the resulted neutrosophic association rules counts has been noted and listed in Table 11.Note the high values that are used for min-support threshold.Figure 12 illustrates the neutrosophic association rules for min-support threshold from 0.5 to 0.9.Using the neutrosophic mining approach makes association rules exist for most of the min-support threshold domain, which may be sometimes misleading.We found that using the neutrosophic approach is useful in generating negative association rules beside positive association rules minings.Huge generated association rules provoke the need to re-mine generated rules (mining of mining association rules).Using suitable high min-support values may help in the neutrosophic mining process.

Conclusions and Future Work
Big data analysis will continue to grow in the next years.In order to efficiently and effectively deal with big data, we introduced in this research a new algorithm for mining big data using neutrosophic association rules.Converting quantitative attributes is the main key for generating such rules.Previously, it was performed by employing the fuzzy sets.However, due to fuzzy drawbacks, which we discussed in the introductory section, we preferred to use neutrosophic sets.Experimental results showed that the proposed approach generated an increase in the number of rules.In addition, the indeterminacy-membership function has been used to prevent losing rules from boundaries problems.The proposed model is more effective in processing negative association rules.By comparing it with the fuzzy association rules mining approaches, we conclude that the proposed model generates a larger number of positive and negative association rules, thus ensuring the construction of a real and efficient decision-making system.In the future, we plan to extend the comparison between the neutrosophic association rule mining and other interval fuzzy association rule minings.Furthermore, we seized the falsity-membership function capacity to generate negative association rules.Conjointly, we availed of the indeterminacy-membership function to prevent losing rules from boundaries problems.Many applications can emerge by adaptions of truth-membership function, indeterminacy-membership function, and falsity-membership function.Future work will benefit from the proposed model in generating negative association rules, or in increasing the quality of the generated association rules by using multiple support thresholds and multiple confidence thresholds for each membership function.The proposed model can be developed to mix positive association rules (represented in the truth-membership function) and negative association rules (represented in the falsity-membership function) in order to discover new association rules, and the indeterminacy-membership function can be put forth to help in the automatic adoption of support thresholds and confidence thresholds.Finally, yet importantly, we project to apply the proposed model in the medical field, due to its capability in effective diagnoses through discovering both positive and negative symptoms of a disease.All future big data challenges could be handled by combining neutrosophic sets with various techniques.

Figure 1 .
Figure 1.Linguistic terms of the temperature attribute.

Figure 3 .
Figure 3. Truth-membership function of temperature attribute.

Figure 10 .
Figure 10.No. of fuzzy association rules with different min-support threshold.

Figure 12 .
Figure 12.No. of neutrosophic rules for min-support threshold from 0.5 to 0.9.

Table 1 .
Membership function for Database Transactions.

Table 5 .
Membership function for database Transactions.

Table 8 .
Segment of data after preparation.

Table 9 .
No. of resulted fuzzy rules with different min-support.

Table 10 .
No. of neutrosophic rules with different min-support threshold.
Figure 11.No. of neutrosophic association rules with different min-support threshold.

Table 11 .
No. of neutrosophic rules with different min-support threshold.