Adaptive Neuro ‐ Fuzzy Inference System Predictor with an Incremental Tree Structure Based on a Context ‐ Based Fuzzy Clustering Approach

: We propose an adaptive neuro ‐ fuzzy inference system (ANFIS) with an incremental tree structure based on a context ‐ based fuzzy C ‐ means (CFCM) clustering process. ANFIS is a combination of a neural network with the ability to learn, adapt and compute, and a fuzzy machine with the ability to think and to reason. It has the advantages of both models. General ANFIS rule generation methods include a method employing a grid division using a membership function and a clustering method. In this study, a rule is created using CFCM clustering that considers the pattern of the output space. In addition, multiple ANFISs were designed in an incremental tree structure without using a single ANFIS. To evaluate the performance of ANFIS in an incremental tree structure based on the CFCM clustering method, a computer performance prediction experiment was conducted using a building heating ‐ and ‐ cooling dataset. The prediction experiment verified that the proposed CFCM ‐ clustering ‐ based ANFIS shows better prediction efficiency than the current grid ‐ based and clustering ‐ based ANFISs in the form of an incremental tree.


Introduction
In optimizing the nonlinear system model, the neuro-fuzzy system has exhibited better performance than the model based on the existing linear system [1][2][3][4][5][6][7][8][9][10][11]. In the case of a neuro-fuzzy system that simulates human learning ability, decision judgment, etc., rather than a mathematical calculation technique, the performance of the model may vary depending on the type of learning model or learning method.
A grid-based rule generation approach and a clustering-based rule generation method can be separated into the adaptive neuro-fuzzy inference system (AFNIS) model rule generation technique. Studies on the grid-based rule generation method include the following: Dovzan [12] proposed a hyperplane-based fuzzy space partitioning method by defining the superplane dividing the problem space and introducing principal component analysis, in which the distance to the superplane is used as a metric instead of the center-oriented cluster. In order to automatically design interpretable fuzzy partitions with maximal granularity, Castiello [13] suggested a dual clustering (DC) method. DC is advanced and works in a two-step phase for classification problems. The first step identifies a cluster of multidimensional samples to derive a prototype with class labels. In the second step, these prototype one-dimensional projections are further clustered at the same time along each dimension, minimizing the number of clusters for each function. Alexandridis [14] proposed a new algorithm to train radial base function (RBF) networks to produce models with increased accuracy and brevity.
The proposed approach is based on the asymmetric deformation of the algorithm of the fuzzy means (FM) with the potential to calculate the number and position of the centers of the silver-winged node RBF, while linear regression is used for the synaptic weights. Verstraete [15] proposed a new approach to remap grid data using the additional data provided so that the system can automatically assume the underlying distribution. The proposed method uses correlation data to imitate intelligent reasoning to provide insight into the distribution of the original data. In the grid-based rule generation method, when the dimension of the input increases or the number of membership functions (MFs) increases, the rule of the neuro-fuzzy system model increases exponentially. Various studies have been undertaken to solve these problems. A typical example is a clustering method in which a given input space and an output space are divided into subspaces, each having a meaning to give a preamble MF.
Studies on creating a rule using a clustering method include the following: Lee [16] introduced an enhanced Mobile Sensor Network (MSN) Low-Energy Adaptive Clustering Layer Protocol to not only prolong the life of the network but also reduce the failure of the package using the fuzzy inference method. Su [17] proposed a belief-peak-based clustering method as an idea, with evidence that all data objects in each sample subsection led to a belief in the possibility that the sample would become a cluster center. Xu [18] proposed a concise zero-order Sugeno-Takagi (TSK) inference system based on enhanced soft subspace clustering (ESSC) and sparse leading (SL) to improve the clarity and interpretability of fuzzy reasoning systems. Sujil [19] proposed wind power generation prediction agents for multiple-agent-based energy management systems in smart microgrids using subtraction clustering and fuzzy clustering methods. A fuzzy-based hyper-round strategy (FHRP) was introduced by Neamatollani [20] to plan clustering operations easily and flexibly. The FHRP performs clustering at the start of all hyper-rounds (HRs) consisting of several rounds other than each round, and the length of the HR is not fixed during the network life and is calculated using the fuzzy reasoning system. To improve the classification and rule-based analytical performance for unbalanced datasets, Gu [21] proposed an imbalanced TSK purge classifier (IB-TSK-FC) for TSK fuzzy classifiers. A hierarchical fuzzy inference tree (HFIT) was constructed by Ojha [22]. In order to construct a natural hierarchy that supports simplicity, HFIT incorporates many low-dimensional fuzzy logic structures with a structure close to the ideal tree. This natural hierarchy provides a high level of approximation accuracy. The clustering-based rule generation method belongs to a cluster that satisfies a given condition by measuring the degree of similarity with each pattern, under the assumption that there are multiple patterns in one nonlinear data space.
Because the information used to create rules has uncertainty, the MFs of the conditional and conclusion parts of the corresponding rules have uncertainty. Studies have been performed to adjust the form of the MF to minimize this instability. Shi [23] proposed the fountain differential proportional-integral-derivative (PID) and fountain differential type 1 purge PID controller to solve this problem because the fountain differential gap type 2 purge PID controller cannot handle the uncertainty of the system. In describing the system's instability dependent on the general type 2 fuzzy logic system, the proposed controller will thoroughly exploit the benefits of the general type 2 fuzzy logic system. The definition of conditional fuzzy sets was suggested by Wang [24] and proved that type 2 fuzzy sets are united with conditional fuzzy sets. Both the conditional fuzzy set and the fuzzy form 2 set are fuzzy relationships for the primary and secondary variables' product space. The distinction is that the primary and secondary variables are usually independent of each one in the conditional fuzzy set system. To resolve the effects of human-made artifacts and a self-regulating interval type 2 neural purge inference system (SRIT2NFIS) to deal with these intrinsic anomalies, Das [25] proposed a powerful general spatial pattern characteristic pursuit algorithm (RoCSP). Das [26] indicated an emerging neural purge inference method (IT2FIS) gap type 2 and its total sequential learning algorithm. Meta-aware learning manages the learning process by choosing the best learning strategy and lets the recommended IT2FIS efficiently estimate the relationship between input and output. The evolving IT2FIS using meta-cognitive learning algorithms is called McTI2FIS. Zhou [27] conducted a study on how footprint of uncertainty (FOU) affects the analysis structure of a wide range of IT2 Mamdani and TSK controllers (i.e., input-output mathematical relationships). A recent application of the hybrid learning approach to the optimization of membership and non-membership functions of the newly developed Type 2 Interval Intuitive Fuzzy Logic Method (IT2IFLS) of the TSK Fuzzy Rationing System using neural networks was introduced by Eyoh [28]. Sumati [29] proposed the gap type 2 mutual subset purge neural inference system (IT2MSFuNIS). A reciprocal subset measurement between the two gap type 2 fuzzy sets is derived and used to determine the similarity between IT2FS inputs and sex items. Biglarbegian [30] proposed a new reasoning mechanism for the interval type 2 TSK fuzzy logic control system (IT2 TSK FLCS) when the condition is a type 2 fuzzy set, and the conclusion is a constant. Gracia [31] proposed a complete framework for type 2 FLS that uses up-to-date perceptions of IT2 FS (a set of gap type 2 fuzzy sets in a typical subsubsidiary form) in which secondary ratings could be nonconvex T1 FS.
As a result of confirming the studies of the ANFIS models summarized above, the existing study focuses on the model rule generation method. In this research, we propose a context-based fuzzy Cmeans (CFCM) clustering-based rule generation approach instead of a general clustering-based rule generation methodology that takes into account the patterns of the input space as well as the output space and ANFIS in the form of an incremental tree structure rather than a single structure. Whereas general clustering methods only take the input space into account, the CFCM clustering approach often takes the output space pattern into account, so that the cluster can be generated more accurately. There are many inputs when using big data in numerous application fields. In the neuro-fuzzy system, as the number of inputs increases, the number of rules increases exponentially. Therefore, it creates meaningful rules by designing a point-of-point tree structure using multiple ANFISs rather than a single ANFIS structure. To evaluate the performance of ANFIS in an incremental tree structure based on the CFCM clustering method, a computer performance prediction experiment was conducted using a building heating-and-cooling dataset [32]. The building heating-and-cooling dataset is a dataset used for energy efficiency forecasting created by Xifara. It consists of eight input variables and two output variables and has a data size of 768 × 10.
The remainder of this paper is structured as follows. Section 1 explains the background of the study. Section 2 describes the method and structure of ANFIS rule creation. In Section 3, the proposed method, ANFIS, with an incremental tree structure based on the CFCM clustering method, is described. Section 4 analyzes the predictive performance of the proposed method, and Section 5 addresses conclusions and future research plans.

ANFIS
Fuzzy inference has the characteristic of effectively explaining the system by organizing professional empirical knowledge that is difficult to quantitatively express in the form of MFs and fuzzy rule bases. [33]. In addition, because neural networks [34] have learning ability, they are highly flexible in the configuration of the system, and they have excellent parallel processing and fault tolerance capabilities. Neuro-fuzzy system neural network theories are actively studied in various fields.
A typical example of this neuro-fuzzy system is ANFIS. The premise of ANFIS depends on how the rule is created. The structure of the conclusion section consists of the form of the first equation stone, the TSK [35] model. Section 2.1 describes how to create rules to determine the premise.

Rule Creation Method
You divide all dimensions of the input space consisting of input variables into separate areas when you deduce a fuzzy law and organize them into segmentation and conquest methods that allow the resulting values of the inference in those areas to be determined. In other words, the premise of a fuzzy rule splits the input space into several regions, and the product of inference from each of those areas is the conclusion of the fuzzy rule. The creation of these fuzzy rules is closely connected to how the input space is separated. ANFIS has a method of grid-based rule development and a method of clustering-based rule creation, mainly based on generating rules consisting of input variables at all stages. Figure 1 shows how to create grid-based rules and how to create clustering-based rules based on ANFIS. Grid partitioning [36] is a method of dividing space into the same structure as the grid so that there is no overlap in the input space. Generally, the application of the grid partitioning method produces uniformly specific partitioning areas, that is, areas with fuzzy rules, which facilitate the analysis of fuzzy rules. When the number of input variables is minimal, that is, when the input space dimension is low, grid partitioning is used. If there are 10 input variables, for example, each input variable is split into two member functions, or 2 128 specific areas. In other terms, for each particular region, one rule is made, and the total number of rules is 128, which is a very complex structure. Therefore, where the number of input variables is limited, the grid partitioning approach is mainly used.
By improving the C-means clustering approach suggested by Bezdek [37,38], the fuzzy C-means (FCM) clustering method is based on a fuzzy set and the least square method. By listing the values belonging to the data in a cluster according to the degree of belonging of each data object in a cluster, the FCM clustering system distinguishes particular subdivided regions. The methods of FCM clustering include the m vector , 1, 2, … , , set in c fuzzy clusters and locate the center in each cluster as it minimizes the objective function of the non-similar calculation. In standard methods of clustering, every data point belongs to a cluster with a membership of 0 or 1. However, there is a gap in the degree of membership of the arbitrary date between 0 and 1 in the FCM clustering process, and it belongs to n clusters. The number of clusters is fixed by the user. The number of clusters here is the number of laws that are fuzzy. Next, we explain the procedure for FCM clustering methods.
Step 1: To have some value between 0 and 1 that satisfies the parameter and membership matrix, initialize (1) here, the Euclidean standard is used to measure the distance between the input data and the middle of the cluster: Step 2: The current cluster's center value is determined by the input data value , , … , and the previously acquired MF : Step 3: The membership matrix is continuously modified with increasing numbers of repetitions using the center value and input data obtained in step 2, : Step 4: The above procedure is repeated until the repeated membership matrix and error is less than any threshold value given by the membership matrix or

Structure
A type of neuro-fuzzy inference method proposed by Jang [35] is the ANFIS model. For given input and output data, the ANFIS model utilizes the least square method and back propagation algorithms to optimally approximate the parameters used in the MF and output. A model consisting of two inputs and n TSK rules with one output and an output from the first linear equation defines the fuzzy inference mechanism briefly: Rule ∶ If X is and is , then … Rule ∶ If X is and is , then here, and represent input variables, and and are fuzzy sets of . Similarly, and represent fuzzy sets of , and , and represent sets of arguments set in rule . The ANFIS model, a forward network structure which consists of two input variables and five levels with four fuzzy rules, is shown in Figure 2. Nodes have multiple functions on each layer of the ANFIS model that are refined through the learning process. The line of relation between two nodes indicates only the flow path between the nodes and has no weight. Next, for the ANFIS model, we define any layer structure and process. Layer 1: Every node in the first layer is able to output values belonging to the language level in the first layer: The MF selects and uses the following Gaussian MF: In addition to the Gaussian MF, a variety of MFs are available, and the learning process selects parameter values that minimize errors.
Layer 2: Every node in the second layer receives a membership value seen in the conditional part of the fuzzy rule in the second layer and outputs it as a weight multiplied by the rule: The output on each node shows that the fuzzy rule is sufficient. Layer 3: In the third layer, each node calculates the ratio of point firepower in rule to the sum of all point fire forces using: The values obtained are displayed as normalized values. Layer 4: Every node in the fourth layer conducts an operation in the fourth layer that multiplies the output function of the conclusion component of each law by the uniform fit: where is the Layer 3 output and the , and output function parameters denote the parameters of the conclusion.
Layer 5: Each node consists of one single node in the fifth and last layer. The output value is determined on the basis of all input values in the lower layer by using: * ∑ ∑ The output value has a continuous type value, not a fuzzy set type.

ANFIS with an Incremental Tree Structure Based on the CFCM Clustering Method
The number of laws increases exponentially as the number of inputs to the fuzzy system increases. The computational utility of the fuzzy system is decreased by this large rule base. It also makes the function of the fuzzy system difficult to understand and complicates the modification of rules and MF parameters. Since many implementations have a small supply of training data, the possibility of generalization of tuned fuzzy structures is diminished by a broad rule base. The fuzzy inference system (FIS) can be implemented as a tree with smaller interconnected FIS objects to solve this challenge, not as a single monolithic FIS entity. This fuzzy trees are also called hierarchical enemy fuzzy structures [39] since fuzzy systems are organized in a hierarchical tree structure. The output of a low-level fuzzy system is used as an input to a high-level fuzzy system in the tree structure. The fuzzy tree is more effective and easier to grasp in terms of computing than a single FIS with the same number of entries.

CFCM-Clustering-Based Rule Creation Method
CFCM clustering is a tool proposed by Pedrycz [40] to construct clusters and partition clusters in order to maintain pattern characteristics related to output variable similarities, as well as input space data. In the output variables, a traditional clustering approach does not take patterns into account but generates only clusters using the Euclidean distance between the centroid cluster and the input data. In comparison, by taking into account not just the pattern of input data but also the pattern of output variables, the CFCM clustering approach generates a cluster, facilitating more detailed space segmentation than the conventional clustering method.
The variations between FCM clustering and the strategies of CFCM clustering are seen in Figure  3. The FCM clustering approach produces two clusters if there is data in the input space, so it gives an initial centroid value and then uses the Euclidean distance between the middle and the data. The CFCM clustering process, by comparison, takes into account the output variable patterns and generates three clusters by taking into account the black and white characteristics of the data in the input space. Next, the CFCM clustering process procedures are defined. Step 1: Let 1 ∞ and set the number of clusters, 2 .
Step 2: Set the initial partition matrix and the threshold value ε, and select the number of repetitions: Step 3: Compute the center of each cluster, 1, 2, … , , using the membership matrix : Step 4: The partitioning matrix is modified with the center value of cluster c: here, represents 's degree of inclusion in the created cluster. The linguistic type specified in the output variable is, in other words, represented as a fuzzy set , : → 0, 1 and computed by an algorithm of fuzzy equalization. Then, the membership value of in can be expressed by , 1, 2, … , . Step the procedure above will be stopped. Otherwise, proceed again from Step 3. For ANFIS models, the methods of CFCM clustering mentioned above apply as follows: Input space data in Layer 1 is broken into input space by CFCM clustering, which outputs the value by considering the output variable pattern. In Layer 2, the values belonging to the previous layer are taken, the weights multiplied by the rules are given, and the impulse force proportion in Layer 3 is expressed as a normalized value. The normalized Layer 4 values are multiplied by the final output function and the final output is determined using Layer 5's weighted average.

ANFIS with an Incremental Tree Structure
For applications, several fuzzy tree structures are available. The input values are integrated into multiple stages in the incremental tree system used in this analysis to optimize the output values at various steps. The previous diagram, for example shows a three-level incremental fuzzy tree with a fuzzy inference method, where represents the FIS index of the nth level. At each step, there is only one fuzzy inference method in an incremental fuzzy tree; 1, that is. The th input of the ith FIS at level is indicated in the previous figure by input , where the kth output of the ith FIS at level n is indicated by the input. The monolithic ( 1) FIS is seen in Figure 4 with four inputs (  1, 2, 3, 4) and three MFs ( 3). Therefore, with the number of input sets, the cumulative number of rules in the incremental fuzzy tree is linear. Based on the contribution to the final output value, input selection at various levels of the incremental fuzzy tree uses input rank. Generally, the input value that contributes the most is used at the lowest level, and at the highest level, the input value that contributes the least is used. This implies that the input value of the low-rank depends upon the input value of the highrank. In the incremental fuzzy tree, irrespective of other important inputs, each input value usually contributes to some degree to the inference method. In this paper, to prevent over-generation of fuzzy rules due to large-scale databases and to generate meaningful rules, we propose a CFCM-ANFIS with an incremental tree structure rather than a single type of CFCM-ANFIS. As seen in Figure 5, this allows one to rank inputs using existing data to create the fuzzy tree.

Experiment and Analysis
In this section, to evaluate the predicted performance of ANFIS with an incremental tree structure based on CFCM clustering methods described in Section 3, experiments were conducted to predict computer performance using the computer hardware dataset. In this experiment, for the predictive performance of ANFIS using the grid-based rule generation method, which is a representative ANFIS, and ANFIS using the FCM clustering-based rule generation method, as well as ANFIS using the proposed method, the incremental tree structure-based CFCM clustering-based rule generation method is compared and analyzed.

Building Heating-and-Cooling Dataset
The building heating-and-cooling dataset is a dataset [41,42] used for energy efficiency forecasting created by Xifara. It consists of eight input variables and two output variables and has a data size of 768 × 10. Relative compaction, surface area, wall area, roof area, total height, direction, glazing area and distribution of the glazing area are input variables. The heating and cooling loads are the output factors, but this analysis uses only the heating load. To conduct the experiment, the building heating dataset was equally divided into learning and verification sets and data values were normalized to between 0 and 1.

Experimental Method and Analysis of Results
The predicted performance of grid-based AFNIS and FCM clustering-based ANFIS, which are general rule generation methods, and of the increasing tree structure based on the CFCM clustering method proposed in this study were compared and analyzed. As described above, a grid-based ANFIS creates rules by dividing the input space into lattices, and FCM clustering-based ANFIS clusters the input space using FCM clustering to create rules. The proposed method uses CFCM clustering in input and output spaces to create contexts and clusters to create rules.
First, the grid-based ANFIS experiment confirmed the predicted performance by increasing the MF by 1 from 2 to 5. By adjusting the number of clusters and the fuzzification coefficient, the FCMclustering-based ANFIS experiment was carried out. The number of clusters increased by 2 from 2 to 20 and the coefficient of fuzzification was set at 2 to confirm the performance anticipated. Finally, for the experiment on AFNIS using the incremental tree structure based on the CFCM clustering method, which is the method proposed in this study, we designed three CFCM-clustering-based ANFISs with two inputs and one output as an incremental tree structure. The entire input variable was then ranked according to the correlation coefficient and used as input to each ANFIS. In the CFCM clustering method, the number of contexts (p) increased by 2 from 2 to 6, and the number of clusters (c) increased by 2 from 2 to 20, confirming the predicted performance. Each ANFIS was performed 10 iterations, and the value with the minimum verification root mean square error (RMSE) was used as the result value. All experiments for Grid-ANFIS, FCM-ANFIS, Incremental-CFCM-ANFIS, LR and RBFN were conducted using Matlab, and the OS is a window10 environment. Table 1 summarizes the prediction performance of the grid-based ANFIS, and Figure 6 shows the prediction results of the grid-based ANFIS. As can be seen in Table 1, when there are two MFs, 256 rules are created, and the verification root mean square error (RMSE) is ~2.2471. When there are three, four, or five MFs, the number of rules increases exponentially, indicating a calculation error.   Table 2 summarizes the prediction performance of ANFIS based on FCM clustering, and Figure  7 shows the prediction results. As can be seen in Table 2, when there are 10 clusters, 10 rules are created and the verification RMSE is ~2.0671. The prediction efficiency of the CFCM-based ANFIS in the form of an incremental tree is summarized in Table 3, which is the approach suggested in this report. Figure 8 shows the prediction results. As can be seen in Table 3, when there are 6 contexts and 20 clusters, 120 rules are created and the verification RMSE is ~1.8705, yielding the best prediction performance. Table 4 Table 3. Results from ANFIS prediction experiment on the incremental tree structure based on CFCM clustering.

Conclusions
ACFCM-based incremental tree-structured ANFIS was proposed. To confirm the validity of the proposed method, the prediction performance was compared with the commonly used grid-based ANFIS and clustering-based ANFIS. As a result of the experiment, the CFCM-based incremental treestructured ANFIS proposed in this study was confirmed to be superior to the existing ANIFS model in terms of performance. In addition, it was confirmed that generating meaningful rules rather than multiple rules can improve prediction performance. As a future research plan, we plan to design a multi-ANFIS in various forms, rather than an incremental tree structure, and apply an optimization algorithm to generate meaningful rules.