Possibility Measure of Accepting Statistical Hypothesis

: Taking advantage of the possibility of fuzzy test statistic falling in the rejection region, a statistical hypothesis testing approach for fuzzy data is proposed in this study. In contrast to classical statistical testing, which yields a binary decision to reject or to accept a null hypothesis, the proposed approach is to determine the possibility of accepting a null hypothesis (or alternative hypothesis). When data are crisp, the proposed approach reduces to the classical hypothesis testing approach.


Introduction
A statistical hypothesis is a statement of the population distribution. In order to seek evidence for confirming if the hypothesis is true or false, a sample observation needs to be drawn randomly from the population. The major work of this research is, therefore, via selecting a proper statistical method to analyze the collected data and decide whether the null hypothesis under consideration is effective. In classical statistical testing, the sample observations are generally crisp, and all the corresponding testing methods can be well implemented. However, in a practical world, the data are frequently fuzzy due to imprecise measurement and rough description. For example, a survey test for the starting salary of graduated students per year, owing to people unwilling to tell the precise number, the collected sample data are generally fuzzy, and data, such as "roughly $29,000", "roughly $32,000", or "less than $40,000", are obtained. Therefore, the extension of the notion of hypothesis testing to the fuzzy environment would be useful to apply in such a case.
Hypothesis testing methods have been effective for solving problems of fuzzy data. Bellman and Zadeh [1] first introduced hypothesis-testing models for application in the fuzzy environment. Casals et al. [2], Son et al. [3], Römer and Kandel [4], Lubiano et al. [5], and Arefi [6] extended classical statistical hypothesis testing methods to perform hypothesis testing for fuzzy data. Watanabe and Imaizumi [7] also fuzzified the statistical hypothesis and then performed fuzzy testing. Delgado et al. [8] considered a Bayesian testing method for fuzzy data. Arnold [9] considered statistical tests with a continuously distributed test statistic and determined a test to maximize the degree of satisfaction under particular fuzzy requirements. Saade and Schwarzlander [10] discussed hypothesis testing for hybrid data, which is composed of fuzzy data and crisp data. Grzegorzewski [11] presented a corresponding fuzzy testing method by using fuzzy confidence intervals considered by Kruse and Meyer [12]. Filzmoser and Viertl [13] considered testing hypotheses with fuzzy data by the fuzzy pvalue. Taheri and Arefi [14] introduced testing fuzzy parametric hypotheses according to a fuzzy test statistic. Wu [15] developed a testing rule as well as a step-by-step procedure by fuzzy critical values and fuzzy p-values when assessing process performance. Parchami et al. [16] presented a method to test hypotheses by comparing a fuzzy p-value and a fuzzy significance level when there were problems with fuzzy hypotheses and crisp data. Alizadeh et al. [17] proposed a hypothesis testing based on a likelihood ratio test for fuzzy hypothesis and fuzzy data. Saeidi et al. [18] considered the problem of testing a hypothesis on the basis of records in a fuzzy environment. Elsherif et al. [19] proposed an algorithm for testing a hypothesis when both hypotheses and data are fuzzy based on a fuzzy test statistic. Habiger [20] developed a framework for the randomized p-value, mid-p-value and abstract randomized p-value, and multiple test function. Icen and Bacanli [21] presented a hypothesis test method for the mean of an inverse Gaussian distribution. In the presented method, confidence intervals by the help of α-cuts are used to obtain a fuzzy test statistic. Yosefi et al. [22] presented an approach for testing fuzzy hypotheses based on a likelihood ratio test statistic. Parchami et al. [23] extended one-way ANOVA to the environment with symmetric triangular and normal fuzzy data. Hesamian and Akbari [24] presented an approach for intuitionistic fuzzy hypotheses by extending the type-I, type-II, power of test, and p-value. Parchami et al. [25] presented a minimax approach to the problem of fuzzy hypotheses while data are crisp. Akbari and Hesamian [26] suggested a degree-based criterion to compare the fuzzy p-value and a specific significance level for making the decision to accept the null hypothesis or not. Kahraman et al. [27] developed intervalvalued intuitionistic fuzzy confidence intervals for population mean and differences in means of two populations. Haktanir and Kahraman [28] developed a Z-fuzzy hypothesis testing method. In the developed method, Z-fuzzy numbers are used to capture the vagueness in the sample data, and a Zfuzzy number is represented by a restriction function that is usually a triangular or trapezoidal fuzzy number. Parchami [29] applied two R packages "FPV" and "Fuzz.p.value" for the practical hypothesis-test problem for when data/hypotheses are fuzzy.
In Theorem 4 of Grzegorzewski [11], the fuzzy test for 0 0 : are fuzzy random sample,   is the α-cut of fuzzy confidence interval  for  and  ) (   is the α-cut of complement of fuzzy confidence interval  . Grzegorzewski [11] claims that the membership function of  is is the membership function value that the parameter value of null hypothesis, 0  , falling in the fuzzy confidence interval  . For example, we get ) (t   = 0.4/0 + 0.6/1 from Figure 1, and the result may be interpreted as "rather reject Grzegorzewski's approach uses the information on the right-hand side of the fuzzy confidence interval only. This means the testing method of Grzegorzewski [11] is simple but may have some spaces to be improved. Figure 1. Testing function  of Grzegorzewski [11].
In classical statistical hypothesis testing, the sample data are substituted into a proper test statistic, and the critical value for the test statistic is determined under a given significance level, then the rejection region is determined consequently. When the observed value of the test statistic falls in the rejection region, the null hypothesis should be rejected. Otherwise, the null hypothesis should not be rejected. This is so-called binary decision. Intuitively, when data are fuzzy, the fuzzy testing methods should be developed by fuzzifying the corresponding classical statistical testing methods. Since testing the rejection region is a crisp set, and the observed value of test statistic is fuzzy, we can conduct a reasonable testing approach to determine whether the fuzzy test statistic falls in the rejection region. Moreover, the proposed fuzzy testing method should be able to degenerate to the classical statistical testing method with crisp data. Based on these thoughts, the rest of this paper is organized as follows. Section 2 presents the method to determine whether the fuzzy test statistic falls into the rejection region. Section 3 presents the testing of the normal population to illustrate the reallife application of the proposed method. Section 4 gives examples to compare our proposed approach with the testing methods of Grzegorzewski [11] and Filzmoser and Viertl [13]. Conclusions and suggestions are drawn in Section 5.

Fuzzy Test Approach
The fuzzy number can be defined as: given a fuzzy set A of the real line  , with the x , such that 1 ) Kruse [33], they may be treated as a fuzzy perception of the usual random sample n 2 1 X , , X , X  (see [11]), and  P is the population distribution of , and the observed sample data are fuzzy numbers,


, then the null hypothesis should be rejected. Otherwise, the null hypothesis should not be rejected. Since T is a fuzzy number, it is not clear whether T falls into the rejection region, C. To solve this problem, Filzmoser and Viertl [13] introduce the concept of a fuzzy p-value.
, then each α-cut of T corresponds to a αcut of fuzzy p-value, which is defined by for the left-hand sided testing problem, for the right-hand sided testing problem, and for the two-sided testing problem.
Given the significance level  for all accepted nor rejected. Note that we cannot make a certain decision in the third case. In this paper, we define the possibility of C  T and propose another testing approach, so that the total information of a membership function of test statistic can be used, and a fuzzy decision can be made in any case.
Assuming that a fuzzy set A of the real line  , the membership function of A is Zadeh [35] defines that the probability of fuzzy set A is where P is the probability measure of Y on real axis  . Based on Equation (2), we can define: When data reduce to crisp, the membership function of

Definition 1. The possibility of the value of the fuzzy test statistic, T , falling in the rejection region. C is the ratio of probability of T to the probability of
Then, the denominator of Equation (3) is zero, which means that Equation (3) is meaningless.
. This is identical to the classical testing method.

Fuzzy Testing of Hypotheses with Fuzzy Data
Suppose the sample data are fuzzy numbers , by using Zadeh's extension principle [34].
where X is the crisp universal set on which i X is defined. It is very difficult to deduce the exact membership function T  of T because the function relationship may be nonlinear. The approximately membership function T  can be derived by the approaches of Liu and Kao [36]. Let When all fuzzy data reduce to crisp values, Equation (5a,b) become identical and T reduce to T in the classical model. Using Zadeh's extension principle [34], the membership function where L(t) and R(t) are the left and right shape functions of T  , respectively.
is to be tested and the rejection region is  (2) and (6), is defined as, where f(t) denotes the probability density function of test statistic T. In Figure 2, based on Equations (3), (6), and (7), the possibility 0 P can be defined as,  (2), (3), (6), and (7), the possibility 0 P in Figure 3 are shown in Table 1. Figure 3. Five different types of membership functions of T for the right-sided test.
Similarly, the possibility 0 P can be calculated for the left-sided test and two-sided test. Figure   4 shows the five different types of the left-sided test, where the rejection region is The definition of possibility 0 P is shown in Table 2.
The two-sided test involves fifteen different types of membership functions of T , as shown in Figures 5-7, where the crisp set is the rejection region. The definition of possibility 0 P is shown in Table 3.
The numerical method is therefore applied to determine the approximate values of 0 P . As an illustration, we consider some fuzzy testing problems for the normal population with fuzzy data.

Single Normal Population with Known Population Variance
The mean of a normal population in classical tests generally assumes that the observations are crisp. Suppose that the population variance is known; the test statistic for the null hypothesis about the population mean, 0 0 : for a normal population, where X , n, and  are the sample mean, sample size, and the standard deviation of the population, respectively. When measured imprecisely, the test statistic using fuzzy data becomes The exact membership function of a fuzzy test statistic Z can be derived, since the functional relationship between Z and i X is linear. When all the observations i X are trapezoidal fuzzy numbers, the α-cuts of i X can be represented as is the α-cuts of Z . Equations (11a) and (11b) are a pair of linear functions with bound constraints. The membership function,    (12), is defined as where ) (z g is the probability density function of a standard normal distribution Z. In Figure 9, the possibility 0 P is defined as Figure 9. The right-sided test for  under fuzzy data.

Two Normal Populations with Known Population Variances
This approach can also be applied in a testing hypothesis concerning the difference between two normal population means. Assume the two population variances are known. The classical test statistic of 0 0 : for two independent normal populations. Without loss of generality, assume all data ( i X and j Ỹ ) are trapezoidal fuzzy numbers for two independent normal populations with fuzzy data. Equation (14) for calculating the test statistic using fuzzy data becomes, . When all data are crisp values, Equation (16a,b) become identical and reduce to Equation (14).

Single Normal Population with Unknown Population Variance
The same concept can be applied to cases of an unknown population variance for tests of the mean for a normal population. In the classical statistical test procedure, suppose X and S represent the mean and the standard deviation of the sample, respectively. If the null hypothesis   (17), and the fuzzy test statistic becomes, T is a pair of nonlinear functions with bounded constraints. We can obtain the membership function of fuzzy number T , ) ( t T  , by using Zadeh's extension principle [34]. When all fuzzy data reduce to crisp values, Equations (19a) and (19b) become identical and reduce to Equation (17) in the classical model.

Two Normal Populations with Unknown but Equal Population Variances
When the two normal population variances are unknown but equal, the test statistic for the null hypothesis about the difference between the two population means, 0 0 : T has a t distribution with 2   Y X n n degrees of freedom, where 2 P S represents the pooled sample variance, which is defined as When the observations are fuzzy, a natural test statistic substitutes 2 P S for 2 P S , which is defined as, Accordingly, Equation (20), for calculating the test statistic when using fuzzy data, becomes, From Equation (21), the test statistic is also a fuzzy number. Let is the α-cuts of T . When all fuzzy data reduce to crisp values, Equations (22a) and (22b) become identical and reduce to Equation (20) in the classical model. The construction of the membership function T  and the fuzzy test procedure are the same as those for a single normal population with unknown population variance.

Numerical Examples
To illustrate the application of the proposed fuzzy testing method described in Section 3, two examples, in which example 1 is described by Grzegorzewski [11], are presented in this section. Moreover, we will compare the results to that of the testing method of Grzegorzewski [11] and Filzmoser and Viertl [13].

Example 1
Four random samples is the sample mean. Therefore, the fuzzy test statistic is is the fuzzy sample mean. Based on Zaheh's extension principle [34], X is a trapezoidal fuzzy number [5, 5.75, 7, 7.75], the membership function The possibility of rejecting the null hypothesis is 0.901, which is quite high. Grzegorzewski [11] uses the membership function of fuzzy confidence interval ) , , representing the possibility of accepting 0 H is 100%. Therefore, the result obtained by our proposed method is much more reasonable than that obtained by the testing method of Grzegorzewski [11]. The fuzzy p-value in Filzmoser and Viertl [13] approximates to a trapezoidal fuzzy number [8.5 × 10 −9 , 1.74 × 10 −5 , 0.0505, 0.444], and in this case, we can neither accept nor reject 0 H and a H at significance level γ = 0.05.

Example 2
Consider the statistical model in Example 1, but to test 4 : . Assume that fuzzy sample is 1 x = "roughly 4.6", 2 x = "roughly 5.6", 3 x = "roughly 6", 4 x = "roughly representing the possibility of accepting 0 H is 100%. Apparently, the result obtained by our proposed method is much more reasonable than that obtained by the testing method of Grzegorzewski [11]. The fuzzy p-value in Filzmoser and Viertl [13]  Viertl [13], we conclude that 0 H is rejected at the level γ = 0.08, which is close to the result of ours.
If significance level 1 . 0   , then Grzegorzewski [11], Filzmoser and Viertl [13], and our method have the same conclusion that the null hypothesis 0 H is rejected.

Conclusions
In this paper, we propose a fuzzy test approach for the hypothesis testing of fuzzy data, which is an extension of a classical method of statistical hypothesis testing of crisp data. The proposed approach first utilizes the probability of fuzzy sets to conduct the possibility definition that fuzzy test statistics fall in the rejection region, and then results in a fuzzy decision rule to determine whether the null hypothesis is to be rejected or not. Although the proposed approach is similar to the fuzzy testing method of Grzegorzewski [11], which is conducted by using confidence intervals, our method is more direct and clear since we use fuzzy test statistics directly. In addition, the latter only uses the single point information on the membership function of the fuzzy confidence interval to make a fuzzy decision rule for the possibility of accepting the null hypothesis. Apparently, when we make decisions, the information used in our proposed approach is much more reasonable and effective than that of Grzegorzewski [11]. Moreover, though our proposed approach is similar to the fuzzy testing method of Filzmoser and Viertl [13], in the latter approach, we can neither accept nor reject 0 H and a H at significance level γ if the value of membership function of the fuzzy p-value at γ is not zero, while our method can make a clear decision in any case. Therefore, our method is more flexible and useful than Filzmoser and Viertl [13]. Although we only present the testing of a single normal population as the illustrative examples, our proposed approach can be applied to all the classical testing methods for fuzzy data. Therefore, the proposed approach is simple and useful. Funding: This research received no external funding.

Conflicts of Interest:
The authors declare no conflict of interest.