Next Article in Journal
A Malicious Webpage Detection Method Based on Graph Convolutional Network
Next Article in Special Issue
Adaptive Hyperparameter Fine-Tuning for Boosting the Robustness and Quality of the Particle Swarm Optimization Algorithm for Non-Linear RBF Neural Network Modelling and Its Applications
Previous Article in Journal
Pricing European and American Installment Options
Previous Article in Special Issue
Numerical Analysis of Alternating Direction Implicit Orthogonal Spline Collocation Scheme for the Hyperbolic Integrodifferential Equation with a Weakly Singular Kernel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improved Confidence Interval and Hypothesis Testing for the Ratio of the Coefficients of Variation of Two Uncorrelated Populations

by
Abbas Bahrampour
1,
Zeynab Avazzadeh
2,*,
Mohammad Reza Mahmoudi
3 and
António M. Lopes
4
1
Department of Biostatistics and Epidemiology, Kerman University of Medical Sciences Research Center for Modeling in Health, Institute for Futures Studies in Health, Kerman 7616913555, Iran
2
Department of Biostatistics and Epidemiology, School of Public Health, Kerman University of Medical Sciences, Kerman 7616914115, Iran
3
Department of Statistics, Faculty of Science, Fasa University, Fasa 7461686131, Iran
4
LAETA/INEGI, Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(19), 3495; https://doi.org/10.3390/math10193495
Submission received: 2 August 2022 / Revised: 5 September 2022 / Accepted: 22 September 2022 / Published: 25 September 2022
(This article belongs to the Special Issue Mathematical Methods for Nonlinear Dynamics)

Abstract

:
One of the most accessible and useful statistical tools for comparing independent populations in different research areas is the coefficient of variation (CV). In this study, first, the asymptotic distribution of the ratio of CV of two uncorrelated populations is investigated. Then, the outputs are used to create a confidence interval and to establish a test of hypothesis about the CV ratio of the populations. The proposed approach is compared with an alternative method, showing its superiority and effectiveness.

1. Introduction

According to the literature, three fundamental measures are used to explain a dataset (random variable). These include central, shape and dispersion tendencies. By obtaining the value of the central tendency, we can know how a random variable is gathered around a central value. The mean, median and mode are the most used criteria to express the central tendency’s measures. Criteria such as range, variance and standard deviation can be used to measure the dispersion of a random variable. In some literature, this is called the random variable distribution scale. Another need of statisticians is to know how a random variable is distributed, or to know its pattern shape, which can be addressed by the use of statistical measures such as kurtosis or skewness.
The coefficient of variation (CV) is obtained by dividing the population standard deviation by the population mean, CV = σ/μ, being an applicable and suitable statistic for evaluating relative variability. The CV is a free parameter that is used in many areas, such as agronomy, biology, engineering, finance, medicine and others, as an indicator of reliability or variability [1,2,3]. In many cases, relating standard deviation to the level of measurement is of great importance to researchers. For this reason, the CV is widely used to measure dispersion. When studying several independent populations, knowing how their CVs are compared is essential. This becomes even more important when populations have skewed distributions. In practical matters, statisticians may be interested in comparing two independent populations’ CV to better understand the data structure. In situations where the means or variances of independent populations are equal, the analysis of variance and Levene’s tests are employed to examine the equality of CVs. If each population has a different mean and variance, it is clearly possible that their CVs are equal. There have been many studies comparing two or more CVs, which we will discuss below. When two CVs are small, the differences between them are likely to be minor, and these minor potential differences can lead to inability to make powerful or definite conclusions. Therefore, the CV ratio is more accurate than the CV difference.
The probability ratio test was introduced by Bennett [4], and later modified by Schaefer and Sullivan [5]. Scientists such as Doornbos et al. [6] and Hedges et al. [7] proposed tests that relied on decentralized meta-analysis. In [8,9,10], some methods were introduced which are known as Wald tests. Pardo and Pardo [11] were able to propose a new method for comparing CVs based on the Rényi divergence. Some researchers, such as Nairy and Rao [12], investigated the inverse CV. Another method proposed for likelihood ratio testing is based on Newton one-step estimators, performed by Verrill and Johnson [13]. The parametric bootstrap (PB) approach was also considered by Jafari and Kazemi [14]. Other statisticians continued these studies and were able to improve them for symmetric distributions, as presented in [15,16,17,18,19,20,21,22,23]. However, comparing two CVs and, in more complex cases, multiple CVs is challenging in practice [24,25]. The approximate interval for the ratio of two CVs in the lognormal distribution was obtained by Nam and Kwon [25]. They obtained the estimate using the Fieller-type log method of variance estimates recovery (MOVER) and the Wald-type method. Two other researchers, Jiang and Wong [26], used the Barlett’s modified likelihood ratio simulation method to make inferences about the ratio of two CVs in the lognormal distribution.
There are many phenomena in the real world that have symmetrical distributions. Due to this reason, most previous studies have focused just on a special type of distributions, especially on the normal distribution. Yue and Baleanu [27] inferred about the CV ratio of two independent symmetric or asymmetric distributions. They studied the asymptotic distribution for the ratio of two populations’ CV. Then, the outputs were used to create the confidence interval and to test the hypothesis about the ratio of CV of the populations.
In this paper, we adopt a procedure similar to those employed in other studies [28,29,30,31,32] to improve the method given by Yue and Baleanu [27]. The rational estimation for the ratio of CV of two populations given by Yue and Baleanu is considered and its asymptotic distribution is investigated. In computing the asymptotic variance, a formulation different from that given by Yue and Baleanu [27] is proposed. Then, using the asymptotic distribution of the given estimator, the confidence interval and hypothesis testing about the ratio of CV of two populations are studied. The new formulation leads to a more powerful estimator and a more accurate confidence interval than those in [27].
The paper is organized as follows: In Section 2, first, a rational estimation for the ratio of CV of two populations is given. Then, the asymptotic distribution of this estimator is considered. Finally, using the asymptotic distribution of the given estimator, the confidence interval and hypothesis testing about the ratio of CV of two populations are studied. In Section 3, various simulated datasets are reviewed and analyzed to verify the accuracy of the theoretical findings. The applicability of the proposed technique in real situations is studied in Section 4. Additionally, Section 5 includes the concluding remarks.

2. Materials and Methods

2.1. Asymptotic Outcomes

Let us consider two uncorrelated variables X and Y , with non-zero means of μ X and μ Y . Their finite i th central moments are, respectively:
μ i X = E ( X μ X ) i ,   μ i Y = E ( Y μ Y ) i ,   i 2 , 3 , 4 .
Furthermore, suppose that X 1 , , X m and Y 1 , , Y n are samples that are identically and independently distributed from X and Y , respectively. According to what was mentioned in Section 1, given that C V X and C V Y are the CV of X and Y , respectively, the parameter
γ = C V Y C V X
is compelling for statistical inference.
Suppose that
m i X = 1 m k = 1 m x k x ¯ i ,   m i Y = 1 n k = 1 n y k y ¯ i ,   i 2 , 3 , 4 ,  
then, C V X and C V Y are consistently estimated [27,33] by C V ^ X = m 2 X X ¯ and C V ^ Y = m 2 Y Y ¯ , respectively. Therefore, it follows that
γ ^ = C V ^ Y C V ^ X
has the ability to estimate the parameter γ reasonably. For convenience,   n = m can be defined. At n m ,   n * = m i n m , n can be used instead of m and n.
Lemma 1.
If the mentioned assumptions are accepted, thus
n C V ^ X C V X L N 0 , δ X 2 ,   a s   n ,
where
δ X 2 = μ 4 X μ 2 X 2 4 μ X 2 μ 2 X μ 3 X μ X 3 + μ 2 X 2 μ 4 X
represents the asymptotic variance.
Proof. 
The proof sketch is given in [33]. □
In the subsequent theorem, Yue and Baleanu [33] used the asymptotic distribution of γ ^ to build the confidence interval and test the hypothesis about γ.
Lemma 2.
If the previous assumptions are accepted, thus
n γ ^ γ L N 0 , λ 2 ,   a s   n ,
where
λ 2 = 1 C V X 2 γ 2 δ X 2 + δ Y 2 ,
and
δ Y 2 = μ 4 Y μ 2 Y 2 4 μ Y 2 μ 2 Y μ 3 Y μ Y 3 + μ 2 Y 2 μ 4 Y .
Proof of Lemma 2.
The proof sketch is given in [27]. □
It can be observed that the asymptotic variance λ 2 depends on the unknown parameters C V X , δ X 2 , δ Y 2 and γ. Yue and Baleanu [33] estimated λ 2 by
λ ^ 2 = 1 C ^ V X 2 γ ^ 2 δ ^ X 2 + δ ^ Y 2 ,
where
δ ^ X 2 = m 4 X m 2 X 2 4 X ¯ 2 m 2 X m 3 X X ¯ 3 + m 2 X 2 m 4 X ,
and
δ ^ Y 2 = m 4 Y m 2 Y 2 4 Y ¯ 2 m 2 Y m 3 Y Y ¯ 3 + m 2 Y 2 m 4 Y .
They used Slutsky’s theorem and proved that λ ^ p λ , as n .
In this work, we rewrite the asymptotic variance λ 2 in an alternative form and re-estimate it using a more robust estimator. The asymptotic variance λ 2 can be represented by
λ 2 = 1 γ 2 δ X C V X 2 + δ Y C V Y 2 .
Therefore, we can construct the asymptotic distribution as
T n = n γ ^ γ λ = n δ X C V X 2 + δ Y C V Y 2 1 / 2 γ ^ γ 1 L N 0 , 1 ,   a s   n .
Theorem 1.
If the previous assumptions are accepted, thus
T n * = n γ ^ γ λ ^ o p t L N 0 , 1 ,   a s   n ,
where
λ ^ o p t 2 = 1 γ ^ 2 S X C V ^ X 2 + S Y C V ^ Y 2 .
Proof of Theorem 1.
Due to the weak law of large numbers, it is clear that
X ¯ p μ X ,   Y ¯ p μ Y ,   m i X p μ i X ,   m i Y p μ i Y ,   i 2 , 3 , 4 ,
as  n .
As a result, by using Slutsky’s theorem, we know that λ ^ o p t p λ , as n . With the help of Lemma 2 the proof is completed.

2.1.1. Building the Confidence Interval

To construct an asymptotic confidence interval for γ, we need a pivotal quantity for the parameter γ. Based on the previous results, T is a pivotal quantity for γ. Therefore, T can be used for this purpose, as follows:
γ ^ 1 + Z α / 2 n S X C V ^ X 2 + S Y C V ^ Y 2 1 / 2 , γ ^ 1 Z α / 2 n S X C V ^ X 2 + S Y C V ^ Y 2 1 / 2 .

2.1.2. Hypothesis Testing

In practical situations, scientists need to test the parameter γ. As a particular instance, the null hypothesis H 0 : γ = 1 shows that the CVs are the same in both populations. To accomplish the hypothesis test H 0 : γ =   γ 0 , the test statistic
T 0 = n S X C V ^ X 2 + S Y C V ^ Y 2 1 2 γ ^ γ 0 1    
can be generally applied.
If the null hypothesis H 0 :   γ = γ 0 is verified, thus the asymptotic distribution of T 0 is standard normal.
It should be noted that this method can be used for all distributions, because asymptotic methods have been used and the technique includes all exponential or non-exponential distributions. Moreover, the method is also appropriate if the populations have different distributions.

3. Results

3.1. Simulation Study

In this section, various simulated datasets are reviewed and analyzed to verify the accuracy of the theoretical findings. To compare the results obtained with the proposed method and those from the method given by Yue and Baleanu [27], we consider the distributions used therein. To simulate different samples of one symmetric and two asymmetric distributions, populations X and Y are considered. For this purpose, we selected the normal, and the gamma and beta distributions, respectively, with a variety of values of CV, C V X , C V Y 1 , 1 , 1 , 2 , 2 , 3 , 2 , 5 , that is, identical to γ 1 , 2 , 1.5 , 2.5 . Figure 1, Figure 2 and Figure 3 depict the probability density function (PDF) for the intended distributions.
The simulations are performed using the software R 3.3.2, adopting 1000 repetitions.
For investigating the accuracy of the proposed method, we compute the coverage probability, by
CP = number   of   repetitions   where   proposed   confidence   interval   includes   right   γ 1000 .
Furthermore, the value of test statistic T 0 is calculated. Then, we use the Shapiro–Wilk test to check the normality. After that, Q–Q plots are drawn to confirm the normality assumption for the offered test statistic. Table 1 summarizes the values of CP of the proposed method and those of the method by Yue and Baleanu [27], for various settings of parameters.

3.2. Results

From Table 1, we conclude that both methods have been successful in controlling type I error. However, as can be seen, our method achieved better results.
From Table 2, we verify that the average length of the confidence interval of our method is relatively smaller than the length of the confidence interval of Ballino’s method. Additionally, as the sample size increases, the length of the confidence interval decreases.
From Table 1, we verify that the CP of the proposed method is close to the intended level (1 − α = 0.95). This is more visible when the sample size increases. As a result, the introduced approach controls type I error. This means that about 95% of the simulated confidence intervals contain true γ, so it is accepted that the proposed confidence is an asymptomatic confidence interval for γ. Moreover, since the CP of the proposed method is closer to the intended level than the one of the alternative method [27], our approach is more robust. Table 3 shows the CPU time needed for running the proposed and the Yue and Baleanu [27] methods, for various combinations of parameters. We verify that our approach is faster than the comparative one.
Table 4 summarizes the p-values for assessing the normality of the statistic T0. We verify that all p-values are greater than 0.05, meaning that the normality of the introduced statistical test is confirmed by the Shapiro–Wilk test. This is also confirmed in the Q–Q plots depicted in Figure 4. Indeed, the points are around the direct line, so the observed values (quantiles) are very much like the quantiles of the normal distribution. Accordingly, the simulation outcomes verified that the asymptotic theoretical findings seem to be satisfied towards a parameter setting. As a result, our method is a suitable selection to perform a test hypothesis and build a confidence interval for the CV ratio of two independent populations.
As the sample size increases, the power also increases. From Table 5, it can also be seen that our method is more powerful than the one by Yue and Baleanu [27].

4. Discussion

4.1. Application

The applicability of the proposed technique in real situations is studied. For this purpose, we analyze a dataset containing two features: The humidity (%) and the scale of mouth, foot and hand disease outbreak from 2010 until 2017 for two regions in Malaysia (Sarawak and Malacca). Details about the datasets can be found in [34,35]. Table 6 and Figure 5 report the descriptive statistics and histograms of the features. The results show that the CVs of the humidity in Sarawak and Malacca are 0.8872 and 1.0788, respectively. The CVs of the disease outbreak in Sarawak and Malacca are 0.0107 and 0.0159, respectively.

4.2. Discussion

The proposed approach is applied to provide 95% confidence intervals for the ratio of the CV of the two considered features in Malacca with respect to Sarawak. The computed confidence intervals are reported in Table 7. It can be observed that the intervals (0.973, 1.459) and (1.189, 1.783) are the 95% confidence intervals for the ratio of the CV of the disease outbreak and the ratio of the CV of the humidity in Malacca with respect to Sarawak. Since the lower bound (1.189, 1.783) is greater than 1, it can be concluded that the CV of the humidity in Malacca is significantly higher than the CV of the humidity in Sarawak. Moreover, since the interval (0.973, 1.459) contains the value 1, the hypothesis of the equality of the CV of the disease outbreak in Sarawak and Malacca cannot be rejected.
It is not necessary that the distribution of x and y be the same, and it is omitted from the article and the table due to length. Therefore, this method can be used when the x and y distributions are different.
As can be seen in Table 7, the confidence interval of our method is slightly smaller than the confidence interval of Yue and Baleanu’s method. This is because we wrote the variance in such a way that not only simplifies the calculations, but also increases the power.

5. Conclusions

To compare populations, the CV is a useful, convenient and simple tool. In countless cases, two populations have the same CV, despite different means and variances. One way to understand the data structure is to study the equality of the two separate populations’ CVs. However, when the difference between the two CVs is slight, it does not provide a solid and valuable interpretation. For this reason, the CV ratio is used, which is more accurate. Asymptotic distribution was proposed in this paper, and then the test of hypothesis and the asymptotic confidence interval for the CV ratio of two independent populations were extracted. The findings showed that the probability of coverage is very close to the desired level when increasing the sample. Based on this, it can be concluded that the introduced methodology controls the type I error. In addition, CPU times confirmed that the proposed method does not involve excessive computational burden. In addition, the normality of the introduced test statistics was confirmed using the normal Shapiro–Wilk test and Q–Q plots. According to the results, it was observed that the asymptotic approximation acted very well for the whole simulated dataset. Moreover, the proposed method is more powerful than the comparative method given by Yue and Baleanu [27].

Author Contributions

Conceptualization, A.B., Z.A. and M.R.M.; Investigation, A.B., Z.A., M.R.M. and A.M.L.; Methodology, A.B., Z.A., M.R.M. and A.M.L.; Software, Z.A.; Writing—original draft, A.B., Z.A., M.R.M. and A.M.L.; Writing—review & editing, A.M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Meng, Q.; Yan, L.; Chen, Y.; Zhang, Q. Generation of Numerical Models of Anisotropic Columnar Jointed Rock Mass Using Modified Centroidal Voronoi Diagrams. Symmetry 2018, 10, 618. [Google Scholar] [CrossRef]
  2. Aslam, M.; Aldosari, M.S. Inspection Strategy under Indeterminacy Based on Neutrosophic Coefficient of Variation. Symmetry 2019, 11, 193. [Google Scholar] [CrossRef]
  3. Iglesias-Caamaño, M.; Carballo-López, J.; Álvarez-Yates, T.; Cuba-Dorado, A.; García-García, O. Intrasession Reliability of the Tests to Determine Lateral Asymmetry and Performance in Volleyball Players. Symmetry 2018, 10, 416. [Google Scholar] [CrossRef]
  4. Bennett, B.M. On an approximate test for homogeneity of coefficients of variation. In Contribution to Applied Statistics; Ziegler, W.J., Ed.; Birkhauser Verlag: Basel, Switzerland; Stuttgart, Germany, 1976; pp. 169–171. [Google Scholar]
  5. Shafer, N.J.; Sullivan, J.A. A simulation study of a test for the equality of the coefficients of variation. Commun. Stat. Simul. Comput. 1986, 15, 681–695. [Google Scholar] [CrossRef]
  6. Doornbos, R.; Dijkstra, J.B. A multi sample test for the equality of coefficients of variation in normal populations. Commun. Stat. Simul. Comput. 1983, 12, 147–158. [Google Scholar] [CrossRef]
  7. Hedges, L.; Olkin, I. Statistical Methods for Meta-Analysis; Academic Press: Orlando, FL, USA, 1985. [Google Scholar]
  8. Rao, K.A.; Vidya, R. On the performance of test for coefficient of variation. Calcutta. Stat. Assoc. Bull. 1992, 42, 87–95. [Google Scholar] [CrossRef]
  9. Gupta, R.C.; Ma, S. Testing the equality of coefficients of variation in k normal populations. Commun. Stat. Theory Methods 1996, 25, 115–132. [Google Scholar] [CrossRef]
  10. Rao, K.A.; Jose, C.T. Test for equality of coefficient of variation of k populations. In Proceedings of the 53rd Session of International Statistical Institute, Seoul, Korea, 22–29 August 2001. [Google Scholar]
  11. Pardo, M.C.; Pardo, J.A. Use of Rényi’s divergence to test for the equality of the coefficient of variation. J. Comput. Appl. Math. 2000, 116, 93–104. [Google Scholar] [CrossRef]
  12. Nairy, K.S.; Rao, K.A. Tests of coefficient of variation of normal population. Commun. Stat. Simul. Comput. 2003, 32, 641–661. [Google Scholar] [CrossRef]
  13. Verrill, S.; Johnson, R.A. Confidence bounds and hypothesis tests for normal distribution coefficients of variation. Commun. Stat. Theory Methods 2007, 36, 2187–2206. [Google Scholar] [CrossRef]
  14. Jafari, A.A.; Kazemi, M.R. A parametric bootstrap approach for the equality of coefficients of variation. Comput. Stat. 2013, 28, 2621–2639. [Google Scholar] [CrossRef]
  15. Feltz, G.J.; Miller, G.E. An asymptotic test for the equality of coefficients of variation from k normal populations. Stat. Med. 1996, 15, 647–658. [Google Scholar] [CrossRef]
  16. Fung, W.K.; Tsang, T.S. A simulation study comparing tests for the equality of coefficients of variation. Stat. Med. 1998, 17, 2003–2014. [Google Scholar] [CrossRef]
  17. Tian, L. Inferences on the common coefficient of variation. Stat. Med. 2005, 24, 2213–2220. [Google Scholar] [CrossRef]
  18. Forkman, J. Estimator and Tests for Common Coefficients of Variation in Normal Distributions. Commun. Stat. Theory Methods 2009, 38, 233–251. [Google Scholar] [CrossRef]
  19. Liu, X.; Xu, X.; Zhao, J. A new generalized p-value approach for testing equality of coefficients of variation in k normal populations. J. Stat. Comput. Simul. 2011, 81, 1121–1130. [Google Scholar] [CrossRef]
  20. Krishnamoorthy, K.; Lee, M. Improved tests for the equality of normal coefficients of variation. Comput. Stat. 2013, 29, 215–232. [Google Scholar]
  21. Jafari, A.A. Inferences on the coefficients of variation in a multivariate normal population. Commun. Stat. Theory Methods 2015, 44, 2630–2643. [Google Scholar] [CrossRef]
  22. Hasan, M.S.; Krishnamoorthy, K. Improved confidence intervals for the ratio of coefficients of variation of two lognormal distributions. J. Stat. Theory Appl. 2017, 16, 345–353. [Google Scholar] [CrossRef]
  23. Shi, X.; Wong, A. Accurate tests for the equality of coefficients of variation. J. Stat. Comput. Simul. 2018, 88, 3529–3543. [Google Scholar] [CrossRef]
  24. Miller, G.E. Use of the squared ranks test to test for the equality of the coefficients of variation. Commun. Stat. Simul. Comput. 1991, 20, 743–750. [Google Scholar] [CrossRef]
  25. Nam, J.; Kwon, D. Inference on the ratio of two coefficients of variation of two lognormal distributions. Commun. Stat. Theory Methods 2016, 46, 8575–8587. [Google Scholar] [CrossRef]
  26. Wong, A.; Jiang, L. Improved Small Sample Inference on the Ratio of Two Coefficients of Variation of Two Independent Lognormal Distributions. J. Probab. Stat. 2019, 2019, 7173416. [Google Scholar] [CrossRef]
  27. Yue, Z.; Baleanu, D. Inference about the Ratio of the Coefficients of Variation of Two Independent Symmetric or Asymmetric Populations. Symmetry 2019, 11, 824. [Google Scholar] [CrossRef]
  28. Haghbin, H.; Mahmoudi, M.R.; Shishebor, Z. Large Sample Inference on the Ratio of Two Independent Binomial Proportions. J. Math. Ext. 2011, 5, 87–95. [Google Scholar]
  29. Mahmoudi, M.R.; Mahmoodi, M. Inference on the Ratio of Variances of Two Independent Populations. J. Math. Ext. 2014, 7, 83–91. [Google Scholar]
  30. Mahmoudi, M.R.; Mahmoodi, M. Inference on the Ratio of Correlations of Two Independent Populations. J. Math. Ext. 2014, 7, 71–82. [Google Scholar]
  31. Mahmoudi, M.R.; Nasirzadeh, R.; Mohammadi, M. On the Ratio of Two Independent Skewnesses. Commun. Stat. Theory Methods 2019, 48, 1721–1727. [Google Scholar] [CrossRef]
  32. Mahmoudi, M.R.; Behboodian, J.; Maleki, M. Large Sample Inference about the Ratio of Means in Two Independent Populations. J. Stat. Theory Appl. 2017, 16, 366–374. [Google Scholar] [CrossRef]
  33. Ferguson Thomas, S. A Course in Large Sample Theory; Chapman & Hall: London, UK, 1996. [Google Scholar]
  34. Nelson, B.R.; Edinur, H.A.; Abdullah, M.T. Compendium of hand, foot and mouth disease data in Malaysia from years 2010 to 2017. Data Brief 2019, 24, 103868. [Google Scholar] [CrossRef]
  35. Mahmoudi, M.R.; Tuan, B.A.; Pho, K.H. On kurtoses of two symmetric or asymmetric populations. J. Comput. Appl. Math. 2021, 391, 113370. [Google Scholar] [CrossRef]
Figure 1. The PDF of the distribution normal (μ, σ 2 ) for different values of CV (black: CV = 1; red: CV = 2; green: CV = 3; blue: CV = 5).
Figure 1. The PDF of the distribution normal (μ, σ 2 ) for different values of CV (black: CV = 1; red: CV = 2; green: CV = 3; blue: CV = 5).
Mathematics 10 03495 g001
Figure 2. The PDF of the distribution gamma (α, λ) for different values of CV (blue: α = 0.04, λ = 0.001, CV = 5; green: α = 0.11, λ = 0.001, CV = 3; red: α = 0.25, λ = 0.001, CV = 2; black: α = 1, λ = 0.001, CV = 1).
Figure 2. The PDF of the distribution gamma (α, λ) for different values of CV (blue: α = 0.04, λ = 0.001, CV = 5; green: α = 0.11, λ = 0.001, CV = 3; red: α = 0.25, λ = 0.001, CV = 2; black: α = 1, λ = 0.001, CV = 1).
Mathematics 10 03495 g002
Figure 3. PDF of distribution of beta (α, β) in the presence of different values of CV (blue: α = 0.009, β = 0.285, CV = 5; green: α = 0.08, β = 2.51; red: α = 0.21, β = 6.87, CV = 2; black: α = 0.94, β = 30.39, CV = 1).
Figure 3. PDF of distribution of beta (α, β) in the presence of different values of CV (blue: α = 0.009, β = 0.285, CV = 5; green: α = 0.08, β = 2.51; red: α = 0.21, β = 6.87, CV = 2; black: α = 0.94, β = 30.39, CV = 1).
Mathematics 10 03495 g003
Figure 4. The Q–Q plots used to investigate the normality of the statistic T 0 . First column: Up: ( C V X , C V Y ) = 1 , 1 and m , n = 50 , 100 ; middle: ( C V X , C V Y ) = 1 , 2 and m , n = 75 , 100 ; down: ( C V X , C V Y ) = 2 , 3 and m , n = 100 , 100 ; Second column: Up: ( C V X , C V Y ) = 1 , 2 and m , n = 100 , 200 ; middle: ( C V X , C V Y ) = 2 , 3 and m , n = 200 , 300 ; down: ( C V X , C V Y ) = 2 , 5 and m , n = 500 , 500 ; Third column: Up: ( C V X , C V Y ) = 1 , 1 and m , n = 500 , 700 ; middle: ( C V X , C V Y ) = 1 , 2 and m , n = 700 , 1000 ; down: ( C V X , C V Y ) = 2 , 5 and m , n = 1000 ,   1000 .
Figure 4. The Q–Q plots used to investigate the normality of the statistic T 0 . First column: Up: ( C V X , C V Y ) = 1 , 1 and m , n = 50 , 100 ; middle: ( C V X , C V Y ) = 1 , 2 and m , n = 75 , 100 ; down: ( C V X , C V Y ) = 2 , 3 and m , n = 100 , 100 ; Second column: Up: ( C V X , C V Y ) = 1 , 2 and m , n = 100 , 200 ; middle: ( C V X , C V Y ) = 2 , 3 and m , n = 200 , 300 ; down: ( C V X , C V Y ) = 2 , 5 and m , n = 500 , 500 ; Third column: Up: ( C V X , C V Y ) = 1 , 1 and m , n = 500 , 700 ; middle: ( C V X , C V Y ) = 1 , 2 and m , n = 700 , 1000 ; down: ( C V X , C V Y ) = 2 , 5 and m , n = 1000 ,   1000 .
Mathematics 10 03495 g004
Figure 5. The humidity and the scale of mouth, foot and hand disease outbreak in Sarawak and Malacca.
Figure 5. The humidity and the scale of mouth, foot and hand disease outbreak in Sarawak and Malacca.
Mathematics 10 03495 g005aMathematics 10 03495 g005b
Table 1. The values of CP of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Table 1. The values of CP of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
m , n Mean Absolute Difference from 0.95
Distribution C V X , C V Y Method(50,100)(75,100)(100,100)(100,200)(200,300)(500,500)(500,700)(700,1000)(1000,1000)
Normal 1 , 1 Proposed0.9470.9490.9490.9490.9510.9500.9490.9500.9500.001
Yue and Baleanu0.9450.9470.9480.9510.9530.9570.9590.9600.9510.005
1 , 2 Proposed0.9480.9490.9490.9510.9500.9500.9500.9490.9510.001
Yue and Baleanu0.9450.9480.9480.9520.9530.9530.9580.9590.9520.005
2 , 3 Proposed0.9470.9480.9480.9490.9500.9500.9500.9500.9500.001
Yue and Baleanu0.9440.9480.9490.9530.9530.9560.9590.9610.9520.006
2 , 5 Proposed0.9470.9490.9490.9500.9500.9500.9510.9510.9500.001
Yue and Baleanu0.9460.9500.9500.9500.9550.9540.9560.9600.9520.004
Gamma 1 , 1 Proposed0.9470.9500.9500.9510.9500.9500.9500.9500.9510.001
Yue and Baleanu0.9460.9480.9480.9520.9560.9560.9580.9610.9520.006
1 , 2 Proposed0.9460.9490.9490.9510.9500.9500.9500.9490.9500.001
Yue and Baleanu0.9470.9490.9490.9510.9540.9550.9580.9610.9510.005
2 , 3 Proposed0.9460.9490.9500.9500.9490.9500.9510.9490.9500.001
Yue and Baleanu0.9470.9500.9500.9520.9530.9560.9590.9610.9520.005
2 , 5 Proposed0.9460.9480.9490.9500.9510.9490.9490.9490.9500.002
Yue and Baleanu0.9450.9490.9490.9520.9560.9570.9580.9620.9530.006
Beta 1 , 1 Proposed0.9470.9500.9500.9500.9510.9500.9500.9500.9500.001
Yue and Baleanu0.9440.9500.9500.9500.9540.9550.9580.9610.9540.005
1 , 2 Proposed0.9470.9490.9490.9490.9490.9500.9510.9510.9490.001
Yue and Baleanu0.9460.9480.9490.9520.9540.9560.9570.9600.9530.005
2 , 3 Proposed0.9460.9490.9490.9500.9500.9500.9500.9500.9500.001
Yue and Baleanu0.9450.9470.9480.9520.9540.9560.9580.9590.9520.005
2 , 5 Proposed0.9480.9490.9500.9500.9490.9500.9510.9500.9500.001
Yue and Baleanu0.9450.9480.9480.9510.9540.9550.9560.9600.9520.005
Mean Absolute Difference from 0.95Proposed0.0030.0010.0020.0010.0010.0010.0010.0010.0020.001
Yue and Baleanu0.0050.0020.0030.0020.0040.0020.0080.0100.0040.005
Table 2. The values of average lengths of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Table 2. The values of average lengths of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Distribution C V X , C V Y Method m , n
(50,100)(75,100)(100,100)(100,200)(200,300)(500,500)(500,700)(700,1000)(1000,1000)
Normal 1 , 1 Proposed0.1940.1730.1420.1390.1140.0920.0840.0730.063
Yue and Baleanu0.3850.3460.3030.2560.2040.1980.1740.1510.135
1 , 2 Proposed0.1930.1720.1590.1240.1070.0990.0890.0730.063
Yue and Baleanu0.3660.3270.3120.2650.2250.1960.1710.1480.123
2 , 3 Proposed0.1850.1610.1450.1360.1190.0970.0860.0770.061
Yue and Baleanu0.3830.3360.2940.2530.2310.1870.1620.1530.134
2 , 5 Proposed0.2000.1700.1560.1320.1150.0920.0820.0710.061
Yue and Baleanu0.3750.3500.2910.2490.2020.1950.1670.1480.124
Gamma 1 , 1 Proposed0.1950.1750.1480.1230.1120.0920.0860.0730.061
Yue and Baleanu0.3760.3250.2800.2610.2170.1810.1770.1550.133
1 , 2 Proposed0.1890.1740.1460.1370.1080.0960.0800.0760.069
Yue and Baleanu0.3980.3490.3180.2450.2110.1890.1660.1480.134
2 , 3 Proposed0.1880.1800.1400.1240.1060.0970.0810.0710.066
Yue and Baleanu0.3630.3210.3020.2500.2110.1920.1720.1470.128
2 , 5 Proposed0.1960.1750.1500.1250.1050.0930.0830.0740.070
Yue and Baleanu0.3750.3330.2910.2680.2200.1970.1800.1510.137
Beta 1 , 1 Proposed0.1870.1670.1450.1270.1120.0910.0890.0740.068
Yue and Baleanu0.3600.3580.3170.2790.2060.1970.1660.1500.130
1 , 2 Proposed0.1900.1680.1480.1380.1020.0980.0810.0740.065
Yue and Baleanu0.3830.3590.2960.2780.2110.1860.1790.1470.128
2 , 3 Proposed0.1950.1660.1410.1400.1040.0940.0850.0720.063
Yue and Baleanu0.3710.3270.3120.2410.2170.1880.1690.1410.140
2 , 5 Proposed0.1920.1670.1440.1390.1050.0960.0890.0780.063
Yue and Baleanu0.3730.3390.2800.2640.2320.1870.1700.1470.136
Table 3. The CPU time needed for running the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Table 3. The CPU time needed for running the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
m , n
Distribution C V X , C V Y Method 50 , 100 75 , 100 100 , 200 200 , 300 500 , 700 700 , 1000
Normal 1 , 1 Proposed7.008.009.0016.0043.0057.00
Yue and Baleanu8.6410.08140.823.0951.9268.67
1 , 2 .Proposed8.008.0012.0019.0021.0048.00
Yue and Baleanu8.7210.2916.4121.5852.1974.17
2 , 3 Proposed8.008.0010.0015.0038.0062.00
Yue and Baleanu9.529.5014221.1051.0565.87
2 , 5 Proposed7.008.0011.0019.0026.0061.00
Yue and Baleanu9.3510.9015.2524.3149.9774.90
Gamma 1 , 1 Proposed8.009.0011.0016.0026.0058.00
Yue and Baleanu9.459.0215.1622.1347.0574.92
1 , 2 Proposed8.009.0012.0020.0027.0065.00
Yue and Baleanu8.009.5814.6524.8749.9666.20
2 , 3 Proposed7.008.0011.0017.0023.0052.00
Yue and Baleanu9.639.2914.4721.9152.5266.84
2 , 5 Proposed7.008.009.0017.0028.0050.00
Yue and Baleanu8.699.8316.2924.2750.6866.11
Beta 1 , 1 Proposed8.009.0011.0020.0043.0049.00
Yue and Baleanu9.5310.5714.1921.4753.2666.58
1 , 2 Proposed7.009.0012.0015.0042.0057.00
Yue and Baleanu9.209.5014.1524.8548.6775.00
2 , 3 Proposed7.008.0012.0017.0037.0055.00
Yue and Baleanu9.029.6314.2523.5250.2969.67
2 , 5 Proposed7.008.009.0015.0022.0063.00
Yue and Baleanu8.759.1715.7322.8950.7973.95
Table 4. The p-values to investigate the normality of the statistic T 0 .
Table 4. The p-values to investigate the normality of the statistic T 0 .
m , n
Distribution C V X , C V Y (50,100)(75,100)(100,100)(100,200)(200,300)(500,500)(500,700)(700,1000)(1000,1000)
Normal 1 , 1 0.061 0.165 0.849 0.541 0.885 0.863 0.875 0.886 0.861
1 , 2 0.605 0.224 0.871 0.559 0.892 0.861 0.895 0.901 0.870
2 , 3 0.112 0.104 0.845 0.891 0.903 0.895 0.874 0.896 0.887
2 , 5 0.099 0.163 0.855 0.435 0.898 0.858 0.895 0.890 0.844
Gamma 1 , 1 0.130 0.236 0.564 0.905 0.885 0.845 0.896 0.903 0.853
1 , 2 0.119 0.099 0.590 0.892 0.808 0.851 0.884 0 , 981 0.845
2 , 3 0.120 0.231 0.866 0.900 0.903 0.863 0.896 0.871 0.895
2 , 5 0.310 0.490 0.787 0.902 0.908 0.857 0.893 0.891 0.866
Beta 1 , 1 0.079 0.410 0.856 0.249 0.904 0.863 0.843 0.601 0.841
1 , 2 0.319 0.325 0.864 0.903 0.876 0.896 0.815 0.877 0.890
2 , 3 0.216 0.086 0.857 0.853 0.891 0.850 0.912 0.908 0.866
2 , 5 0.106 0.286 0.835 0.884 0.901 0.874 0.902 0.909 0.904
Table 5. The values of powers of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Table 5. The values of powers of the proposed and the Yue and Baleanu [27] methods, for various settings of parameters.
Distribution C V X , C V Y Method m , n
(50,100)(75,100)(100,100)(100,200)(200,300)(500,500)(500,700)(700,1000)(1000,1000)
Normal 1 , 2 Proposed0.8140.8630.8910.9090.9410.9660.9830.9960.998
Yue and Baleanu0.6250.6550.6870.7230.7530.7940.8750.9170.934
2 , 3 Proposed0.8120.8450.8600.9380.9460.9780.9790.9960.999
Yue and Baleanu0.6210.6520.7050.7170.7620.8240.8520.9110.934
2 , 5 Proposed0.8410.8500.8750.9320.9600.9710.9861.0001.000
Yue and Baleanu0.6190.6700.6980.7140.7770.7940.8450.9180.929
Gamma 1 , 2 Proposed0.8400.8330.8710.9190.9500.9670.9760.9940.998
Yue and Baleanu0.6010.6630.6970.7360.7710.8300.8510.9180.935
2 , 3 Proposed0.8440.8320.8930.9350.9420.9720.9810.9900.996
Yue and Baleanu0.6430.6790.6800.7260.7520.8290.8790.9030.947
2 , 5 Proposed0.8170.8430.8980.9230.9480.9610.9770.9900.995
Yue and Baleanu0.6100.6750.6830.7250.7570.8170.8520.8950.934
Beta 1 , 2 Proposed0.8410.8390.8960.9130.9490.9710.9800.9991.000
Yue and Baleanu0.6480.6580.6840.7310.7570.8170.8580.9180.948
2 , 3 Proposed0.8030.8660.8650.9240.9510.9750.9840.9950.996
Yue and Baleanu0.6460.6670.7020.7200.7510.8250.8680.9150.921
2 , 5 Proposed0.8300.8750.8770.9120.9500.9750.9800.9960.999
Yue and Baleanu0.6210.6530.6940.7290.7520.8060.8670.9100.938
Table 6. Descriptive statistics about the humidity and the scale of mouth, foot and hand disease outbreak in Sarawak and Malacca.
Table 6. Descriptive statistics about the humidity and the scale of mouth, foot and hand disease outbreak in Sarawak and Malacca.
FeatureNRegionMeanStandard DeviationCV
Disease outbreak6220Malacca64.71969.9241.078
62061Sarawak646.468573.5970.887
Humidity (%)6220Malacca75.7041.1850.015
62061Sarawak75.1500.8210.010
Table 7. The lower and upper bounds of the computed confidence intervals for the ratio of the CV of the humidity and the ratio of the CV of the scale of hand, foot and mouth disease outbreak in Malacca with respect to Sarawak.
Table 7. The lower and upper bounds of the computed confidence intervals for the ratio of the CV of the humidity and the ratio of the CV of the scale of hand, foot and mouth disease outbreak in Malacca with respect to Sarawak.
FeatureRatioLower BoundUpper Bound
Proposed MethodDisease outbreak1.2161.1221.309
Humidity (%)1.4310.5712.292
Yue and Baleanu’s methodDisease outbreak1.2161.0781.353
Humidity (%)1.4310.2532.610
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bahrampour, A.; Avazzadeh, Z.; Mahmoudi, M.R.; Lopes, A.M. Improved Confidence Interval and Hypothesis Testing for the Ratio of the Coefficients of Variation of Two Uncorrelated Populations. Mathematics 2022, 10, 3495. https://doi.org/10.3390/math10193495

AMA Style

Bahrampour A, Avazzadeh Z, Mahmoudi MR, Lopes AM. Improved Confidence Interval and Hypothesis Testing for the Ratio of the Coefficients of Variation of Two Uncorrelated Populations. Mathematics. 2022; 10(19):3495. https://doi.org/10.3390/math10193495

Chicago/Turabian Style

Bahrampour, Abbas, Zeynab Avazzadeh, Mohammad Reza Mahmoudi, and António M. Lopes. 2022. "Improved Confidence Interval and Hypothesis Testing for the Ratio of the Coefficients of Variation of Two Uncorrelated Populations" Mathematics 10, no. 19: 3495. https://doi.org/10.3390/math10193495

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop