Abstract
In this article, we propose approaches for constructing confidence intervals for the single signal-to-noise ratio (SNR) of a log-normal distribution and the difference in the SNRs of two log-normal distributions. The performances of all of the approaches were compared, in terms of the coverage probability and average length, using Monte Carlo simulations for varying values of the SNRs and sample sizes. The simulation studies demonstrate that the generalized confidence interval (GCI) approach performed well, in terms of coverage probability and average length. As a result, the GCI approach is recommended for the confidence interval estimation for the SNR and the difference in SNRs of two log-normal distributions.
1. Introduction
In statistics, it is well-known that the standard deviation and the variance are used to measure dispersion. Although the standard deviation has an important advantage and is easier to interpret than the variance, the former is not an appropriate indicator when we compare the dispersion in distributions of several variables. Therefore, the coefficient of variation (CV), which is defined as the ratio of the standard deviation to the mean, is used to measure the relative dispersion. CV is free from the unit of measurement and is useful in comparing the variability between groups of observations. Many authors have proposed confidence intervals for CV. For instance, Niwitpong [1] constructed the confidence intervals for the CV of a log-normal distribution with restricted parameter space, while Ng [2] studied the confidence interval for the common CV of log-normal distributions. Furthermore, Thangjai [3] proposed the simultaneous fiducial generalized confidence intervals (SFGCIs) for the differences in the CVs of log-normal distributions.
The signal-to-noise ratio (SNR) is the inverse of the CV. It is the ratio of the mean to the standard deviation. SNR has been used in many fields, such as finance, quality control, medicine, imaging, economics, marketing, and biology. For the application of this ratio in the theory of finance, SNR measures the relationship between excess return and the risk of financial assets. In analog and digital communications, SNR is a measure of the signal strength relative to the background noise. In quality control, SNR represents the magnitude of the mean of a process compared to its variation. In medicine, SNR can be used to analyze the blood pressure of patients in a longitudinal study. In image processing, the ratio of the mean pixel values over a given neighborhood is calculated by the SNR of an image. Furthermore, the SNR is used for the analysis of portfolio selection models and market risk (see [4,5]).
A log-normal distribution is right-skewed and is used in models for various applications, such as medicine, economics, biology, agriculture, entomology, and finance. Applications of the log-normal distribution can be found in [6,7,8].
Suppose that a random variable follows all possible distributions. The lower and the upper limits of the confidence interval for the CV are denoted by and , respectively. By definition, if is a random sample from a probability distribution with statistical parameters, then the confidence interval for the CV () with nominal confidence level is an interval with and : These are determined with the property . Then, can be achieved by taking the inverse values of , , and . That is to say, the confidence interval for the inverse of CV () with nominal confidence level is the interval with and . Confidence interval estimation in terms of SNR has received attention in the literature; see [9,10,11,12,13,14,15]. In this article, we propose two approaches for constructing the confidence intervals for the SNR of a log-normal distribution, using the GCI and the large sample approaches. Furthermore, three confidence intervals for the difference between the SNRs of log-normal distributions are constructed based on the GCI, large sample, and method of variance estimates recovery (MOVER) approaches.
The rest of this article is organized as follows. In Section 2, the confidence intervals for the SNR of a log-normal distribution are presented, and the confidence intervals for the difference between the SNRs of log-normal distributions are given in Section 3. In Section 4, the results of simulation studies to assess the coverage probabilities and the average lengths of all of the proposed confidence intervals are presented. Next, two examples are given to illustrate the proposed approaches in Section 5, and the concluding remarks are presented in Section 6.
2. The Confidence Intervals for a Single SNR
Suppose that a random variable follows a normal distribution with mean and variance . Then, the random variable Y follows a log-normal distribution with mean and variance . Thus, the SNR of Y is given by
We are interested in constructing the confidence interval for the SNR . Let be a random sample from Y: Let and be the sample mean and sample variance for log-transformed data , where ; and let and be the observed sample mean and observed sample variance, respectively. The estimator of is
The variance of , given in [3], is in the form . Therefore, it is easy to derive the variance of , as follows:
2.1. The GCI Approach for a Single SNR
The concept of GCI was introduced by Weerahandi [16]. Let be a random sample having a density function , where is the parameter of interest and is a nuisance parameter. Let x be the observed sample of X. A generalized pivotal quantity is considered and satisfies the following conditions:
- (i)
- The distribution of is free of all unknown parameters.
- (ii)
- The observed value of is the parameter of interest.
Condition (i) is imposed to guarantee that a subset of the sample space of the possible values of can be found at a given value of the confidence coefficient, with no knowledge of the parameters. Condition (ii) is imposed to ensure that such probability statements, based on a generalized pivotal quantity, lead to confidence regions involving the observed data x only. The GCI for is computed using the percentiles of the generalized pivotal quantity. Let be a two-sided GCI for the parameter of interest, where and denote the -th and the -th percentiles of , respectively.
Suppose that and are the mean and variance of the log-transformed sample from a log-normal distribution. Furthermore, let and be the observed values of and , respectively. Since has a chi-squared distribution with degrees of freedom, defined by , then . We define the generalized pivotal quantity for as
where denotes a chi-squared distribution with degrees of freedom.
From Equations (1) and (4), the generalized pivotal quantity for , based on the generalized pivotal quantity for , is given by
The two-sided confidence interval for the SNR of log-normal distribution , based on the GCI approach, is given by
where and denote the -th and -th quantiles of , respectively.
The following algorithm is used to construct the GCI for the SNR of a log-normal distribution (Algorithm 1):
| Algorithm 1: The GCI for the SNR. |
| For a given and For 1 to h: Generate from chi-squared distribution with degrees of freedom Compute from Equation (4) Compute from Equation (5) End g loop Compute the -th quantiles of defined by Compute the -th quantiles of defined by |
2.2. The Large Sample Approach for a Single SNR
From Equations (2) and (3), the two-sided confidence interval for the SNR of log-normal distribution , based on the large sample approach, is given by
where denotes the -th quantile of a standard normal distribution and is defined as in Equation (3), with replaced by s.
3. The Confidence Intervals for the Difference between SNRs
Suppose that follows a normal distribution with mean and variance . Similarly, let be a normal distribution with mean and variance . Moreover, X and T are independent. The single SNRs of Y and W are, respectively, given by
The estimators of and are
The variances of and are, respectively,
Therefore, the difference between and is
Let n and m be the sample sizes of X and T, respectively. Using the Bienaym formula, the variance of the sum of uncorrelated random variables is the sum of their variances. Moreover, using the linearity of the expectation operator and the assumption that X and T are independent, the variance of is obtained as
3.1. The GCI Approach for the Difference between SNRs
Suppose that and denote the variances of the log-transformed sample, and let and be the observed values of and , respectively. The generalized pivotal quantities for and are obtained from
where and denote chi-squared distributions with and degrees of freedom, respectively.
Therefore, the difference between the generalized pivotal quantities , based on the generalized pivotal quantities for and , can be written as
The two-sided confidence interval for the difference between the SNRs of log-normal distributions , based on the GCI approach, is given by
where and denote the -th and -th quantiles of , respectively.
3.2. The Large Sample Approach for the Difference between SNRs
Using the central limit theorem, the two-sided confidence interval for the difference between SNRs of log-normal distributions , based on the large sample approach, is given by
where is the -th quantile of the standard normal distribution, and and are defined as in Equations (11) and (12), respectively, with and replaced by and .
3.3. The MOVER Approach for the Difference between SNRs
Let and be the lower and upper limits of the confidence interval for the SNR of X, respectively, then they can be defined by
where is the -th quantile of a Student’s t distribution, and and are defined as in Equations (9) and (10), respectively, with replaced by .
Similarly, let and be the lower and upper limits of the confidence interval for the SNR of T, respectively, then they can be written as
where is the -th quantile of a Student’s t distribution, and and are defined as in Equations (9) and (10), respectively, with replaced by .
4. Simulation Studies
Two simulation studies were conducted to evaluate the coverage probabilities and average lengths of the proposed confidence intervals. The aim of the first simulation was to assess the performance of the GCI approach, in comparison with the large sample approach, for the confidence interval estimation for the single SNR of a log-normal distribution. The aim of the second simulation was to examine the performance of the GCI approach, in comparison with the large sample and MOVER approaches.
In the single SNR simulation study, the sample sizes were 10, 20, 30, 50, 100, and 200; the population mean of normal data was 1; the population standard deviation was computed as for the normally distributed data; and the SNR was 1, 3, 5, and 10. A total of 5000 random samples were generated for each set of parameters. For the GCI approach, 2500 were obtained for each of the random samples. Table 1 reports the coverage probabilities and average lengths of the 95% two-sided confidence intervals for the SNR of the log-normal distribution. The results show that the coverage probabilities of both approaches were close to the nominal confidence level of 0.95. Moreover, the average lengths of the GCI approach were shorter than those of the large sample approach, when the sample size was small. For a large sample size ( 100), the GCI approach performed as well as the large sample approach, in terms of the average length, when the SNR was small; otherwise, the average lengths of the GCI approach were shorter than those of the large sample approach.
Table 1.
Coverage probabilities (CP) and average lengths (AL) of the 95% two-sided confidence intervals for the signal-to-noise ratio (SNR) of the log-normal distribution.
In the simulation study of the difference of SNRs, the sample sizes were (10, 10), (10, 20), (20, 20), (20, 30), (30, 30), (30, 50), (50, 50), (50, 100), (100, 100), (100, 200), and (200, 200); the population means were 1; and the population SNRs were (10, 1), (10, 2), (10, 5), and (10,10) for the normally distributed data. Therefore, the population standard deviations of the normally distributed data and were computed. The coverage probabilities and average lengths of the 95% two-sided confidence intervals for the difference between the SNRs of the log-normal distributions were evaluated, based on 5000 replications, and 2500 were obtained for the GCI approach. The results are given in Table 2, in which it can be seen that the GCI approach and the large sample approach were preferable for all cases. However, the average lengths of the GCI approach were shorter than those of the large sample approach. Furthermore, the coverage probabilities of the MOVER approach provided more than 0.97 for (10, 10) and (10, 20); thus, the MOVER confidence interval was conservative for those two sample sizes. For large sample sizes, the coverage probabilities of the MOVER approach were close to the nominal confidence level of 0.95, although the average lengths were wider than those of the GCI and large sample approaches.
Table 2.
CP and AL of the 95% two-sided confidence intervals for the difference between the SNRs of the log-normal distributions.
5. Empirical Applications
Two examples are given to illustrate our proposed approach for confidence intervals for the SNR of a log-normal distribution and the difference between the SNRs of log-normal distributions. The GCIs are computed using Algorithm 1, with 2500.
Example 1.
The data are from Fung and Tsang [19] and Ng [2]. The data-set contains hemoglobin values from one normal and one abnormal blood sample of Hb1995. The summary statistics are 65, 14.64, and 0.0665. Therefore, the SNR of the log-normal distribution is 3.8135. The procedures in Section 2 are applied to compute the 95% two-sided confidence intervals for the SNR of the log-normal distribution. The 95% GCI and large sample confidence interval for the SNR are [3.1365, 4.4748] with a length of interval of 1.3383 and [3.1307, 4.4964] with a length of interval of 1.3657, respectively. Note that the GCI and the large sample confidence intervals contain the true value of the SNR. However, the length of the GCI is shorter than the length of the large sample confidence interval and, thus, the former is better when the sample size is small (Table 1).
Example 2.
The data are from the Regenstrief Medical Record System, as reported in MCDonald et al. [20], Zhou et al. [21], and Jafari and Abdollahnezhad [8]. The data represent the effects of race on medical charges for patients with type I diabetes who received inpatient or outpatient care, on at least two occasions, during the period from 1 January 1993 to 30 June 1994. The dataset consists of African American and white patients. For African American patients, the summary statistics are 119, 9.0670, and 1.8240 and, for the white patients, the summary statistics are 106, 8.6930, and 2.6920. The difference between the SNRs is 0.1691. Zhou et al. [19] showed that both datasets come from log-normal distributions. The 95% two-sided confidence intervals for the difference between the SNRs of the log-normal distributions were constructed, using the three approaches given in Section 3. The 95% GCI, large sample, and MOVER confidence intervals for the difference between SNRs are [0.0101, 0.3258] with a length of interval of 0.3157, [0.0082, 0.3300] with a length of interval of 0.3218, and [0.0064, 0.3318] with a length of interval of 0.3254, respectively. The results indicate that all of the confidence intervals contain the true difference between the SNRs, but GCI provided the shortest length, and so is much more satisfactory than the others.
6. Discussion and Conclusions
In this article, we considered the confidence intervals for the single SNR of a log-normal distribution and for the difference of SNRs between the two log-normal distributions. First, we used the GCI approach and the large sample approach to construct the confidence intervals for the SNR, and then we used the GCI, large sample, and MOVER approaches to estimate the confidence interval for the difference between the SNRs.
For the confidence interval for SNR, the coverage probabilities of both approaches were satisfactory. However, the GCI approach was better than the large sample approach, in terms of the average length. For the difference between the SNRs, the GCI approach and the large sample approach were preferable to MOVER. However, the average lengths of the GCI approach were shorter than those of the large sample approach. As a result, comparing the GCI approach and the large sample approach, the former was therefore more preferable, in terms of the average length.
Author Contributions
All authors contributed significantly to the study and preparation of the article.
Funding
This research was funded by Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, Contract No. 6245106.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Niwitpong, S. Confidence intervals for coefficient of variation of lognormal distribution with restricted parameter space. Appl. Math. Sci. 2013, 7, 3805–3810. [Google Scholar] [CrossRef]
- Ng, C.K. Inference on the common coefficient of variation when populations are lognormal: A simulation-based approach. J. Stat. Adv. Theory Appl. 2014, 11, 117–134. [Google Scholar]
- Thangjai, W.; Niwitpong, S.-A.; Niwitpong, S. Simultaneous fiducial generalized confidence intervals for all differences of coefficients of variation of log-normal distributions. Lecture Notes Artif. Intell. 2016, 9978, 552–561. [Google Scholar]
- Holgersson, H.E.T.; Karlsson, P.S.; Mansoor, R. Estimating mean-standard deviation ratios of financial data. J. Appl. Stat. 2012, 39, 657–671. [Google Scholar] [CrossRef]
- Soliman, A.A.; Abd Ellah, A.H.; Abou-Elheggag, N.A.; Modhesh, A.A. Estimation of the coefficient of variation for non-normal model using progressive first-failure-censoring data. J. Appl. Stat. 2012, 39, 2741–2758. [Google Scholar] [CrossRef]
- Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; John Wiley: New York, NY, USA, 1994. [Google Scholar]
- Crow, E.L.; Shimizu, K. Lognormal Distributions; Marcel Dekker: New York, NY, USA, 1988. [Google Scholar]
- Jafari, A.A.; Abdollahnezhad, K. Inferences on the means of two log-normal distributions: A computational approach test. Commun. Stat.-Simul. Comput. 2015, 44, 1659–1672. [Google Scholar] [CrossRef]
- Sharma, K.K.; Krishna, H. Asymptotic sampling distribution of inverse coefficient of variation and its applications. IEEE Trans. Reliab. 1994, 43, 630–633. [Google Scholar] [CrossRef]
- George, F.; Kibria, B.M.G. Confidence intervals for signal-to-noise ratio of a poisson distribution. Am. J. Biostat. 2011, 2, 44–55. [Google Scholar]
- George, F.; Kibria, B.M.G. Confidence intervals for estimating the population signal-to-noise ratio: A simulation study. J. Appl. Stat. 2012, 39, 1225–1240. [Google Scholar] [CrossRef]
- Albatineh, A.N.; Kibria, B.M.G.; Zogheib, B. Asymptotic sampling distribution of inverse coefficient of variation and its applications: Revisited. Int. J. Adv. Stat. Probab. 2014, 2, 15–20. [Google Scholar] [CrossRef]
- Albatineh, A.N.; Boubakari, I.; Kibria, B.M.G. New confidence interval estimator of the signal-to-noise ratio based on asymptotic sampling distribution. Commun. Stat.-Theory Methods 2017, 46, 574–590. [Google Scholar] [CrossRef]
- Niwitpong, S. Confidence intervals for functions of signal-to-noise ratios of normal distributions. Stud. Comput. Intell. 2018, 760, 196–212. [Google Scholar]
- Saothayanun, L.; Thangjai, W. Confidence intervals for the signal to noise ratio of two-parameter exponential distribution. Stud. Comput. Intell. 2018, 760, 255–265. [Google Scholar]
- Weerahandi, S. Generalized confidence intervals. J. Am. Stat. Assoc. 1993, 88, 899–905. [Google Scholar] [CrossRef]
- Zou, G.Y.; Donner, A. Construction of confidence limits about effect measures: A general approach. Stat. Med. 2008, 27, 1693–1702. [Google Scholar] [CrossRef] [PubMed]
- Zou, G.Y.; Taleban, J.; Hao, C.Y. Confidence interval estimation for lognormal data with application to health economics. Comput. Stat. Data Anal. 2009, 53, 3755–3764. [Google Scholar] [CrossRef]
- Fung, W.K.; Tsang, T.S. A simulation study comparing tests for the equality of coefficients of variations. Stat. Med. 1998, 17, 2003–2014. [Google Scholar] [CrossRef]
- McDonald, C.J.; Blemis, L.; Tierny, W.M.; Martin, D.K. The regenstrief medical records. MD Comput. 1988, 5, 34–47. [Google Scholar] [PubMed]
- Zhou, X.H.; Gao, S.; Hui, S.L. Methods for comparing the means of two independent log-normal samples. Biometrics 1997, 53, 1129–1135. [Google Scholar] [CrossRef] [PubMed]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).