1. Introduction
Poisson distribution is widely used to model the number of events that occur randomly over time or space and is commonly used to model data from areas such as epidemiology, industrial quality control, environmental statistics, etc. When the data are from multiple sources, it is of particular interest to determine if the underlying Poisson populations share the same event rate or mean. Note that testing the homogeneity of Poisson distributions is fundamentally a test of symmetry across datasets, and rejection of the test corresponds to evidence of asymmetry in the mean parameters, leading to different stochastic behaviors across populations. For this problem, ref. [
1] proposed a Bayesian approach. However, the proposed method requires each population to have more than one observation. On the other hand, refs. [
2,
3] considered the problem of testing the homogeneity of the means of
n independent Poisson populations when there is only one observation for each population. Traditional methods that rely on large sample sizes are not applicable in this setting, making it crucial to develop specialized techniques tailored to the constraints of single observations.
Mathematically, let
, for
, denote the independent Poisson random variables with mean
. Let
denote the observed value of
. The problem of interest is to test
where
is the unknown common mean parameter. One of the many tests considered by ref. [
2] is the likelihood ratio test (LRT), where the likelihood ratio test statistic (LRTS),
, is given by
with
, and
refers to the natural logarithmic function. Ref. [
2] showed that as
,
is asymptotically distributed as a Chi-square distribution with
degrees of freedom,
. However, ref. [
3] proved that for a given
, the limiting distribution of
, as
, is not
. More precisely, ref. [
3] showed that, under the null hypothesis
, for a given
, as
,
where
and
are the approximate mean and variance of
, respectively, with
Note that if the exact mean and variance of
, denoted as
and
, were available, the central limit theorem would imply that
The difficulty lies in the fact that , and do not have closed-form expressions.
In this paper, we propose two methods to approximate
and
. The first is a truncation method, which provides simple closed-form approximations of
and
. The second is a parametric bootstrap method, which, based on the observed sample, numerically approximates
and
. We compare the accuracy of the
p-values obtained by these two methods for testing the hypothesis stated in Equation (
1).
Furthermore, in real-life applications, such as online sequential testing in sequential sampling inspection procedures (see ref. [
4]) and quality control charts (see ref. [
5]), where
n is typically fixed, while
, a setting different from that considered in ref. [
3], we prove that the limiting distribution of the LRTS is indeed
.
The rest of the paper is organized as follows. In
Section 2, we derive the LRTS for testing the homogeneity of the means across
n independent Poisson populations, which is symmetrical across populations, each with one observation. For a fixed
and
, we propose two methods to approximate the mean and variance of the LRTS and use them to compute the
p-value for testing Equation (
1). In
Section 3, the case that
n is fixed and
is considered. The asymptotic distrtibution of the LRTS is derived for testing Equation (
1). With
, a simple closed-form approximation of the mean of the LRTS is derived and used in the Bartlett correction to provide a more accurate approximation of the
p-value.
In addition, a parametric bootstrap method is also proposed to approximate the mean of the LRTS, and, using the Bartlett correction again, an accurate approximation of the
p-value is obtained.
Section 4 presents numerical examples that demonstrate the application and accuracy of the proposed methods. Simulation results show that even for small
n and small
, the Bartlett correction using the bootstrap-approximated mean of the LRTS yields highly accurate results, although it is computationally intensive. Some concluding remarks are given in
Section 5.
2. Asymptotic Distribution of LRTS When Is Large and Is Fixed
Let
, denote independent Poisson
random variables with only one observation,
, from each distribution. To test the symmetry across the
n independent Poisson populations, the null and alternative hypotheses are stated in Equation (
1). For this problem, the log-likelihood function is
It can be shown that the overall maximum likelihood estimate (MLE) of
is given by
, where
. The log-likelihood function evaluated at the overall MLE is
When
is true, the constrained log-likelihood function is
Then, the constrained MLE of
is
, and the log-likelihood function evaluated at the constrained MLE is
Thus, as given in [
2], the LRTS is
Note that this LRTS, , measures deviation from symmetry across the n independent Poisson populations. Under the null hypothesis, the means are assumed to be equal, reflecting a symmetric structure. A significant value of , therefore, signals a symmetry-breaking event, indicating heterogeneity among the populations. Accurately deriving and approximating the distribution of is crucial as it underpins valid statistical inference about such structural deviations. Beyond its inferential utility, this analysis contributes to the broader understanding of invariant properties and symmetry-related transformations in statistical systems—core themes within the scope of Symmetry. By identifying and quantifying departures from symmetry, this work aligns with the journal’s emphasis on both the theoretical foundations and applied implications of symmetry and asymmetry in mathematical and scientific contexts.
In ref. [
2], the authors proved that as
,
However, ref. [
3] showed that this result is wrong. More specifically, the authors showed that for a fixed
, and as
,
where
Since is unknown, and assuming is true, can be estimated by . However, and do not have closed-form expressions.
In the following subsections, we first propose a truncation method to approximate
and
with closed-form expressions. Thus,
and
can be approximated. Alternatively, we propose a parametric bootstrap method to numerically estimate
and
. Once
and
are available, either through the truncation method or the parametric bootstrap method, we apply the central limit theorem to obtain the
p-value for testing the symmetry across the
n independent distribution:
where
w is the observed LRTS. A small
p-value suggests evidence of symmetry breaking, while a large
p-value indicates no evidence against the assumed symmetry.
2.1. Truncation Method
The truncated Poisson random variable in mixed effects models was introduced in ref. [
6]. Based on the saddlepoint approximation, ref. [
7] approximated the nonlinear moments of truncated Poisson random variables. In this section and in the
Appendix A, we followed the arguments in ref. [
7] and derived a closed-form expression of the mean and variance of LRTS. More specifically, let
,
, be independent Poisson
variables. Assume that
m and
ℓ are positive integers. Then,
Hence, for
and
, we have
where
Moreover, for
, we have
. Thus,
Therefore, for a given
, as
,
. Since
can be expressed as
as
, for a given
, we have
where
is the probability mass function of the Poisson
distribution.
Following the same argument, in the
Appendix A, we show that for a given
, and as
, applying the truncation method gives
where
is given in Equation (
8). Finally, the mean and variance of
can be approximated by
and
, respectively.
In the case that
is unknown, the estimate of
under
is the constrained MLE
. Hence,
and
can be approximated by
and
, respectively. Thus, the
p-value for testing Equation (
1) can be obtained using Equation (6). Finally, evidence of symmetry across the
n independent Poisson populations can be determined from the
p-value.
2.2. Parametric Bootstrap Method
As an alternative to the truncation method proposed in the previous subsection, we propose a parametric bootstrap method to numerically approximate the mean and variance of
. The idea of approximating the mean of an LRTS has been demonstrated in ref. [
8] via cointegration testing, and also in ref. [
9] for testing the equality of Gaussian graphical models. In this paper, we adopt the same approach to approximate the mean and variance of
. The procedure can be summarized by the following Algorithm 1.
Algorithm 1: Approximating the mean and varaince of LRT | |
Have: | From the observed data , we can calculate the observed
LRTS w by |
| |
|
where |
Step 1: | Obtain a parametric bootstrap sample of size n, ,
from Poisson (). |
Step 2: | From the bootstrap sample, calculate the observed
LRTS by |
|
where |
Step 3: | Repeat Steps 1 and 2 M times, where M is large and we have
. |
Step 4: | Then |
|
are the unbiased estimate of and , respectively. |
Hence, the
p-value for testing Equation (
1) can be obtained from Equation (6). Finally, evidence of symmetry across the
n independent Poisson populations can be determined from the
p-value.
2.3. Numerical Examples
Example 1. We generated one set of data from each of the following five cases stated in Table 1: Table 2 reports that the
p-values for testing
calculated from the four methods discussed in this paper:
Note that rejecting
corresponds to evidence of having asymmetry in the mean parameters. For FWT, calculating
and
requires an infinite sum, and numerically, we summed 1,000,000 times. Also, for Bootstrap, we use
M = 1,000,000 resamples. Note that for cases 1 to 4, the data were generated using the same mean parameters. Therefore, we expect to obtain large
p-values. In contract, for case 5, the data were generated using different mean parameters, so we expect small
p-values. The results are recorded in
Table 2.
It is clear from
Table 2 that the results from ref. [
2] are significantly different from those of the other three methods, and theoretically, we know they are the wrong results. The results from ref. [
3] and the results from the proposed truncation method are the same. However, in terms of the required time in calculation, the method in ref. [
3] takes, on average, 10 to 15 times longer to obtain the results than the proposed truncation method. The parametric bootstrap method gives result similar to those obtained by the truncation method. But due to the required large number of simulations (we use
M = 1,000,000), the parametric bootstrap method takes, on average, 600 times longer than the truncated method to obtain the results.
Example 2. We consider the five cases studied in Example 1. For each case, we perform the following calculations:
- 1.
Generate a sample of size n.
- 2.
Record the p-value calculated from the truncation method and the parametric bootstrap method. We did not include method from ref. [2] because it is wrong, and we did not include method ref. [3] because it gives the same answer as the truncation method. - 3.
Repeat the above steps N times.
- 4.
Report the proportion of p-values that is larger than α out of these N simulated samples, where α is a preset significance level.
In our study, we set and N = 10,000. The corresponding standard error is . To reduce computation time, we use resamples for the parametric bootstrap method in this example.
As discussed in ref. [
10], if the data generated under
are correct (symmetry across populations), then the proportion of
p-values less than
should be close to the nominal level
with standard error
. On the other hand, if the data generated under
are correct (asymmetry in mean parameters), then the proportion of
p-values less than
should correspond to the power of the test at a 5% level of significance. Hence, a higher proportion corresponds to a more powerful test.
For our simulation, data in cases 1 to 4 are generated from independent Poisson distributions with the same mean parameter. The best method is the one that yields the proportion of p-values less than closest to the nominal-level . However, for case 5, data are generated from independent Poisson distributions with different mean parameters. The best method is the one that yields the largest proportion of p-values less than .
The results of this simulated study are recorded in
Table 3. We observed that the parametric bootstrap method gives results that are slightly closer to the nominal
than those obtained by the truncation method (see cases 1 to 4). Moreover, the parametric bootstrap method also has slightly higher power than the truncation method (see case 5). But the results are not significantly different. The advantage of the truncation method is the simplicity in the calculation, whereas the parametric bootstrap method is very time-consuming because of the required bootstrapping.
3. Asymptotic Distribution of LRTS When Is Small but Is Large
In many applied contexts, especially in real-time monitoring systems, one often encounters a small number of independent Poisson-distributed observations with large expected counts. A relevant example arises in online sequential testing, where a detector continuously monitors radiation by recording the number of emissions per second. The detector maintains a moving average of the most recent
n counts, with
n being fixed and small, while the underlying Poisson means are large. Under normal operating conditions, these counts are expected to be statistically similar, reflecting distributional symmetry over time. However, when a newly observed emission count deviates substantially from the previous ones, this symmetry is broken, suggesting an anomaly or system shift. In such cases, the detector triggers a reset mechanism. Currently, systems often rely on a moving average of four observations to determine when this symmetry-breaking event occurs. This scenario can be formulated as a statistical hypothesis testing problem. Let
denote the
ith observed number of emissions asummed to follow an independent Poisson
distribution, for
. Each
represents the mean emission rate per second and is assumed to be large. The goal is to test symmetry across
n independent Poisson populations based on the observed sample
. And the statistical hypothesis is stated in Equation (
1). The LRTS
is defined as in
Section 2, measuring the deviation from symmetry of the independent populations. However, the problem differs from the setting discussed in
Section 2, as here,
n is small and the common mean
is large. Thus, the asymptotic distribution of
is different from the one derived in
Section 2.
To address this problem, we first derived the asymptotic distribution of
under the condition that
n is fixed and
. Then, we propose three methods for approximating the
p-value for testing the hypothesis stated in Equation (
1). The first method directly utilizes the asymptotic distribution of
. The second and third methods are modifications of the Bartlett correction method (see ref. [
11]), where the mean of
is approximated by using a special case of the truncation method described in
Section 2, and by the approximated parametric bootstrap method, as given in
Section 2. An accurate approximation of the distribution of
is essential as it enables precise statistical inference regarding potential departures from symmetry in the underlying system. Such approximations not only support the detection of symmetry-breaking phenomena but also contribute to the broader understanding of invariant structures and transformations that align with the theoretical and applied focus of Symmetry.
3.1. Obtaining p-Value Based on LRTS
Since we are considering the case where
, we can rewrite
, where
is fixed, and
. The problem can then be reformulated as
, where each
follows a Poisson distribution with parameter
. Hence, as
, the central limit theorem gives
Since
converges to 1 in probability, it follows that
The LRTS given in Equation (
2) is
Using the Taylor expansion of
, we have
The first term simplifies to
and the second term simplies to
Therefore,
, and it follows that
Thus, the
p-value for testing the hypothesis stated in Equation (
1) is
where
w is the observed LRTS.
3.2. Bartlett Correction of LRTS
As stated in ref. [
12], Equation (
10) has a convergence rate of order
. Bartlett (see ref. [
11]) proposed the Bartlett correction method to improve the convergence rate of the
p-value to
, and ref. [
13] gives a detailed summary of the Bartlett correction method. Although the Bartlett correction method has a hight convergence rate, as is demonstrated in refs. [
14,
15], the exact Bartlett correction factor is very difficult to obtain. An alternative way of explaining the Bartlett correction method is to obtain a transformation of
such that the limiting distribution of the transformed statistic remains as a Chi-square distribution but the mean of the transformed statistic exactly matches the mean of the limiting distribution. This can be achieve by using the scale transformation:
which ensures that the mean of
is
. Thus, according to ref. [
11],
where
is the observed value of
, and this method achieves a convergence rate of
. A more accurate approximation of the
p-value provides stronger evidence for detecting whether the underlying symmetry of mean homogeneity across Poisson populations is preserved or broken. This leads to more precise decision-making and directly supports the journal’s focus on identifying and understanding symmetry-breaking phenomena in statistical structures.
However,
does not have a closed-form expression. As discussed in
Section 2, we propose two methods to approximate
: a version of the truncation method, and a parametric bootstrap method.
3.2.1. Truncation Method
Let
X be a Poisson random variable with mean
. By following the arguments in [
7], we have
Let
; then, we have
Moreover, let
where
Poisson(
). Then,
Since
, by Taylor expansion of
, we have
Furthermore, by Taylor expansion of
, we have
and similarly,
Finally,
Therefore, under
, we have
3.2.2. Parametric Bootstrap Method
We can also apply the parametric bootstrap method discussed in
Section 2 to this problem. Using the Algorithm given in
Section 2, we obtain
, which is an unbiased estimate of
.
3.3. Numerical Studies
Example 3. We generated a dataset from each of the following six cases stated in Table 4: The table reports the
p-values calculated from the three methods—the likelihood ratio method (LRT), the Bartlett correction method where the mean of the LRTS is approximated by Equation (
15) (BCTruncated), and the Bartlett correction method where the mean of the LRTS is approximated by the parametric bootstrap method (BCBootstrap)—for testing the hypothesis stated in Equation (
1). For the parametric bootstrap, we use M = 1,000,000 resamples. Note that rejecting
corresponds to evidence of having asymmetry in the mean parameters. Similar to Example 1, for the first four cases, the data were generated using the same mean parameter. Hence, we expect large
p-values. For cases 5 and 6, the data were generated using different mean parameters, so we expect small
p-values.
In
Table 5, the
p-values across all three methods are evidently similar. The parametric bootstrap method takes approximately 500 times longer to obtain the result than the other two methods.
Example 4. To compare the accuracy of the three methods discussed in Example 3, simulation studies with N = 10,000 were performed using the parameter settings given in Example 3. At the 5% level of significance, the standard error of each simulated study is 0.0022. Table 6 reports the proportion of p-values that are less than the chosen . For cases 1 to 4, the best method is the one which gives results closest to α. However, for cases 5 and 6, the best method is the one that yields the largest value. As shown in Table 6, the results are not significantly different from each other. Example 5. We consider additional combinations of and , where both n and λ are small. Table 7 reports the proportion of p-values less than α, obtained using the method in [3] (FWT), the truncation method discussed in Section 2 (Truncated), the parametric bootstrap method discussed in Section 2 (Bootstrap), the LRT method discussed in Section 3 (LRT), the Bartlett correction method where the mean of LRTS is approximated by the truncation method discussed in Section 3 (BCTruncated), and the Bartlett correction method where the mean of LRTS is approximated by the parametric bootstrap method discussed in Section 3 (BCBootstrap). The nominal
value is
, and with
N = 10,000, the standard error of each of the simulated studies is 0.0022. Again, the criterion for the best method is given in ref. [
10]. It is evident that results from FWT and Truncated are not satisfactory when
n is small. This suggests that the approximated mean and variance of the LRTS may not be accurate in such cases. The LRT method discussed in
Section 3 performed reasonably well, but still falls short of being ideal. The results from BCTruncated and Bootstrap are much better, with Bootstrap seeming to produce slightly more accurate results. Overall, BCBootstrap is the most accurate method regardless of the size of
n and
. However, the trade-off is that both Bootstrap and BCBootstrap are computationally intensive. If computational time is not a constraint, we recommend using either Bootstrap or BCBootstrap. If computational time is a concern, then we recommend using BCTruncated. Truncated should only be used when the
n is large relative to
. LRT is acceptable if the
is large, regardless of the size of
n.
Other simulation studies have also been performed, and they give similar results to those reported here. These results are available from the authors upon request.
5. Conclusions
This paper investigates the problem of testing the homogeneity of n independent Poisson means when only one observation per population is available—a common situation in fields such as epidemiology, environmental statistics, and industrial quality control. Traditional large-sample methods are not applicable in this setting. We proposed a LRTS-based approach, where the mean and variance of the LRTS are approximated using either a truncation method or a parametric bootstrap. These approximations enable p-value computation via the central limit theorem. For small n and large , we also derived the asymptotic distribution of the LRTS. The mean of the LRTS can be approximated analytically or via bootstraping, allowing for Bartlett correction to improve test accuracy. Simulations show that the bootstrap-based correction provides the most accurate results but is computationally intensive. In contrast, the truncation-based method is faster but less precise.
The framework also extends to constructing confidence intervals for the common mean by inverting the test based on values. The resulting interval uses chi-square critical values, with degrees of freedom depending on the chosen method. Overall, this work provides a practical and flexible approach to testing Poisson mean homogeneity in data-limited settings.