1. Introduction
The bootstrap method is often used for estimating the variance of survey estimates in Statistics Canada’s social surveys and sometimes in its economic surveys. It is implemented by producing a set of bootstrap weights that is made available to users along with the survey data. This implementation of the bootstrap method is convenient for users as they can typically easily replicate their statistics using the bootstrap weights instead of the original survey weights.
Most of Statistics Canada’s surveys that implement the bootstrap method have a complex stratified two-stage or three-stage sampling design. The Rao–Wu–Yue bootstrap weights [
1] are often computed in those surveys. The Rao–Wu–Yue bootstrap weights are applicable when the first-stage sample is drawn with replacement within strata. With-replacement sampling is never applied in practice; however, the Rao–Wu–Yue method provides a reasonable approximation for fixed-size without-replacement sampling when the first-stage sampling fractions are small.
In some multistage surveys, the first-stage sampling fractions may be large in some strata, and the Rao–Wu–Yue bootstrap weights may lead to substantial overestimation of the variance of survey estimates. The literature on this topic is mostly limited to the case of simple random sampling at each stage (e.g., [
2,
3,
4]). Chaudhuri and Saha [
5] considered two-stage sampling with the Rao–Hartley–Cochran (RHC) method [
6] at both stages. Inspired by [
7], Osiewicz and Pérez-Duarte [
8] provided bootstrap weights for the case of RHC sampling at the first stage followed by simple random sampling without replacement at the second stage.
In addition to RHC and simple random sampling, Poisson sampling and sampling without replacement with probability proportional to size are sometimes used in one of the stages. Beaumont and Patak [
9] proposed a general bootstrap method that can be applied to any design, including multistage sampling designs, as long as joint selection probabilities can be computed or at least accurately approximated. However, the method is somewhat complex to implement, as it requires a spectral decomposition, and negative bootstrap weights may occur. A recent review of bootstrap methods for finite population sampling is provided in [
10].
To our knowledge, there does not appear to exist a simple and general method for constructing non-negative bootstrap weights in multistage sampling with different sampling methods at each stage. We thus developed a unified bootstrap procedure that is straightforward to implement to fill this methodological gap. Our procedure can be applied to any number of stages as long as valid bootstrap weights can be produced for each distinct stage of sampling.
Our bootstrap procedure is also directly applicable to two-phase sampling designs when Poisson sampling is used at the second phase. This specific two-phase design can be viewed as a special case of two-stage sampling where first-stage units contain only one second-stage unit. It is also a useful design to model survey nonresponse. Our bootstrap procedure can thus be applied to estimate the variance of nonresponse-weighted estimators. Replication variance estimation for this specific two-phase design was also considered by [
11].
In
Section 2, we introduce notation, provide textbook variance estimators for the double expansion estimator of the population total under two-stage sampling and describe basic bootstrap concepts for finite populations. A few commonly-used methods of deriving bootstrap weights for single-stage cluster sampling are briefly reviewed in
Section 3 for different sampling designs. For probability proportional to size without replacement sampling, we propose a simple heuristic extension of the Rao–Wu–Yue bootstrap weights.
Our bootstrap procedure for two-stage sampling is presented in
Section 4.
Section 5 considers the special case of two-phase sampling with Poisson sampling at the second phase, and the application to nonresponse is discussed in
Section 6.
Section 7 describes how to apply our bootstrap procedure for multistage sampling using the example of three-stage sampling. Bootstrap theory for nonlinear estimators is briefly discussed in
Section 8.
Section 9 provides the results of three limited simulation studies to evaluate the performance of our proposed bootstrap methods. Finally, a few concluding remarks are given in
Section 10.
2. Background and Notation
Let us first consider the estimation of the population total
where
is the population of clusters,
is the population of units within cluster
,
is the value of variable of interest
for unit
in cluster
and
is the total of variable
within cluster
.
2.1. Single-Stage Cluster Sampling
At the first stage, a sample of clusters,
, is selected from the population
according to a probability sampling design. The probability of selecting a given cluster
at the first stage is
, and the first-stage design weight of cluster
is defined as
. The joint probability of selecting clusters
and
is
. We assume that both
and
are strictly greater than 0 for
and
. The basic estimator of
under single-stage cluster sampling is the simple expansion estimator,
It is design-unbiased, i.e.,
, where the subscript
indicates that the expectation is taken with respect to the sampling design. The design variance of
is given by
We consider estimators of
that takes the following quadratic form:
where
is determined so as to ensure that
is design-unbiased for
, i.e.,
. The typical choice is
Under regularity conditions, this also ensures that is design-consistent for .
2.2. Two-Stage Sampling
At the second stage, a sample is drawn from , independently from one selected cluster to another, using some probability sampling design. The probability of selecting unit in sample cluster is , and the second-stage design weight of unit in sample cluster is defined as . The joint probability of selecting units in the sample cluster and unit in the sample cluster is .
Again, we assume that
and
are all strictly greater than 0. Due to the above independence assumption,
for two units
and
in different clusters. This assumption is key for the validity of our bootstrap method, but we do not require the typical invariance assumption in two-stage sampling (see [
12], Chapter 4, and [
13]).
The double expansion estimator of the population total
is
where
. It follows from
,
, that
is design-unbiased for
. The design variance of
is given by
where
The first term on the right-hand side of (5), , is the variance under single-stage cluster sampling given in (1). The second term, , reflects the increase in variance due to the second stage of sampling.
The estimator
of
, given in (2), cannot be used in two-stage sampling because the cluster totals
,
, are unknown. Instead, we consider estimators of
that take the form:
where
The quantity
in (7) is determined so as to ensure that
,
, is conditionally design-unbiased for
, i.e.,
. The typical choice is
The independence assumption at the second stage implies that for two different sample clusters and . As a result, it is straightforward to show that in (6) is design-unbiased for .
A design-unbiased estimator of
is simply obtained as
The overall design variance in (5) can thus be estimated by
. Under regularity conditions,
is design-consistent for the design variance (5). An alternative decomposition for
is
, where
and
This alternative decomposition is used to develop our bootstrap methodology in
Section 4. For stratified sampling at the first stage, when all the first-stage sampling fractions are small, the term
is typically negligible. In that scenario, it is common practice to estimate the overall design variance (5) by
only and assume with-replacement sampling at the first stage. The Rao–Wu–Yue bootstrap method is based on this simplification. Although it is valid in many surveys, there are cases where some first-stage sampling fractions are not small, and
cannot be neglected.
Using (7), (9) and (10), the overall variance estimator
can be rewritten as
where
and
with
, for
, due to the independence assumption at the second stage. Assuming (3) and (8) are used, Equation (12) can be expressed as
where
and
. Expression (11) has a form similar to (2) for single-stage cluster sampling and is useful for the extension to three-stage sampling discussed in
Section 7.
2.3. Bootstrap Weights and Bootstrap Variance
Let us define the first-stage bootstrap weight for a sample cluster as
,
, where
is a random bootstrap weight adjustment for cluster
that is intended to account for the first-stage sampling variability.
Section 3 discusses how to generate
with a few examples for some common sampling designs. The second-stage bootstrap weight for a sample unit
in cluster
is defined as
,
and
, where
is a random bootstrap weight adjustment for unit
in cluster
that is intended to account for the second-stage sampling variability. A procedure for generating
will be discussed in
Section 4. The overall bootstrap weight for a sample unit
in cluster
is defined as
.
Using the bootstrap weights
and
, we obtain bootstrap versions of the simple expansion estimator,
, and the double expansion estimator
, given in (4), as
and
respectively, where
. The bootstrap weights are constructed so that the bootstrap expectation and variance of
and
are design-unbiased and design-consistent for the design expectation and variance of
and
, respectively. Greater detail is given in
Section 3 and
Section 4.
Suppose now that
is a nonlinear design-weighted estimator of a certain population parameter
, not necessarily a total, such that, for large samples,
. The bootstrap version
of
is obtained by replacing the first-stage and second-stage design weights by their corresponding bootstrap weights.
Section 8 provides an example where
is a smooth function of design-weighted estimators of totals. A valid bootstrap method ensures that, for large samples,
and
is design-consistent for
, where the subscript
* indicates the moments are taken with respect to the bootstrap mechanism, conditionally on the selected two-stage sample.
Except for linear estimators, such as
and
, discussed in
Section 3 and
Section 4, respectively, a closed-form expression for the bootstrap variance
does not exist. It is generally approximated through Monte Carlo simulations by generating bootstrap adjustments independently a large number of times, say
. We denote these
replicate versions of
and
by
and
, respectively,
. From these
bootstrap adjustments, we can compute
replicate versions
of
,
. The bootstrap variance,
, can then be approximated as
3. Bootstrap for Single-Stage Cluster Sampling
We consider distributions for generating the bootstrap adjustments
,
, that satisfy the following two conditions:
and
for every pair of sample clusters
and
. Conditions (15) and (16) ensure that
and
Conditions (17) and (18) are key for the validity of any bootstrap method. They ensure that the bootstrap expectation and variance of are design-unbiased and design-consistent for the design expectation and variance of , respectively.
Any distribution can be used to generate bootstrap adjustments as long as conditions (15) and (16) are satisfied. It may also be desirable to ensure that the bootstrap adjustments are all non-negative leading to non-negative bootstrap weights . We provide below some common expressions of bootstrap weight adjustments that achieve these objectives for different sampling designs.
3.1. Poisson Sampling
Under Poisson sampling, clusters are selected independently of one another. Assuming (3) is used, we have
,
, and
. Beaumont and Patak [
9] studied several distributions that satisfy conditions (15) and (16), including some that ensured non-negative bootstrap adjustments
. A main conclusion of their simulation study is that the choice of the distribution does not have a large impact on the properties of the bootstrap variance estimator as long as heavily-skewed distributions are avoided. We thus propose to generate
,
, independently of one another using the gamma distribution with
and
. This distribution satisfies conditions (15) and (16), ensures non-negative values of
and is easy to implement.
3.2. Sampling with Replacement with Unequal Probabilities
Suppose that a sample
of size
is drawn with replacement from the population
of size
. At each draw, the population cluster
is chosen with probability
and we have
. Note that some population clusters may appear multiple times in the sample
. In other words, each element of
is one of the
draws. Here, we define
and
. (This is a slight abuse of notation as
is not the selection probability of population cluster
in with-replacement sampling.) The standard variance estimator for
, proposed by [
14], can be written in the form (2) with
,
, and
. The Rao–Wu–Yue bootstrap weight adjustment can be used for this design. It is given by
for
. The so-called multiplicity
is the number of times cluster
is chosen out of
independent random trials, where each
has a constant probability of being chosen at each trial equal to
. By observing that the multiplicities
follow a multinomial distribution, it is straightforward to show that the bootstrap weight adjustments (19) satisfy conditions (15) and (16). They are non-negative provided that
. The typical choice is
. Using this choice, the bootstrap adjustment (19) reduces to
.
If
is large and
, the bootstrap weight adjustment (19) can be approximated as
. Using
is equivalent to applying the original bootstrap method of [
15] for independently and identically distributed observations.
3.3. Simple Random Sampling without Replacement
Suppose now that a simple random sample
of size
is drawn without replacement from the population
of size
. Under this design, it can be easily shown that (3) reduces to
,
, and
, where
is the sampling fraction. Rao, Wu and Yue [
1] did not provide bootstrap weights for this design. However, [
9] showed that the bootstrap method of [
7] can be implemented by using the following bootstrap weight adjustment:
for
. Expression (20) is similar to (19) but it incorporates the finite population correction
. Again, it is straightforward to show that the bootstrap weight adjustments (20) satisfy conditions (15) and (16). They are non-negative provided that
. The choice
dominates in practice.
3.4. Sampling without Replacement with Unequal Probabilities
The case of sampling
clusters without replacement with unequal probabilities is, in general, more difficult to handle as a simple expression, like (19) or (20), does not exist, and the joint selection probabilities
are often unknown. For the RHC sampling design, a design-unbiased variance estimator that takes the form (2) is available, and it is thus possible to construct valid bootstrap weights. The RHC sampling design was first considered by [
7], and an expression for the bootstrap weights is given in [
8]. Chaudhuri and Saha [
5] also developed a bootstrap method for the RHC sampling design.
Beaumont and Patak [
9] considered a method for generating bootstrap weight adjustments that satisfy (15) and (16) for general sampling designs. Their method requires the computation of joint selection probabilities or an accurate approximation of these probabilities. It is somewhat complex to implement and may produce negative bootstrap weights. As a simple alternative, we suggest below a heuristic extension of the bootstrap adjustment (20):
The bootstrap adjustment (21) satisfies condition (15) and, if (3) is used, also satisfies
. However, it does not satisfy (16) entirely since
for
; the right-hand side of (22) can only be viewed as an approximation of
given in (3). As a result, replacing
in (2) with
does not lead to an exactly design-unbiased estimator of
. By equating the right-hand side of (3) to the right-hand side of (22), we obtain the following approximation of the joint selection probability
,
:
We note that, if all the selection probabilities are equal to , then the right-hand side of (23) reduces to the joint selection probability under simple random sampling without replacement and the bootstrap adjustment (21) reduces to (20). The bootstrap adjustment (21) is also approximately equal to (19) if all the selection probabilities are small. Note also that the bootstrap adjustment (21) is exactly equal to 1, as expected, for clusters selected with certainty in the sample.
A drawback of the bootstrap adjustment (21) is that the bootstrap version of
,
, is random and not necessarily equal to the fixed sample size
. This can be dealt with by calibrating the bootstrap adjustment (21) so as to ensure that the calibrated bootstrap adjustments
satisfy
. An alternative to (21) is thus
with
given in (21). Note that clusters selected with certainty, if any, should be excluded from this calibration and keep their bootstrap adjustment of 1. Both (21) and (24) yield non-negative bootstrap adjustments when
and are easy to implement. This simplicity comes at the expense of losing the strict design-unbiasedness property of the resulting variance estimator.
It is beyond the scope of this paper to rigorously assess the quality of the approximation (23) and the bootstrap adjustments (21) and (24). However, these bootstrap adjustments are evaluated in a limited simulation study in
Section 9.1. Our simulation study shows that the bootstrap adjustment (21) alone leads to substantial overestimation of the design variance but yields small biases if it is modified by the calibration adjustment (24). In [
16], and recently in [
17], bootstrap methods were proposed for without-replacement sampling with probability proportional to size that could be used as an alternative to (24).
3.5. Stratified Sampling
All the above sampling designs are typically applied within strata, and sampling is done independently from one stratum to another. The extension of the above bootstrap methods to stratified sampling is straightforward. It only suffices to generate the bootstrap weight adjustments independently from one stratum to another using the appropriate method.
4. Bootstrap for Two-Stage Sampling
First, bootstrap adjustments
,
, that satisfy conditions (15) and (16) given in
Section 3, are generated. The objective is then to generate
for all sample units so as to ensure that
and
where
is given in (14). We assume that the adjustments
are generated independently from
,
and that
. This ensures that condition (25) is satisfied. If we further assume that
,
, are generated independently from one cluster
to another, then it is straightforward to show that
As a result, conditions (25) and (26) are satisfied if
and
There are many possible ways of generating
,
, that satisfy conditions (27) and (28) for each
. We propose generating the second-stage bootstrap adjustments as
where
,
, are preliminary second-stage bootstrap adjustments, generated independently from one cluster
to another, that satisfy
and
The methods described in
Section 3.1,
Section 3.2,
Section 3.3,
Section 3.4, which satisfy conditions (15) and (16), can be adapted in a straightforward manner to generate
,
, that satisfy conditions (30) and (31). For instance, if Poisson sampling is used at the second stage along with Equation (8), then
,
, can be generated independently of one another using the gamma distribution with
and
. In
Section 9.2, we provide an expression for
under simple random sampling without replacement at the second stage.
Three remarks are in order: (i) the bootstrap adjustment
in (29) is computable only if
; (ii) the bootstrap adjustment
is non-negative when
is non-negative and
; and (iii) if (3) is used,
, and the bootstrap adjustment (29) can be rewritten as
It is non-negative provided that . The bootstrap adjustments (29) or (32) satisfy conditions (27) and (28).
We can also show that
where
is the total of a certain variable
within cluster
. The right-hand side of (33) is a design-unbiased and design-consistent estimator of
. The property (33) is required for estimators that involve sums over first-stage and second-stage sample units (see
Section 8). The nonresponse-adjusted estimator discussed in
Section 6 is a special case that involves sums over first-phase and second-phase sample units.
Our bootstrap method can be summarized using the following steps:
- Step 1:
Generate first-stage bootstrap adjustments , , that satisfy conditions (15) and (16).
- Step 2:
Generate preliminary second-stage bootstrap adjustments , , independently from one cluster to another, that satisfy conditions (30) and (31).
- Step 3:
Compute the second-stage bootstrap adjustments , , using (29) or (32) for each .
- Step 4:
Compute the first-stage bootstrap weights , and, for each , the overall bootstrap weights , .
The above steps are repeated independently
times. Bootstrap variance estimates can then be computed, as described in
Section 2.3, using the bootstrap weights obtained in step 4. Note that standard bootstrap methods for single-stage sampling can be used to obtain bootstrap adjustments in steps 1 and 2. This is discussed in
Section 3 for the first-stage bootstrap adjustments in step 1. Preliminary second-stage bootstrap adjustments in step 2 can be obtained similarly.
5. Bootstrap for Two-Phase Sampling with Poisson Sampling at the Second Phase
In the next two sections, we consider, in greater detail, the case where all the population clusters are of size 1. Due to the independence assumption at the second stage, this special case of two-stage sampling is usually called two-phase sampling with Poisson sampling at the second phase. For the single unit
in population cluster
, we thus have
. The selection of a unit
at the second stage (or phase) is equivalent to the selection of its cluster
, and we can write
,
and
. Assuming (8) is used, we can also write
. The estimated cluster total
can be written as
, where
if the unit in cluster
is selected at the second phase and
, otherwise. We denote by
, the set of clusters (or units) selected at the second phase, i.e., the set of units
such that
. The double expansion estimator (4) reduces to
. The estimated variance (7) becomes
The alternative variance estimators
and
, given in (9) and (10), become
and
respectively. The bootstrap procedure in
Section 4 can still be applied. With the modified notation in this section, it becomes:
- Step 1:
Generate first-stage (first-phase) bootstrap adjustments , , that satisfy conditions (15) and (16).
- Step 2:
Generate preliminary second-stage (second-phase) bootstrap adjustments , , independently of one another so as to satisfy and . We suggest using the gamma distribution.
- Step 3:
Compute the second-stage (second-phase) bootstrap adjustments , , using (29) or (32) with and replaced with and , respectively.
- Step 4:
Compute the first-stage (first-phase) bootstrap weights , and the overall bootstrap weights , .
The application of these four steps ensures that conditions (25) and (26) are satisfied with and given in (34) and (35), respectively. The bootstrap estimator involved in these conditions can be expressed as .
6. Application to Nonresponse
Nonresponse is often viewed as a second phase of sampling, and it is typically assumed that sample units respond independently of one another. Poisson sampling is thus a useful model to handle nonresponse. However, the response probability
,
, is unknown. We assume that a consistent estimator
of
can be obtained under a specified nonresponse model. The bootstrap procedure of
Section 5 can still be used but
in step 2 must be replaced with
. Condition (25) remains satisfied, and we can show that
where
and
are given in (34) and (35), respectively. Condition (26) is thus asymptotically satisfied provided that
.
The linear double expansion estimator of the population total cannot be used in the context of nonresponse because is unknown. It can be replaced with to obtain the nonresponse-adjusted estimator . The bootstrap version of the nonresponse-adjusted estimator is , where is the bootstrap nonresponse-adjusted weight for unit and is the bootstrap version of .
In
Section 6.1, we provide two examples of response probability models with their associated estimators
and
. An alternative replication method was proposed by [
11]. This is discussed in
Section 6.2.
6.1. Examples
Uniform model
In the uniform model, the unknown response probability
is assumed to be constant, i.e.,
,
. It can be estimated by the design-weighted response rate
provided that the design weight
does not explain the response indicator
. The bootstrap version of
in (36) is
Unweighted versions of (36) and (37), obtained by omitting the design weight in the above two equations, are also used in practice.
The uniform model is usually assumed to hold within response homogeneous groups so that for all sample units that fall into group , where is the number of groups. The response rates (36) and (37) are then computed separately for each group so as to obtain and , .
Logistic model
Suppose now that the unknown response probability
can be modelled using the logistic function
, where
is a vector of auxiliary variables available for all
and
is a vector of unknown model parameters. The estimator
of
can be obtained by solving for
in the design-weighted (or pseudo) maximum likelihood estimating equation
provided that the design weight
does not explain
after conditioning on
. The response probability can then be estimated as
. The bootstrap version of (38) is
Solving (39) for yields and the bootstrap version of is . When , , the logistic model reduces to the uniform model.
6.2. The Kim-Yu Method
Kim and Yu [
11] proposed a replication variance estimation method for two-phase sampling with Poisson sampling at the second phase. Their method is not applicable to the double expansion estimator but can be used with reweighted estimators, such as the nonresponse-adjusted estimator discussed above under the uniform model. Using the notation of our paper, the second-phase bootstrap adjustments proposed by [
11] are given by
where
follows a Bernoulli distribution with probability
and
The adjustments (40) proposed by [
11] can be rewritten as
where
has a mean equal to 1 and a variance equal to
.
The second-phase adjustment (41) is similar but not identical to our proposed adjustments. It would be identical to the adjustment (32) if
in (41) were replaced with
. The adjustment (41) is expected to lead to overestimation of the variance. However, our simulation study in
Section 9.3 shows that the overestimation remains moderate.
7. Bootstrap for Multistage Sampling
Designs with more than two stages of sampling are sometimes used in practice. For these multistage sampling designs, our bootstrap procedure in
Section 4 can be applied repeatedly to generate bootstrap adjustments for each additional stage. First, bootstrap weight adjustments are generated for the first two stages of sampling. Then, the first two stages are viewed as a single stage, bootstrap weight adjustments for the third stage are generated and so on.
We provide below greater detail on our bootstrap procedure for three-stage sampling. To simplify the notation in this section, the first two stages are combined into a single stage so that the subscript
refers to second-stage units and the subscript
refers to third-stage units. As a result,
denotes the sample of second-stage units, and
denotes the sample of third-stage units within the selected second-stage unit
. Using this revised notation,
and the weight
, defined in
Section 2.2, and
, given in (12), are now written
,
and
, respectively.
For three-stage sampling, our bootstrap method can be summarized using the following steps:
- Step 1:
Generate combined first-stage and second-stage bootstrap adjustments
,
, that satisfy
and
. The bootstrap procedure in
Section 4 can be used to generate
,
, as the product of the first-stage and second-stage bootstrap adjustments.
- Step 2:
Generate preliminary third-stage bootstrap adjustments
,
, independently from one second-stage unit
to another, that satisfy
and
, where
is determined similarly to
in
Section 2.2.
- Step 3:
Compute the third-stage bootstrap adjustments
,
, similarly to (29), using
for each second-stage unit
. Note that
if
can be written in the form (13).
- Step 4:
Compute the overall bootstrap weights , for each and , where , and is the probability of selecting unit at the third stage.
8. Bootstrap for Nonlinear Estimators
In
Section 4, we developed the theory for the linear double expansion estimator (4) and proposed random bootstrap weights that satisfy conditions (25) and (26). In practice, nonlinear estimators are often used. The nonresponse-adjusted estimator in
Section 6 is an example. The bootstrap continues to be valid for nonlinear estimators under certain regularity conditions.
We provide below, through linearization techniques, an informal justification of the validity of the bootstrap method for nonlinear estimators that can be written as smooth functions of totals. A more formal justification is given in [
18] for simple random sampling without replacement at the first stage with negligible first-stage sampling fractions. Justification of the bootstrap method for nonlinear estimators that are implicitly defined through estimating equations is discussed in [
9,
19]. The latter provides a rigorous treatment of this topic. Greater detail on the theory of bootstrap can be found in [
20].
Suppose that we are interested in estimating the design variance of the nonlinear estimator
, where the function
is a smooth function,
,
,
and
and
are vectors for unit
in cluster
. Under regularity conditions, the estimator
is design-consistent for the population parameter
, where
,
and
. The bootstrap version of
is
, where
and
Using a first-order Taylor linearization, the sampling error
can be approximated as
where
and
. Using (42), the design variance of
is approximated as
Similarly, we can again use a first-order Taylor linearization to approximate the bootstrap error
as
From (44), the bootstrap variance of
is approximated as
The right-hand side of (45) is design-consistent for the right-hand side of (43) provided that , , , and are design-consistent for , , , and , respectively. Conditions (15), (16), (27) and (28) are key to obtain a valid bootstrap method. They ensure that , and are identical to standard textbook estimators of , and , respectively.
9. Simulation Studies
We conducted three simulation studies to evaluate our proposed bootstrap methods under three different sampling designs. First, our population for the simulation studies was taken from 50 clusters of the Canadian Health Measures Survey (CHMS). We considered two variables of interest from the CHMS: Body Mass Index (BMI) and Height. The CHMS aims to collect important health information through a household interview and direct physical measures and is based on a stratified three-stage sampling design. Greater detail on the CHMS can be found at:
https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&SDDS=5071 (accessed on 28 January 2022).
From this population, we generated
samples using one of the three sampling designs described in
Section 9.1,
Section 9.2,
Section 9.3. Suppose that
is a weighted estimate of a certain population parameter
for the
th generated sample. The design variance of the estimator
,
, is approximated by the Monte Carlo variance
Then, for each of the
samples, 1000 sets of bootstrap weights were generated using one of the methods discussed in
Section 3,
Section 4,
Section 5,
Section 6, and the bootstrap variance was obtained as described at the end of
Section 2.3. Let us denote by
, the bootstrap variance estimate for the
th sample for any given sampling design and bootstrap method. The percent Monte Carlo Relative Bias (RB) of the bootstrap variance estimator is defined as
and its percent Monte Carlo Relative Root Mean Square Error (RRMSE) is defined as
where
is the percent relative error. We evaluated bootstrap methods for the three sampling designs described in
Section 9.1,
Section 9.2,
Section 9.3 by comparing their RBs and RRMSEs. The SAS code used to obtain our simulation results is available upon request from the authors.
9.1. Single-Stage Cluster Sampling without Replacement with Unequal Probabilities
The objective of this simulation study is to evaluate the accuracy of the bootstrap adjustments (21) and (24) under single-stage cluster sampling without replacement with probability proportional to size.
From our population of
clusters, we generated
samples using sequential Poisson sampling [
21], where the size measure is the cluster size. We considered two sample sizes:
and
. The sampling fraction is 0.20 for
and 0.60 for
. We compared the RB and RRMSE of different bootstrap methods of estimating the design variance of the expansion estimator
for two variables of interest: BMI and Height.
Table 1 describes the five methods we considered in this simulation study to obtain bootstrap weight adjustments
,
.
Relative biases are provided in
Table 2. As expected, the RWY method shows large positive biases, especially for the largest sampling fraction, as it is valid under with-replacement sampling. The SRS-RWY method is surprisingly not overly poor in terms of bias, except for the variable BMI and
, where the RB is not small (34.6%). The PPS-RWY method shows very large biases, whereas PPS-RWY-CAL is the best method in terms of bias.
In other words, it appears that modifying the bootstrap adjustment (21) by the calibration adjustment (24) is effective for taking into account the unknown joint selection probabilities and the fixed sample size. The POI-CAL method is also effective, although it shows negative RBs, especially for the smallest sample size. The POI-CAL method is sometimes used in practice for its simplicity.
Table 3 gives the RRMSEs. Both RWY and PPS-RWY show large RRMSEs due to their large biases. The other three methods are comparable, perhaps with a small advantage for POI-CAL.
Overall, the proposed PPS-RWY-CAL method offers a good compromise in terms of both bias and mean square error. The POI-CAL method is an interesting alternative, especially for larger sample sizes.
9.2. Two-Stage Sampling with Poisson Sampling at the First Stage and Simple Random Sampling at the Second Stage
The objective of this second simulation study is to evaluate the accuracy of the bootstrap adjustment (32) under a two-stage sampling design. At the first stage, Poisson sampling was used to select a sample of clusters, with probability proportional to cluster size, among the population clusters. We considered two values of the expected sample size: and . For each of the selected clusters at the first stage, a second-stage sample of 10 units was selected using simple random sampling without replacement.
This process was repeated times. For each of the 10,000 samples, we computed double expansion estimates of the population total for the variables BMI and Height. We also computed weighted estimates of the population median for these variables using the methodology implemented in the procedure Univariate of SAS for weighted percentiles.
We considered two bootstrap methods for estimating design variances: (i) the RWY method described in
Table 1 and (ii) the proposed method described in
Section 4. In our proposed method, we generated first-stage bootstrap adjustments using the gamma distribution described in
Section 3.1. Then, we generated preliminary second-stage bootstrap adjustments within each selected cluster
by applying the method in
Section 3.3 for simple random sampling without replacement, i.e., we used
where
is the number of units selected in cluster
,
is the sampling fraction in cluster
,
is the number of times sample unit
in cluster
is chosen out of
independent random trials, where each sample unit
i in cluster
has a constant probability of being chosen at each trial equal to
. Finally, we computed second-stage bootstrap adjustments using Equation (32).
Table 4 provides RBs for both the RWY bootstrap method and the proposed method, whereas the RRMSEs are given in
Table 5. The RWY method leads to significant negative biases for the estimation of population totals because it does not account for the random sample size at the first stage due to Poisson sampling. As expected, the proposed method shows small biases and smaller RRMSEs compared with the RWY method.
Estimators of population means or medians are typically less affected by random sample sizes than estimators of population totals. As a result, the RWY method shows moderate positive biases for the estimation of population medians due to the non-negligible first-stage sampling fractions. Again, the proposed method shows smaller biases and RRMSEs than the RWY method.
These results are not surprising since the RWY method is not intended for Poisson sampling and/or non-negligible first-stage sampling fractions. It remains a useful method in many practical scenarios. This simulation study simply illustrates that the sampling design must be properly taken into account when choosing a bootstrap method.
9.3. Two-Phase Sampling with Simple Random Sampling at the First Phase and Nonresponse at the Second Phase
The objective of this last simulation study is to evaluate our bootstrap procedure in
Section 5 and
Section 6 under two-phase sampling with nonresponse at the second phase. Our population for the variable BMI is a random subset of all the units in the 50 population clusters that were used in
Section 9.1 and
Section 9.2. This gave a total of 6227 population units. The population for the variable Height was constructed similarly by selecting a random subset of all the units in the same 50 clusters, which resulted in 6439 units.
For both populations, simple random sampling without replacement was used to select a first-phase sample with a sampling fraction of 0.5. From the first-phase sample units, nonresponse was then generated using Bernoulli sampling with probability 0.05 (uniform nonresponse). This process was again repeated times. For each of the 10,000 samples, we computed nonresponse-adjusted estimates of the population total for the variables BMI and Height, where the estimated probability is given in (36).
We considered two bootstrap methods for estimating design variances: (i) the Kim–Yu method described in
Section 6.2 with
and (ii) the proposed method described in
Section 5 and
Section 6. In our proposed method, we generated first-phase bootstrap adjustments using Equation (20) in
Section 3.3. Then, we generated preliminary second-phase bootstrap adjustments using the gamma distribution with mean 1 and variance
and computed the second-phase bootstrap adjustments using (32) as described in step (3) of
Section 5. In both methods, the bootstrap version of
is
, where
is given in (37).
Table 6 provides RBs and RRMSEs for both bootstrap methods. The Kim–Yu method shows a moderate positive bias for both variables whereas the proposed method has a negligible bias, albeit negative. The larger RRMSEs of the Kim–Yu method can be explained by its larger bias.
10. Concluding Remarks
We developed a simple and unified method for producing non-negative bootstrap weights that can be applied to any multistage sampling design provided that valid bootstrap weights can be produced for each distinct stage of sampling. This is not a major limitation in practice as most bootstrap methods for single-stage sampling can be implemented through the computation of bootstrap weights, including the pseudo-population method (see [
9,
10,
16,
22]).
We provided a few expressions for bootstrap weights under different single-stage sampling designs, including probability proportional to size without replacement sampling. For this design, we proposed a simple heuristic extension of the Rao–Wu–Yue bootstrap weights that performed well in our empirical investigations.
We assumed that the selection and joint selection probabilities were strictly greater than 0 at each stage, which excludes systematic sampling in one of the stages. For systematic sampling, the usual practical solution is to approximate this design with stratified simple random sampling without replacement, where strata are formed based on the order of the population units in the sampling frame.
Our bootstrap method for multistage sampling can be used even when first-stage sampling fractions are not small—unlike the commonly used Rao–Wu–Yue bootstrap weights. The results of simulation studies indicate that our method has good properties in terms of bias and mean square error.
As indicated in the introduction, there are alternatives bootstrap methods for some specific multistage sampling designs, such as simple random sampling without replacement at each stage. Although we did not investigate these methods, we do not expect our bootstrap method to have necessarily better (or worse) statistical properties. They are all valid methods that correctly track the first two design moments of estimators. The main appeal of our method is its simplicity and generality; it can be applied to most multistage sampling designs used in practice through a simple formula that combines the bootstrap weights at each distinct stage of sampling.
Our bootstrap method is also directly applicable to two-phase sampling designs when Poisson sampling is used at the second phase; it can thus handle nonresponse weighting. More research is required for general two-phase or multi-phase sampling designs.
Author Contributions
Methodology, J.-F.B.; software, N.É.; validation, J.-F.B. and N.É.; writing—original draft preparation, J.-F.B.; writing—review and editing, J.-F.B. and N.É. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data are not publicly available due to confidentiality restrictions.
Acknowledgments
We would like to thank Fulvia Mecatti for the kind invitation to contribute to this special issue and all three reviewers for their comments, which helped improve the clarity of our manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Disclaimer
The content of this paper represents the authors’ opinions and not necessarily those of Statistics Canada.
References
- Rao, J.N.K.; Wu, C.F.J.; Yue, K. Some recent work on resampling methods for complex surveys. Surv. Methodol. 1992, 18, 209–217. [Google Scholar]
- Funaoka, F.; Saigo, H.; Sitter, R.R.; Toida, T. Bernoulli bootstrap for stratified multistage sampling. Surv. Methodol. 2006, 32, 151–156. [Google Scholar]
- Preston, J. Rescaled bootstrap for stratified multistage sampling. Surv. Methodol. 2009, 35, 227–234. [Google Scholar]
- Saigo, H. Comparing four bootstrap methods for stratified three-stage sampling. J. Off. Stat. 2010, 26, 193–207. [Google Scholar]
- Chaudhuri, A.; Saha, A. Extending Sitter’s mirror-match bootstrap to cover Rao-Hartley-Cochran sampling in two-stages with simulated illustrations. Sankhyā Indian J. Stat. 2004, 66, 791–802. [Google Scholar]
- Rao, J.N.K.; Hartley, H.O.; Cochran, W.G. On a simple procedure of unequal probability sampling without replacement. J. R. Stat. Soc. Ser. B 1962, 24, 482–491. [Google Scholar] [CrossRef]
- Rao, J.N.K.; Wu, C.F.J. Resampling inference with complex survey data. J. Am. Stat. Assoc. 1988, 83, 231–241. [Google Scholar] [CrossRef]
- Osiewicz, M.; Pérez-Duarte, S. Flexible variance estimation in complex sample surveys: Rescaled bootstrap in multistage, pps surveys. In Proceedings of the New Techniques and Technologies for Statistics, Brüssel, Belgium, 5–7 March 2013. [Google Scholar]
- Beaumont, J.F.; Patak, Z. On the generalized bootstrap for sample surveys with special attention to Poisson sampling. Int. Stat. Rev. 2012, 80, 127–148. [Google Scholar] [CrossRef]
- Mashreghi, Z.; Haziza, D.; Léger, C. A survey of bootstrap methods in finite population sampling. Stat. Surv. 2016, 10, 1–52. [Google Scholar] [CrossRef]
- Kim, J.K.; Yu, C.L. Replication variance estimation under two-phase sampling. Surv. Methodol. 2011, 37, 67–74. [Google Scholar]
- Särndal, C.-E.; Swensson, B.; Wretman, J. Model Assisted Survey Sampling; Springer: New York, NY, USA, 1992. [Google Scholar]
- Beaumont, J.-F.; Haziza, D. A note on the concept of invariance in two-phase sampling designs. Surv. Methodol. 2016, 42, 319–323. [Google Scholar]
- Hansen, M.H.; Hurwitz, W.N. On the theory of sampling from finite populations. Ann. Math. Stat. 1943, 14, 333–362. [Google Scholar] [CrossRef]
- Efron, B. Bootstrap methods: Another look at the jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
- Ranalli, M.G.; Mecatti, F. Comparing recent approaches for bootstrapping sample survey data: A first step towards a unified approach. In JSM Proceedings; Survey Research Methods Section; American Statistical Association: Alexandria, VA, USA, 2012; pp. 4088–4099. [Google Scholar]
- Conti, P.L.; Mecatti, F.; Nicolussi, F. Efficient unequal probability resampling from finite populations. Comput. Stat. Data Anal. 2022, 167, 107366. [Google Scholar] [CrossRef]
- Chauvet, G. Coupling methods for multistage sampling. Ann. Stat. 2015, 43, 2484–2506. [Google Scholar] [CrossRef]
- Kim, J.K.; Rao, J.N.K.; Wang, Z. Hypotheses testing from complex survey data using bootstrap weights: A Unified Approach. arXiv 2019, arXiv:1902.08944. [Google Scholar]
- Shao, J.; Tu, D. The Jackknife and Bootstrap; Springer Science & Business Media: New York, NY, USA, 2012. [Google Scholar]
- Ohlsson, E. Sequential poisson sampling. J. Off. Stat. 1998, 14, 149–162. [Google Scholar]
- Conti, P.L.; Mecatti, F. Resampling under complex sampling designs: Roots, development and the way forward. Stats 2022, 5, 258–269. [Google Scholar] [CrossRef]
Table 1.
Bootstrap methods for single-stage cluster sampling.
Table 1.
Bootstrap methods for single-stage cluster sampling.
Method | Description |
---|
RWY | Bootstrap adjustment (19) obtained under with-replacement sampling with as proposed in [1]. |
SRS-RWY | Bootstrap adjustment (20) obtained under simple random sampling without replacement. |
PPS-RWY | Proposed bootstrap adjustment (21) for sampling without replacement with probability proportional to size. |
PPS-RWY-CAL | Proposed bootstrap adjustment (21) for sampling without replacement with probability proportional to size modified by the calibration adjustment (24). |
POI-CAL | Bootstrap adjustment under Poisson sampling, using the gamma distribution discussed in Section 3.1, modified by the calibration adjustment (24). |
Table 2.
The percent relative biases of the bootstrap variance estimators.
Table 2.
The percent relative biases of the bootstrap variance estimators.
Method | BMI | Height |
---|
| | | |
---|
RWY | 31.2 | 236.4 | 25.1 | 145.8 |
SRS-RWY | 4.9 | 34.6 | 0.1 | −1.7 |
PPS-RWY | 127.7 | 4288.6 | 1847.8 | 68,959.1 |
PPS-RWY-CAL | 1.0 | 1.5 | −0.2 | −1.9 |
POI-CAL | −15.8 | −2.1 | −17.3 | −6.3 |
Table 3.
The percent relative root mean square errors of the bootstrap variance estimators.
Table 3.
The percent relative root mean square errors of the bootstrap variance estimators.
Method | BMI | Height |
---|
| | | |
---|
RWY | 64.0 | 243.4 | 56.0 | 152.3 |
SRS-RWY | 45.0 | 41.6 | 40.1 | 17.7 |
PPS-RWY | 161.4 | 4335.6 | 2021.1 | 69,700.3 |
PPS-RWY-CAL | 44.2 | 30.0 | 41.8 | 24.9 |
POI-CAL | 39.9 | 28.8 | 38.5 | 24.8 |
Table 4.
The percent relative biases of the bootstrap variance estimators.
Table 4.
The percent relative biases of the bootstrap variance estimators.
Method | Population Parameter | BMI | Height |
---|
| | | |
---|
RWY | Total | −98.8 | −97.2 | −99.7 | −99.3 |
Proposed | Total | −1.8 | 1.3 | 0.71 | 4.3 |
RWY | Median | 14.6 | 24.7 | 21.5 | 12.8 |
Proposed | Median | −2.9 | 10.0 | 3.8 | 6.8 |
Table 5.
The percent relative root mean square errors of the bootstrap variance estimators.
Table 5.
The percent relative root mean square errors of the bootstrap variance estimators.
Method | Population Parameter | BMI | Height |
---|
| | | |
---|
RWY | Total | 98.8 | 97.2 | 99.7 | 99.3 |
Proposed | Total | 29.1 | 16.6 | 28.9 | 17.5 |
RWY | Median | 88.3 | 58.6 | 126.4 | 50.6 |
Proposed | Median | 65.9 | 42.9 | 87.6 | 41.1 |
Table 6.
The percent RBs and percent RRMSEs of the bootstrap variance estimators.
Table 6.
The percent RBs and percent RRMSEs of the bootstrap variance estimators.
Method | RB | RRMSE |
---|
BMI | Height | BMI | Height |
---|
Kim–Yu | 22.8 | 23.0 | 30.6 | 29.2 |
Proposed | −2.6 | −2.4 | 16.4 | 14.4 |
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).