Next Article in Journal
New Results About Aggregation Functions of Quasi-Pseudometric Modulars
Previous Article in Journal
A Review of Link Prediction Algorithms in Dynamic Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Point and Interval Estimation of Population Prevalence Using a Fallible Test and a Non-Probabilistic Sample: Post-Stratification Correction

by
Jorge Mario Estrada Alvarez
1,†,
Juan de Dios Luna del Castillo
2,† and
Miguel Ángel Montero-Alonso
2,*,†
1
Caja de Compensación Familiar de Risaralda, Salud Comfamiliar, Pereira 660003, Colombia
2
Department of Statistics and Operational Research, University of Granada, 18071 Granada, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2025, 13(5), 805; https://doi.org/10.3390/math13050805
Submission received: 17 January 2025 / Revised: 25 February 2025 / Accepted: 26 February 2025 / Published: 28 February 2025
(This article belongs to the Special Issue Current Research in Biostatistics, 2nd Edition)

Abstract

:
Accurate prevalence estimation is crucial for public health planning, particularly for rare diseases or low-prevalence conditions. This study evaluated frequentist and Bayesian methods for estimating prevalence, addressing challenges such as imperfect diagnostic tests, partial disease status verification, and non-probabilistic samples. Post-stratification was applied as a novel method and was used to improve representativeness and correct biases. Three scenarios were analyzed: (1) complete verification using a gold standard, (2) estimation with a diagnostic test of known sensitivity and specificity, and (3) partial verification of disease status limited to test positives. In all scenarios, post-stratification adjustments increased prevalence estimates and interval lengths, highlighting the importance of accounting for population variability. Bayesian methods demonstrated advantages in integrating prior information and modeling uncertainty, particularly under high-variability and low-prevalence conditions. Key findings included the flexibility of Bayesian approaches to maintain estimates within plausible ranges and the effectiveness of post-stratification in correcting biases in non-probabilistic samples. Frequentist methods provided narrower intervals but were limited in addressing inherent uncertainties. This study underscores the need for methodological adjustments in epidemiological studies, offering robust solutions for real-world challenges. These results have significant implications for improving public health decision-making and the design of prevalence studies in resource-constrained or non-probabilistic contexts.

1. Introduction

Estimating prevalence in epidemiological studies is a fundamental component of public health planning, particularly for pathological conditions with low population prevalence. However, this process faces numerous methodological challenges, such as the need for large sample sizes and the implementation of complex designs for probabilistic sample selection [1,2]. Additionally, using diagnostic tests with inaccuracies in the absence of a gold standard or the limited application of such tests to only a fraction of the sample adds complexity to the analysis and may introduce bias. Furthermore, it is crucial to employ advanced interval estimation methods that overcome the inherent limitations of extremely low prevalence rates, particularly those below 0.1 [3].
Various authors have addressed the methodological challenges associated with estimating prevalence in the context of low-prevalence conditions and imperfect diagnostic tests. For example, Rogan and Gladen [4] proposed a classical approach for inference and bias correction in estimated prevalence when using tests with sensitivities and specificity below 1.0 . This approach has been revisited and expanded more recently by Izbicki [5]. Similarly, Reiczigel [6] proposed an exact method to construct confidence intervals when employing tests with known sensitivity and specificity, a contribution later developed further by Lang et al. [7].
Another methodological challenge in estimating low prevalence is the combined use of a fallible diagnostic test within a two-phase cross-sectional design, where verification via a gold standard is performed only on positive cases. This issue was addressed by Thomas et al. [8], who developed frequentist and Bayesian estimators to correct the bias introduced by this verification scheme.
However, while probabilistic samples enable robust population-level inference, their implementation often involves high costs and significant limitations in resource-constrained settings. In such contexts, convenience and nonprobabilistic samples present a practical alternative. Various methodological approaches have been proposed for the estimation with nonprobabilistic samples [9,10]. A notable example is the use of post-stratification estimators, as proposed by Smith [11], which leverage auxiliary population-level covariate distributions to correct prevalence estimates. Designed initially to mitigate nonresponse bias in survey sampling, this technique is an effective tool to adjust bias in nonprobabilistic samples by estimating selection probabilities.
This study aims to present and evaluate some of the methods described previously to estimate the prevalence of rare or orphan diseases, incorporating adjustments proposed by various authors. These methods address contexts involving fallible diagnostic tests and partial verification schemes applied exclusively to individuals who test positive. Additionally, a novel post-stratification approach is integrated as a strategy to correct prevalence estimates derived from nonprobabilistic samples. The methods and adjustments are applied and analyzed in different scenarios using real-world data.

2. Materials and Methods

Three scenarios were established using different interval estimation methods within both frequentist and Bayesian frameworks. For each approach, estimates with and without post-stratification adjustment were compared. This sensitivity analysis permits the evaluation of the impact of post-stratification on probably bias reduction, showing that although complete bias removal is not guaranteed, the adjustment likely improves the accuracy of prevalence estimates in non-probabilistic samples.

2.1. Estimation of a Proportion in Non-Probabilistic Samples with Post-Stratification Adjustment

Post-stratification is a weighting adjustment technique used in nonprobabilistic samples. It was initially proposed to correct various biases in probabilistic samples, such as under-coverage, over-coverage, and nonresponse bias. This technique involves adjusting the weights of the sample units to reflect the known proportions of a target population divided into strata. However, these strata are constructed after the sample has been selected. In the ideal case, using a gold standard test, the prevalence estimate adjusted for post-stratification, as proposed by Holt et al. [12], in standard notation, assumes that the population comprises N units, which can be uniquely divided into H strata of sizes N 1 , N 2 , , N H , such that h = 1 H N h = N .
Let y i be a variable taking values y h i , where h = 1 , , H and i = 1 , , N h . The selected sample is of fixed size n, which, after selection, is distributed among the strata according to the vector n = ( n 1 , n 2 , , n H ) , where h = 1 H n h = n . The components of n are unknown until the sample is drawn. The prevalence estimator for the population in each stratum h is defined as:
p ^ h = i = 1 n h y i h n h
where y i is an indicator variable such that y i = 1 if the individual in the sample is positive and y i = 0 otherwise. Furthermore, x h = i = 1 n h y i h represents the total number of cases identified in stratum h.
Thus, the population prevalence estimator adjusted for post-stratification, p ^ w , and its variance, V ^ ( p ^ w ) , are expressed as follows:
p ^ w = h = 1 H N h N p ^ h
The variance of the weighted prevalence estimator p ^ w is derived using the formula for the variance of a weighted sum of independent random variables, incorporating the finite population correction factor for each stratum. Specifically, the expression considers the variance of p ^ h in each stratum and the proportion of the total population in that stratum:
V ^ ( p ^ w ) = h = 1 H 1 n h N h N h N 2 p ^ h ( 1 p ^ h ) n h 1
Here, V ^ ( p ^ h ) = p ^ h ( 1 p ^ h ) n h 1 represents the estimated variance for each stratum with the finite population correction factor ( 1 n h N h ) applied.

2.2. Scenario 1: Prevalence Estimation with Full Verification Using a Gold Standard

2.2.1. Frequentist Approach

Let P denote the unknown population prevalence to be estimated from a probabilistic sample of size n. Following the interval proposed by Agresti and Coull [3], we employ an adjustment—the commonly known “add two successes and two failures” method—that modifies both the sample size and the observed number of successes. Specifically, we define the adjusted sample size as follows:
n ˜ n + z α 2
where z α / 2 denotes the critical value of the standard normal distribution corresponding to a significance level of α . Likewise, the adjusted prevalence estimator is given as follows:
p ˜ = x + z α / 2 2 2 n ˜
where x represents the number of observed successes. Based on these definitions, an interval for the population prevalence P is expressed as follows:
P p ˜ ± z α p ˜ n ˜ ( 1 p ˜ )
  • Post-Stratification Adjustment
    Let P ˜ denote the adjusted prevalence, where w h represents the weight of stratum h in the population, and p ^ h is the estimated prevalence within stratum h. The adjustment for the sample size per stratum, n ˜ h , is given as follows:
    n ˜ h = n h + z α / 2 2
    The estimator p ˜ must also be adjusted using the weights and prevalences of each stratum and would be expressed as follows:
    p ˜ h = x h + z α 2 2 n ˜ h
    For the weighted prevalence, the formula is as follows:
    P ˜ = h = 1 H w h p ˜ h
    The weighted variance is given as follows:
    Var ( P ˜ ) = h = 1 H w h 2 p ˜ h ( 1 p ˜ h ) n ˜ h
    The confidence interval is as follows:
    P P ˜ ± z α Var ( P ˜ )

2.2.2. Bayesian Approach

From a Bayesian perspective, an interval for the parameter P can be derived using a beta-binomial model, which is recognized for its favorable frequentist properties [13]. This approach also accommodates post-stratification adjustments to account for sampling biases.
For the prevalence in each stratum, we use a beta distribution as the prior:
P h Beta ( α h , β h )
In this specific case, we use a non-informative prior P h Beta ( 1 , 1 ) . The likelihood of observing x h positives in a sample of size n h in stratum h follows a binomial distribution; that is, x h Binomial ( n h , P h ) . Consequently,
P ( x h | n h , P h ) = n h x h P h x h ( 1 P h ) n h x h
By applying Bayes’ theorem, the posterior distribution for P h is obtained by combining the prior and the likelihood:
P ( P h | x h , n h ) P ( x h | n h , P h ) · P ( P h )
Substituting the binomial likelihood and the Beta ( α h , β h ) prior into this expression yields the following:
P ( P h | x h , n h ) P h x h ( 1 P h ) n h x h · P h α h 1 ( 1 P h ) β h 1 .
where α h = 1 and β h = 1 , the posterior simplifies to the following:
P ( P h | x h , n h ) P h x h ( 1 P h ) n h x h
The adjusted prevalence, ( P ˜ ) , is a weighted combination of subgroup prevalences, where the weights are the proportions of each stratum in the total population. During the Bayesian inference process, samples are drawn from the posterior distribution of P h for each stratum h, which we denote as P h s , where s indicates a specific sample from the posterior distribution.
P ˜ s = h = 1 H w h P h s
Based on the above, a 95 % credible interval for the adjusted prevalence P ˜ can be defined as follows:
CIr 95 % ( P ˜ ) = P ˜ 2.5 , P ˜ 97.5
where P ˜ 2.5 and P ˜ 97.5 denote the 2.5th and 97.5th percentiles, respectively, of the posterior distribution of P ˜ .

2.3. Scenario 2: Estimation with a Single Diagnostic Test with Known Sensitivity and Specificity

In most practical situations, using a gold standard test is not feasible either because of the complexity of its application or because of the high associated costs. This limitation is further exacerbated when, due to sampling schemes, whether simple or complex, the gold standard is not available for all selected individuals. In addition, its use may be contraindicated in some cases due to specific indications.

2.3.1. Frequentist Approach

Reiczigel [6] proposes the estimation of prevalence using an exact method that corrects the bias introduced by an imperfect diagnostic test, assuming that its sensitivity and specificity are known. Later, Lang et al. [7] extended this proposal by combining the formula of Rogan and Gladen with an adjustment based on the method proposed by Agresti and Coull.
Let S e and S p denote the sensitivity and specificity obtained from an independent study of an imperfect diagnostic test used to estimate the prevalence P of a condition or disease. The apparent prevalence ( A P ) is the proportion of positive results obtained from the test in the target population. According to Rogan and Gladen, the point estimator of P is calculated as follows:
p ^ = A P ^ + S p ^ 1 S e ^ + S p ^ 1
if the adjustment is applied to the interval proposed by Agresti and Coull on A P ^ :
n ˜ = n + z α 2
A P ^ = n · A P ^ + z α / 2 2 2 n ˜
Thus, both the corrected point estimate of P and its confidence interval are constructed as follows:
p ˜ = A P ^ + S p 1 S e + S p 1
p ˜ ± z α / 2 1 ( S e + S p 1 ) A P ^ ( 1 A P ) ^ n 1 / 2
  • Post-Stratification Adjustment
    Applying the adjustment proposed by Agresti and Coull for each stratum h:
    A P h ^ = x h + z α / 2 2 2 n h + z α / 2 2
    p ˜ h = A P h ^ + S p 1 S e + S p 1
    The estimator for the prevalence adjusted by post-stratification and the known sensitivity and specificity of the test, P ˜ , is as follows:
    P ˜ = h = 1 H w h p ˜ h
    The variance for each stratum is as follows:
    Var ( p ˜ h ) = 1 ( S e + S p 1 ) 2 · A P h ^ ( 1 A P h ^ ) n ˜ h
    The combined total variance is as follows:
    Var ( P ˜ ) = h = 1 H w h 2 · Var ( p ˜ h )
    Finally, the confidence interval adjusted for post-stratification is as follows:
    P ˜ ± z α / 2 · Var ( P ˜ )

2.3.2. Bayesian Approach

Flor et al. [14] proposed a comparative evaluation of point and interval estimation methods utilizing both frequentist and Bayesian approaches to estimate prevalence with an imperfect diagnostic test. In this context, the Bayesian approach offers the advantage of incorporating prior information about the test’s sensitivity and specificity, which can improve the precision of the estimates.
In a Bayesian framework, the true prevalence ( P ) is modeled as a random parameter with an associated prior distribution. The sensitivity ( S e ) and specificity ( S p ) of the diagnostic test, previously defined, are treated as known parameters obtained from independent studies. The probability of observing x positive cases in a sample of size n, assuming that the probability of success is given by the apparent prevalence, is modeled by a binomial distribution x Binomial ( n , A P ) .
The apparent prevalence is related to the true prevalence P, as well as the sensitivity and specificity of the diagnostic test, by the following equation:
A P = P · S e + ( 1 P ) · ( 1 S p )
The likelihood function for observing x positive cases is as follows:
P ( x | n , P , S e , S p ) = n x · A P x · ( 1 A P ) n x .
Based on Bayes’ theorem, the posterior distribution for P is as follows:
P ( P | x , n , S e , S p ) P ( x | n , P , S e , S p ) · P ( P )
Substituting the binomial likelihood and non-informative prior yields the following:
P ( P | x , n , S e , S p ) n x · P · S e + ( 1 P ) · ( 1 S p ) x · 1 P · S e + ( 1 P ) · ( 1 S p ) n x · Beta ( 1 , 1 )
The posterior simplifies to the following:
P ( P | x , n , S e , S p ) P · S e + ( 1 P ) · ( 1 S p ) x · 1 P · S e + ( 1 P ) · ( 1 S p ) n x
Numerical methods such as MCMC (Markov Chain Monte Carlo) can be used to obtain the posterior distribution of P. This approach allows for the generation of samples from the posterior distribution of P and the provision of inferences about the true prevalence.
  • Post-Stratification Adjustment
    The previous Bayesian estimation can be complemented with a post-stratification adjustment. For H strata, each with a prevalence P h , and each stratum having a proportion w h of the total population, the total prevalence adjusted for post-stratification can be calculated as follows:
    P ( P h | x h , n h , S e , S p ) P h · S e + ( 1 P h ) · ( 1 S p ) x h · 1 P h · S e + ( 1 P h ) · ( 1 S p ) n h x h
    P ˜ = h = 1 H w h · P h
    However, when estimating the prevalence P h for each stratum h under a Bayesian approach, we do not obtain a single point estimate but rather a posterior distribution for P h . Thus, this post-stratification must be based on posterior distributions.
    Suppose we have obtained S samples from the posterior distribution of P h for H subgroups. We denote these samples for each subgroup h as follows:
    P h ( 1 ) , P h ( 2 ) , , P h ( S )
    For each iteration s in the posterior, we weigh the prevalence P h ( s ) by the proportion w h of the stratum h in the total population. In this way, the adjusted prevalence in iteration s is as follows:
    P ˜ ( s ) = h = 1 H w h · P h ( s )
    This process is repeated for each sample s = 1 , , S , resulting in a combined posterior distribution for the total prevalence adjusted by post-stratification:
    P ˜ ( 1 ) , P ˜ ( 2 ) , , P ˜ ( S )
    From the samples P ˜ ( s ) , it is possible to calculate any credible interval. For example, the 2.5th and 97.5th percentiles of the posterior distribution of P ˜ can be used to define a credible interval 95 % . Similarly, the point estimate is obtained as the median of the posterior distribution of P ˜ .

2.4. Scenario 3: Estimation When the Status of the Disease Is Verified Only Among Test Positives

2.4.1. Frequentist Approach

In diagnostic processes or public health surveys, verifying the true disease status is common only in individuals who test positive on an initial screening test. For cost or ethical reasons, this verification is typically restricted to those with a high probability of disease based on screening results. In this context, Thomas et al. [8] derived maximum likelihood estimators for disease prevalence using both frequentist and Bayesian approaches, assuming that the sensitivity ( S e ) and specificity ( S p ) of the screening test are known with certainty. The key steps are briefly described below (for additional algebraic details, we refer the reader to the supplementary materials of Thomas et al. [8], where a complete derivation is provided):
The likelihood function is given as follows:
L ( P ) = [ ( 1 S e ) P + S p ( 1 P ) ] n . 0 ( S e · P ) n 11 ( ( 1 S p ) ( 1 P ) ) n 01
where n 11 denotes the number of true positives, n 01 denotes the number of false positives among those tested positive, n . 0 denotes the total number of test negatives, and P denotes the prevalence and the sensitivity and specificity.
The likelihood function is constructed based on the observed values and the probability of test results given the true health condition of the individuals (see [8] for more details). The maximum likelihood estimator (MLE) of prevalence, p ^ , arises from differentiating the log-likelihood with respect to P and setting it to zero. Upon expanding and simplifying the derivative of
log L ( P ) = n . 0 log [ ( 1 S e ) P + S p ( 1 P ) ] + n 11 log ( S e P ) + n 01 log [ ( 1 S p ) ( 1 P ) ]
a quadratic equation in P is obtained, and the explicit solution to this equation is given as follows:
p ^ = B B 2 4 A C 2 A
where the terms A, B, and C are defined as follows:
A = ( 1 S e S p ) n
B = ( 1 S e S p ) ( n . 0 + n 11 ) S p ( n 11 + n 01 )
C = n 11 S p
Here, n represents the total sample size (the sum of individuals who tested positive and negative on the screening test).
The asymptotic variance of p ^ follows from the standard large-sample theory for maximum likelihood estimators, where it is given by the inverse of the Fisher information (i.e., the negative second derivative of the log-likelihood) evaluated at p ^ . Substituting p ^ into the resulting expression yields the following:
σ ^ 2 = S e p ^ + 1 S p 1 p ^ + ( 1 S e S p ) 2 ( 1 S e ) p ^ + S p ( 1 p ^ ) 1
Using this variance, an asymptotic 100 ( 1 α ) % confidence interval can be constructed as follows:
p ^ ± z 1 α / 2 σ ^ n
However, in daily practice, screening programs are typically not based on a random sample from the population. Therefore, disease prevalence estimates require adjustments to reduce the bias associated with this practice. In this context, post-stratification is a suitable methodological option for such adjustments.
  • Post-Stratification Adjustment
    The prevalence for each stratum is calculated using the method proposed by [8]. A p ^ h value is defined for each of the H strata in the sample:
    p h ^ = B h B h 2 4 A h C h 2 A h
    where A h , B h , and C h are defined previously (28). The variance for each stratum is as follows:
    σ ^ h 2 = S e p h ^ + 1 S p 1 p h ^ + ( 1 S e S p ) 2 ( 1 S e ) p h ^ + S p ( 1 p h ^ ) 1
    The overall prevalence adjusted by post-stratification is as follows:
    p ˜ = h = 1 H w h · p ^ h
    The weighted variance is given as follows:
    V a r ( p ˜ ) = h = 1 H N h N 2 σ ^ h 2
    A confidence interval for p ˜ is as follows:
    C I ( p ˜ ) = p ˜ ± z 1 α / 2 V a r ( p ˜ )

2.4.2. Bayesian Approach

In the same work published by Thomas et al. [8], the Bayesian version for prevalence estimation is presented for cases where verification using a gold standard is performed only on individuals who test positive in a screening test. Starting from the previously defined likelihood and assuming that S e and S p are known and fixed for the screening test, the prevalence is estimated as the parameter of interest under a non-informative prior, defined as the following distribution: P Beta ( 1 , 1 ) . Thus, the posterior distribution for P is as follows:
P P D P D P · P ( P )
where D corresponds to true positives ( n 11 ) , false positives among those who tested positive in the test ( n 01 ) , total negatives according to the test ( n . 0 ) , and sensitivity and specificity of the test ( S e , S p ) .
P P D = [ ( 1 S e ) P + S p ( 1 P ) ] n . 0 ( S e · P ) n 11 ( ( 1 S p ) ( 1 P ) ) n 01 · Beta ( 1 , 1 )
Finally, to obtain the posterior distribution of P, MCMC (Markov Chain Monte Carlo) methods can be implemented, with the calculation of a credible interval for P derived from the estimated posterior distribution.
  • Post-Stratification Adjustment
    The previous Bayesian estimation can also include a post-stratification adjustment. In this case, as previously defined, for the H strata formed, P h is estimated by stratifying the posterior of P:
    P P h D = [ ( 1 S e ) P h + S p ( 1 P h ) ] n . 0 h ( S e · P h ) n 11 h ( ( 1 S p ) ( 1 P h ) ) n 01 h · Beta ( 1 , 1 )
    P ˜ = h = 1 H w h · P h
    As previously defined, when estimating the prevalence P h for each stratum h under a Bayesian approach, a single point estimate is not obtained; instead, a posterior distribution for P h is generated. In this context, post-stratification must be based on these posterior distributions, allowing for the incorporation of uncertainty into the stratum-adjusted estimates. We have obtained S samples from the posterior distribution of P h for H strata. These samples for each stratum h are denoted as follows:
    P h ( 1 ) , P h ( 2 ) , , P h ( S )
    For each iteration s of the posterior distribution, we weight the prevalence P h ( s ) by the proportion w h of the stratum h in the total population. In this way, the adjusted prevalence in iteration s is as follows:
    P ˜ ( s ) = h = 1 H w h · P h ( s )
    This process is repeated for each sample s = 1 , , S , resulting in a combined posterior distribution for the total prevalence adjusted by post-stratification:
    P ˜ ( 1 ) , P ˜ ( 2 ) , , P ˜ ( S )
    From the samples P ˜ ( s ) , we can calculate any credible interval.

2.5. Application of Prevalence Estimation for Mutation in Familial Chylomicronemia with Post-Stratification Adjustment

Data from a two-phase descriptive cross-sectional study, in which one of the authors (JMEA) participated [15], were used to evaluate the applicability of the proposed post-stratification adjustment. The study began with an initial nonprobabilistic sample of n = 74,145 adult subjects from Pereira, Colombia. The progressive selection of patients was carried out in two phases. In the first phase, positive cases were identified by applying the FCS clinical score (Family Chylomicronemia Syndrome), considered a fallible diagnostic test. With a cutoff point 8 , this score has a reported sensitivity of 0.88 and a specificity of 0.85 for detecting mutations associated with FCS [16]. As a result of this stage, n = 18 positive subjects were identified.
In the second phase, these 18 individuals underwent a gold standard test using molecular sequencing of the coding region of the genome (exome: 20,000 genes), with coverage greater than 98 % and a minimum depth of 20 X . This analysis included the APOA5, APOC2, GPIHBP1, LMF1, and LPL genes, covering all pathogenic variants in exonic regions, splicing sites, as well as small insertions and deletions. As a final result, n = 6 subjects were confirmed to have mutations associated with familial chylomicronemia syndrome.
The strata used for the post-stratification adjustment were based on sex and single-year age (ranging from 18 to 85 years or older), obtained from the official population projections published by The National Administrative Department of Statistics (DANE) of Colombia.
Frequentist estimates were performed using the R software, version 4.2.3 while Bayesian estimates were conducted using Stan through the RStan library. In the MCMC simulation, the convergence towards a stationary distribution was verified by running four chains with 1000 iterations each, ensuring that R ^ < 1.1 is used as the convergence criterion. This approach yields the complete posterior distribution, thereby facilitating the selection of appropriate 95 % credible intervals [17].

3. Results

Table 1 presents the results obtained for the estimation with and without post-stratification adjustment, corresponding to each scenario. Table 2 shows the lengths of each interval with and without post-stratification adjustment. According to each scenario, the following was found:

3.1. Scenario 1: Estimation with Full Verification Using a Gold Standard

In the first scenario, the Agresti–Coull method showed a point estimate of prevalence of 0.0081 % without post-stratification adjustment, with a confidence interval of [ 0.0032 % , 0.0181 % ] . After applying the post-stratification adjustment, the estimate increased to P = 0.4621%. This result highlights a significant increase in the prevalence estimate, likely due to the correction of biases in the sample.
The confidence interval length without post-stratification was 0.0149 , while it increased to 0.1211 after adjustment, reflecting a notable increase. This result aligns with the objective of post-stratification: to account for variability previously unconsidered due to using a nonprobabilistic sample.
Similarly, the Bayesian approach initially presented a prevalence of P = 0.0088%, with a 95 % credible interval ( C I r 95 % ) of [ 0.0028 % , 0.0156 % ] . With adjustment, the estimate increased to P = 0.2417%, with an adjusted interval of [ 0.2014 % , 0.2886 % ] . The interval length increased from 0.0128 without post-stratification to 0.0872 with adjustment, reflecting the same trend observed with the frequentist method.

3.2. Scenario 2: Estimation with a Diagnostic Test of Known Sensitivity and Specificity

In this scenario, the Lang method presented an initial prevalence of P = 0.000% without post-stratification adjustment, indicating severe underestimation due to the limited sensitivity of the method. After applying post-stratification, the estimator increased to 0.0836 % , highlighting the importance of adjustment to improve bias in designs with imperfect diagnostic tests. The interval length without post-stratification was 0.0162 , while it increased to 0.0836 with post-stratification.
The Bayesian method proposed by Flor showed an initial prevalence of P = 0.0016%, with a credible interval 95 % ( C I r 95 % ) of [ 0.0000 % , 0.0058 % ] . After adjustment, the estimator increased significantly, as shown in Table 1; this result emphasizes the flexibility of the Bayesian approach to address the limitations of fallible diagnostic tests and adjust prevalence estimates. In this case, the interval length increased from 0.0058 to 0.1006 . This significant increase is due to the Bayesian method explicitly incorporating the additional uncertainty associated with the nonprobabilistic sample.

3.3. Scenario 3: Estimation When the Status of the Disease Is Verified Only Among Positives

In this final scenario, the frequentist method of Thomas estimated an initial prevalence of P = 0.0094%, with a confidence interval 95 % ( C I 95 % ) of [ 0.0020 % , 0.0169 % ] . After applying the correction by the inverse of inclusion in the sample, the estimator decreased, resulting in a narrower interval. The interval length was 0.0149 without post-stratification and decreased slightly to 0.0121 with adjustment. This result is unique to this scenario, as post-stratification appears to reduce variability by addressing the specific limitations of partial verification. This change reflects the adjustment’s ability to improve precision in partial verification designs, a situation more representative of daily practice.
The Bayesian approach by Thomas et al. initially estimated a prevalence of P = 0.0104%, with a credible interval 95% ( C I r 95 % ) of [ 0.0045 % , 0.0201 % ] . The adjusted estimator using the post-stratification method increased, demonstrating the ability of Bayesian estimates to integrate additional information and adjust prevalence estimates in partial verification scenarios. The length of the credible interval increased from 0.0156 to 0.1024 with adjustment. This considerable increase is consistent with the Bayesian approach’s ability to model the additional uncertainty introduced by post-stratification.

4. Discussion

Accurate prevalence estimation in epidemiological studies is essential for public health planning, particularly for rare diseases or conditions with low prevalence. The methods evaluated in this study address key challenges such as imperfect diagnostic tests, partial verification of disease status, and reliance on nonprobabilistic samples. These limitations can lead to biased estimates and inadequate public health decisions if not corrected. Thus, this study highlights the need to implement robust methodological adjustments to improve the accuracy of prevalence estimates. A comparison of frequentist and Bayesian approaches revealed significant differences in their ability to address inherent limitations in study designs and diagnostic tests. These observations align with McNamee et al. [18], who noted that despite their complexity, two-phase designs can optimize precision and reduce costs in prevalence studies.

4.1. First Scenario

In the first scenario, the results show that both methodological approaches indirectly mitigate the biases in the unadjusted estimates, as evidenced by the observed differences in the point estimates and the length of the intervals. However, post-stratification adjustment had a significant impact by incorporating additional information on the distribution of population variables. This result is consistent with Holt et al. [12], who reported the effectiveness of post-stratification in addressing the lack of representativeness in nonprobabilistic samples. Furthermore, the Bayesian approach adjusted with post-stratification generated wider confidence intervals than the Agresti–Coull method, as observed in the study by Flor et al. [14]. This behavior reflects the Bayesian approach’s ability to explicitly model uncertainty, making it an advantageous alternative in contexts of high variability.

4.2. Second Scenario

The second scenario, where a diagnostic test with known sensitivity and specificity was performed, highlights the importance of considering the quality of the initial tests in the study design. As noted by Shrout and Newman [19], the limited sensitivity of screening methods can significantly underestimate prevalence, consistent with the unadjusted results obtained in this investigation. However, post-stratification adjustments and Bayesian estimates allowed for correction of these limitations, aligning with previous studies that highlight the flexibility of the Bayesian approach in incorporating prior information and improving precision. Regarding interval lengths, unadjusted intervals were observed to be shorter, which could create false confidence in underrepresented estimates. In contrast, post-stratification adjustment, while increasing interval lengths, corrected the inherent bias in the data and improved population representativeness.
A particular situation, well documented in the literature, became evident in the interval proposed by Lang and was also observed in this study. When using the adjusted prevalence estimator proposed by Rogan and Gladen, the adjusted estimate may fall outside the allowed range [ 0 , 1 ] , especially for negative values when the true prevalence is very low. This situation, previously studied by authors such as Viana [20] and Hasselt [21], was reflected in the present study, where the reported prevalence (Table 1) was zero, consistently underestimating the true prevalence in low-prevalence scenarios.
In contrast, the Bayesian approach proved beneficial, as evidenced by the results of this study. Using a prior distribution P  Beta ( α , β ) , this approach ensures that the estimated prevalence remains within the allowed range of [ 0 , 1 ] , even in situations of extremely low prevalence. This advantage was also reflected in the Bayesian approach adjusted by post-stratification used in Scenario 2, where a more robust and representative estimate was achieved, effectively addressing the limitations observed with frequentist methods.

4.3. Third Scenario

In Scenario 3, where the disease status is verified only among positives, the results confirm the advantages of combining maximum likelihood methods and post-stratification adjustments. The frequentist approach, while traditional, showed a reduction in the interval length after adjustment, highlighting its ability to address specific biases in such designs. On the other hand, the Bayesian approach provided wider intervals but with a better representation of uncertainty, consistent with the observations of Flor et al. [14] in partial verification scenarios.
Regarding the interval lengths, the unadjusted intervals were shorter, potentially leading to overconfidence in underrepresented estimates. In contrast, post-stratification adjustment, while increasing interval lengths, corrected inherent bias in the data and improved population representativeness.
In this scenario, a different pattern was observed in the lengths of the confidence intervals. The frequentist approach showed a reduction in interval length after adjustment. In contrast, Bayesian approaches presented wider intervals post-adjustment, reflecting their ability to incorporate the inherent uncertainty of partial verification designs using a gold standard test. These results underscore the importance of selecting the most appropriate method based on the study context, as suggested in previous research [18].
Finally, Bayer et al. [22] proposed an innovative approach for estimating prevalence derived from complex sampling surveys based on a melding method that uses gamma and beta distributions. While their method is particularly robust in addressing bias introduced by the imperfect sensitivity and specificity of diagnostic tests, its context is limited to probabilistic samples, ensuring population representativeness from the sampling design but not directly addressing bias in nonprobabilistic samples or post-stratification adjustments.
In contrast, the approach in this study focuses on contexts where samples are nonprobabilistic, presenting an additional challenge. Integrating Bayesian methods with post-stratification adjustments allows for correcting biases related to population representativeness, a dimension not explicitly addressed in Bayer’s method. This methodological difference makes this work complementary to that of Bayer et al. It provides a flexible solution for situations where practical limitations—such as the inability to use probabilistic sampling designs or incomplete verification of diagnostic tests—necessitate methods that balance representativeness and adaptability.
In epidemiological studies, frequentist methods offer computational efficiency and straightforward interpretation in large samples but may underestimate uncertainty in low-prevalence settings or designs with partial verification. In contrast, Bayesian methods integrate prior information and provide more comprehensive uncertainty quantification through credible intervals. The integration of both approaches, along with post-stratification adjustments, effectively corrects biases related to nonprobabilistic sampling and imperfect diagnostics. This combined strategy offers a robust framework for prevalence estimation, enhancing representativeness and supporting more informed public health decision-making.
Despite favorable findings, this study has some limitations. Reliance on known metrics, such as the sensitivity and specificity of diagnostic tests, can restrict the applicability of methods in contexts where these parameters are not well established. Additionally, while robust, the Bayesian approach critically depends on the quality of prior information (i.e., without reliable data, using noninformative priors may lead to less precise estimates).
It was not feasible to generate or acquire a synthetic dataset that accurately represents the complexities associated with nonprobabilistic sampling processes, which are a crucial part of our study design. This limitation underscores the need for future research to rigorously evaluate the coverage properties of intervals adjusted through post-stratification and quantify the bias linked to point estimators under different conditions. These evaluations are essential for determining the robustness and validity of the proposed methodologies, especially when compared to alternative approaches.
The results have important implications for public health practice and epidemiological research. Adjusting prevalence estimates through post-stratification significantly improves representativeness in contexts with nonprobabilistic samples and imperfect diagnostic tests. Bayesian methods have proven valuable tools in high-uncertainty scenarios, particularly when prior information is incorporated.

Author Contributions

Conceptualization, J.M.E.A., J.d.D.L.d.C. and M.Á.M.-A.; Methodology, J.M.E.A., J.d.D.L.d.C. and M.Á.M.-A.; Investigation, J.M.E.A., J.d.D.L.d.C. and M.Á.M.-A.; Writing—original draft, J.M.E.A., J.d.D.L.d.C. and M.Á.M.-A.; Writing—review & editing, J.M.E.A., J.d.D.L.d.C. and M.Á.M.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author due to restrictions stemming from their origin in a separate study conducted by another research team [15], in which one of the authors participated. Access to the data requires an explicit request and agreement with the original authors. Requests can be sent to jestradaa@comfamiliar.com.

Acknowledgments

We thank Comfamiliar Risaralda.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
MCMCMarkov Chain Monte Carlo
FCSFamilial Chylomicronemia Syndrome

References

  1. Arya, R.; Antonisamy, B.; Kumar, S. Sample Size Estimation in Prevalence Studies. Indian J. Pediatr. 2012, 79, 1482–1488. [Google Scholar] [CrossRef]
  2. Lewis, F.I.; Torgerson, P.R. A tutorial in estimating the prevalence of disease in humans and animals in the absence of a gold standard diagnostic. Emerg. Themes Epidemiol. 2012, 9, 9. [Google Scholar] [CrossRef]
  3. Agresti, A.; Coull, B.A. Approximate is Better than “Exact” for Interval Estimation of Binomial Proportions. Am. Stat. 1998, 52, 119–126. [Google Scholar] [CrossRef]
  4. Rogan, W.J.; Gladen, B. Estimating prevalence from the results of a screening test. Am. J. Epidemiol. 1978, 107, 71–76. [Google Scholar] [CrossRef]
  5. Izbicki, R.; Diniz, M.A.; Bastos, L.S. Sensitivity and specificity in prevalence studies: The importance of considering uncertainty. Clinics 2020, 75, e2449. [Google Scholar] [CrossRef]
  6. Reiczigel, J.; Foldi, J.; Ózsvári, L. Exact confidence limits for prevalence of a disease with an imperfect diagnostic test. Epidemiol. Infect. 2010, 138, 1674–1678. [Google Scholar] [CrossRef]
  7. Lang, Z.; Reiczigel, J. Confidence limits for prevalence of disease adjusted for estimated sensitivity and specificity. Prev. Vet. Med. 2014, 113, 13–22. [Google Scholar] [CrossRef] [PubMed]
  8. Thomas, E.G.; Peskoe, S.B.; Spiegelman, D. Prevalence estimation when disease status is verified only among test positives: Applications in HIV screening programs. Stat. Med. 2018, 37, 1101–1114. [Google Scholar] [CrossRef]
  9. Elliott, M.R.; Valliant, R. Inference for nonprobability samples. Stat. Sci. 2017, 32, 249–264. [Google Scholar] [CrossRef]
  10. Lohr, S.L. Sampling: Design and Analysis, 2nd ed.; Cengage Learning: Boston, MA, USA, 2010. [Google Scholar]
  11. Smith, T.M.F. Post-Stratification. J. R. Stat. Society Ser. D (Stat.) 1991, 40, 315–323. [Google Scholar] [CrossRef]
  12. Holt, D.; Smith, T.M.F. Post Stratification. J. R. Stat. Society Ser. A (Gen.) 1979, 142, 33–46. [Google Scholar] [CrossRef]
  13. Tony Cai, T. One-sided confidence intervals in discrete distributions. J. Stat. Plan. Inference 2005, 131, 63–88. [Google Scholar] [CrossRef]
  14. Flor, M.; Weiß, M.; Selhorst, T.; Müller-Graf, C.; Greiner, M. Comparison of Bayesian and frequentist methods for prevalence estimation under misclassification. BMC Public Health 2020, 20, 1135. [Google Scholar] [CrossRef]
  15. Rodriguez, F.H.; Estrada, J.M.; Quintero, H.M.A.; Nogueira, J.P.; Porras-Hurtado, G.L. Analyses of familial chylomicronemia syndrome in Pereira, Colombia 2010–2020: A cross-sectional study. Lipids Health Dis. 2023, 22, 43. [Google Scholar] [CrossRef] [PubMed]
  16. Moulin, P.; Dufour, R.; Averna, M.; Arca, M.; Cefalù, A.B.; Noto, D.; D’Erasmo, L.; Costanzo, A.D.; Marçais, C.; Walther, L.A.A.S.; et al. Identification and diagnosis of patients with familial chylomicronaemia syndrome (FCS): Expert panel recommendations and proposal of an “FCS score”. Atherosclerosis 2018, 275, 265–272. [Google Scholar] [CrossRef] [PubMed]
  17. Gelman, A.; Rubin, D.B. Inference from Iterative Simulation Using Multiple Sequences. Stat. Sci. 1992, 7, 457–472. [Google Scholar] [CrossRef]
  18. McNamee, R. Two-Phase Sampling for Simultaneous Prevalence Estimation and Case Detection. Biometrics 2004, 60, 783–792. [Google Scholar] [CrossRef]
  19. Shrout, P.E.; Newman, S.C. Design of Two-Phase Prevalence Surveys of Rare Disorders. Biometrics 1989, 45, 549–555. [Google Scholar] [CrossRef] [PubMed]
  20. Viana, M.A.G.; Ramakrishnan, V.; Levy, P.S. Bayesian analysis of prevalence from the results of small screening samples. Commun. Stat.-Theory Methods 1993, 22, 575–585. [Google Scholar] [CrossRef]
  21. Van Hasselt, M.; Bollinger, C.R.; Bray, J.W. A Bayesian approach to account for misclassification in prevalence and trend estimation. J. Appl. Econ. 2022, 37, 351–367. [Google Scholar] [CrossRef]
  22. Bayer, D.M.; Fay, M.P.; Graubard, B.I. Confidence intervals for prevalence estimates from complex surveys with imperfect assays. Stat. Med. 2023, 42, 1822–1867. [Google Scholar] [CrossRef] [PubMed]
Table 1. Point and interval estimates of the population prevalence ( P ) obtained using the frequentist and Bayesian methods, both unadjusted and adjusted by post-stratification.
Table 1. Point and interval estimates of the population prevalence ( P ) obtained using the frequentist and Bayesian methods, both unadjusted and adjusted by post-stratification.
ScenarioMethodUnadjusted (%)Adjusted (%)
PLowerUpperPLowerUpper
Scenario 1Agresti–Coull0.00810.00320.01810.46210.40150.5226
Bayesian (beta-binomial)0.00880.00280.01560.24170.20140.2886
Scenario 2Lang0.00000.00000.01620.00000.00000.0836
Bayesian–Flor0.00160.00000.00580.27400.22800.3286
Scenario 3Frequentist–Thomas0.00940.00200.01690.00750.00140.0135
Bayesian–Thomas0.01040.00450.02010.27970.23280.3352
Table 2. Comparison of Interval Lengths by Estimation Method.
Table 2. Comparison of Interval Lengths by Estimation Method.
MethodWithout AdjustmentWith Adjustment
Agresti–Coull0.01490.1212
Bayesian (beta-binomial)0.01280.0872
Lang0.01620.0836
Flor–Bayesian0.00580.1005
Frequentist–Thomas0.01490.0121
Bayesian–Thomas0.01560.1025
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Estrada Alvarez, J.M.; Luna del Castillo, J.d.D.; Montero-Alonso, M.Á. Point and Interval Estimation of Population Prevalence Using a Fallible Test and a Non-Probabilistic Sample: Post-Stratification Correction. Mathematics 2025, 13, 805. https://doi.org/10.3390/math13050805

AMA Style

Estrada Alvarez JM, Luna del Castillo JdD, Montero-Alonso MÁ. Point and Interval Estimation of Population Prevalence Using a Fallible Test and a Non-Probabilistic Sample: Post-Stratification Correction. Mathematics. 2025; 13(5):805. https://doi.org/10.3390/math13050805

Chicago/Turabian Style

Estrada Alvarez, Jorge Mario, Juan de Dios Luna del Castillo, and Miguel Ángel Montero-Alonso. 2025. "Point and Interval Estimation of Population Prevalence Using a Fallible Test and a Non-Probabilistic Sample: Post-Stratification Correction" Mathematics 13, no. 5: 805. https://doi.org/10.3390/math13050805

APA Style

Estrada Alvarez, J. M., Luna del Castillo, J. d. D., & Montero-Alonso, M. Á. (2025). Point and Interval Estimation of Population Prevalence Using a Fallible Test and a Non-Probabilistic Sample: Post-Stratification Correction. Mathematics, 13(5), 805. https://doi.org/10.3390/math13050805

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop