A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease

Chatzimichail, Theodora; Hatjimihail, Aristides T.

doi:10.3390/diagnostics14040402

Open AccessArticle

A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease

by

Theodora Chatzimichail

and

Aristides T. Hatjimihail

^*

Hellenic Complex Systems Laboratory, Kostis Palamas 21, 66131 Drama, Greece

^*

Author to whom correspondence should be addressed.

Diagnostics 2024, 14(4), 402; https://doi.org/10.3390/diagnostics14040402

Submission received: 4 January 2024 / Revised: 4 February 2024 / Accepted: 5 February 2024 / Published: 12 February 2024 / Corrected: 26 June 2024

(This article belongs to the Section Clinical Laboratory Medicine)

Download

Browse Figures

Versions Notes

Abstract

The role of medical diagnosis is essential in patient care and healthcare. Established diagnostic practices typically rely on predetermined clinical criteria and numerical thresholds. In contrast, Bayesian inference provides an advanced framework that supports diagnosis via in-depth probabilistic analysis. This study’s aim is to introduce a software tool dedicated to the quantification of uncertainty in Bayesian diagnosis, a field that has seen minimal exploration to date. The presented tool, a freely available specialized software program, utilizes uncertainty propagation techniques to estimate the sampling, measurement, and combined uncertainty of the posterior probability for disease. It features two primary modules and fifteen submodules, all designed to facilitate the estimation and graphical representation of the standard uncertainty of the posterior probability estimates for diseased and non-diseased population samples, incorporating parameters such as the mean and standard deviation of the test measurand, the size of the samples, and the standard measurement uncertainty inherent in screening and diagnostic tests. Our study showcases the practical application of the program by examining the fasting plasma glucose data sourced from the National Health and Nutrition Examination Survey. Parametric distribution models are explored to assess the uncertainty of Bayesian posterior probability for diabetes mellitus, using the oral glucose tolerance test as the reference diagnostic method.

Keywords:

Bayesian diagnosis; Bayesian inference; prior probability; posterior probability; likelihood; parametric distribution; probability density function; uncertainty; combined uncertainty; measurement uncertainty; sampling uncertainty; confidence intervals; diabetes mellitus; fasting plasma glucose; oral glucose tolerance test

1. Introduction

1.1. Diagnosis in Medicine

Diagnosis in medicine fundamentally involves identifying the unique characteristics of a disease and distinguishing it from other conditions with similar presentations. The term “diagnosis”, originating from the Greek word “διάγνωσις” meaning “discernment” [1], emphasizes the critical role of distinguishing between healthy and diseased states in individuals. Diagnostic tests are essential in classifying individuals based on their health status. However, the reliance on a singular threshold for diagnosis across a range of data points introduces uncertainty, owing to the overlapping probability distributions of a measurand in both healthy and diseased populations [2]. While traditional diagnostic methods have been broadly effective, they may not fully encompass the diversity of disease manifestations, particularly across varied groups of people [3].

As underlined previously [2], Bayesian inference represents a paradigm shift in the field of medical diagnosis, offering a robust framework for integrating various sources of information to make probabilistic assessments. At its core, Bayesian inference relies on the Bayes’ theorem for updating beliefs in light of new evidence, integrating prior disease probabilities with the distribution of diagnostic measurands to calculate posterior probabilities for disease [4,5,6,7]. This approach enables a more comprehensive probabilistic assessment, evaluation of the information conveyed by diagnostic measurements, and a personalized patient approach [3,8].

Historically, the application of Bayesian methods in medicine has undergone significant evolution. Despite facing several challenges and being met with skepticism, these methods have gradually gained acceptance.

1.1.1. Bayes’ Theorem in Medical Diagnostics

Bayes’ theorem, a fundamental principle in probability theory [5], forms a connection between the direct probability P(H|E) of a hypothesis H given specific data E, and the inverse probability P(E|H) of data E given the hypothesis H [9]. In medical diagnostics, Bayes’ theorem is instrumental in transforming the prior probability for disease into a posterior probability following diagnostic tests [4].

1.1.2. Challenges in Applying Bayesian Inference

The application of Bayesian inference in diagnostics, however, faces significant challenges.

Computational Complexity

The computational complexity of Bayesian inference requires considerable resources.

Statistical Distributions in Diagnostics

A major challenge involves comprehensively understanding the statistical distributions of diagnostic test measurands in both diseased and nondiseased populations [10]. Calculation of posterior probabilities requires probability density functions (PDF) for the measurands in these populations. The normal distribution, often used for its simplicity, may not be suitable for measurands with non-normal characteristics like skewness or multimodality. Critical evaluation and potential adoption of alternative distributions are necessary for more accurate Bayesian diagnostic methods [10,11,12]. Bayesian Diagnosis, our previously published software, addresses this challenge [2].

Uncertainty of Bayesian Posterior Probabilities

Another significant challenge involves estimating the uncertainty associated with Bayesian posterior probabilities in disease diagnosis. This uncertainty can substantially affect their clinical utility. Despite its critical importance, the task of estimating, evaluating, and mitigating uncertainty in Bayesian diagnostic test interpretation has seldom been addressed in medical literature [13]. To confront this issue, we have developed Bayesian Diagnostic Uncertainty, a software tool for the estimation of uncertainty in Bayesian diagnosis, which is presented in detail in this study.

Both Bayesian Diagnostic Uncertainty and Bayesian Diagnosis, enhance the applicability of Bayesian methods in medical diagnostics.

1.1.3. Quantifying Uncertainty in Diagnostics

Uncertainty can be quantified and is often expressed probabilistically [14].

Combined Uncertainty

In the context of Bayesian posterior probability for disease, we consider two main components of combined uncertainty:

Measurement Uncertainty

This reflects the inherent variability in measurement processes and is defined as a parameter characterizing the dispersion of values that could reasonably be attributed to the measurand [15]. While crucial for laboratory quality assurance, the impact of measurement uncertainty on clinical decision-making and outcomes is often underexplored and rarely quantified [16,17]. Emerging research focuses on its effects on misclassification [18] and on diagnostic accuracy measures [19].

Sampling Uncertainty

The variability in sampling contributes to the uncertainty of posterior probability for disease [20], and it is essential in evaluating diagnostic methods.

2. Methods

2.1. Computational Methods

2.1.1. Bayes’ Theorem

Bayes’ theorem calculates the posterior probability

P (D | T)

of a disease

D

given a test result

T = x

and a parameter vector θ, as follows:

P (D | T) = \frac{f_{D} (x | θ) r}{f_{D} (x | θ) r + f_{\bar{D}} (x | θ) (1 - r)}

Here

r

denotes the prior probability for disease,

f_{D} (x | θ)

the PDF in disease presence, while

f_{\bar{D}} (x; θ)

denotes the PDF in its absence (refer to Appendix A.1 for details).

2.1.2. Parametric Distributions

Parametric statistics operate under the assumption that data from a population can be accurately represented by a probability distribution with a fixed set of parameters [21]. The program supports the following parametric distributions:

Normal distribution
Lognormal distribution
Gamma distribution.

2.1.3. Uncertainty Quantification

Uncertainty of input parameters can manifest as standard uncertainty

u (x)

, the standard deviation of

x

, and expanded uncertainty

U (x)

, a range around

x

encompassing

x

with a probability

p

[16].

Measurement Uncertainty

Measurement uncertainty is computed following guidelines in the “Guide to the expression of uncertainty in measurement” (GUM) [15] and “Expression of measurement uncertainty in laboratory medicine” [16]. Bias is considered a component of this uncertainty [22].

The relationship between the standard measurement uncertainty

u (x)

to the value of the measurand

x

, is generally expressed as:

u_{m} (x) = \sqrt{b_{0}^{2} + b_{1}^{2} x^{2}}

where

b_{0}

is a constant and

b_{1}

is a proportionality constant.

If needed, it is approximated linearly as:

u_{m} (x) ≅ b_{0} + b_{1} x

The general approach to estimating the coefficients of the above equations is delineated in Appendix A5 of “Quantifying Uncertainty in Analytical Measurement” [23].

Sampling Uncertainties of Means and Standard Deviations

If

m_{P}

and

s_{P}

are the mean and standard deviation of a measurand in a population sample of size

n_{P}

, then the standard sampling uncertainties of

m_{P}

and

s_{P}

are estimated as:

u_{s} (m_{P}) ≅ \frac{s_{P}}{\sqrt{n_{P}}}

u_{s} (s_{P}) ≅ \frac{s_{p}}{\sqrt{2 (n_{P} - 1)}}

using the central limit theorem and the chi-square distribution [24,25,26].

Sampling Uncertainty of Prior Probability for Disease

If

n_{D}

and

n_{\bar{D}}

are the respective numbers of diseased and nondiseased in a population sample, then the standard uncertainty of the prior probability for disease

r = \frac{n_{D}}{n_{\bar{D}} + n_{D}}

is estimated as:

u_{s} (r) ≅ \sqrt{\frac{(2 + n_{\bar{D}}) (2 + n_{D})}{{(4 + n_{\bar{D}} + n_{D})}^{3}}}

using the Agresti–Coull adjustment of the Waldo interval [27].

Combined Uncertainty of Posterior Probability for Disease

The standard combined uncertainty

u_{c} (x)

of posterior probability for disease is computed via uncertainty propagation rules, employing a first-order Taylor series approximation [28] (refer to Supplementary File S2).

When there are l components of uncertainty, with standard uncertainties

u_{i} (x)

, then:

u_{c} (x) = \sqrt{\sum_{i = 1}^{l} u_{i} {(x)}^{2}}

2.1.4. Expanded Uncertainty of Posterior Probability for Disease

When there are l components of uncertainty, with standard uncertainties

u_{i} (x)

and

v_{i}

degrees of freedom, then the effective degrees of freedom

v_{e f f}

of the combined uncertainty

u_{c} (x)

are obtained from the Welch–Satterthwaite formula [29,30]:

v_{e f f} (x) ≅ \frac{u_{c} {(x)}^{4}}{\sum_{i = 1}^{l} \frac{u_{i} {(x)}^{4}}{v_{i}}}

If

v_{m i n}

the minimum of

v_{1}, v_{2}, \dots, v_{l}

, then:

v_{m i n} \leq v_{e f f} (x) \leq \sum_{i = 1}^{l} v_{i}

If

F_{v} (z)

is the Student’s t-distribution cumulative distribution function with

v

degrees of freedom and

u_{c} (x)

is the standard combined uncertainty of posterior probability for disease, its expanded combined uncertainty

U_{c} (x)

at a confidence level

p

is:

U_{c} (x) ≅ (F_{v}^{- 1} (\frac{1 - p}{2}) u_{c} (x), F_{v}^{- 1} (\frac{1 + p}{2}) u_{c} (x))

The confidence interval of

x

at the same confidence level

p

is approximated as:

C I_{p} (x) ≅ (x + F_{v}^{- 1} (\frac{1 - p}{2}) u_{c} (x), x + F_{v}^{- 1} (\frac{1 + p}{2}) u_{c} (x))

The confidence intervals of the posterior probability for disease were truncated to the [0, 1] range.

2.2. The Software

2.2.1. Program Overview

To facilitate the estimation of the uncertainty of Bayesian posterior probability for disease, the software program Bayesian Diagnostic Uncertainty was developed in Wolfram Language, using Wolfram Mathematica^® Ver. 13.3 (Wolfram Research, Inc., Champaign, IL, USA). Bayesian Diagnostic Uncertainty was designed to estimate and plot the standard sampling, measurement, and combined uncertainty and the confidence intervals of the Bayesian posterior probability for disease of a screening or diagnostic test (See Figure 1).

This interactive program is freely available as a Wolfram Language notebook (.nb) (Supplementary File S2: BayesianUncertainty.nb). It can be run on Wolfram Player^® (Wolfram Research, Inc., Champaign, IL, USA (2023)) or Wolfram Mathematica^® (see Appendix A.2). Due to the complexity of the calculations required, it is computationally intensive.

2.2.2. Input Parameters

The program allows for the definition of three parametric distributions of a measurand for the diseased and nondiseased populations.

Distribution Selection: The user selects the type of distribution of each population from a predefined list:

Normal distribution
Lognormal distribution
Gamma distribution.

Definition of Statistical Parameters: For each population, the user defines its size n, the mean μ, and the standard deviation σ of the measurand.

Measurement Uncertainty

The user selects a linear or nonlinear equation of the measurement uncertainty versus the value x of the measurand and defines the constant contribution

b_{0}

to the standard measurement uncertainty, the proportionality constant

b_{1}

, and the number of quality control samples that have been analyzed for its estimation.

2.2.3. Output Specifications

Visualizations

The program generates a series of plots designed to elucidate various uncertainty measures and statistics:

Uncertainty of posterior probability for disease: Plots are generated to show the standard sampling, measurement, and combined uncertainty of the posterior probability for disease.
Relative uncertainty of posterior probability for disease: Plots are generated to show the relative standard sampling, measurement, and combined uncertainty of the posterior probability for disease.
Confidence intervals of posterior probability for disease: Plots are generated to show the confidence intervals of the posterior probability for disease, for a user defined confidence level.

Tables

For each combination of parametric distributions of the diseased and nondiseased populations, the program tabulates for a user defined measurand value:

The standard sampling, measurement, and combined uncertainty of the posterior probability for disease.
The relative standard sampling, measurement, and combined uncertainty of the posterior probability for disease.
The confidence intervals of the posterior probability for disease for a user defined confidence level.

By providing this comprehensive set of input parameters and output specifications (see Figure 2), the program offers a robust platform for exploring the uncertainty in Bayesian diagnosis of disease using parametric distributions of medical diagnostic measurands.

3. Illustrative Case Study

To demonstrate the application of the program, fasting plasma glucose (FPG) was used as the diagnostic test measurand for the Bayesian diagnosis of diabetes mellitus (From now on, when mentioning “diabetes”, we are referring to diabetes mellitus). The oral glucose tolerance test (OGTT) was used as the reference diagnostic method. A diagnosis of diabetes was confirmed if the plasma glucose value was equal to or greater than 200 mg/dL, measured two hours after oral administration of 75 g of glucose [31], during an OGTT (2-h PG). The study population was confined to individuals aged between 70 and 80 years, a decision guided by the well-documented strong correlation between age and the prevalence of diabetes [32].

National Health and Nutrition Examination Survey (NHANES) data from participants was retrieved for the period from 2005 to 2016 (n = 60,936) [33]. NHANES is a series of studies designed to evaluate the health and nutritional status of adults and children in the United States.

The inclusion criteria for participants were:

Valid FPG and OGTT results (n = 13,836).
A negative response to NHANES question DIQ010 regarding a diabetes diagnosis [34] (n = 13,465).
Age 70–80 years (n = 976).

Participants with a 2-h PG measurement ≥200 mg/dL were considered diabetic (n = 154).

The prior probability for diabetes was estimated as:

v ≅ \frac{154}{976} = 0.158

The statistics of the FPG datasets are presented in Table 1 (Hereafter, FPG and its uncertainty are expressed in mg/dL).

Lognormal distributions were estimated to model FPG measurements in diabetic and nondiabetic participants, using the maximum likelihood estimation method [35]. The respective distributions, parametrized for their means

μ_{D}

and

μ_{\bar{D}}

, and standard deviations

σ_{D}

and

σ_{\bar{D}}

, were the following:

L_{D} = L o g n o r m a l (μ_{D}, σ_{D}) = L o g n o r m a l (120.671, 17.720) L_{\bar{D}} = L o g n o r m a l (μ_{\bar{D}}, σ_{\bar{D}}) = L o g n o r m a l (102.642, 10.653)

The NHANES quality control data of the FPG measurements was retrieved for the same period (2005–2016). In total, 1350 QC samples had been analyzed. The weighted nonlinear least squares analysis [36] yielded the following function relating the standard measurement uncertainty

u_{m} (x)

to the measurement value,

x

:

u_{m} (x) = \sqrt{b_{0}^{2} + b_{1}^{2} x^{2}} = \sqrt{0.7501 + 0.00012 x^{2}}

where

b_{0} = 0.866

and

b_{1} = 0.0109

.

The means of the standard measurement uncertainty of FPG of the included diabetic and nondiabetic participants were estimated as:

{\hat{u}}_{D} ≅ 1.586 mg / dL {\hat{u}}_{\bar{D}} ≅ 1.028 mg / dL

Consequently, the distributions of the measurands, assuming negligible uncertainty, were estimated as:

l_{D} ≅ L o g n o r m a l (μ_{D}, \sqrt{σ_{D}^{2} - {\hat{u}}_{D}^{2}}) ≅ L o g n o r m a l (120.671, 17.720)

l_{\bar{D}} ≅ L o g n o r m a l (μ_{\bar{D}}, \sqrt{σ_{\bar{D}}^{2} - {\hat{u}}_{\bar{D}}^{2}}) ≅ L o g n o r m a l (102.642, 10.653)

Table 2 displays the descriptive statistics of the estimated lognormal distributions of the diabetic and nondiabetic populations, including the respective p-values of the Cramér–von Mises goodness-of-fit test [37].

Figure 3 and Figure 4 show the estimated PDFs of FPG in the diabetic and nondiabetic populations, assuming a lognormal distribution and negligible measurement uncertainty, and the histograms of the respective NHANES datasets.

Likelihoods and posterior probabilities were estimated accordingly.

4. Results

Using the settings of Table 3, the program generated the plots of Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15, Figure 16 and Figure 17 and the tables of Figure 17, Figure 18 and Figure 19.

Figure 5 shows the plots of the standard sampling, measurement, and combined uncertainty of posterior probability for diabetes versus FPG, while Figure 6 shows the respective plots of the relative standard uncertainty.

Figure 7 shows the plots of the confidence intervals of posterior probability for diabetes versus FPG for a confidence level

p = 0.95

.

Assessing the combined standard uncertainty of the posterior probability for diabetes, we note the following:

It is substantially affected by measurement uncertainty of FPG.
Two local maxima are observed, corresponding to the regions near the steepest segments of the posterior probability curve, which exhibits an approximately double sigmoidal configuration. These maxima are quantitatively defined as following:
2.1.
At an FPG value of 58.7 mg/dL, the posterior probability for disease is equal to 0.585, while the combined standard uncertainty is equal to 0.893.
2.2.
At an FPG value of 133.2 mg/dL, the posterior probability for disease is equal to 0.725, while the combined standard uncertainty is equal to 0.182.

This pattern of local maxima is indicative of heightened uncertainty in the regions where the posterior probability curve demonstrates its most pronounced inflections. The confidence intervals are affected accordingly.

Assessing the relative combined standard uncertainty of the posterior probability for diabetes, we note that two local maxima are observed as well, quantitatively defined as following:

At an FPG value of 64.1 mg/dL, the posterior probability for disease is equal to 0.257, while the relative combined standard uncertainty is equal to 2.044.
At an FPG value of 128.1 mg/dL, the posterior probability for disease is equal to 0.561, while the relative combined standard uncertainty is equal to 0.278.

Figure 8 shows the plots of the standard sampling, measurement, and combined uncertainty of posterior probability for diabetes versus the constant contribution

b_{0}

of measurement uncertainty of FPG, while Figure 9 shows the respective plots of the relative standard uncertainty.

Figure 10 shows the plots of the confidence intervals of posterior probability for diabetes versus the constant contribution

b_{0}

of measurement uncertainty of FPG, for a confidence level

p = 0.95

.

Figure 11 shows the plots of the standard sampling, measurement, and combined uncertainty of posterior probability for diabetes versus the proportionality constant

b_{1}

of measurement uncertainty of FPG, while Figure 12 shows the respective plots of the relative standard uncertainty.

Figure 13 shows the plots of the confidence intervals of posterior probability for diabetes versus the proportionality constant

b_{1}

of measurement uncertainty of FPG for a confidence level

p = 0.95

.

Figure 14 shows the plots of the standard sampling, measurement, and combined uncertainty of posterior probability for diabetes versus the total population size n, while Figure 15 shows the respective plots of the relative standard uncertainty.

Figure 16 shows the plots of the confidence intervals of posterior probability for diabetes versus the total population size n, for a confidence level

p = 0.95

.

As anticipated, the impact of sampling uncertainty decreases markedly as the size of the population sample increases.

Figure 17 shows a table of the standard sampling, measurement, and combined standard uncertainty of posterior probability for diabetes for FPG value equal to 126 mg/dL, while Figure 18 shows a table of the respective values of relative standard uncertainty.

Figure 18 shows the confidence intervals of posterior probability for diabetes for FPG value equal to 126 mg/dL and confidence level

p = 0.95

.

The tables distinctly demonstrate the considerable magnitude of uncertainty and relative uncertainty associated with the posterior probability for diabetes at an FPG level of 126 mg/dL, the established threshold for the diagnosis of diabetes. Furthermore, the posterior probabilities delineated in the tables suggest a limited concordance between the classification criteria of diabetes derived from the OGTT and FPG tests [31], as found previously in existing literature [38].

5. Discussion

5.1. Reevaluation of Traditional Diagnostic Methods

Traditional diagnostic methods rely on the use of predetermined thresholds; however, this often fails to consider the complexities of disease pathology. While this has been historically effective, it may lack the ability to offer a holistic approach in today’s patient-centered medicine, where personalized care is paramount [39]. The evolving nature of diseases and shifts in patient demographics increase the complexity of the diagnostic process, pushing the boundaries of conventional methodologies. In this challenging context, Bayesian inference emerges as an alternative approach, offering probabilistic evaluations that can adapt to the individual patient profiles [2,3].

Nevertheless, estimating the uncertainty of posterior probabilities within Bayesian inference remains a pivotal challenge [13]. This issue is critically important in the context of diagnostic and screening tests for life-threatening conditions or those associated with considerable morbidity risk. It underscores the need for well-informed clinical judgments and comprehensive uncertainty evaluation in medical decision-making. Key examples include:

Cardiac troponin for diagnosing myocardial injury and infarction [40];
Natriuretic peptides for the diagnosis of heart failure [41];
D-dimer for diagnosing thromboembolic events [42];
FPG, OGTT, and glycated hemoglobin (HbA1c) for diagnosing diabetes [31];
OGTT for the diagnosis of gestational diabetes [43];
Thyroid stimulating hormone (TSH), free serum triiodothyronine (T₃), and free serum thyroxine (T₄) for diagnosing thyroid dysfunction [44];
Protein-to-creatinine ratio for the diagnosis of preeclampsia [45];
Creatinine or cystatin C derived glomerular filtration rate (GFR), and albuminuria for diagnosing chronic kidney disease [46].

The ability to quantify this uncertainty is not a purely academic concern but also a practical necessity in improving diagnosis and patient outcomes.

To address this, our software explores the sampling, measurement, and combined uncertainty of Bayesian posterior probabilities. This exploration is not only vital for enhancing clinical decision-making but also plays a significant role in the fields of quality and risk management in laboratory medicine [47]. Additionally, it may contribute to the design and implementation of test accuracy studies [48,49]. As mentioned in Section 1, despite the extensive body of research on Bayesian diagnosis and uncertainty as separate entities, the intersection of these two areas remains relatively unexplored [50,51].

The illustrative case study, focusing on individuals aged 70 to 80 years, was designed to mitigate age-related variations in disease prevalence. This focus exemplifies the considerations required in modern diagnostics, where factors such as age, genetics, and lifestyle choices should be accounted for in the diagnostic equation.

Our software manages through its analysis of sampling, measurement, and combined uncertainty (as illustrated in Figure 5, Figure 8, Figure 11, Figure 14 and Figure 17), relative uncertainty (Figure 6, Figure 9, Figure 12, Figure 15 and Figure 18) and the corresponding confidence limits (Figure 7, Figure 10, Figure 13, Figure 16 and Figure 19), to display its versatility in addressing these diagnostic challenges. Although the software’s calculations are highly sophisticated, its user-friendly interface renders it an effective tool for medical researchers and professionals.

The case study from Section 4 highlights the substantial impact of combined uncertainty on the diagnostic process. This finding emphasizes the predominant role of measurement uncertainty, and thus stresses the demanding path toward enhancing diagnostic accuracy. By improving the analytical methods of screening and diagnostic tests, the medical community could achieve more accurate diagnosis, leading to more effective and tailored patient care.

Looking ahead, future research should focus on improving the estimations of the uncertainty of posterior probabilities under a diverse array of clinically relevant parameter settings. To transition from research into practical application, it is necessary to focus on clinical decision analysis, studies on cost-effectiveness, and research on quality of care, which includes conducting implementation studies [48]. Such efforts are necessary in addressing the complex issues in diagnostic medicine and finding new and effective approaches to tackle ongoing challenges.

5.2. Limitations of the Program

This program’s limitations, which provide paths for further research, include:

Underlying assumptions:
1.1.
The existence of “gold standards” in diagnostics. If a “gold standard” does not exist, there are alternative approaches for classification [52,53,54].
1.2.
The hypothesis of parametric distribution of measurements or their transformations. However, existing literature underlines the robustness of nonparametric techniques in capturing complex data distributions [55].
1.3.
The generally accepted bimodality of the measurands, although unimodal distributions could be considered [56,57].

If these assumptions are not valid, the program may underestimate the standard uncertainty of the posterior probability for disease.

2.: The use of first-order Taylor series approximations in uncertainty propagation calculations, where higher-order approximations may provide more accurate estimations [15].
3.: The approximation of the uncertainty of the prior probability for disease using the Agresti–Coull-adjusted Waldo interval, despite more accurate methods being available [58].
4.: The approximations of the sampling uncertainties for both the sample means and standard deviations, which can be improved for smaller samples or pronounced skewness observed in lognormal and gamma distributions [59,60].
5.: The use of confidence intervals derived from the t-distribution despite the high relative uncertainty [61]. Though not typical in a Bayesian context, this can be employed instead of credible intervals as a practical tool under certain circumstances [5,62].

While addressing these limitations would increase considerably computational complexity, they represent key areas for future enhancement [63,64].

5.3. Limitations of the Case Study

The case study’s main limitations include reliance on the OGTT as the reference method for diagnosing diabetes mellitus, despite several factors influencing glucose tolerance [65,66,67,68,69,70,71,72]. Additionally, the lognormal distributions used only approximate the distributions of the FPG measurements from NHANES datasets, highlighting the need for more flexible statistical models.

5.4. Challenges in Bayesian Analysis for Disease Diagnosis

While Bayesian analysis may be beneficial in medical diagnostics, it presents certain challenges. For instance, the substantial uncertainty of the posterior probability for disease revealed in our study could lead to clinical indecision. Additionally, there is a notable lack of comprehensive statistical research on the distribution of measurands in both diseased and nondiseased populations, hindering further advancements in Bayesian analysis in this field.

5.5. Implications of Incomplete Data

Over-reliance on prior probabilities: Limited empirical data may cause an overdependence on prior probabilities, leading to distorted posterior probabilities and potentially flawed clinical decisions [73].
Increased uncertainty: Insufficient data amplifies the uncertainty of computed posterior probabilities, which in turn could exacerbate clinical indecision [74].
Bias risks: Unrepresentative datasets could introduce systemic bias, increasing the uncertainty in Bayesian computations [5].

5.6. Analysis of the Double Sigmoidal Curve in Posterior Probability Estimation and Its Impact on Uncertainty

The posterior probability for disease curve, characterized by a double sigmoidal shape featuring two symmetrical sigmoid functions, presents compelling analytical perspectives in the field of medical diagnostic statistics. This configuration implies that the risk associated with the disease may escalate at both the lower and upper extremes of a given measurand, while a zone of relative safety exists in the intermediate range. Notably, the uncertainty associated with the posterior probability for disease becomes markedly pronounced along the steep segments of the double sigmoidal curve. This heightened uncertainty is attributable to the fact that minor variations in the measurand value can lead to significant alterations in the computed posterior probability.

5.7. Software Comparison

Our software easily generates a wide array of parametric plots and comprehensive tables for the analysis of uncertainty of posterior probability. To the best of our knowledge, no exixting software, including all major general or medical or Bayesian statistical and uncertainty quantification software packages (JASP^® ver. 0.20.0, Mathematica^® ver. 14.0, Matlab^® ver. R2023b, MedCalc^® ver. 20.2.1, metRology ver. 2023, NCSS^® ver. 24.0.0, NIST Uncertainty Machine ver. 2.0.0, OpenBUGS ver. 3.3.0, R ver. 4.3.1, SAS^® ver. 9.5, SPSS^® ver. 29, Stan ver. 2.33.0, Stata^® ver. 19, and UQLab ver. 2.0.0) provides this range of plots and tables without requiring advanced programming.

6. Conclusions

The program we have developed represents a novel approach to estimating and analyzing the uncertainty of Bayesian posterior probabilities in disease diagnosis. This tool stands out not only for its innovative capabilities in the field of medical diagnostics but also as a significant educational and research asset. Considering the difficulties and complexities we have outlined, this software offers essential assistance in applying Bayesian methods and dealing with diagnostic uncertainties, thereby enhancing well-informed decision-making.

Looking forward, it seems imperative that future research should focus on improving this method with advanced statistical concepts and empirically validating it with comprehensive test accuracy studies. Such studies are essential to verify the efficacy and reliability of the program in real clinical settings. Additionally, it is necessary to expand its application across a diverse range of diagnostic modalities. Doing so could enable the program to address a broader spectrum of diagnostic challenges, further enhancing its utility and impact on the medical field.

Our research, undertaken alongside our prior work on the uncertainty of diagnostic accuracy measures [19], creates a foundation for understanding uncertainties in diagnostic tests. With this consideration, we would recommend employing our approach in diagnostic accuracy research, aiming at formulating clear guidelines and establishing best practices to effectively integrate such information into clinical practice [48,75,76,77].

Regarding regulatory issues, it is necessary to ensure that the application of the software adheres to the standards set forth by local regulatory authorities.

The potential of this program seems to be extending beyond its practical implications in medical diagnostics. As an educational resource, it could offer significant opportunities for training in medical statistics, particularly in the understanding of the uncertainty of Bayesian posterior probabilities. Its user-friendly interface, coupled with the depth of its analytical capabilities, makes it an effective learning tool for both aspiring and experienced professionals in the medical community.

In conclusion, the development and refinement of the Bayesian Diagnostic Uncertainty program are pivotal steps towards navigating the complexities of modern medical diagnostics. Its role in enhancing Bayesian diagnostic methods, coupled with its educational benefits, highlights its capability as a supporting tool in the ongoing evolution of medical practice and research.

Supplementary Materials

The following supporting information can be downloaded at: Supplementary File S1: BayesianDiagnosticUncertainty.nb: The program as a Wolfram Mathematica Notebook. Available at https://www.hcsl.com/Tools/BayesianDiagnosticUncertainty/BayesianDiagnosticUncertainty.nb (accessed on 4 February 2024); Supplementary File S2: BayesianUncertaintyCalculations.nb: The calculations for the estimation Bayesian posterior probability for disease and its standard uncertainty in a Wolfram Mathematica Notebook. Available at https://www.hcsl.com/Supplements/SBDU.zip (accessed on 4 February 2024); Supplementary File S3: BayesianDiagnosticUncertaintyInterface.pdf: A brief description of the interface of the program. Available at: https://www.hcsl.com/Documents/BayesianDiagnosticUncertaintyInterface.pdf (accessed on 4 February 2024).

Author Contributions

Conceptualization: T.C.; methodology: T.C. and A.T.H.; software: T.C. and A.T.H.; validation: T.C.; formal analysis: T.C. and A.T.H.; investigation: T.C.; resources: A.T.H.; data curation: T.C.; writing—original draft preparation: T.C.; writing—review and editing A.T.H.; visualization: T.C.; supervision: A.T.H.; project administration: T.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Data collection was carried out following the rules of the Declaration of Helsinki. The Ethics Review Board of the National Center for Health Statistics approved data collection and posting the data online for public use. National Center for Health Statistics NHANES—NCHS Research Ethics Review Board Approval (Protocols #2005-06 and #2011-17), available online at: https://www.cdc.gov/nchs/nhanes/irba98.htm (accessed on 20 December 2023).

Informed Consent Statement

Written consent was obtained from each subject participating in the survey.

Data Availability Statement

The data presented in this study are available at https://wwwn.cdc.gov/nchs/nhanes/default.aspx (accessed on 20 December 2023).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Formalisms and Notation

Acronyms

PDF: probability density function

FPG: fasting plasma glucose

OGTT: oral glucose tolerance test

NHANES: National Health and Nutrition Examination Survey

Notation

Parameters

n_{D}

: size of diseased population

μ_{D}

: mean of diseased population

σ_{D}

: standard deviation of diseased population

n_{\bar{D}}

: size of nondiseased population

μ_{\bar{D}}

: mean of nondiseased population

σ_{\bar{D}}

: standard deviation of nondiseased population

r

: prior probability for disease (prevalence rate)

u_{s} (x)

: standard sampling uncertainty of

x

u_{m} (x)

: standard measurement uncertainty of

x

u_{c} (x)

: standard combined uncertainty of

x

n_{U}

: number of quality control measurements

b_{0}

: constant contribution to measurement uncertainty

b_{1}

: measurement uncertainty proportionality constant

\hat{u}

: mean standard measurement uncertainty

p

: confidence level

v_{e f f}

: effective degrees of freedom

Functions

P (A)

: probability of the event A

P (A | B)

: conditional probability of the event A given the event B

L (θ | z)

: likelihood function

F^{- 1} (.) :

the inverse function F

(.)

Bayes’ Theorem

For the purposes of our study, Bayes’ theorem is formulated as:

P (D | T) = \frac{P (T | D) P (D)}{P (T)} = \frac{P (T | D) P (D)}{P (T | D) P (D) + P (T | \bar{D}) (1 - P (D))}

where

P (D | T)

denotes the posterior probability of having a disease

D

given a test result

T

.

P (T | D)

denotes the likelihood of obtaining the result

T

given the presence of the disease

D

.

P (T | \bar{D})

denotes the likelihood of obtaining the result

T

given the absence of the disease

D

.

P (D)

is the prior probability or prevalence

r

of the disease

D

.

P (T)

is the overall probability of the result

T

.

According to Bayes’ theorem, the posterior probability for a disease

D

given a test result

T = x

and a parameter vector θ is calculated as:

P (D | T) = \frac{L_{D} (θ | x) r}{L_{D} (x | θ) r + L_{\bar{D}} (x | θ) (1 - r)} = \frac{f_{D} (x | θ) r}{f_{D} (x | θ) r + f_{\bar{D}} (x | θ) (1 - r)}

where

r

denotes the prior probability for disease,

L_{D} (θ | x)

and

f_{D} (x | θ)

denote the likelihood function and the PDF of the test measurand in the presence of the disease, respectively, while

L_{\bar{D}} (x | θ)

and

f_{\bar{D}} (x | θ)

are the respective functions in the absence of the disease.

Appendix A.1.1. Parametric Distributions

It is assumed that the test measurands of the diseased or nondiseased populations follow the normal, lognormal or gamma distribution. The domains of random variables for the respective distributions are defined as follows:

The domain of a random variable X following a normal distribution is the set of all real numbers, denoting $- \infty < X < \infty$ .
The domain of a random variable X following a lognormal distribution is the set of all positive real numbers, denoting $0 < X < \infty$ .
The domain of a random variable X following a gamma distribution is the set of all positive real numbers, denoting $0 < X < \infty$ .

Appendix A.1.2. Calculations of the Posterior Probability for Disease and Its Uncertainty

These calculations are detailed in Supplementary File S2 (Refer to Supplementary Files).

Appendix A.2. Software Availability and Requirements

Program name: Bayesian Diagnostic Uncertainty

Version: 1.0.0

Project home page: https://www.hcsl.com/Tools/BayesianDiagnosticUncertainty/ (accessed on 4 January 2024)

Available at: https://www.hcsl.com/Tools/BayesianDiagnosticUncertainty/BayesianDiagnosticUncertainty.nb (accessed on 4 January 2024)

Operating systems: Microsoft Windows 10+, Linux 3.15+, Apple macOS 11+

Programming language: Wolfram Language

Other software requirements:

For running the program and reading the BayesianDiagnosticUncertaintyCalculations.nb file Wolfram Player^® ver. 12.0+ is required, freely available at: https://www.wolfram.com/player/ (accessed 18 December 2023) or Wolfram Mathematica^® ver. 12.0+

System requirements: Intel^® i9™ or equivalent CPU and 32 GB of RAM

License: Attribution—Noncommercial—ShareAlike 4.0 International Creative Commons License

Appendix A.3. A Note about the Program

About the Program Controls

The program features an intuitive tabbed user interface, designed to streamline user interaction and facilitate effortless navigation across its multiple modules and submodules.

The numerical settings are defined by the user with menus or sliders. Sliders can be finely manipulated by holding down the alt key or opt key while dragging the mouse. They be even more finely manipulated by also holding the shift and/or ctrl keys.

Dragging with the mouse while pressing the ctrl, alt, or opt keys zooms plots in or out.

Range of input parameters

x : m a x i m u m (μ_{\bar{D}} - 5 σ_{\bar{D}}, 0) - μ_{D} + 5 σ_{\bar{D}}

n_{D}

: 2–10,000

μ_{D}

: 0.1–10,000

σ_{D}

: 0.01–1000

n_{\bar{D}}

: 2–10,000

μ_{\bar{D}}

: 0.1–10,000

σ_{\bar{D}}

: 0.01–1000

r

: 0.010–0.500

n_{U}

: 20–10,000

b_{0}

: 0–

σ_{\bar{D}}

b_{1} :

0–0.1

p

: 0.900–0.999

References

Weiner, E.S.C.; Simpson, J.A. The Oxford English Dictionary; Oxford Univeristy Press: Oxford, UK, 1989; ISBN 9780198611868. [Google Scholar]
Chatzimichail, T.; Hatjimihail, A.T. A Bayesian Inference Based Computational Tool for Parametric and Nonparametric Medical Diagnosis. Diagnostics 2023, 13, 3135. [Google Scholar] [CrossRef]
Choi, Y.-K.; Johnson, W.O.; Thurmond, M.C. Diagnosis Using Predictive Probabilities without Cut-Offs. Stat. Med. 2006, 25, 699–717. [Google Scholar] [CrossRef] [PubMed]
Bours, M.J. Bayes’ Rule in Diagnosis. J. Clin. Epidemiol. 2021, 131, 158–160. [Google Scholar] [CrossRef] [PubMed]
Gelman, A.; Carlin, J.B.; Stern, H.S.; Dunson, D.B.; Vehtari, A.; Rubin, D.B. Bayesian Data Analysis; CRC Press: Boca Raton, FL, USA, 2013; ISBN 9781439898208. [Google Scholar]
van de Schoot, R.; Depaoli, S.; King, R.; Kramer, B.; Märtens, K.; Tadesse, M.G.; Vannucci, M.; Gelman, A.; Veen, D.; Willemsen, J.; et al. Bayesian Statistics and Modelling. Nat. Rev. Methods Primers 2021, 1, 1. [Google Scholar] [CrossRef]
Viana, M.A.G.; Ramakrishnan, V. Bayesian Estimates of Predictive Value and Related Parameters of a Diagnostic Test. Can. J. Stat. 1992, 20, 311–321. [Google Scholar] [CrossRef]
Topol, E.J. Individualized Medicine from Prewomb to Tomb. Cell 2014, 157, 241–253. [Google Scholar] [CrossRef] [PubMed]
Joyce, J. Bayes’ Theorem. In The Stanford Encyclopedia of Philosophy; Stanford University: Stanford, CA, USA, 2021. [Google Scholar]
Lehmann, E.L.; Romano, J.P. Testing Statistical Hypotheses; Springer: New York, NY, USA, 2008; ISBN 9780387988641. [Google Scholar]
Box, G.E.P.; Cox, D.R. An Analysis of Transformations. J. R. Stat. Soc. Series B Stat. Methodol. 1964, 26, 211–243. [Google Scholar] [CrossRef]
D’Agostino, R.; Pearson, E.S. Tests for Departure from Normality. Empirical Results for the Distributions of b₂ and √b₁. Biometrika 1973, 60, 613–622. [Google Scholar] [CrossRef]
Srinivasan, P.; Westover, M.B.; Bianchi, M.T. Propagation of Uncertainty in Bayesian Diagnostic Test Interpretation. South Med. J. 2012, 105, 452–459. [Google Scholar] [CrossRef]
Ayyub, B.M.; Klir, G.J. Uncertainty Modeling and Analysis in Engineering and the Sciences; Chapman and Hall/CRC: Boca Raton, FL, USA, 2006. [Google Scholar]
Joint Committee for Guides in Metrology. Evaluation of Measurement Data—Supplement 2 to the “Guide to the Expression of Uncertainty in Measurement”—Extension to Any Number of Output Quantities; BIPM: Sèvres, France, 2011. [Google Scholar]
Kallner, A.; Boyd, J.C.; Duewer, D.L.; Giroud, C.; Hatjimihail, A.T.; Klee, G.G.; Lo, S.F.; Pennello, G.; Sogin, D.; Tholen, D.W.; et al. Expression of Measurement Uncertainty in Laboratory Medicine; Approved Guideline; Clinical and Laboratory Standards Institute: Wayne, PA, USA, 2012. [Google Scholar]
Smith, A.F.; Shinkins, B.; Hall, P.S.; Hulme, C.T.; Messenger, M.P. Toward a Framework for Outcome-Based Analytical Performance Specifications: A Methodology Review of Indirect Methods for Evaluating the Impact of Measurement Uncertainty on Clinical Outcomes. Clin. Chem. 2019, 65, 1363–1374. [Google Scholar] [CrossRef]
Ceriotti, F.; Fernandez-Calle, P.; Klee, G.G.; Nordin, G.; Sandberg, S.; Streichert, T.; Vives-Corrons, J.-L.; Panteghini, M. Criteria for Assigning Laboratory Measurands to Models for Analytical Performance Specifications Defined in the 1st EFLM Strategic Conference. Clin. Chem. Lab. Med. 2017, 55, 189–194. [Google Scholar] [CrossRef]
Chatzimichail, T.; Hatjimihail, A.T. A Software Tool for Calculating the Uncertainty of Diagnostic Accuracy Measures. Diagnostics 2021, 11, 406. [Google Scholar] [CrossRef]
Rostron, P.D.; Fearn, T.; Ramsey, M.H. Confidence Intervals for Robust Estimates of Measurement Uncertainty. Accredit. Qual. Assur. 2020, 25, 107–119. [Google Scholar] [CrossRef]
Geisser, S.; Johnson, W.O. Modes of Parametric Statistical Inference; John Wiley & Sons: Hoboken, NJ, USA, 2006; ISBN 9780471743125. [Google Scholar]
White, G.H. Basics of Estimating Measurement Uncertainty. Clin. Biochem. Rev. 2008, 29 (Suppl. S1), S53–S60. [Google Scholar]
Ellison, S.L.R.; Williams, A. Quantifying Uncertainty in Analytical Measurement, 3rd ed.; EURACHEM/CITAC: Teddington, UK, 2012. [Google Scholar]
Agresti, A.; Franklin, C.; Klingenberg, B. Statistics: The Art and Science of Learning from Data, Global Edition, 4th ed.; Pearson Education: London, UK, 2023; ISBN 9781292442464. [Google Scholar]
Miller, J.; Miller, J.C. Statistics and Chemometrics for Analytical Chemistry, 7th ed.; Pearson Education: London, UK, 2018; ISBN 9781292186719. [Google Scholar]
Aitchison, J.; Brown, J.A.C. The Lognormal Distribution with Special Reference to Its Uses in Econometrics; Cambridge University Press: Cambridge, UK, 1957. [Google Scholar]
Agresti, A.; Coull, B.A. Approximate Is Better than “Exact” for Interval Estimation of Binomial Proportions. Am. Stat. 1998, 52, 119–126. [Google Scholar] [CrossRef]
Wilson, B.M.; Smith, B.L. Taylor-Series and Monte-Carlo-Method Uncertainty Estimation of the Width of a Probability Distribution Based on Varying Bias and Random Error. Meas. Sci. Technol. 2013, 24, 035301. [Google Scholar] [CrossRef]
Welch, B.L. The Generalization of ‘Student’s’ Problem When Several Different Population Variances Are Involved. Biometrika 1947, 34, 28–35. [Google Scholar] [CrossRef]
Satterthwaite, F.E. An Approximate Distribution of Estimates of Variance Components. Biometrics 1946, 2, 110–114. [Google Scholar] [CrossRef]
ElSayed, N.A.; Aleppo, G.; Aroda, V.R.; Bannuru, R.R.; Brown, F.M.; Bruemmer, D.; Collins, B.S.; Hilliard, M.E.; Isaacs, D.; Johnson, E.L.; et al. 2. Classification and Diagnosis of Diabetes: Standards of Care in Diabetes—2023. Diabetes Care 2023, 46, S19–S40. [Google Scholar] [CrossRef]
Sun, H.; Saeedi, P.; Karuranga, S.; Pinkepank, M.; Ogurtsova, K.; Duncan, B.B.; Stein, C.; Basit, A.; Chan, J.C.N.; Mbanya, J.C.; et al. IDF Diabetes Atlas: Global, Regional and Country-Level Diabetes Prevalence Estimates for 2021 and Projections for 2045. Diabetes Res. Clin. Pract. 2022, 183, 109119. [Google Scholar] [CrossRef]
National Center for Health Statistics. National Health and Nutrition Examination Survey Data. Available online: https://wwwn.cdc.gov/nchs/nhanes/default.aspx (accessed on 4 September 2023).
National Center for Health Statistics. National Health and Nutrition Examination Survey Questionnaire. Available online: https://wwwn.cdc.gov/nchs/nhanes/Search/variablelist.aspx?Component=Questionnaire (accessed on 4 September 2023).
Myung, I.J. Tutorial on Maximum Likelihood Estimation. J. Math. Psychol. 2003, 47, 90–100. [Google Scholar] [CrossRef]
Nielsen, A.A. Least Squares Adjustment: Linear and Nonlinear Weighted Regression Analysis; Technical University of Denmark: Kongens Lyngby, Denmark, 2007. [Google Scholar]
Darling, D.A. The Kolmogorov-Smirnov, Cramer-von Mises Tests. Ann. Math. Stat. 1957, 28, 823–838. [Google Scholar] [CrossRef]
Tucker, L.A. Limited Agreement between Classifications of Diabetes and Prediabetes Resulting from the OGTT, Hemoglobin A1c, and Fasting Glucose Tests in 7412 U.S. Adults. J. Clin. Med. Res. 2020, 9, 2207. [Google Scholar] [CrossRef]
Obermeyer, Z.; Emanuel, E.J. Predicting the Future—Big Data, Machine Learning, and Clinical Medicine. N. Engl. J. Med. 2016, 375, 1216–1219. [Google Scholar] [CrossRef]
Wereski, R.; Kimenai, D.M.; Taggart, C.; Doudesis, D.; Lee, K.K.; Lowry, M.T.H.; Bularga, A.; Lowe, D.J.; Fujisawa, T.; Apple, F.S.; et al. Cardiac Troponin Thresholds and Kinetics to Differentiate Myocardial Injury and Myocardial Infarction. Circulation 2021, 144, 528–538. [Google Scholar] [CrossRef]
Roberts, E.; Ludman, A.J.; Dworzynski, K.; Al-Mohammad, A.; Cowie, M.R.; McMurray, J.J.V.; Mant, J.; NICE Guideline Development Group for Acute Heart Failure. The Diagnostic Accuracy of the Natriuretic Peptides in Heart Failure: Systematic Review and Diagnostic Meta-Analysis in the Acute Care Setting. BMJ 2015, 350, h910. [Google Scholar] [CrossRef]
Freund, Y.; Chauvin, A.; Jimenez, S.; Philippon, A.-L.; Curac, S.; Fémy, F.; Gorlicki, J.; Chouihed, T.; Goulet, H.; Montassier, E.; et al. Effect of a Diagnostic Strategy Using an Elevated and Age-Adjusted D-Dimer Threshold on Thromboembolic Events in Emergency Department Patients with Suspected Pulmonary Embolism: A Randomized Clinical Trial. JAMA 2021, 326, 2141–2149. [Google Scholar] [CrossRef]
Rani, P.R.; Begum, J. Screening and Diagnosis of Gestational Diabetes Mellitus, Where Do We Stand. J. Clin. Diagn. Res. 2016, 10, QE01–QE04. [Google Scholar] [CrossRef]
Reyes Domingo, F.; Avey, M.T.; Doull, M. Screening for Thyroid Dysfunction and Treatment of Screen-Detected Thyroid Dysfunction in Asymptomatic, Community-Dwelling Adults: A Systematic Review. Syst. Rev. 2019, 8, 260. [Google Scholar] [CrossRef]
Rodriguez-Thompson, D.; Lieberman, E.S. Use of a Random Urinary Protein-to-Creatinine Ratio for the Diagnosis of Significant Proteinuria during Pregnancy. Am. J. Obstet. Gynecol. 2001, 185, 808–811. [Google Scholar] [CrossRef]
Moynihan, R.; Glassock, R.; Doust, J. Chronic Kidney Disease Controversy: How Expanding Definitions Are Unnecessarily Labelling Many People as Diseased. BMJ 2013, 347, f4298. [Google Scholar] [CrossRef] [PubMed]
Haeckel, R.; Wosniok, W.; Gurr, E.; Peil, B.; on behalf of Arbeitsgruppe Richtwer. Supplements to a Recent Proposal for Permissible Uncertainty of Measurements in Laboratory Medicine. LaboratoriumsMedizin 2016, 40, 141–145. [Google Scholar] [CrossRef]
Knottnerus, J.A.; Buntinx, F. (Eds.) The Evidence Base of Clinical Diagnosis. In Evidence-Based Medicine, 2nd ed.; BMJ Books: London, UK, 2011; ISBN 9781444360639. [Google Scholar]
Hajian-Tilaki, K. Sample Size Estimation in Diagnostic Test Studies of Biomedical Informatics. J. Biomed. Inform. 2014, 48, 193–204. [Google Scholar] [CrossRef] [PubMed]
Baron, J.A. Uncertainty in Bayes. Med. Decis. Mak. 1994, 14, 46–51. [Google Scholar] [CrossRef]
Ashby, D.; Smith, A.F. Evidence-Based Medicine as Bayesian Decision-Making. Stat. Med. 2000, 19, 3291–3305. [Google Scholar] [CrossRef]
Knottnerus, J.A.; Dinant, G.J. Medicine Based Evidence, a Prerequisite for Evidence Based Medicine. BMJ 1997, 315, 1109–1110. [Google Scholar] [CrossRef]
Pfeiffer, R.M.; Castle, P.E. With or without a Gold Standard. Epidemiology 2005, 16, 595–597. [Google Scholar] [CrossRef]
van Smeden, M.; Naaktgeboren, C.A.; Reitsma, J.B.; Moons, K.G.M.; de Groot, J.A.H. Latent Class Models in Diagnostic Studies When There Is No Reference Standard—A Systematic Review. Am. J. Epidemiol. 2014, 179, 423–431. [Google Scholar] [CrossRef]
Wasserman, L. All of Nonparametric Statistics; Springer: Berlin/Heidelberg, Germany, 2006; ISBN 9780387306230. [Google Scholar]
Wilson, J.M.G.; Jungner, G. Principles and Practice of Screening for Disease; Public health papers; World Health Organization: Geneva, Switzerland, 1968; Volume 34. [Google Scholar]
Petersen, P.H.; Horder, M. 2.3 Clinical Test Evaluation. Unimodal and Bimodal Approaches. Scand. J. Clin. Lab. Investig. 1992, 52, 51–57. [Google Scholar] [CrossRef]
Pires, A.M.; Amado, C. Interval Estimators for a Binomial Proportion: Comparison of Twenty Methods. Revstat Stat. J. 2008, 6, 165–197. [Google Scholar] [CrossRef]
Schmoyeri, R.L.; Beauchamp, J.J.; Brandt, C.C.; Hoffman, F.O. Difficulties with the Lognormal Model in Mean Estimation and Testing. Environ. Ecol. Stat. 1996, 3, 81–97. [Google Scholar] [CrossRef]
Bhaumik, D.K.; Kapur, K.; Gibbons, R.D. Testing Parameters of a Gamma Distribution for Small Samples. Technometrics 2009, 51, 326–334. [Google Scholar] [CrossRef]
Williams, A. Calculation of the Expanded Uncertainty for Large Uncertainties Using the Lognormal Distribution. Accredit. Qual. Assur. 2020, 25, 335–338. [Google Scholar] [CrossRef]
Stephens, M. The Bayesian Lens and Bayesian Blinkers. Philos. Trans. A Math. Phys. Eng. Sci. 2023, 381, 20220144. [Google Scholar] [CrossRef] [PubMed]
Joint Committee for Guides in Metrology. Evaluation of Measurement Data—Supplement 1 to the “Guide to the Expression of Uncertainty in Measurement”—Propagation of Distributions Using a Monte Carlo Method Joint Committee for Guides in Metrology; BIPM: Sèvres, France, 2008. [Google Scholar]
Joint Committee for Guides in Metrology. Guide to the Expression of Uncertainty in Measurement—Part 6: Developing and Using Measurement Models; BIPM: Sèvres, France, 2020. [Google Scholar]
Meneilly, G.S.; Elliott, T. Metabolic Alterations in Middle-Aged and Elderly Obese Patients with Type 2 Diabetes. Diabetes Care 1999, 22, 112–118. [Google Scholar] [CrossRef]
Geer, E.B.; Shen, W. Gender Differences in Insulin Resistance, Body Composition, and Energy Balance. Gend. Med. 2009, 6 (Suppl. S1), 60–75. [Google Scholar] [CrossRef] [PubMed]
Van Cauter, E.; Polonsky, K.S.; Scheen, A.J. Roles of Circadian Rhythmicity and Sleep in Human Glucose Regulation. Endocr. Rev. 1997, 18, 716–738. [Google Scholar] [CrossRef]
Colberg, S.R.; Sigal, R.J.; Fernhall, B.; Regensteiner, J.G.; Blissmer, B.J.; Rubin, R.R.; Chasan-Taber, L.; Albright, A.L.; Braun, B.; American College of Sports Medicine; et al. Exercise and Type 2 Diabetes: The American College of Sports Medicine and the American Diabetes Association: Joint Position Statement. Diabetes Care 2010, 33, e147–e167. [Google Scholar] [CrossRef]
Salmerón, J.; Manson, J.E.; Stampfer, M.J.; Colditz, G.A.; Wing, A.L.; Willett, W.C. Dietary Fiber, Glycemic Load, and Risk of Non-Insulin-Dependent Diabetes Mellitus in Women. JAMA 1997, 277, 472–477. [Google Scholar] [CrossRef] [PubMed]
Surwit, R.S.; van Tilburg, M.A.L.; Zucker, N.; McCaskill, C.C.; Parekh, P.; Feinglos, M.N.; Edwards, C.L.; Williams, P.; Lane, J.D. Stress Management Improves Long-Term Glycemic Control in Type 2 Diabetes. Diabetes Care 2002, 25, 30–34. [Google Scholar] [CrossRef]
Pandit, M.K.; Burke, J.; Gustafson, A.B.; Minocha, A.; Peiris, A.N. Drug-Induced Disorders of Glucose Tolerance. Ann. Intern. Med. 1993, 118, 529–539. [Google Scholar] [CrossRef]
Dupuis, J.; Langenberg, C.; Prokopenko, I.; Saxena, R.; Soranzo, N.; Jackson, A.U.; Wheeler, E.; Glazer, N.L.; Bouatia-Naji, N.; Gloyn, A.L.; et al. New Genetic Loci Implicated in Fasting Glucose Homeostasis and Their Impact on Type 2 Diabetes Risk. Nat. Genet. 2010, 42, 105–116. [Google Scholar] [CrossRef]
O’Hagan, A.; Buck, C.E.; Daneshkhah, A.; Richard Eiser, J.; Garthwaite, P.H.; Jenkinson, D.J.; Oakley, J.E.; Rakow, T. Uncertain Judgements: Eliciting Experts’ Probabilities; John Wiley & Sons: Hoboken, NJ, USA, 2006; ISBN 9780470033302. [Google Scholar]
Berger, J.O. Statistical Decision Theory and Bayesian Analysis; Springer: Berlin/Heidelberg, Germany, 1985; ISBN 9780387960982. [Google Scholar]
Whiting, P.F.; Rutjes, A.W.S.; Westwood, M.E.; Mallett, S.; QUADAS-2 Steering Group. A Systematic Review Classifies Sources of Bias and Variation in Diagnostic Test Accuracy Studies. J. Clin. Epidemiol. 2013, 66, 1093–1104. [Google Scholar] [CrossRef] [PubMed]
Salameh, J.-P.; Bossuyt, P.M.; McGrath, T.A.; Thombs, B.D.; Hyde, C.J.; Macaskill, P.; Deeks, J.J.; Leeflang, M.; Korevaar, D.A.; Whiting, P.; et al. Preferred Reporting Items for Systematic Review and Meta-Analysis of Diagnostic Test Accuracy Studies (PRISMA-DTA): Explanation, Elaboration, and Checklist. BMJ 2020, 370, m2632. [Google Scholar] [CrossRef] [PubMed]
Schlattmann, P. Tutorial: Statistical Methods for the Meta-Analysis of Diagnostic Test Accuracy Studies. Clin. Chem. Lab. Med. 2023, 61, 777–794. [Google Scholar] [CrossRef] [PubMed]

Figure 1. A simplified flowchart of the program Bayesian Diagnostic Uncertainty with the number of input parameters and of output types for each submodule.

Figure 2. A screenshot of the program Bayesian Diagnostic Uncertainty.

Figure 3. The estimated PDF of the FPG (mg/dL) in diabetic participants, assuming a lognormal distribution and negligible measurement uncertainty, and the histogram of the respective NHANES dataset, with the parameters of the distribution in Table 2.

Figure 4. The estimated PDF of the FPG (mg/dL) in nondiabetic participants, assuming a lognormal distribution and negligible measurement uncertainty, and the histogram of the respective NHANES dataset, with the parameters of the distribution in Table 2.

Figure 5. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus FPG curve plot, with the settings of the program in Table 2.

Figure 6. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus FPG curve plot, with the settings of the program in Table 2.

Figure 7. Confidence intervals of the posterior probability for diabetes versus FPG curves plot, with the settings of the program in Table 2.

Figure 8. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curve plot, with the settings of the program in Table 2.

Figure 8. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curve plot, with the settings of the program in Table 2.

Figure 9. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curve plot, with the settings of the program in Table 2.

Figure 9. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curve plot, with the settings of the program in Table 2.

Figure 10. Confidence intervals of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curves plot, with the settings of the program in Table 2.

Figure 10. Confidence intervals of the posterior probability for diabetes versus measurement uncertainty constant contribution

b_{0}

curves plot, with the settings of the program in Table 2.

Figure 11. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curve plot, with the settings of the program in Table 2.

Figure 11. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curve plot, with the settings of the program in Table 2.

Figure 12. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curve plot, with the settings of the program in Table 2.

Figure 12. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curve plot, with the settings of the program in Table 2.

Figure 13. Confidence intervals of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curves plot, with the settings of the program in Table 2.

Figure 13. Confidence intervals of the posterior probability for diabetes versus measurement uncertainty proportionality constant

b_{1}

curves plot, with the settings of the program in Table 2.

Figure 14. Standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus total population sample size n curve plot, with the settings of the program in Table 2.

Figure 15. Relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes versus total population sample size n curve plot, with the settings of the program in Table 2.

Figure 16. Confidence intervals of the posterior probability for diabetes versus total population sample size n curves plot, with the settings of the program in Table 2.

Figure 17. Table of the standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes, with the settings of the program in Table 2.

Figure 18. Table of the relative standard sampling, measurement, and combined uncertainty of the posterior probability for diabetes, with the settings of the program in Table 2.

Figure 19. Confidence intervals of the posterior probability for diabetes, with the settings of the program in Table 2.

Table 1. Descriptive statistics of the fasting plasma glucose datasets.

	Diabetic Participants	Nondiabetic Participants
n	154	822
Mean	120.7	102.6
Median	117.0	102.0
Standard Deviation	19.1	10.9
Skewness	1.448	0.523
Kurtosis	6.354	3.445

Table 2. Descriptive statistics of the estimated lognormal distributions of the diabetic and nondiabetic populations.

	Diabetic Participants		Nondiabetic Participants
Estimated Distribution	$L_{D}$	$l_{D}$	$L_{\bar{D}}$	$l_{\bar{D}}$
Mean Uncertainty	1.586	0	1.028	0
Mean	120.7	120.7	102.6	102.6
Median	119.4	119.4	102.1	102.1
Standard Deviation	17.8	17.7	10.9	10.7
Skewness	0.446	0.444	0.315	0.312
Kurtosis	3.355	3.352	3.177	3.174
p-value (Cramér–von Mises test)	0.294	0.295	0.281	0.299

Table 3. The settings of the program Bayesian Diagnostic Uncertainty for Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17, Figure 18 and Figure 19.

Settings	Figure 5 and Figure 6	Figure 7	Figure 8 and Figure 9	Figure 10	Figure 11 and Figure 12	Figure 13	Figure 14 and Figure 15	Figure 16	Figure 17 and Figure 18	Figure 19
p	-	0.95	-	0.95	-	0.95	-	0.95	-	0.95
x	31.0–192.0	31.0–192.0	126.0	126.0	126.0	126.0	126.0	126.0	126.0	126.0
$μ_{D}$	120.7	120.7	120.7	120.7	120.7	120.7	120.7	120.7	120.7	120.7
$σ_{D}$	17.7	17.7	17.7	17.7	17.7	17.7	17.7	17.7	17.7	17.7
$n_{D}$	154	154	154	154	154	154	-	-	154	154
$μ_{\bar{D}}$	102.7	102.7	102.7	102.7	102.7	102.7	102.7	102.7	102.7	102.7
$σ_{\bar{D}}$	10.7	10.7	10.7	10.7	10.7	10.7	10.7	10.7	10.7	10.7
$n_{\bar{D}}$	822	822	822	822	822	822	-	-	822	822
n	-	-	-	-	-	-	65–5000	65–5000	-	-
r	-	-	-	-	-	-	0.158	0.158	-	-
$b_{0}$	0.866	0.866	0.0–0.161	0.0–0.161	0.866	0.866	0.866	0.866	0.866	0.866
$b_{1}$	0.0109	0.0109	0.0109	0.0109	0.0–0.1	0.0–0.1	0.0109	0.0109	0.0109	0.0109
$n_{U}$	-	1350	-	1350	-	1350	-	1350	-	1350
$l_{D}$	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	normal lognormal gamma	normal lognormal gamma
$l_{\bar{D}}$	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	lognormal	normal lognormal gamma	normal lognormal gamma

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chatzimichail, T.; Hatjimihail, A.T. A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease. Diagnostics 2024, 14, 402. https://doi.org/10.3390/diagnostics14040402

AMA Style

Chatzimichail T, Hatjimihail AT. A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease. Diagnostics. 2024; 14(4):402. https://doi.org/10.3390/diagnostics14040402

Chicago/Turabian Style

Chatzimichail, Theodora, and Aristides T. Hatjimihail. 2024. "A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease" Diagnostics 14, no. 4: 402. https://doi.org/10.3390/diagnostics14040402

APA Style

Chatzimichail, T., & Hatjimihail, A. T. (2024). A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease. Diagnostics, 14(4), 402. https://doi.org/10.3390/diagnostics14040402

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Software Tool for Estimating Uncertainty of Bayesian Posterior Probability for Disease

Abstract

1. Introduction

1.1. Diagnosis in Medicine

1.1.1. Bayes’ Theorem in Medical Diagnostics

1.1.2. Challenges in Applying Bayesian Inference

Computational Complexity

Statistical Distributions in Diagnostics

Uncertainty of Bayesian Posterior Probabilities

1.1.3. Quantifying Uncertainty in Diagnostics

Combined Uncertainty

Measurement Uncertainty

Sampling Uncertainty

2. Methods

2.1. Computational Methods

2.1.1. Bayes’ Theorem

2.1.2. Parametric Distributions

2.1.3. Uncertainty Quantification

Measurement Uncertainty

Sampling Uncertainties of Means and Standard Deviations

Sampling Uncertainty of Prior Probability for Disease

Combined Uncertainty of Posterior Probability for Disease

2.1.4. Expanded Uncertainty of Posterior Probability for Disease

2.2. The Software

2.2.1. Program Overview

2.2.2. Input Parameters

Measurement Uncertainty

2.2.3. Output Specifications

Visualizations

Tables

3. Illustrative Case Study

4. Results

5. Discussion

5.1. Reevaluation of Traditional Diagnostic Methods

5.2. Limitations of the Program

5.3. Limitations of the Case Study

5.4. Challenges in Bayesian Analysis for Disease Diagnosis

5.5. Implications of Incomplete Data

5.6. Analysis of the Double Sigmoidal Curve in Posterior Probability Estimation and Its Impact on Uncertainty

5.7. Software Comparison

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Formalisms and Notation

Appendix A.1.1. Parametric Distributions

Appendix A.1.2. Calculations of the Posterior Probability for Disease and Its Uncertainty

Appendix A.2. Software Availability and Requirements

Appendix A.3. A Note about the Program

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI