Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model
Abstract
:1. Introduction
2. Methodology
2.1. Maximum Empirical Likelihood Estimators of AUC and J
2.2. Joint Asymptotic Normality of
- C1.
- for any .
- C2.
- and are continuous in the neighborhood of , with and .
- C3.
- The total sample size , and remains constant.
- C4.
- The DRM (2) is satisfied by and . Additionally, is positive definite, and for in a neighborhood of ,
3. Joint Inference Procedures for under the DRM
3.1. Confidence Region of
- Step 1.
- Calculate and the corresponding based on the observed data and .
- Step 2.
- For , draw a bootstrap sample of size with replacements from and another bootstrap sample of size with replacements from .
- Step 3.
- For , based on the lth bootstrap two-sample data in Step 2, calculate the estimate for and the corresponding for , and compute
- Step 4.
- Obtain the th quantile of , which is denoted as .
- Step 5.
- The bootstrap confidence region of is given by
3.2. Joint Hypothesis Testing on
- (a)
- The maximum likelihood estimator of μ in based on is given by
- (b)
- The likelihood ratio test statistic for testing versus iswhere is the positive part of x.
- Step 1.
- Calculate the test statistic based on the observed data and .
- Step 2.
- For , generate l-th boostrap two-sample data as follows:
- (a)
- If or , draw a bootstrap sample of size from and another bootstrap sample of size from .
- (b)
- If , , and , draw a sample of size from and another sample of size from .
- (c)
- If , , and , draw a sample of size from and another sample of size from .
- Step 3.
- For , calculate the test statistic based on the l-th bootstrap two-sample data in Step 2 (using the same method as in Step 1).
- Step 4.
- Calculate the p-value of as
- Step 5.
4. Simulation Study
4.1. Simulation Parameter Settings
- (1)
- and ;
- (2)
- and .
4.2. Simulation for Confidence Regions
- Empirical likelihood method (proposed in (6)), which is denoted as “EL”;
- Bootstrap empirical likelihood method with (proposed in (7)), which is denoted as “BEL”;
- Parametric Box–Cox asymptotic delta method, which is denoted as “AD”;
- Parametric Box–Cox generalized inference approach, which is denoted as “GPQ”;
- Nonparametric bootstrap confidence region, which is denoted as “BTI”;
- Nonparametric bootstrap confidence region with the arcsin-square-root transformation, which is denoted as “BTAT”.
4.3. Simulation for Joint Hypothesis Testing
- Parametric bootstrap joint test method, which is denoted as “PBA”;
- Nonparametric kernel-smoothed-based joint test method, which is denoted as “NKS”.
5. Real Data Analysis
6. Summary
7. Discussion
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. Proof of Theorem 1
Appendix A.1. Some Preparation
Appendix A.2. Proof of Theorem 1
Appendix B. Proof of Proposition 1
Appendix C. Numerical Calculations of
- Step 1.
- Find all s such that
- Step 2.
- Obtain
- Step 3.
- Calculate and
- Step 1.
- Find all s such that
- Step 2.
- Obtain
- Step 3.
- Calculate and
References
- Yin, J.; Tian, L. Joint confidence region estimation for area under ROC curve and Youden index. Stat. Med. 2014, 33, 985–1000. [Google Scholar] [CrossRef] [PubMed]
- Yin, J.; Mutiso, F.; Tian, L. Joint hypothesis testing of the area under the receiver operating characteristic curve and the Youden index. Pharm. Stat. 2021, 20, 657–674. [Google Scholar] [CrossRef] [PubMed]
- Pepe, M.S. Receiver operating characteristic methodology. J. Am. Stat. Assoc. 2000, 95, 308–311. [Google Scholar] [CrossRef]
- Qin, J.; Zhang, B. Using logistic regression procedures for estimating receiver operating characteristic curves. Biometrika 2003, 90, 585–596. [Google Scholar] [CrossRef]
- Zhou, X.H.; Obuchowski, N.A.; McClish, D.K. Statistical Methods in Diagnostic Medicine, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
- Chen, B.; Li, P.; Qin, J.; Yu, T. Using a monotonic density ratio model to find the asymptotically optimal combination of multiple diagnostic tests. J. Am. Stat. Assoc. 2016, 111, 861–874. [Google Scholar] [CrossRef]
- Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997, 30, 1145–1159. [Google Scholar] [CrossRef]
- Faraggi, D.; Reiser, B. Estimation of the area under the ROC curve. Stat. Med. 2002, 21, 3093–3106. [Google Scholar] [CrossRef] [PubMed]
- Youden, W.J. Index for rating diagnostic tests. Cancer 1950, 3, 32–35. [Google Scholar] [CrossRef]
- Fluss, B.; Faraggi, D.; Reiser, B. Estimation of the Youden Index and its associated cutoff point. Biom. J. 2005, 47, 458–472. [Google Scholar] [CrossRef] [PubMed]
- Schisterman, E.F.; Perkins, N.J.; Liu, A.; Bond, H. Optimal cut-point and its corresponding Youden index to discriminate individuals using pooled blood samples. Epidemiology 2005, 16, 73–81. [Google Scholar] [CrossRef] [PubMed]
- Lavrentieva, A.; Kontakiotis, T.; Lazaridis, L.; Tsotsolis, N.; Koumis, J.; Kyriazis, G.; Bitzani, M. Inflammatory markers in patients with severe burn injury: What is the best indicator of sepsis? Burns 2007, 33, 189–194. [Google Scholar] [CrossRef] [PubMed]
- Bantis, L.E.; Nakas, C.T.; Reiser, B. Constr.Construction of confidence regions in the ROC space after the estimation of the optimal Youden index-based cut-off point. Biometrics 2014, 70, 212–223. [Google Scholar] [CrossRef] [PubMed]
- Wotschofsky, Z.; Busch, J.; Jung, M.; Kempkensteffen, C.; Weikert, S.; Schaser, K.D.; Melcher, I.; Kilic, E.; Miller, K.; Kristiansen, G. Diagnostic and prognostic potential of differentially expressed miRNAs between metastatic and non-metastatic renal cell carcinoma at the time of nephrectomy. Clin. Chim. Acta 2013, 416, 5–10. [Google Scholar] [CrossRef] [PubMed]
- Jiang, S.; Tu, D. Inference on the probability P(T1<T2) as a measurement of treatment effect under a density ratio model and random censoring. Comput. Stat. Data Anal. 2012, 56, 1069–1078. [Google Scholar]
- Wang, C.; Marriott, P.; Li, P. Testing homogeneity for multiple nonnegative distributions with excess zero observations. Comput. Stat. Data Anal. 2017, 114, 146–157. [Google Scholar] [CrossRef]
- Yuan, M.; Li, P.; Wu, C. Semiparametric inference of the Youden index and the optimal cut-off point under density ratio models. Can. J. Stat. 2021, 49, 965–986. [Google Scholar] [CrossRef]
- Anderson, J.A. Multivariate logistic compounds. Biometrika 1979, 66, 17–26. [Google Scholar] [CrossRef]
- Qin, J.; Zhang, B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika 1997, 84, 609–618. [Google Scholar] [CrossRef]
- Qin, J. Biased Sampling, Over-Identified Parameter Problems and Beyond; Springer: Singapore, 2017. [Google Scholar]
- Hu, D.; Yuan, M.; Yu, T.; Li, P. Statistical inference for the two-sample problem under likelihood ratio ordering, with application to the ROC curve estimation. Stat. Med. 2023, 42, 3649–3664. [Google Scholar] [CrossRef] [PubMed]
- Zhang, B. A semiparametric hypothesis testing procedure for the ROC curve area under a density ratio model. Comput. Stat. Data Anal. 2006, 50, 1855–1876. [Google Scholar] [CrossRef]
- Owen, A.B. Empirical Likelihood; Chapman and Hall/CRC: New York, NY, USA, 2001. [Google Scholar]
- Cai, S.; Chen, J.; Zidek, J.V. Hypothesis testing in the presence of multiple samples under density ratio models. Stat. Sin. 2017, 27, 761–783. [Google Scholar] [CrossRef]
- Hsieh, F.; Turnbull, B.W. Nonparametric methods for evaluating diagnostic tests. Stat. Sin. 1996, 6, 47–62. [Google Scholar]
- Percy, M.E.; Andrews, D.F.; Thompson, M.W. Duchenne muscular dystrophy carrier detection using logistic discrimination: Serum creatine kinase, hemopexin, pyruvate kinase, and lactate dehydrogenase in combination. Am. J. Med. Genet. 1982, 13, 27–38. [Google Scholar] [CrossRef] [PubMed]
- Andrews, D.F.; Herzberg, A.M. Data: A Collection of Problems from Many Fields for the Student and Research Worker; Springer: New York, NY, USA, 2012. [Google Scholar]
- Yin, J.; Samawi, H.; Tian, L. Joint inference about the AUC and Youden index for paired biomarkers. Stat. Med. 2022, 41, 37–64. [Google Scholar] [CrossRef] [PubMed]
- Wang, J.; Yin, J.; Tian, L. Evaluating joint confidence region of hypervolume under ROC manifold and generalized Youden index. Stat. Med. 2024, 43, 869–889. [Google Scholar] [CrossRef]
- McClish, D.K. Analyzing a portion of the ROC curve. Med. Decis. Mak. 1989, 9, 190–195. [Google Scholar] [CrossRef] [PubMed]
- Jiang, Y.; Metz, C.E.; Nishikawa, R.M. A receiver operating characteristic partial area index for highly sensitive diagnostic tests. Radiology 1996, 201, 745–750. [Google Scholar] [CrossRef] [PubMed]
- Zhang, D.D.; Zhou, X.H.; Freeman, D.H., Jr.; Free, J.L. A non-parametric method for the comparison of partial areas under ROC curves and its application to large health care data sets. Stat. Med. 2002, 21, 701–715. [Google Scholar] [CrossRef]
- Dodd, L.E.; Pepe, M.S. Partial AUC estimation and regression. Biometrics 2003, 59, 614–623. [Google Scholar] [CrossRef] [PubMed]
- Ma, H.; Bandos, A.I.; Rocket, H.E.; Gur, D. On use of partial area under the ROC curve for evaluation of diagnostic performance. Stat. Med. 2013, 32, 3449–3458. [Google Scholar] [CrossRef] [PubMed]
Distribution | ||||||
---|---|---|---|---|---|---|
lognormal | 0.707 | 0.3 | 0 | 1 | 0.77 | 1 |
0.830 | 0.5 | 0 | 1 | 1.35 | 1 | |
0.928 | 0.7 | 0 | 1 | 2.07 | 1 | |
Distribution | ||||||
beta | 0.704 | 0.3 | 1.5 | 3 | 2.77 | 3 |
0.824 | 0.5 | 1.5 | 3 | 4.25 | 3 | |
0.922 | 0.7 | 1.5 | 3 | 7.09 | 3 |
EL | BEL | AD | GPQ | BTI | BTAT | |||
---|---|---|---|---|---|---|---|---|
(50, 50) | 0.707 | 0.3 | ||||||
0.830 | 0.5 | |||||||
0.928 | 0.7 | |||||||
(100, 100) | 0.707 | 0.3 | ||||||
0.830 | 0.5 | |||||||
0.928 | 0.7 | |||||||
(150, 50) | 0.707 | 0.3 | ||||||
0.830 | 0.5 | |||||||
0.928 | 0.7 |
EL | BEL | AD | GPQ | BTI | BTAT | |||
---|---|---|---|---|---|---|---|---|
(50, 50) | 0.704 | 0.3 | ||||||
0.824 | 0.5 | |||||||
0.922 | 0.7 | |||||||
(100, 100) | 0.704 | 0.3 | ||||||
0.824 | 0.5 | |||||||
0.922 | 0.7 | |||||||
(150, 50) | 0.704 | 0.3 | ||||||
0.824 | 0.5 | |||||||
0.922 | 0.7 |
BELT | PBA | NKS | ||
---|---|---|---|---|
(0.830, 0.5) | (50, 50) | 5.60 | 2.65 | 0.25 |
(75, 50) | 5.60 | 2.40 | 0.10 | |
(75, 75) | 5.50 | 2.35 | 0.05 | |
(0.928, 0.7) | (50, 50) | 90.85 | 58.35 | 24.45 |
(75, 50) | 96.35 | 66.25 | 23.10 | |
(75, 75) | 98.25 | 70.75 | 37.25 |
BELT | PBA | NKS | ||
---|---|---|---|---|
(0.824, 0.5) | (50, 50) | 5.65 | 7.30 | 3.05 |
(75, 50) | 5.40 | 8.75 | 2.30 | |
(75, 75) | 5.05 | 7.90 | 2.45 | |
(0.922, 0.7) | (50, 50) | 91.00 | 94.00 | 81.55 |
(75, 50) | 95.70 | 97.60 | 90.25 | |
(75, 75) | 97.70 | 99.05 | 93.90 |
Biomarker | BEL | GPQ | BTAT | |
---|---|---|---|---|
PK | PE | |||
ACR | ||||
H | PE | |||
ACR |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, S.; Tian, Q.; Liu, Y.; Li, P. Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model. Mathematics 2024, 12, 2118. https://doi.org/10.3390/math12132118
Liu S, Tian Q, Liu Y, Li P. Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model. Mathematics. 2024; 12(13):2118. https://doi.org/10.3390/math12132118
Chicago/Turabian StyleLiu, Siyan, Qinglong Tian, Yukun Liu, and Pengfei Li. 2024. "Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model" Mathematics 12, no. 13: 2118. https://doi.org/10.3390/math12132118
APA StyleLiu, S., Tian, Q., Liu, Y., & Li, P. (2024). Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model. Mathematics, 12(13), 2118. https://doi.org/10.3390/math12132118