Modernising Receiver Operating Characteristic (ROC) Curves †
Abstract
:1. Introduction
Article Organistion
2. Related Work
3. Materials and Methods
3.1. Traditional ROC Curves
3.2. Measurement System Analysis (MSA)
3.3. Item Response Theory (IRT) and the Rasch Model
3.4. Modernising ROC
3.4.1. Linacre Linearisation
3.4.2. Linearised ROC Curves Adopting the Rasch Model
- (i)
- (ii)
- and calculated from the cumulative rates for TPR and FPR (Figure A1) summed vertically for these columns over the range of target value, j, for each device i, according to the ROC expressions. An assessment of these two alternative scorings in producing the most relevant results is one main aim of the current work.
4. Results and Discussions
4.1. Logistic Regressions
4.2. Rasch Parameters Versus Assigned Value, x
4.3. Evaluating Device Parameters
4.4. Theoretical Curves
4.5. Assessing Relative Performance of Devices
4.6. Measurement Uncertainties
4.7. Limitations
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. ROC, Item Response Theory, Control Charts, Rankits and Other Measures
Appendix A.1. ROC in the Presence of Counted-Fractions Ordinality
- The Youden index, which is a measure of the vertical distance between the 45° line (Section 3.1) and the corresponding point on the ROC curve, as given by the expression . A better diagnosis has a higher Youden index value [23]
- Sensitivity + Specificity, which is a simple addition where a better diagnosis has a higher index value.
- Distance to corner, a better diagnosis which has a smaller distance, d, to the top-left corner of the ROC curve for each cut-off value,
Appendix A.2. Rasch, MSA and Specific Objectivity
Appendix A.3. Ability and Task Difficulty in Control Charts
Appendix B. Explaining Curve Parameters
- The straight line intercepts the vertical axis, where x = 0 (and ), at a place which shifts with the ability, , of each device;
- The straight line crosses the horizontal (x-) axis at z = 0, where and . The different values of SL obtained in this fashion match the different intercepts on the vertical axis, that are due to the varying device ability.
References
- Pendrill, L.R. Using Measurement Uncertainty in Decision-Making and Conformity Assessment. Metrologia 2014, 51, S206–S218. [Google Scholar] [CrossRef]
- Hibbert, D.B. Evaluation of Measurement Data: The Role of Measurement Uncertainty in Conformity Assessment. Chem. Int.—Newsmag. IUPAC 2013, 35, 22–23. [Google Scholar] [CrossRef]
- Peterson, W.; Birdsall, T.; Fox, W. The Theory of Signal Detectability. Trans. IRE Prof. Group Inf. Theory 1954, 4, 171–212. [Google Scholar] [CrossRef]
- Pendrill, L. Quality Assured Measurement: Unification across Social and Physical Sciences; Springer Series in Measurement Science and Technology; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; ISBN 978-3-030-28694-1. [Google Scholar]
- Mosteller, F.; Tukey, J. Data Analysis and Regression: A Second Course in Statistics, 1st ed.; Pearson: Reading, MA, USA, 1977; ISBN 978-0-201-04854-4. [Google Scholar]
- Koczula, K.M.; Gallotta, A. Lateral Flow Assays. Essays Biochem. 2016, 60, 111–120. [Google Scholar] [CrossRef] [PubMed]
- Petersen, P.H.; Christensen, N.G.; Sandberg, S.; Nordin, G.; Pedersen, M. How to Deal with Semi-Quantitative Tests? Application of an Ordinal Scale Model to Measurements of Urine Glucose. Scand. J. Clin. Lab. Investig. 2009, 69, 662–672. [Google Scholar] [CrossRef] [PubMed]
- Somborac Bačura, A.; Dorotić, M.; Grošić, L.; Džimbeg, M.; Dodig, S. Current Status of the Lateral Flow Immunoassay for the Detection of SARS-CoV-2 in Nasopharyngeal Swabs. Biochem. Med. 2021, 31, 020601. [Google Scholar] [CrossRef] [PubMed]
- Majnik, M.; Bosnić, Z. ROC Analysis of Classifiers in Machine Learning: A Survey. Intell. Data Anal. 2013, 17, 531–558. [Google Scholar] [CrossRef]
- Krouwer, J.S. Cumulative Distribution Analysis Graphs—An Alternative to ROC Curves. Clin. Chem. 1987, 33, 2305–2306. [Google Scholar] [CrossRef] [PubMed]
- Birdsall, T. The Theory of Signal Detectability: ROC Curves and Their Character; University of Michigan Library: Ann Arbor, MI, USA, 1973. [Google Scholar]
- Linacre, J. Evaluating a Screening Test. Rasch Meas. Trans. 1994, 7, 317–318. [Google Scholar]
- Cipriani, D.; Fox, C.; Khuder, S.; Boudreau, N. Comparing Rasch Analyses Probability Estimates to Sensitivity, Specificity and Likelihood Ratios When Examining the Utility of Medical Diagnostic Tests. J. Appl. Meas. 2005, 6, 180–201. [Google Scholar] [PubMed]
- Fisher, W.P., Jr.; Burton, E. Embedding Measurement within Existing Computerized Data Systems: Scaling Clinical Laboratory and Medical Records Heart Failure Data to Predict ICU Admission. J. Appl. Meas. 2010, 11, 271–287. [Google Scholar] [PubMed]
- Wright, B.D.; Perkins, K.; Dorsey, K. Multiple Regression via Measurement. Rasch Meas. Trans. 2000, 14, 729. [Google Scholar]
- Schlattmann, P. Tutorial: Statistical Methods for the Meta-Analysis of Diagnostic Test Accuracy Studies. Clin. Chem. Lab. Med. 2023, 61, 777–794. [Google Scholar] [CrossRef] [PubMed]
- E11 Committee. Guide for Measurement Systems Analysis (MSA); ASTM International: West Conshohocken, PA, USA, 2022. [Google Scholar]
- Rasch, G. Studies in Mathematical Psychology: I. Probabilistic Models for Some Intelligence and Attainment Tests; Nielsen & Lydiche: Oxford, UK, 1960. [Google Scholar]
- Bashkansky, E.; Turetsky, V. Ability Evaluation by Binary Tests: Problems, Challenges & Recent Advances. J. Phys. Conf. Ser. 2016, 772, 012012. [Google Scholar] [CrossRef]
- Montgomery, D.C. Introduction to Statistical Quality Control, 3rd ed.; Wiley: New York, NY, USA, 1996; ISBN 978-0-471-30353-4. [Google Scholar]
- Statology. Zach How to Create a ROC Curve in Excel (Step-by-Step). 2021. Available online: https://www.statology.org/roc-curve-excel/ (accessed on 10 May 2023).
- Safari, S.; Baratloo, A.; Elfil, M.; Negida, A. Evidence Based Emergency Medicine; Part 5 Receiver Operating Characteristic Curve and Area under the Curve. Emergency 2016, 4, 111. [Google Scholar] [PubMed]
- Youden, W.J. Index for Rating Diagnostic Tests. Cancer 1950, 3, 32–35. [Google Scholar] [CrossRef] [PubMed]
- Pendrill, L.R.; Melin, J. Assuring Measurement Quality in Person-Centered Care. In Person-Centered Outcome Metrology: Principles and Applications for High Stakes Decision Making; Fisher, W.P., Jr., William, P., Cano, S.J., Eds.; Springer Series in Measurement Science and Technology; Springer International Publishing: Cham, Switzerland, 2023; pp. 311–355. ISBN 978-3-031-07465-3. [Google Scholar]
- Rasch, G. On General Laws and the Meaning of Measurement. In Psychology, Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 4: Contributions to Biology and Problems of Medicine, Berkeley, CA, USA, 20 June–30 July 1960; University of California Press: Oakland, CA, USA, 1961; pp. 321–333. [Google Scholar]
- Wind, S.A. Examining Rating Scales Using Rasch and Mokken Models for Rater-Mediated Assessments. J. Appl. Meas. 2014, 15, 100–132. [Google Scholar] [PubMed]
Device Type | Assigned Value hCG (IU/L) for the Distributed Materials | Number of Negative Results | Number of Positive Results | TPR (Sensitivity) | FPR (Fall-Out) |
---|---|---|---|---|---|
Device 1 | 6 | 360 | 91 | 1.00 | 1.00 |
Device 2 | 6 | 304 | 41 | 1.00 | 1.00 |
Device 3 | 6 | 248 | 16 | 1.00 | 1.00 |
Device 4 | 6 | 69 | 8 | 1.00 | 1.00 |
Device 5 | 6 | 62 | 4 | 1.00 | 1.00 |
Device 5 | 12 | 28 | 2 | 0.99 | 0.88 |
Device 5 | 24 | 15 | 38 | 0.83 | 0.44 |
Device 5 | 31 | 36 | 121 | 0.57 | 0.20 |
Device 5 | 51 | 11 | 183 | 0.24 | 0.02 |
Device Type | M | C |
---|---|---|
Device 1 | 0.84 | 2.23 |
Device 2 | 0.86 | 2.59 |
Device 3 | 0.87 | 2.35 |
Device 4 | 0.95 | 1.75 |
Device 5 | 0.91 | 2.05 |
TPR | FPR | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Device Type | m | c | SL | u | m | c | SL | u * | ||
Device 1 | −0.85 (14) | 0.16 | −5.80 | 35.45 | 6.11 | −3.73 (16) | −0.29 | 5.30 | 18.43 | −3.473 |
Device 2 | −0.87 (12) | 0.16 | −5.92 | 36.18 | 6.11 | −4.48 (18) | −0.29 | 4.53 | 15.75 | −3.47 |
Device 3 | −0.84 (14) | 0.16 | −5.89 | 36.00 | 6.11 | −3.67 (18) | −0.29 | 5.49 | 19.07 | −3.47 |
Device 4 | −1.08 (12) | 0.16 | −6.13 | 37.46 | 6.11 | −1.62 (12) | −0.30 | 6.59 | 22.21 | −3.47 |
Device 5 | −1.17 (12) | 0.16 | −6.22 | 38.01 | 6.11 | −1.54 (14) | −0.29 | 7.49 | 26.03 | −3.47 |
M | C | |||
---|---|---|---|---|
Device Type | Experiment | Theory | Experiment | Theory |
Device 1 | −0.50 | −0.57 | −2.61 | −2.78 |
Device 2 | −0.50 | −0.57 | −3.02 | −3.34 |
Device 3 | −0.50 | −0.57 | −2.58 | −2.77 |
Device 4 | −0.50 | −0.55 | −1.79 | −2.50 |
Device 5 | −0.50 | −0.56 | −1.84 | −1.96 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pendrill, L.R.; Melin, J.; Stavelin, A.; Nordin, G. Modernising Receiver Operating Characteristic (ROC) Curves. Algorithms 2023, 16, 253. https://doi.org/10.3390/a16050253
Pendrill LR, Melin J, Stavelin A, Nordin G. Modernising Receiver Operating Characteristic (ROC) Curves. Algorithms. 2023; 16(5):253. https://doi.org/10.3390/a16050253
Chicago/Turabian StylePendrill, Leslie R., Jeanette Melin, Anne Stavelin, and Gunnar Nordin. 2023. "Modernising Receiver Operating Characteristic (ROC) Curves" Algorithms 16, no. 5: 253. https://doi.org/10.3390/a16050253
APA StylePendrill, L. R., Melin, J., Stavelin, A., & Nordin, G. (2023). Modernising Receiver Operating Characteristic (ROC) Curves. Algorithms, 16(5), 253. https://doi.org/10.3390/a16050253