Next Article in Journal
Self-Supervised Representation Learning for Data-Efficient DRIL Classification in OCT Images
Previous Article in Journal
Reliability of Large Language Model-Based Artificial Intelligence in AIS Assessment: Lenke Classification and Fusion-Level Suggestion
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Residual Safety Margin-Based Risk Stratification for Hospital-Wide POCT Glucose Meters Anchored to ISO 15197: Moving Beyond Pass-Fail

Department of Laboratory Medicine, Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430070, China
*
Author to whom correspondence should be addressed.
Diagnostics 2025, 15(24), 3220; https://doi.org/10.3390/diagnostics15243220
Submission received: 29 October 2025 / Revised: 11 December 2025 / Accepted: 11 December 2025 / Published: 16 December 2025
(This article belongs to the Section Point-of-Care Diagnostics and Devices)

Abstract

Background: In this hospital-wide evaluation of point-of-care testing (POCT) glucose meters, we introduced a residual safety margin (r) anchored to ISO 15197:2013 thresholds to quantify tolerance, move beyond binary pass/fail assessments, and enable risk stratification. Methods: Thirty-five departmental glucose meters were compared with a central laboratory reference at five predefined glucose concentrations. Compliance was assessed using ISO 15197:2013 point-wise limits, Bland–Altman analysis was used to estimate bias and limits of agreement, and the mean absolute relative difference (MARD) and root mean square error (RMSE) were calculated to summarize overall error. At each concentration, r was calculated for every department, ranked, and classified into low, medium, or high risk using allowable error thresholds based on biological variation, specifically total allowable error (TEa), mapped to the ISO limits. Results: All departments met ISO criteria (100% compliance; 95% CI: 97.9–100%). Mean bias was −1.43 mg/dL, with limits of agreement from −15.6 to 12.8 mg/dL; MARD was 3.8% (95% CI: 3.4–4.3%), and RMSE was 7.4 mg/dL (95% CI: 6.6–8.2 mg/dL). Despite universal compliance, r-based analysis revealed concentration-related heterogeneity and highlighted borderline-performing departments that were overlooked by conventional metrics. Conclusions: By anchoring residual safety margins to ISO thresholds, the r framework shifts POCT glucose assessment from a binary pass/fail decision to a risk-stratified ranking approach, exposing latent performance variation and supporting targeted quality improvement at the hospital level.

1. Introduction

Point-of-care testing (POCT) for blood glucose is widely adopted for clinical glucose monitoring and control, from emergencies to routine ward care, due to its minimal blood volume requirement and rapid turnaround time [1]. However, POCT glucose systems are susceptible to analytic and pre-analytic errors, so robust quality assurance is essential [2]. In hospitalized patients, POCT results often guide insulin dosing and other critical decisions, and even small deviations may have important clinical consequences [3]. In hospital wide comparison studies, POCT performance is commonly assessed by analyzing five whole blood samples at predefined glucose concentrations, comparing the results with a laboratory reference, and evaluating compliance with ISO 15197:2013 criteria, which defines allowable limits of ±15 mg/dL for glucose values below 100 mg/dL and ±15% for values at or above 100 mg/dL [4].
However, this compliance framework typically yields only a binary outcome—pass or fail—that confirms whether minimum standards are met but conceals important nuances in performance relative to the thresholds. Some departments operate well within the limits, while others approach the boundary, where minor fluctuations may result in non-compliance [5]. In such scenarios, binary pass/fail evaluations provide insufficient information for hospital-level risk stratification or quality governance.
Traditional agreement methods, including correlation analysis, Bland–Altman (BA) plots, and Passing–Bablok or Deming regression, support method comparison but offer limited utility for hospital-wide, cross-departmental risk stratification. Correlation summarizes global linear association but may overstate agreement in the presence of systematic bias. BA visualizes bias and variability, yet does not assess compliance with ISO-defined acceptability limits [6,7]. Aggregate error metrics, such as the mean absolute relative difference (MARD) and root mean square error (RMSE), are informative but cannot replace point-specific ISO evaluations [8]. While Passing–Bablok regression can reveal department-specific systematic bias (intercept/slope), it does not directly assess compliance and therefore lacks sufficiency for risk-based quality management [9].
To address the above limitations, the ISO anchored residual safety margin r was introduced as a conceptual framework to quantify the remaining safety buffer between observed values and allowable error. Rather than yielding a binary classification of “pass” or “fail”, r enables concentration-specific, department-level risk stratification and near-threshold detection that can be acted upon within a quality-governance loop and is congruent with the risk-based emphasis in ISO 15189 revisions [10,11,12]. In this hospital-wide POCT evaluation, the r framework was applied to five predefined glucose concentrations, allowing stratification of residual risk across departments at each level. This method enabled clearer prioritization of quality improvement efforts, supported data-driven decisions, and facilitated continuous monitoring and periodic reassessment.

2. Materials and Methods

2.1. Design and Setting

This study was conducted at a large tertiary Maternal and Child Health hospital and included 35 clinical departments, each with one POCT glucose meter. Devices were from three different brands, and for each brand, one lot number was used throughout the hospital to minimize inter-lot variability. EDTA-anticoagulated whole blood was used, with five predefined reference glucose levels: 41, 82, 123, 227, and 291 mg/dL. Each device performed triplicate measurements at each level, and the mean was compared with the central laboratory reference (Beckman Coulter AU5800; glucose oxidase) [4]. Detailed information is summarized in Table 1 and Table S1.

2.2. Hospital-Wide POCT Comparison Quality Assurance

All POCT in the hospital-wide comparison followed a unified institutional quality assurance (QA) system. Each device had completed method verification; operators were certified after standardized training and competency checks. On the study day, test strips and quality-control (QC) materials were in date and handled per the instructions for use (IFU) under specified environmental conditions. Recent external quality assessment (EQA) met program criteria, and two levels of internal quality control (IQC) were within range. Sample-stability requirements and exclusion of invalid specimens followed ISO 15197:2013.

2.3. Endpoints and Calculations

Primary analytical endpoints included: ISO 15197:2013 per-point acceptability [4], BA mean bias, and 95% limits of agreement (LOA), MAR, and RMSE [7,8]. Statistical methods follow standard references.
r was calculated as a new metric: r = max {0, 1–use}, where use = |bias|/ISO limit. Here, bias is defined for each department at each reference level as the difference between the mean of three POCT replicates and the paired reference value: Bias (mg/dL) = mean (POCT1, POCT2, POCT3) − Reference (mg/dL). ISO 15197:2013 acceptance limits are 15 mg/dL when the reference value is <100 mg/dL and 0.15 × the reference value when it is ≥100 mg/dL. Because r is calculated separately at each reference level using these absolute (<100 mg/dL) and relative (≥100 mg/dL) ISO limits, it does not rely on an assumption of constant variance across the measurement range.
At five reference levels, department-level r values were calculated and ranked. Two threshold lines were defined under the EFLM/BV allowable-error framework: total allowable error (TEa) (≤54 mg/dL: 3.78 mg/dL; >54 mg/dL: 7% of the reference) and half-TEa (≤54 mg/dL: 1.89 mg/dL; >54 mg/dL: 3.5% of the reference), consistent with management approaches based on allowable error and sigma-metrics [13,14]. Using these cutoffs, departments at each level were classified as high risk (r < r*(TEa)), medium risk (r*(TEa) ≤ r < r*(1/2TEa)), or low risk (r ≥ (r*1/2)TEa). Here, r*(TEa) = 1 − TEa limit/ISO limit and r*(1/2TEa) = 1 − half-TEa limit/ISO limit.

2.4. Statistical Analysis

All statistical analyses and visualizations were performed using R (version 4.5.1). ISO 15197:2013 per-point compliance was estimated with two-sided Wilson 95% confidence intervals. Confidence intervals for BA limits of agreement, MARD, and RMSE were calculated using a two-sided percentile bootstrap approach (2000 resamples); when observations were nested within departments, a clustered bootstrap by department was used [15].

3. Results

3.1. Overall Agreement and Accuracy Assessment

A total of 175 paired measurements were analyzed across all departments. The per-point compliance with ISO 15197:2013 was 100.0% (95% CI: 97.9–100.0%). BA analysis showed a mean bias of −1.43 mg/dL, with 95% LoA from −15.6 to 12.8 mg/dL. Overall, MARD was 3.8% (95% CI: 3.4–4.3%), and the RMSE was 7.4 mg/dL (95% CI: 6.6–8.2 mg/dL) (Table 2 and Figure 1a). Brand-level estimates are provided in Supplementary Table S2. As shown in Figure 1b, differences clustered around the BA mean line (−1.43 mg/dL). Dispersion increased at higher concentrations (227 and 291 mg/dL), and at 291 mg/dL, biases shifted more negative with a few points near or slightly beyond the lower LoA. These patterns indicate greater error at higher glucose levels, although all observations remained within ISO per-point acceptability limits.

3.2. Performance and Agreement Across Glucose Concentration Levels

Across the five reference glucose levels (41, 82, 123, 227, and 291 mg/dL), BA analysis by level (see Figure 2 and Supplementary Table S3) showed a slight positive bias at 41 mg/dL, similarly mild positive bias at 82 and 123 mg/dL, near-zero mean bias at 227 mg/dL (95% CI crossing zero), and a more pronounced negative bias at 291 mg/dL (Figure 2b). Per-point ISO 15197:2013 acceptability was 100% at every level (Figure 2a). MARD followed the expected U-shape: lowest at 82–123 mg/dL (~2–3%) and higher at the extremes (~6–8% at 41 mg/dL; ~3–5% at 291 mg/dL; Figure 2c). RMSE (absolute error, mg/dL) increased with concentration (Figure 2d). In sum, all levels met ISO criteria; absolute and relative errors were smallest in the low-to-mid range, while absolute error rose at higher concentrations, but relative error remained acceptable.

3.3. Department-Level Risk Stratification by ISO-Anchored r with TEa-Referenced Bands

Across the five reference levels, concentration dependence and inter-department variability in r were evident. Pediatric Neurology was classified as low at most levels, whereas the Emergency Department more often fell into medium/high (Figure 3a). At the low reference levels (41 and 82 mg/dL), the ISO-anchored r heat map (Figure 3a) flagged several departments as medium or high risk with only small residual safety margins near the TEa-based thresholds, even though the pooled Bland–Altman plot (Figure 1b) suggested overall acceptable agreement. Departmental rankings at each level are shown in Figure 3b–f. Department-level r values and per-level ranks are provided in Supplementary Table S4a–e.

4. Discussion

In this single-center evaluation of 35 POCT glucose meters across five predefined levels, all devices met the per-point ISO 15197:2013 acceptance criteria under standardized conditions, indicating acceptable system accuracy. With concentration stratification, RMSE increased with glucose level, whereas MARD was lowest in the mid-range and slightly higher at both ends [8]. At higher concentrations, a shift toward more negative bias and wider dispersion was observed [5,16]. Taken together, these findings suggest that, when pre-analytical steps are controlled and staff are trained, POCT glucose testing can approach central-laboratory accuracy across departments [17,18].
Traditional tools such as Bland–Altman plots and Passing–Bablok regression remain essential for method comparison but are limited for cross-department governance: they use pooled data, do not map directly to ISO point-wise limits, and give little weight to near-threshold risk [4,6,19]. To address this, we introduced an ISO-anchored residual safety margin r, calculated separately at five reference levels. This level-specific r is directly tied to ISO 15197 limits, separates low/high from mid-range performance, is more sensitive to near-limit risk, allows cross-department ranking at each level, and, when combined with TEa and 1/2TEa, maps results into actionable red/yellow/green risk bands, while the continuous r ∈ [0, 1] scale supports threshold setting and trend tracking [13,20]. At low reference levels (41 and 82 mg/dL), for example, the pooled Bland–Altman plot (Figure 1b) still suggests good overall agreement, whereas the ISO-anchored r heat map (Figure 3a) flags several departments as medium or high risk with only narrow residual safety margins to the TEa-based thresholds, revealing level-specific critical risk that is not apparent from the pooled plot alone. At higher concentrations, dispersion increases and bias shifts more negative, yet because r is defined separately at each reference level and anchored to the absolute (<100 mg/dL) and relative (≥100 mg/dL) ISO limits, its clinical interpretation remains straightforward across the range, and the per-level heatmaps and rankings further facilitate communication and implementation in hospital quality-management practice.
Applied to the hospital-wide dataset, r-based stratification revealed heterogeneity in residual safety margins despite overall ISO compliance, consistent with prior multi-system evaluations showing substantial between-device variability even when ISO criteria are used as the yardstick [5]. From the heatmap and the risk-stratification plots at the five reference levels, the mid-range was predominantly low, whereas medium/high occurred more often at the extremes (41 and 291 mg/dL), in line with reports that absolute error tends to rise with concentration, while MARD is lowest around the mid-range [8]. Mechanistically, low-level anomalies are compatible with zero-point offset/limited sensitivity/background drift and matrix effects, all known contributors to bias at low glucose [21]. In contrast, high-level deviations can reflect amplified proportional bias and upper-range nonlinearity/saturation, and performance can deteriorate under temperature/humidity stress, which increases dispersion—practical concerns for ward-based POCT [22]. Occasional mid-range departures are consistent with slope–intercept mismatch on method comparison, which is routinely characterized via Passing–Bablok/Deming frameworks [23].
Accordingly, remediation should be level-specific and device-focused: at low levels, prioritize zero-point checks/adjustment and denser checks near the lower end, supported by strengthened IQC; at mid-levels, perform traceability checks and recalibrate slope and intercept (with confidence intervals verified by method-comparison regression); and at high levels, consider segmented or nonlinear calibration, add denser upper-range checks, and verify environmental robustness (temperature/power stability), escalating to vendor module evaluation when needed. These steps are consistent with contemporary guidance on quality indicators/sigma-metrics for continuous improvement in laboratory medicine [12,13], and with recent POCT-specific frameworks that use process mapping and risk-based selection of quality indicators for glucose point-of-care testing [24,25], as well as with evidence that comparison-method choice influences apparent accuracy, hence the need for traceability verification [12].
For hospital-level governance, POCT comparisons should go beyond a binary pass/fail and proceed in four layers: first, apply ISO single-point acceptability as the compliance gate; second, conduct performance evaluation using BA (mean bias and limits of agreement), PB regression (slope/intercept with confidence intervals), MARD, and RMSE, followed by concentration-stratified review that assesses absolute-error impact at low levels and dispersion plus negative-bias risk at high levels to locate the problematic ranges; third, implement r-based departmental banding, using green/yellow/red to quantify distance to the ISO boundary and enable within-level comparability and risk stratification; finally, for high-risk devices, define concentration-specific corrective actions and verify their effectiveness to complete a closed improvement loop. Within this framework, pooled BA, PB regression, MARD, and RMSE describe overall system performance, while r provides an ISO-anchored, department-level risk banding for governance.
This study has limitations: single center, conducted under controlled workflows, and used a single lot, which may underestimate real-world variability. Follow-up work will apply this ISO-anchored framework across departments, lots, and time within department-specific representative concentration ranges, re-evaluate the discriminative performance of r, and longitudinally assess its impact on IQC and external quality assessment/proficiency testing (EQA/PT) performance. In this initial implementation, brand-level results (including Ascensia, n = 10) are reported as descriptive only; future applications that specifically focus on between-brand comparisons will predefine simple minimum sample-size and confidence-interval-width criteria before brand rankings are used to inform governance decisions.
In summary, an ISO-anchored residual safety margin r, used within a TEa-referenced framework, enables an assessment that goes beyond pass/fail. Even when ISO criteria are met, safety margins can be stratified and ranked to expose near-threshold risks and heterogeneity (by concentration, department, and device), turning “compliant” into a comparable risk gradient with clear priorities. This approach uses appropriate statistical characterization (e.g., r-based risk banding) to identify potential issues in advance and to guide precision quality control and continuous improvement.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics15243220/s1: Table S1: Distribution of participating departments by POCT glucose meter brand; Table S2: Brand-level accuracy and agreement with the laboratory reference (mg/dL); Table S3: Per-level analytical performance of POCT glucose versus the laboratory reference; Table S4a–e: Department-level r-based performance rankings at five reference levels.

Author Contributions

Conceptualization, H.B. and Q.Y.; methodology, H.B. and Q.Y.; data curation and analysis, H.B., Y.C., Y.W., Z.S. and J.X.; writing—original draft preparation, H.B.; writing—review and editing, Q.Y.; visualization, H.B.; supervision, Q.Y.; project administration, Q.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki. The Institutional Ethics Committee of the Maternal and Child Health Hospital of Hubei Province reviewed the protocol and determined the study to be exempt from full ethics review as a secondary analysis of de-identified residual data from routine hospital POCT glucose testing; therefore, written informed consent was waived. No approval number is issued for exempt determinations under the institutional policy.

Informed Consent Statement

Patient consent was waived because the study involved only a secondary analysis of de-identified residual data generated during routine point-of-care glucose testing, with no patient intervention or identifiable information. Consent for Publication: Not applicable; no identifiable patient information is included in this manuscript, and no images or case details enabling identification are presented.

Data Availability Statement

De-identified data supporting the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors sincerely thank the Medical Affairs Department of the Hubei Maternal and Child Health Hospital for their organization and coordination of this hospital-wide POCT glucose comparison study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BABland–Altman
BVBiological variation
CIConfidence interval
CVCoefficient of variation
EFLMEuropean Federation of Clinical Chemistry and Laboratory Medicine
EQAExternal quality assessment
FADFlavin adenine dinucleotide
GDH-FADFAD-dependent glucose dehydrogenase
GODGlucose oxidase
IFUInstructions for use
IQCInternal quality control
ISOInternational Organization for Standardization
LoALimits of agreement
MARDMean absolute relative difference
Mut. Q-GDHMutant quinoprotein glucose dehydrogenase
POCTPoint-of-care testing
QCQuality control
QAQuality assurance
RMSERoot mean square error
SDStandard deviation
TEaTotal allowable error

References

  1. Luan, X.; Ke, L.; Feng, M.; Peng, W.; Luo, H.; Xue, H.; Xia, Y. Risk management in POCT blood glucose monitoring: FMEA approach aligned with ISO 15189:2022. PLoS ONE 2025, 20, e0319817. [Google Scholar] [CrossRef] [PubMed]
  2. Rebel, A.; Rice, M.A.; Fahy, B.G. Accuracy of point-of-care glucose measurements. J. Diabetes Sci. Technol. 2012, 6, 396–411. [Google Scholar] [CrossRef] [PubMed]
  3. Huysal, K.; Budak, Y.U.; Demirci, H.; Önelge, M.A. Evaluation of CareSens POCT Devices for Glucose Testing in the Routine Hospital Setting. J. Clin. Diagn. Res. 2015, 9, Bc04-7. [Google Scholar] [CrossRef] [PubMed]
  4. ISO 15197:2013; In Vitro Diagnostic Test Systems—Requirements for Blood-Glucose Monitoring Systems for Self-Testing in Managing Diabetes Mellitus. International Organization for Standardization: Geneva, Switzerland, 2013.
  5. Pleus, S.; Baumstark, A.; Jendrike, N.; Mende, J.; Link, M.; Zschornack, E.; Haug, C.; Freckmann, G. System accuracy evaluation of 18 CE-marked current-generation blood glucose monitoring systems based on EN ISO 15197:2015. BMJ Open Diabetes Res. Care 2020, 8, e001067. [Google Scholar] [CrossRef]
  6. Giavarina, D. Understanding Bland Altman analysis. Biochem. Med. 2015, 25, 141–151. [Google Scholar] [CrossRef]
  7. Bland, J.M.; Altman, D.G. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986, 1, 307–310. [Google Scholar] [CrossRef]
  8. Freckmann, G.; Mende, J.; Pleus, S.; Waldenmaier, D.; Baumstark, A.; Jendrike, N.; Haug, C. Mean Absolute Relative Difference of Blood Glucose Monitoring Systems and Relationship to ISO 15197. J. Diabetes Sci. Technol. 2022, 16, 1089–1095. [Google Scholar] [CrossRef]
  9. Bilić-Zulle, L. Comparison of methods: Passing and Bablok regression. Biochem. Med. 2011, 21, 49–52. [Google Scholar] [CrossRef]
  10. De la Villa Porras, I. Novelties in the ISO 15189:2023 standard. Adv. Lab. Med. 2023, 4, 337–338. [Google Scholar] [CrossRef]
  11. Linko, S.; Boursier, G.; Bernabeu-Andreu, F.A.; Dzneladze, N.; Vanstapel, F.; Brguljan, P.M.; Tosheska-Trajkovska, K.; Mehay, H.; Panteghini, M.; Brugnoni, D.; et al. EN ISO 15189 revision: EFLM Committee Accreditation and ISO/CEN standards (C: A/ISO) analysis and general remarks on the changes. Clin. Chem. Lab. Med. 2025, 63, 1084–1098. [Google Scholar] [CrossRef]
  12. Plebani, M.; Sciacovelli, L.; Aita, A. Quality Indicators for the Total Testing Process. Clin. Lab. Med. 2017, 37, 187–205. [Google Scholar] [CrossRef]
  13. Bayat, H.; Westgard, S.A.; Westgard, J.O. The value of Sigma-metrics in laboratory medicine. Clin. Chem. Lab. Med. 2024, 62, 2401–2404. [Google Scholar] [CrossRef]
  14. WS/T 403—2024; Analytical Performance Standard for Routine Analytes in Clinical Chemistry. National Health Commission of the People’s Republic of China: Beijing, China, 2024. Available online: https://www.nhc.gov.cn/fzs/c100048/202407/758b8a012f4748c9b23ee48d22d0a79a/files/1733821991150_35767.pdf (accessed on 9 May 2024).
  15. Carpenter, J.; Bithell, J. Bootstrap confidence intervals: When, which, what? A practical guide for medical statisticians. Stat. Med. 2000, 19, 1141–1164. [Google Scholar] [CrossRef]
  16. Jendrike, N.; Baumstark, A.; Kamecke, U.; Haug, C.; Freckmann, G. ISO 15197: 2013 Evaluation of a Blood Glucose Monitoring System’s Measurement Accuracy. J. Diabetes Sci. Technol. 2017, 11, 1275–1276. [Google Scholar] [CrossRef] [PubMed]
  17. Oliver, P.; Fernandez-Calle, P.; Buno, A. POCT Accreditation ISO 15189 and ISO 22870: Making the Point. Ejifcc 2021, 32, 131–139. [Google Scholar] [PubMed]
  18. Khan, A.I.; Pratumvinit, B.; Jacobs, E.; Kost, G.J.; Kary, H.; Balla, J.; Shaw, J.; Milevoj Kopcinovic, L.; Vaubourdolle, M.; Oliver, P.; et al. Point-of-care testing performed by healthcare professionals outside the hospital setting: Consensus based recommendations from the IFCC Committee on Point-of-Care Testing (IFCC C-POCT). Clin. Chem. Lab. Med. 2023, 61, 1572–1579. [Google Scholar] [CrossRef]
  19. Krouwer, J.S. The Danger of Using Total Error Models to Compare Glucose Meter Performance. J. Diabetes Sci. Technol. 2014, 8, 419–421. [Google Scholar] [CrossRef]
  20. Bowman, C.F.; Nichols, J.H. Comparison of Accuracy Guidelines for Hospital Glucose Meters. J. Diabetes Sci. Technol. 2020, 14, 546–552. [Google Scholar] [CrossRef]
  21. Tonyushkina, K.; Nichols, J.H. Glucose meters: A review of technical challenges to obtaining accurate results. J. Diabetes Sci. Technol. 2009, 3, 971–980. [Google Scholar] [CrossRef]
  22. Lam, M.; Louie, R.F.; Curtis, C.M.; Ferguson, W.J.; Vy, J.H.; Truong, A.T.; Sumner, S.L.; Kost, G.J. Short-Term Thermal-Humidity Shock Affects Point-of-Care Glucose Testing: Implications for Health Professionals and Patients. J. Diabetes Sci. Technol. 2014, 8, 83–88. [Google Scholar] [CrossRef]
  23. Lee, S. Agreement Evaluation in Statistical Analyses: Misconceptions and Key Features. Ann. Lab. Med. 2025, 45, 276–278. [Google Scholar] [CrossRef]
  24. Shaw, J.L.V.; Arnoldo, S.; Beach, L.; Bouhtiauy, I.; Brinc, D.; Brun, M.; Collier, C.; Kostantin, E.; Fung, A.W.S.; Füzéry, A.K.; et al. Establishing quality indicators for point of care glucose testing: Recommendations from the Canadian Society for Clinical Chemists Point of Care Testing and Quality Indicators Special Interest Groups. Clin. Chem. Lab. Med. 2023, 61, 1280–1287. [Google Scholar] [CrossRef]
  25. Shaw, J.L.V.; Arnoldo, S.; Bouhtiany, I.; Brinc, D.; Brun, M.; Collier, C.; Fuezery, A.; Fung, A.W.S.; Huang, Y.; Kaur, S.; et al. Recommendations for the integration of standardized quality indicators for glucose point-of-care testing. Clin. Chem. Lab. Med. 2025, 63, 1965–1973. [Google Scholar] [CrossRef]
Figure 1. Agreement of POCT glucose with the laboratory reference across 175 paired measurements. (a) Per-point acceptability relative to ISO 15197:2013 allowable-error limits (reference < 100 mg/dL: ±15 mg/dL; ≥100 mg/dL: ±15%). Differences are plotted against the reference value; the dashed lines represent perfect agreement (y = x), and solid lines indicate the ISO 15197:2013 allowable-error limits. (b) BA plot showing the mean bias (−1.43 mg/dL) and 95% limits of agreement (LoA) (−15.6 to 12.8 mg/dL). The solid line represents the mean bias, and the dashed lines indicate the 95% LoA. Abbreviations: POCT, point-of-care testing.
Figure 1. Agreement of POCT glucose with the laboratory reference across 175 paired measurements. (a) Per-point acceptability relative to ISO 15197:2013 allowable-error limits (reference < 100 mg/dL: ±15 mg/dL; ≥100 mg/dL: ±15%). Differences are plotted against the reference value; the dashed lines represent perfect agreement (y = x), and solid lines indicate the ISO 15197:2013 allowable-error limits. (b) BA plot showing the mean bias (−1.43 mg/dL) and 95% limits of agreement (LoA) (−15.6 to 12.8 mg/dL). The solid line represents the mean bias, and the dashed lines indicate the 95% LoA. Abbreviations: POCT, point-of-care testing.
Diagnostics 15 03220 g001
Figure 2. Concentration-stratified analytical performance at five predefined reference glucose levels. (a) ISO 15197:2013 per-point acceptability (Wilson 95% CIs). (b) Bland–Altman mean bias (POCT − reference) by level with percentile-bootstrap 95% CIs (B = 2000). (c) MARD (%) by level with percentile-bootstrap 95% CIs (B = 2000). (d) RMSE (mg/dL) by level with percentile-bootstrap 95% CIs (B = 2000).
Figure 2. Concentration-stratified analytical performance at five predefined reference glucose levels. (a) ISO 15197:2013 per-point acceptability (Wilson 95% CIs). (b) Bland–Altman mean bias (POCT − reference) by level with percentile-bootstrap 95% CIs (B = 2000). (c) MARD (%) by level with percentile-bootstrap 95% CIs (B = 2000). (d) RMSE (mg/dL) by level with percentile-bootstrap 95% CIs (B = 2000).
Diagnostics 15 03220 g002
Figure 3. Department-level risk stratification by ISO-anchored residual r. (a) Heatmap of department-level bands across five reference glucose levels (41, 82, 123, 227, and 291 mg/dL). Colors denote risk bands: high (red, r < r*(TEa)), medium (yellow, r*(TEa) ≤ r < r*(1/2TEa)), and low (green, rr*(1/2TEa)); (bf) Departmental rankings at each level: (b) 41 mg/dL; (c) 82 mg/dL; (d) 123 mg/dL; (e) 227 mg/dL; and (f) 291 mg/dL. Bars show 100 × r; vertical dashed lines indicate r*(TEa) and r*(1/2TEa).
Figure 3. Department-level risk stratification by ISO-anchored residual r. (a) Heatmap of department-level bands across five reference glucose levels (41, 82, 123, 227, and 291 mg/dL). Colors denote risk bands: high (red, r < r*(TEa)), medium (yellow, r*(TEa) ≤ r < r*(1/2TEa)), and low (green, rr*(1/2TEa)); (bf) Departmental rankings at each level: (b) 41 mg/dL; (c) 82 mg/dL; (d) 123 mg/dL; (e) 227 mg/dL; and (f) 291 mg/dL. Bars show 100 × r; vertical dashed lines indicate r*(TEa) and r*(1/2TEa).
Diagnostics 15 03220 g003aDiagnostics 15 03220 g003bDiagnostics 15 03220 g003c
Table 1. Characteristics of POCT blood glucose monitoring systems and the laboratory reference analyzer.
Table 1. Characteristics of POCT blood glucose monitoring systems and the laboratory reference analyzer.
Brand
(Manufacturer,
Country)
Departments (n)Assay Chemistry
(Strip/Enzyme)
Measurement Range
(mg/dL)
Hematocrit Range (%)Operating Temperature (°C)Operating Humidity (%)Strip/
Reagent Lot
Method Type
LifeScan OneTouch Verio (LifeScan, Malvern, PA, USA)21FAD-dependent glucose dehydrogenase (GDH-FAD)20–60020–6015–4010–905869580POCT
Roche Accu-Chek Inform II (Roche Diagnostics, Mannheim, Germany) 12Mutant quinoprotein glucose Dehydrogenase (Mut. Q-GDH; maltose-independent)11–60010–658–4410–90671596POCT
Ascensia CONTOUR TS (Ascensia Diabetes Care, Basel, Switzerland) 2FAD-dependent glucose dehydrogenase (GDH-FAD)20–60015–655–4510–93DP4HM3F31EPOCT
Beckman Coulter AU5800 (Beckman Coulter Diagnostics, Brea, CA, USA)1Glucose oxidase (GOD)10–600N/A18–3230–8025030502Reference analyzer
Abbreviations: FAD = flavin adenine dinucleotide; GDH-FAD = FAD-dependent glucose dehydrogenase; Mut. Q-GDH = mutant quinoprotein glucose dehydrogenase; GOD = glucose oxidase; POCT = point-of-care testing; N/A = not applicable.
Table 2. Overall accuracy and agreement with the laboratory reference.
Table 2. Overall accuracy and agreement with the laboratory reference.
Pairs (n)ISO 15197, %
[95% CI]
BA Mean Bias, mg/dL
(95% LoA)
MARD, %
[95% CI]
RMSE, mg/dL
[95% CI]
175100.0 [97.9, 100.0]−1.43 (−15.6, 12.8)3.8 [3.4, 4.3]7.4 [6.6, 8.2]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bi, H.; Chen, Y.; Wu, Y.; Shi, Z.; Xia, J.; Yan, Q. Residual Safety Margin-Based Risk Stratification for Hospital-Wide POCT Glucose Meters Anchored to ISO 15197: Moving Beyond Pass-Fail. Diagnostics 2025, 15, 3220. https://doi.org/10.3390/diagnostics15243220

AMA Style

Bi H, Chen Y, Wu Y, Shi Z, Xia J, Yan Q. Residual Safety Margin-Based Risk Stratification for Hospital-Wide POCT Glucose Meters Anchored to ISO 15197: Moving Beyond Pass-Fail. Diagnostics. 2025; 15(24):3220. https://doi.org/10.3390/diagnostics15243220

Chicago/Turabian Style

Bi, Hao, Yuting Chen, Yihan Wu, Zuliang Shi, Jianbo Xia, and Qiuyue Yan. 2025. "Residual Safety Margin-Based Risk Stratification for Hospital-Wide POCT Glucose Meters Anchored to ISO 15197: Moving Beyond Pass-Fail" Diagnostics 15, no. 24: 3220. https://doi.org/10.3390/diagnostics15243220

APA Style

Bi, H., Chen, Y., Wu, Y., Shi, Z., Xia, J., & Yan, Q. (2025). Residual Safety Margin-Based Risk Stratification for Hospital-Wide POCT Glucose Meters Anchored to ISO 15197: Moving Beyond Pass-Fail. Diagnostics, 15(24), 3220. https://doi.org/10.3390/diagnostics15243220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop