Next Article in Journal
Do Diet and Dietary Supplements Mitigate Clinical Outcomes in COVID-19?
Previous Article in Journal
Relationship between Serum 25-Hydroxyvitamin D Level and Risk of Recurrent Stroke
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Large-Scale Data Analysis for Glucose Variability Outcomes with Open-Source Automated Insulin Delivery Systems

1
CeADAR—Ireland’s Centre for Applied AI, University College Dublin, D04 V2N9 Dublin, Ireland
2
OpenAPS, Seattle, WA 98101, USA
*
Author to whom correspondence should be addressed.
Nutrients 2022, 14(9), 1906; https://doi.org/10.3390/nu14091906
Submission received: 17 March 2022 / Revised: 19 April 2022 / Accepted: 28 April 2022 / Published: 2 May 2022
(This article belongs to the Section Nutrition and Diabetes)

Abstract

:
Open-source automated insulin delivery (AID) technologies use the latest continuous glucose monitors (CGM), insulin pumps, and algorithms to automate insulin delivery for effective diabetes management. Early community-wide adoption of open-source AID, such as OpenAPS, has motivated clinical and research communities to understand and evaluate glucose-related outcomes of such user-driven innovation. Initial OpenAPS studies include retrospective studies assessing high-level outcomes of average glucose levels and HbA1c, without in-depth analysis of glucose variability (GV). The OpenAPS Data Commons dataset, donated to by open-source AID users with insulin-requiring diabetes, is the largest freely available diabetes-related dataset with over 46,070 days’ worth of data and over 10 million CGM data points, alongside insulin dosing and algorithmic decision data. This paper first reviews the development toward the latest open-source AID and the performance of clinically approved GV metrics. We evaluate the GV outcomes using large-scale data analytics for the n = 122 version of the OpenAPS Data Commons. We describe the data cleaning processes, methods for measuring GV, and the results of data analysis based on individual self-reported demographics. Furthermore, we highlight the lessons learned from the GV outcomes and the analysis of a rich and complex diabetes dataset and additional research questions that emerged from this work to guide future research. This paper affirms previous studies’ findings of the efficacy of open-source AID.

Graphical Abstract

1. Introduction

More than 536.6 million people globally (10.5% between the ages of 20–79) are estimated to be living with diabetes [1]. Regardless of type of diabetes, in the US alone, 150–250 million people are estimated to be living with insulin-requiring diabetes [2]. The tools for insulin administration have improved over the last decades, moving from syringe injections to insulin pens and insulin pumps. Glucose monitoring technology has also improved with the advent of continuous glucose monitors (CGM), which enable the measurement of interstitial glucose levels every few minutes. While interstitial glucose does have a lag time, the frequency and trend of more-frequent measurements enable more precise insulin dosing than is possible with single point-in-time fingerstick blood glucose data measurements.
Traditionally, insulin pumps and CGMs were standalone devices that did not communicate with one another, and people with diabetes (PwD) were required to assess data from both the CGM and insulin pump to determine when insulin dosing should be adjusted. As a result, while glycemic outcomes and variability improved with this technology, they still heavily required PwD to take on the burden of diabetes management and decision making hundreds of times per day.
The next evolution in diabetes technology was the shift toward closed-loop systems, where the pump and CGM data are processed through an algorithm (on the pump or on a separate mobile device) and adjustments to insulin dosing are auomatically enacted. These are now known more broadly as automated insulin delivery (AID) systems. Some of the early systems were considered to be “hybrid” closed-loop systems (HCL), because due to the capability of the system and the timing of insulin, PwD were still required to enter a meal estimate or carb entry and also initiate manual insulin dosing for meals (meal-time bolus) [3]. These AID systems have continued to evolve and advance, and some of the latest systems are approaching “fully” closed-loop systems that require less input and interaction by the human user [4].
Typically, medical device technology is created and distributed through commercial channels. However, due to the lag in innovations in diabetes technologies, some individuals living with diabetes leveraged off-the-shelf hardware connected with custom-built software and algorithms, combined with existing on-the-market insulin pumps and CGM, arriving at an automated insulin delivery system. The first of these systems is known as OpenAPS and was shared as open source many years before the first commercial AID system was available [5]. As a result, thousands of PwD worldwide have accessed open-source AID systems, and there is estimated to be dozens of millions of hours of diabetes data from these individuals [6]. Open-source AID has been repeatedly assessed as safe and effective when compared to commercial AID [7] as well as to other methods of diabetes treatment including standard insulin pumps (continuous subcutaneous insulin infusion (CSII)) [8].
The data from early AID adopters can be used to understand not only the capabilities of the systems, but to assess and analyse the undiscovered trends and phenomena in diabetes. This includes deeper assessments and gaining knowledge around glycemic variability, which after HbA1c and time in range (TIR) is becoming a recommended metric to assess clinical outcomes for PwD [9,10].

1.1. Automated Insulin Delivery (AID) Technologies

Automated insulin delivery systems leverage three components: an insulin pump, a CGM, and an algorithm to communicate and process the data from both devices. The algorithm can be contained within the physical body of the insulin pump or held on a separate mobile device. If the algorithm is held on a separate mobile device, decisions are communicated via Bluetooth to a connected and authorised insulin pump. Some open-source AIDs use a small radio bridge device and translate Bluetooth commands to 900 MHz radio-frequency, which older insulin pumps were programmed with.
Because of the early adoption of open-source AID, such as OpenAPS [11], there was early interest from clinical and research communities regarding data and outcomes from such systems, especially because they were self-built by patients. Early studies performed on OpenAPS were primarily retrospective studies [12,13,14] and in some cases relied on self-reported data. Studies that followed began analysing retrospective data (that were not self-reported) to further assess outcomes with open-source AID. However, a majority of studies focused primarily on high-level outcomes of average glucose levels, estimated HbA1c, and overall TIR metrics without looking deeper into system features or concrete concepts such as glycemic variability [15,16,17,18].
With ever-increasing interest in accessing data from open-source AID users, a method was formulated to allow individuals to anonymously and seamlessly donate data for research. The goal was to remove the burden on individual PwD who wanted to support research but did not want to be frequently contacted individually and asked to share their data. A key developer and founder of OpenAPS worked to build the OpenAPS Data Commons on the Open Humans platform, which enabled individuals to anonymously upload their data and share it [19]. The OpenAPS Data Commons then coordinated with interested researchers to manage access to the anonymised pool of data. All individuals have insulin-requiring diabetes by virtue of using an open-source AID. The dataset includes individuals using a variety of open-source AID, such as OpenAPS (OpenAPS, https://OpenAPS.org/, accessed on 25 April 2022), AndroidAPS (AndroidAPS, https://androidaps.readthedocs.io/en/latest/, accessed on 25 April 2022), and Loop (Loop, https://loopkit.github.io/loopdocs/, accessed on 25 April 2022).
The OpenAPS Data Commons is a rich dataset that includes CGM data with timestamps, insulin pump dosing data, and a log of algorithmic processing such as glucose predictions and dosing decisions made by the open-source AID. Other features include any manual entries by PwD such as carbohydrates or temporary targets. A previous version of the OpenAPS Data Commons (when n = 119) was assessed to have 46,586 total days’ worth of data and over 10 million CGM data points [20].
Based on CGM glucose data alone, this is one of the largest glucose datasets available from individuals with insulin-requiring diabetes. Most studies with “big data” approaches on diabetes technologies perform analysis on limited-scale real-world datasets. Some examples include employing random forest and support vector machine algorithms using 14 days’ worth of data collected from 25 people [21], while others include testing in 10 people across 4 weeks to achieve 255 days of data after training on 27,466 days from an unspecified number of individuals [22]. The next largest dataset used in diabetes research appears to be from the Tidepool Big Data Donation Project with up to 41,318 days of data from n = 175 individuals as used in at least one paper [23]. The Tidepool Big Data Donation Dataset may have more data available [24], although it requires a fee to license the datasets for research [25], which may be one reason a limited portion of the dataset is more frequently used. The OpenAPS Data Commons, at the time of writing this paper, includes more than 184 individuals’ donated data and surpasses the size of the number of days of data from other datasets reported in the literature. This paper uses the n = 122 version that was the size of the dataset at the start of this paper’s analysis work; additional data were donated to the dataset following the commencement of this paper.
In addition to the increased number of days, total number of individuals, and overall glucose and insulin dosing dataset, the potential from the OpenAPS Data Commons dataset also comes from the additional data collected from the AID with both a log of all insulin dosing performed by the system and human-entered inputs (such as carbohydrates consumed, targets adjusted for exercise, etc.) as well as the recording of algorithmic processing by the open-source AID system.
As a result of the OpenAPS Data Commons, it is now possible to assess research questions and expand the knowledge of insulin-requiring diabetes in ways that were not possible before due to a lack of truly large-scale diabetes data.

1.2. Motivation to Study Glucose Variability

There has been increased interest in glycemic variability, both as a potential metric to assess outcomes in PwD as well as a metric that could be used to assess and evaluate changes in other metrics such as correlations with hypoglycemia [26]. Previous studies have found glucose variability to be a predictor of severe hypoglycemia [27], but the exact relationship between them is unknown. According to the Diabetes Complications and Control Trial (DCCT), one of the concerns in insulin-requiring diabetes is that lower average glucose and HbA1c might be achieved by increasing rates of hypoglycemia and resulting correlations with increased morbidity. Modern diabetes technology typically enables lowered average glucose and HbA1c without increasing rates of hypoglycemia, yet the risk of hypoglycemia still exists, even with AID. Furthermore, hyperglycemia and glycemic variability are considered to be a factor inducing oxidative stress and overproducing reactive oxygen species. With AID data, such as OpenAPS Data Commons, it is possible to assess rates and incidents of hypoglycemia and hyperglycemia and study their exact relationship with glycemic variabilty.
It is also possible to assess whether there are any differences within subgroups in the dataset, such as by self-reported gender identity, age, or other basic demographic variables. Currently, all PwD using AID use the same algorithms and adjust their baseline settings individually. However, if any sub-group can be identified with different glucose outcomes from using the same system, it may be possible to either adjust systems in the future or recommend individuals to change settings or feature choices to better address differences within groups using the same AID.

1.3. Paper Contributions and Organisation

This paper seeks to describe methods of assessing glycemic variability using the CGM data from the largest freely available dataset of individuals with insulin-requiring diabetes, the OpenAPS Data Commons, along with a partial self-reported dataset of basic demographic variables.
In summary, the main original contributions of this work are:
  • Methods and techniques for data cleaning and glycemic variability analysis for large-scale diabetes data, i.e., OpenAPS Data Commons, originating from open-source AID technologies. We further calculate standard clinical metrics to analyse the glucose variability outcomes and add to previous demonstrations of the effectiveness of open-source AID technologies.
  • The application of a machine-learning-based hierarchical clustering algorithm in order to understand distinct patterns across glucose profiles of insulin-requiring individuals. We discovered that there are no obvious sub-populations in the open-source AID user community that are being underserved.
  • The first in-depth timeseries analysis for glucose variability outcomes and data-driven comparative analysis of the outcomes in open-source AID users based on self-reported population genders. We demonstrate some gender-wise differences in glucose mean and distribution during the times of a day, days of a week and month, and months of a year.
All programming scripts and tools developed for the analysis of demographics and glucose data in this paper are made public at [28].
Section 2 presents the literature review of existing glucose analysis software tools and technologies making use of CGM data and provides a comprehensive analysis of state-of-the-art research and challenges in glycemic variability. In Section 3, we provide methods and techniques adopted for diabetes data collection, anonymisation, and cleaning along with a list of employed glucose analysis metrics in this paper. Section 4 presents the results of demographics and glucose data analysis conducted in this paper, including the timeseries analysis of glycemic variability outcomes in the OpenAPS Data Commons dataset. Section 5 presents discussions on the analysed glucose outcomes, highlights the lessons learned and criticises the limitations, and provides a roadmap for future considerations. Section 6 concludes the paper.

2. Related Work

This section reviews the existing tools and techniques in the area of glucose variability analysis for continuous glucose monitoring (CGM) data and highlights the major challenges.

2.1. Popular Metrics for Assessing Glucose Variability

Insulin-requiring diabetes management is commonly aided by CGM, which yields longitudinal glucose data and assists clinicians and researchers to understand various factors of glucose variability. Since CGM generates timeseries data, there are a number of statistical and machine-learning (ML) methods to help understand and summarise it. According to PubMed, the glucose variability keyword is mentioned in over 26,000 publications and is considered a key metric in clinical research [29]. There are over 25 glucose variability (GV) metrics proposed in the literature, and only a few have been clinically validated. Since the reported results and statistics do not converge based on a single methodology, comparing and analysing them across various studies is a non-trivial task. Table 1 lists the clinically validated metrics that employ CGM data to calculate glucose variability.

2.2. Software Tools for Automated Variability Analysis of Continuous Glucose Monitoring (CGM) Data

Cgmquantify [37] is an open-source toolbox developed to assist the calculation and visualisation of various clinically validated metrics of glucose variability. Its functions are implemented as Python and R programming libraries but require glucose data in Dexcom CGM format. In [39], the CGM-GUIDE tool is proposed that provides a user-friendly graphical user interface for monitoring CGM data and different variability metrics including the percentage of time spent in different glucose levels during an interval. The CGDA [40] data analysis tool implemented as an R programming package provides a simple interface and list of functions to analyse glucose data from any available CGM.
Other software tools featuring support for glucose management and variability analysis include EasyGV [41], cgmanalysis [42], and GlyCulator [43].

2.3. Comprehensive Review of Efficiency and Performance of Glucose Variability Metrics

Although the DCCT had considered HbA1C as the standard metric for glycemic control (in part due to the different set of tools available at the time of the landmark study), minimising the blood glucose variability (GV) has been more recently explored over the past decade as another promising metric for glucose management [44]. Understanding GV is integral to both the physiology and pathophysiology of diabetes, and it is interlinked with the risk of hypoglycemia. Quantifying the interconnection between GV and hypoglycemia is tough as the glucose fluctuations are determined by both the amplitude and timing [45].
Simple statistical metrics such as mean blood glucose and standard deviations have not been observed to be contributing factors in the development of microvascular complications of diabetes [46]. A strong correlation has been shown between HbA1c and diabetes-related complications for both Type 1 and Type 2 diabetes [47]. However, in recent years, instability of glucose has been found to contribute more than HbA1c in the development of diabetes-related complications. This has been further validated by clinical studies where significant glucose fluctuations have been observed in children with Type 1 diabetes while they maintain a normal HbA1c. A review of medical conditions such as oxidative stress and intensive care settings shows a difference in GV [47].
It has been discovered that GV increases progressively from initial diabetes-related conditions through the development of Type 2 diabetes and is comparatively higher in people with Type 1 diabetes. A review of GV metrics in clinical and research applications shows that the coefficient of variation (CV) and standard deviation (SD) are the most used metrics for GV in the literature [9].
A critical analysis of the efficiency of the mean amplitude of glycemic excursion (MAGE) shows that it lacks the ability to capture excursion frequency and distance travelled. Another drawback of using MAGE as a GV metric is the differences in its implementations [48]. The available MAGE implementations are provided by EasyGV, cgmanalysis, cgmquantify, and iglu, yielding median errors of 20%, 78%, 11%, and 42%, respectively, when compared with the manual calculations. In Fernandes et al. [49], an approximation algorithm to access MAGE is developed with an objective to improve the accuracy of calculations. The accuracy of the technique was evaluated using a five-fold cross-validation technique and was found to have a median error of 1%.
In Marling et al. [50], CGM charts were classified using three ML models including a naive Bayes classifier, a multilayer perceptron, and a logistic model tree. For model training, daily CGM curves were labelled by two physicians and compared to GlycoMark [51] (serum levels of 1,5-anhydroglucitol) as reference measures. However, the labelling consistency between the two physicians was around 81%. Furthermore, the performance of ML models was also affected by the limited training data.
Dovc et al. [52] evaluated the correlation between CGM use and GV in pre-school children with Type 1 diabetes using data from the Slovenian National Registry. GV was analysed for a period of 5 years with and without the use of CGM among the participants. The results indicate that the use of CGM reduced the GV. The mean glucose with and without CGM was 3.6 mmol/L and 4.3 mmol/L, respectively. Similarly, the coefficient of variation was 44% and 46.1% with and without CGM use, respectively.
In Moscardo et al. [41], the glycemic variability assessment methods are developed in the EasyGV software tool that yields a correlation of 98% with most of the GV metric except the calculation of MAGE. The difference in calculating MAGE is because EasyGV calculates MAGE using a fuzzy-logic-based method.
Boris et al. [26] argued that diabetes management requires balancing between mean blood glucose and frequency of hypoglycemia. Therefore, with the adoption and evolution of the automated insulin delivery systems (AID), it is necessary to standardise the GV metrics. The authors computed various metrics including SD, CV, MAGE, MAG, LBGI, HBGI, and ADRR to understand the principal components of glucose variations, i.e., amplitude and timing.
The existing GV metrics mainly focus on measuring the amplitude components of the fluctuations and lack an integration of the timing component [45]. However, because of the inherent inaccuracies of various existing GV metrics, such as J-index, MAG, and CONGA, there is a need to develop and fine-tune metrics that characterise primary features of glucose activity such as time to peak, and time to recovery to the baseline [53]. With the evolution of CGM-based data collection technologies, calculating timing factors has become possible. Timeseries analysis techniques serve as promising techniques to measure the timing components of the glucose fluctuation profiles. The metrics for GV calculations include time in range (TIR) and time out of range (TOR).
In Rodbard et al. [9], a methodology using timeseries analysis to profile CGM data during the day is proposed to understand the relationships of GV during and after meal intakes by setting specific time windows. The need for large-scale diabetes datasets is highlighted for different populations to interpret GV and other CGM data metrics. This would further serve to better understand the practical pros and cons of traditional timeseries approaches to analysing glucose data.
Although experimental discoveries suggest oxidative stress has a strong link with short-term GV, the mechanisms for long-term GV based on visit-to-visit measurements of HbA1c and fasting plasma glucose along with their SD and CV calculations are not well defined [10].
Siegelaar et al. [54] reviewed the research methods to measure GV with a view to finding a link between GV and oxidative stress and extreme out-of-whack glycemic conditions. GV has been found to be an important predictor variable for extreme hypoglycemic conditions in people with Type 1 diabetes. GV is greater in people with diabetes who experience severe hypoglycemia [27]. However, the exact relationship is still unknown. In order to find the predictive power of the model variables for hypoglycemia, a general boosted model was developed, and coefficient of variation and MAG were found to have over 40% and 10% influence, respectively [27].
To summarise, understanding glucose variability is promising for improving diabetes management. Major challenges in the area of GV include a lack of consensus on the best approach to measure the GV given the CGM data. A better understanding of GV metrics fed by the evidence from the evaluation of real-world diabetes datasets can result in better tools to minimise the metabolic ups and downs of blood glucose levels and prevention of or reduction in, wherever possible, diabetes-related complications.

3. Materials and Methods

This section first details the procedures and setups put in place for diabetes data collection and anonymisation. We then describe the methods used to establish the data cleaning pipeline and list the statistical and variability metrics for glucose analysis used in this paper. Lastly, the section provides a summary of the cleaned OpenAPS Data Commons dataset and the self-reported demographics of the insulin-requiring AID users.

3.1. Diabetes Data Collection and Anonymisation Highlights

The primary dataset for this analysis comes from the OpenAPS Data Commons, a dataset collated as a project on the Open Humans platform. The Open Humans platform (Open Humans platform, https://www.openhumans.org/, accessed on 25 April 2022) allows individuals to connect their data to the platform and donate it to projects such as the OpenAPS Data Commons (OpenAPS Data Commons project on Open Humans, https://www.openhumans.org/activity/openaps-data-commons/, accessed on 25 April 2022). They are typically donated through an uploader project to Open Humans that anonymises the data [55], so that data stored in the Open Humans platform and then donated to the OpenAPS Data Commons are anonymised. Additionally, the OpenAPS Data Commons leverages other privacy-preserving features of Open Humans, including no collection of username or email addresses. Project participants are assigned a random, 8-digit identifier for the project. Participants can be contacted only via the Open Humans messaging platform, which again does not provide any identification of participants to the project administrator. As such, the OpenAPS Data Commons holds a complex, rich anonymous dataset that can then be used by research projects such as the one described within this paper.
The OpenAPS Data Commons dataset is provided upon request to researchers who agree to simple terms of use of the data. Once agreed upon, researchers receive a Dropbox link holding two versions of the dataset. One is an untouched, .gzip version of the dataset downloaded from OpenHumans. The second is an unzipped version that has also been converted to csv format, using an open-source tool designed to convert complex data files without known data structures [56]. The unzipped json version files also are provided.
Part of the reason for the complexity and unspecified file structure for each participant within the OpenAPS Data Commons dataset is due to the nature of the flexibility of the diabetes open-source community and the tools commonly used, such as Nightscout. Nightscout is an open-source remote monitoring platform that many use for real-time personal monitoring of diabetes data from multiple devices; other features include data analytics. Each Nightscout site is self-managed, so individuals coordinate their own data storage. Nightscout has typical device fields and data structures for common devices such as continuous glucose monitors (CGM), insulin pumps, blood glucose meters, etc. However, because a plethora of devices can be connected to Nightscout, there is often disparate formatting with regard to data labels or date and timestamps across devices. Such complexity is also therefore transferred into the data structures within the OpenAPS Data Commons, since Nightscout is typically the platform most commonly used for data donation. As a result, significant data cleaning must be done to provide uniformity across individuals’ datasets before further analysis.

Demographics Data Collection

Alongside the diabetes data discussed above, there is also a partial dataset of demographics information associated with the OpenAPS Data Commons. At the time that individuals first join the OpenAPS Data Commons project on Open Humans, they are redirected to a Google Form which asks for voluntary demographics information. This includes age, gender, geographic location, ethnicity; estimated weight, height, and insulin usage; relevant diabetes dates (diagnosis, pump or CGM commencement, open-source AID commencement); and type of open-source AID.
Participants are not required to fill out this survey, so some either do not see the redirect or choose not to fill it out. Gender was added as a question to the survey after the project was established, so early participants do not have gender-reported data. As the data are self-reported and only reported at the initial joining of the project, they reflect demographic data such as estimated height, age, weight, etc., only representing that point in time, and may not be representative throughout their data, as many individuals have donated multiple years’ worth of diabetes data.
Like the OpenAPS Data Commons dataset, the companion demographics file is made available via a Dropbox link for download as a .csv or .xls file upon request.

3.2. Glucose and Demographics Data Cleaning

The data pre-processing and cleaning pipeline for glucose and demographics data was implemented using Python frameworks and shell scripts.
  • File formatting: We use glucose entries files in .csv format to ease data visualisation support using spreadsheets. Since the originating source contains a large volume of .json data, there are multiple .csv files for each individual. The files were previously converted to .csv using the Unzip-Zip-CSVify-OpenHumans-data.sh script (this is an open-source script along with other open-source tools used for processing large complex data such as the OpenAPS Data Commons data coming from the Open Humans platform, https://github.com/danamlewis/OpenHumansDataTools, accessed on 25 April 2022).
  • Unified datastore: We pull glucose entries data in .csv format for all individuals to a common directory representing a unified data store.
  • Timestamp cleaning and consistency: Each glucose entry file contains inconsistent timestamps. The cleanest timestamps were appended with letters such as T and Z represented in the following example format: “2018-08-04T23:58:50Z”. These were cleaned simply by trimming the alphabet letter. Multiple types of timestamp formats represented in different time zones—GMT, PDT, CES, CST, etc.—were found in single glucose entries files. We cleaned such instances programmatically by employing “regex” functions exposed by the Python Pandas package and lambda functions. Furthermore, we noticed an overlap of different formats of timestamps between the data rows. Some timestamp values were accompanied by the abbreviations for the day of the week such as “Mon”, “Tue”, “Wed”, etc., alongside the year and date. Although a lot of the inconsistent timestamps were cleaned programmatically, some of them required manual labour efforts as well. After all the pre-processing, we converted the timestamps to a consistent date–time format.
  • Glucose entries cleaning and consistency: To maintain the consistency of glucose entries, we performed the following steps during the data cleaning phase:
    • We noticed some glucose data samples contain text such as “null”. Data rows with “null” were removed in each glucose entry file.
    • Multiple .csv files for the same individual were merged, and data were organised in the increasing order of the timestamps in dataframe columns.
    • All the duplicate timeseries values (if any) were removed, and only the first entry for duplicate timestamps was kept.
    • All infinite numbers represented as “inf” were replaced with “NaN”, and all rows with “NaN” were dropped.
    • Based on CGM device knowledge, decisions were made to remove data values that represent error terms, as they should not be included in calculations for glucose values (units mg/dL). This includes removing every data point less than 39 and greater than 1000. Any data point greater than 400 and less than 1000 was replaced with 400. No further interpolations were performed to cover the error terms.
  • Validation of data consistency: Finally, we plot glucose data and visualise it for each individual to identify any anomalies or inconsistencies in timestamps and corrected them (See Figure A1).
Since demographics data collection was a self-reported process, there was a difference in the format and units of the reported data. Due to the range of errors in reported demographics, a manual data cleaning process was employed to develop a consistent and reliable dataset for use in this paper.

3.3. Glucose Analysis Metrics

The following statistical and variability metrics for glucose analysis are calculated in this paper.
  • Count—Total CGM points to calculate the amount of data.
  • Mean—Average of CGM data.
  • Min—Minimum CGM data value.
  • Max—Maximum CGM data value.
  • Q1 or 25%—First quartile.
  • Q2 or 50%—Second quartile.
  • Q3 or 75%—Third quartile.
  • IQR—Interquartile range.
  • SD—Interday standard deviation of CGM data.
  • CV—Interday coefficient of variation.
  • TOR < 70—Time outside range, i.e., hypoglycemia. Total glucose data points less than 70 mg/dL in percentage.
  • TIR—Time inside range, i.e., total glucose data points within target range between 70 mg/dL and 180 mg/dL in percentage.
  • TOR > 180—Time outside range, i.e., hyperglycemia. Total glucose data points greater than 180 mg/dL in percentage.
  • POR—Total percentage of time outside range (range in standard deviations from mean).
  • J_index—Glycemic variability.
  • LBGI—Glycemic variability metric to calculate low blood glucose index.
  • HBGI—Glycemic variability metric to calculate high blood glucose index.
  • GMI—Glycemic management indicator.

3.4. Summary of Glucose and Demographics Data

In total, we cleaned 46,070 days’ worth of glucose data donated by 122 individuals. On average, each individual donated 377 days’ worth of data. Over 70% of the insulin-requiring individuals donated data representing five months or more. Figure 1 summarises the mean glucose and SD ranges for each insulin-requiring individual. The average mean and distribution of glucose data is 139 ± 49.8 mg/dL, respectively. The mean glucose data quartiles Q1, Q2, Q3, and interquartile range (IQR) are {102.83, 129.40, 166.13, 63.3} mg/dL, respectively.
Table 2 lists the collected demographic features alongside the count of reported demographic features. The average reported age at the point in time in which individuals donated data is 36 years. A total of 78 individuals reported their gender, out of which 50 are males and 28 are females. Insulin-requiring individuals from 21 countries reported their demographics. Most are from the USA, Germany, and UK with a total count of 45, 12, and 6, respectively. The average and median per-day self-reported insulin intake, in general, is 44.57 and 40 units, respectively. For 25% of people, insulin intake is less than 31.53 units per day and more than 51.29 units per day. The average and median insulin intake reported for males is {45.21, 39.84} units and for females is {49.06, 36.85} units, respectively.

4. Results

This section present the results of demographics and glucose data analysis followed by glucose variability and timeseries analysis for OpenAPS Data Commons datasets based on gender classification.

4.1. Demographics Data Analysis

To understand the relationship between the demographic features listed in Table 2, we performed statistical tests using Spearman correlation. Figure 2a shows the heat-map with correlation matrix between demographics features where light colour or 0 means no correlation and dark colour or ±1 means high (anti) correlation.
We observed a maximum correlation of 69% between self-reported total daily insulin units and daily basal insulin units. It is a statistically significant linear correlation with p < 0.001 within a 95% confidence interval. An increase in daily insulin by 1 unit increases daily basal insulin on average by 1.40 units. The second most statistically significant linearly correlated features are weight and self-reported total daily insulin units, i.e., 63% (p < 0.001). Finally, the correlation between weight and self-reported basal units is 61% (p < 0.001). Other demographic features show a poor and statistically insignificant correlation between each other.
Feature distributions were further analysed by classifying the data based on gender. Figure 2b shows the box plots with demographic distributions where the data are normalised altogether before classifying with respect to gender. The normalisation is performed using MinMaxScaler function exposed by Python scikit-learn Library. It can be seen that the self-reported total daily insulin intake for females is comparatively greater than males. The high positive and statistically significant correlation (p < 0.001) between self-reported insulin intake and weight can be observed as female weight and insulin intake are greater than the male weight and insulin intake, respectively. The average self-reported insulin intake by males and females is 45.58 units and 49 units, respectively. We observe greater carbs and basal intake for males as compared to females.

4.2. Glucose Data Analysis

After independently profiling the glucose data for each individual (Figure A1), this section calculates the glucose analysis metrics listed in Section 3.3. Table 3 shows the summarised statistics with minimum, maximum, average, quartiles, and interquartile ranges for each glucose variability metric. Figure 3 shows a plot with portions of glucose TIR and TOR ranges in percentages for each individual in our dataset. In general, it can be seen that a number of individuals have well achieved a TIR above recommended standards [57]. If we compare the TOR < 70 and TOR > 180, we observe that there is more tendency within this dataset for an individual to have TOR > 180 than TOR < 70. Salient results include:
  • It can be seen that the interday glucose standard deviation is 49.75 mg/dL with a minimum and maximum of 14.71 mg/dL and 77.33 mg/dL, respectively.
  • We calculated the glucose rate of change for each individual in our dataset using the formula [ g l u c o s e ( t i ) g l u c o s e ( t i 1 ) ] / ( t i t i 1 ) . We analysed visually and statistically the glucose rate of change (ROC) for each insulin-requiring individual (see Figure 4). The standard deviation of glucose ROC has a minimum, average, and maximum of {0.61, 1.42, 2.69} mg/dL per minute, respectively. The Shapiro–Wilk test (statistical test for normality) performed on SD of glucose ROC data yielded p = 0.205 (i.e., >0.05) indicating that the data follow a normal distribution. According to 3 σ rule of distribution statistics, 99.7% of the data lies between 0.29 and 2.46.
  • We further analysed the distribution of standard deviation of rate of change (ROC) for individual glucose profiles in the dataset (see Figure A2). The minimum, average, and maximum of SD ROC calculated separately for males and females is {0.60, 1.36, 2.6} and {0.7, 1.4, 2} mg/dL per minute, respectively.
  • The average interday coefficient of variation (CV) is 35.43. A total of 25% of the insulin-requiring individuals have CV less than 32.42 and greater than 38.47, whereas the interquartile range is 6.
  • We observe that the average TIR is 77.27% for people using DIY technologies. Furthermore, less than 25% of the individuals have TIR less than 71%. However, over 25% of the insulin-requiring individuals achieved a TIR higher than 84%. The minimum, average, and maximum for TOR < 70 and TOR > 180 is {0.23%, 4%, 16.97%} and {0.05%, 18.74%, 49.67%}, respectively.
  • The minimum, average, and maximum for TOR < 70 and TOR > 180 highly correlate to LBGI and HBGI, respectively.
  • The mean J_index and GMI for insulin-requiring individuals in our dataset is 36.42 and 6.63, respectively.
Table 3. Summarised statistics for glucose variability. Total number of individuals (n) = 122.
Table 3. Summarised statistics for glucose variability. Total number of individuals (n) = 122.
MinMaxAverageQ1Q2Q3IQR
Interday SD (mg/dL)14.7177.3349.7543.1349.3458.1014.97
Glucose ROC SD0.612.691.421.151.411.660.51
Interday CV (%)16.8644.9435.4332.4235.8738.476.05
TIR (%)49.7598.4577.2671.1877.9184.0712.89
TOR < 70 (%)0.2316.974.011.793.225.613.83
TOR > 180 (%)0.0549.6718.7412.7217.1425.5212.81
LGBI0.133.821.090.640.951.420.78
HBGI0.0313.254.362.833.955.893.06
J_index10.3973.9336.4229.7135.4943.8414.14
GMI5.407.966.636.386.636.920.53
Figure 3. Glucose TIR and TOR for insulin-requiring individuals using open-source AID systems. Total number of individuals (n) = 122. The average TIR for insulin-requiring individuals in OpenAPS Data Commons dataset is 77%. There is a higher tendency for individuals to have more TOR situations over the higher TIR limit (180 mg/dL) as compared to TOR situations less than the lower TIR limit (70 mg/dL).
Figure 3. Glucose TIR and TOR for insulin-requiring individuals using open-source AID systems. Total number of individuals (n) = 122. The average TIR for insulin-requiring individuals in OpenAPS Data Commons dataset is 77%. There is a higher tendency for individuals to have more TOR situations over the higher TIR limit (180 mg/dL) as compared to TOR situations less than the lower TIR limit (70 mg/dL).
Nutrients 14 01906 g003
Figure 4. Standard deviation of glucose rate of change. Total number of individuals (n) = 122.
Figure 4. Standard deviation of glucose rate of change. Total number of individuals (n) = 122.
Nutrients 14 01906 g004

4.3. Clustering Glucose Profiles

To understand whether there are distinct patterns of glucose data profiles in insulin-requiring individuals in our dataset, we used an unsupervised learning approach and employed a hierarchical/agglomerative clustering algorithm.
The main steps involved in agglomerative clustering include starting by treating each glucose data profile as one cluster. Therefore, the number of clusters initially are equal to the number of glucose profiles. The clusters are then formed based on joining the closest profiles and ultimately form one big cluster by joining small clusters. Dendrograms divide the single big cluster into multiple clusters using the Euclidean distance as a metric. We used linkage and fcluster tools from Python Scipy library to perform the clustering.
As an outcome of our initial experimental investigation, we did not find any distinct clusters of glucose profiles. There is a possibility to obtain some distinct and meaningful clusters by tuning the distance metrics in other ML-based clustering approaches. However, we can argue that not observing distinct clusters of glucose profiles indicates that there is no obvious flaw in open-source AID systems at the highest level, which directly rebuts potential concerns related to the potential additive harm of open-source AID [58].

4.4. Glucose Variability Analysis Based on Gender

This section presents our results for glucose variability analysis by mapping glucose entries in OpenAPS Data Commons dataset with individuals’ self-reported gender. We divided the datasets for males (n = 50) and females (n = 28) and calculated the GV metrics including HBGI, LBGI, GMI, J_index, CV, SD, TIR, and POR. Figure 5 shows the distribution of GV outcomes for insulin-requiring individuals based on gender. Salient observations from the GV outcomes include:
  • The minimum, average, and maximum LBGI for females and males are {0.13, 0.87, 1.74} and {0.24, 1.11, 3.82}, respectively. Similarly, the minimum, average, and maximum HBGI for females and males is {0.80, 4.90, 8.75} and {0.33, 4.44, 13.24}, respectively. It can be seen that LBGI and HBGI distributions for males and females are positively skewed. The Shapiro–Wilk test on LBGI and HBGI yielded p < 0.05, confirming that the data do not belong to a normal distribution.
  • The minimum, average, and maximum GMI for females and males is {6.04, 6.76, 7.46} and {5.66, 6.64, 7.96}, respectively. We do not observe any differences in GMI distribution patterns for males and females. However, the Shapiro–Wilk test yielded p > 0.05, indicating that the GMI follow a normal distribution.
  • J_index distributions show a slight positive skew with higher average statistics for females with 38.99 as compared to males with 36.63. p > 0.05 is obtained from the Shapiro–Wilk test, indicating that J_index follows a normal distribution.
  • There are no distinct differences between females and males for both CV and SD. The average CV in percentage and SD in mg/dL are {35.80, 51.93} for females and {35.23, 49.62} for males. Furthermore, there is a negative skew for both males and females in CV distributions. Using the Shapiro–Wilk test, p < 0.05 for CV confirms its non-uniform distribution. The SD distributions are more uniform (p > 0.05).
  • Average TIR for females and males is 75.13 and 76.90. The distributions for both females and males are uniform (p > 0.05) and exhibit similar patterns. For our datasets, more than 75% of males and females achieve TIR of over 70%. The maximum TIR achieved for females and males in 96.25% and 98.45%
  • The POR distributions for both genders are uniform (p > 0.05), where average POR for females is 28.92% and males is 28.26%.
Figure 5. Glucose variability outcomes for individuals using open-source AID systems, based on gender. Total number of males and females is 50 and 28, respectively. (a) HBGI. (b) LBGI. (c) GMI. (d) J_index. (e) Glucose Coefficient of Variation. (f) Glucose Standard Deviation. (g) Glucose Percentage of Time Inside Range (TIR). (h) Glucose Percentage of Time Outside Range (POR).
Figure 5. Glucose variability outcomes for individuals using open-source AID systems, based on gender. Total number of males and females is 50 and 28, respectively. (a) HBGI. (b) LBGI. (c) GMI. (d) J_index. (e) Glucose Coefficient of Variation. (f) Glucose Standard Deviation. (g) Glucose Percentage of Time Inside Range (TIR). (h) Glucose Percentage of Time Outside Range (POR).
Nutrients 14 01906 g005aNutrients 14 01906 g005b

Timeseries Analysis for Glucose Data based on Gender

This section presents the experimental results of timeseries analysis to study the variations in glucose mean and standard deviation (SD) during the hours of a day, days of week and month, and months of the year for all the individuals in the OpenAPS Data Commons dataset. We further analyse the average trends and differences between males and females in terms of glucose mean and SD.
To perform the analysis, we separated the timestamps into hours, dates, days, and months for all the glucose entries for all the individuals and append them in separate columns as Python Pandas dataframe. We grouped the glucose entries based on hours, dates, days, and months for each individual and calculate the statistics such as count, mean, SD, minimum, maximum, Q1, Q2, and Q3. We plotted the distributions to visually inspect the differences in glucose patterns for each individual. Figure A3 shows an example for timeseries results for one insulin-requiring individual in the OpenAPS Data Commons dataset.
To summarise the results and draw meaningful understandings from our dataset, we calculated and analysed the glucose mean and SD that are averaged for all individuals. Figure 6 shows the glucose mean and SD that is averaged for all individuals during the days of a week. It can be seen that both males and females follow similar trends in terms of mean glucose. The minimum mean glucose for females is 139.86 mg/dL on Friday. We also observe that the glucose mean for females is slightly greater than for males throughout the week. We observe a similar pattern in glucose SD, where females have a higher SD as compared to males; however, there is a difference in the trend of SD profiles. The maximum SD for females is 54.14 mg/dL on Monday and for males is 52.3 mg/dL on Saturday.
Figure 7 shows the glucose mean and SD that is averaged for all individuals during the hours of a day. It can be seen that the female glucose mean and SD is greater than males during all the hours of a day except for the first hour, i.e., 12:00 a.m., where male SD is greater than female SD. Both females and males have a minimum mean glucose of 145.32 mg/dL and 139.04 mg/dL at 8:00 a.m, respectively. The variations in the form of SD for both males and females drop a few hours after midnight until 8:00 a.m. and then continue to rise during the day until 12:00 a.m. The maximum SD for females and males is 49.35 mg/dL and 45.87 mg/dL during the last hour of the day (i.e., 23:00–24:00), respectively.
Figure 8 shows the glucose mean and SD that is averaged for all individuals during the months of a year. We observe a similar pattern (such as in Figure 6 and Figure 7) for glucose mean and SD with female measures greater than the males during all months of a year. It can be seen that the variations in terms of SD are greater during the first and last four months of the year. However, during the summer months (May, June, July, and August), the SD for both males and females is comparatively lower.
Figure 9 shows the glucose mean and SD that are averaged for all individuals during the days of a month. The glucose mean and SD for females are greater than the males for all of the days. The variations in terms of SD are higher for females during the first 20 days of the month. There are a few overlaps between the SDs for males and females during the last 10 days of the month.

5. Discussion

This paper assessed demographics and glucose data analysis followed by glucose variability and timeseries analysis for diabetes-related data from the OpenAPS Data Commons, a dataset with anonymised, donated data from individuals with insulin-requiring diabetes.
Demographics were reported at the time of the first data donation, so they may not be accurate throughout the time period of glucose data available, but they nevertheless provide an opportunity to assess demographic-related correlations with glucose data. For example, within this dataset, there is a positive correlation between distributions of insulin intake and weight in females (p < 0.001 using Spearman correlation), whereas we observed higher carbohydrate and basal rate requirements in males. This matches existing studies showing higher carbohydrate intake in males [59] and higher weight in females with diabetes compared to males [60].
Individuals within this dataset are using open-source AID systems, which have been frequently studied and shown to achieve greater TIR [61] than recommended standards [57]. This dataset further adds to this demonstration of efficacy, with an average TIR of 77.27%. This includes data from early-era open-source AID as well as open-source AID in more recent years, which is relevant because the algorithms and feature sets of open-source AID have changed over time. As such, the TIR from this study should not only be taken as an indicator of what is currently being achieved in the “DIY” community. If such an assessment were to be preferred, a sub-analysis of the OpenAPS Data Commons could be performed looking at recent data only to reflect modern system use and a more homogeneous feature set influencing the glycemic outcomes. Additionally, early concerns from healthcare providers regarding open-source AID included the concern that open-source AID would cause additional hypoglycemia [62]; however, our analysis shows there were not excess levels of hypoglycemia alongside the overall positive efficacy observed with above-goal time in range outcomes.
Notably, within this study, machine-learning-based hierarchical clustering methods were performed, and despite the differences in glucose variability by gender described below, no obvious clusters were found. This clustering analysis was conducted to determine if there were obvious sub-populations within this dataset that were not being effectively served or were particularly effectively served by their choice of open-source AID. This is a piece of additional evidence, other than the glycemic outcomes highlighted above, in support of open-source AID being effective at solving the majority of “noise” and glucose excursions on a regular basis.
This paper is one of the first to perform an in-depth analysis of glucose variability analysis by gender. As described previously, most studies on open-source AID are simply assessing efficacy or outcomes at a population level. This study found that in this particular version of the dataset, the average LBGI for females (0.87) is lower than for males (1.11), and the average HBGI was also higher in females (4.90) than in males (4.44). However, GMI distribution patterns were not distinguished between males and females. Similarly, CV and SD within this dataset showed no distinct differences between males and females. CV distributions for both genders have negative skewness (p < 0.05 obtained using the Shapiro–Wilk test), whereas SD distributions are more uniform (p > 0.05 obtained using the Shapiro–Wilk test). Average TIR distributions between genders were also similar and uniform (p > 0.05), including time above range.
The timeseries analysis for glucose data based on gender provided more distinct differences between genders, although both show similar trends. The mean glucose levels for females are slightly greater than for males throughout the week, and similarly, SD is higher in females than males, although there is a different pattern throughout the week between genders. In this dataset, female SD is highest on Monday whereas males have a higher SD on Saturday (Figure 6). Moving to an assessment of the hours throughout the day (Figure 7), both females and males follow a similar pattern of glucose mean dropping between midnight and 8:00 am, then rising throughout the day and evening until midnight.
For assessing changes throughout a calendar year (Figure 8), glucose mean and SD is again higher in females than males; however, the variation in terms of SD between gender are greater in the first and last 4 months of the year. The summer months (i.e., May through August) result in lower SD for both males and females. This matches the existing population-based literature findings of seasonal variation among people with diabetes [63].
Throughout a typical calendar month, the mean glucose of females is relatively flat throughout the month but with changes in SD throughout the month. This matches findings from studies with commercial AID systems where glucose is well-managed on average by AID systems throughout the menstrual cycle [64]. However, up to two-thirds of women may experience a menstrual cycle phenomenon [65], so further analysis should be done within the female-specific sub-population to assess the two thirds who experience cyclical changes separate from those who do not. A better understanding of menstrual cycle changes [66] could lead to improvement in AID systems or education for those with menstrual cycles and their healthcare providers regarding existing features and options within AID systems that could support menstruating individuals with menstruation-related glycemic changes.

Limitations

One of the limitations of this study is that the timeseries and gender-based analyses were performed on datasets of different lengths, e.g., some individuals may have 2–3 weeks of data while others have months and even years of data. Some analyses adjust for this, whereas others do not. These analyses are not meant to be taken as an assessment of all modern AID (open-source or commercial) but as a demonstration of methods that should be built on with additional studies.
Notably, the data within this dataset include data ranging from 2015 through 2021. Open-source AID systems have changed over time, so data from 2015 do not necessarily reflect real-world open-source AID usage in 2022 and beyond. Data from RCTs such as the CREATE trial [67] provide a modern representation of open-source AID potential instead. Lastly, the demographics are self-reported and were provided at the onset of data donation, and therefore do not necessarily match the age, weight, etc., of the individual over time.
This work primarily focused on glucose variability and demographics analysis and demonstrates the effectiveness of a large-scale data-driven analysis for the developments of open-source AID technologies. However, the OpenAPS Data Commons dataset has great potential for further applied research and development. In our future work, we will employ the data processing and analysis framework established in this paper and expand it by including algorithmic-derived novel variables such as autosenstivity [68].

6. Conclusions

This paper presented methods and techniques employed for anonymising, cleaning, and analysing the largest freely available diabetes dataset, i.e., OpenAPS Data Commons, donated by insulin-requiring individuals using open-source automated insulin delivery technologies. These data further validate previous findings of improved glycemic outcomes, including increased time in range without significant levels of hypoglycemia, with open-source automated insulin delivery systems. Furthermore, we employed clinically approved standard glucose variability metrics in timeseries analysis and machine-learning-based hierarchical clustering to evaluate data-driven glycemic variability outcomes. Overall, this paper showed that we can not only measure glucose variability but should be doing so within more studies. More AID studies should assess GV in addition to TIR and HbA1c and GMI, as it provides useful data for individuals living with diabetes as well as a comparison across the increasing number of AID systems becoming available. To build upon this study, future work should evaluate GV for specific situations, such as post-prandial or post-exercise, and for specific sub-populations such as menstruating individuals.

Author Contributions

Conceptualisation, A.S. and D.M.L.; methodology, A.S. and D.M.L.; software, A.S. and D.M.L.; validation, A.S. and D.M.L.; formal analysis, A.S. and D.M.L.; investigation, A.S. and D.M.L.; resources, A.S. and D.M.L.; data curation, A.S. and D.M.L.; writing—original draft preparation, A.S. and D.M.L.; writing—review and editing, A.S. and D.M.L.; visualisation, A.S.; project administration, D.M.L.; funding acquisition, D.M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work, as part of the OPEN project (www.open-diabetes.eu, accessed on 25 April 2022), has received funding from the European Commission’s Horizon 2020 Research and Innovation Program under the Marie Skłodowska-Curie Action Research and Innovation Staff Exchange (RISE), grant agreement number 823902.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board of University College Dublin (LS-20-37-ODonnell).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

All programming scripts and tools developed for the analysis of demographics and glucose data in this paper are made public and online at https://github.com/danamlewis/OpenHumansDataTools/tree/master/bin/GV-demographics-scripts (accessed on 25 April 2022).

Acknowledgments

Thank you to Drew Cooper and Bernd Reinhold for feedback and input on the manuscript, and to members of the #WeAreNotWaiting community who have donated their data to the OpenAPS Data Commons.

Conflicts of Interest

The authors declare no financial conflict of interest. D.M.L. is a volunteer developer of one of the open-source AID systems, OpenAPS. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
OPENOutcomes of Patients’ Evidence with Novel, Do-it-Yourself Artificial Pancreas Technology
OpenAPSOpen-Source Artificial Pancreas System
AIDAutomated Insulin Delivery
APSArtificial Pancreas System
HCLHybrid Closed Loop
T1DType 1 Diabetes
CGMContinuous Glucose Monitoring
PwDPeople With Diabetes (any type)
HbA1cHemoglobin A1c
TIRTime In Range
GVGlucose Variability

Appendix A. Individual-Level Glucose Data Analysis

Figure A1. Visualisation for Glucose Data for an Open-Source AID User.
Figure A1. Visualisation for Glucose Data for an Open-Source AID User.
Nutrients 14 01906 g0a1
Figure A2. Visualisation for Glucose Rate of Change for an AID User. ‘Mu’ is mean and ‘Sigma’ is the standard deviation of distribution.
Figure A2. Visualisation for Glucose Rate of Change for an AID User. ‘Mu’ is mean and ‘Sigma’ is the standard deviation of distribution.
Nutrients 14 01906 g0a2
Figure A3. Timeseries analysis showing glucose distribution of an insulin-requiring individual using open-source AID during hours, week, days, and month. (a) Glucose distribution during the days of a week for an open-source AID user. (b) Glucose distribution during the days of a month for an open-source AID user. (c) Glucose distribution during the hours of a day for an open-source AID user. (d) Glucose distribution during the month of a year for an open-source AID user.
Figure A3. Timeseries analysis showing glucose distribution of an insulin-requiring individual using open-source AID during hours, week, days, and month. (a) Glucose distribution during the days of a week for an open-source AID user. (b) Glucose distribution during the days of a month for an open-source AID user. (c) Glucose distribution during the hours of a day for an open-source AID user. (d) Glucose distribution during the month of a year for an open-source AID user.
Nutrients 14 01906 g0a3

References

  1. Sun, H.; Saeedi, P.; Karuranga, S.; Pinkepank, M.; Ogurtsova, K.; Duncan, B.B.; Stein, C.; Basit, A.; Chan, J.C.; Mbanya, J.C.; et al. IDF diabetes atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res. Clin. Pract. 2022, 183, 109119. [Google Scholar] [CrossRef]
  2. Garg, S.K.; Rewers, A.H.; Akturk, H.K. Ever-Increasing Insulin-Requiring Patients Globally. Diabetes Technol. Ther. 2018, 20, S2-1–S2-4. [Google Scholar] [CrossRef] [PubMed]
  3. Lewis, D. Setting expectations for successful artificial pancreas/hybrid closed loop/automated insulin delivery adoption. J. Diabetes Sci. Technol. 2018, 12, 533–534. [Google Scholar] [CrossRef] [PubMed]
  4. Lewis, D. How it started, how it is going: The future of artificial pancreas systems (automated insulin delivery systems). J. Diabetes Sci. Technol. 2021, 15, 1258–1261. [Google Scholar] [CrossRef] [PubMed]
  5. Lewis, D.; Leibrand, S.; Community, O. Real-world use of open source artificial pancreas systems. J. Diabetes Sci. Technol. 2016, 10, 1411. [Google Scholar] [CrossRef]
  6. Lewis, D.M. Do-it-yourself artificial pancreas system and the OpenAPS movement. Endocrinol. Metab. Clin. 2020, 49, 203–213. [Google Scholar] [CrossRef] [PubMed]
  7. Knoll, C.; Peacock, S.; Wäldchen, M.; Cooper, D.; Aulakh, S.K.; Raile, K.; Hussain, S.; Braune, K. Real-world evidence on clinical outcomes of people with type 1 diabetes using open-source and commercial automated insulin dosing systems: A systematic review. Diabet. Med. 2021, 39, e14741. [Google Scholar] [CrossRef]
  8. Patel, R.; Crabtree, T.; Taylor, N.; Langeland, L.; Gazis, T.; Mendis, B.; Wilmot, E.; Idris, I. Safety and effectiveness of Do-It-Yourself Artificial Pancreas System (DIYAPS) compared with continuous subcutaneous insulin infusions (CSII) in combination with Free Style Libre (FSL) in people with Type 1 diabetes. Diabet. Med. 2022, 39, e14793. [Google Scholar] [CrossRef]
  9. Rodbard, D. Glucose variability: A review of clinical applications and research developments. Diabetes Technol. Ther. 2018, 20, S2–S5. [Google Scholar] [CrossRef]
  10. Ceriello, A. Glucose variability and diabetic complications: Is it time to treat? Diabetes Care 2020, 43, 1169–1171. [Google Scholar] [CrossRef]
  11. OpenAPS. Available online: https://OpenAPS.org (accessed on 25 April 2022).
  12. Lewis, D.M.; Swain, R.S.; Donner, T.W. Improvements in A1C and Time-in-Range in DIY Closed-Loop (OpenAPS) Users. Diabetes 2020, 67, 352-OR. [Google Scholar] [CrossRef]
  13. Zabinsky, J.; Howell, H.; Ghezavati, A.; Lewis, D.M.; Nguyen, A.; Wong, J.C. 988-P: Do-it-yourself artificial pancreas systems for type 1 diabetes reduce hyperglycemia without increasing hypoglycemia. Diabetes 2020, 69, 988-P. [Google Scholar] [CrossRef]
  14. Melmer, A.; Züger, T.; Lewis, D.M.; Leibrand, S.; Stettler, C.; Laimer, M. Glycaemic control in individuals with type 1 diabetes using an open source artificial pancreas system (OpenAPS). Diabetes Obes. Metab. 2019, 21, 2333–2337. [Google Scholar] [CrossRef] [PubMed]
  15. Wu, Z.; Luo, S.; Zheng, X.; Bi, Y.; Xu, W.; Yan, J.; Yang, D.; Weng, J. Use of a do-it-yourself artificial pancreas system is associated with better glucose management and higher quality of life among adults with type 1 diabetes. Ther. Adv. Endocrinol. Metab. 2020, 11, 1–11. [Google Scholar] [CrossRef] [PubMed]
  16. Volkova, A.R.; Chernaya, M.; Vlasova, K.A. Experience of using insulin therapy with the closed loop method among patients with type 1 diabetes mellitus in Russia. In Proceedings of the Endocrine Abstracts, Virtual, UK, 5–9 September 2020; Volume 70. [Google Scholar]
  17. Gawrecki, A.; Zozulinska-Ziolkiewicz, D.; Michalak, M.A.; Adamska, A.; Michalak, M.; Frackowiak, U.; Flotynska, J.; Pietrzak, M.; Czapla, S.; Gehr, B.; et al. Safety and glycemic outcomes of do-it-yourself AndroidAPS hybrid closed-loop system in adults with type 1 diabetes. PLoS ONE 2021, 16, e0248965. [Google Scholar] [CrossRef] [PubMed]
  18. Jeyaventhan, R.; Gallen, G.; Choudhary, P.; Hussain, S. A Real-World Study of User Characteristics, Safety and Efficacy of Open-Source Closed-Loop Systems and Medtronic 670G. Diabetes Obes. Metab. 2021, 23, 1989–1994. [Google Scholar] [CrossRef] [PubMed]
  19. Greshake Tzovaras, B.; Angrist, M.; Arvai, K.; Dulaney, M.; Estrada-Galiñanes, V.; Gunderson, B.; Head, T.; Lewis, D.; Nov, O.; Shaer, O.; et al. Open Humans: A platform for participant-centered research and personal data exploration. GigaScience 2019, 8, giz076. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Papadopoulos, A.; Salinas, J.; Crump, C. Computational modeling approaches to characterize risk and achieve safe, effective, and trusted designs in the development of artificial intelligence and autonomous closed-loop medical systems. In Proceedings of the Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications III, Virtual, FL, USA, 12–17 April 2021; Volume 11746, p. 1174623. [Google Scholar]
  21. Rodríguez-Rodríguez, I.; Chatzigiannakis, I.; Rodríguez, J.V.; Maranghi, M.; Gentili, M.; Zamora-Izquierdo, M. Utility of big data in predicting short-term blood glucose levels in type 1 diabetes mellitus through machine learning techniques. Sensors 2019, 19, 4482. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Mosquera-Lopez, C.; Dodier, R.; Tyler, N.; Resalat, N.; Jacobs, P. Leveraging a Big Dataset to Develop a Recurrent Neural Network to Predict Adverse Glycemic Events in Type 1 Diabetes. IEEE J. Biomed. Health Inform. 2019, in press. [Google Scholar] [CrossRef] [PubMed]
  23. Mosquera-Lopez, C.; Jacobs, P.G. Incorporating glucose variability into glucose forecasting accuracy assessment using the new glucose variability impact index and the prediction consistency index: An LSTM case example. J. Diabetes Sci. Technol. 2021, 16, 7–18. [Google Scholar] [CrossRef]
  24. Celebrating 10,000 donations to the Tidepool Big Data Donation Project. Available online: https://www.tidepool.org/blog/celebrating-10000-donations (accessed on 25 April 2022).
  25. Tidepool: Big Data Donation Project. Available online: https://www.tidepool.org/bigdata/ (accessed on 25 April 2022).
  26. Kovatchev, B.; Cobelli, C. Glucose variability: Timing, risk analysis, and relationship to hypoglycemia in diabetes. Diabetes Care 2016, 39, 502–510. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. DeVries, J.H. Glucose variability: Where it is important and how to measure it. Diabetes 2013, 62, 1405–1408. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Shahid, A. Programming Scripts for Demographics and Glucose Variability Analysis for OpenAPS Data Commons Dataset. 2022. Available online: https://github.com/danamlewis/OpenHumansDataTools/tree/master/bin/GV-demographics-scripts (accessed on 25 April 2022).
  29. Kovatchev, B.P. Metrics for glycemic control—From HbA 1c to continuous glucose monitoring. Nat. Rev. Endocrinol. 2017, 13, 425–436. [Google Scholar] [CrossRef] [PubMed]
  30. Kovatchev, B.P.; Otto, E.; Cox, D.; Gonder-Frederick, L.; Clarke, W. Evaluation of a new measure of blood glucose variability in diabetes. Diabetes Care 2006, 29, 2433–2438. [Google Scholar] [CrossRef] [Green Version]
  31. McDonnell, C.; Donath, S.; Vidmar, S.; Werther, G.; Cameron, F. A novel approach to continuous glucose analysis utilizing glycemic variation. Diabetes Technol. Ther. 2005, 7, 253–263. [Google Scholar] [CrossRef] [PubMed]
  32. Service, F.J.; Molnar, G.D.; Rosevear, J.W.; Ackerman, E.; Gatewood, L.C.; Taylor, W.F. Mean amplitude of glycemic excursions, a measure of diabetic instability. Diabetes 1970, 19, 644–655. [Google Scholar] [CrossRef]
  33. Baghurst, P.A. Calculating the mean amplitude of glycemic excursion from continuous glucose monitoring data: An automated algorithm. Diabetes Technol. Ther. 2011, 13, 296–302. [Google Scholar] [CrossRef]
  34. Fritzsche, G.; Kohnert, K.D.; Heinke, P.; Vogt, L.; Salzsieder, E. The use of a computer program to calculate the mean amplitude of glycemic excursions. Diabetes Technol. Ther. 2011, 13, 319–325. [Google Scholar] [CrossRef]
  35. Yu, X.; Lin, L.; Shen, J.; Chen, Z.; Jian, J.; Li, B.; Xin, S.X. Calculating the mean amplitude of glycemic excursions from continuous glucose data using an open-code programmable algorithm based on the integer nonlinear method. Comput. Math. Methods Med. 2018, 2018, 6286893. [Google Scholar] [CrossRef]
  36. Bergenstal, R.M.; Beck, R.W.; Close, K.L.; Grunberger, G.; Sacks, D.B.; Kowalski, A.; Brown, A.S.; Heinemann, L.; Aleppo, G.; Ryan, D.B.; et al. Glucose management indicator (GMI): A new term for estimating A1C from continuous glucose monitoring. Diabetes Care 2018, 41, 2275–2280. [Google Scholar] [CrossRef] [Green Version]
  37. Bent, B.; Henriquez, M.; Dunn, J.P. Cgmquantify: Python and R Software Packages for Comprehensive Analysis of Interstitial Glucose and Glycemic Variability from Continuous Glucose Monitor Data. IEEE Open J. Eng. Med. Biol. 2021, 2, 263–266. [Google Scholar] [CrossRef] [PubMed]
  38. Saisho, Y.; Tanaka, C.; Tanaka, K.; Roberts, R.; Abe, T.; Tanaka, M.; Meguro, S.; Irie, J.; Kawai, T.; Itoh, H. Relationships among different glycemic variability indices obtained by continuous glucose monitoring. Prim. Care Diabetes 2015, 9, 290–296. [Google Scholar] [CrossRef] [PubMed]
  39. Rawlings, R.A.; Shi, H.; Yuan, L.H.; Brehm, W.; Pop-Busui, R.; Nelson, P.W. Translating Glucose Variability Metrics into the Clinic via C ontinuous G lucose M onitoring: AG raphical U ser I nterface for D iabetes E valuation (CGM-GUIDE©). Diabetes Technol. Ther. 2011, 13, 1241–1248. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Attaye, I.; van der Vossen, E.W.; Mendes Bastos, D.N.; Nieuwdorp, M.; Levin, E. Introducing the Continuous Glucose Data Analysis (CGDA) R Package: An Intuitive Package to Analyze Continuous Glucose Monitoring Data. J. Diabetes Sci. Technol. 2022, in press. [Google Scholar] [CrossRef] [PubMed]
  41. Moscardó, V.; Giménez, M.; Oliver, N.; Hill, N.R. Updated software for automated assessment of glucose variability and quality of glycemic control in diabetes. Diabetes Technol. Ther. 2020, 22, 701–708. [Google Scholar] [CrossRef]
  42. Vigers, T.; Chan, C.L.; Snell-Bergeon, J.; Bjornstad, P.; Zeitler, P.S.; Forlenza, G.; Pyle, L. cgmanalysis: An R package for descriptive analysis of continuous glucose monitor data. PLoS ONE 2019, 14, e0216851. [Google Scholar] [CrossRef]
  43. Czerwoniuk, D.; Fendler, W.; Walenciak, L.; Mlynarski, W. GlyCulator: A glycemic variability calculation tool for continuous glucose monitoring data. J. Diabetes Sci. Technol. 2011, 5, 447–451. [Google Scholar] [CrossRef] [Green Version]
  44. Hirsch, I.B.; Brownlee, M. Should minimal blood glucose variability become the gold standard of glycemic control? J. Diabetes Its Complicat. 2005, 19, 178–181. [Google Scholar] [CrossRef]
  45. Cobelli, C.; Facchinetti, A. Yet another glucose variability index: Time for a paradigm change? Diabetes Technol. Ther. 2018, 20, 1–3. [Google Scholar] [CrossRef]
  46. Kilpatrick, E.S.; Rigby, A.S.; Atkin, S.L. The effect of glucose variability on the risk of microvascular complications in type 1 diabetes. Diabetes Care 2006, 29, 1486–1490. [Google Scholar] [CrossRef] [Green Version]
  47. Frontoni, S.; Di Bartolo, P.; Avogaro, A.; Bosi, E.; Paolisso, G.; Ceriello, A. Glucose variability: An emerging target for the treatment of diabetes mellitus. Diabetes Res. Clin. Pract. 2013, 102, 86–95. [Google Scholar] [CrossRef] [PubMed]
  48. Sechterberger, M.K.; Luijf, Y.M.; DeVries, J.H. Poor agreement of computerized calculators for mean amplitude of glycemic excursions. Diabetes Technol. Ther. 2014, 16, 72–75. [Google Scholar] [CrossRef] [PubMed]
  49. Fernandes, N.J.; Nguyen, N.; Chun, E.; Punjabi, N.M.; Gaynanova, I. Open-Source Algorithm to Calculate Mean Amplitude of Glycemic Excursions Using Short and Long Moving Averages. J. Diabetes Sci. Technol. 2022, 16, 576–577. [Google Scholar] [CrossRef] [PubMed]
  50. Marling, C.R.; Shubrook, J.H.; Vernier, S.J.; Wiley, M.T.; Schwartz, F.L. Characterizing blood glucose variability using new metrics with continuous glucose monitoring data. J. Diabetes Sci. Technol. 2011, 5, 871–878. [Google Scholar] [CrossRef] [Green Version]
  51. Buse, J.B.; Freeman, J.L.; Edelman, S.V.; Jovanovic, L.; McGill, J.B. Serum 1, 5-anhydroglucitol (GlycoMark™): A short-term glycemic marker. Diabetes Technol. Ther. 2003, 5, 355–363. [Google Scholar] [CrossRef]
  52. Dovc, K.; Cargnelutti, K.; Sturm, A.; Selb, J.; Bratina, N.; Battelino, T. Continuous glucose monitoring use and glucose variability in pre-school children with type 1 diabetes. Diabetes Res. Clin. Pract. 2019, 147, 76–80. [Google Scholar] [CrossRef] [Green Version]
  53. Service, F.J. Glucose variability. Diabetes 2013, 62, 1398–1404. [Google Scholar] [CrossRef] [Green Version]
  54. Siegelaar, S.E.; Holleman, F.; Hoekstra, J.B.; DeVries, J.H. Glucose variability; Does it matter? Endocr. Rev. 2010, 31, 171–182. [Google Scholar] [CrossRef] [Green Version]
  55. Nightscoutfoundation/Dataxfer: POC to Develop a Web-Based Data Transfer Tool from Nightscout DBS to Other Platforms. Available online: https://github.com/NightscoutFoundation/dataxfer (accessed on 25 April 2022).
  56. Lewis, D. OpenHumansDataTools. 2018. Available online: https://github.com/danamlewis/OpenHumansDataTools/blob/master/bin/unzip-split-csvify-OpenHumans-data.sh (accessed on 25 April 2022).
  57. Assessment, G. 6. Glycemic Targets: Standards of Medical Care in Diabetes—2022. Diabetes Care 2022, 45, S83. [Google Scholar]
  58. Lewis, D.M. Errors of commission or omission: The net risk safety analysis conversation we should be having around automated insulin delivery systems. Diabet. Med. 2021, 39, e14687. [Google Scholar] [CrossRef]
  59. Toeller, M.; Buyken, A.; Heitkamp, G.; Cathelineau, G.; Ferriss, B. Nutrient intakes as predictors of body weight in European people with type 1 diabetes. Int. J. Obes. 2001, 25, 1815–1822. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Szadkowska, A.; Madej, A.; Ziólkowska, K.; Szymanska, M.; Jeziorny, K.; Mianowska, B.; Pietrzak, I. Gender and Age–Dependent effect of type 1 diabetes on obesity and altered body composition in young adults. Ann. Agric. Environ. Med. 2015, 22, 124–128. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  61. Asarani, N.; Reynolds, A.; Elbalshy, M.; Burnside, M.; de Bock, M.; Lewis, D.; Wheeler, B. Efficacy, safety, and user experience of DIY or open-source artificial pancreas systems: A systematic review. Acta Diabetol. 2021, 58, 539–547. [Google Scholar] [CrossRef]
  62. Palmer, W.; Greeley, S.A.W.; Letourneau-Freiberg, L.R.; Naylor, R.N. Using a do-it-yourself artificial pancreas: Perspectives from patients and diabetes providers. J. Diabetes Sci. Technol. 2020, 14, 860–867. [Google Scholar] [CrossRef] [PubMed]
  63. Kershenbaum, A.; Kershenbaum, A.; Tarabeia, J.; Stein, N.; Lavi, I.; Rennert, G. Unraveling seasonality in population averages: An examination of seasonal variation in glucose levels in diabetes patients using a large population-based data set. Chronobiol. Int. 2011, 28, 352–360. [Google Scholar] [CrossRef]
  64. Levy, C.; O’Malley, G.; Raghinaru, D.; Kudva, Y.C.; Laffel, L.M.; Pinsker, J.E.; Lum, J.; Brown, S. Insulin Delivery and Glucose Variability throughout the Menstrual Cycle on Closed Loop Control for Women with Type 1 Diabetes. Diabetes Technol. Ther. 2022, in press. [Google Scholar] [CrossRef] [PubMed]
  65. Herranz, L.; Saez-de Ibarra, L.; Hillman, N.; Gaspar, R.; Pallardo, L.F. Glycemic changes during menstrual cycles in women with type 1 diabetes. Med. Clin. 2016, 146, 287–291. [Google Scholar] [CrossRef]
  66. Mewes, D.; Wäldchen, M.; Knoll, C.; Raile, K.; Braune, K. Variability of Glycemic Outcomes and Insulin Requirements Throughout the Menstrual Cycle: A Qualitative Study on Women With Type 1 Diabetes Using an Open-Source Automated Insulin Delivery System. J. Diabetes Sci. Technol. 2022, in press. [Google Scholar] [CrossRef]
  67. Burnside, M.; Lewis, D.; Crocket, H.; Wilson, R.; Williman, J.; Jefferies, C.; Paul, R.; Wheeler, B.J.; de Bock, M. Create (community derived automated insulin delivery) trial. randomised parallel arm open label clinical trial comparing automated insulin delivery using a mobile controller (anydana-loop) with an open-source algorithm with sensor augmented pump therapy in type 1 diabetes. J. Diabetes Metab. Disord. 2020, 19, 1615–1629. [Google Scholar] [CrossRef]
  68. Lewis, D.M.; Leibrand, S.; Street, T.J.; Phatak, S.S. Detecting insulin sensitivity changes for individuals with type 1 diabetes. Diabetes 2018, 67, 79-LB. [Google Scholar] [CrossRef]
Figure 1. Glucose mean and distribution for insulin-requiring individuals using open-source AID systems. Total number of individuals (n) = 122. Average glucose mean and SD across the individuals is 139 ± 49.8.
Figure 1. Glucose mean and distribution for insulin-requiring individuals using open-source AID systems. Total number of individuals (n) = 122. Average glucose mean and SD across the individuals is 139 ± 49.8.
Nutrients 14 01906 g001
Figure 2. Analysis of Self-reported Demographic Attributes of Insulin-Requiring Individuals using AID Technologies. Overall data (male and female combined) are normalised using MinMaxScaler function provided by Python scikit-learn library. (a) Shows the highest correlation of 69% between weight and daily insulin units. Second-highest correlation is between weight and basal units equal to 61%. (b) The distribution of self-reported demographics with respect to gender shows a greater intake of carbs and basal in males as compared to females. Average insulin intake by males and females is 45.58 and 49 units, respectively.
Figure 2. Analysis of Self-reported Demographic Attributes of Insulin-Requiring Individuals using AID Technologies. Overall data (male and female combined) are normalised using MinMaxScaler function provided by Python scikit-learn library. (a) Shows the highest correlation of 69% between weight and daily insulin units. Second-highest correlation is between weight and basal units equal to 61%. (b) The distribution of self-reported demographics with respect to gender shows a greater intake of carbs and basal in males as compared to females. Average insulin intake by males and females is 45.58 and 49 units, respectively.
Nutrients 14 01906 g002
Figure 6. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during days of the week based on gender. Total number of males and females is 50 and 28, respectively.
Figure 6. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during days of the week based on gender. Total number of males and females is 50 and 28, respectively.
Nutrients 14 01906 g006
Figure 7. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during hours of a day based on gender. Total number of males and females is 50 and 28, respectively.
Figure 7. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during hours of a day based on gender. Total number of males and females is 50 and 28, respectively.
Nutrients 14 01906 g007
Figure 8. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during months of a year based on gender. Total number of males and females is 50 and 28, respectively. m1 represents January, m2 represents February, and similarly m12 represents December.
Figure 8. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during months of a year based on gender. Total number of males and females is 50 and 28, respectively. m1 represents January, m2 represents February, and similarly m12 represents December.
Nutrients 14 01906 g008
Figure 9. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during days of a month based on gender. Total number of males and females is 50 and 28, respectively.
Figure 9. Average glucose mean and standard deviation for insulin-requiring individuals in OpenAPS Data Commons dataset during days of a month based on gender. Total number of males and females is 50 and 28, respectively.
Nutrients 14 01906 g009
Table 1. Clinically Validated Glucose Variability Metrics.
Table 1. Clinically Validated Glucose Variability Metrics.
MetricAcronymDefinition
Average daily risk rangeADRRAssessment of overall total daily glucose variations within risk range [30]. Risk scores are defined relative to a target.
Continuous overall net glycemic actionCONGAA GV metric similar to standard deviation (SD) that assesses glucose fluctuations for a predetermined interval [31].
Mean amplitude of glycemic excursionMAGEMean of blood glucose values that exceed one SD from the 24 h mean blood glucose value [32]. Multiple implementations of automatically calculating MAGE are available in the literature [33,34,35].
Glycemic management indicatorGMIIndicates the expected mean hemoglobin A1C using mean glucose of individuals with diabetes [36].
High blood glucose indexHBGIA quantifying metric indicating the risk of hyperglycemia calculated using self-monitoring of blood glucose (SMBG) samples [30].
Low blood glucose indexLBGIA quantifying metric indicating the risk of hypoglycemia calculated using SMBG samples [30].
Coefficient of variationCVA statistical metric to compute the diversity of glucose data. Commonly used sub-metrics for glucose data include the interday and intraday CV in CGM data [37].
Glycemic variability metricJ_indexA quality assessment metric of glucose management using a combination of information from the mean and SD [38].
Time in rangeTIRA quantifiable metric to calculate the percentage of time spent within normal glucose levels, i.e., a target range defined between 70 mg/DL to 180 mg/dL.
Time outside rangeTORA quantifiable metric to calculate the percentage of time spent outside normal glucose levels, i.e., either less than 70 mg/DL or greater than 180 mg/dL.
Table 2. Count of self-reported demographics data.
Table 2. Count of self-reported demographics data.
Demographic FeaturesNumber of Available ReportsMissing Reports
Total Number of Individuals1220
Diagnosed Date1220
Date of Pump Use10319
Date of CGM Use10517
Date of Closed Loop Initiation10220
Open-Source AID Type10715
Date of Birth1175
Country1211
Weight1184
Height1193
Total Daily Insulin Units1148
Daily Basal Insulin Units1193
Total Daily Carbs10517
Last HbA1C1166
Last A1C Date1166
Gender7844
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shahid, A.; Lewis, D.M. Large-Scale Data Analysis for Glucose Variability Outcomes with Open-Source Automated Insulin Delivery Systems. Nutrients 2022, 14, 1906. https://doi.org/10.3390/nu14091906

AMA Style

Shahid A, Lewis DM. Large-Scale Data Analysis for Glucose Variability Outcomes with Open-Source Automated Insulin Delivery Systems. Nutrients. 2022; 14(9):1906. https://doi.org/10.3390/nu14091906

Chicago/Turabian Style

Shahid, Arsalan, and Dana M. Lewis. 2022. "Large-Scale Data Analysis for Glucose Variability Outcomes with Open-Source Automated Insulin Delivery Systems" Nutrients 14, no. 9: 1906. https://doi.org/10.3390/nu14091906

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop