1. Introduction
Time series (TS) is a sequence of data points indexed by time, typically collected at consistent intervals [
1]. TS analysis involves developing methodologies to extract meaningful insights and patterns from this temporal data [
2,
3]. A key area within TS analysis is time series forecasting (TSF), which focuses on projecting historical trends and patterns into future time stamps [
4]. TSF has a wide range of applications across various fields, including healthcare [
5]. This drives extensive research efforts aimed at establishing robust forecasting frameworks [
6,
7,
8]. Numerous studies have successfully developed robust TSF models across diverse domains, incorporating classical techniques such as autoregressive models, moving averages, exponential smoothing, and autoregressive integrated moving average [
9,
10,
11,
12,
13,
14].
Traditional TSF approaches, while widely utilised, face several limitations, including challenges in tuning, a heavy reliance on domain expertise, restricted capacity to capture complex patterns, and limited ability to handle outliers and missing values [
15,
16]. As a result, there has been a shift toward more advanced algorithms, particularly machine learning (ML), to enhance TSF capabilities [
17,
18,
19,
20]. Various ML techniques, such as penalised linear methods and nonlinear regression trees, have been explored for developing more effective TSF models [
21,
22,
23,
24].
Deep learning (DL), a subset of ML, has proven to be a powerful tool for complex computational tasks [
25,
26,
27,
28,
29,
30]. Its capacity to model nonlinear dynamics has made DL particularly effective in TSF applications [
31,
32]. Furthermore, advancements in data collection technology have led to the accumulation of large time series datasets, a crucial resource for DL analysis [
33,
34]. Consequently, studies have employed DL architectures like recurrent neural networks (RNNs), convolutional neural networks (CNNs), and transformers to develop robust TSF models [
35,
36,
37,
38,
39,
40]. Despite the demonstrated success of DL-based TSF models, further refinement and improvement remain critical, driving the development of more advanced interdependent techniques with enhanced analytical potential beyond independent techniques.
One such advancement is the incorporation of adversarial learning (AL) [
41]. Initially designed for generating image data, AL represents an intricate DL training framework consisting of two subnetworks: a generator and a discriminator. The generator aims to produce realistic data, while the discriminator distinguishes between real and synthetic data. These subnetworks engage in an adversarial loop, where the generator progressively improves its ability to produce convincing outputs, and the discriminator becomes more adept at distinguishing real from synthetic data [
42].
Building on its success in image synthesis, AL has been adapted for time series data generation and forecasting [
43,
44,
45,
46]. In these adaptations, a main generator predicts future values based on historical data, while a discriminator evaluates the similarity between the predicted values and the actual data [
47,
48,
49,
50,
51,
52]. These innovations have established AL as a promising tool for improving TSF models [
41].
Enhancing advanced techniques like AL is essential for addressing real-world challenges, such as predicting blood glucose (BG) levels in individuals with type 1 diabetes (T1D) [
53]. This TSF task is particularly complex [
54], and continuous improvements are critical [
55] for effective diabetes management [
56,
57,
58]. Accurate BG predictions (BGP) can help reduce both acute and chronic complications associated with T1D [
59,
60]. However, the volatility of BG values poses significant challenges, underscoring the need for more advanced models capable of providing accurate and reliable forecasting. At the same time, the widespread use of wearable devices, such as continuous glucose monitoring (CGM) systems, enables the automated collection of vast datasets necessary for training such advanced models [
61]. This technological development creates an ideal environment for the application of advanced techniques like AL, making them increasingly valuable for BGP [
62].
BGP is also closely connected to personalised medicine, because glucose dynamics, glycaemic variability, and treatment responses differ substantially across individuals with T1D. As a result, clinically useful forecasting systems should not only be accurate in general but should also support subject-specific prediction and individualised glycaemic risk management using patient-level CGM data.
In this context, this paper introduces an interdependent learning framework for blood glucose prediction that combines adversarial and collaborative optimisation. The study is designed to address two methodological questions: first, whether adversarial training alone improves sequence-to-sequence blood glucose prediction relative to independent learning; and second, whether augmenting adversarial training with a collaborative post-horizon objective improves the temporal relevance of predictions. To this end, we evaluate four frameworks—independent learning (IL), adversarial learning (AL), collaborative learning (CL), and adversarial collaborative learning (ACL)—under identical preprocessing, data partitioning, optimisation, and evaluation conditions. In addition to this controlled internal comparison, the best-performing proposed framework is also benchmarked against established approaches reported in the literature, including classical statistical, machine-learning, feed-forward neural network, and recurrent neural network baselines. This design allows the contribution of the proposed learning strategy to be assessed separately from differences in data handling or training conditions. To assess the framework under practically relevant forecasting conditions, experiments were conducted for two prediction horizons, 30 min and 60 min ahead. These horizons were selected because they represent short-term forecasting intervals that are highly relevant in diabetes management: 30 min supports near-term anticipatory decision-making, while 60 min provides a more challenging but clinically valuable longer warning window.
2. Adversarial Learning in Time Series
This section provides an overview of representative recent work on adversarial learning (AL) and deep learning for time-series analysis and forecasting. In addition to individual application studies, recent survey papers have highlighted the rapid expansion of deep learning for time-series forecasting, including recurrent, convolutional, and transformer-based models, as well as the growing role of generative adversarial approaches in modelling, forecasting, augmentation, and imputation. For a broader overview of these developments, several dedicated reviews are already available in the literature [
41,
63,
64,
65].
The article [
66] presents a promising new approach for modelling financial TS data using AL. The article first argues that traditional models, such as autoregressive or moving average techniques, struggle to capture the complex dynamics of financial data. Then, in response, the work proposes using AL and shows that this approach outperforms traditional models in terms of accuracy and ability to capture complex patterns in the data. Additionally, the article explores potential applications of AL in various financial domains, including fraud detection.
Another article [
67] proposes a novel DL methodology that employs AL for anomaly detection in TS data. The authors assess the efficacy of the method on benchmark datasets and demonstrate its superior performance over several state-of-the-art anomaly detection techniques. This research introduces a promising solution for detecting anomalies in TS data, with potential applications across various domains.
A different study proposes a new approach to addressing the problem of missing values in multivariate TS data [
68]. The authors introduce a specialized AL architecture designed for imputing TS data. The evaluation analysis of real-world datasets demonstrates that the model outperforms several existing imputation methods. This research offers a promising solution to the challenge of imputing missing values in multivariate time series data that has practical applications in diverse fields, such as finance, healthcare, and environmental monitoring.
Another research article [
69] introduces a method for predicting hourly photovoltaic power output using a conditional AL. The proposed approach addresses the issue of limited training data by using conditional AL to generate synthetic data, which helps to increase the size of the training dataset. A DL model then uses the augmented dataset to predict the future output. The study compares the performance of this approach with other popular TSF methods and finds that it outperforms them in terms of accuracy. The new method’s potential for improving the accuracy of photovoltaic power forecasting could play a crucial role in integrating renewable energy sources into the power grid.
Article [
70] introduces a new approach for TS prediction and classification using a combination of AL and recurrent neural networks with an attention mechanism. The recurrent neural network unit is used to analyse the temporal patterns in the data, while the AL unit is employed to create synthetic data that can be used to enhance the training dataset. Comparative analysis with other prevalent methods for TS prediction and classification demonstrates the superior accuracy and efficiency of this novel approach.
Finally, ref. [
43] presents a new method for TS resampling that addresses the issue of unevenly spaced data. The proposed method uses AL to generate synthetic data that can fill in the gaps between the original data points. AL is trained to learn the statistical patterns of the original data and then used to generate synthetic data points that complete the missing values. The study compares the performance of this method with other popular techniques used for resampling TS data, and the results show that it is more accurate. The authors conclude that this new approach has the potential to improve the accuracy of time series resampling.
More broadly, recent deep learning studies in time-series forecasting have shown that forecasting performance can benefit from increasingly expressive architectures, including recurrent, convolutional, hybrid, and transformer-based models. In parallel, review papers on GANs for time series have shown that adversarial approaches are being used not only for direct forecasting, but also for sequence generation, augmentation, imputation, anomaly detection, and representation learning. Within healthcare and glucose forecasting specifically, recent studies have explored recurrent and multitask forecasting models, as well as GAN-based modelling of CGM-related signals. These developments confirm both the momentum of deep-learning-based forecasting and the emerging relevance of adversarial techniques for biomedical time series.
In conclusion, AL has found application in certain TSF domains. However, their use in BGP, where reliable forecasting is vital for diabetes management, remains underexplored. Additionally, conventional adversarial learning often yields suboptimal results in BGP, focusing primarily on adversarial loss within a limited prediction horizon and frequently neglecting longer-term trends that are clinically critical.
It is important to distinguish the proposed approach from existing GAN-based sequence models and multi-step forecasting strategies. In standard adversarial TSF frameworks, the discriminator evaluates whether predicted sequences within the PH are distributionally consistent with real sequences. This adversarial objective improves realism but does not explicitly encourage predictions to be informative of future trends beyond the PH boundary. Multi-step forecasting approaches, on the other hand, extend the prediction range by directly forecasting additional future steps; however, they treat the extended horizon as an end in itself, optimising accuracy across the entire extended window simultaneously.
The proposed collaborative component operates on a fundamentally different principle. Rather than extending the PH or modifying the adversarial objective, it introduces an auxiliary regressor that consumes the primary regressor’s PH predictions and uses them to forecast post-PH values. Critically, during training, the collaborative loss is backpropagated through the auxiliary regressor to the primary regressor, compelling it to embed extended temporal information within its original PH outputs. The primary regressor’s PH length and output remain unchanged at inference; the improvement arises because the training process has shaped its predictions to be more temporally coherent. This indirect enrichment of PH predictions through post-PH supervision is, to our knowledge, a novel training strategy that has not been explored in adversarial TSF literature.
3. Learning Systems Architecture
In this research, we develop and evaluate four learning frameworks for blood glucose prediction (BGP): independent learning (IL), adversarial learning (AL), collaborative learning (CL), and adversarial collaborative learning (ACL). These frameworks are built on three core components: a primary regressor, an auxiliary discriminator, and an auxiliary regressor. The novelty of the proposed framework lies in the learning strategy rather than in introducing a new sequence backbone. Standard GAN-based forecasting typically couples a predictor or generator with a discriminator so that predicted sequences better match the distribution of the target horizon. By contrast, our framework adds an auxiliary regressor that receives the predicted horizon and is trained to forecast the subsequent post-horizon window. This creates a collaborative objective that regularises the primary regressor toward predictions that are not only accurate and distributionally plausible within the prediction horizon, but also informative for what happens immediately afterwards. For clarity, this differs from ordinary direct multi-step forecasting, in which a single model is trained to predict a longer horizon directly; here, the beyond-horizon requirement is imposed indirectly through an interacting auxiliary regressor.
The primary regressor is responsible for the main task of BGP—processing a specified length of historical data and predicting future data within a predefined PH. The sequence-to-sequence forecasting capability of the primary regressor independently allows for extended insights when paired with auxiliary components. In this study, a multilayer perceptron (MLP), well-known for its efficiency in BGP [
71], is utilised as the primary regressor. This choice was deliberate. The aim of this paper is to evaluate the proposed learning framework rather than to maximise performance through increasingly complex temporal backbones. After reframing the CGM series into fixed history-to-horizon input-output pairs, the ordered lag values already encode short-term temporal structure, allowing a lightweight MLP to serve as a stable and interpretable baseline for testing the effect of adversarial and collaborative learning. In addition, simpler architectures reduce optimisation instability in adversarial settings and enable a fair like-for-like comparison across IL, AL, CL, and ACL. The architecture of the primary regressor includes an input layer, followed by a 50-unit dense layer, a 20-unit dense layer, and a final dense output layer, with the number of units in the output layer determined by the length of PH. The ReLU activation function is employed for all layers.
The auxiliary discriminator assesses the congruence between the actual sequences within the PH and those predicted by the primary regressor. This component processes a sequence that includes a specified length of genuine historical data, real or predicted PH data, and real post-PH data to determine whether the PH data is actual or predicted. A convolutional neural network (CNN), known for its classification capacity [
72], was selected for the auxiliary discriminator. This network comprises an input layer, a 20-unit Conv1D layer, a 10-unit Conv1D layer, and a single-unit dense output layer with a sigmoid activation function, using binary cross-entropy as the loss function.
The auxiliary regressor plays a critical role in extending and validating the forecasting capacity of the primary regressor. It evaluates the predicted sequences by the primary regressor and uses them to project future trends beyond the immediate PH. The post-PH period is designed to be equal in length to the PH for consistency. By optimising collaborative loss at this stage, the auxiliary regressor component enables the primary regressor to return predictions that better predict future trends. To isolate the effect of the collaborative objective from the effect of changing model family, the auxiliary regressor was also implemented as an MLP with the same architecture as the primary regressor.
The primary regressor serves as the central component in all frameworks, while the auxiliary components—the discriminator and regressor—interconnect with the primary regressor to perform supplementary tasks. When paired with the auxiliary discriminator, the primary regressor engages in an adversarial interaction, generating sequences that not only optimise prediction accuracy but also challenge the discriminator’s ability to classify data correctly. In contrast, when linked with the auxiliary regressor, the primary regressor collaborates to produce sequences that minimise prediction error while enhancing the auxiliary regressor’s forecasting performance. Depending on whether the primary regressor operates independently or in conjunction with the auxiliary components, one independent and three interdependent frameworks are formed, as detailed below.
For clarity, the four frameworks can be summarised as follows. IL uses only the primary regressor and serves as the conventional baseline. AL connects the primary regressor to an auxiliary discriminator, introducing adversarial supervision within the prediction horizon. CL connects the primary regressor to an auxiliary regressor, introducing collaborative supervision through post-horizon forecasting. ACL combines both auxiliary components so that the primary regressor is trained under both adversarial and collaborative interactions.
3.1. Independent Learning
In the IL framework, the primary regressor operates alone, without the integration of any auxiliary components; similar to conventional BGP approaches, this method involves optimising a standard regression loss given in Equation (1).
where
: system loss,
: predictive loss, E(a, b): error between a and b,
: real prediction horizon sequence,
: real history sequence,
PR: primary regressor,
PR(
a): primary regressor’s evaluation of
a.
3.2. Adversarial Learning
As shown in
Figure 1a, within the AL framework, the primary regressor is integrated with an auxiliary discriminator, and both components undergo simultaneous training. During training, the primary regressor interacts adversarially with the auxiliary discriminator. The loss functions in Equations (2) and (3) are optimised for the auxiliary discriminator and the primary regressor, respectively. These loss functions instruct the discriminator to classify real data as 0 and generated data as 1, while the primary regressor is trained to generate outputs that both enhance prediction accuracy and reduce the discriminator’s ability to classify generated data accurately. The system’s overall loss, as shown in Equation (4), is overall predictive and adversarial loss.
where
: predictive loss,
: adversarial loss,
: system loss,
AD: auxiliary discriminator,
E(
a,
b): error between
a and
b,
AD(
a): auxiliary discriminator’s evaluation of
a,
: real prediction horizon sequence,
: synthesised prediction horizon sequence,
: real history sequence,
PR: primary regressor,
PR(
a): primary regressor’s evaluation of
a.
In practice, the adversarial framework in this study was trained using a fixed and consistent optimisation procedure across all scenarios. The primary regressor and auxiliary discriminator were updated jointly under the same epoch schedule, using ADAM optimiser. We intentionally adopted lightweight and stable subnetworks for this interaction, namely an MLP-based regressor and a CNN-based discriminator, in order to reduce optimisation volatility and maintain a balanced adversarial process. No additional adversarial stabilisation mechanisms, such as gradient penalties or specialised alternating update schedules, were introduced; rather, stability was pursued through architectural simplicity, consistent optimisation settings, and repeated-run evaluation.
3.3. Collaborative Learning
In the CL framework (
Figure 1b), the primary regressor is connected to the auxiliary regressor. Both modules undergo simultaneous training, during which the primary regressor collaborates with the auxiliary regressor. The loss functions in Equations (5) and (6) are used for the auxiliary regressor and primary regressor, respectively. The primary regressor not only optimises its performance by minimising PH prediction errors but also improves the auxiliary regressor’s post-PH forecasting performance. The system optimises the predictive loss and collaborative loss as shown in Equation (7).
where
: predictive loss,
: collaborative loss,
: system loss, E(a, b): error between a and b
: real prediction horizon sequence,
: synthesised prediction horizon sequence,
AR: auxiliary regressor,
AR(
a): auxiliary regressor’s evaluation of
a,
: real history sequence,
: real post prediction horizon sequence,
PR: primary regressor,
PR(
a): primary regressor’s evaluation of
a.
3.4. Adversarial Collaborative Learning
The ACL framework (
Figure 1c) integrates the primary regressor with both the auxiliary discriminator and auxiliary regressor. In this setting, all three modules are trained together, with the primary regressor engaging in both adversarial and collaborative interactions. The loss functions in Equations (8)–(11) are used for the auxiliary discriminator, auxiliary regressor, and primary regressor, respectively. The primary regressor learns to generate sequences that not only optimise prediction accuracy but also degrade the discriminator’s performance and improve the auxiliary regressor’s forecasting ability. The system’s overall loss function is the combination of the three losses as shown in Equation (11). In the present implementation, the ACL objective is formed as an unweighted combination of predictive, adversarial, and collaborative losses, as defined in Equation (11). No additional weighting coefficients were introduced to rebalance these three components. This choice was made deliberately to preserve a controlled comparison with IL, AL, and CL and to evaluate the effect of combining the three learning signals without framework-specific loss tuning.
where
: predictive loss,
: adversarial loss,
: collaborative loss,
: system loss, AD: auxiliary discriminator, E(a, b): error between a and b, AD(a): auxiliary discriminator’s evaluation of a,
: real prediction horizon sequence,
: synthesised prediction horizon sequence,
AR: auxiliary regressor,
AR(
a): auxiliary regressor’s evaluation of
a,
: real history sequence,
: real post prediction horizon sequence,
PR: primary regressor,
PR(
a): primary regressor’s evaluation of a.
In ACL, all three components were trained under a unified optimisation schedule using the same optimiser and training hyperparameters across all experiments. This design was chosen to keep the comparison with IL, AL, and CL controlled and transparent. The objective was not to maximise adversarial sophistication through additional regularisation heuristics, but to examine whether the collaborative interaction can improve the learning dynamics and predictive utility of the primary regressor under a stable and consistent training setting.
4. Blood Glucose Prediction
To develop BGP systems, Ohio T1D datasets were utilised. The data were first preprocessed to meet the specific requirements of the problem space. Following this, the primary regressor was trained using the three proposed interdependent frameworks, alongside the conventional independent learning approach. The resulting systems underwent rigorous evaluation to assess their performance.
4.1. Dataset Description
For developing BGP systems, this work investigates two Ohio T1D datasets [
73]. The Ohio T1D datasets are a reputable benchmark in the field of BGP, known for their comprehensiveness and real-world applicability, making it a robust and conclusive dataset for evaluating models in this domain [
74,
75,
76,
77,
78,
79]. Each dataset encompasses eight weeks’ worth of diabetes-related attributes for a cohort of six individuals with T1D [
73]. The first dataset compiles data for four females and two males aged between 40 and 60 years [
73]. It was released in 2018 for the first BGP challenge [
73]. The second dataset comprises data for one female and five males within the age range of 20–80 years old [
73]. This dataset was disseminated for the second BGP challenge in 2020 [
73]. Hereafter, this paper refers to the former dataset as Ohio T1D 2018 and the latter as Ohio T1D 2020.
In line with prior univariate BGP studies, this work uses the CGM modality from the Ohio T1D datasets [
80]. CGM values were collected every five minutes using Enlite continuous glucose monitoring sensors (Medtronic MiniMed, Northridge, CA, USA) [
73]. Following the standard dataset protocol, the last 10 days of data for each individual were used as the testing set, while the preceding 46 days were used as the training set [
73]. All models were trained exclusively on the training portion, and the testing portion remained completely unseen until evaluation.
Table 1 summarises key statistical properties of the CGM data, while a fuller description is available in the original dataset documentation [
73].
Although the Ohio T1D datasets are widely used benchmarks in BGP research, the cohort size remains relatively limited. Across the two dataset releases, the present study evaluates the proposed framework on a modest number of contributors, each with an individual-specific training and testing split. Consequently, the evaluation is well suited to assessing temporal generalisation within each person, because models are trained on earlier observations and tested on later unseen observations from the same individual. However, this design does not constitute a leave-one-patient-out or cross-patient generalisation study, and the findings should therefore be interpreted primarily as evidence of subject-specific predictive effectiveness on this benchmark.
4.2. Data Preprocessing
This subsection reports the preprocessing analysis operated on the Ohio T1D datasets before proceeding with the BGP modelling phase.
4.2.1. Missing Value Imputation
In the first stage of the preprocessing, missing CGM values are handled. Linear interpolation is implemented to fill in missing values in the training set. However, missing values in the testing set are imputed utilising linear extrapolation. This technique avoids information leakage by ensuring that the systems do not use future information during evaluation. Accordingly, the resulting systems remain suitable for real-time prediction.
4.2.2. Problem Reframing
The next preprocessing stage is translating the sequence-to-sequence BGP task to a supervised ML problem. For this purpose, a window with the length of history plus PH is rolled over the CGM series, creating a set of associated vectors. Each vector is then split into pairs of input and output sequences according to the length of history and PH. This operation renders a subset of associated input and output sequences necessary for supervised ML [
81,
82,
83]. In this study, two prediction horizons (PHs) were investigated: 30 min and 60 min ahead. Considering the five-minute sampling frequency of CGM values in the Ohio T1D datasets, these correspond to forecasting 6 and 12 future timesteps, respectively. To exemplify, a rolling window with a length of 90 min forms a set of vectors for BGP 30 min in advance from 60 min of lag observations. Then, each vector is subdivided so that the first 60 min are used as the input sequence and the final 30 min as the associated output sequence. These two PHs were chosen to examine model behaviour under both a near-term clinically actionable setting and a more demanding longer-horizon forecasting setting.
4.3. Evaluation Analyses
The predictive performance of the developed systems is rigorously evaluated from both mathematical and clinical perspectives. Given the complex nature of BGP, a variety of evaluation metrics are essential for a comprehensive assessment, ensuring that both prediction accuracy and clinical relevance are thoroughly considered.
4.3.1. Mathematical Evaluation
BGP errors are measured using three widely used regression metrics: root mean square error (
RMSE), mean absolute error (
MAE), and mean absolute percentage error (
MAPE), given in Equations (12)–(14), respectively. Moreover, the coefficient of determination (
r2), given in Equation (15), is used to quantify the agreement between the reference and predicted
BG levels.
where
N denotes the number of samples in the testing set,
denotes the observed blood glucose value at the
n-th sample,
denotes the predicted blood glucose value at the
-th sample, and
denotes the mean observed blood glucose value across the testing set.
4.3.2. Clinical Evaluation
The following evaluation criteria were employed to assess the performance of the generated systems from the clinical point of view.
Surveillance Error (SE) is a metric developed to quantify the clinical risk associated with errors in BGP. It assigns a numerical value to each prediction, reflecting the potential clinical impact of any inaccuracies. The calculation of SE is comprehensive, taking into account the magnitude and direction of the prediction error, as well as its possible health implications. For a detailed breakdown of the SE calculation methodology, readers are directed to the thorough explanation provided in the original article cited as [
84]. Succinctly, 0 < SE < 0.5 reflects no clinical risk, 0.5 < SE < 1.5 slight clinical risk, 1.5 < SE < 2.5 moderate clinical risk, 2.5 < SE < 3.5 high clinical risk, and 3.5 < SE extreme clinical risk. This work utilises the percentage of predictions with no clinical risk (SE < 0.5) and the average surveillance error (ASE) for predictions across the entire testing set as evaluation metrics.
The Matthews correlation coefficient (
MCC) is a robust statistical measure employed to evaluate the quality of binary classifications. It excels in scenarios where data distribution across two classes is unbalanced, considering true and false positives and negatives to provide a balanced metric. This attribute makes
MCC particularly valuable in delivering accurate insights into classification models, especially when precision and recall are pivotal [
85]. In this work,
MCC is calculated as Equation (16) to score the fulfilment of the systems’ predictions in correctly prognosticating the occurrence of adverse glycaemic events (BG < 70 mg/dL or >180 mg/dL) as opposed to euglycaemic events (70 mg/dL < BG < 180 mg/dL).
where
TP (true positive) represents the number of adverse glycaemic events foresaw truly by the BGP system,
TN (true negative) represents the number of euglycaemic events foresaw truly,
FP (false positive) represents the number of adverse glycaemic events foresaw falsely, and
FN (false negative) represents the number of euglycaemic events foresaw falsely.
Taken together, these clinical metrics complement the mathematical error measures by indicating whether numerical improvements are likely to translate into safer and more useful predictions in practice. Lower SE and ASE values imply that prediction errors are less clinically hazardous, while a higher percentage of predictions with SE < 0.5 indicates that more forecasts fall within the no-risk region. Likewise, higher MCC values indicate improved ability to distinguish adverse glycaemic events from euglycaemic states, which is important for timely preventive action against hypo- and hyperglycaemia.
4.3.3. Statistical Analysis
A notable limitation in many AI-driven BGP techniques is the lack of rigorous statistical evaluation, which is crucial for accurate model comparison. In BGP, unique challenges arise from the diversity of evaluation metrics and glycaemic variability across individuals, underscoring the need for robust statistical methods. This work addresses these challenges by implementing a comprehensive statistical analysis framework for BGP model comparison.
We first applied the non-parametric Friedman test [
86] to globally assess model performance at a 5% significance level. The Friedman test was chosen for its suitability in comparing model outcomes across multiple scenarios without the assumption of normality [
87], which is required in parametric tests like ANOVA [
88]. The null hypothesis of the Friedman test posits that performance distributions across models are identical.
Upon detecting statistically significant differences in the global analysis, local pairwise comparisons were conducted using the Nemenyi test. To mitigate the risk of Type I errors in these multiple comparisons, we applied the Holm–Bonferroni correction. Finally, the results were visually represented using a ranking critical difference (CD) diagram, which illustrates statistically significant differences between models.
4.3.4. Experimental Protocol
For each Ohio T1D dataset, experiments were conducted for all six individuals and for two prediction horizons (30 min and 60 min). In every scenario, four learning systems—IL, AL, CL, and ACL—were evaluated using the same training/testing split and the same preprocessing pipeline, so that performance differences could be attributed primarily to the learning framework. Each configuration was repeated ten times to account for stochastic variability, and results are reported as mean ± standard deviation across runs.
4.3.5. Optimisation and Training Stability
All models were trained under a shared optimisation setting using the ADAM optimiser for 600 epochs, with batch size 128 and learning rate 0.002. The same training hyperparameters were used across all frameworks to avoid framework-specific tuning effects. These hyperparameters were selected as a common optimisation configuration and were intentionally kept fixed across all experiments. No framework-specific hyperparameter search was performed, because the objective of this study was to compare the learning strategies under controlled training conditions rather than to maximise the performance of each framework through separate tuning.
For AL and ACL, the interacting subnetworks were trained simultaneously under this common schedule. To support stable optimisation, the study employed lightweight and comparatively robust component models, namely MLP-based regressors and a CNN-based discriminator. Rather than relying on additional adversarial stabilisation techniques, stability was promoted through architectural simplicity, consistent optimisation settings, and repeated-run evaluation.
5. Results and Discussion
This section presents the outcomes of the evaluation analyses for generated BGP systems and the associated discussions. Because the developed algorithms are stochastic, each system was run ten times, and the results are reported as mean ± standard deviation (SD). In all training scenarios, the ADAM optimiser is utilised, with an epoch size of 600, a batch size of 128, and a learning rate of 0.002.
5.1. Evaluation Results
Table 2 and
Table 3 display the evaluation results for systems created using the Ohio T1D 2018 and Ohio T1D 2020 datasets, respectively, under two prediction horizons: 30 min and 60 min ahead. Each table is compartmentalised with the results of 12 scenarios, i.e., BGP modelling for six data contributors in two PHs. For each scenario, four systems are created: one using the independent learning approach and three using the proposed interdependent learning frameworks.
Monochromatic colour coding is applied in the tables to visualise intra-scenario comparisons between learning platforms. To this end, the cells are shaded using four grey colours, where the darkest to lightest colour codes the outcomes for each metric from the first to the fourth rank.
An inter-framework comparative analysis is performed, considering all evaluation metrics in all scenarios. To do so,
Figure 2 graphically summarises the rankings for each learning framework as derived from the aggregated data in
Table 2 and
Table 3. The figure visually encapsulates the performance of each framework across all evaluated metrics and scenarios, thereby forming the empirical basis for the subsequent statistical scrutiny. ACL is shown to lead in performance, with the majority of its evaluations falling in the top two rankings, yielding an average rank of 2.14. This is followed by CL, AL, and IL, with average ranks of 2.22, 2.49, and 3.04, respectively.
The results of the rank distribution, depicted in
Figure 2, were further evaluated using the Friedman test, a non-parametric alternative to ANOVA that does not assume normality in the data. The Friedman test rejected the null hypothesis that all models have an identical distribution of ranks, with a
p-value of 0.0265 at a significance level of 0.05. Consequently, we applied the Nemenyi post hoc test, complemented by critical CD analysis, as shown in
Figure 3. In the CD diagram, the ranking of each model and the critical significance interval are displayed. Based on the results, the ACL model exhibited the best overall performance. The use of the collaborative unit in both CL and ACL models resulted in statistically significant improvements compared to models without the collaborative unit, specifically IL and AL.
Overall, the results emphasise the efficacy of the proposed collaborative component in advancing BGP compared to traditional adversarial and independent learning mechanisms. While adversarial interactions between the primary regressor and the discriminator ensure that the primary regressor produces sequences adhering to data distribution within the PH, this approach focuses solely on minimising adversarial loss. It does not consider how well the predictions reflect future trends beyond the PH. Specifically, the generated sequences are optimised to reduce dissimilarity with real CGM sequences within the PH but do not account for patterns predictive of future values.
In contrast, the collaborative relationship between the primary regressor and the auxiliary regressor adds an additional layer of evaluation to the primary regressor’s outputs. This interaction directs the primary regressor to produce sequences that incorporate greater knowledge of future trends. As a result, the trained primary regressor generates sequences that not only minimise prediction errors but also possess enhanced future predictivity. Through the dual interaction with both the adversarial and collaborative components, the primary regressor learns to generate sequences with optimal prediction errors, which are more consistent with the data dynamics within the PH and hold better predictive power for future trends.
Adversarial optimisation is known to be sensitive to instability; therefore, the present study intentionally adopted a restrained training design. Rather than introducing multiple adversarial regularisation heuristics, we used lightweight MLP and CNN subnetworks, a fixed optimisation procedure with ADAM, and the same hyperparameter settings across all frameworks. In AL and ACL, the interacting modules were trained simultaneously under this shared schedule. Furthermore, each experiment was repeated ten times and summarised as mean ± SD, which helps reveal whether performance is consistently reproducible despite stochastic optimisation. Within this controlled setting, the observed results indicate that the proposed frameworks can be trained in a stable and reliable manner for the Ohio T1D prediction tasks considered here.
Although recurrent and transformer-based architectures are often attractive for time-series modelling, this study intentionally adopts MLP-based regressors to isolate the contribution of the proposed learning strategy under a controlled architectural setting. Given the fixed-window univariate CGM formulation considered here, the MLP provides a computationally efficient and stable baseline while avoiding additional architectural confounding in the comparison among IL, AL, CL, and ACL. Moreover, the benchmark comparison in
Table 4 shows that the resulting ACL framework remains competitive with an LSTM baseline. Nevertheless, more expressive temporal backbones may further strengthen the proposed framework and should be investigated in future work.
From a computational perspective, the proposed framework was intentionally designed to remain relatively lightweight. Although ACL is more demanding during training than IL, AL, or CL because it jointly optimises the primary regressor, auxiliary discriminator, and auxiliary regressor, the use of compact MLP- and CNN-based subnetworks helps keep this additional cost manageable. Importantly, the richer multi-component interaction is primarily a training-stage mechanism. At inference time, blood glucose forecasts are generated from fixed-length CGM input windows by the trained primary regressor, which supports efficient deployment. This distinction between a more involved training phase and a comparatively lightweight inference phase strengthens the practical feasibility of the proposed framework.
5.2. Benchmark Comparison with Established Baselines
To complement the controlled comparison among IL, AL, CL, and ACL, we also compare ACL against established blood glucose prediction baselines using the benchmark design reported in [
89]. This benchmark includes a naive persistence baseline, classical statistical models, machine-learning models, and neural-network baselines, thereby positioning the proposed framework relative to widely used alternatives. This external comparison is intended to contextualise the proposed learning strategy beyond the within-study framework comparison.
Table 4 reports the performance of ACL alongside nine benchmark approaches on the Ohio T1D datasets, following the experimental design described in [
89]. These baselines include a naive persistence model, classical statistical approaches, machine-learning methods, feed-forward neural networks, and an LSTM model. The comparison therefore provides an external reference point for assessing whether the proposed framework offers practical advantages beyond the within-study ablation analysis.
According to the benchmark results, ACL remains highly competitive against established BGP models and achieves the best overall average rank in the subsequent statistical comparison. To provide a more thorough evaluation, statistical analysis was conducted, similar to the approach described earlier. This involved ranking the models based on their average performance across all scenarios. Following this, a global statistical analysis was performed, using the Friedman test, to determine whether there was a significant difference in the performance of the models. The test decisively indicated the existence of statistically significant differences among the models.
Subsequently, a local pairwise statistical analysis using the Nemenyi test was conducted to further examine these differences. The outcome of the critical difference (CD) analysis is illustrated in
Figure 4 where the rankings and significance intervals of each model are shown. From the diagram, it is evident that the ACL model outperforms all other models with an average rank of 1.75. Notably, the performance of ACL is statistically significantly better than that of the other models, as shown by its separation from the others in the CD diagram.
5.3. Clinical Impact
BGP represents a critical advancement in managing T1D, enabling more informed insulin dosing and other therapeutic interventions. The strong performance of the proposed ACL system in minimising prediction errors indicates its promise for improving the reliability of BGP on the Ohio T1D benchmark. Because the evaluation covers multiple individuals with differing glycaemic characteristics, the findings suggest that the framework is adaptable across varied subject profiles within this dataset. However, given the limited cohort size and the subject-specific evaluation design, these results should be interpreted cautiously and not as definitive evidence of broad population-level generalisability. Although the Ohio T1D datasets provide a recognised and clinically relevant benchmark, broader generalisation to external clinical cohorts remains to be established. Independent datasets may differ in patient demographics, glycaemic variability, treatment routines, sensor characteristics, and missingness patterns, all of which can influence predictive performance. In addition, real-world CGM streams may present further challenges such as sensor noise, temporary signal dropouts, calibration-related variability, and workflow-related latency. Therefore, the present findings should be interpreted as strong benchmark evidence, while validation on external clinical datasets and prospective real-world CGM streams remains an important next step.
When integrated with CGM systems, these predictive models provide real-time, personalised insights into glucose trends, supporting more precise and proactive diabetes management. Improved glycaemic control, reduced frequency of complications, and ultimately a higher quality of life for T1D patients could result from this approach. Furthermore, the adaptability of the proposed ACL system to individual patient data supports its role as an AI-enabled personalised medicine approach for subject-specific glucose forecasting and more tailored diabetes management. Incorporating these advanced models into clinical practice may enable healthcare providers to enhance the safety and efficacy of diabetes management, ultimately benefiting patient outcomes and potentially reducing the healthcare burden associated with T1D.
In a practical implementation, integration with CGM systems would involve continuously updating a rolling history window as new sensor measurements arrive and applying the trained primary regressor to generate the next 30 min or 60 min forecast. This workflow is compatible with real-time monitoring because the model operates on regularly sampled CGM inputs and does not require future information at inference. Nevertheless, translation into deployed CGM-supported decision systems would require prospective validation under live streaming conditions, where issues such as sensor interruptions, delayed readings, and user-specific behavioural variability may affect operational performance.
6. Conclusions
This work examined adversarial learning for blood glucose prediction, an area that remains underexplored despite the promise of adversarial approaches in other time-series forecasting domains. Traditional adversarial forecasting frameworks rely on a predictor-discriminator interaction to improve forecast realism within the target horizon. In contrast, the proposed method introduces an additional collaborative interaction through an auxiliary regressor that evaluates whether the predicted horizon remains useful for forecasting the immediately following post-horizon segment. The core novelty of ACL therefore lies in combining adversarial within-horizon supervision with collaborative post-horizon supervision in a single interdependent framework, rather than in merely extending a standard GAN-based predictor or adopting a conventional longer-horizon forecasting objective.
Experiments were conducted using two publicly available and well-known Ohio T1D datasets. The systems were rigorously evaluated through both mathematical regression metrics and clinically relevant performance measures. The results consistently demonstrated the effectiveness of the proposed architectures, showcasing their ability to handle the complexities of BGP. In addition to the controlled comparison among the four proposed learning frameworks, the study also benchmarked ACL against established blood glucose prediction models reported in the literature, including statistical approaches, machine-learning models, feed-forward neural networks, and an LSTM baseline. This broader comparison showed that ACL remains competitive with strong existing approaches while achieving the best overall average rank in the benchmark statistical analysis. These findings strengthen the evidence that the proposed learning strategy contributes meaningfully beyond the internal framework variants examined in this work.
In summary, the findings of this study underscore the potential of the enhanced adversarial learning frameworks to provide alternative learning mechanisms that surpass traditional approaches. Beyond methodological advancement, this work contributes to personalised medicine by supporting more individualized blood glucose forecasting from CGM data, with potential value for patient-specific monitoring, glycaemic risk management, and future AI-enabled diabetes decision support
7. Future Work and Potential Improvements
Several promising avenues remain for further development and improvement of this work, each addressing opportunities to expand on current limitations and deepen the impact of the proposed approach.
Effective data fusion in TSF is complex, particularly when incorporating diverse modalities from various sources. Adapting the proposed methods in this study to handle multivariate TSF is a logical next step. Future work could expand the proposed framework by introducing a conditional ACL system, where auxiliary variables are integrated as conditional inputs. This would allow for a comprehensive understanding of complex TS patterns where multiple influencing factors are present.
This work establishes a novel learning strategy specifically tailored for BGP. Given its potential, future studies should explore its application in other TSF domains where comparable volatility and complexity are present, such as energy demand forecasting, healthcare monitoring, or financial trends. In addition, while this study intentionally prioritised evaluation of the learning strategy under stable and lightweight architectures, future research could investigate replacing the MLP-based regressors with recurrent or transformer-based backbones to determine whether stronger temporal encoders further enhance the proposed ACL framework. Identifying model variants best suited for each domain could improve both the generalisability and performance of the approach.
A further consideration is the limited cohort size of the Ohio T1D datasets. While these datasets are established and relevant benchmarks, they include a relatively small number of individuals, and the present experimental design evaluates each model on held-out future data from the same individual rather than on entirely unseen patients. Accordingly, the reported findings provide strong evidence of within-patient temporal generalisation on a recognised benchmark, but broader clinical generalisation should be confirmed in future studies using larger and more diverse cohorts and dedicated cross-patient validation protocols.
8. Data and Code Availability
Instructions on attaining the Ohio T1D datasets can be found at this address. Also, we have made our source codes accessible on this Gitlab repository. For these implementations, we scripted in Python (3.6.7) [
90]. The third-party libraries used include TensorFlow [
91], Keras [
92], Pandas [
93], NumPy [
94], Sklearn [
95], and statsmodels [
96]. Training was conducted on an Apple MacBook Pro () equipped with an Apple M1 Pro chip, 16 GB unified memory, and the integrated Apple M1 Pro GPU. Under this configuration, training a single ACL model for one subject-horizon scenario required approximately 20 min, while inference on a single input window was comparatively lightweight because deployment relies only on the trained primary regressor.
Author Contributions
H.K.: conceptualisation, methodology, software, validation, formal analysis, investigation, data curation, writing the original draft, review and editing, visualisation. H.N.: conceptualisation, methodology, software, validation, formal analysis, investigation, data curation, review and editing. J.E.: conceptualisation, project administration, resources, validation, review & editing, supervision. M.B.: conceptualisation, methodology, validation, investigation, review and editing, supervision. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Ethical review and approval were not required for this study, as it relies exclusively on the publicly available Ohio T1D Dataset, which was previously collected, anonymised, and made available for research purposes by the original data custodians. No new data were collected from human participants as part of this study.
Informed Consent Statement
Not applicable.
Data Availability Statement
No new data were created or analyzed in this study.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Maçaira, P.M.; Tavares Thomé, A.M.; Cyrino Oliveira, F.L.; Carvalho Ferrer, A.L. Time series analysis with explanatory variables: A systematic literature review. Environ. Model. Softw. 2018, 107, 199–209. [Google Scholar] [CrossRef]
- Zou, Y.; Donner, R.V.; Marwan, N.; Donges, J.F.; Kurths, J. Complex network approaches to nonlinear time series analysis. Phys. Rep. 2019, 787, 1–97. [Google Scholar] [CrossRef]
- Broomhead, D.S.; Jones, R. Time-series analysis. Proc. R. Soc. Lond. A Math. Phys. Sci. 1989, 423, 103–121. [Google Scholar] [CrossRef]
- Sibeijn, M.; Pequito, S. A time-reversed model selection approach to time series forecasting. Sci. Rep. 2022, 12, 10912. [Google Scholar] [CrossRef]
- Meisenbacher, S.; Turowski, M.; Phipps, K.; Rätz, M.; Müller, D.; Hagenmeyer, V.; Mikut, R. Review of automated time series forecasting pipelines. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2022, 12, e1475. [Google Scholar] [CrossRef]
- Athiyarath, S.; Paul, M.; Krishnaswamy, S. A Comparative Study and Analysis of Time Series Forecasting Techniques. SN Comput. Sci. 2020, 1, 175. [Google Scholar] [CrossRef]
- Aijaz, I.; Agarwal, P. A Study on Time Series Forecasting using Hybridization of Time Series Models and Neural Networks. Recent Adv. Comput. Sci. Commun. 2019, 13, 827–832. [Google Scholar] [CrossRef]
- Semenoglou, A.-A.; Spiliotis, E.; Makridakis, S.; Assimakopoulos, V. Investigating the accuracy of cross-learning time series forecasting methods. Int. J. Forecast. 2021, 37, 1072–1084. [Google Scholar] [CrossRef]
- Aamir, M.; Shabri, A. Modelling and forecasting monthly crude oil price of Pakistan: A comparative study of ARIMA, GARCH and ARIMA Kalman model. AIP Conf. Proc. 2016, 1750, 060015. [Google Scholar] [CrossRef]
- Khan, S.; Alghulaiakh, H. ARIMA Model for Accurate Time Series Stocks Forecasting. IJACSA Int. J. Adv. Comput. Sci. Appl. 2020, 11, 524–528. Available online: www.ijacsa.thesai.org (accessed on 10 March 2023). [CrossRef]
- Ab Razak, N.H.; Aris, A.Z.; Ramli, M.F.; Looi, L.J.; Juahir, H. Temporal flood incidence forecasting for Segamat River (Malaysia) using autoregressive integrated moving average modelling. J. Flood Risk Manag. 2018, 11, S794–S804. [Google Scholar] [CrossRef]
- Das, R.; Middya, A.I.; Roy, S. High granular and short term time series forecasting of PM 2.5 air pollutant—A comparative review. Artif. Intell. Rev. 2022, 55, 1253–1287. [Google Scholar] [CrossRef]
- Zhao, X.; Chen, Y.; Xu, C.; Dharmawan, P.A.S.; Indradewi, A.A.D. Double exponential smoothing brown method towards sales forecasting system with a linear and non-stationary data trend. J. Phys. Conf. Ser. 2021, 1810, 012026. [Google Scholar] [CrossRef]
- Sulandari, W.; Suhartono; Subanar; Rodrigues, P.C. Exponential Smoothing on Modeling and Forecasting Multiple Seasonal Time Series: An Overview. Fluct. Noise Lett. 2021, 20, 2130003. [Google Scholar] [CrossRef]
- Xie, J.; Wang, Q. Benchmarking Machine Learning Algorithms on Blood Glucose Prediction for Type i Diabetes in Comparison with Classical Time-Series Models. IEEE Trans. Biomed. Eng. 2020, 67, 3101–3124. [Google Scholar] [CrossRef]
- Karthikeswaren, R.; Kayathwal, K.; Dhama, G.; Arora, A. A Survey on Classical and Deep Learning based Intermittent Time Series Forecasting Methods. In Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, 18–22 July 2021. [Google Scholar] [CrossRef]
- Cheng, W.; Wang, Y.; Peng, Z.; Ren, X.; Shuai, Y.; Zang, S.; Liu, H.; Cheng, H.; Wu, J. High-efficiency chaotic time series prediction based on time convolution neural network. Chaos Solitons Fractals 2021, 152, 111304. [Google Scholar] [CrossRef]
- Tealab, A. Time series forecasting using artificial neural networks methodologies: A systematic review. Future Comput. Inform. J. 2018, 3, 334–340. [Google Scholar] [CrossRef]
- Ramadevi, B.; Bingi, K. Chaotic Time Series Forecasting Approaches Using Machine Learning Techniques: A Review. Symmetry 2022, 14, 955. [Google Scholar] [CrossRef]
- Ensafi, Y.; Amin, S.H.; Zhang, G.; Shah, B. Time-series forecasting of seasonal items sales using machine learning—A comparative analysis. Int. J. Inf. Manag. Data Insights 2022, 2, 100058. [Google Scholar] [CrossRef]
- Syrgkanis, V.; Zampetakis, M. Estimation and Inference with Trees and Forests in High Dimensions. In Proceedings of the Thirty Third Conference on Learning Theory; Abernethy, J., Agarwal, S., Eds.; Proceedings of Machine Learning Research; PMLR: London, UK, 2020; Volume 125, pp. 3453–3454. Available online: https://proceedings.mlr.press/v125/syrgkanis20a.html (accessed on 10 March 2023).
- Konzen, E.; Ziegelmann, F.A. LASSO-Type Penalties for Covariate Selection and Forecasting in Time Series. J. Forecast. 2016, 35, 592–612. [Google Scholar] [CrossRef]
- Corsi, F. A Simple Approximate Long-Memory Model of Realized Volatility. J. Financ. Econom. 2009, 7, 174–196. [Google Scholar] [CrossRef]
- Masini, R.P.; Medeiros, M.C.; Mendes, E.F. Machine learning advances for time series forecasting. J. Econ. Surv. 2023, 37, 76–111. [Google Scholar] [CrossRef]
- Semenoglou, A.-A.; Spiliotis, E.; Assimakopoulos, V. Image-based time series forecasting: A deep convolutional neural network approach. Neural Netw. 2023, 157, 39–53. [Google Scholar] [CrossRef] [PubMed]
- Garg, A.; Zhang, W.; Samaran, J.; Savitha, R.; Foo, C.S. An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time Series. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 2508–2517. [Google Scholar] [CrossRef] [PubMed]
- De Oliveira, J.F.L.; Silva, E.G.; De Mattos Neto, P.S.G. A Hybrid System Based on Dynamic Selection for Time Series Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 3251–3263. [Google Scholar] [CrossRef]
- Cichos, F.; Gustavsson, K.; Mehlig, B.; Volpe, G. Machine learning for active matter. Nat. Mach. Intell. 2020, 2, 94–103. [Google Scholar] [CrossRef]
- Lim, B.; Zohren, S. Time-series forecasting with deep learning: A survey. Philos. Trans. R. Soc. A 2021, 379, 20200209. [Google Scholar] [CrossRef]
- Ismail Fawaz, H.; Forestier, G.; Weber, J.; Idoumghar, L.; Muller, P.A. Deep learning for time series classification: A review. Data Min. Knowl. Discov. 2019, 33, 917–963. [Google Scholar] [CrossRef]
- Torres, J.F.; Hadjout, D.; Sebaa, A.; Martínez-Álvarez, F.; Troncoso, A. Deep Learning for Time Series Forecasting: A Survey. Big Data 2021, 9, 3–21. [Google Scholar] [CrossRef]
- Lara-Benítez, P.; Carranza-García, M.; Riquelme, J.C. An Experimental Review on Deep Learning Architectures for Time Series Forecasting. Int. J. Neural Syst. 2021, 31, 2130001. [Google Scholar] [CrossRef]
- Liang, W.; Tadesse, G.A.; Ho, D.; Li, F.-F.; Zaharia, M.; Zhang, C.; Zou, J. Advances, challenges and opportunities in creating data for trustworthy AI. Nat. Mach. Intell. 2022, 4, 669–677. [Google Scholar] [CrossRef]
- Sun, A.Y.; Scanlon, B.R. How can Big Data and machine learning benefit environment and water management: A survey of methods, applications, and future directions. Environ. Res. Lett. 2019, 14, 073001. [Google Scholar] [CrossRef]
- Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021; Association for Computing Machinery: New York, NY, USA, 2021; Volume 12B, pp. 11106–11115. [Google Scholar] [CrossRef]
- Lazcano, A.; Herrera, P.J.; Monge, M. A Combined Model Based on Recurrent Neural Networks and Graph Convolutional Networks for Financial Time Series Forecasting. Mathematics 2023, 11, 224. [Google Scholar] [CrossRef]
- Gasparin, A.; Lukovic, S.; Alippi, C. Deep learning for time series forecasting: The electric load case. CAAI Trans. Intell. Technol. 2022, 7, 1–25. [Google Scholar] [CrossRef]
- Du, L.; Gao, R.; Suganthan, P.N.; Wang, D.Z.W. Bayesian optimization based dynamic ensemble for time series forecasting. Inf. Sci. 2022, 591, 155–175. [Google Scholar] [CrossRef]
- Alassafi, M.O.; Jarrah, M.; Alotaibi, R. Time series predicting of COVID-19 based on deep learning. Neurocomputing 2022, 468, 335–344. [Google Scholar] [CrossRef]
- Rahimilarki, R.; Gao, Z.; Jin, N.; Zhang, A. Convolutional neural network fault classification based on time-series analysis for benchmark wind turbine machine. Renew. Energy 2022, 185, 916–931. [Google Scholar] [CrossRef]
- Brophy, E.; Wang, Z.; She, Q.; Ward, T. Generative Adversarial Networks in Time Series: A Systematic Literature Review. ACM Comput. Surv. 2023, 55, 199. [Google Scholar] [CrossRef]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. In Advances in Neural Information Processing Systems; NeurIPS Proceedings: San Diego, CA, USA, 2014; pp. 2672–2680. Available online: https://arxiv.org/abs/1406.2661v1 (accessed on 9 September 2022).
- Dahl, C.M.; Sørensen, E.N. Time series (re)sampling using Generative Adversarial Networks. Neural Netw. 2022, 156, 95–107. [Google Scholar] [CrossRef]
- Yoon, J.; Jarrett, D.; van der Schaar, M. Time-series Generative Adversarial Networks. Adv. Neural Inf. Process. Syst. 2019, 32, 5508–5518. [Google Scholar]
- Cichosz, S.L.; Xylander, A.A.P. A Conditional Generative Adversarial Network for Synthesis of Continuous Glucose Monitoring Signals. J. Diabetes Sci. Technol. 2021, 16, 1220–1223. [Google Scholar] [CrossRef] [PubMed]
- Li, H.; Xu, Y.; Ke, D.; Su, K. μ-law SGAN for generating spectra with more details in speech enhancement. Neural Netw. 2021, 136, 17–27. [Google Scholar] [CrossRef] [PubMed]
- Zhou, X.; Pan, Z.; Hu, G.; Tang, S.; Zhao, C. Stock Market Prediction on High-Frequency Data Using Generative Adversarial Nets. Math. Probl. Eng. 2018, 2018, 4907423. [Google Scholar] [CrossRef]
- Rüttgers, M.; Lee, S.; Jeon, S.; You, D. Prediction of a typhoon track using a generative adversarial network and satellite images. Sci. Rep. 2019, 9, 6057. [Google Scholar] [CrossRef]
- Koochali, A.; Schichtel, P.; Dengel, A.; Ahmed, S. Probabilistic Forecasting of Sensory Data with Generative Adversarial Networks—ForGAN. IEEE Access 2019, 7, 63868–63880. [Google Scholar] [CrossRef]
- Han, J.; Wang, C. SSR-TVD: Spatial Super-Resolution for Time-Varying Data Analysis and Visualization. IEEE Trans. Vis. Comput. Graph. 2022, 28, 2445–2456. [Google Scholar] [CrossRef] [PubMed]
- Elazab, A.; Wang, C.; Gardezi, S.J.S.; Bai, H.; Hu, Q.; Wang, T.; Chang, C.; Lei, B. GP-GAN: Brain tumor growth prediction using stacked 3D generative adversarial networks from longitudinal MR Images. Neural Netw. 2020, 132, 321–332. [Google Scholar] [CrossRef]
- Chu, J.; Dong, W.; Huang, Z. Endpoint prediction of heart failure using electronic health records. J. Biomed. Inform. 2020, 109, 103518. [Google Scholar] [CrossRef]
- Cheng, C.; Sa-Ngasoongsong, A.; Beyca, O.; Le, T.; Yang, H.; Kong, Z.; Bukkapatnam, S.T. Time series forecasting for nonlinear and non-stationary processes: A review and comparative study. IIE Trans. 2015, 47, 1053–1071. [Google Scholar] [CrossRef]
- Zhu, T.; Wang, W.; Yu, M. A novel blood glucose time series prediction framework based on a novel signal decomposition method. Chaos Solitons Fractals 2022, 164, 112673. [Google Scholar] [CrossRef]
- Khadem, H.; Nemat, H.; Elliott, J.; Benaissa, M. In vitro glucose measurement from NIR and MIR spectroscopy: Comprehensive benchmark of machine learning and filtering chemometrics. Heliyon 2024, 10, e30981. [Google Scholar] [CrossRef]
- Woldaregay, A.Z.; Årsand, E.; Botsis, T.; Albers, D.; Mamykina, L.; Hartvigsen, G. Data-Driven Blood Glucose Pattern Classification and Anomalies Detection: Machine-Learning Applications in Type 1 Diabetes. J. Med. Internet Res. 2019, 21, e11030. [Google Scholar] [CrossRef] [PubMed]
- Khadem, H.; Nemat, H.; Elliott, J.; Benaissa, M. Multi-Lag Stacking for Blood Glucose Level Prediction. In Knowledge Discovery in Healthcare Data; CEUR-Workshop Proceedings: Aachen, Germany, 2020; Volume 2675, pp. 146–150. [Google Scholar]
- Khadem, H.; Nemat, H.; Elliott, J.; Benaissa, M. Blood Glucose Level Time Series Forecasting: Nested Deep Ensemble Learning Lag Fusion. Bioengineering 2023, 10, 487. [Google Scholar] [CrossRef] [PubMed]
- Marcus, Y.; Eldor, R.; Yaron, M.; Shaklai, S.; Ish-Shalom, M.; Shefer, G.; Stern, N.; Golan, N.; Dvir, A.Z.; Pele, O.; et al. Improving blood glucose level predictability using machine learning. Diabetes Metab. Res. Rev. 2020, 36, e3348. [Google Scholar] [CrossRef] [PubMed]
- Nemat, H.; Khadem, H.; Eissa, M.R.; Elliott, J.; Benaissa, M. Blood Glucose Level Prediction: Advanced Deep-Ensemble Learning Approach. IEEE J. Biomed. Health Inform. 2022, 26, 2758–2769. [Google Scholar] [CrossRef]
- Woldaregay, A.Z.; Årsand, E.; Walderhaug, S.; Albers, D.; Mamykina, L.; Botsis, T.; Hartvigsen, G. Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes. Artif. Intell. Med. 2019, 98, 109–134. [Google Scholar] [CrossRef]
- Zhu, T.; Yao, X.; Li, K.; Herrero, P.; Georgiou, P. Blood glucose prediction for type 1 diabetes using generative adversarial networks. CEUR Workshop Proc. 2020, 2675, 90–94. [Google Scholar]
- Zhang, D.; Ma, M.; Xia, L. A comprehensive review on GANs for time-series signals. Neural Comput. Appl. 2022, 34, 3551–3571. [Google Scholar] [CrossRef]
- Brophy, E.; Wang, Z.; She, Q.; Ward, T. Generative adversarial networks in time series: A survey and taxonomy. arXiv 2021, arXiv:2107.11098. [Google Scholar] [CrossRef]
- Festag, S.; Denzler, J.; Spreckelsen, C. Generative adversarial networks for biomedical time series forecasting and imputation: A systematic review. J. Biomed. Inform. 2022, 129, 104058. [Google Scholar] [CrossRef]
- Takahashi, S.; Chen, Y.; Tanaka-Ishii, K. Modeling financial time-series with generative adversarial networks. Phys. A Stat. Mech. Its Appl. 2019, 527, 121261. [Google Scholar] [CrossRef]
- Geiger, A.; Liu, D.; Alnegheimish, S.; Cuesta-Infante, A.; Veeramachaneni, K. TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks. In Proceedings of the 2020 IEEE International Conference on Big Data, Big Data 2020; IEEE: New York, NY, USA, 2020; pp. 33–43. [Google Scholar] [CrossRef]
- Luo, Y.; Cai, X.; Zhang, Y.; Xu, J.; Yuan, X. Multivariate Time Series Imputation with Generative Adversarial Networks. In 32nd Conference on Neural Information Processing Systems; NeurIPS Proceedings: San Diego, CA, USA, 2018. [Google Scholar]
- Huang, X.; Li, Q.; Tai, Y.; Chen, Z.; Liu, J.; Shi, J.; Liu, W. Time series forecasting for hourly photovoltaic power using conditional generative adversarial network and Bi-LSTM. Energy 2022, 246, 123403. [Google Scholar] [CrossRef]
- Zhou, K.; Wang, W.; Hu, T.; Deng, K. Time Series Forecasting and Classification Models Based on Recurrent with Attention Mechanism and Generative Adversarial Networks. Sensors 2020, 20, 7211. [Google Scholar] [CrossRef] [PubMed]
- Hua, Y.; Zhao, Z.; Li, R.; Chen, X.; Liu, Z.; Zhang, H. Deep Learning with Long Short-Term Memory for Time Series Prediction. IEEE Commun. Mag. 2019, 57, 114–119. [Google Scholar] [CrossRef]
- Liu, C.L.; Hsaio, W.H.; Tu, Y.C. Time Series Classification with Multivariate Convolutional Neural Network. IEEE Trans. Ind. Electron. 2019, 66, 4788–4797. [Google Scholar] [CrossRef]
- Marling, C.; Bunescu, R. The OhioT1DM Dataset for Blood Glucose Level Prediction: Update 2020. In International Workshop on Knowledge Discovery in Healthcare Data; CEUR Workshop Proceedings: Aachen, Germany, 2020; Volume 2675, pp. 71–74. [Google Scholar]
- Daniels, J.; Herrero, P.; Georgiou, P. A Multitask Learning Approach to Personalized Blood Glucose Prediction. IEEE J. Biomed. Health Inform. 2022, 26, 436–445. [Google Scholar] [CrossRef]
- Yang, T.; Yu, X.; Ma, N.; Wu, R.; Li, H. An autonomous channel deep learning framework for blood glucose prediction. Appl. Soft Comput. 2022, 120, 108636. [Google Scholar] [CrossRef]
- Zhu, T.; Li, K.; Chen, J.; Herrero, P.; Georgiou, P. Dilated Recurrent Neural Networks for Glucose Forecasting in Type 1 Diabetes. J. Healthc. Inform. Res. 2020, 4, 308–324. [Google Scholar] [CrossRef]
- Martinsson, J.; Schliep, A.; Eliasson, B.; Mogren, O. Blood Glucose Prediction with Variance Estimation Using Recurrent Neural Networks. J. Healthc. Inform. Res. 2020, 4, 1–18. [Google Scholar] [CrossRef]
- Shuvo, M.M.H.; Islam, S.K. Deep Multitask Learning by Stacked Long Short-Term Memory for Predicting Personalized Blood Glucose Concentration. IEEE J. Biomed. Health Inform. 2023, 27, 1612–1623. [Google Scholar] [CrossRef]
- Nemat, H.; Khadem, H.; Elliott, J.; Benaissa, M. Causality analysis in type 1 diabetes mellitus with application to blood glucose level prediction. Comput. Biol. Med. 2023, 153, 106535. [Google Scholar] [CrossRef]
- Jeon, J.; Leimbigler, P.J.; Baruah, G.; Li, M.H.; Fossat, Y.; Whitehead, A.J. Predicting Glycaemia in Type 1 Diabetes Patients: Experiments in Feature Engineering and Data Imputation. J. Healthc. Inform. Res. 2019, 4, 71–90. [Google Scholar] [CrossRef] [PubMed]
- Nemat, H.; Khadem, H.; Elliott, J.; Benaissa, M. Physical Activity Integration in Blood Glucose Level Prediction: Different Levels of Data Fusion. IEEE J. Biomed. Health Inform. 2024, 29, 1397–1408. [Google Scholar] [CrossRef] [PubMed]
- Souza, R.C. Practical Time Series Analysis Prediction with Statistics and Machine Learning; O’Reilly Media: Santa Rosa, CA, USA, 2001; Volume 21. [Google Scholar]
- Brownlee, J. Time Series Forecasting as Supervised Learning. 2016. Available online: https://machinelearningmastery.com/time-series-forecasting-supervised-learning (accessed on 30 June 2025).
- Klonoff, D.C.; Lias, C.; Vigersky, R.; Clarke, W.; Parkes, J.L.; Sacks, D.B.; Kirkman, M.S.; Kovatchev, B.; Panel, T.E.G.; Aurand, L.; et al. The surveillance error grid. J. Diabetes Sci. Technol. 2014, 8, 658–672. [Google Scholar] [CrossRef] [PubMed]
- Zhu, Q. On the performance of Matthews correlation coefficient (MCC) for imbalanced dataset. Pattern Recognit. Lett. 2020, 136, 71–80. [Google Scholar] [CrossRef]
- Someetheram, V.; Marsani, M.F.; Kasihmuddin, M.S.M.; Zamri, N.E.; Sidik, S.S.M.; Jamaludin, S.Z.M.; Mansor, M.A. Random Maximum 2 Satisfiability Logic in Discrete Hopfield Neural Network Incorporating Improved Election Algorithm. Mathematics 2022, 10, 4734. [Google Scholar] [CrossRef]
- Friedman, M. A Comparison of Alternative Tests of Significance for the Problem of m Rankings on JSTOR. Ann. Math. Stat. 1940, 11, 86–92. [Google Scholar] [CrossRef]
- Fisher, R. Statistical Methods and Scientific Induction. J. R. Stat. Soc. Ser. B 1955, 17, 69–78. [Google Scholar] [CrossRef]
- De Bois, M.; Yacoubi, M.A.E.; Ammi, M. GLYFE: Review and benchmark of personalized glucose predictive models in type 1 diabetes. Med. Biol. Eng. Comput. 2022, 60, 1–17. [Google Scholar] [CrossRef]
- Van Rossum, G.; Drake, F.L. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009. [Google Scholar]
- Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. Tensorflow: A system for large-scale machine learning. In 12th Symposium on Operating Systems Design and Implementation; USENIX Association: Berkeley, CA, USA, 2016; pp. 265–283. [Google Scholar]
- Chollet, F. Keras. GitHub. 2015. Available online: https://github.com/keras-team/keras (accessed on 10 March 2024).
- McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference; SciPy: Austin, TX, USA, 2010; pp. 56–61. [Google Scholar] [CrossRef]
- Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array programming with {NumPy}. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Seabold, S.; Perktold, J. Statsmodels: Econometric and statistical modeling with python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |