A Calibrated Multi-Task Ensemble Architecture for Biomedical Risk Prediction
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Preparation
2.2. Hyperparameter Architecture and Configuration of the CMSE
2.3. Description of the Calibrated Multitask Stacking Ensemble
3. Results
3.1. Descriptive Analysis of Glycemic Indicators
3.2. Epidemiological Patterns in Obesity and Age
3.3. Correlation Structures Across Biochemical, Lifestyle, and Therapeutic Features
3.4. SHAP-Based Interpretability
3.5. Predictive Performance: Holdout Evaluation
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kim, S.Y. Explainable AI-Based Clinical Decision Support Systems: Frameworks, Methods, and Applications. Appl. Sci. 2024, 14, 6638. [Google Scholar] [CrossRef]
- Saee Sun, H.; Saeedi, P.; Karuranga, S.; Pinkepank, M.; Ogurtsova, K.; Duncan, B.B.; Stein, C.; Basit, A.; Chan, J.C.N.; Mbanya, J.C.; et al. IDF Diabetes Atlas: Global, regional and country-level estimates of diabetes prevalence for 2021 and projections for 2045. Diabetes Res. Clin. Pract. 2022, 183, 109119. [Google Scholar] [CrossRef] [PubMed]
- Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From Local Explanations to Global Understanding with Explainable AI for Trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef] [PubMed]
- Tonekaboni, S.; Joshi, S.; McCradden, M.D.; Goldenberg, A. What Clinicians Want: Contextualizing Explainable Machine Learning for Clinical End Use. Proc. Mach. Learn. Res. 2019, 106, 359–380. [Google Scholar]
- Centers for Disease Control and Prevention (CDC). National Health and Nutrition Examination Survey (NHANES) 2017–March 2020 Pre-Pandemic Data Documentation. Updated 2023. Available online: https://www.cdc.gov/nchs/nhanes/ (accessed on 10 January 2026).
- Zhang, Y.; Liu, X.; Zhang, X.; Fei, Y.; Li, X. Machine learning-based prediction of metabolic dysfunction-associated steatotic liver disease using National Health and Nutrition Examination Survey (NHANES) data. PLoS ONE 2025, 20, e0335656. [Google Scholar] [CrossRef] [PubMed]
- Rahman, M.M.; Islam, M.R.; Hasan, M.; Ahmed, S. Machine Learning-Based Identification of Metabolic Risk Factors for Dysglycemia in Large Epidemiological Cohorts. BMC Med. Inform. Decis. Mak. 2022, 22, 310. [Google Scholar]
- Sghaireen, M.G.; Al-Smadi, Y.; Al-Qerem, A.; Srivastava, K.C.; Ganji, K.K.; Alam, M.K.; Nashwan, S.; Khader, Y. Machine Learning Approach for Metabolic Syndrome Diagnosis Using Explainable Data-Augmentation-Based Classification. Diagnostics 2022, 12, 3117. [Google Scholar] [CrossRef] [PubMed]
- Widen, E.; Raben, T.G.; Lello, L.; Hsu, S.D.H. Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank. Genes 2021, 12, 991. [Google Scholar] [CrossRef] [PubMed]
- Fry, A.; Littlejohns, T.J.; Sudlow, C.; Doherty, N.; Adamska, L.; Sprosen, T.; Collins, R.; Allen, N.E. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants with the General Population. Am. J. Epidemiol. 2017, 186, 1026–1034. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.; Lin, S.; Zeng, Q.; Peng, L.; Yan, C. Machine learning and SHAP value interpretation for predicting cardiovascular disease risk in patients with diabetes using dietary antioxidants. Front. Nutr. 2025, 12, 1612369. [Google Scholar] [CrossRef] [PubMed]
- Ruder, S. An Overview of Multitask Learning in Deep Neural Networks. arXiv 2017, arXiv:1706.05098. [Google Scholar] [CrossRef]
- Hwang, M.; Rachim, V.P.; Yoo, J.; Lee, Y.; Park, S. Generalized Multitask Learning Framework for Glucose Forecasting and Hypoglycemia Detection Using Simulation-to-Reality Transfer. NPJ Digit. Med. 2025, 8, 994. [Google Scholar] [CrossRef]
- Lee, M.; Park, T.; Shin, J.Y.; Park, M. A Comprehensive Multitask Deep Learning Approach for Predicting Metabolic Syndrome with Genetic, Nutritional, and Clinical Data. Sci. Rep. 2024, 14, 17851. [Google Scholar] [CrossRef] [PubMed]
- Shao, J.; Pan, Y.; Kou, W.-B.; Feng, H.; Zhao, Y.; Zhou, K.; Zhong, S. Generalization of a Deep Learning Model for Continuous Glucose Monitoring–Based Hypoglycemia Prediction: Algorithm Development and Validation Study. JMIR Med. Inform. 2024, 12, e56909. [Google Scholar] [CrossRef]
- Rajkomar, A.; Dean, J.; Kohane, I. Machine Learning in Medicine. N. Engl. J. Med. 2019, 380, 1347–1358. [Google Scholar] [CrossRef]
- Guo, C.; Pleiss, G.; Sun, Y.; Weinberger, K.Q. On Calibration of Modern Neural Networks. In Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, NSW, Australia, 6–11 August 2017; pp. 1321–1330. [Google Scholar] [CrossRef]
- Van Calster, B.; McLernon, D.J.; van Smeden, M.; Wynants, L.; Steyerberg, E.W. Calibration: The Achilles Heel of Predictive Analytics. BMC Med. 2019, 17, 230. [Google Scholar] [CrossRef]
- Nathan, D.M.; Kuenen, J.; Borg, R.; Zheng, H.; Schoenfeld, D.; Heine, R.J.; A1C-Derived Average Glucose Study Group. Translating the A1C Assay Into Estimated Average Glucose Values. Diabetes Care 2008, 31, 1473–1478. [Google Scholar] [CrossRef] [PubMed]




















| Block | Parameter | Value/Setting | Note |
|---|---|---|---|
| Train-only preprocessing | Winsor quantile clip | q_low = 0.005, q_high = 0.995 (0.5–99.5%) | For continuous features only |
| StandardScaler | fit on train, apply on test | Binary features pass without scaling | |
| Gap/inf | Sanitization | NaN → 0.0 (post-imputation only), ±inf → ±1 × 106 | Applied after median imputation to handle residual numerical artifacts; does not affect missing data handling |
| Stacking (OOF) | CLS Splits | StratifiedKFold (n_splits = 5, shuffle = True, random_state = 42) | For binary classification |
| REG Splits | KFold (n_splits = 5, shuffle = True, random_state = 42) | For regressions | |
| CLS Calibration | Isotonic Regression | out_of_bounds = “clip”, per channel, on OOF | For each baseline Classifier |
| CLS | Decision threshold | 0.5 | For ACC/F1/MCC |
| eAG | Formula | eAG = 28.7 * HbA1c − 46.7 | Constants: EAG_L = 28.7, EAG_B = −46.7 |
| Intensity | λ = EAG_LAM = 0.0 | >0.0 soft glucose and HbA1c coupling | |
| Other | Random seed | 42 | Same everywhere |
| Parallelism | n_jobs = max(1, cpu_count − 1) | For ExtraTrees |
| Core Laboratory Feature | Median Imputations, n |
|---|---|
| fasting_hours | 0 |
| hdl_mgdl | 3 |
| ldl_mgdl | 11 |
| tc_mgdl | 3 |
| tg_mgdl | 3 |
| hscrp_mg_l | 17 |
| ferritin_ng_ml | 3 |
| scr_mg_dl | 0 |
| uacr_mg_g | 77 |
| na_mmol_l | 0 |
| k_mmol_l | 6 |
| cl_mmol_l | 0 |
| cotinine_ng_ml | 1 |
| Feature | Missing Before Median Imputation |
|---|---|
| age | 0 |
| height_cm | 76 |
| weight_kg | 69 |
| bmi | 81 |
| bp_sys | 454 |
| bp_dia | 454 |
| fasting_hours | 0 |
| hdl_mgdl | 0 |
| ldl_mgdl | 0 |
| tc_mgdl | 0 |
| tg_mgdl | 0 |
| hscrp_mg_l | 0 |
| ferritin_ng_ml | 0 |
| scr_mg_dl | 0 |
| uacr_mg_g | 0 |
| na_mmol_l | 0 |
| k_mmol_l | 0 |
| cl_mmol_l | 0 |
| cotinine_ng_ml | 0 |
| cotinine_ln | 0 |
| dpq_total | 824 |
| diet_kcal | 377 |
| diet_carb_g | 377 |
| diet_sugar_g | 377 |
| diet_fiber_g | 377 |
| diet_protein_g | 377 |
| diet_fat_total_g | 377 |
| diet_satfat_g | 377 |
| diet_sodium_mg | 377 |
| diet_potassium_mg | 377 |
| diet_caffeine_mg | 377 |
| diet_alcohol_g | 377 |
| Model | Key Hyperparameters | Library |
|---|---|---|
| ExtraTreesClassifier | n_estimators = 700, max_features = “sqrt”, n_jobs ≈ (#cores − 1), random_state = 42 | scikit-learn |
| HistGradientBoostingClassifier | max_iter = 450, random_state = 42 | scikit-learn |
| LogisticRegression | solver = “lbfgs”, max_iter = 4000, class_weight = “balanced”, random_state = 42 | scikit-learn |
| XGBClassifier* | n_estimators = 400, max_depth = 5, subsample = 0.9, colsample_bytree = 0.9, tree_method = “hist”, eval_metric = “logloss”, random_state = 42 | XGBoost |
| CatBoostClassifier* | depth = 6, iterations = 750, learning_rate = 0.05, loss_function = “Logloss”, auto_class_weights = “Balanced”, verbose = 0, random_seed = 42 | CatBoost |
| Model | Key Hyperparameters | Library |
|---|---|---|
| Ordinary Least Squares (OLS) | no regularization (α = 0) | scikit-learn |
| ExtraTreesRegressor | n_estimators = 900, max_features = “sqrt”, n_jobs ≈ (#cores − 1), random_state = 42 | scikit-learn |
| HistGradientBoostingRegressor | max_iter = 650, random_state = 42 | scikit-learn |
| XGBRegressor* | n_estimators = 500, max_depth = 6, subsample = 0.9, colsample_bytree = 0.9, tree_method = “hist”, objective = “reg:squarederror”, random_state = 42 | XGBoost |
| CatBoostRegressor* | depth = 6, iterations = 900, learning_rate = 0.05, loss_function = “RMSE”, verbose = 0, random_seed = 42 | CatBoost |
| Node | Model/ Operation | Hyperparameters | Meta-Feature Input |
|---|---|---|---|
| Calibration (CLS) | IsotonicRegression | out_of_bounds = “clip” | OOF scores of each base classifier |
| Meta-CLS head | LogisticRegression | solver = “lbfgs”, max_iter = 4000, class_weight = “balanced”, random_state = 42 | Concatenation of calibrated classifier outputs, summary statistics (mean, std, min, max), and cross-task regression-derived statistics. |
| Meta-REG head (Glucose) | Ridge | alpha = 0.5, random_state = 42 | Meta-features include Z_g, p*, summary statistics (mean, std, q10, q90), and polynomial terms p* and (p*)2. |
| Meta-REG head (HbA1c) | Ridge | alpha = 0.5, random_state = 42 | Similar for Z_a |
| Scenario | Threshold | ACC | F1 | Precision | Recall | MCC |
|---|---|---|---|---|---|---|
| Base (0.50) | 0.50 | 0.949615 | 0.794468 | 0.825 | 0.765 | 0.735072 |
| Maximum F1 | 0.67 | 0.951212 | 0.808927 | 0.873 | 0.753 | 0.744756 |
| High sensitivit | 0.35 | 0.933660 | 0.772083 | 0.689 | 0.879 | 0.710247 |
| Model | ACC/ACC * | F1/F1 * | ROC_AUC/ROC-AUC * | PR_AUC/PR-AUC * | MCC/MCC * | Brier/Brier * |
|---|---|---|---|---|---|---|
| CMSE | 0.801/0.950 | 0.558/0.794 | 0.865/0.975 | 0.568/0.838 | 0.469/0.735 | 0.144/0.060 |
| LogReg | 0.751/0.919 | 0.500/0.721 | 0.838/0.920 | 0.483/0.785 | 0.398/0.685 | 0.161/0.067 |
| ExtraTrees | 0.850/0.919 | 0.232/0.743 | 0.848/0.942 | 0.541/0.787 | 0.272/0.696 | 0.105/0.074 |
| HistGB | 0.858/0.896 | 0.413/0.735 | 0.848/0.953 | 0.544/0.821 | 0.376/0.684 | 0.119/0.092 |
| XGB | 0.855/0.911 | 0.396/0.720 | 0.860/0.941 | 0.559/0.778 | 0.358/0.668 | 0.103/0.072 |
| CatBoost | 0.852/0.922 | 0.542/0.757 | 0.865/0.946 | 0.574/0.813 | 0.453/0.711 | 0.103/0.063 |
| Model | R2/R2 * | RMSE/RMSE * | MAE/MAE * | MAPE/MAPE * |
|---|---|---|---|---|
| CMSE | 0.385/0.681 | 26.65/22.47 | 14.36/11.42 | 0.121/0.101 |
| Linear | 0.254/0.477 | 29.37/28.16 | 16.62/18.57 | 0.143/0.200 |
| ExtraTrees | 0.301/0.612 | 28.42/24.25 | 15.09/12.26 | 0.127/0.109 |
| HistGB | 0.308/0.574 | 28.28/25.43 | 16.37/12.66 | 0.142/0.111 |
| XGB | 0.397/0.579 | 26.40/25.27 | 15.02/12.87 | 0.129/0.112 |
| CatBoost | 0.375/0.615 | 26.88/24.17 | 14.88/12.36 | 0.127/0.112 |
| Model | R2/R2 * | RMSE/RMSE * | MAE/MAE * | MAPE/MAPE * |
|---|---|---|---|---|
| CMSE | 0.752/0.366 | 0.676/0.866 | 0.353/0.490 | 0.057/0.078 |
| Linear | 0.511/0.251 | 0.910/0.942 | 0.620/0.547 | 0.117/0.088 |
| ExtraTrees | 0.700/0.297 | 0.712/0.912 | 0.371/0.506 | 0.062/0.080 |
| HistGB | 0.693/0.305 | 0.721/0.907 | 0.386/0.532 | 0.064/0.086 |
| XGB | 0.663/0.355 | 0.754/0.874 | 0.405/0.507 | 0.066/0.082 |
| CatBoost | 0.712/0.352 | 0.698/0.876 | 0.374/0.503 | 0.062/0.081 |
| Model | ACC (Mean ± Std; 95% CI) | F1 (Mean ± Std; 95% CI) | ROC_AUC (Mean ± Std; 95% CI) | PR_AUC (Mean ± Std; 95% CI) | MCC (Mean ± Std; 95% CI) | Brier (Mean ± Std; 95% CI) |
|---|---|---|---|---|---|---|
| CMSE | 0.9198 ± 0.0111; 95%CI [0.9100, 0.9295] | 0.7456 ± 0.0401; 95%CI [0.7104, 0.7808] | 0.9324 ± 0.0144; 95%CI [0.9198, 0.9450] | 0.7672 ± 0.0392; 95%CI [0.7328, 0.8016] | 0.7000 ± 0.0451; 95%CI [0.6604, 0.7395] | 0.0666 ± 0.0062; 95%CI [0.0612, 0.0720] |
| LogReg | 0.9121 ± 0.0078; 95%CI [0.9052, 0.9189] | 0.6999 ± 0.0317; 95%CI [0.6721, 0.7277] | 0.9090 ± 0.0151; 95%CI [0.8958, 0.9223] | 0.7482 ± 0.0261; 95%CI [0.7253, 0.7711] | 0.6585 ± 0.0337; 95%CI [0.6290, 0.6880] | 0.0725 ± 0.0057; 95%CI [0.0675, 0.0775] |
| ExtraTrees | 0.9150 ± 0.0121; 95%CI [0.9043, 0.9256] | 0.7233 ± 0.0494; 95%CI [0.6800, 0.7666] | 0.9314 ± 0.0167; 95%CI [0.9168, 0.9460] | 0.7739 ± 0.0375; 95%CI [0.7410, 0.8069] | 0.6773 ± 0.0530; 95%CI [0.6309, 0.7237] | 0.0784 ± 0.0118; 95%CI [0.0681, 0.0888] |
| HistGB | 0.8924 ± 0.0057; 95%CI [0.8874, 0.8974] | 0.7111 ± 0.0144; 95%CI [0.6985, 0.7237] | 0.9395 ± 0.0091; 95%CI [0.9315, 0.9475] | 0.7909 ± 0.0379; 95%CI [0.7577, 0.8242] | 0.6524 ± 0.0178; 95%CI [0.6367, 0.6680] | 0.0914 ± 0.0051; 95%CI [0.0869, 0.0959] |
| XGB | 0.9094 ± 0.0124; 95%CI [0.8985, 0.9203] | 0.7062 ± 0.0472; 95%CI [0.6648, 0.7476] | 0.9292 ± 0.0152; 95%CI [0.9159, 0.9425] | 0.7687 ± 0.0489; 95%CI [0.7258, 0.8115] | 0.6568 ± 0.0514; 95%CI [0.6117, 0.7018] | 0.0767 ± 0.0111; 95%CI [0.0670, 0.0864] |
| CatBoost | 0.9137 ± 0.0093; 95%CI [0.9055, 0.9218] | 0.7158 ± 0.0384; 95%CI [0.6821, 0.7494] | 0.9359 ± 0.0146; 95%CI [0.9231, 0.9487] | 0.7783 ± 0.0441; 95%CI [0.7396, 0.8169] | 0.6700 ± 0.0403; 95%CI [0.6347, 0.7053] | 0.0675 ± 0.0092; 95%CI [0.0594, 0.0756] |
| Task | Metric | Hold-Out | 5-Fold CV (Mean) | Gap |
|---|---|---|---|---|
| is_diabetes_labs_only classification | ROC-AUC | 0.974510 | 0.9324 | +0.042110 |
| F1 | 0.794468 | 0.7456 | +0.048868 | |
| glucose_mgdl regression | R2 | 0.681382 | 0.6254 | +0.055982 |
| hba1c_pct regression | R2 | 0.752465 | 0.7401 | +0.012365 |
| Model | R2 (Mean ± Std; 95% CI) | RMSE (Mean ± Std; 95% CI) | MAE (Mean ± Std; 95% CI) | MAPE (Mean ± Std; 95% CI) |
|---|---|---|---|---|
| CMSE | 0.6254 ± 0.0623; 95%CI [0.5708, 0.6799] | 25.7153 ± 4.9299; 95%CI [21.3940, 30.0366] | 12.2576 ± 1.4144; 95%CI [11.0178, 13.4973] | 0.1039 ± 0.0057; 95%CI [0.0990, 0.1089] |
| Linear | 0.4445 ± 0.0269; 95%CI [0.4210, 0.4680] | 31.3222 ± 3.3380; 95%CI [28.3963, 34.2481] | 19.0658 ± 0.8155; 95%CI [18.3510, 19.7806] | 0.1950 ± 0.0048; 95%CI [0.1909, 0.1992] |
| ExtraTrees | 0.5935 ± 0.0272; 95%CI [0.5697, 0.6174] | 26.8075 ± 3.1366; 95%CI [24.0582, 29.5569] | 12.3101 ± 0.9991; 95%CI [11.4343, 13.1858] | 0.1039 ± 0.0044; 95%CI [0.1000, 0.1078] |
| HistGB | 0.5978 ± 0.0420; 95%CI [0.5610, 0.6346] | 26.5452 ± 2.0908; 95%CI [24.7125, 28.3779] | 13.0074 ± 0.7128; 95%CI [12.3826, 13.6322] | 0.1113 ± 0.0055; 95%CI [0.1065, 0.1161] |
| XGB | 0.5494 ± 0.0605; 95%CI [0.4964, 0.6024] | 28.1303 ± 3.0767; 95%CI [25.4335, 30.8272] | 13.5327 ± 0.7848; 95%CI [12.8448, 14.2206] | 0.1133 ± 0.0054; 95%CI [0.1085, 0.1180] |
| CatBoost | 0.6150 ± 0.0370; 95%CI [0.5825, 0.6474] | 26.0157 ± 2.4782; 95%CI [23.8435, 28.1879] | 12.5895 ± 0.7686; 95%CI [11.9158, 13.2632] | 0.1084 ± 0.0049; 95%CI [0.1041, 0.1127] |
| Model | R2 (Mean ± Std; 95% CI) | RMSE (Mean ± Std; 95% CI) | MAE (Mean ± Std; 95% CI) | MAPE (Mean ± Std; 95% CI) |
|---|---|---|---|---|
| CMSE | 0.7401 ± 0.0309; 95%CI [0.7130, 0.7671] | 0.6933 ± 0.0962; 95%CI [0.6089, 0.7776] | 0.3599 ± 0.0302; 95%CI [0.3334, 0.3864] | 0.0580 ± 0.0038; 95%CI [0.0547, 0.0613] |
| Linear | 0.5228 ± 0.0126; 95%CI [0.5117, 0.5338] | 0.9396 ± 0.0373; 95%CI [0.9069, 0.9723] | 0.6248 ± 0.0105; 95%CI [0.6156, 0.6340] | 0.1143 ± 0.0020; 95%CI [0.1125, 0.1160] |
| ExtraTrees | 0.7056 ± 0.0197; 95%CI [0.6883, 0.7229] | 0.7382 ± 0.0471; 95%CI [0.6969, 0.7796] | 0.3649 ± 0.0213; 95%CI [0.3462, 0.3835] | 0.0585 ± 0.0029; 95%CI [0.0560, 0.0611] |
| HistGB | 0.7257 ± 0.0322; 95%CI [0.6975, 0.7539] | 0.7114 ± 0.0487; 95%CI [0.6688, 0.7541] | 0.3746 ± 0.0209; 95%CI [0.3563, 0.3930] | 0.0605 ± 0.0028; 95%CI [0.0581, 0.0630] |
| XGB | 0.6973 ± 0.0224; 95%CI [0.6777, 0.7169] | 0.7475 ± 0.0268; 95%CI [0.7240, 0.7711] | 0.3900 ± 0.0103; 95%CI [0.3810, 0.3990] | 0.0628 ± 0.0014; 95%CI [0.0616, 0.0640] |
| CatBoost | 0.7343 ± 0.0281; 95%CI [0.7097, 0.7589] | 0.7005 ± 0.0462; 95%CI [0.6599, 0.7410] | 0.3667 ± 0.0170; 95%CI [0.3518, 0.3815] | 0.0595 ± 0.0021; 95%CI [0.0577, 0.0614] |
| λ | ACC | F1 | ROC_AUC | Brier | Glucose R2 | Glucose RMSE | HbA1c R2 | HbA1c RMSE |
|---|---|---|---|---|---|---|---|---|
| 0.00 | 0.949615 | 0.794468 | 0.974510 | 0.059521 | 0.681382 | 22.473540 | 0.752465 | 0.676296 |
| 0.10 | 0.949602 | 0.794211 | 0.974378 | 0.059620 | 0.680912 | 22.492008 | 0.548109 | 0.842163 |
| 0.25 | 0.949587 | 0.793990 | 0.974246 | 0.059714 | 0.680817 | 22.503274 | 0.545321 | 0.845127 |
| Variant | ACC | F1 | ROC_AUC | PR_AUC | MCC | Brier |
|---|---|---|---|---|---|---|
| CMSE (base) | 0.949615 | 0.794468 | 0.974510 | 0.838077 | 0.735072 | 0.059521 |
| CMSE_no-calibration | 0.944118 | 0.792031 | 0.974358 | 0.835629 | 0.729154 | 0.064029 |
| CMSE_λ = 0.10 | 0.949602 | 0.794211 | 0.974378 | 0.838004 | 0.734981 | 0.059620 |
| CMSE_λ = 0.25 | 0.949587 | 0.793990 | 0.974246 | 0.837882 | 0.734822 | 0.059714 |
| CMSE_no-medication | 0.861032 | 0.650214 | 0.904956 | 0.675082 | 0.552187 | 0.097826 |
| Variant | Glucose R2 | Glucose RMSE | HbA1c R2 | HbA1c RMSE |
|---|---|---|---|---|
| CMSE (base) | 0.681382 | 22.473540 | 0.752465 | 0.676296 |
| CMSE_no-calibration | 0.678214 | 22.552311 | 0.752823 | 0.676994 |
| CMSE_λ = 0.10 | 0.680912 | 22.492008 | 0.548109 | 0.842163 |
| CMSE_λ = 0.25 | 0.680817 | 22.503274 | 0.545321 | 0.845127 |
| CMSE_no-medication | 0.385423 | 26.649910 | 0.366000 | 0.866235 |
| Variant | ACC | F1 | ROC-AUC | PR-AUC | MCC | Brier |
|---|---|---|---|---|---|---|
| CMSE (base) | 0.949615 | 0.794468 | 0.974510 | 0.838077 | 0.735072 | 0.059521 |
| CMSE_no-medication | 0.861032 | 0.650214 | 0.904956 | 0.675082 | 0.552187 | 0.097826 |
| Variant | Glucose R2 | Glucose RMSE | HbA1c R2 | HbA1c RMSE |
|---|---|---|---|---|
| CMSE (base) | 0.681382 | 22.473540 | 0.752465 | 0.676296 |
| CMSE_no-medication | 0.385423 | 26.649910 | 0.366000 | 0.866235 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Khamitova, Z.; Omarova, G.; Akhmetzhanov, M.; Burganova, R.; Orynbassar, M.; Sabirova, U.; Bukatayeva, A.; Barakova, A.; Jiyanmuratova, G.; Yuldasheva, D. A Calibrated Multi-Task Ensemble Architecture for Biomedical Risk Prediction. Computers 2026, 15, 244. https://doi.org/10.3390/computers15040244
Khamitova Z, Omarova G, Akhmetzhanov M, Burganova R, Orynbassar M, Sabirova U, Bukatayeva A, Barakova A, Jiyanmuratova G, Yuldasheva D. A Calibrated Multi-Task Ensemble Architecture for Biomedical Risk Prediction. Computers. 2026; 15(4):244. https://doi.org/10.3390/computers15040244
Chicago/Turabian StyleKhamitova, Zhainagul, Gulmira Omarova, Madi Akhmetzhanov, Roza Burganova, Maksym Orynbassar, Umida Sabirova, Almagul Bukatayeva, Aliya Barakova, Gulnoz Jiyanmuratova, and Dilchekhra Yuldasheva. 2026. "A Calibrated Multi-Task Ensemble Architecture for Biomedical Risk Prediction" Computers 15, no. 4: 244. https://doi.org/10.3390/computers15040244
APA StyleKhamitova, Z., Omarova, G., Akhmetzhanov, M., Burganova, R., Orynbassar, M., Sabirova, U., Bukatayeva, A., Barakova, A., Jiyanmuratova, G., & Yuldasheva, D. (2026). A Calibrated Multi-Task Ensemble Architecture for Biomedical Risk Prediction. Computers, 15(4), 244. https://doi.org/10.3390/computers15040244
