Synthetic Expansion of Blood Dielectric Spectra at Microwave Frequencies Using Data-Driven Methods
Abstract
1. Introduction
- We propose a structured data-synthesis framework that performs Hb-dependent filling by generating new Hb levels together with their corresponding frequency-dependent permittivity spectra, enabling densification of sparse experimental data into a densely sampled Hb range.
- We develop a regression framework for predicting blood analyte concentrations from complex permittivity spectra, with hemoglobin (Hb) used as a representative target.
- We demonstrate that the densified dataset improves prediction accuracy and generalization across different Hb levels compared to models trained on sparse measurements alone.
2. Methods
2.1. Measured Dataset
2.2. Synthesis Methods
2.2.1. Interpolation-Based Methods
- Gaussian-Noise Synthetic DatasetResidual variability is represented using a Gaussian model. Synthetic values were obtained by adding normally distributed perturbations, scaled by the empirical residual standard deviation at each frequency, to the PCHIP-interpolated trend. This produced smooth synthetic spectra with controlled, symmetric variability.
- Bootstrap-Residual Synthetic DatasetResidual variability is captured non-parametrically using bootstrap resampling. Residuals are drawn with replacement from the observed residual set and added to the PCHIP trend. This preserved the empirical residual distribution, including potential asymmetry or heavy tails.
2.2.2. Probabilistic-Based Methods
- Conditional Gaussian Process Regressor
- II.
- Conditional Bayesian Principal Component Analysis
2.3. Validation Approaches
2.3.1. Spectral Trend Consistency Assessment
2.3.2. Variance Preservation Assessment
2.3.3. Dielectric Relaxation Behavior Assessment
2.3.4. Dielectric Spectral Similarity Assessment
2.3.5. Predictive Behavior Assessment
3. Results
3.1. Spectral Trend Consistency Assessment Results
3.2. Variance Preservation Assessment Results
3.3. Dielectric Relaxation Behavior Assessment Results
3.4. Dielectric Spectral Similarity Assessment Results
3.5. Predictive Behavior Assessment Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Abbosh, A. Electromagnetic medical sensing. Sensors 2019, 19, 1662. [Google Scholar] [CrossRef] [PubMed]
- Schwan, H.P. Dielectric properties of biological tissue and cells at RF-and MW-frequencies. In Biological Effects and Dosimetry of Nonionizing Radiation: Radiofrequency and Microwave Energies; Springer: New York, NY, USA, 1983; pp. 195–211. [Google Scholar]
- Mobashsher, A.; Bialkowski, K.; Abbosh, A.; Crozier, S. Design and experimental evaluation of a non-invasive microwave head imaging system for intracranial haemorrhage detection. PLoS ONE 2016, 11, e0152351. [Google Scholar] [CrossRef]
- Wolf, M.; Gulich, R.; Lunkenheimer, P.; Loidl, A. Broadband dielectric spectroscopy on human blood. Biochim. Biophys. Acta (BBA)-Gen. Subj. 2011, 1810, 727–740. [Google Scholar] [CrossRef]
- Wintrobe, M.M. Wintrobe’s Clinical Hematology; Lippincott Williams & Wilkins: Waltham, MA, USA, 2009; Volume 1. [Google Scholar]
- Santorelli, A.; Abbasi, B.; Lyons, M.; Hayat, A.; Gupta, S.; O’Halloran, M.; Gupta, A. Investigation of Anemia and the Dielectric Properties of Human Blood at Microwave Frequencies. IEEE Access 2018, 6, 56885–56892. [Google Scholar] [CrossRef]
- Gabriel, S.; Lau, R.; Gabriel, C. The dielectric properties of biological tissues: II. Measurements in the frequency range 10 Hz to 20 GHz. Phys. Med. Biol. 1996, 41, 2251–2269. [Google Scholar] [CrossRef]
- Farrugia, L.; Porter, E.; Conceição, R.C.; Di Meo, S.; Godinho, D.M.; Bonello, J.; Ragulskis, M.; Ocket, I.; Farina, L.; Cavagnaro, M. The complex permittivity of biological tissues: A practical measurement guideline. IEEE Access 2024, 12, 10296–10314. [Google Scholar] [CrossRef]
- Porter, E.; Farrugia, L.; Prakash, P.; Conceição, R.C.; Shrivastava, D.; Scapaticci, R.; Mandija, S.; Cavagnaro, M.; Curto, S. Current status and emerging techniques for measuring the dielectric properties of biological tissues. ASME Open J. Eng. 2024, 3, 031005. [Google Scholar] [CrossRef]
- Villena Gonzales, W. Dielectric Spectroscopy and Significance of Blood Analytes in the Microwave and Millimetre-Wave Bands. Ph.D. Thesis, The University of Queensland, St Lucia, QLD, Australia, 2022. [Google Scholar]
- Chen, L.; Xia, C.; Zhao, Z.; Fu, H.; Chen, Y. AI-driven sensing technology. Sensors 2024, 24, 2958. [Google Scholar] [CrossRef] [PubMed]
- Naser, M. A review of machine learning with small and limited data. J. Big Data 2026, 13, 18. [Google Scholar] [CrossRef]
- Rahman, M.S.; Sultana, M. Performance of Firth-and logF-type penalized methods in risk prediction for small or sparse binary data. BMC Med. Res. Methodol. 2017, 17, 33. [Google Scholar] [CrossRef]
- Saxton, D.; Grefenstette, E.; Hill, F.; Kohli, P. Analysing mathematical reasoning abilities of neural models. arXiv 2019, arXiv:1904.01557. [Google Scholar] [CrossRef]
- Haghbin, Y.; Moradi, H.; Hosseini, R. Feature-to-Image Data Augmentation: Improving Model Feature Extraction with Cluster-Guided Synthetic Samples. arXiv 2024, arXiv:2409.17685. [Google Scholar]
- Brain, D.; Webb, G.I. On the effect of data set size on bias and variance in classification learning. In Proceedings of the Fourth Australian Knowledge Acquisition Workshop; University of New South Wales: Kensington, Australia, 1999; pp. 117–128. [Google Scholar]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Won, D.-H.; Shin, K.-S.; Youm, S. Synthetic Data Augmentation for Imbalanced Tabular Data: A Comparative Study of Generation Methods. Electronics 2026, 15, 883. [Google Scholar] [CrossRef]
- Xiong, S. The reconstruction approach: From interpolation to regression. Technometrics 2021, 63, 225–235. [Google Scholar] [CrossRef]
- Chen, Y.; Chen, X.; Wang, Y.; Zu, S. The interpolation of sparse geophysical data. Surv. Geophys. 2019, 40, 73–105. [Google Scholar] [CrossRef]
- Deringer, V.L.; Bartók, A.P.; Bernstein, N.; Wilkins, D.M.; Ceriotti, M.; Csányi, G. Gaussian process regression for materials and molecules. Chem. Rev. 2021, 121, 10073–10141. [Google Scholar] [CrossRef] [PubMed]
- Liu, K.; Meng, Z.; Own, C.-M. Gaussian process regression plus method for localization reliability improvement. Sensors 2016, 16, 1193. [Google Scholar] [CrossRef]
- Kremer, F.; Schönhals, A. Broadband Dielectric Spectroscopy; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
- Rubner, Y.; Tomasi, C.; Guibas, L.J. The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 2000, 40, 99–121. [Google Scholar] [CrossRef]
- Pozar, D.M. Microwave Engineering; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
- Fritsch, F.N.; Butland, J. A method for constructing local monotone piecewise cubic interpolants. SIAM J. Sci. Stat. Comput. 1984, 5, 300–304. [Google Scholar] [CrossRef]
- Rasmussen, C.E. Gaussian processes in machine learning. In Advanced Lectures on Machine Learning; Summer School on Machine Learning; Springer: Berlin/Heidelberg, Germany, 2003; pp. 63–71. [Google Scholar]
- Tipping, M.E.; Bishop, C.M. Probabilistic principal component analysis. J. R. Stat. Soc. Ser. B Stat. Methodol. 1999, 61, 611–622. [Google Scholar] [CrossRef]
- Bishop, C. Bayesian PCA. In Advances in Neural Information Processing Systems; MIT Press: Cambridge, MA, USA, 1998; Volume 11. [Google Scholar]
- Montgomery, D.C.; Runger, G.C. Applied Statistics and Probability for Engineers; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
- Sasaki, K.; Wake, K.; Watanabe, S. Development of best fit Cole-Cole parameters for measurement data from biological tissues and organs between 1 MHz and 20 GHz. Radio Sci. 2014, 49, 459–472. [Google Scholar] [CrossRef]
- Cole, K.S.; Cole, R.H. Dispersion and absorption in dielectrics I. Alternating current characteristics. J. Chem. Phys. 1941, 9, 341–351. [Google Scholar] [CrossRef]
- Chen, T. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016. [Google Scholar]
- Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining; Association for Computing Machinery: New York, NY, USA, 2019; pp. 2623–2631. [Google Scholar]
- Carpenter, J.; Bithell, J. Bootstrap confidence intervals: When, which, what? A practical guide for medical statisticians. Stat. Med. 2000, 19, 1141–1164. [Google Scholar] [CrossRef]
- Friedman, M. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J. Am. Stat. Assoc. 1937, 32, 675–701. [Google Scholar] [CrossRef]








| Data Sets | RMSE 95% CI | MAE 95% CI | R2 95% CI |
|---|---|---|---|
| Measured | 1.813 [1.157, 2.344] | 1.169 [0.908, 1.465] | 0.941 [0.893, 0.977] |
| Interpolation + Gaussian Noise | 1.684 [1.347, 1.998] | 1.248 [1.100, 1.416] | 0.949 [0.922, 0.969] |
| Interpolation + Bootstrap Residual | 1.696 [1.347, 2.020] | 1.253 [1.100, 1.428] | 0.948 [0.920, 0.969] |
| Conditioned BPCA | 1.774 [1.447, 1.997] | 1.337 [1.129, 1.517] | 0.943 [0.917, 0.966] |
| Conditioned GPR | 1.948 [1.500, 2.500] | 1.403 [1.148, 1.754] | 0.932 [0.870, 0.963] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Alhummada, I.; Bialkowski, A.; Guo, L.; Villena Gonzales, W.; Deriche, M.; Abbosh, A. Synthetic Expansion of Blood Dielectric Spectra at Microwave Frequencies Using Data-Driven Methods. Sensors 2026, 26, 3580. https://doi.org/10.3390/s26113580
Alhummada I, Bialkowski A, Guo L, Villena Gonzales W, Deriche M, Abbosh A. Synthetic Expansion of Blood Dielectric Spectra at Microwave Frequencies Using Data-Driven Methods. Sensors. 2026; 26(11):3580. https://doi.org/10.3390/s26113580
Chicago/Turabian StyleAlhummada, Iman, Alina Bialkowski, Lei Guo, Wilbert Villena Gonzales, Mohamed Deriche, and Amin Abbosh. 2026. "Synthetic Expansion of Blood Dielectric Spectra at Microwave Frequencies Using Data-Driven Methods" Sensors 26, no. 11: 3580. https://doi.org/10.3390/s26113580
APA StyleAlhummada, I., Bialkowski, A., Guo, L., Villena Gonzales, W., Deriche, M., & Abbosh, A. (2026). Synthetic Expansion of Blood Dielectric Spectra at Microwave Frequencies Using Data-Driven Methods. Sensors, 26(11), 3580. https://doi.org/10.3390/s26113580

