Sleep Stage Classification During CPAP Therapy from CPAP-Airflow and Wearable Fingertip Signals
Abstract
1. Introduction
- Clinical Specialization: We propose a sleep staging framework specifically tailored for the unique physiological conditions of patients undergoing CPAP therapy, addressing the challenges posed by positive airway pressure artifacts.
- Multimodal Synergy: We demonstrate that the integration of CPAP-airflow with fingertip-worn PPG and IMU signals significantly outperforms single-modality approaches, particularly in identifying REM sleep, which is critical for titration accuracy.
- Robust Fusion Architecture: We implement a late-fusion soft-voting ensemble strategy that effectively harmonizes heterogeneous data sources with different sampling resolutions and sensor characteristics.
- Practical Feasibility: Our approach leverages existing clinical hardware (CPAP) and minimally obtrusive wearables, offering a scalable solution for high-fidelity, in-home sleep monitoring without the need for full PSG.
Related Work
| Reference | FDA Approved | Signal | Method | Overall | Sensitivity | ||||
|---|---|---|---|---|---|---|---|---|---|
| Acc | F1 | Wake | REM | NREM | |||||
| Long et al., 2014 [25] | RE | LD | 0.762 | 0.45 | - | - | - | - | |
| Tataraidze et al., 2015 [26] | RIP | Bagging | 0.804 | 0.65 | - | - | - | - | |
| Yang et al., 2016 [27] | RF | Classifier | 0.74 | 0.49 | - | - | - | - | |
| Chen et al., 2023 [28] | RIP | XGB | 0.76 | 0.54 | 0.677 | 0.749 | 0.762 | 0.769 | |
| Chung et al., 2024 [29] | RF | XGB | 0.709 | 0.458 | 0.736 | 0.590 | 0.734 | - | |
| Chen et al., 2025 [17] | RF | CNN | 0.785 | 0.605 | 0.727 | 0.748 | 0.604 | 0.838 | |
| Manjunath et al., 2025 [30] | RF | XGB | - | - | - | 0.719 | 0.645 | 0.665 | |
| Wu et al., 2020 [20] | PPG | ANN + SVM | 0.78 | 0.54 | - | - | - | - | |
| Korkalainen et al., 2020 [21] | PPG | CNN + RNN | 0.801 | 0.65 | - | 0.72 | 0.70 | 0.87 | |
| Huttunen et al., 2021 [23] | PPG | CNN + RNN | 0.833 | 0.72 | 0.83 | 0.75 | 0.86 | 0.89 | |
| Strumpf et al., 2023 [24] | ✓ | PPG | Transformer + CNN | 0.80 | 0.606 | 0.741 | 0.58 | 0.70 | 0.90 |
| Chen et al., 2025 [31] | ✓ | PPG + ACTG | NN | - | - | - | - | - | - |
| Sharan et al., 2025 [32] | PPG | NN | - | 0.662 | 0.770 | 0.825 | 0.828 | 0.843 | |
| Zhang et al., 2024 [33] | ECG + PPG + ACTG + Temp | NN | 0.84 | 0.729 | 0.842 | 0.862 | 0.823 | 0.820 | |
| Kazemi et al., 2024 [34] | RF + RIP + PPG | CNN+RNN | 0.83 | 0.66 | 0.82 | 0.677 | 0.687 | 0.914 | |
| Krauss et al., 2025 [35] | ACTG + ECG + RIP | RNN | - | - | - | 0.685 | 0.511 | 0.895 | |
| Reference | Signal | Method | Overall | Sensitivity | |||||
|---|---|---|---|---|---|---|---|---|---|
| Acc | F1 | Wake | REM | Light | Deep | ||||
| Long et al., 2014 [25] | RE | LD | 0.638 | 0.38 | - | - | - | - | - |
| Chang et al., 2018 [36] | IMU + Mic | KNN | - | - | - | - | 0.700 | 0.696 | 0.598 |
| Chen et al., 2023 [28] | RIP | XGB | 0.65 | 0.458 | 0.544 | - | - | - | - |
| Wu et al., 2020 [20] | PPG | ANN + SVM | 0.62 | 0.41 | - | 0.526 | 0.659 | 0.625 | 0.596 |
| Korkalainen et al., 2020 [21] | PPG | CNN + RNN | 0.685 | 0.54 | - | 0.73 | 0.67 | 0.71 | 0.52 |
| Huttunen et al., 2021 [23] | PPG | CNN + RNN | 0.741 | 0.64 | 0.745 | 0.77 | 0.83 | 0.79 | 0.57 |
| Sharan et al., 2025 [32] | PPG | NN | - | 0.647 | 0.738 | 0.714 | 0.812 | 0.721 | 0.839 |
| N. Sridhar et al., 2020 [22] | ECG | CNN | 0.72 | 0.55 | - | 0.74 | 0.76 | 0.76 | 0.48 |
| Zhang et al., 2024 [33] | ECG + PPG + ACTG + Temp | NN | 0.753 | 0.615 | 0.741 | 0.840 | 0.798 | 0.615 | 0.733 |
| Kazemi et al., 2024 [34] | RF + RIP + PPG | CNN + RNN | 0.798 | 0.70 | 0.798 | 0.716 | 0.845 | 0.825 | 0.774 |
| Han et al., 2024 [37] | In-ear Sound | Multi-class SVM | - | - | - | - | 0.773 | 0.653 | 0.623 |
2. Materials and Dataset Statistics
2.1. Benchmark
- TipTraQ Models: Trained using PPG and IMU signals to predict sleep stages, including two-stage (wake/sleep), three-stage (wake/REM/NREM), and four-stage (wake/REM/light/deep) classification. Specifically, the four-stage labels were derived by grouping N1 and N2 as “light sleep,” with N3 representing “deep sleep”.
- CPAP Models: Trained using CPAP-related signals to predict sleep stages, including two-stage (wake/sleep), three-stage (wake/REM/NREM), and four-stage (wake/REM/light/deep) classification.
2.2. Performance Evaluation Protocol
3. Model
3.1. Feature Extraction for the TipTraQ Model
3.2. Feature Extraction for the CPAP Model
3.3. Training and Validation Overview
3.4. Ensemble Method
3.5. Post-Processing
3.6. Signal Quality Mask
4. Results
4.1. Evaluation of Two-Stage Classification
4.2. Evaluation of Three-Stage Classification
4.3. Evaluation of Four-Stage Classification
4.4. Model Robustness via Cross-Validation
4.5. Qualitative Visualization of Hypnograms
4.6. Sensitivity Analysis of the Ensemble Weight
4.7. Prediction Confidence Analysis
5. Discussion
5.1. Synergistic Potential of Multimodal Fusion
5.2. Classification Accuracy and Error Distributions
5.3. Challenges in Deep Sleep Detection and Data Sparsity
5.4. Impact of the CPAP Titration Environment
5.5. Model Robustness and Domain Adaptation
5.6. Clinical Utility: Titration Efficacy and REM Detection
5.7. Adaptive Reliability and Quality Masking
5.8. Probabilistic Estimates as a Confidence Metric
5.9. Limitations and Future Directions
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Luyster, F.S.; Strollo, P.J.; Zee, P.C.; Walsh, J.K. Sleep: A health imperative. Sleep 2012, 35, 727–734. [Google Scholar] [CrossRef]
- Berry, R.B.; Brooks, R.; Gamaldo, C.; Harding, S.M.; Lloyd, R.M.; Quan, S.F.; Troester, M.T.; Vaughn, B.V. AASM scoring manual updates for 2017 (version 2.4). J. Clin. Sleep Med. 2017, 13, 665–666. [Google Scholar] [CrossRef]
- Kyriacou, P.A.; Allen, J. Photoplethysmography: Technology, Signal Analysis and Applications; Academic Press: Cambridge, MA, USA, 2021. [Google Scholar]
- Yuan, H.; Hill, E.A.; Kyle, S.D.; Doherty, A. A systematic review of the performance of actigraphy in measuring sleep stages. J. Sleep Res. 2024, 33, e14143. [Google Scholar] [CrossRef] [PubMed]
- Topalidis, P.I.; Baron, S.; Heib, D.P.; Eigl, E.S.; Hinterberger, A.; Schabus, M. From pulses to sleep stages: Towards optimized sleep classification using heart-rate variability. Sensors 2023, 23, 9077. [Google Scholar] [CrossRef]
- Jaspan, V.N.; Greenberg, G.S.; Parihar, S.; Park, C.M.; Somers, V.K.; Shapiro, M.D.; Lavie, C.J.; Virani, S.S.; Slipczuk, L. The role of sleep in cardiovascular disease. Curr. Atheroscler. Rep. 2024, 26, 249–262. [Google Scholar] [CrossRef]
- Imtiaz, S.A. A systematic review of sensing technologies for wearable sleep staging. Sensors 2021, 21, 1562. [Google Scholar] [CrossRef]
- Long, X.; Haakma, R.; Leufkens, T.R.; Fonseca, P.; Aarts, R.M. Effects of between-and within-subject variability on autonomic cardiorespiratory activity during sleep and their limitations on sleep staging: A multilevel analysis. Comput. Intell. Neurosci. 2015, 2015, 583620. [Google Scholar] [CrossRef] [PubMed]
- Birrer, V.; Elgendi, M.; Lambercy, O.; Menon, C. Evaluating reliability in wearable devices for sleep staging. npj Digit. Med. 2024, 7, 74. [Google Scholar] [CrossRef] [PubMed]
- McDaid, C.; Griffin, S.; Weatherly, H.; Duree, K.; Van der Burgt, M.; Burgt, M.; van Hout, S.; Hout, S.; Akers, J.; Davies, R.; et al. Continuous positive airway pressure devices for the treatment of obstructive sleep apnoea-hypopnoea syndrome: A systematic review and economic analysis. Health Technol. Assess. 2009, 13, 1–146. [Google Scholar] [CrossRef]
- D’Rozario, A.L.; Hoyos, C.M.; Wong, K.K.; Unger, G.; Kim, J.W.; Vakulin, A.; Kao, C.H.; Naismith, S.L.; Bartlett, D.J.; Grunstein, R.R. Improvements in cognitive function and quantitative sleep electroencephalogram in obstructive sleep apnea after six months of continuous positive airway pressure treatment. Sleep 2022, 45, zsac013. [Google Scholar] [CrossRef] [PubMed]
- Frias, C.L.; Almeria, M.; Castejon, J.; Artero, C.; Caruana, G.; Elias-Mas, A.; Uscamaita, K.; Hawkins, V.; Ray, N.J.; Buongiorno, M.; et al. A Pilot Study: The Effect of CPAP Intervention on Sleep Architecture and Cognition in Alzheimer’s Disease Patients with Obstructive Sleep Apnea. Neurol. Int. 2025, 17, 147. [Google Scholar] [CrossRef]
- Wilckens, K.A.; Jeon, B.; Morris, J.L.; Buysse, D.J.; Chasens, E.R. Effects of continuous positive airway pressure treatment on sleep architecture in adults with obstructive sleep apnea and type 2 diabetes. Front. Hum. Neurosci. 2022, 16, 924069. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Li, Q.; Zou, X.; Zhong, Z.; Ouyang, Q.; Zeng, Q.; Hu, Y.; Wang, M.; Luo, Y.; Yao, D. Effects of CPAP treatment on electroencephalographic activity in patients with obstructive sleep apnea syndrome during deep sleep: Preliminary findings of a cross-sectional study. Chronic Respir. Dis. 2023, 20, 14799731231215094. [Google Scholar] [CrossRef]
- Douglas, N.J.; White, D.P.; Pickett, C.K.; Weil, J.V.; Zwillich, C.W. Respiration during sleep in normal man. Thorax 1982, 37, 840–844. [Google Scholar] [CrossRef]
- Kirkness, J.P.; Schwartz, A.R.; Patil, S.P.; Pichard, L.E.; Marx, J.J.; Smith, P.L.; Schneider, H. Dynamic modulation of upper airway function during sleep: A novel single-breath method. J. Appl. Physiol. 2006, 101, 1489–1494. [Google Scholar] [CrossRef]
- Chen, H.Y.; Ahsan, M.; Zinchuk, A.V.; Husain, A.; Yaggi, H.K.; Wu, H.T.; Chen, C.Y. Automatic Sleep Staging from CPAP Airflow using a Dual Fusion Multi-Period Convolutional Neural Network. Physiol. Meas. 2026, 47, 045011. [Google Scholar] [CrossRef] [PubMed]
- Van de Perck, E.; Kazemeini, E.; Van den Bossche, K.; Willemen, M.; Verbraecken, J.; Vanderveken, O.M.; Op de Beeck, S. The effect of CPAP on the upper airway and ventilatory flow in patients with obstructive sleep apnea. Respir. Res. 2023, 24, 143. [Google Scholar] [CrossRef]
- Malik, J.; Lo, Y.L.; Wu, H.T. Sleep-wake classification via quantifying heart rate variability by convolutional neural network. Physiol. Meas. 2018, 39, 085004. [Google Scholar] [CrossRef] [PubMed]
- Wu, X.; Yang, J.; Pan, Y.; Zhang, X.; Luo, Y. Automatic sleep-stage scoring based on photoplethysmographic signals. Physiol. Meas. 2020, 41, 065008. [Google Scholar] [CrossRef]
- Korkalainen, H.; Aakko, J.; Duce, B.; Kainulainen, S.; Leino, A.; Nikkonen, S.; Afara, I.O.; Myllymaa, S.; Töyräs, J.; Leppänen, T. Deep learning enables sleep staging from photoplethysmogram for patients with suspected sleep apnea. Sleep 2020, 43, zsaa098. [Google Scholar] [CrossRef]
- Sridhar, N.; Shoeb, A.; Stephens, P.; Kharbouch, A.; Shimol, D.B.; Burkart, J.; Ghoreyshi, A.; Myers, L. Deep learning for automated sleep staging using instantaneous heart rate. npj Digit. Med. 2020, 3, 106. [Google Scholar] [CrossRef]
- Huttunen, R.; Leppänen, T.; Duce, B.; Oksenberg, A.; Myllymaa, S.; Töyräs, J.; Korkalainen, H. Assessment of obstructive sleep apnea-related sleep fragmentation utilizing deep learning-based sleep staging from photoplethysmography. Sleep 2021, 44, zsab142. [Google Scholar] [CrossRef] [PubMed]
- Strumpf, Z.; Gu, W.; Tsai, C.W.; Chen, P.L.; Yeh, E.; Leung, L.; Cheung, C.; Wu, I.C.; Strohl, K.P.; Tsai, T.; et al. Belun Ring (Belun Sleep System BLS-100): Deep learning-facilitated wearable enables obstructive sleep apnea detection, apnea severity categorization, and sleep stage classification in patients suspected of obstructive sleep apnea. Sleep Health 2023, 9, 430–440. [Google Scholar] [CrossRef]
- Long, X.; Foussier, J.; Fonseca, P.; Haakma, R.; Aarts, R.M. Analyzing respiratory effort amplitude for automated sleep stage classification. Biomed. Signal Process. Control 2014, 14, 197–205. [Google Scholar] [CrossRef]
- Tataraidze, A.; Anishchenko, L.; Korostovtseva, L.; Kooij, B.J.; Bochkarev, M.; Sviryaev, Y. Sleep stage classification based on respiratory signal. In Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); IEEE: Piscataway, NJ, USA, 2015; pp. 358–361. [Google Scholar]
- Yang, J.; Keller, J.M.; Popescu, M.; Skubic, M. Sleep stage recognition using respiration signal. In Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); IEEE: Piscataway, NJ, USA, 2016; pp. 2843–2846. [Google Scholar]
- Chen, H.Y.; Wu, H.T.; Chen, C.Y. Quality Aware Sleep Stage Classification over RIP Signals with Persistence Diagrams. In Proceedings of the 2023 IEEE 19th International Conference on Body Sensor Networks (BSN); IEEE: Piscataway, NJ, USA, 2023; pp. 1–4. [Google Scholar]
- Chung, Y.M.; Huang, W.K.; Wu, H.T. Topological data analysis assisted automated sleep stage scoring using airflow signals. Biomed. Signal Process. Control 2024, 89, 105760. [Google Scholar] [CrossRef]
- Manjunath, S.; Wu, H.T.; Sathyanarayana, A. Pediatric sleep staging from airflow signals via persistence curve approximations. Comput. Biol. Med. 2025, 195, 110513. [Google Scholar] [CrossRef] [PubMed]
- Chen, K.W.; Tseng, C.H.; Lee, H.C.; Liu, W.T.; Chou, K.T.; Wu, H.T. Validation of a fingertip home sleep apnea testing system using deep learning AI and a temporal event localization analysis. Sleep 2025, 48, zsae317. [Google Scholar] [CrossRef]
- Sharan, R.V.; Takeuchi, H.; Kishi, A.; Wang, J.; Watanabe, T.; Yamamoto, Y. Sleep staging using PPG-derived heart rate and accelerometer data from a smart ring with lightweight neural networks. IEEE Trans. Instrum. Meas. 2025, 74, 4020414. [Google Scholar] [CrossRef]
- Zhang, A.H.; He-Mo, A.; Yin, R.F.; Li, C.; Tang, Y.; Gurve, D.; van der Horst, V.; Buchman, A.S.; Ghahjaverestan, N.M.; Goubran, M.; et al. Mamba-based Deep Learning Approaches for Sleep Staging on a Wireless Multimodal Wearable System without Electroencephalography. arXiv 2024, arXiv:2412.15947. [Google Scholar]
- Kazemi, K.; Abiri, A.; Zhou, Y.; Rahmani, A.; Khayat, R.N.; Liljeberg, P.; Khine, M. Improved sleep stage predictions by deep learning of photoplethysmogram and respiration patterns. Comput. Biol. Med. 2024, 179, 108679. [Google Scholar] [CrossRef]
- Krauss, D.; Richer, R.; Küderle, A.; Jukic, J.; German, A.; Leutheuser, H.; Regensburger, M.; Winkler, J.; Eskofier, B.M. Incorporating respiratory signals for machine learning-based multimodal sleep stage classification: A large-scale benchmark study with actigraphy and heart rate variability. Sleep 2025, 48, zsaf091. [Google Scholar] [CrossRef] [PubMed]
- Chang, L.; Lu, J.; Wang, J.; Chen, X.; Fang, D.; Tang, Z.; Nurmi, P.; Wang, Z. SleepGuard: Capturing rich sleep information using smartwatch sensing data. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2018, 2, 1–34. [Google Scholar] [CrossRef]
- Han, F.; Yang, P.; Feng, Y.; Jiang, W.; Zhang, Y.; Li, X.Y. Earsleep: In-ear acoustic-based physical and physiological activity recognition for sleep stage detection. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2024, 8, 98. [Google Scholar] [CrossRef]
- Kazemi, K.; Azimi, I.; Khine, M.; Khayat, R.N.; Rahmani, A.M.; Liljeberg, P. Multimodal sleep stage and sleep apnea classification using vision transformer: A multitask explainable learning approach. In Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); IEEE: Piscataway, NJ, USA, 2025; p. 43. [Google Scholar]
- Chriskos, P.; Frantzidis, C.A.; Nday, C.M.; Gkivogkli, P.T.; Bamidis, P.D.; Kourtidou-Papadeli, C. A review on current trends in automatic sleep staging through bio-signal recordings and future challenges. Sleep Med. Rev. 2021, 55, 101377. [Google Scholar] [CrossRef]
- Makowski, D.; Pham, T.; Lau, Z.J.; Brammer, J.C.; Lespinasse, F.; Pham, H.; Schölzel, C.; Chen, S.H.A. NeuroKit2: A Python toolbox for neurophysiological signal processing. Behav. Res. Methods 2021, 53, 1689–1696. [Google Scholar] [CrossRef]
- Bradley, G.R.; Kyriacou, P.A. Opening the envelope: Efficient envelope-based PPG denoising algorithm. Biomed. Signal Process. Control 2024, 88, 105693. [Google Scholar] [CrossRef]
- Zoccoli, G.; Amici, R. Sleep and autonomic nervous system. Curr. Opin. Physiol. 2020, 15, 128–133. [Google Scholar] [CrossRef]
- Stein, P.K.; Pu, Y. Heart rate variability, sleep and sleep disorders. Sleep Med. Rev. 2012, 16, 47–66. [Google Scholar] [CrossRef]
- Findley, L.J.; Wilhoit, S.C.; Suratt, P.M. Apnea duration and hypoxemia during REM sleep in patients with obstructive sleep apnea. Chest 1985, 87, 432–436. [Google Scholar] [CrossRef] [PubMed]
- Bakrania, K.; Yates, T.; Rowlands, A.V.; Esliger, D.W.; Bunnewell, S.; Sanders, J.; Davies, M.; Khunti, K.; Edwardson, C.L. Intensity thresholds on raw acceleration data: Euclidean norm minus one (ENMO) and mean amplitude deviation (MAD) approaches. PLoS ONE 2016, 11, e0164045. [Google Scholar] [CrossRef]
- Kiranyaz, S.; Ince, T.; Abdeljaber, O.; Avci, O.; Gabbouj, M. 1-D convolutional neural networks for signal processing applications. In Proceedings of the ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); IEEE: Piscataway, NJ, USA, 2019; pp. 8360–8364. [Google Scholar]
- Peever, J.; Fuller, P.M. The biology of REM sleep. Curr. Biol. 2017, 27, R1237–R1248. [Google Scholar] [CrossRef] [PubMed]
- Maragos, P.; Pessoa, L. Morphological filtering for image enhancement and detection. In The image and Video Processing Handbook; Academic Press: Cambridge, MA, USA, 1999; pp. 135–156. [Google Scholar]
- Coutrot, M.; Dudoignon, E.; Joachim, J.; Gayat, E.; Vallee, F.; Depret, F. Perfusion index: Physical principles, physiological meanings and clinical implications in anaesthesia and critical care. Anaesth. Crit. Care Pain Med. 2021, 40, 100964. [Google Scholar] [CrossRef]
- Guo, C.; Pleiss, G.; Sun, Y.; Weinberger, K.Q. On calibration of modern neural networks. In Proceedings of the International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2017; pp. 1321–1330. [Google Scholar]
- Naeini, M.P.; Cooper, G.; Hauskrecht, M. Obtaining well calibrated probabilities using bayesian binning. In Proceedings of the AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015; Volume 29. [Google Scholar]
- McArdle, N.; Douglas, N.J. Effect of continuous positive airway pressure on sleep architecture in the sleep apnea–hypopnea syndrome: A randomized controlled trial. Am. J. Respir. Crit. Care Med. 2001, 164, 1459–1463. [Google Scholar] [CrossRef]
- Grzęda-Hałon, M.; Poręba, M.; Gut, G.; Czerwińska, K.; Gać, P.; Martynowicz, H.; Mazur, G.; Poręba, R. The effect of CPAP therapy on heart rate variability in patients with obstructive sleep apnea. Sleep Biol. Rhythm. 2023, 21, 143–154. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Shi, C.; Luo, J.; Huang, R.; Xiao, Y. Rapid eye movement (REM)-related obstructive sleep apnea And hypertension: Insights from the clinical spectrum of apnea–hypopnea index ratios across REM and non-REM sleep stages in a Chinese cohort. J. Sleep Res. 2025, 34, e14418. [Google Scholar] [CrossRef] [PubMed]
- Cerina, L.; Fonseca, P.; Papini, G.B.; Vullings, R.; Overeem, S. Breathtaking dreams: Reduced REM phenotype in REM-related sleep apnea. Sleep Breath. 2025, 29, 87. [Google Scholar] [CrossRef] [PubMed]
- Ben Messaoud, R.; Baillieul, S.; Tamisier, R.; Terrail, R.; Bailly, S.; Joyeux-Faure, M.; Pépin, J.L. Digital sleep clinic: Assessing efficacy of continuous positive airway pressure through sleep staging via connected devices: A study protocol. BMC Digit. Health 2024, 2, 23. [Google Scholar] [CrossRef]
- Somiah, M.; Taxin, Z.; Keating, J.; Mooney, A.M.; Norman, R.G.; Rapoport, D.M.; Ayappa, I. Sleep quality, short-term and long-term CPAP adherence. J. Clin. Sleep Med. 2012, 8, 489–500. [Google Scholar] [CrossRef]
- Zhao, Y.Y.; Wang, R.; Gleason, K.J.; Lewis, E.F.; Quan, S.F.; Toth, C.M.; Morrical, M.; Rueschman, M.; Weng, J.; Ware, J.H.; et al. Effect of continuous positive airway pressure treatment on health-related quality of life and sleepiness in high cardiovascular risk individuals with sleep apnea: Best Apnea Interventions for Research (BestAIR) Trial. Sleep 2017, 40, zsx040. [Google Scholar] [CrossRef] [PubMed]









| Characteristic | Mean ± SD | Median (Q1–Q3) |
|---|---|---|
| Age | 52.8 ± 14.8 | 52.0 (41.0–65.25) |
| BMI | 39.4 ± 11.2 | 36.99 (30.28–44.58) |
| AHI 1A Rule | 57.0 ± 38.1 | 42.85 (32.65–62.15) |
| AHI 1B Rule | 42.1 ± 38.2 | 25.48 (19.83–49.96) |
| TST | 197.9 ± 54.6 | 196.0 (161.50–234.75) |
| Sex | N | % |
| Male | 18 | 56 |
| Female | 14 | 44 |
| Race | N | % |
| White | 23 | 72 |
| African American | 9 | 28 |
| Cardiovascular Comorbidities | N | % |
| Hypertension | 5 | 16 |
| Arrhythmia | 1 | 3 |
| Medication Use | N | % |
| Beta Blocker | 8 | 25 |
| Hypnotics | 4 | 13 |
| Antidepressants | 12 | 38 |
| Overall | Sensitivity | PPV | Specificity | F1 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Acc | F1 | Wake | Sleep | Wake | Sleep | Wake | Sleep | Wake | Sleep | |
| Direct Inference | |||||||||||
| TipTraQ | 0.823 | 0.429 | 0.713 | 0.454 | 0.930 | 0.650 | 0.855 | 0.930 | 0.454 | 0.534 | 0.891 |
| CPAP | 0.821 | 0.470 | 0.735 | 0.555 | 0.899 | 0.616 | 0.873 | 0.899 | 0.555 | 0.584 | 0.886 |
| Ensemble | 0.843 | 0.576 | 0.787 | 0.730 | 0.876 | 0.633 | 0.917 | 0.876 | 0.730 | 0.678 | 0.896 |
| 3-Fold Cross-Validation | |||||||||||
| TipTraQ | 0.757 ± 0.03 | 0.432 ± 0.02 | 0.710 ± 0.02 | 0.745 ± 0.07 | 0.758 ± 0.06 | 0.501 ± 0.02 | 0.906 ± 0.01 | 0.758 ± 0.06 | 0.745 ± 0.07 | 0.597 ± 0.01 | 0.823 ± 0.04 |
| CPAP | 0.833 ± 0.01 | 0.538 ± 0.02 | 0.769 ± 0.01 | 0.646 ± 0.04 | 0.890 ± 0.02 | 0.651 ± 0.01 | 0.890 ± 0.01 | 0.890 ± 0.02 | 0.646 ± 0.04 | 0.648 ± 0.02 | 0.890 ± 0.01 |
| Ensemble | 0.854 ± 0.01 | 0.613 ± 0.01 | 0.806 ± 0.00 | 0.753 ± 0.06 | 0.885 ± 0.03 | 0.678 ± 0.03 | 0.921 ± 0.01 | 0.885 ± 0.03 | 0.753 ± 0.06 | 0.711 ± 0.01 | 0.902 ± 0.01 |
| Overall | Sensitivity | PPV | Specificity | F1 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Acc | F1 | Wake | REM | NREM | Wake | REM | NREM | Wake | REM | NREM | Wake | REM | NREM | |
| Direct Inference | |||||||||||||||
| TipTraQ | 0.821 | 0.511 | 0.676 | 0.456 | 0.725 | 0.838 | 0.653 | 0.649 | 0.778 | 0.930 | 0.913 | 0.650 | 0.537 | 0.685 | 0.807 |
| CPAP | 0.807 | 0.452 | 0.636 | 0.551 | 0.428 | 0.859 | 0.612 | 0.702 | 0.741 | 0.897 | 0.960 | 0.566 | 0.580 | 0.532 | 0.795 |
| Ensemble | 0.843 | 0.587 | 0.730 | 0.676 | 0.692 | 0.822 | 0.641 | 0.720 | 0.829 | 0.889 | 0.940 | 0.756 | 0.658 | 0.706 | 0.826 |
| 3-Fold Cross-Validation | |||||||||||||||
| TipTraQ | 0.761 ± 0.02 | 0.416 ± 0.03 | 0.610 ± 0.02 | 0.745 ± 0.07 | 0.526 ± 0.02 | 0.631 ± 0.08 | 0.501 ± 0.02 | 0.541 ± 0.00 | 0.794 ± 0.01 | 0.758 ± 0.06 | 0.900 ± 0.01 | 0.776 ± 0.04 | 0.597 ± 0.01 | 0.533 ± 0.01 | 0.701 ± 0.05 |
| CPAP | 0.800 ± 0.01 | 0.486 ± 0.01 | 0.660 ± 0.01 | 0.646 ± 0.04 | 0.600 ± 0.00 | 0.753 ± 0.05 | 0.651 ± 0.01 | 0.541 ± 0.05 | 0.781 ± 0.01 | 0.890 ± 0.02 | 0.883 ± 0.02 | 0.713 ± 0.03 | 0.648 ± 0.02 | 0.568 ± 0.03 | 0.766 ± 0.02 |
| Ensemble | 0.827 ± 0.01 | 0.552 ± 0.01 | 0.704 ± 0.01 | 0.753 ± 0.06 | 0.585 ± 0.02 | 0.783 ± 0.06 | 0.678 ± 0.03 | 0.642 ± 0.03 | 0.803 ± 0.01 | 0.885 ± 0.03 | 0.925 ± 0.01 | 0.736 ± 0.04 | 0.711 ± 0.01 | 0.611 ± 0.01 | 0.791 ± 0.03 |
| Overall | Sensitivity | PPV | Specificity | F1 | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Acc | F1 | W | R | L | D | W | R | L | D | W | R | L | D | W | R | L | D | |
| Direct Inference | |||||||||||||||||||
| TipTraQ | 0.782 | 0.369 | 0.539 | 0.520 | 0.669 | 0.580 | 0.421 | 0.550 | 0.667 | 0.674 | 0.270 | 0.912 | 0.923 | 0.719 | 0.814 | 0.535 | 0.668 | 0.624 | 0.329 |
| CPAP | 0.776 | 0.287 | 0.464 | 0.597 | 0.335 | 0.731 | 0.170 | 0.479 | 0.676 | 0.575 | 0.370 | 0.864 | 0.960 | 0.482 | 0.954 | 0.532 | 0.448 | 0.644 | 0.233 |
| Ensemble | 0.806 | 0.432 | 0.611 | 0.642 | 0.615 | 0.604 | 0.596 | 0.724 | 0.783 | 0.630 | 0.373 | 0.948 | 0.958 | 0.659 | 0.840 | 0.680 | 0.689 | 0.617 | 0.459 |
| 3-Fold Cross-Validation | |||||||||||||||||||
| TipTraQ | 0.801 ± 0.00 | 0.378 ± 0.02 | 0.542 ± 0.01 | 0.727 ± 0.05 | 0.322 ± 0.03 | 0.742 ± 0.07 | 0.334 ± 0.10 | 0.732 ± 0.02 | 0.640 ± 0.04 | 0.592 ± 0.01 | 0.399 ± 0.01 | 0.939 ± 0.01 | 0.954 ± 0.01 | 0.531 ± 0.06 | 0.921 ± 0.02 | 0.729 ± 0.03 | 0.427 ± 0.02 | 0.657 ± 0.04 | 0.356 ± 0.05 |
| CPAP | 0.802 ± 0.00 | 0.401 ± 0.02 | 0.497 ± 0.01 | 0.905 ± 0.03 | 0.479 ± 0.04 | 0.696 ± 0.03 | 0.045 ± 0.02 | 0.604 ± 0.04 | 0.599 ± 0.03 | 0.640 ± 0.01 | 0.160 ± 0.08 | 0.864 ± 0.01 | 0.918 ± 0.02 | 0.639 ± 0.06 | 0.963 ± 0.01 | 0.724 ± 0.03 | 0.530 ± 0.01 | 0.666 ± 0.01 | 0.069 ± 0.04 |
| Ensemble | 0.829 ± 0.00 | 0.465 ± 0.03 | 0.558 ± 0.02 | 0.877 ± 0.05 | 0.451 ± 0.07 | 0.809 ± 0.05 | 0.118 ± 0.03 | 0.744 ± 0.01 | 0.665 ± 0.02 | 0.658 ± 0.02 | 0.312 ± 0.04 | 0.929 ± 0.02 | 0.942 ± 0.01 | 0.609 ± 0.08 | 0.958 ± 0.01 | 0.804 ± 0.02 | 0.534 ± 0.04 | 0.724 ± 0.01 | 0.169 ± 0.03 |
| Overall | Deep Sleep (N3 Stage) Specific | ||||||
|---|---|---|---|---|---|---|---|
| Model | Acc | F1 | Sensitivity | PPV | Specificity | F1 | |
| Direct Inference | |||||||
| TipTraQ | 0.748 (0.693–0.793) | 0.281 (0.159–0.414) | 0.472 (0.340–0.569) | 0.490 (0.229–0.746) | 0.274 (0.141–0.402) | 0.823 (0.754–0.892) | 0.346 (0.174–0.508) |
| CPAP | 0.801 (0.773–0.828) | 0.343 (0.235–0.454) | 0.470 (0.378–0.562) | 0.171 (0.026–0.344) | 0.283 (0.103–0.492) | 0.967 (0.932–0.995) | 0.175 (0.045–0.327) |
| Ensemble | 0.810 (0.768–0.842) | 0.411 (0.286–0.539) | 0.561 (0.471–0.657) | 0.475 (0.198–0.747) | 0.287 (0.136–0.433) | 0.849 (0.788–0.912) | 0.353 (0.167–0.560) |
| 3-Fold Cross-Validation | |||||||
| TipTraQ | 0.793 (0.731–0.831) | 0.343 (0.220–0.456) | 0.458 (0.338–0.559) | 0.249 (0.054–0.498) | 0.259 (0.060–0.493) | 0.955 (0.911–0.992) | 0.235 (0.038–0.456) |
| CPAP | 0.813 (0.789–0.836) | 0.391 (0.268–0.498) | 0.488 (0.394–0.570) | 0.087 (0.017–0.188) | 0.276 (0.059–0.550) | 0.968 (0.925–1.000) | 0.119 (0.027–0.243) |
| Ensemble | 0.836 (0.815–0.857) | 0.443 (0.326–0.554) | 0.532 (0.452–0.617) | 0.143 (0.013–0.333) | 0.223 (0.017–0.453) | 0.972 (0.933–0.995) | 0.169 (0.019–0.366) |
| 0.0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1.0 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Acc | 0.812 | 0.824 | 0.829 | 0.835 | 0.838 | 0.840 | 0.831 | 0.818 | 0.806 | 0.796 | 0.754 |
| 0.498 | 0.524 | 0.535 | 0.550 | 0.560 | 0.568 | 0.550 | 0.519 | 0.495 | 0.473 | 0.391 | |
| F1 | 0.668 | 0.685 | 0.693 | 0.703 | 0.710 | 0.712 | 0.697 | 0.676 | 0.659 | 0.645 | 0.587 |
| Confidence Level | Acc | F1 | Confidence | Epoch Count |
|---|---|---|---|---|
| High (0.8–1.0) | 0.893 | 0.837 | 0.946 | 8416 |
| Intermediate (0.6–0.8) | 0.649 | 0.647 | 0.699 | 4251 |
| Low (0.3–0.6) | 0.501 | 0.492 | 0.523 | 3502 |
| Probability Range | Acc | Confidence | Epoch Count |
|---|---|---|---|
| 0.3–0.4 | 0.337 | 0.377 | 199 |
| 0.4–0.5 | 0.430 | 0.460 | 1004 |
| 0.5–0.6 | 0.530 | 0.548 | 2498 |
| 0.6–0.7 | 0.585 | 0.650 | 2142 |
| 0.7–0.8 | 0.714 | 0.749 | 2109 |
| 0.8–0.9 | 0.819 | 0.850 | 2076 |
| 0.9–1.0 | 0.917 | 0.977 | 6340 |
| Final ECE: 0.045 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Chen, H.-Y.; Husain, A.; Zinchuk, A.V.; Yaggi, H.K.; Ahsan, M.; Chen, C.-Y.; Pokusa, S.; Wu, H.-T. Sleep Stage Classification During CPAP Therapy from CPAP-Airflow and Wearable Fingertip Signals. Sensors 2026, 26, 3720. https://doi.org/10.3390/s26123720
Chen H-Y, Husain A, Zinchuk AV, Yaggi HK, Ahsan M, Chen C-Y, Pokusa S, Wu H-T. Sleep Stage Classification During CPAP Therapy from CPAP-Airflow and Wearable Fingertip Signals. Sensors. 2026; 26(12):3720. https://doi.org/10.3390/s26123720
Chicago/Turabian StyleChen, Hsin-Yu, Aatif Husain, Andrey V. Zinchuk, Henry K. Yaggi, Muneeb Ahsan, Cheng-Yao Chen, Shirah Pokusa, and Hau-Tieng Wu. 2026. "Sleep Stage Classification During CPAP Therapy from CPAP-Airflow and Wearable Fingertip Signals" Sensors 26, no. 12: 3720. https://doi.org/10.3390/s26123720
APA StyleChen, H.-Y., Husain, A., Zinchuk, A. V., Yaggi, H. K., Ahsan, M., Chen, C.-Y., Pokusa, S., & Wu, H.-T. (2026). Sleep Stage Classification During CPAP Therapy from CPAP-Airflow and Wearable Fingertip Signals. Sensors, 26(12), 3720. https://doi.org/10.3390/s26123720

