An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series

Wang, Qizheng; Wang, Yuping; Zhao, Shuai; Wu, Yuhan; Li, Shengjie

doi:10.3390/math14071107

Open AccessFeature PaperArticle

An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series

by

Qizheng Wang

,

Yuping Wang

,

Shuai Zhao

^*

,

Yuhan Wu

and

Shengjie Li

State Key Laboratory of Network and Switching Technology, Beijing University of Posts and Telecommunications, Bejing 100876, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(7), 1107; https://doi.org/10.3390/math14071107

Submission received: 23 February 2026 / Revised: 23 March 2026 / Accepted: 24 March 2026 / Published: 25 March 2026

(This article belongs to the Special Issue Machine Learning and Graph Neural Networks)

Download

Browse Figures

Versions Notes

Abstract

The accurate inference of hidden states from non-stationary physiological signals remains a significant challenge in stochastic process modeling. This paper proposes an Adaptive Sticky Hidden Markov Model (Sticky-HMM) framework designed to enhance the robustness of state decoding in noisy environments. To address the “state-flickering” issue inherent in traditional HMMs, we incorporate a “Sticky” parameter into the transition matrix, imposing a temporal penalty on spurious state switching to maintain continuity. Furthermore, we introduce a Dynamic Prior Strategy that adaptively calibrates self-transition probabilities by mapping frequency-domain features of the observed sequence to the model’s parameter space. The proposed decoding process employs a two-pass refinement strategy and the Viterbi algorithm in the logarithmic domain to ensure numerical stability. The model’s efficacy was validated using a high-fidelity dataset of simulated apnea events. This work provides a computationally efficient and mathematically rigorous approach that demonstrates strong potential for long-term respiratory health monitoring.

Keywords:

mmWave radar; hidden Markov model; sleep apnea; multi-dimensional features; non-contact sensing

MSC:

60J20

1. Introduction

Obstructive sleep apnea (OSA) is a prevalent sleep-related breathing disorder characterized by the repetitive partial or complete collapse of the upper airway during sleep. Clinical guidelines define OSA as breathing pauses lasting at least 10 s, which remains a significant global health challenge, affecting nearly 1 billion adults aged 30–69 years worldwide [1,2]. The American Heart Association (AHA) has identified OSA as an independent risk factor for several cardiovascular comorbidities, including resistant hypertension, atrial fibrillation, and heart failure [3]. Recent evidence from large-scale meta-analyses underscores that untreated OSA significantly elevates the risk of all-cause mortality and is specifically linked to an increased risk of sudden cardiac death, with a pooled odds ratio as high as 3.87 in untreated patients [4]. In addition, the condition frequently induces chronic fatigue, excessive daytime sleepiness, and cognitive impairment, which markedly degrade patients’ quality of life [5].

Currently, overnight in-laboratory multichannel polysomnography (PSG) is recognized as the established “gold standard” for the definitive diagnosis and severity grading of obstructive sleep apnea (OSA) [5]. However, the use of multiple tubes and a chest/abdomen strap in PSG places a significant physiological burden on subjects and is costly, making widespread PSG screening difficult. These limitations contribute to a high global rate of underdiagnosis, with estimates of undetected cases exceeding 80% in certain populations, rendering PSG impractical for large-scale population screening or longitudinal home monitoring [3,6]. To address the above issues, various portable monitoring solutions have been developed. For instance, dedicated wearable devices have been designed to capture respiratory pressure signals using a pressure sensor, which can calculate essential indicators such as respiratory rate, inspiratory and expiratory time, and apnea–hypopnea index (AHI) in real time. However, they still require physical contact with the subjects, which may cause discomfort and introduce signal artifacts related to sleeping posture or mattress compression [7]. Furthermore, while commercially available wearable digital devices are increasingly utilized in population-based research due to their low cost and ubiquity, issues concerning data privacy, limited data access, and low user adherence continue to hinder their effectiveness for reliable, long-term data acquisition [8]. Therefore, a non-invasive, comfortable, and efficient alternative to PSG is needed in both medical and home settings to improve cardiovascular risk stratification and increase early diagnosis rates [3,6].

In recent years, millimeter-wave (mmWave) radar technology has emerged as a transformative solution for non-invasive physiological monitoring. Unlike optical sensors that raise significant privacy concerns [9] or contact sensors requiring physical attachment, mmWave radar is capable of detecting sub-millimeter chest displacements caused by cardiorespiratory activity through clothing and bedding without disrupting the user’s natural sleep environment [10,11].

Building on the above limitations, researchers have actively explored mmWave radar for detecting sleep-related breathing disorders (SRBDs). For instance, Jung et al. [12] proposed a method to extract respiratory parameters from FMCW radar signals, achieving high efficacy in monitoring sleep apnea events. However, their work primarily focused on event detection and did not quantitatively assess the temporal accuracy of specific event occurrences. Conversely, data-driven approaches, such as the hybrid CNN–Transformer architecture introduced by Choi et al. [13], have achieved significant success in diagnosing OSA. Nevertheless, while these deep learning models offer powerful feature representation capabilities, they typically require large-scale training datasets and substantial computational resources. From a methodological perspective, the proposed Sticky-HMM offers an unsupervised and lightweight alternative; however, its relative empirical performance compared to state-of-the-art deep learning architectures remains to be quantified in future head-to-head studies.

To achieve high-precision apnea detection while maintaining a lightweight computational framework, this article proposes an Adaptive Sticky Hidden Markov Model (HMM) framework. This approach integrates multi-dimensional characteristics of distinct signal states and introduces a dynamic mechanism to adaptively calibrate the HMM transition probabilities. By fusing these physiological features with adaptive probabilistic modeling, the proposed method significantly enhances the robustness and accuracy of respiratory state inference. Our main contributions are summarized as follows:

We propose a complete apnea detection framework based on the Adaptive Sticky Hidden Markov Model (HMM) which encompasses the entire signal chain, from raw data acquisition and signal processing to multi-dimensional feature extraction and final event inference.
We construct a high-fidelity mmWave radar dataset involving 15 subjects, strictly synchronized with a respiratory belt as the ground truth (GT). The dataset covers diverse experimental scenarios, including three standoff distances (1 m, 1.5 m, 2 m) and distinct apnea durations (10 s and 15 s), providing a valuable resource for future research in non-contact physiological monitoring.
Experimental results in our dataset demonstrate that our method achieves better performance than other baseline methods and exhibits high consistency with the ground truth.

2. Methods

2.1. Signal Preprocessing

As illustrated in Stage 1 of Figure 1, the received raw radar ADC data is reorganized into a fast-time × slow-time matrix. We perform a Range-FFT on each chirp to generate the Range Profile matrix

R (k, m)

, where k is the range bin index and m is the slow-time frame index.

To robustly lock onto the chest wall and mitigate body shifts, we employ a smoothing tracker. The target range bin

k_{t a r g e t}^{(m)}

is updated dynamically:

k_{t a r g e t}^{(m)} = round (α \cdot k_{t a r g e t}^{(m - 1)} + (1 - α) \cdot \underset{k}{arg max} | R (k, m) |)

(1)

where

α

is the smoothing factor and

| R (k, m) |

denotes the magnitude at range index k of the m-th frame after Range-FFT processing.

After identifying the target range bin, the phase information is extracted from the complex signal using the arctangent function [14]. However, the raw phase is restricted to the principal range of

[- π, π)

, leading to artificial sawtooth-like discontinuities when the chest wall displacement exceeds the radar wavelength. To reconstruct the actual continuous movement of the chest, a phase unwrapping algorithm is applied. This process detects phase jumps between consecutive samples and compensates for them by adding or subtracting integer multiples of

2 π

, thereby restoring a smooth and continuous respiratory trajectory.

Finally, a 4th-order Butterworth bandpass filter with cutoff frequencies of

[0.08, 0.6] Hz

is applied to isolate the respiratory component, effectively suppressing cardiac signals (>0.8 Hz) and low-frequency drift (<0.05 Hz).

2.2. Multi-Dimensional Feature Fusion and Normalization

To capture the intricate dynamics of respiratory cessation, a three-dimensional feature vector

F_{m} = {[f_{1, m}, f_{2, m}, f_{3, m}]}^{T}

is constructed for each slow-time sample m, corresponding to the feature extraction process illustrated in Stage 2 of Figure 1:

Phase Envelope ( $f_{1}$ ): The Hilbert transform is utilized to extract the analytical signal of the phase, and its magnitude is smoothed to reflect the overall respiratory energy:

$f_{1, m} = MA (| Hilbert (ϕ_{b p} (m)) |, w_{1})$

(2)
Velocity ( $f_{2}$ ): The first derivative of the phase represents the rate of chestwall movement:

$d_{1} (m) = \frac{\partial ϕ_{b p} (m)}{\partial t}$

(3)

$f_{2, m} = MA (| d_{1} (m) |, w_{2})$

(4)
Curvature ( $f_{3}$ ): The second derivative is highly sensitive to the sudden onset and offset of apnea events:

$d_{2} (m) = \frac{\partial d_{1} (m)}{\partial t}$

(5)

$f_{3, m} = MA (| d_{2} (m) |, w_{3})$

(6)

where $MA (\cdot, w)$ denotes a moving average filter with window size w.

To ensure distance-invariant performance, features are normalized using the Median Absolute Deviation (MAD) instead of standard Z-score normalization:

z_{k, m} = \frac{f_{k, m} - median (f_{k})}{1.4826 \cdot MAD (f_{k})}

(7)

This robust scaling suppresses the influence of outliers and maintains a stable feature distribution across varying Signal-to-Noise Ratio (SNR) environments.

2.3. Adaptive Sticky-HMM-Based State Decoding

We model the respiratory process as a two-state Hidden Markov Model (HMM) with state space

S = {S_{A}, S_{B}}

, representing apnea and normal breathing, respectively.

To eliminate the fragmentation effect caused by transient noise, we incorporate a “Sticky” parameter [15] into the transition matrix

A

:

A = [\begin{matrix} p_{A A} & 1 - p_{A A} \\ 1 - p_{B B} & p_{B B} \end{matrix}]

(8)

The self-transition probabilities

p_{i i}

are derived from the expected physiological duration

τ_{i}

of each state:

p_{i i} = max (ϵ, 1 - \frac{1}{τ_{i} \cdot f_{s}}), i \in {A, B}

(9)

This formulation imposes a temporal penalty on state switching, effectively forcing the model to maintain state continuity unless significant feature evidence suggests otherwise. The observation features are assumed to follow a diagonal Gaussian distribution for each state:

P (z_{m} | S_{i}) = \frac{1}{\sqrt{{(2 π)}^{3} | Σ_{i} |}} exp (- \frac{1}{2} {(z_{m} - μ_{i})}^{T} Σ_{i}^{- 1} (z_{m} - μ_{i}))

(10)

where the parameters

{μ_{i}, Σ_{i}}

are statistically estimated from initial state masks.

To handle subject-specific variability and different apnea durations, an Adaptive Prior Strategy is implemented.

The respiratory prior is dynamically updated by identifying the peak frequency (

f_{p e a k}

) of the power spectral density (PSD) calculated via Welch’s method:

f_{p e a k} = arg max_{f \in [0.08, 0.6]} PSD (f)

(11)

τ_{B} = \frac{1}{f_{p e a k}}

(12)

The proposed decoding process employs a two-pass refinement strategy to optimize state estimation. In the initial pass, the sequence is decoded using a preliminary apnea prior,

τ_{A, i n i t}

. To enhance the precision of the temporal boundaries, a refinement stage is introduced wherein the median duration of the segments identified in the first pass is utilized to adaptively update

τ_{A}

. Subsequently, a second pass is executed; here, the transition matrix is reconstructed based on the refined prior, facilitating a final high-precision decoding of the respiratory states.

The optimal state sequence, denoted as

\hat{S}

, is inferred via the Viterbi algorithm implemented in the logarithmic domain to ensure numerical stability and mitigate potential underflow issues. The recursive update is defined as

δ_{t} (j) = max_{i} [δ_{t - 1} (i) + ln a_{i j}] + ln P (z_{t} | S_{j})

(13)

where

δ_{t} (j)

represents the highest log-probability of a state sequence ending at state

S_{j}

at time t;

a_{i j}

denotes the transition probability from state i to state j; and

P (z_{t} | S_{j})

is the emission probability of the feature vector

z_{t}

given state

S_{j}

.

To ensure the decoded sequences align with standardized clinical criteria, a postprocessing refinement stage is subsequently applied. Specifically, a bridging heuristic is employed to merge transient gaps shorter than 2 s, followed by a pruning operation to eliminate spurious segments that fail to satisfy the minimum duration constraint,

T_{a p n e a}

. This dual-filtering approach guarantees that the final output strictly adheres to the temporal definitions of clinical apnea events.

3. Experiment Settings

3.1. Hardware and Software Configuration

For raw radar data collection, the TI IWR6843ISK [16] radar sensor (Texas Instruments, Dallas, TX, USA) was utilized in conjunction with a DCA1000EVM [17] data capture card (Texas Instruments, Dallas, TX, USA). The system was configured to operate in a Single-Transmit, Four-Receive (1T4R) mode. Parameter configuration and data capture were controlled via the mmWave Studio software (v02.01.01.00). Detailed radar parameters are summarized in Table 1.

To acquire respiratory-induced chest displacement signals synchronized with the radar data, an ElasTech wireless respiratory belt (Ningbo Elastech Co., Ltd., Ningbo, China) [18] was employed to provide high-precision reference respiration waveforms. During the data acquisition process, signals from both devices were synchronized using high-precision timestamps. All signal processing tasks and algorithm implementations were conducted using the MATLAB R2025a platform.

3.2. Participants and Protocol

A total of 15 volunteers (aged 20–30 years) were recruited for the raw data acquisition process. To ensure signal integrity and minimize clutter interference, all experiments were conducted in a spacious indoor environment cleared of extraneous objects that might affect the radar backscatter. During the measurements, participants maintained a stationary posture with steady breathing. The radar sensor was aligned directly with the subject’s chest wall, as illustrated in Figure 2.

To enhance the reliability of the experimental results, data were acquired at three distinct sensing distances: 1 m, 1.5 m, and 2 m. Each subject completed two controlled apnea trials per distance condition—one of 10 s and one of 15 s duration—yielding 6 trials per subject across all distances. Across the 15 subjects, this resulted in a total of 90 apnea events (15 subjects × 2 durations × 3 distances), with 30 events per distance condition, ensuring a balanced distribution across sensing distances. Subjects were guided by computer-based visual prompts to initiate and terminate each simulated apnea event, and all radar recordings were synchronized with the respiratory belt ground truth via high-precision timestamps. The resulting dataset comprised a total of 64 min of respiratory signals, including 90 distinct apnea events after quality screening.

The feasibility of using seated, voluntary breath-holding as a proxy for obstructive sleep apnea (OSA) rests on the morphological equivalence of the chest wall motion signal. From a radar sensing perspective, the primary signature of an apnea event—whether obstructive or voluntary—is the cessation of rhythmic thoracic displacement. By simulating apnea in a controlled, high-SNR setting, we establish a rigorous baseline for validating the algorithm’s temporal boundary precision before clinical translation. As the proposed Adaptive Sticky-HMM is a training-free method whose parameters are estimated directly from each test signal, no cross-subject data splitting is required; cross-subject generalizability is inherent to the unsupervised design of the method.

4. Results

4.1. Apnea Event Detection Performance

To comprehensively evaluate the performance of our proposed method, we reproduced three baseline methods for comparison, namely the threshold-based method, the peak-based method, and the Hidden Markov Model (HMM) method [19,20,21,22].

Figure 3 illustrates the performance of various detection methods compared against the ground truth (GT) obtained from the respiratory belt. Among all evaluated approaches, our proposed method demonstrates the highest precision in event boundary estimation. Specifically, while the GT recorded two distinct apnea events during 21.25–32.20 s and 60.70–76.10 s, our proposed Adaptive Sticky-HMM successfully identified these occurrences at 21.20–31.50 s and 61.35–76.10 s, respectively. The results exhibit a precise degree of consistency with the GT, with temporal offsets remaining well within a minimal margin, thereby validating the precision of our approach.

In contrast, both the peak-based and traditional HMM-based methods suffer from significant inaccuracies and temporal fragmentation. For the first apnea event, these two methods exhibited imprecise boundaries of 19.75–26.95 s and 21.15–29.30 s, respectively, both failing to fully capture the actual duration. This instability became more pronounced during the second event, where both approaches erroneously partitioned a single apnea event into fragmented segments. Specifically, the peak-based method divided the event into 60.40–66.85 s and 70.90–80.05 s, while the traditional HMM exhibited similar instability, with boundaries at 61.35–65.80 s and 70.05–76.55 s. Although the threshold-based method correctly identified the occurrence of both events at 22.40–32.05 s and 60.50–77.80 s, it exhibited non-negligible errors in estimating exact durations. For instance, the delayed onset and offset discrepancies in the first event led to a less robust performance compared to the proposed Adaptive Sticky-HMM, which maintained superior temporal alignment across all cases.

Table 2 summarizes the Mean Absolute Error (MAE) of the four evaluated methods across sensing distances ranging from 1 m to 2 m. Our proposed Adaptive Sticky-HMM method demonstrates superior precision and robustness across all test intervals. At the initial distance of 1 m, the Adaptive Sticky-HMM method achieves a minimal MAE of 0.77 s, which is significantly lower than the traditional HMM-based (1.29 s), threshold-based (1.53 s), and peak-based (1.80 s) methods. As the range extends to the intermediate distance of 1.5 m, the Adaptive Sticky-HMM method maintains high stability with an MAE of 1.03 s, whereas the errors for the baseline methods increase notably to 1.62 s, 2.24 s, and 2.53 s, respectively. Even at the maximum test distance of 2 m, the Adaptive Sticky-HMM preserves an MAE of 1.18 s, which remains lower than the error of the traditional HMM-based method at a much closer 1 m range (1.29 s).

The performance disparity between the Adaptive Sticky-HMM method and the three baseline methods becomes more pronounced at greater distances. The peak-based method exhibits the highest sensitivity to distance, with its MAE escalating to 3.38 s at 2 m—a nearly twofold increase compared to its 1 m performance. Similarly, the threshold-based method shows substantial degradation, reaching an MAE of 2.65 s at the 2 m mark. Overall, the Adaptive Sticky-HMM method achieves an average MAE of 0.99 s, providing an error reduction of 40.7% compared to the traditional HMM-based method (1.67 s) and outperforming the threshold-based (2.14 s) and peak-based (2.57 s) approaches by a substantial margin. These results quantitatively confirm the effectiveness of the Adaptive Sticky-HMM method in mitigating environmental noise and state-switching instabilities in long-range sensing scenarios.

4.2. Statistical Analysis of Duration Error

To validate the assumption of approximate event-level independence, the Intraclass Correlation Coefficient (ICC2) was computed for each method–distance combination. ICC2 is defined under the two-way random-effects model as

ICC 2 = \frac{M S_{R} - M S_{E}}{M S_{R} + (k - 1) M S_{E} + \frac{k}{n} (M S_{C} - M S_{E})}

(14)

where

M S_{R}

,

M S_{C}

, and

M S_{E}

denote the between-subject, between-rater, and error mean squares, respectively;

n = 15

is the number of subjects; and

k = 2

is the number of repeated measurements per subject (corresponding to the 10 s and 15 s apnea trials). As shown in Table 3, all ICC values fall below 0.5, indicating that between-subject variance is negligible relative to event-level variability. Consequently, individual apnea events are treated as the unit of statistical inference, consistent with the event-based nature of clinical apnea evaluation.

To formally assess whether the proposed Adaptive Sticky-HMM achieves statistically superior performance over the baseline methods, two levels of hypothesis testing were conducted on the event-level absolute duration errors.

First, a Friedman test was applied at each sensing distance to evaluate the overall difference among the four methods. As shown in Table 4, the differences are highly significant at all three distances (all

p < 0.001

), confirming that the observed performance hierarchy is not attributable to chance.

Post-hoc Wilcoxon signed-rank tests further confirmed that the proposed Adaptive Sticky-HMM significantly outperformed all baseline methods across all distances. Specifically, for each pairwise comparison between the proposed method and baseline methods, p-values were consistently below 0.001, indicating strong statistical significance (see Table 5).

Figure 4 illustrates the comparative boxplots of absolute duration errors for the four radar-based methods against the respiratory belt ground truth (GT).

Precision and Stability at Close Range (1 m): At a distance of 1 m, the Adaptive Sticky-HMM achieves a median error of 0.80 s with a compact IQR of 0.40 s. In comparison, the traditional HMM-based method yields a median error of 1.30 s, while the threshold-based and peak-based methods exhibit significantly higher median errors of 1.58 s and 1.83 s, respectively. Notably, the Adaptive Sticky-HMM provides a 38.5% reduction in median error relative to the traditional HMM-based approach.
Resilience to Intermediate and Long Ranges (1.5 m and 2 m): As the sensing range extends, the Adaptive Sticky-HMM demonstrates remarkable resilience to the degradation of the Signal-to-Noise Ratio (SNR). At 1.5 m, it maintains a median error of 1.05 s and a stable IQR of 0.39 s, whereas the peak-based method’s median error escalates to 2.55 s. At the maximum test distance of 2 m, the Adaptive Sticky-HMM preserves a median error of 1.20 s. In contrast, all three baseline methods exhibit median errors exceeding 2.10 s, with the peak-based method reaching 3.33 s.
Consistency Analysis: The Adaptive Sticky-HMM is the only method that maintains a median error below 1.50 s across all tested scenarios. Furthermore, the 75th percentile of the Adaptive Sticky-HMM error at 2 m remains lower than the 25th percentiles of both the threshold-based and peak-based methods at the same distance.

To evaluate systematic bias and the limits of agreement (LoAs), a Bland–Altman analysis was performed for the Adaptive Sticky-HMM across all experimental scenarios, as shown in Figure 5.

Unbiased Estimation: The system demonstrates negligible systematic bias regardless of the sensing distance. The mean bias remains near zero across all ranges: −0.07 s at 1 m, 0.09 s at 1.5 m, and −0.04 s at 2 m.
Agreement Limits and Reliability: At 1 m, the 95% LoAs are tightly constrained within [−1.22, 1.09] s, with a span of 2.31 s. Although the LoAs expand as the distance increases—reaching [−1.89, 1.81] s at 2 m with a span of 3.70 s—the vast majority of detection samples remain within these confidence boundaries.
Subgroup Analysis (10 s vs. 15 s Events): The system exhibits consistent performance across different apnea durations. At 2 m, the biases for 10 s events (−0.11 s) and 15 s events (0.03 s) remain minimal, with nearly identical standard deviations (SD ≈ 0.95 s for both groups).

5. Discussion

5.1. Impact of Sensing Distance

A primary challenge in millimeter-wave radar monitoring is distance. As evidenced in Table 2 and Figure 4, the Mean Absolute Error (MAE) and median error for all methods exhibit positive correlations with sensing distance. This trend is primarily due to the decrease in the Signal-to-Noise Ratio (SNR) at longer ranges, where environmental noise and subtle body movements interfere with the extracted respiratory phase. At the 2 m mark, traditional approaches like the peak-based method showed the highest sensitivity, with the median error escalating to 3.33 s and the IQR expanding to 1.10 s. This indicates that without a robust state-transition mechanism, noise-induced fluctuations are frequently misinterpreted as breathing activity, leading to the “fragmentation” of detected events.

5.2. Effectiveness of the Sticky Mechanism in State Estimation

The superiority of the Adaptive Sticky-HMM over the traditional HMM-based method is rooted in its ability to suppress spurious state transitions. Traditional Hidden Markov Models often suffer from “state-flickering” in low-SNR environments because they lack a prior bias toward state persistence. In our results, while the traditional HMM-based method outperformed the other two baselines, its median error still increased from 1.30 s at 1 m to 2.15 s at 2 m.

In contrast, the Adaptive Sticky-HMM effectively mitigates this by favoring the current state unless significant evidence of a transition is present. This is reflected in the stable Interquartile Range (IQR), which only grew from 0.40 s at 1 m to 0.60 s at 2 m. The compact IQR confirms that the “Sticky” parameter provides a crucial stabilizing effect, ensuring that the detected apnea boundaries remain continuous and precise despite signal fluctuations.

5.3. Statistical Reliability and Clinical Versatility

The Bland–Altman analysis in Figure 5 further confirms the clinical potential of the Adaptive Sticky-HMM. The mean bias remained negligible across all distances: −0.07 s at 1 m, 0.09 s at 1.5 m, and −0.04 s at 2 m. This near-zero bias indicates that the system does not systematically over- or under-estimate the duration of apnea events.

Furthermore, the subgroup analysis (10 s vs. 15 s events) reveals that the system’s performance is independent of the apnea duration. At 2 m, the biases for 10 s events (−0.11 s) and 15 s events (0.03 s) were remarkably similar, with nearly identical standard deviations of approximately 0.95 s. This stability across different event lengths suggests that the Adaptive Sticky-HMM holds promise for varying clinical scenarios, pending further validation with diverse patient populations and clinical OSA data. By maintaining sub-second average precision (0.99 s overall average), the system represents a significant step toward bridging the gap between contact-based sensors and non-contact radar monitoring in controlled settings.

6. Conclusions

In this study, we presented a robust, non-contact framework for sleep apnea detection utilizing mmWave FMCW radar and a novel Adaptive Sticky-HMM algorithm. The proposed approach integrates multi-dimensional signal features with a dynamic sticky transition mechanism, effectively overcoming the inherent sensitivity of traditional heuristic and standard probabilistic models to environmental noise. The framework was validated through extensive experiments involving 15 subjects across three sensing distances (1 m, 1.5 m, and 2 m), achieving an overall Mean Absolute Error (MAE) of 0.99 s and significantly outperforming baseline threshold-based, peak-based, and standard HMM approaches. Formal statistical evaluations—including Wilcoxon signed-rank tests with Bonferroni correction, boxplot analysis, and Bland–Altman agreement analysis—demonstrated high concordance with ground-truth measurements and distance-invariant robustness, with minimal systematic bias even under reduced-SNR conditions at a 2 m sensing range.

Although the proposed method is benchmarked against classical signal processing baselines in this work, recent studies have explored machine learning and deep learning approaches for radar-based respiratory monitoring. For instance, Choi et al. [13] proposed a hybrid CNN–Transformer architecture that achieved strong OSA diagnostic performance from radar signals, and Jung et al. [12] demonstrated effective apnea event detection using learned features from FMCW radar data. While such data-driven models benefit from their capacity to capture complex nonlinear patterns, they typically require large-scale labeled training datasets and substantial computational resources, which limits their applicability in resource-constrained or data-scarce deployment scenarios. The proposed Adaptive Sticky-HMM occupies a complementary niche: it requires no training data, relies exclusively on physiologically motivated priors with adaptive parameter estimation, and maintains a lightweight computational footprint well-suited for edge deployment in long-term home monitoring. A direct quantitative comparison with deep learning approaches remains an important direction for future investigation, pending the availability of large-scale, publicly accessible radar-based apnea datasets.

Several limitations of the current study warrant acknowledgement. First, the evaluation is based on 15 healthy volunteers performing voluntary breath-holding in a controlled seated posture, rather than clinical OSA patients monitored during natural sleep. Although voluntary breath-holding produces the same radar-observable biomechanical signature as obstructive apnea—namely, the cessation of rhythmic thoracic displacement—real-world OSA involves additional physiological complexity, including hypopnea (partial airway obstruction with attenuated rather than absent chest motion), involuntary respiratory effort against a closed airway, and variable sleeping postures that alter the radar-to-chest geometry and signal characteristics. The controlled experimental design was deliberately chosen to establish a high-SNR validation baseline for rigorously assessing the algorithm’s temporal boundary precision, consistent with established practice in radar-based physiological monitoring research. Second, the study cohort comprises subjects within a relatively narrow age range (20–30 years), which may not fully represent the demographic and physiological diversity of clinical OSA populations. Future work will prioritize validation of the proposed framework using PSG-synchronized clinical recordings across diverse patient populations, with extensions to hypopnea detection and apnea–hypopnea index (AHI) estimation, thereby supporting scalable, non-invasive respiratory health management in real-world settings.

Author Contributions

Conceptualization, Q.W. and Y.W. (Yuping Wang); methodology, Q.W., Y.W. (Yuping Wang), S.L. and S.Z.; software, Q.W. and Y.W. (Yuhan Wu); validation, Q.W., Y.W. (Yuping Wang) and Y.W. (Yuhan Wu); formal analysis, Q.W. and Y.W. (Yuping Wang); investigation, Q.W. and Y.W. (Yuping Wang); data curation, Y.W. (Yuhan Wu); writing—original draft preparation, Q.W.; writing—review and editing, Y.W. (Yuping Wang), S.L., S.Z. and Q.W.; visualization, Q.W.; supervision, Y.W. (Yuping Wang), S.Z. and S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Due to ethical concerns, the data can only obtained by contacting the corresponding author and signing the relevant documents.

Acknowledgments

Thank you to Texas Instruments and ElasTech for their technical support.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kapur, V.K.; Auckley, D.H.; Chowdhuri, S.; Kuhlmann, D.C.; Mehra, R.; Ramar, K.; Harrod, C.G. Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: An American Academy of Sleep Medicine clinical practice guideline. J. Clin. Sleep Med. 2017, 13, 479–504. [Google Scholar] [CrossRef] [PubMed]
Benjafield, A.V.; Ayas, N.T.; Eastwood, P.R.; Heinzer, R.; Ip, M.S.; Morrell, M.J.; Nunez, C.M.; Patel, S.R.; Penzel, T.; Pépin, J.L.; et al. Estimation of the global prevalence and burden of obstructive sleep apnoea: A literature-based analysis. Lancet Respir. Med. 2019, 7, 687–698. [Google Scholar] [CrossRef] [PubMed]
Yeghiazarians, Y.; Jneid, H.; Tietjens, J.R.; Redline, S.; Brown, D.L.; El-Sherif, N.; Mehra, R.; Bozkurt, B.; Ndumele, C.E.; Somers, V.K. Obstructive Sleep Apnea and Cardiovascular Disease: A Scientific Statement From the American Heart Association. Circulation 2021, 144, e56–e67. [Google Scholar] [CrossRef] [PubMed]
Zou, X.; Zhou, X.; Yi, S. Obstructive sleep apnea and the risk of sudden cardiac death: A systematic review and meta-analysis. BMC Cardiovasc. Disord. 2025, 25, 751. [Google Scholar] [CrossRef] [PubMed]
Platon, A.L.; Stelea, C.G.; Boişteanu, O.; Patrascanu, E.; Zetu, I.N.; Rosu, S.N.; Trifan, V.; Palade, D.O. An Update on Obstructive Sleep Apnea Syndrome—A Literature Review. Medicina 2023, 59, 1459. [Google Scholar] [CrossRef] [PubMed]
Zappalà, P.; Lentini, M.; Ronsivalle, S.; Lavalle, S.; La Via, L.; Maniaci, A. The Global Socioeconomic Burden of Obstructive Sleep Apnea: A Comprehensive Review. Healthcare 2025, 13, 2115. [Google Scholar] [CrossRef] [PubMed]
Jacob, D.; Kokil, P.; Suriyan, S.; Jayanthi, T. Wearable Sensor Design for Real-Time Obstructive Sleep Apnea Monitoring. IEEE Sens. J. 2025, 25, 20584–20593. [Google Scholar] [CrossRef]
Jaiswal, S.J.; Owens, R.L.; Pawelek, J.B.; Quer, G.; Trieu, M.; Pandit, J.A. Using New Technologies and Wearables for Characterizing Sleep in Population-based Studies. Curr. Sleep Med. Rep. 2024, 10, 82–92. [Google Scholar] [CrossRef]
Singh, A.; Rehman, S.U.; Yongchareon, S.; Chong, P.H.J. Multi-Resident Non-Contact Vital Sign Monitoring Using Radar: A Review. IEEE Sens. J. 2021, 21, 4061–4084. [Google Scholar] [CrossRef]
Gu, C. Short-range noncontact sensors for healthcare and other emerging applications: A review. IEEE Sens. J. 2016, 16, 6609–6623. [Google Scholar] [CrossRef] [PubMed]
Santra, A.; Uysal, F. Radar for Indoor Monitoring: Detection, Classification, and Assessment; CRC Press: Boca Raton, FL, USA, 2021. [Google Scholar]
Jung, C.-W.; Yoo, Y.-K.; Shin, H.-C. Detecting Sleep-Related Breathing Disorders Using FMCW Radar. J. Electromagn. Eng. Sci. 2023, 23, 482–491. [Google Scholar] [CrossRef]
Choi, J.W.; Koo, D.L.; Kim, D.H.; Nam, H.; Lee, J.H.; Hong, S.-N.; Kim, B. A novel deep learning model for obstructive sleep apnea diagnosis: Hybrid CNN-Transformer approach for radar-based detection of apnea-hypopnea events. Sleep 2024, 47, zsae184. [Google Scholar] [CrossRef] [PubMed]
Mirhosseini, S.F.; Alaee-Kerahroodi, M.; Beltrao, G.; Schroeder, U.; Bhavani Shankar, M.R. Phase Unwrapping for Heart and Breathing Rate Estimation Using mmWave FMCW Radar. In Proceedings of the 2025 33rd European Signal Processing Conference (EUSIPCO), Palermo, Italy, 1–5 September 2025; pp. 2567–2571. [Google Scholar]
Fox, E.B.; Sudderth, E.B.; Jordan, M.I.; Willsky, A.S. A Sticky HDP-HMM with Application to Speaker Diarization. Ann. Appl. Stat. 2011, 5, 1020–1056. [Google Scholar] [CrossRef]
Texas Instruments. IWR6843ISK Single-Chip 60-GHz to 64-GHz mmWave Sensor Evaluation Module. 2022. Available online: https://www.ti.com/tool/IWR6843ISK (accessed on 7 January 2026).
Texas Instruments. DCA1000EVM. 2019. Available online: https://www.ti.com/tool/DCA1000EVM (accessed on 7 January 2026).
ElasTech. Respiratory belt-ElasTech. 2023. Available online: https://www.elas-tech.com/products/20539.html (accessed on 7 January 2026).
Kagawa, M.; Ueki, K.; Tojima, H.; Matsui, T. Noncontact Screening System with Two Microwave Radars for the Diagnosis of Sleep Apnea-Hypopnea Syndrome. In Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS), Osaka, Japan, 3–7 July 2013; pp. 2052–2055. [Google Scholar]
Xu, H.; Ebrahim, M.P.; Hasan, K.; Heydari, F.; Howley, P.; Yuce, M.R. Accurate Heart Rate and Respiration Rate Detection Based on a Higher-Order Harmonics Peak Selection Method Using Radar Non-Contact Sensors. Sensors 2022, 22, 83. [Google Scholar] [CrossRef] [PubMed]
Rabiner, L.R. A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. IEEE 1989, 77, 257–286. [Google Scholar] [CrossRef]
Nakajima, R.; Taki, K.; Wang, H.; Ma, J. Real-time Respiratory Apnea Detection Using mmWave Radar. In Proceedings of the 2024 IEEE Smart World Congress (SWC), Denarau Island, Fiji, 2–5 December 2024; pp. 1–6. [Google Scholar]

Figure 1. The overall signal processing pipeline for apnea detection using the Adaptive Sticky-HMM. Stage 1: Signal preprocessing; Stage 2: multi-dimensional feature fusion and normalization; Stage 3: Adaptive Sticky-HMM-based state decoding.

Figure 2. Experimental hardware setup and software setup.

Figure 3. Comparison of apnea event detection across different methods for a representative case.

Figure 4. Comparative boxplots of absolute duration error across various methods at different distances.

Figure 5. Agreement analysis between Adaptive Sticky-HMM and ground truth via Bland–Altman plots at varying distances.

Table 1. MmWave chirp and frame configuration.

Parameter	Value
Start Frequency	60.25 GHz
Frequency Slope	40.845 MHz/µs
ADC Samples	256
Sample Rate	3000 ksps
Ramp End Time	91.72 µs
Number of Chirp Loops	2
Frame Periodicity	50 ms

Table 2. Comparison of MAE across different methods at various distances.

Method	1 m (s)	1.5 m (s)	2 m (s)	Overall Average (s)
HMM-based	1.29	1.62	2.11	1.67
Threshold-based	1.53	2.24	2.65	2.14
Peak-based	1.80	2.53	3.38	2.57
Adaptive Sticky-HMM	0.77	1.03	1.18	0.99

Table 3. Intraclass Correlation Coefficient for absolute duration error across all method–distance combinations.

Method	1 m	1.5 m	2 m	Mean ICC
HMM-based	0.398	0.005	0.012	0.138
Threshold-based	0.244	0.285	0.039	0.189
Peak-based	0.089	0.318	0.321	0.243
Adaptive Sticky-HMM	0.027	0.027	0.062	0.039

Table 4. Friedman test results across four methods at each sensing distance (

n = 30

events per group).

Table 4. Friedman test results across four methods at each sensing distance (

n = 30

events per group).

Distance	$χ^{2}$	p-Value
1 m	46.65	$4.12 \times 10^{- 10}$
1.5 m	52.66	$2.17 \times 10^{- 11}$
2 m	68.90	$7.34 \times 10^{- 15}$

Table 5. Pairwise Wilcoxon signed-rank test results (Adaptive Sticky-HMM vs. each baseline method) at each sensing distance.

Comparison	Distance	p-Value
Proposed vs. Threshold-Based	1 m	$4.48 \times 10^{- 6}$
Proposed vs. Peak-Based	1 m	$3.13 \times 10^{- 6}$
Proposed vs. HMM-Based	1 m	$2.06 \times 10^{- 5}$
Proposed vs. Threshold-Based	1.5 m	$1.72 \times 10^{- 6}$
Proposed vs. Peak-Based	1.5 m	$2.84 \times 10^{- 6}$
Proposed vs. HMM-Based	1.5 m	$9.48 \times 10^{- 5}$
Proposed vs. Threshold-Based	2 m	$1.72 \times 10^{- 6}$
Proposed vs. Peak-Based	2 m	$1.73 \times 10^{- 6}$
Proposed vs. HMM-Based	2 m	$8.04 \times 10^{- 6}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Q.; Wang, Y.; Zhao, S.; Wu, Y.; Li, S. An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series. Mathematics 2026, 14, 1107. https://doi.org/10.3390/math14071107

AMA Style

Wang Q, Wang Y, Zhao S, Wu Y, Li S. An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series. Mathematics. 2026; 14(7):1107. https://doi.org/10.3390/math14071107

Chicago/Turabian Style

Wang, Qizheng, Yuping Wang, Shuai Zhao, Yuhan Wu, and Shengjie Li. 2026. "An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series" Mathematics 14, no. 7: 1107. https://doi.org/10.3390/math14071107

APA Style

Wang, Q., Wang, Y., Zhao, S., Wu, Y., & Li, S. (2026). An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series. Mathematics, 14(7), 1107. https://doi.org/10.3390/math14071107

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Adaptive Sticky Hidden Markov Model for Robust State Inference in Non-Stationary Physiological Time Series

Abstract

1. Introduction

2. Methods

2.1. Signal Preprocessing

2.2. Multi-Dimensional Feature Fusion and Normalization

2.3. Adaptive Sticky-HMM-Based State Decoding

3. Experiment Settings

3.1. Hardware and Software Configuration

3.2. Participants and Protocol

4. Results

4.1. Apnea Event Detection Performance

4.2. Statistical Analysis of Duration Error

5. Discussion

5.1. Impact of Sensing Distance

5.2. Effectiveness of the Sticky Mechanism in State Estimation

5.3. Statistical Reliability and Clinical Versatility

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI