1. Introduction
The autonomic nervous system (ANS) governs involuntary physiological processes through its sympathetic and parasympathetic branches, which respectively mediate arousal (“fight or flight”) and recovery (“rest and digest”) responses to maintain homeostatic balance [
1]. Increasing parasympathetic autonomic activity is associated with numerous health benefits, including improved cardiovascular health [
2] and anti-inflammatory effects [
3], which are particularly valuable for managing chronic conditions. Chronic stress, cardiovascular risks, hypertension, and other ill health conditions are associated with sympathetic activity dominance or dysregulation [
4,
5]. Among the various yoga styles, Iyengar yoga [
6] is uniquely suited for improving parasympathetic activity of the autonomic nervous system (ANS) due to its focus on precise body alignment while performing the posture (asana), targeted engagement of muscle groups, and sustained postural engagement. In Iyengar yoga, props like blocks, blankets, and straps are used to assist with alignment, deepen poses, allowing participants to spend more time in the asana, which often leads to the yoga only being performed under the guidance of certified instructors. This limits the extensive use of such therapeutic yoga for individuals on their own, which, in turn, decreases the possibility of gathering data in such settings. This study is one such setting where data was collected with the agreement of both the yoga practitioner and participants. The analysis of physiological mechanisms of Iyengar yoga, a relatively data-scarce area, was carried out in this study using robust and interpretable machine learning models. To that end, the study was guided by the following research questions:
Can machine learning models distinguish between different physiological states (relaxed vs. parasympathetic-inducing/sympathetic poses) based on features derived from physiological signals captured by the wearable device?
Can the models differentiate between sympathetic and parasympathetic autonomic responses based on pose-induced physiological changes?
What features contribute strongly to this distinction? Are there any physiological patterns specific to each autonomic response?
2. Related Work
Yoga therapy is known to improve cardiac autonomic tone and vagal modulation [
7], and the practice of yoga therapy and breath work (pranayama) influences autonomic nervous system (ANS) function by enhancing parasympathetic activity and reducing sympathetic activation [
8]. Preliminary studies, such as [
9], have reported improvements in ambulation, mental health, and pain management, while others, such as [
10,
11], have highlighted its potential in addressing clinical depression and stress-related symptoms. Iyengar yoga’s practice of holding poses for extended durations (5–10 min), as recommended by [
12], facilitates the release of the myotatic reflex, increasing muscular range of motion and potentially eliciting distinct physiological responses.
However, these studies are limited to specific populations or conditions, and there is a need for more rigorous longitudinal research to better understand the physiological mechanisms underlying these benefits. To address this gap, a multi-case longitudinal study design is proposed through this study to investigate the physiological effects of Iyengar yoga at the pose level. Building on prior research, such as [
13], which demonstrated improvements in acute physiological measures like arterial blood pressure and heart rate during 90 min yoga sessions, these measures were analyzed in this study at the
pose-level. While the previous studies employed survey-based [
9,
11] and Holter ECG monitoring-based [
7] data collection, EmbracePlus, a wearable smart watch, was employed in this study to record physiological data and ANS activity was measured using skin temperature, electrodermal activity (EDA), heart-rate variability, and other vitals, to assess sympathetic and parasympathetic responses. Prior studies [
7] used statistical analyses and report-based methods, where as, machine learning models were implemented in this study to advance the understanding of differences in physiological responses at pose-level, lasting an average of 5–10 min, as opposed to entire 90 min sessions in previous studies [
13].
4. Results and Discussion
A multivariate analysis of variance (MANOVA) was conducted to evaluate the overall effect of the physiological state on a set of 40 physiological features. The multivariate test revealed a highly significant effect of the state on the combined physiological profile (Wilks’ Lambda = 0.111, F(80, 398) = 9.98,
0.001), with a large effect size (
= 0.889), indicating that approximately 89% of the variance in physiological responses is attributable to the underlying state. To further investigate which specific features contributed to this effect, follow-up univariate analyses were conducted on each feature. Thirty-two features were found to be significantly different across states at a stringent threshold of
0.01 and power > 0.8. For each of these features, the group means and standard deviations for baseline, parasympathetic, and sympathetic states are reported in
Table 5. To identify which physiological states differed significantly on each feature, post hoc comparisons using Tukey’s Honest Significant Difference (HSD) test were performed, which was also leveraged in other studies [
15].
These comparisons revealed consistent patterns of autonomic modulation across sensor modalities. Notably, sympathetic states were associated with significantly elevated skin conductance variability (e.g., EDA std dev, EDA peak to peak), increased cardiovascular reactivity (e.g., BVP std dev, RR interval peak to peak), and greater temperature fluctuations (temperature std dev, peak to peak), relative to both baseline and parasympathetic conditions. In contrast, parasympathetic states exhibited increased mean RR intervals and lower accelerometer-derived motion features, consistent with reduced physiological arousal and physical movement. Baseline states typically showed intermediate values but were statistically distinct from the other two states for several features. These univariate results, supported by robust post hoc evidence, provide strong validation that specific physiological features reliably differentiate between autonomic states. The consistent group-wise differences across modalities affirm the potential of multimodal sensor fusion for state classification and underscore the biological plausibility of the derived features in capturing autonomic state dynamics. Based on this, subsequent analyses focused on feature-level importance using supervised machine learning models, aiming to identify core physiological features that are predictive of each physiological state.
In this study, the feasibility and performance of various machine learning models were explored for distinguishing three physiological states: baseline, parasympathetic-dominant, and sympathetic-dominant poses from wearable sensor data in the context of Iyengar yoga. Given that this is the first known study exploring physiological signal classification within the specific practice of Iyengar yoga using machine learning techniques, we prioritized generalizability, interpretability, and robustness in our model selection process. Simpler and computationally efficient models were deliberately chosen to reduce the risk of overfitting in a low-data regime, a common concern when initiating modeling in novel application domains with no prior machine learning baselines. All models were evaluated using
10-fold stratified cross-validation to ensure performance estimates were robust and class proportions preserved across folds. This approach enhanced the reliability of our findings despite limited data availability.
Table 6 with comparative model performances reveals that ensemble learning and margin-based approaches are more suited to distinguish physiological states than probabilistic or linear models. The Random Forest model achieved the highest performance across all evaluation metrics, with an accuracy, precision, recall, and F1-score of 0.94 and an AUC of 0.99, which can be attributed to its ability to capture complex, nonlinear relationships and reduce variance through ensembling. In physiological signals, such nonlinearities are common due to individual variability, sensor noise, and the interaction of autonomic responses (e.g., heart rate, respiration, and electrodermal activity often covary under stress).
The SVM model also performed well, with an F1-score of 0.86 and AUC of 0.95, demonstrating strong performance particularly with the use of a linear kernel indicating a strong discrimination capability and balanced precision–recall tradeoff. The Decision Tree, outperforming logistic regression (F1-score = 0.70) and Naive Bayes (F1-score = 0.65), provided decent results (F1-score = 0.80) but was still susceptible to overfitting. These differences highlight the varying capacity of models to handle nonlinear patterns and interdependent physiological features. In contrast, logistic regression’s performance indicated that linear models lack the flexibility to capture interdependencies in physiological signals, despite L1 regularization aiding in sparsity and interpretability. Naive Bayes exhibited the weakest performance, reflecting the impracticality of its core assumption of feature independence. Given the biological correlations among physiological channels (e.g., respiration affecting both BVP and temperature), this assumption is violated, leading to degraded model fidelity. Despite their reduced flexibility, both models showed reasonable performance. Failing to detect a sympathetic state (false negative) may result in missed intervention opportunities, whereas false positives can lead to unnecessary feedback or false alarms. Hence, maximizing both recall and precision is essential, making F1-score a central metric for model selection. These findings suggest that, for real-world deployment in biofeedback systems, models like Random Forest and SVM offer a favorable balance of accuracy, reliability, and interpretability.
4.1. Leave-One-Group-Out (LOGO) Cross-Validation
Leave-One-Subject-Out Cross-Validation (LOGO-CV) revealed substantial heterogeneity in classification performance across participants and models as seen in
Table 7. On average, SVM achieved the highest discriminative power with an AUC of 0.89 ± 0.13, closely followed by logistic regression (LR, 0.88 ± 0.15) and Random Forest (RF, 0.82 ± 0.19), while Decision Tree (DT)’s performance was markedly less stable (0.66 ± 0.24). In terms of F1-scores, which better reflect balanced precision and recall, LR (0.62 ± 0.38) and RF (0.60 ± 0.37) provided the strongest results, with SVM (0.57 ± 0.36), DT (0.51 ± 0.37), and Naive Bayes (NB, 0.50 ± 0.36) performing less consistently. Importantly, the large standard deviations across all models highlight pronounced subject-level variability: while some participants (e.g., Subject 4, Subject 6, Subject 9) as in
Figure 4, were classified with near-perfect accuracy across models (AUC and F1 ≈ 1.0), others (e.g., Subject 1, Subject 13, Subject 15) consistently approached chance-level outcomes (AUC ≈ 0.5, F1 ≤ 0.1). In several cases, models achieved high AUC but low F1 (e.g., Subject 3, Subject 8, Subject 14), suggesting that class probabilities were ranked correctly but hard decision thresholds failed, likely due to class imbalance or weak physiological separability. These patterns reinforce that model success is not uniform—for some individuals, physiological signals aligned well with the intended sympathetic/parasympathetic labels, while for others, either signal noise, atypical autonomic responses, or limited generalizability. Overall, while population-level metrics suggested strong average discriminative ability, the LOGO-CV analysis underscores the importance of inter-individual variability and points toward the need for personalized calibration or subgroup-specific modeling approaches in future work.
4.2. Building Sparse Models
Given the variability in model performance and inherent differences in learning paradigms, model-specific feature selection and interpretation were performed. Importantly, for building sparse models, we analyzed the feature importances, coefficients, and probabilities from the 10-fold cross-validation metrics and chose top-k, where
k was the minimal number of features per model, as shown in
Figure 5; thus, when used on a hold-out test set, it gave the best performance when compared with the base models using full feature set on a single train-test split setting, shown in
Table 8. This individual top-k features per model approach avoids the risk of overinterpreting features from underperforming models and provides a more grounded understanding of physiological drivers for each classifier. Furthermore, any convergent features across models were noted as candidates for robust domain-level significance, for future investigation.
Several features appeared repeatedly in the top-k sets across models, indicating their robust relevance across different models and that they likely encode the most physiologically salient information for differentiating between relaxed, sympathetic-inducing, and parasympathetic-inducing physiological states. For instance, accelerometer-related features may reflect postural transitions, while BVP-related features relate to cardiovascular activity, which can vary under sympathetic arousal. EDA-related features capture phasic electrodermal responses and temperature-related features may reflect thermoregulatory shifts associated with relaxation or physical effort.
Random Forest emphasized both EDA dynamics (peak_rms, mean_derivative) and cardiorespiratory indicators like rr_std_dev, with its top features balancing well across modalities: EDA, BVP, temperature, RR interval, and accelerometry, indicating an ensemble’s ability to capture inter-modality patterns. Logistic regression’s top weights were assigned to features like eda_mean, peak_rms, and bvp_mean, aligning with known markers of autonomic state. Its reliance on a smaller but informative set of linearly separable features helped construct a compact and effective model. Naive Bayes highlighted more basic statistical descriptors (means, std devs), avoiding derivatives or entropy measures, making sense given NB’s independence assumptions, and confirming it can serve as a lightweight, reasonably performant baseline.
Consistent top features across the five models include acc_std_dev: a strong proxy for physical stillness vs. movement; bvp_mean: reflecting average blood flow changes tied to autonomic tone; eda_peak_rms: capturing phasic EDA response directly correlating sympathetic arousal; temp_std_dev: fluctuating skin temperature demonstrating transient autonomic reactions across conditions; and rr_std_dev: indicating variability in heart rate (HRV), a widely used marker of autonomic balance. Movement (accelerometer) and Cardiac Signals (RR interval, BVP) were consistent top contributors, reflecting the interplay between posture, movement, and cardiac autonomic regulation during Iyengar yoga. Electrodermal and Temperature Signals underline the importance of arousal and peripheral circulation as non-invasive autonomic markers. Derivative and standard deviation signals were frequently selected over simple means, reinforcing the strength of dynamic features over static values in distinguishing subtle physiological states. A small number of features (especially those related to EDA, BVP, and HRV) consistently drove the classifiers’ performance. These overlaps point to robust candidate features for autonomic state differentiation, offering stable physiological markers across multiple algorithmic viewpoints, despite their inherent modeling differences.
To assess the potential confounding impact of movement-related (accelerometer) features, we conducted an ablation experiment excluding all accelerometer-derived features and re-trained each classifier. As summarized in
Table 9, the stratified cross-validation performance remained high across models (e.g., Random Forest: accuracy 0.94, F1 0.94, AUC 0.99 without the accelerometer), closely matching the results obtained with the full feature set that included movement metrics. These observations confirm that the models are not exclusively leveraging pose-related movement differences, but also capture meaningful autonomic signal modulation reflected in the heart rate variability, EDA, temperature, and BVP.
4.3. Feature Explanation
There is notable convergence between the insights gained from the SHapely Additive exPlanations (SHAP) analysis [
17] of the top-performing Random Forest model and feature weights from classical models (logistic regression, SVM, Naive Bayes). This intersection highlights a set of physiological features that robustly predict autonomic states—sympathetic, parasympathetic, and baseline (relaxed)—within the Iyengar yoga dataset. These features are not model-specific artifacts; they encode stable, physiologically meaningful information evidenced across fundamentally different analytical approaches. From the best performing Random Forest model, the highly discriminative features were
rr_std_dev (0.098),
acc_std_dev (0.097),
bvp_mean (0.094),
eda_peak_rms (0.074), and
temp_mean (0.066). SHAP analysis on this best performing RF model formed the basis for further analysis of how each of these features contributed to each physiological states. Feature importances from other classical models such as LR, GNB and SVM were used in complement to support the physiological robustness.
4.3.1. Heart Rate Variability (RR Interval Mean and Standard Deviation)
RR Interval Mean and standard deviation are markers of heart rate variability and cardiac autonomic balance. Long RR intervals (i.e., low heart rate) and high variability (std_dev) consistently increased parasympathetic predictions. The SHAP values in
Figure 6 show strong positive effects; LR (+1.11; OR: 3.04) and SVM (+0.49; OR: 1.64) agree. This aligns well with the established physiology where parasympathetic dominance increases HRV and is strongly indicative of vagal dominance. Shorter RR intervals (lower HRV) drove sympathetic predictions in SHAP
Figure 6 and LR (−0.88; OR: 0.42), matching known associations between sympathetic arousal and cardiac acceleration. RR interval values near the mean were associated with relaxed state in SHAP
Figure 6, suggesting moderate HRV distinct from the extremes defining parasympathetic or sympathetic states. This reflects the relaxed state’s position as a physiological middle ground. Parasympathetic activity promotes HRV and slower HR; sympathetic arousal suppresses HRV and raises heart rate.
4.3.2. Movement Variability (Accelerometer Standard Deviation)
Accelerometer standard deviation reflects movement variability/postural change. Unsurprisingly, low accelerometer standard deviation was strongly predictive of parasympathetic state, reflecting restorative poses. The SHAP values in
Figure 6 were sharply negative for higher values, pushing the model away from parasympathetic classification. This was corroborated by LR (−1.99; OR: 0.14) and SVM (−1.27; OR: 0.28). Stillness aligns with the restorative, restful parasympathetic state. While high accelerometer variability increased sympathetic prediction in SHAP values in
Figure 6 and SVM (coef: +1.00; OR: 2.72), suggesting movement during challenging or arousing postures. Interestingly, high accelerometer variability also contributed to relaxed class predictions in the SHAP values in
Figure 6 and LR (coef: +2.06; OR: 7.82). This may reflect postural transitions or free movement within a calm, baseline context. Parasympathetic activation is associated with stillness, while sympathetic tone increases during exertion or challenging postures manifesting as more movement.
4.3.3. Blood Volume Pulse Amplitude (BVP Mean)
Blood volume pulse, a proxy for peripheral circulation, is a measure of cardiac and vascular activity often modulated by sympathetic tone. Low BVP amplitude was associated with parasympathetic predictions in SHAP
Figure 6, suggesting reduced cardiac output and vasoconstriction typical of a resting state. High BVP amplitude pushed predictions toward a sympathetic state. This aligns with elevated cardiac activity and pulse wave amplitude under stress or arousal. Intermediate BVP levels were associated with relaxed predictions in SHAP
Figure 6, again emphasizing the mid-range nature of this class.
4.3.4. Phasic Electrodermal Activity (EDA Peak to RMS)
EDA Peak to RMS represents the phasic, rapid component of skin conductance changes. Low peak-to-RMS values strongly promoted parasympathetic predictions in the SHAP values in
Figure 6 and LR (−2.82; OR: 0.06), reflecting the dampening of sympathetic arousal typical of restful states. High values robustly indicated sympathetic activation, the SHAP values in
Figure 6 showed a large positive push; LR (+0.77; OR: 2.17) and SVM (+0.14; OR: 1.15) supported this, reflecting a gold-standard biomarker for SNS activity. Mid-range values were weakly associated with relaxed state predictions, with the SHAP values contributions in
Figure 6 being notably smaller, suggesting that neither extreme phasic arousal nor complete flatness defines this class. Electrodermal activity is governed by the sympathetic branch; phasic EDA increases during stress or effort, not during rest-and-digest periods.
4.3.5. Skin Temperature (Temperature Mean)
The temperature mean reflects the peripheral skin temperature responding to vascular changes and showed class-distinct but partially inconsistent behavior across models. The SHAP values in
Figure 6 indicated slightly positive association, with higher skin temperature increasing parasympathetic prediction. However, LR (−0.67; OR: 0.51) and SVM (−0.18; OR: 0.83) suggested the opposite. While parasympathetic activity is classically linked to increased peripheral temperature due to vasodilation, because the parasympathetic data here is labeled in accordance with passive poses (e.g., lying down, resting), the body core temperature might drop due to less muscular activity. The data were collected through the wrist, as this region is very sensitive to peripheral vasoconstriction/vasodilation; thus, there may have been delayed or inverse changes compared to the core parasympathetic activity. The SHAP values in
Figure 6 showed a positive influence, higher temperatures pushed predictions toward the sympathetic class, and LR (+0.68; OR: 1.97) agreed. However, SVM showed the reverse (−1.12; OR: 0.33). These mixed results suggest skin temperature may reflect non-autonomic processes. The SHAP values in
Figure 6 associated mid-to-high temperatures with a relaxed state prediction. This aligns with a
comfort zone interpretation of neither cold-stressed nor vasodilated extremes.
Among these best predictors, features such as temp_mean, eda_mean, eda_std_dev, eda_peak_rms, eda_mean_deriv, rr_mean, rr_std_dev, acc_mean, acc_std_dev, and bvp_mean appear to be the key discriminators for parasympathetic and sympathetic autonomic response-eliciting poses. This feature-centric analysis reveals that EDA peak to RMS, HRV, movement variability, and, to a lesser extent, skin temperature and BVP amplitude are consistent and interpretable predictors of autonomic state classification. Parasympathetic states are defined by stillness, high HRV, and low phasic arousal. Sympathetic states arise from dynamic movement, short RR intervals, and phasic EDA surges. Relaxed states are distinguished by moderate movement and balanced physiological values across modalities.
For instance, an increase in the magnitude of the RR interval from baseline to the postural phase is indicative of parasympathetic activity, aligning with previous findings that associate elevated RR intervals with higher vagal tone [
7], can be observed in
Figure 7. Skin temperature tends to rise under parasympathetic dominance due to a reduction in sympathetic vasoconstriction, reflecting increased peripheral blood flow. In contrast, EDA typically decreases as parasympathetic activity prevails. However, it is important to note that during the initial phase of posture execution, a brief surge in EDA may occur triggered by transient sympathetic arousal in response to physical effort before it subsides as the parasympathetic state establishes itself. BVP, reflecting peripheral vascular tone, also increases under parasympathetic influence, signifying vasodilation.
The parasympathetic response differs markedly from the sympathetic response across these modalities. While parasympathetic activation is associated with relaxation and recovery manifesting as increased RR interval (slower heart rate), elevated skin temperature (due to vasodilation), decreased EDA, and increased BVP absorption, the sympathetic response produces the opposite pattern. During sympathetic activation, such as in stress or challenge, the RR interval decreases (faster heart rate), skin temperature typically drops (stemming from vasoconstriction), EDA rises sharply (from sweat gland activation), and BVP absorption decreases (indicating vasoconstriction). These physiology-driven contrasts underscore the fundamental opposition between calming (parasympathetic) and arousing (sympathetic) branches of the ANS, and highlight the nuanced, sometimes overlapping, shifts detected by multimodal physiological monitoring.
6. Conclusions
This study establishes a robust methodological framework using wearable sensor data and machine learning models to discriminate different autonomic physiological states (baseline/relaxed, parasympathetic/restorative poses, and sympathetic/stimulating poses) during Iyengar yoga practice. Differentiating between phases (baseline and postural) and ANS response (parasympathetic and sympathetic) helps in providing the user with feedback for their engagement which is an essential step towards automated and personalized practice of such therapeutic yoga, which is currently limited to being performed under the guidance of certified instructors only. While the feedback about phases (relaxed or in-pose) provides information about getting into the pose, feedback about the type of autonomic response provides information on whether the required physiological response is elicited or not after holding the poses for sometime. As poses elicit a physiological response, this provides a chance to course-correct along the way, which makes it more feasible, reliable, and personalized. This work emphasizes that not only intense bodily movements but even slow, static yoga postures can induce rapid, multidimensional changes detectable via wearable sensors. While many studies documented ANS modulation using biomarkers such as HRV, EDA, or respiratory markers, the mechanisms are often attributed to breathing, meditation, or relaxation aspects rather than the pose mechanics themselves, which is the key aspect of this study, especially in Iyengar yoga. By mitigating overfitting and improving generalization, the developed sparse models create a strong foundation for future wearable-based health monitoring systems. Interpretable feature selection and model interpretation consistently highlighted acceleration and cardiac variability features, phasic EDA measures, and vascular/thermal dynamics as core predictors of the autonomic state. These results elucidate the complex interplay among movement, cardiovascular activity, electrodermal fluctuations, and thermoregulation during yoga, providing a physiologically sound and computationally validated pathway for future real-time, adaptive biofeedback and posture-aware health interventions. The findings from this study herein offer a scalable solution for objective autonomic assessment, enabling precision yoga therapy, self-monitoring, and broader applications in digital health and behavior medicine.