1. Introduction
Heart rate variability (HRV) refers to the fluctuation in time intervals between consecutive heartbeats, measured as R–R intervals on an electrocardiogram or similar device [
1]. It serves as a non-invasive marker of autonomic nervous system function, particularly the dynamic interplay between sympathetic and parasympathetic activity [
2]. Higher levels of HRV, especially in vagally mediated metrics like the root mean square of successive differences (RMSSD), generally reflect a more adaptable, resilient autonomic state [
2,
3,
4]. Lower HRV, by contrast, is associated with stress, fatigue, and diminished physiological flexibility [
3,
4].
Accurate HRV measurement relies on consistent protocols, valid hardware, and robust signal processing—particularly for photoplethysmography (PPG)-based devices, where algorithmic filtering and artefact correction are essential [
5]. HRV is also highly sensitive to contextual factors such as posture, time of day, and psychological or physiological state, making standardised measurement conditions strongly advisable [
4,
6]. Best practice involves taking readings either during sleep or immediately upon waking, under relaxed and consistent conditions [
4,
7]. Short-term (e.g., 60 s to 5 min) recordings in a seated or supine position are commonly used for longitudinal monitoring and can yield reliable estimates of vagally mediated HRV [
8,
9]. Chest-worn heart rate monitors such as the Polar H10 have shown excellent agreement with gold-standard ECG under these conditions [
10,
11]. In contrast, wrist-worn or camera-based sensors are more prone to noise, and their HRV estimates vary considerably depending on sampling rate and movement artefacts [
8]. Within-subject variability is another important consideration. Even under controlled conditions, day-to-day fluctuations in HRV are common and may reflect changes in sleep, hydration, or behavioural and environmental stressors [
5,
12,
13]. For this reason, researchers recommend tracking HRV trends over time—often using rolling averages or coefficients of variation—to distinguish meaningful changes from random noise [
12,
14].
Numerous studies have explored how HRV corresponds with subjective experiences of stress, fatigue, recovery, and sleep quality [
12,
15,
16,
17,
18,
19]. In general, lower HRV is associated with higher perceived stress, greater fatigue, and poorer recovery [
15,
18,
19]. Among athletes, HRV tends to decrease during periods of heavy training and improve during recovery, often tracking with subjective reports of wellness [
17,
18,
19,
20]. However, findings in real-world, non-athletic populations are less consistent. For example, Martinez et al. found that HRV explained only a small fraction of the variance in daily stress ratings among information workers, despite statistically significant associations in the expected direction [
16]. Similarly, de Vries et al. reported weak and inconsistent within-person relationships between HRV and self-reported stress or fatigue in a longitudinal study of police officers [
15]. These findings suggest that while HRV is sensitive to changes in physiological state, its relationship with perceived wellness is subtle, highly individualised, and influenced by contextual factors.
In summary, while HRV is increasingly used to monitor stress, recovery, and overall wellness, its day-to-day relationship with self-reported experience remains inconsistent—especially outside athletic populations. This limitation stems from methodological variability, individual differences in physiological responses, and the limited validation of proprietary readiness scores commonly featured in consumer wearables [
21]. While studies have shown general associations between HRV and subjective wellness in patients [
22,
23] and trained athletes [
24,
25], evidence from real-world, non-clinical and/or non-athletic settings is mixed, with weak and context-dependent within-person correlations. Given the widespread adoption of wearable devices—hundreds of millions globally [
26], many of which provide HRV-based metrics—there is a pressing need for rigorous, ecologically valid studies that evaluate how well these measures reflect or predict users’ self-reported stress, fatigue, recovery, and sleep quality in everyday life.
Therefore, the aim of this study was to evaluate the relationship between short-term, morning HRV measurements and self-reported indicators of health and wellness—including perceived fatigue, stress, physical recovery, and sleep quality—across a 14-day observational period.
2. Materials and Methods
2.1. Study Design
This was a 14-day prospective cohort study designed to evaluate associations between short-term HRV and self-reported wellness in a real-world, non-clinical setting. Participants performed daily HRV recordings each morning using a chest-worn Polar H10 heart rate monitor (Polar Electro Oy, Kempele, Finland) connected to the Kubios HRV mobile application (version 1.6.6; Kubios Oy, Kuopio, Finland). HRV data were automatically uploaded to the Kubios Team Readiness platform, which facilitated centralised data management, team coordination, and real-time monitoring of data quality and adherence. The platform computed validated time-domain HRV metrics—such as RMSSD and standard deviation of normal-to-normal intervals (SDNN).
Ethical approval for the study was granted by the University College Dublin Human Research Ethics Committee (LS-23-55) in November 2023.
2.2. Study Population
Participants were recruited via convenience sampling through digital advertisements, university posters, and community outreach. Eligible participants were adults aged 18 years or older, in good general health, and without a history of cardiovascular, respiratory, or major psychiatric conditions. Individuals were excluded if they reported use of medications known to affect autonomic function.
Interested individuals received a detailed study information leaflet and completed a digital consent form prior to enrolment. Upon consent, participants completed a baseline questionnaire assessing demographics, sleep quality (Pittsburgh Sleep Quality Index), health-related quality of life (WHOQOL-BREF), and physical activity (modified International Physical Activity Questionnaire). The Pittsburgh Sleep Quality Index and WHOQOL-BREF are validated instruments with well-established internal consistency and construct validity across general populations [
27,
28].
2.3. Measurement Protocol
Participants were trained on proper use of the Polar H10 sensor and Kubios app during an onboarding session, which included installation, device pairing, and a supervised test measurement. Each morning, participants were instructed to remain supine and still while recording a five-minute HRV measurement following a one-minute relaxation period. Measurements were performed immediately after waking, under consistent conditions, and before any caffeine, physical activity, or digital device use.
Physiological data were collected via the Kubios mobile app and transferred securely to the Kubios Team Readiness platform. RMSSD and SDNN were calculated automatically using Kubios’ artefact-corrected beat-to-beat interval processing. Respiratory rate and heart rate were also derived from each session.
After the HRV measurement, participants completed a brief self-report questionnaire via SurveyMonkey (SurveyMonkey Europe UC). This included daily ratings of perceived fatigue, stress, sleep quality, and physical recovery (muscle soreness), each on a five-point Likert-type scale. Participants were also asked whether they had engaged in physical exercise in the past 24 h. If yes, they were prompted to rate their perceived exertion using the Borg Rating of Perceived Exertion (RPE) scale, and their subjective exercise experience, defined as how they felt during the session. This was rated on a five-point scale (Very Weak, Weak, Normal, Strong, Very Strong) based on perceived strength or energy during exercise. Additional contextual items included alcohol consumption (units), recent illness, and sleep quality from the previous night.
To minimise potential bias, participants were instructed to complete the self-report survey immediately after recording their HRV and before viewing any feedback or readiness scores in the Kubios app. To support adherence, participants received automated daily reminders and had access to technical support. Daily data completeness was monitored through the Kubios platform, and participants were contacted if entries were missing or incomplete.
2.4. Outcome Measures
The four primary outcomes were daily self-reported ratings of fatigue, stress, sleep quality, and body soreness (physical recovery). These were assessed using single-item, five-point Likert-type questions adapted from the Hooper Index, a tool commonly used in athlete monitoring [
29]. Each item followed a consistent scale where 1 indicated poorer states (greater impairment) and 5 indicated more favourable states (lower impairment). For example, a score of 1 on fatigue corresponded to “Extremely Tired”, and 5 to “Very Fresh”; a 1 on stress indicated “Extremely Stressed”, and 5 indicated “Very Relaxed”. Sleep quality ranged from 1 = “Very Restless” to 5 = “Very Restful”, and body soreness ranged from 1 = “Very Sore/Heavy” to 5 = “No Soreness”. While single-item measures limit the assessment of psychometric reliability, the Hooper index has demonstrated utility in repeated-measures designs where participant burden, adherence and monitoring feasibility are important [
30,
31].
The primary physiological predictor was RMSSD, chosen for its robust validation as a marker of parasympathetic activity. RMSSD values were log-transformed to improve distributional properties and facilitate model convergence. SDNN was also collected and analysed descriptively, though not retained in final predictive models.
Secondary physiological predictors included resting heart rate and respiratory rate, which were automatically derived from the Kubios platform—and Kubios’ own proprietary ‘readiness’ score. Contextual covariates were drawn from self-reported questionnaire responses and included (1) alcohol consumption, recorded in units; (2) recent illness (binary); (3) exercise-related variables, including RPE for those who exercised, subjective exercise experience, coded as an ordinal categorical variable (Very Weak to Very Strong).
2.5. Statistical Analysis
For each outcome, we fit Bayesian cumulative ordinal mixed-effects models using the Bambi package in Python (version 0.15.0) [
32]. This approach accounted for the ordinal nature of the response variables and the repeated-measures design. Each model included a participant-level random intercept to account for individual variability.
Ordinal responses were modelled as thresholds on a latent continuous variable, with a cumulative logit link relating the latent variable to the predictors. The core fixed effect was log-transformed RMSSD. Other covariates (heart rate, respiratory rate, alcohol, illness, and exercise) were included based on model fit and posterior certainty.
All fixed effects were assigned weakly informative normal priors, scaled to the standard deviation of each predictor. Random intercept variances were modelled hierarchically using half-normal priors, and threshold parameters were constrained to preserve monotonicity. Models were estimated via Markov chain Monte Carlo (MCMC) [
33] using the No-U-Turn Sampler (NUTS) [
34], with four chains of 2000 draws each (1000 discarded as burn-in), yielding 4000 posterior samples per model.
Model refinement followed an iterative process guided by expected log pointwise predictive density (ELPD), estimated via leave-one-out cross-validation (LOO) with Pareto-smoothed importance sampling (PSIS) [
35]. Competing models were compared based on ELPD, model parsimony, and parameter uncertainty. Covariates were retained if their removal led to poorer predictive performance or if their 95% highest density interval (HDI) excluded zero.
Posterior summaries reported mean effects, standard deviations, and 95% HDIs. A variable was considered to have a credible association if its HDI excluded zero. Model calibration was assessed through posterior predictive checks and confusion matrices, with classification accuracy evaluated across ordinal response categories.
All code used for model fitting and diagnostics is available upon request.
4. Discussion
This study investigated whether short-term HRV, measured each morning under standardised conditions, was associated with self-reported wellness indicators in a non-clinical adult population. Across 424 daily observations, higher RMSSD values were credibly linked to more favourable self-reports of fatigue (β = 0.281, 95% HDI: 0.020 to 0.562), stress (β = 0.353, 95% HDI: 0.059 to 0.606), and sleep quality (β = 0.510, 95% HDI: 0.239 to 0.779). These associations remained robust after adjusting for physiological and contextual covariates, including resting heart rate, respiratory rate, alcohol intake, and physical activity. Specifically, participants with higher RMSSD were more likely to report feeling rested, relaxed, and well-slept. By contrast, RMSSD was not credibly associated with body soreness (β = 0.172, 95% HDI: −0.051 to 0.396), suggesting either a weak or context-dependent relationship between autonomic recovery and perceived muscle soreness.
These findings align with a growing body of literature suggesting that HRV—particularly RMSSD—tracks with day-to-day fluctuations in subjective wellness [
24,
36,
37]. Prior repeated-measures studies in athletic and occupational settings have reported similar within-person associations. For example, Martinez et al. [
16] found that lower daytime HRV corresponded with higher perceived stress in a large sample of knowledge workers, albeit with small effect sizes. Similarly, Lundstrom et al. [
20] reported that declines in morning RMSSD were associated with elevated fatigue in collegiate swimmers during intensified training periods. These studies, like ours, highlight RMSSD’s potential to reflect real-time changes in recovery and physiological strain when measured consistently.
Our results also echo insights from N-of-1 designs in high-demand sport [
38] and professional [
15] settings. In a longitudinal study of police officers, daily HRV derived from the Oura Ring showed weak but stable associations with self-reported stress and fatigue [
15]. Notably, these effects were more apparent in subjective domains like mood and sleep than in physical recovery measures such as soreness. This pattern is consistent with the physiological basis of RMSSD: as a marker of parasympathetic tone, RMSSD may more directly reflect psychological strain and autonomic recovery than biomechanical or inflammatory processes involved in muscle soreness [
4,
24,
37]. This pattern also aligns with our own findings. While some studies have shown that HRV decreases in response to muscle damage or overtraining [
36,
39,
40], others suggest that soreness arises from peripheral mechanisms not readily detected through autonomic markers—particularly when assessed using brief Likert-type items [
41,
42].
By capturing these associations in a healthy, mixed-activity adult sample using a validated chest-worn sensor (Polar H10), this study extends HRV research beyond elite or clinical populations. It also reinforces the role of resting heart rate and respiratory rate as co-regulators of subjective wellness [
43]. Consistent with prior work, lower resting heart rate was associated with better self-reported sleep and fatigue, supporting its utility as a general marker of recovery status [
44,
45].
Overall, our results support the interpretation of RMSSD as a meaningful, though not exclusive, indicator of short-term recovery and stress in daily life. While effect sizes were modest, the associations with sleep, fatigue, and stress were directionally consistent, statistically credible, and physiologically plausible. These findings reinforce the value of HRV as a behavioural signal—one that can reflect fluctuations in subjective wellness when interpreted in context and over time [
4].
This has practical relevance in light of growing public interest in HRV monitoring. Consumer platforms such as WHOOP, Oura, Garmin, and Polar increasingly provide users with composite “readiness” scores that incorporate HRV alongside other metrics. While the rationale for these scores is intuitive—integrating multiple physiological signals to estimate recovery—they may obscure variation in individual inputs, while also serving as test beds for companies to experiment with new markers of health and wellness—often at the expense of users’ data privacy [
46]. HRV is particularly sensitive to context and personal baseline. Aggregating it into a single opaque score, without transparency or independent validation, risks diluting its interpretive value. At present, few of these readiness indices have been tested in prospective, real-world studies. Before relying on black-box recovery scores, we argue for the importance of isolating and validating core physiological signals such as RMSSD. By modelling RMSSD in relation to clearly defined self-reported outcomes, this study offers a transparent and interpretable approach to understanding biosignal dynamics in everyday life. Methodologically, several features support the credibility of our findings: the use of research-grade devices (Polar H10) and validated analysis software (Kubios HRV); a standardised morning protocol to reduce confounding from posture, circadian variation, or physical activity; and a 14-day prospective design that captured repeated within-subject measurements under real-world conditions. Finally, the use of Bayesian ordinal regression allowed us to account for uncertainty and the ordinal nature of our outcome variables, while leave-one-out cross-validation ensured robust model selection.
Conceptually, this study contributes to the field of wearable-based digital phenotyping by demonstrating that daily RMSSD can track perceived stress, fatigue, and sleep quality in a general adult population. Much of the existing literature has focused on athletes [
24,
25] or clinical groups [
22,
23]; our findings extend these insights to more diverse, non-clinical settings. HRV may therefore hold promise as a scalable, low-burden indicator of recovery and strain—particularly for self-monitoring and occupational health applications.
Still, the heterogeneity in effect strength across outcomes warrants cautious interpretation. Not all dimensions of self-reported wellness tracked equally with RMSSD. While stress, fatigue, and sleep quality showed credible associations, physical recovery (body soreness) did not. These differences likely reflect the multi-factorial nature of perceived wellness, shaped by individual autonomic responsivity, psychological framing, and contextual influences like sleep disruption, alcohol intake, or workload. Even under controlled measurement conditions, HRV may reflect some wellness domains more strongly than others.
On this basis, HRV should not be viewed as a diagnostic or standalone marker but rather as a context-sensitive physiological correlate. Its interpretive value increases when considered alongside subjective experience, behavioural patterns, and temporal trends. Our findings reinforce the view that single-day HRV readings offer limited insight, whereas multi-day trajectories—interpreted in context—provide a more robust signal of stress, recovery, and physiological adaptability. To illustrate this, we calculated the coefficient of variation (CV) of RMSSD across the 14-day period for each participant. The average CV was 0.37 (SD = 0.16), with individual values ranging from 0.14 to 0.71. This means that many participants experienced 30–40% day-to-day fluctuations in HRV relative to their own mean. Such natural variability underscores the limitations of point-in-time interpretation. A single-day dip or rise often falls within a participant’s normal range and may not reflect a meaningful change. Recognising this reinforces the case for trend-based HRV monitoring as a more reliable approach to understanding individual physiological status.
In sum, while our findings support the utility of RMSSD as a dynamic indicator of subjective wellness, they should be interpreted in light of several limitations. First, our sample was relatively small and skewed toward younger adults, many of whom were students or early-career professionals. This demographic homogeneity limits the generalisability of our results to older adults or clinically diverse populations. Second, the distributions of self-reported wellness outcomes were moderately skewed toward “normal” or “restful” responses, which may have reduced sensitivity to extreme or impaired states. Third, although the 14-day protocol was sufficient for within-subject modelling, it may not have captured longer-term physiological adaptations, recovery cycles, or responses to sustained stressors. Future studies may benefit from extended monitoring periods to assess baseline stability and detect meaningful deviations over time. Finally, contextual variables such as alcohol use and physical activity were self-reported retrospectively and may be subject to recall inaccuracies or social desirability bias.
Looking ahead, future research should adopt longer-term, longitudinal designs to examine the stability and interpretive value of HRV trends across broader life contexts. Extended tracking may clarify how RMSSD responds to changes in stress exposure, training load, illness, or recovery—and whether specific HRV patterns, such as sustained suppression or increased variability, predict downstream outcomes like burnout, injury, or mood disturbance. From a methodological standpoint, personalised or N-of-1 designs may be especially well suited to HRV monitoring. Given the strong influence of individual baselines, circadian variation, and behavioural context on autonomic measures, adaptive analytic frameworks that incorporate personal history and real-time conditions could enhance both interpretability and relevance. Coupling HRV with ecological momentary assessments of mood, performance, or behaviour may further illuminate the ways in which physiology and experience interact on a day-to-day basis.
In the applied domain, integrating HRV into mobile health platforms offers promising opportunities for real-time behavioural support. Just-in-time adaptive interventions could use HRV patterns to prompt stress management strategies, encourage rest, or nudge users toward recovery-supportive behaviours when signs of physiological strain emerge. However, these systems must be evidence-based, user-centred, and transparent—grounded in rigorous validation and attentive to the limits of what HRV can and cannot infer. Finally, there is a growing need to critically evaluate proprietary readiness scores, such as those produced by WHOOP, Oura, and Garmin. While these indices are widely used in practice, few have undergone independent validation in prospective, real-world studies [
21]. Comparative research should assess whether these composite scores provide predictive value beyond their individual components—such as RMSSD or resting heart rate—or whether they obscure meaningful physiological variation by aggregating disparate signals.