Next Article in Journal
Design and Implementation of a Novel Polarimetric Active Radar Calibrator for Gaofen-3 SAR
Previous Article in Journal
Energy Efficient Management of Pipelines in Buildings Using Linear Wireless Sensor Networks
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:

Validation of the Apple Watch for Heart Rate Variability Measurements during Relax and Mental Stress in Healthy Subjects

Biomedical Signal Interpretation & Computational Simulation (BSICoS) Group, Aragón Institute of Engineering Research (I3A), IIS Aragón, University of Zaragoza, 50018 Zaragoza, Spain
Centro de Investigación Biomédica en Red en Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), 28029 Madrid, Spain
Communications Networks and Information Technologies (CeNIT) Group, Aragón Institute of Engineering Research (I3A), University of Zaragoza, 50018 Zaragoza, Spain
Author to whom correspondence should be addressed.
Sensors 2018, 18(8), 2619;
Received: 4 July 2018 / Revised: 2 August 2018 / Accepted: 8 August 2018 / Published: 10 August 2018
(This article belongs to the Section State-of-the-Art Sensors Technologies)


Heart rate variability (HRV) analysis is a noninvasive tool widely used to assess autonomic nervous system state. The market for wearable devices that measure the heart rate has grown exponentially, as well as their potential use for healthcare and wellbeing applications. Still, there is a lack of validation of these devices. In particular, this work aims to validate the Apple Watch in terms of HRV derived from the RR interval series provided by the device, both in temporal (HRM (mean heart rate), SDNN, RMSSD and pNN50) and frequency (low and high frequency powers, LF and HF) domain. For this purpose, a database of 20 healthy volunteers subjected to relax and a mild cognitive stress was used. First, RR interval series provided by Apple Watch were validated using as reference the RR interval series provided by a Polar H7 using Bland-Altman plots and reliability and agreement coefficients. Then, HRV parameters derived from both RR interval series were compared and their ability to identify autonomic nervous system (ANS) response to mild cognitive stress was studied. Apple Watch measurements presented very good reliability and agreement (>0.9). RR interval series provided by Apple Watch contain gaps due to missing RR interval values (on average, 5 gaps per recording, lasting 6.5 s per gap). Temporal HRV indices were not significantly affected by the gaps. However, they produced a significant decrease in the LF and HF power. Despite these differences, HRV indices derived from the Apple Watch RR interval series were able to reflect changes induced by a mild mental stress, showing a significant decrease of HF power as well as RMSSD in stress with respect to relax, suggesting the potential use of HRV measurements derived from Apple Watch for stress monitoring.

1. Introduction

Heart rate variability (HRV) analysis is a widely accepted tool for the noninvasive assessment of autonomic nervous system (ANS) [1]. Its analysis and use are becoming increasingly common, since it is sensitive to both physiological and psychological changes [2]. Altered HRV measurements have been reported in several diseases related to ANS dysregulation, including cardiovascular diseases, such as ischemia, myocardial infarction and heart failure [1,3], metabolic diseases, such as diabetes and obesity [3], and mental disorders, such as anxiety and depression [4,5]. Moreover, HRV measurements have also been used to monitor sleep [6], stress [7,8], drowsiness [9] and exercise training [10,11,12].
Over the past 10 years, the market for health related wearable devices has grown exponentially, as well as their potential use for healthcare and wellbeing applications. The most popular and accepted ones, especially among the non-patient population, are the watch-like devices, which usually provide average heart rate (HR) estimates, derived from the pulse signal recorded at the wrist. They are based on photophethysmography (PPG), including a light source for illuminating the skin and a photodetector, which measures the intensity of the reflected light. HR estimates are based on the pulsatile changes in reflected light induced by fluctuations in blood flow every heartbeat. Although technically feasible, only a small subset of these devices allows HRV measures. As pointed out in [13], these devices usually lack from proper validation, limiting its applicability for enhancing clinical care.
Different wearable devices have been evaluated and compared for HR estimation. In [13] the Apple Watch achieved the best performance estimating HR at a one minute granularity, with median error below 3% during sitting, walking, running, and cycling. However, neither of these devices has been validated for HRV estimation. Although pulse rate variability (PRV) derived from PPG has been validated as a surrogate of HRV derived from the electrocardiogram (ECG) [14], it is not the case for PRV derived from the wrist. Only a few studies have addressed this issue, such as [15], where wrist PPG-based HRV was evaluated for Emphatica E4 (Empatica Srl, Milano, Italy) and PulseOn (PulseOn Oy, Espoo, Finland) devices, yielding reliable results only during sitting condition.
The objective of this study is to validate the Apple Watch for HRV measurement. Prior studies have only validated aggregate HR estimates [13,16,17,18,19], but its outperformance over other devices [13,20] makes it a potential candidate among wrist-worn devices for HRV measurement. Moreover, its spread and acceptance are beating the wearable market [21]. The present study sets out the comparison of RR intervals derived from Apple Watch using as reference the RR intervals derived from the Polar H7 chest belt heart rate monitor (Polar Electro Oy, Kempele, Finland), which has been already validated [22,23]. Additionally, this study includes the validation of HRV measurements, derived from the RR intervals from both devices, in two scenarios with different ANS balance: relax and metal stress.

2. Materials and Methods

2.1. Experimental Setup

A total of 20 volunteers agreed to participate in the study. All of them were apparently healthy subjects. Written informed consent was obtained from each subject. The study protocol was approved by the institutional ethics committee and was in accordance with the Declaration of Helsinki for Human Research of 1974 (last modified in 2008).
Participants were asked to abstain from caffeine and smoking prior to the test and to only consume a light meal 2 h prior to testing. The Polar H7 chest belt was placed tightly just below the chest muscles and the Apple Watch was placed tightly as well in the left wrist. Both devices were cleaned up between uses to avoid possible misreading effects. Measures were taken while participants were sitting in front of a 21″ computer screen, completing relax and mental stress test protocols. Instructions were given to minimize left arm movements, where the Apple Watch was placed.
The experimental protocol consisted of two consecutive 5-min stages, namely relax and stress stages, separated by a 1-min break. During the relax stage volunteers watched a relaxing video, consisting in relaxing music and pleasant images, which was previously used in a study on emotion recognition [24]. During the stress stage, participants performed an online version of the Stroop test (, which is an attentional test in which participants see words (names of different colors) written in colored ink and they are asked to name the color of the ink while ignoring the meaning of the word.
Raw RR data was obtained from Polar H7 ( R R H 7 ) using Elite HRV app installed in an Android device. This app is able to export a text file containing the raw RR interval series with a precision of milliseconds. On the other hand, the Breath app was used to extract RR interval series from Apple Watch ( R R A W ). Breath app, developed by Apple, is the only way at this moment (watchOS 4.2 operating system) to obtain RR raw values since Apple does not include any programing method for developers to directly access the values. This app stores the raw RR values, with a precision of centiseconds, in the user’s Personal Health Record, accessible to be exported in XML format using Apple’s Health App.

2.2. Synchronization and RR Matching

Figure 1 shows an example of both RR series, R R H 7 and R R A W . It can be seen that the RR series derived from the Apple Watch has some gaps, probably corresponding to time instants where Apple Watch proprietary algorithms have not been able to reliably detect the PPG pulses. These gaps were present in almost all Apple Watch recordings, with no apparent relation with the subject or the test stage (relax or stress). Synchronization between R R H 7 and R R A W was performed using the delay that maximized their cross correlation using the first 20 RR intervals, where no gaps appeared in Apple Watch recordings. All RR intervals need to be matched from both devices: we create R R H 7 g by intentionally creating the same gaps in R R H 7 than those in R R A W . This way, both R R A W and R R H 7 g have the same number of samples and all RR intervals are matched. For each recording, the percentage of missing RR intervals missed with respect to the total number of RR intervals were noted.

2.3. Validation of RR Series

A Bland–Altman plot [25] was used to study the validity of the Apple Watch RR series, by representing both R R H 7 g and R R A W series. The bias, the limits of agreement (LOA, ±2*std values) and the percentage of paired RR measurements out of the LOA were also obtained.
To measure the interchangeability between both RR series, two reliability indices were used. First, Lin’s concordance correlation coefficient (CCC) determines how far the observed data deviate from the line of perfect concordance line at 45° on a square axis scatter plot [26]. Second, intraclass correlation coefficient (ICC) represents the ratio of between-sample variance and the total variance (between- and within-sample) to measure precision under the model of equal marginal distributions [27]. Lastly, the agreement was measured by an information-based measure of disagreement (IBMD), which is based on Shannon’s entropy [28]. This index equals 0 when the observers agree (no disagreement); i.e., there is no information in the differences between both methods. The agreement (A) can be quantified as A = 1−IBMD.

2.4. Heart Rate Variability Parameters

Heart rate variability parameters were divided in two classes: time and frequency domain indices. Since both are calculated in different ways, the gaps also are addressed differently. For the temporal domain analysis, we will derive HRV parameters from R R A W , R R H 7 and R R H 7 g , the latter created with simulated gaps as defined in Section 2.2. For the frequency domain analysis, most of the methods used for power spectral density estimation require evenly sampled series, and interpolation is usually employed to overcome the intrinsic unevenly sampled RR interval series. However, since gaps in R R A W and R R H 7 g are too large (from 3.3 to 10.4 s), we first filled the gaps with artificial RR intervals, created by linear interpolation. Note that, for simplicity, we use the same nomenclature, R R A W and R R H 7 g , for temporal and frequency domain analysis, although they are slightly different: for temporal HRV parameters, the gaps are omitted and the RR intervals are concatenated ( R R A W and R R H 7 g are shorter than R R H 7 ), while for frequency HRV parameters the gaps are filled with artificially created RR intervals ( R R A W and R R H 7 g are the same length than R R H 7 ).
For the temporal parameters, four parameters were proposed [1]: HRM, SDNN, RMSSD and pNN50. HRM is the mean heart rate, which is obtained as the inverse of the mean heart period (mean of the RR intervals). SDNN is the standard deviation of the NN intervals (or normal RR intervals), which is a measure of the total power in the analyzed segment. A dispersion measure can be obtained by calculating the root mean-square of successive differences of adjacent intervals (RMSSD). The last parameter, pNN50, represents the percentage of pairs of adjacent intervals differing by more than 50 ms. RMSSD and pNN50 describe short-time variability. Note that whenever a gap appears in R R A W , the corresponding intervals in R R H 7 are also removed, and the remaining RR intervals are concatenated. The difference of the intervals just before and after the gaps are not included in the calculation of RMSSD and pNN50.
For the frequency domain analysis, the heart rate variability signal was obtained following a method based on the integral pulse frequency modulation (IPFM) model described in [29]: from the RR interval series, we derived the instantaneous heart rate signal, d H R ( n ) , which is a continuous and evenly sampled signal (sampled at 4 Hz). Then, we obtained the mean heart rate, d H R M ( n ) , using a low pass filter up to 0.03 Hz. The variability signal, d H R V ( n ) , was obtained as d H R V ( n ) = d H R ( n ) d H R M ( n ) . Finally, the modulating signal, which is assumed to represent the ANS modulation on the sinoatrial node, was obtained as m ^ ( n ) = d H R V ( n ) / d H R M ( n ) [30].
The Welch periodogram was applied to estimate the spectral properties of the HRV signals m ^ A W ( n ) , m ^ H 7 ( n ) and m ^ H 7 g ( n ) (derived from R R A W , R R H 7 and R R H 7 g , respectively), using a Hamming window with a length of 150 s with an overlap of 120 s. Low frequency (LF) and high frequency (HF) powers, P L F and P H F respectively, were obtained integrating the power in their bands (LF: 0.04–0.15 Hz, HF: 0.15–0.4 Hz) [1].

2.5. Statistical Analysis

None of all HRV parameters followed a normal distribution (tested with a Kolmogorov test). To compare HRV parameters, a paired Wilcoxon test was applied. Then we performed 3 different comparisons: (1) we analyzed the influence of the gaps by comparing HRV parameters derived from R R H 7 and R R H 7 g ; (2) we compared HRV parameters derived from R R A W and R R H 7 g ; and (3) HRV parameters derived from R R A W were compared in relax vs. stress stage to evaluate the ability of those parameters to reflect ANS response to stress, using changes in HRV parameters derived from R R H 7 as the reference ANS response. The difference was considered to be significantly different from zero when p < 0.05.
Distribution of HRV parameters are shown using boxplots: on each box, the central mark indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to the most extreme data points not considered outliers, and the outliers are plotted individually using the ‘+’ symbol. An outlier is defined as a point that falls more than 1.5 times the interquartile range (difference between 75th and 25th percentiles) above the 75th percentile or below the 25th percentile.

3. Results

3.1. Validity of RR Series

A total of 206 gaps were found in the Apple Watch recordings, equivalent to 1321 missing RR intervals (around 10% of total intervals). On average, each recording presents 5 gaps, with a length of 6 s per gap. The minimum length found was 3.3 s and the maximum length was 10.4 s. No differences in the number or length of these gaps were found between relax and stress recordings.
Figure 2 shows the Bland–Altman plot which evaluates the RR interval discrepancies between Polar H7 and Apple Watch measurements and the stability across the different values of the intervals. A total of 12,109 paired RR intervals were used from both relax and stress stages. The bias (central horizontal line) was 0.06 ms. The LOA (upper and lower lines) contained 96.21% of the intervals. No visual differences were found in the discrepancies between lower and higher RR intervals.
Table 1 shows the mean values of the RR intervals for each stage, as well as the reliability and agreement indices. While slightly lower in the stress stage, these indices show excellent reliability and agreement between the measurements in both stages. Bias and LOA are also shown for relax and stress stages, which are very similar to those obtained with the combined data. The last row shows the mean of the percentage of missing RR intervals in R R A W .

3.2. HRV Parameters: Temporal Domain

Figure 3 shows the temporal parameters for both relax and stress stages. No significant differences were found in any parameter neither between R R H 7 and R R H 7 g , nor between R R H 7 g and R R A W .

3.3. HRV Parameters: Frequency Domain

Figure 4 shows the frequency HRV parameters derived from R R H 7 , R R H 7 g and R R A W in relax and stress stages. Both P L F and P H F show significant differences (marked with an asterisk between the boxplots) in R R H 7 g when comparing to R R H 7 , being lower in the presence of the gaps. When comparing HRV parameters derived from R R A W and R R H 7 g , neither P L F nor P H F presented significant differences.

3.4. HRV Parameters: Relax vs. Stress

Figure 5 and Figure 6 display HRV parameters derived from R R A W and R R H 7 in relax vs. stress stages. All temporal HRV parameters (HRM, SDNN, RMSSD and pNN50) presented changes from relax to stress in both Apple Watch and Polar H7 recordings: an increase in HRM and a decrease in the rest of parameters. We observed a decrease in P L F in stress with respect to relax stage, being only statistically significant in Polar H7 recordings (p = 0.030 for Polar H7, p = 0.065 for Apple Watch). A significant decrease was found in both devices when comparing P H F .

4. Discussion

This is the first study to validate HRV measurements using an Apple Watch to the authors’ knowledge. Many of the wrist-worn devices aimed at assessing well-being, stress, and fitness provide average HR values, derived from a pulse signal, instead of HRV. Fitbit, Jawbone, TomTom, Garming, Withing, Samsumg, among others, commercialize such devices. However, although changes in average HR are mainly induced by ANS and can be significantly different in some physio-pathological situations, it cannot be considered a measure of autonomic function [3,31]. Moreover, there are situations where altered ANS function is manifested in HRV but not in HR changes, such as in depressed patients with respect to controls [32] or in exercise contexts [33,34].
In order to validate HRV measurements derived from an Apple Watch, in this work an experimental study was conducted on young healthy volunteers subjected to mild relax and stress. HRV measurements derived from Apple Watch were compared to those obtained from a validated heart rate monitor, in particular Polar H7 chest band. Several works in the literature have already validated, using as reference the RR interval series derived from a synchronous ECG, the RR interval series given by different models of Polar devices: S810 [35,36], RS800 [37,38], and V800 with a H7 chest belt [22,23].
Regarding the RR intervals series, which are the basis for any HRV analysis, the bias was almost 0 and most pairs of RR intervals (around 96%) were within the limits of agreement in the Bland Altman plot. These results are consistent when studying both stages separately. Moreover, the discrepancy between Apple Watch and Polar H7 is similar with no dependency of the heart rate. The RR series present excellent reliability and agreement when analyzing matched pairs of RR intervals. These results agree with other studies for Apple Watch [17,18] and other heart rate watches [13,16,19].
However, it is important to note the existence in Apple Watch measurements of missing beats. Around 10% of the beats could not be detected by the Apple Watch in both the relax and stress stages. This could be not problematic if they were isolated beats, but they usually were consecutive beats, which lead to gaps in the RR series. The shorter gaps were longer than 3 s, being easily identifiable as abnormally large RR intervals. The origin of these gaps could be varied, from bad skin contact to fast arm movement, but ultimately, we cannot conclude anything about these gaps because we do not have direct access to the algorithms used by the Apple Watch. In [18], they also reported missing values in the Apple Watch, with the proportion of heart rate values actually measured by the Apple Watch decreasing with increasing exercise intensity. They also reported that these missing heart rate values were higher in the first minute of exercise (between 20 and 40% of RR intervals), particularly at higher intensities of exercise. In our data, however, we did not find more missing beats in the stress stage compared to the relax stage, and the total missing RR intervals was about 10% of total intervals.
These missing beats are important when extracting HRV parameters in the frequency domain, since the RR interval series needs to be interpolated in the gaps. This might influence HRV frequency parameters. To study the influence of these gaps in frequency parameters P L F and P H F , we created R R H 7 g by intentionally creating gaps in the R R H 7 series, removing the missing RR intervals found in R R A W . This analysis showed that both P L F and P H F were significantly lower in the presence of simulated gaps. However, when comparing HRV frequency parameters derived from R R A W and from R R H 7 g , no significant difference was found, suggesting that differences between HRV parameters derived from RR series provided by Apple Watch and Polar H7 are indeed due to the presence of gaps in the Apple Watch series, rather than to the different signal from which they are derived or to the different time resolution. The effect of gaps on HRV temporal parameters was also investigated, but no significant differences were found, suggesting that these parameters are less sensitive to the presence of gaps.
Despite these differences, HRV measurements derived from Apple Watch were still able to reflect the ANS response induced by the stress stage, replicating most of the trends observed in HRV measurements derived from the Polar H7. Induced mental stress resulted in a significant decrease in P H F with respect to relax (in both the Apple Watch and Polar H7 measures, note that there were no gaps in the Polar H7 recordings), suggesting the inhibition of parasympathetic stimulation, as already reported in numerous studies [8,39]. A similar conclusion can be extracted with the decrease in RMSSD parameter in the stress phase, being this parameter associated to the parasympathetic activity. Although not shown in the results, the ratio between LF and HF powers (R), as well as the normalized LF power ( P L F n ), were also obtained. Comparing relax and stress, these parameters have shown in other studies a significant increase during stress [8], which means a predominance of the sympathetic activity. In this work, however, the increase found in R and P L F n was not significant even in the Polar H7 measures, possibly due to the fact that the level of stress was moderate: see the moderately low maximum HR achieved in the stress stage (around 100 bpm). Still, the significant decrease in P H F was evident using both the Polar H7 and the Apple Watch, supporting the potential use of the Apple Watch for stress monitoring.
However, one of the main limitations of using the Apple Watch to obtain and use HRV measures is that, at this moment, the RR series is only available through the use of the Apple Breath App in an indirect way (exporting the measured values using the Health App, where all Health-related data from the user is stored). Apple does not provide within the watch OS 4.2 SDK any function to access raw RR measures. Thus, developers are not able to develop an App that takes advantage of the potential shown by Apple Watch in HRV monitoring. Besides, Breath App stops when repeated motion is detected. This does not allow us to validate HRV measurements in low or moderate exercise. Nevertheless, this SDK limitation can be removed by Apple in a further SDK release. This inclusion would leverage the development of Apps able to monitoring stress conditions, among other things, by the developer’s community.

5. Conclusions

Heart rate variability parameters extracted from an Apple Watch device have been validated using a Polar H7 band as a reference during relax and mental stress in 20 healthy volunteers. Reliability and agreement coefficients were computed for the RR interval series provided by both devices in relax and stress stages, achieving very good results (reliability and agreement > 0.9). No significant differences were found when comparing temporal HRV parameters (SDNN, RMSSD, pNN50 and HRM) derived from the RR interval series provided by both devices. However, frequency HRV parameters LF and HF powers were significantly different when derived from the Apple Watch, due to the appearance of gaps in the RR series, both in relax and stress stages. Nonetheless, a decrease in the HF power (and in the RMSSD parameter) was observed in stress with respect to relax stage when using both devices, which supports the potential use of the Apple Watch for stress monitoring.

Author Contributions

Conceptualization, D.H., Á.A. and R.B.; Methodology, D.H. and R.B.; Formal Analysis, D.H.; Data Curation, S.R., J.S. and Á.A.; Writing-Original Draft Preparation, D.H., S.R., J.S., Á.A. and R.B.; Writing-Review & Editing, D.H., Á.A. and R.B.; Supervision, Á.A. and R.B.


This work was supported by Centro de Investigación Biomédica en Red–Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN) through Instituto de Salud Carlos III, by the Ministerio de Economía, Industria y Competitividad, Gobierno de España, European Regional Development Fund (TIN2016-76770-R) and FEDER “Construyendo Europa desde Aragón” (Grupo de investigación T31_17R), by Aragón Government through Grupo de Referencia BSICoS (T39_17R), by Aragón Institute of Engineering Research (I3A), IIS Aragón and European Social Fund (EU).


The computation was performed by the ICTS “NANBIOSIS”, more specifically by the High Performance Computing Unit of the CIBER-BBN at the University of Zaragoza. The authors also want to acknowledge the support from Ogilvy Paris.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.


  1. Camm, A.J.; Malik, M.; Bigger, J.T.; Brethardt, G.; Cerutti, S.; Cohen, R.J.; Coumel, P.; Fallen, E.L.; Kennedy, H.L.; Kleiger, R.E.; et al. Heart rate variability: Standards of measurement, physiological interpretation, and clinical use. Circulation 1996, 93, 1043–1065. [Google Scholar]
  2. Thayer, J.; Ahs, F.; Fredrikson, M.; Sollers, J.; Wager, T. A meta-analysis of heart rate variability and neuroimaging studies: Implications for heart rate variability as a marker of stress and health. Neurosci. Biobehav. Rev. 2012, 36, 747–756. [Google Scholar] [CrossRef] [PubMed]
  3. Billman, G. Heart rate variability—A historical perspective. Front. Physiol. 2011, 2, 86. [Google Scholar] [CrossRef] [PubMed]
  4. Beauchaine, T.; Thayer, J. Heart rate variability as a transdiagnostic biomarker of psychopathology. Int. J. Psychophysiol. 2015, 98, 338–350. [Google Scholar] [CrossRef] [PubMed]
  5. Koenig, J.; Kemp, A.; Beauchaine, T.; Thayer, J.; Kaess, M. Depression and resting state heart rate variability in children and adolescents—A systematic review and meta-analysis. Clin. Psychol. Rev. 2016, 46, 136–150. [Google Scholar] [CrossRef] [PubMed]
  6. Tobaldini, E.; Nobili, L.; Strada, S.; Casali, K.; Braghiroli, A.; Montano, N. Heart rate variability in normal and pathological sleep. Front. Physiol. 2013, 4, 294. [Google Scholar] [CrossRef] [PubMed][Green Version]
  7. Kim, H.; Cheon, E.; Bai, D.; Lee, Y.; Koo, B. Stress and heart rate variability: A meta-analysis and review of the literature. Psychiatry Investig. 2018, 15, 235–245. [Google Scholar] [CrossRef] [PubMed]
  8. Hernando, A.; Lázaro, J.; Gil, E.; Arza, A.; Garzón, J.; López-Antón, R.; de la Cámara, C.; Laguna, P.; Aguiló, J.; Bailón, R. Inclusion of respiratory frequency information in heart rate variability analysis for stress assessment. IEEE J. Biomed. Health Inform. 2016, 20, 1016–1025. [Google Scholar] [CrossRef] [PubMed]
  9. Vicente, J.; Laguna, P.; Bartra, A.; Bailón, R. Drowsiness detection using heart rate variability. Med. Biol. Eng. Comput. 2016, 54, 927–937. [Google Scholar] [CrossRef] [PubMed]
  10. Kiviniemi, A.; Hautala, A.; Kinnunen, H.; Tulppo, M. Endurance training guided individually by daily heart rate variability measurements. Eur. J. Appl. Physiol. 2007, 101, 743–751. [Google Scholar] [CrossRef] [PubMed]
  11. Plews, D.; Laursen, P.; Stanley, J.; Kilding, A.; Buchheit, M. Training adaptation and heart rate variability in elite endurance athletes: Opening the door to effective monitoring. Sports Med. 2013, 43, 773–781. [Google Scholar] [CrossRef] [PubMed]
  12. Tian, Y.; He, Z.; Zhao, J.; Tao, D.; Xu, K.; Earnest, C.; McNaughton, L. Heart rate variability threshold values for early-warning nonfunctional overreaching in elite female wrestlers. J. Strength Cond. Res. 2013, 27, 1511–1519. [Google Scholar] [CrossRef] [PubMed]
  13. Shcherbina, A.; Mattsson, C.; Waggott, D.; Salisbury, H.; Christle, J.; Hastie, T.; Wheeler, M.; Ashley, E. Accuracy in wrist-worn, sensor-based measurements of heart rate and energy expenditure in a diverse cohort. J. Pers. Med. 2017, 7, 3. [Google Scholar] [CrossRef] [PubMed]
  14. Gil, E.; Orini, M.; Bailón, R.; Vergara, J.; Mainardi, L.; Laguna, P. Photoplethysmography pulse rate variability as a surrogate measurement of heart rate variability during non-stationary conditions. Physiol. Meas. 2010, 31, 1271–1290. [Google Scholar] [CrossRef] [PubMed]
  15. Eskola, H.; Väisänen, O.; Viik, J.; Hyttinen, J. Evaluation of the accuracy and reliability for photoplethysmography based heart rate and beat-to-beat detection during daily activities. In Proceedings of the Joint Conference of the European Medical and Biological Engineering Conference (EMBEC) and the Nordic-Baltic Conference on Biomedical Engineering and Medical Physics (NBC) 2017, Tampere, Finland, 11–15 June 2017. [Google Scholar]
  16. Wallen, M.; Gomersall, S.; Keating, S.; Wisløff, U.; Coombes, J. Accuracy of heart rate watches: Implications for weight management. PLoS ONE 2016, 11, e0154420. [Google Scholar] [CrossRef] [PubMed]
  17. Abt, G.; Bray, J.; Benson, A. The validity and inter-device variability of the Apple Watch for measuring maximal heart rate. J. Sports Sci. 2017, 36, 1447–1452. [Google Scholar] [CrossRef] [PubMed]
  18. Khushhal, A.; Nichols, S.; Evans, W.; Gleadall-Siddall, D.; Page, R.; O’Doherty, A.; Carroll, S.; Ingle, L.; Abt, G. Validity and reliability of the AppleWatch for measuring heart rate during exercise. Sports Med. Int. Open 2017, 1, 206–211. [Google Scholar]
  19. Wang, R.; Blackburn, G.; Desai, M.; Phelan, D.; Gillinov, L.; Houghtaling, P.; Gillinov, M. Accuracy of wrist-worn heart rate monitors. JAMA Cardiol. 2017, 2, 104–106. [Google Scholar] [CrossRef] [PubMed]
  20. Bai, Y.; Hibbing, P.; Mantis, C.; Welk, G. Comparative evaluation of heart rate-based monitors: Apple Watch vs Fitbit Charge HR. J. Sports Sci. 2017, 36, 1734–1741. [Google Scholar] [CrossRef] [PubMed]
  21. Canalys. 18 Million Apple Watches Ship in 2017, up 54% on 2016. 2018. Available online: (accessed on 4 July 2018).
  22. Giles, D.; Draper, N.; Neil, W. Validity of the Polar V800 heart rate monitor to measure RR intervals at rest. Eur. J. Appl. Physiol. 2016, 116, 563–571. [Google Scholar] [CrossRef] [PubMed][Green Version]
  23. Caminal, P.; Sola, F.; Gomis, P.; Guasch, E.; Perera, A.; Soriano, N.; Mont, L. Validity of the Polar V800 monitor for measuring heart rate variability in mountain running route conditions. Eur. J. Appl. Physiol. 2018, 118, 669–677. [Google Scholar] [CrossRef] [PubMed]
  24. Valderas, M.T.; Bolea, J.; Laguna, P.; Vallverdú, M.; Bailón, R. Human Emotion Recognition Using Heart Rate Variability Analysis with Spectral Bands Based on Respiration. In Proceedings of the 37nd Annual International Conference of the IEEE EMBS, Milan, Italy, 25–29 August 2015; pp. 6134–6137. [Google Scholar]
  25. Bland, J.; Altman, D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986, 8, 307–310. [Google Scholar] [CrossRef]
  26. Lin, L. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989, 45, 255–268. [Google Scholar] [CrossRef] [PubMed]
  27. Fisher, R. Statistical Methods for Research Workers; Oliver & Boyd: Edinburgh, UK, 1925. [Google Scholar]
  28. Costa-Santos, C.; Antunes, L.; Souto, A.; Bernardes, J. Assessment of disagreement: A new information-based approach. Ann. Epidemiol. 2010, 20, 555–561. [Google Scholar] [CrossRef] [PubMed]
  29. Mateo, J.; Laguna, P. Improved heart rate variability signal analysis from the beat occurrence times according to the IPFM model. IEEE Trans. Biomed. Eng. 2000, 47, 985–996. [Google Scholar] [CrossRef] [PubMed][Green Version]
  30. Bailón, R.; Laouini, G.; Grao, C.; Orini, M.; Laguna, P.; Meste, O. The integral pulse frequency modulation with time-varying threshold: Application to heart rate variability analysis during exercise stress testing. IEEE Trans. Biomed. Eng. 2011, 58, 642–652. [Google Scholar] [CrossRef] [PubMed]
  31. Sacha, J. Interplay between heart rate and its variability: A. prognostic game. Front. Physiol. 2014, 5, 347. [Google Scholar] [CrossRef] [PubMed]
  32. Valenza, G.; Garcia, R.; Citi, L.; Scilingo, E.; Tomaz, C.; Barbieri, R. Nonlinear digital signal processing in mental health: Characterization of major depression using instantaneous entropy measures of heartbeat dynamics. Front. Physiol. 2015, 6, 74. [Google Scholar] [CrossRef] [PubMed][Green Version]
  33. Weippert, M.; Behrens, K.; Rieger, A.; Stoll, R.; Kreuzfeld, S. Heart rate variability and blood pressure during dynamic and static exercise at similar heart rate levels. PLoS ONE 2013, 8, e83690. [Google Scholar] [CrossRef] [PubMed]
  34. Buchheit, M. Monitoring training status with HR measures: Do all roads lead to Rome? Front. Physiol. 2014, 5, 73. [Google Scholar] [CrossRef] [PubMed]
  35. Gamelin, F.X.; Baquet, G.; Berthoin, S.; Bosquet, L. Validity of the polar S810 to measure R-R intervals in children. Int. J. Sports Med. 2008, 29, 134–138. [Google Scholar] [CrossRef] [PubMed]
  36. Nunan, D.; Jakovljevic, D.G.; Donovan, G.; Hodges, L.D.; Sandercock, G.R.; Brodie, D.A. Levels of agreement for RR intervals and short-term heart rate variability obtained from the Polar S810 and an alternative system. Eur. J. Appl. Physiol. 2008, 103, 529–537. [Google Scholar] [CrossRef] [PubMed]
  37. Barbosa, M.P.; da Silva, N.T.; de Azevedo, F.M.; Pastre, C.M.; Vanderlei, L.C. Comparison of Polar® RS800G3™ heart rate monitor with Polar® S810i™ and electrocardiogram to obtain the series of RR intervals and analysis of heart rate variability at rest. Clin. Physiol. Funct. Imaging 2016, 36, 112–117. [Google Scholar] [CrossRef] [PubMed]
  38. Hernando, D.; Garatachea, N.; Almeida, R.; Casajús, J.A.; Bailón, R. Validation of Heart Rate Monitor Polar RS800 for Heart Rate Variability Analysis During Exercise. J. Strength Cond. Res. 2018, 32, 716–725. [Google Scholar] [CrossRef] [PubMed]
  39. Porges, S. Cardiac vagal tone: A physiological index of stress. Neurosci. Biobehav. Rev. 1995, 19, 225–233. [Google Scholar] [CrossRef]
Figure 1. Example of the RR series for a subject in both stages: Polar H7 (blue) and Apple Watch (red). In the stress stage, there are gaps in the Apple Watch recording where no beats are detected.
Figure 1. Example of the RR series for a subject in both stages: Polar H7 (blue) and Apple Watch (red). In the stress stage, there are gaps in the Apple Watch recording where no beats are detected.
Sensors 18 02619 g001
Figure 2. Bland-Altman plot: R R H 7 g   vs .   R R A W . Mean of the difference of the RR series ±2*std values (limits of agreement, LOA).
Figure 2. Bland-Altman plot: R R H 7 g   vs .   R R A W . Mean of the difference of the RR series ±2*std values (limits of agreement, LOA).
Sensors 18 02619 g002
Figure 3. Heart rate variability (HRV) parameters: time domain.
Figure 3. Heart rate variability (HRV) parameters: time domain.
Sensors 18 02619 g003
Figure 4. HRV parameters: frequency domain, derived from R R H 7 , R R H 7 g and R R A W . * denotes significant differences (p < 0.05) between adjacent boxplots. Adim refers to adimensional units.
Figure 4. HRV parameters: frequency domain, derived from R R H 7 , R R H 7 g and R R A W . * denotes significant differences (p < 0.05) between adjacent boxplots. Adim refers to adimensional units.
Sensors 18 02619 g004
Figure 5. HRV parameters: temporal domain (Relax vs. Stress). * denotes significant differences (p < 0.05) between adjacent boxplots.
Figure 5. HRV parameters: temporal domain (Relax vs. Stress). * denotes significant differences (p < 0.05) between adjacent boxplots.
Sensors 18 02619 g005
Figure 6. HRV parameters: frequency domain (Relax vs. Stress). * denotes significant differences (p < 0.05) between adjacent boxplots. Adim refers to adimensional units.
Figure 6. HRV parameters: frequency domain (Relax vs. Stress). * denotes significant differences (p < 0.05) between adjacent boxplots. Adim refers to adimensional units.
Sensors 18 02619 g006
Table 1. Validity of RR series from Apple Watch. CI = Confidence interval.
Table 1. Validity of RR series from Apple Watch. CI = Confidence interval.
Mean (SD) H7 RR intervals (ms)869.28 (114.01)834.78 (97.43)
Mean (SD) AW RR intervals (ms)869.23 (114.39)834.70 (97.84)
CCC (90% CI)0.989 (0.981, 0.998)0.977 (0.970, 0.985)
ICC (90% CI)0.989 (0.984, 0.996)0.982 (0.977, 0.987)
A (90% CI)0.993 (0.987, 0.999)0.983 (0.975, 0.991)
Bias (Out LOA)0.049 (3.29%)0.078 (3.93%)
Mean (SD)% missed RR intervals10.98 (9.78)9.45 (7.30)

Share and Cite

MDPI and ACS Style

Hernando, D.; Roca, S.; Sancho, J.; Alesanco, Á.; Bailón, R. Validation of the Apple Watch for Heart Rate Variability Measurements during Relax and Mental Stress in Healthy Subjects. Sensors 2018, 18, 2619.

AMA Style

Hernando D, Roca S, Sancho J, Alesanco Á, Bailón R. Validation of the Apple Watch for Heart Rate Variability Measurements during Relax and Mental Stress in Healthy Subjects. Sensors. 2018; 18(8):2619.

Chicago/Turabian Style

Hernando, David, Surya Roca, Jorge Sancho, Álvaro Alesanco, and Raquel Bailón. 2018. "Validation of the Apple Watch for Heart Rate Variability Measurements during Relax and Mental Stress in Healthy Subjects" Sensors 18, no. 8: 2619.

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop