1. Introduction
Rapid-eye-movement sleep behavior disorder (RBD) is a parasomnia characterized by the loss of normal muscle atonia during REM sleep, resulting in motor disinhibition [
1]. Motor manifestations in RBD vary widely in intensity, both across individuals and from night to night, ranging from subtle twitches to complex and occasionally violent behaviors. Prompt and accurate diagnosis of RBD is critical, as it is most often a prodromal manifestation of synucleinopathies, including Parkinson’s disease, dementia with Lewy bodies, and multiple system atrophy.
The diagnosis of RBD relies on overnight video-polysomnography (vPSG). This assessment includes electromyography (EMG) recordings from the submentalis (chin), flexor digitorum superficialis (upper extremities), and anterior tibialis muscles (lower extremities) to quantify muscle activity during REM sleep, along with synchronized infrared video recording, as outlined by International RBD Study Group guidelines [
2]. EMG provides a direct window into the pathophysiologic hallmark of RBD—excessive muscle activity during REM sleep—which reflects well-characterized disruptions in REM sleep motor control circuits, including dysfunction within pontomedullary inhibitory circuits and altered cortical–brainstem interactions [
3]. In contrast, video data are particularly valuable for evaluating RBD symptom severity [
4,
5,
6].
Importantly, EMG activity does not always correspond to overt movements, as some electrical activity may not reach a movement threshold. Conversely, not all movements detected on video are captured by EMG, as they may involve muscle groups that are not monitored. Yet, movements in video may be obscured by bed coverings, and subtle twitches may fall below the detection threshold of infrared cameras. Recent studies have explored the potential of video analysis for differentiating RBD from other sleep disorders [
7,
8].
A third modality, wrist actigraphy, has been shown to capture excessive movements in RBD compared to other sleep disorders and healthy sleep [
9,
10,
11]. More recent work has extended these approaches using automated and machine learning-based methods for RBD detection across home and ambulatory settings, with actigraphy-based classifications validated against concurrently acquired vPSG recordings [
12,
13,
14,
15,
16]. Despite a growing body of evidence supporting the development of actigraphy-based classifiers—and the potential scalability of this approach as a screening tool—two critical aspects of wrist accelerometry in RBD remain insufficiently studied: (1) the extent to which movements detected in RBD are specific to REM sleep, as opposed to non-REM sleep or wakefulness; and (2) the degree to which wrist actigraphy captures overlapping versus distinct motor phenomena of REM sleep when compared with EMG–particularly of the upper extremities, and with movements observed on video.
2. Materials and Methods
2.1. Participants
Three research cohorts were used in this study:
a. RBD cases. 17 adult participants with RBD (
n = 9) at Mount Sinai Sleep Research Center, (
n = 8) at the Stanford Sleep Research Center, (
n = 16) with idiopathic RBD, (
n = 1) with antidepressant-induced RBD, underwent vPSG with concomitant wrist actigraphy between June 2021 and November 2025. All RBD diagnoses were confirmed by overnight vPSG based on ICSD-3 criteria [
17]. Research participants signed an informed consent form, and the study was approved by the Institutional Review Boards of the Icahn School of Medicine at Mount Sinai. Participant demographics are summarized in
Table 1 below.
b. Controls. The open research Newcastle dataset [
18] (
n = 8), in which participants also wore wrist actigraphy during concomitant vPSG, was used for this study.
The Mount Sinai RBD cohort served as the primary dataset for multimodal comparisons (EMG, actigraphy, and video), as it was the only cohort with fully synchronized recordings across all three modalities. The Stanford RBD cohort was combined with the Mount Sinai cohort exclusively for analyses of actigraphy-derived movement load across sleep stages (REM vs. non-REM (NREM)) to increase statistical power. Stanford data were acquired through an ongoing collaboration with the Stanford Sleep Center and collected under standardized protocols. The Newcastle control dataset was used solely for between-group comparisons of actigraphy-derived movement load (RBD vs. controls).
Accordingly, analyses of movement load across sleep stages (Result
Section 3.1) included the combined RBD cohort (Mount Sinai + Stanford) and controls, whereas the multimodal overlap analyses (Result
Section 3.2,
Section 3.3 and
Section 3.4) were restricted to the Mount Sinai cohort to ensure consistency in multimodal acquisition and scoring.
2.2. Video PSG
All RBD participants were connected to PSG equipment 2 h prior to their habitual bedtime, which was determined by participant interview. Data (diagnostic or on a therapeutic Continuous Positive Airway Pressure (CPAP) machine) was acquired following standard American Academy of Sleep Medicine (AASM) protocols using Compumedics E-series and Grael 2 systems (Compumedics Ltd., Melbourne, Australia). Signal acquisition included 19 channel electroencephalography placed according to the International 10–20 system or a subset of 6 channels (F3, F4, C3, C4, O1, and O2), left and right electro-oculography (EOG), all sampled at 256 or 512 Hz and referenced to a common patient and ground electrode, submental bipolar, bilateral flexor digitorum superficialis, and anterior tibialis bipolar EMG, respiration by a pressure transducer and PAP device interface, effort by rib/abdomen impedance plethysmography, single-channel ECG, and oxygen saturation by pulse oximetry (SpO2). Synchronized infrared video was recorded continuously throughout the night.
Sleep stages were scored by a sleep technician in 30 s epochs according to standard criteria for sleep and EEG arousals and confirmed by a board-certified sleep specialist. Total sleep time (TST) and percentage time spent in wake, REM sleep, non-REM stage 1 (N1), non-REM stage 2 (N2), and slow-wave sleep (SWS/N3) were determined.
A summary of the Mount Sinai, Stanford, and Newcastle vPSG data analyzed in this study is presented in
Table 2,
Tables S1 and S2. REM sleep periods shorter than 5 min were excluded from analysis, in line with International RBD Study Group guidelines [
2]. Consequently, one REM period was excluded for Mount Sinai participant 6 (leaving three REM periods) and for control participants 1 and 2 (leaving two REM periods each).
2.3. Actigraphy
All RBD participants wore an AX6 triaxial accelerometer (Axivity Ltd., Newcastle, UK) on both wrists during their overnight vPSG, recorded at 50 Hz with a ±8 g dynamic range. Timestamps across EMG, infrared video, and actigraphy were synchronized by aligning the prominent movement artifacts generated by a standardized brisk arm maneuver performed at the start of the vPSG recording. An illustrative example of the study design is shown in
Figure 1.
All control participants in the Newcastle dataset wore a GENEActive actigraph [
18] (Activinsights Ltd., Kimbolton, UK). To ensure comparability across datasets, both devices were configured with the same dynamic range (±8 g), and gyroscope data were not used [
19,
20]. Raw acceleration signals were processed using harmonized pipelines, including resampling to a common frequency of 25 Hz prior to feature extraction.
Although preprocessing steps were aligned, the devices differ in native sampling frequency (85.7 Hz for GENEActive vs. 50 Hz for AX6) and resolution (12-bit vs. 16-bit, respectively). These hardware differences may influence sensitivity to low-amplitude movements and introduce subtle differences in movement quantification across devices.
2.4. Electromyographic Scoring
Bilateral flexor digitorum superficialis EMG activity during REM sleep was scored according to the AASM Manual for the Scoring of Sleep and Associated Events, Version 3 [
21]. EMG signals were filtered using a 5 Hz high-pass filter and a 500 Hz low-pass filter. Within each REM sleep period, EMG activity was manually scored in 3 s mini-epochs using a binary system. Activity was defined as an amplitude at least twice the participant’s baseline atonia level for that REM sleep cycle and lasted between 0.1 and 5.0 s. The atonia level for each REM period was defined as the lowest EMG amplitude observed during that period, or if lower, during the preceding non-REM period.
2.5. Video Scoring
Arm movements during each REM sleep cycle were manually evaluated from the video recordings and scored using a binary system. Video scoring was performed blinded to the EMG and actigraphy scores to minimize bias. Any observable arm movement within each 3 s mini-epoch was marked as activity. Movements were not differentiated by amplitude; small twitches and large, full-body movements were treated equivalently and coded as 1 (activity).
2.6. Actigraphy Scoring: Activity Counts
Activity counts from the accelerometer data were calculated following the activity count model described in Brink-Kjaer et al. (2022) [
12]. In brief, raw accelerometer data was calibrated such that the vector magnitude is equal to the gravitational acceleration (1 g) during periods of inactivity. Activity counts were calculated as the acceleration magnitude based on the Euclidean Norm Minus One formula:
where
,
, and
represent acceleration along the three orthogonal axes. The activity counts were then averaged in 1 s bins. To mitigate baseline drift, the resulting time series were forward–backward filtered using a high-pass infinite impulse response (IIR) Butterworth filter with a stopband attenuation of 80 dB at 0.1 Hz and passband ripple of 1 dB starting at 0.5 Hz. Activity counts less than 0.1 were rounded down to zero. A 3 s mini epoch was classified as active if the maximum activity count within the epoch exceeded zero.
Accelerometer data from control participants in the Newcastle dataset were processed using a modified version of the open-dataset processing code [
18,
22], and activity counts were calculated using the same algorithm described above.
2.7. Statistical and Agreement Analysis
Signal processing and statistical analyses were performed using MATLAB (R2024b, The MathWorks, Natick, MA, USA) and Python (version 3.11, Python Software Foundation, Wilmington, DE, USA). Differences in actigraphy-derived activity counts across sleep stages were first examined to evaluate whether detected movements were specific to REM sleep rather than non-REM sleep and wakefulness, and whether REM motor activity was elevated in participants with RBD relative to controls.
We quantified actigraphy-derived movement load, defined as the sum of left and right wrist activity counts per minute, for each sleep stage. Within-participant pairwise comparisons of movement load between sleep stages were conducted using Wilcoxon signed-rank tests, with Holm p value correction for multiple comparisons.
Because motor activity-event prevalence differed across participants and modalities, we prioritized agreement metrics that are more robust to marginal imbalance. Cross-modality agreement was quantified using Cohen’s κ (chance-corrected agreement) [
23]. We also calculated directional conditional probabilities (e.g., P(Actigraphy = 1∣EMG = 1) and P(Video = 1|Actigraphy = 1)) between EMG, actigraphy, and video pairs to characterize their co-detection patterns.
Agreement and conditional probability analyses were performed at the level of 3 s mini-epochs. Multi-epoch motor events spanning consecutive epochs were treated as independent active epochs for these analyses. Left and right wrist detections were combined such that an epoch was classified as active if activity was detected in either wrist. Group-level 95% confidence intervals for Cohen’s κ and conditional probabilities were estimated across participants using nonparametric bootstrap resampling.
In addition, modality-detected motor activity rates and ratios were compared to quantify the proportion of activity captured by each modality. All metrics were computed per participant and then summarized across participants.
Finally, to assess whether actigraphy movement magnitude varied with multimodal consensus, we compared accelerometer-derived activity counts across mutually exclusive multimodal agreement categories (actigraphy alone; actigraphy and video; EMG and actigraphy; and all three modalities). Specifically, REM mini-epochs were stratified by overlapping detection condition, and for each category we computed the mean actigraphy activity count across epochs. These values were then compared across conditions to assess how actigraphy-detected movement intensity changed as a function of modality co-detection.
3. Results
3.1. Motor Activity During REM vs. NREM
Bilateral wrist actigraphy activity counts demonstrated sleep–wake state-dependent variability in participants with RBD (
Figure 2,
Figures S1 and S2). Activity was highest in mean amplitude (0.233 ± 0.113) and frequency (14.0 ± 2.77) during wake and was generally lower and sparser across NREM sleep (amplitude 0.027 ± 0.013; frequency 2.88 ± 1.40). REM epochs exhibited intermittent clusters of elevated activity relative to NREM.
In control participants, actigraphy movement load was low across all sleep stages, with median values of 0.94 [IQR 1.97] in N1, 1.13 [IQR 0.56] in N2, 0.81 [IQR 1.05] in N3, and 0.90 [IQR 1.04] in REM sleep (see
Figure 3 and
Figure S5). No significant differences were observed between the NREM and REM stages. In contrast, in RBD participants, significant increase was observed in REM relative to NREM sleep. The movement load was highest during REM sleep (median 2.91 [IQR 2.20]), followed by N1 (median 1.72 [IQR 2.99]), N2 (median 0.80 [IQR 0.97]), and N3 sleep (median 0.42 [IQR 0.51]) with statistically significant differences in REM sleep compared to N2 (adjusted
p = 0.003) and N3 sleep (adjusted
p = 0.019). Movement load during REM sleep was higher in RBD participants than in controls (Mann–Whitney U test, uncorrected
p = 0.046), with a large effect size (Cliff’s δ = 0.49). However, this effect did not remain significant after false discovery rate correction across sleep stages (adjusted
p = 0.185).
3.2. Comparison of Motor Activity Detected by EMG, Actigraphy and Video
Across 12,941 3 s REM mini-epochs recorded across participants, 2797 contained motor events detected on the left, right, or both sides—captured by one, two, or all modalities. Of these, EMG, wrist actigraphy, and video detected 1703, 1613, and 811 motor events, respectively (
Figure 4). Of these motor events, 413 events were detected simultaneously by all three modalities. A total of 174 events were detected by both EMG and actigraphy, 187 events were detected by EMG and video, and 143 events were detected by actigraphy and video. Modality-specific detections were also observed, with 929 events detected by EMG alone, 883 by actigraphy alone, and 68 by video alone.
Agreement between modalities during REM sleep was assessed using Cohen’s κ (
Table 3;
Figure S3). Overall, pairwise agreement was moderate, with mean κ values increasing from EMG–actigraphy (κ = 0.27, 95% CI [0.21, 0.33]) to actigraphy–video (κ = 0.41, 95% CI [0.34, 0.49]) and EMG–video (κ = 0.45, 95% CI [0.36, 0.54]). Consistent with these results, odds ratio analyses (log scale) showed the strongest association between EMG–video (pooled OR ≈ 49, 95% CI [29.8, 80.3]), followed by actigraphy–video (pooled OR ≈ 33, 95% CI [16.6, 66.9]) (
Figure S4). EMG–actigraphy exhibited the weakest, but still robust and positive association (pooled OR ≈ 10.6, 95% CI [6.26, 17.9]).
We quantified relationships between modalities using directional conditional probabilities to characterize cross-modal concordance (
Figure 5,
Tables S3–S5). Across participants, actigraphy and video showed asymmetric patterns of agreement with EMG. The probability that actigraphy detected motor activity given concurrent EMG activity (P(Acti = 1|EMG = 1)) was modest (mean 0.49, 95% CI [0.38, 0.60]), indicating that actigraphy captured approximately half of EMG-associated motor events. Similarly, the probability that video detected activity given EMG activity (P(Video = 1|EMG = 1)) was slightly lower (0.46, 95% CI [0.34, 0.58]).
In contrast, conditional probabilities in the reverse direction were substantially lower for actigraphy than for video. The probability that EMG activity was present when actigraphy detected movement (P(EMG = 1|Acti = 1)) was 0.37, 95% CI [0.24, 0.50], whereas video detections were more frequently accompanied by EMG activity (P(EMG = 1|Video = 1) = 0.67, 95% CI [0.52, 0.80]).
The probability that actigraphy detected motor activity when video detected activity was relatively high (P(Acti = 1|Video = 1) = 0.73, 95% CI [0.66, 0.81]), indicating that most visually apparent movements were also captured by actigraphy. In contrast, the probability that video detected activity when actigraphy detected movement was substantially lower (P(Video = 1|Acti = 1) = 0.35, 95% CI [0.27, 0.43]), suggesting that a large proportion of actigraphy-detected movements were not visually apparent on infrared video.
Across all modality pairs, probabilities of concordant absence (e.g., P(Acti = 0|EMG = 0), P(Video = 0|Acti = 0)) were uniformly high (above 0.90).
3.3. REM Motor Activity Ratio and Rate Measured Across Modalities
To compare the overall burden of REM-related motor activity detected by each modality, we measured an “activity ratio” and an “activity rate” (
Table 4). Activity ratio is defined as the percentage of REM sleep mini-epochs that contain any detected arm activity. Activity rate is defined as the number of distinct motor events per hour of REM sleep. A motor event is counted once regardless of its duration: events may be brief (lasting <3 s and confined to a single mini-epoch) or prolonged (extending across multiple consecutive 3 s mini-epochs) but are considered a single event from onset to offset.
Across participants, actigraphy detected the highest overall motor activity burden by both metrics, with a mean activity ratio of 13.7 ± 6.6% and a rate of 70.0 ± 31.7 per hour. EMG detected activity during a similar proportion of REM epochs (12.0 ± 10.3%) with a comparable rate (66.8 ± 47.1 events per hour), whereas video detected activity during a smaller proportion of epochs (6.6 ± 4.4%) and at substantially lower rates (38.8 ± 22.4 events per hour). Activity rates showed high inter-individual variability.
3.4. Actigraphy Activity Counts Across Overlapping Detection Conditions
Actigraphy activity counts were highest for epochs in which motor activity was detected by all three modalities (median 0.508 [IQR 0.402]) (
Figure 6 and
Figure S6). Intermediate activity levels were observed for epochs detected by actigraphy–video pairs (median 0.197 [IQR 0.164]) and EMG–actigraphy pairs (median 0.0740 [IQR 0.141]). The lowest activity counts were observed for actigraphy-only epochs (median = 0.0958 [IQR 0.0511]).
Wilcoxon signed-rank test with Holm correction for multiple comparisons demonstrated that actigraphy activity counts for all three detections condition were significantly greater than those for actigraphy–video, EMG–actigraphy, and actigraphy-only epochs, while differences between EMG–actigraphy and actigraphy–video conditions were not statistically significant. Together, these findings indicate that higher-amplitude actigraphy signals are more likely to correspond to motor events detected by multiple modalities, whereas lower-amplitude actigraphy signals are more frequently detected in isolation.
4. Discussion
In this study, we directly compared EMG, wrist actigraphy, and infrared video within synchronized REM sleep epochs to characterize how each modality captures motor activity in patients with RBD. Actigraphy-derived movement load was significantly higher during REM sleep compared to NREM stages, a pattern that was not observed in control participants. This REM-specific elevation aligns with the loss of REM atonia characteristic of RBD and demonstrates that actigraphy can capture disease-relevant motor activity during REM sleep.
Overall, agreement between video and the other two modalities—EMG (κ = 0.45 ± 0.15) and actigraphy (κ = 0.41 ± 0.12)—was highest, consistent with the notion that movements visible on video tend to be larger motor events that are more likely to involve both muscle activation and limb displacement. Conversely, disagreement between video and the other modalities may reflect limitations of visual detection, as some movements were obscured by bedding or body position, or occurred with minimal visible displacement despite underlying muscle activation or acceleration.
Our findings show moderate agreement between actigraphy and EMG-defined motor activity, with actigraphy detecting approximately half of the epochs containing EMG activity (P(Acti = 1|EMG = 1) = 0.49). Probabilities of concordant absence were uniformly high (above 0.90) across all modalities, reflecting strong agreement during periods without detectable motor activity, likely driven by the predominance of inactive REM epochs. Actigraphy also detected motor activity in a substantial proportion of epochs without concurrent EMG activity (63% of actigraphy-detected activity), likely reflecting sensitivity to movement signals that may not be captured by the recorded EMG channels. In addition, actigraphy yielded the highest overall motor activity burden, as reflected by greater activity ratios (13.7 ± 6.6%) and event rates (70.0 ± 31.7 events per hour) compared with EMG and video. Together, these findings suggest that actigraphy captures a broader range of movement-related signals, including both multimodally concordant events and movements detected in isolation.
Modality-specific detections were observed across all three modalities and likely reflect differences in the physiological signals captured by each method. EMG-only events, which accounted for a substantial proportion of detections (929 epochs), may represent tonic flexor digitorum superficialis activation that do not result in sufficient wrist acceleration or visible displacement. In contrast, actigraphy-only detections may reflect movements involving proximal or axial musculature, or whole-body motion, that are not captured by forearm EMG recordings. Video-only events, although less frequent, may represent visually apparent movements involving muscle groups not captured by the recorded EMG channels or movements with low-amplitude or sustained acceleration that do not exceed actigraphy detection thresholds. These findings highlight that each modality captures complementary aspects of motor activity.
Further analysis showed that actigraphy activity counts were the highest (median 0.508 [IQR 0.402]) for epochs in which motor activity was detected concurrently by all three modalities and the lowest (median 0.0958 [IQR 0.0511]) for actigraphy-only epochs, indicating that activity with higher activity counts is more likely to reflect larger, overt, multimodally detectable movements. In contrast, lower activity counts were frequently detected in isolation, suggesting that actigraphy signal magnitude may serve as a useful proxy for movement salience. These findings have practical implications for actigraphy-based RBD detection algorithms, including the potential utility of incorporating amplitude-based thresholds to improve specificity for clinically meaningful motor events.
Several limitations should be considered when interpreting these results. First, the sample (Mount Sinai cohort) size was small and included only male participants, which may limit generalizability. Second, the control group wore a different type of device, which may introduce calibration or sensitivity differences that complicate comparisons with the RBD participants. Additionally, differences in participant populations and study protocols across institutions may introduce further variability. Third, motor activity was scored using binary criteria within 3 s mini-epochs, which do not capture individual movements’ duration or complexity. The use of mini-epochs may also have influenced agreement estimates, as longer movements may be fragmented across adjacent bins and small temporal misalignments (e.g., a 1 s offset) between modalities may lead to apparent disagreement within such short epochs. Incorporating event-level analyses, including movement duration and complexity, may provide a more detailed characterization of cross-modal agreement. vPSG scoring was performed manually by a single rater, which may introduce human error and limits assessment of inter-rater reliability. Future studies could incorporate automated analysis pipelines to reduce observer bias and improve reproducibility in vPSG scoring, as emerging work is actively exploring and validating such approaches [
7,
8,
15,
24,
25]. In addition, all visible arm movements were treated equivalently without differentiation by amplitude or kinematic features, limiting more detailed characterization of video-detected activity. Finally, analyses of EMG recordings were restricted to the flexor digitorum superficialis to enable direct one-to-one comparisons with wrist actigraphy, whereas clinical vPSG assessments of RBD typically incorporate both chin and limb EMG channels for diagnosis.
Despite these limitations, our work provides a quantitative comparison of EMG, actigraphy, and video within the same REM sleep periods, extending prior RBD diagnostic and actigraphy studies to the level of motor-event concordance across modalities. These findings should be interpreted as preliminary evidence, offering new framework for quantitatively understanding the complementary strengths of EMG, video, and actigraphy. This framework may help inform the potential role of actigraphy in scalable RBD screening or longitudinal monitoring outside of the laboratory. Further work is needed to validate these findings in larger and more diverse cohorts. Additional studies could compare EMG with actigraphy across various body placements (wrist, trunk, ankles) and 3D time-of-flight video, which is more sensitive than conventional infrared video systems [
8].