Next Article in Journal
Adaptive Local–Global Synergistic Perception Network for Hydraulic Concrete Surface Defect Detection
Next Article in Special Issue
Development of a Neural-Fuzzy-Based Variable Admittance Control Strategy for an Upper Limb Rehabilitation Exoskeleton
Previous Article in Journal
An In-Depth Review on Sensing, Heat-Transfer Dynamics, and Predictive Modeling for Aircraft Wheel and Brake Systems
Previous Article in Special Issue
A Preliminary Data-Driven Approach for Classifying Knee Instability During Subject-Specific Exercise-Based Game with Squat Motions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multimodal Evaluation of Mental Workload and Engagement in Upper-Limb Robot-Assisted Motor Tasks

1
Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, National Research Council, 20133 Milan, Italy
2
Department of Psychology, Catholic University of Milan, 20123 Milan, Italy
3
Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, National Research Council, 23900 Lecco, Italy
4
Rehabilia Technologies SRL, 20123 Milano, Italy
5
Industrial Engineering Department, University of Bologna, 40126 Bologna, Italy
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(3), 922; https://doi.org/10.3390/s26030922
Submission received: 18 December 2025 / Revised: 26 January 2026 / Accepted: 29 January 2026 / Published: 31 January 2026

Abstract

Patient engagement and mental workload (MWL) are often overlooked when optimising robotic-assisted rehabilitation, despite their potential impact on its effectiveness. This study aims to propose a multimodal approach to assess MWL and engagement, using electrophysiological signals and questionnaires, to explore their modulation across different assistance modalities and engaging strategies. Thirty healthy subjects were enrolled and performed repetitive upper-limb movements with a robotic device under three assistance modalities (active, passive, semi-assisted) while listening to a 1 Hz auditory stimulus (metronome or music). Electroencephalography, Electrocardiogram, the NASA Task Load Index, and the Short Stress State Questionnaire were used to assess objective and perceived MWL and engagement. Engagement increased significantly in the music condition, whereas MWL showed no significant change. The passive modality was perceived as significantly less demanding and less engaging compared to active and semi-assisted conditions. Although EEG objective indicators did not vary across modalities, the ECG objective metric was modulated significantly in agreement with the subjective measures. Overall, the auditory stimulus significantly influenced engagement, and assistance levels affected both perceived mental demand and engagement. The proposed multimodal approach is sensitive to both engagement and MWL constructs, highlighting the potential for adaptive rehabilitation systems designed to maintain engagement, prevent overload or monotony, and ultimately support better functional outcomes over the long term of robotic training.

1. Introduction

Upper-limb motor rehabilitation is crucial for regaining functional independence after neurological diseases, as arm and hand impairments significantly limit the performance of everyday activities, including reaching, grasping, and object manipulation. In this context, robotic devices have emerged as pivotal tools in neurorehabilitation, enabling precise, intensive, repetitive, and adaptable therapy tailored to each patient’s needs. Evidence from several studies demonstrates that robot-assisted, task-specific, and intensive training can significantly enhance motor recovery in different neurological disorders [1,2,3]. Moreover, tailoring the level of robotic assistance to the patient’s residual physical capacity can help improve rehabilitation outcomes [4,5]. Such adaptive assistance strategies can also be integrated into bilateral training approaches, which have demonstrated specific neurophysiological benefits for motor recovery [6,7].
Typically, studies evaluating the effects of motor rehabilitation focus primarily on motor performance, often overlooking the broader cognitive, psychological, and mental states of the patient. Indeed, the most common studied measures of motor upper limb recovery (e.g., the Fugl–Meyer Assessment, the Functional Independence Measure, the Action Research Arm Test, the Box and Block Test, the modified Ashworth Scale and the Wolf Motor Function Test) assess different aspects of motor recovery, such as movement quality, dexterity, strength and task-related ability [8,9]. In this context, electrophysiological measures such as the electroencephalography (EEG) have also been used to evaluate [10] and predict rehabilitation outcomes, although still mainly in relation to motor recovery [11,12].
However, beyond a patient’s residual physical capacity to perform motor tasks, other factors can affect performance and clinical outcomes. Indeed, mental workload (MWL) and engagement can play a key role in facilitating neuroplastic mechanisms and promoting functional improvement [13,14]. MWL can be defined as “the degree of activation of a finite pool of resources, limited in capacity, while cognitively processing a primary task over time, mediated by external stochastic environmental and situational factors, as well as affected by definite internal characteristics of a human operator, for coping with static task demands, by devoted effort and attention [15]. In this specific context, MWL can be viewed as the user demand required by the assistive device and can have an impact on the patient’s experience [16]. For example, when the task demand during therapy is set too high, patients may experience stress or frustration, which reduces training effectiveness. Conversely, when the challenge is too low or the activity becomes overly repetitive, boredom and disengagement can occur [17,18,19]. Both cases lead to poorer performance and reduced effectiveness of the rehabilitation process. For this reason, monitoring MWL is crucial to appropriately balance task demands, ensuring that rehabilitation activities remain neither overly demanding nor insufficiently stimulating, to enhance rehabilitation success.
The optimal balance between task difficulty and the patient’s skill level leads to the ’Flow State’, characterized by maximum engagement and enjoyment [20]. This high level of engagement is a critical factor, as it increases patients’ motivation to participate in the therapeutic activity for its own sake, thereby enhancing their adherence to therapy. This leads patients to train more frequently and consistently, which in turn further enhances the likelihood of functional recovery [21]. In this direction, numerous studies have explored strategies to enhance patient engagement. These include the use of exergames, which combine visual stimuli to provide patients with interactive exercises [22,23], as well as auditory stimuli (e.g., rhythmic auditory stimuli and music). For instance, the use of rhythmic auditory stimuli to encourage movement and long-term functional improvements in patients with motor impairments has shown promising results in motor rehabilitation outcomes [24,25]. In the past years, an increasing number of clinical studies provided valuable insights into the benefits of music therapy, showing that musical rhythms can serve as a continuous time reference to patients with movement-related disorders [24], and can act as an engaging factor during rehabilitation sessions [26,27].
Since workload, motivation, and emotional factors play a crucial role in determining rehabilitation outcomes, their systematic assessment during rehabilitation sessions may be essential to personalize and optimize the effectiveness of the entire rehabilitative process. Moreover, a joint evaluation of engagement and mental workload is important for better characterizing how patients interact with assistive robots across different tasks and motivational conditions.
In this context, monitoring of physiological signals has emerged as a promising approach for objectively assessing the patient’s psychological and cognitive state during rehabilitation. Electrophysiological biomarkers can reflect underlying emotional and cognitive processes, eliminating the need for active patient cooperation [28]. Frequently used physiological measurements include electrocardiography (ECG), electrodermal activity (EDA), EEG, respiration and skin temperature, which together offer insights into autonomic regulation, engagement, and MWL. Consistent findings indicate that increased task demand or mental effort is associated with elevated heart rate and decreased average value of Normal-to-Normal Intervals (AVNN) [29,30], lower values of heart rate variability (HRV) parameters (e.g., Root Mean Square of Successive Differences (RMSSD) and Standard Deviation of Normal-to-Normal intervals (SDNN)) [31,32], higher skin conductance [32,33,34], higher values of the EEG-MWL index (MWLI) [35,36,37], faster and less variable respiration [31,38], and lower skin temperature [39,40].
Regarding the objective assessment of engagement, several studies have computed an EEG-based engagement index (EI) across various contexts and applications. For example, in video game play, there is evidence that EI can discriminate users’ engagement across different tasks or scenarios [41]. Additionally, autonomic-related signals, such as EDA and HRV, exhibited synchronized responses to specific stimuli, corresponding to higher levels of attentional engagement [42].
In the field of robot-assisted motor rehabilitation, recent studies have integrated some of these psychophysiological measures to evaluate user’s state during rehabilitation sessions. However, while engagement has received increasing attention [43,44], the assessment of MWL specifically during robotic rehabilitation remains largely unexplored.
Nevertheless, subjective variables such as perceived mental effort and engagement represent crucial aspects of the user experience that cannot be fully captured through physiological indicators alone. These dimensions play a key role in shaping how patients interact with rehabilitation tasks and, ultimately, in influencing recovery outcomes. To assess these subjective components, self-report instruments such as the NASA Task Load Index (NASA-TLX) [45] and the Short Stress State Questionnaire (SSSQ) [46] are commonly employed. These tools provide insight into how people consciously perceive the difficulty, effort, and emotional quality of an experience, including rehabilitation tasks [47]. However, self-reports are inherently influenced by subjective interpretation, awareness, and communication ability, which can be compromised in clinical populations.
For this reason, by integrating electrophysiological measurements with self-report questionnaires, recent studies have provided a more holistic representation of the patient’s experience during rehabilitation [28,43,48,49]. This complementary approach helps identify how physiological responses correspond to perceived workload and emotional states, paving the way for user-centered adaptive rehabilitation protocols that can dynamically tailor assistance based on the patient’s psychophysiological condition and feedback [50].
Within this context and to the best of our knowledge, no previous studies have examined how MWL and engagement vary across different levels of robotic assistance and sensory stimulation modalities (e.g., music during exercise) while integrating objective electrophysiological measures with validated self-reports in robot-assisted upper-limb motor rehabilitation. In fact, most existing research either focuses exclusively on assistance levels [51,52,53] or on user engagement [43,54], and others rely solely on isolated physiological signals [50,55], without jointly investigating how all these dimensions interact during motor training.
This work addresses this gap by proposing a multimodal framework to evaluate MWL and engagement modulation across different levels of robot-assisted movement and auditory-stimulation modalities, complementing objective measures (based on EEG and ECG signals) with subjective assessments.

2. Materials and Methods

2.1. Participants

In total, 30 healthy individuals were recruited (21 males, 9 females; see Table 1 for more details). Before the experimental session began, all volunteers were briefed on the protocol and signed a written informed consent form. The study was approved by the Ethical Committee of the National Research Council of Italy (Approval No. 0083017, 19 March 2025) and was conducted in accordance with the Declaration of Helsinki, ensuring adherence to ethical standards for research involving human subjects.

2.2. Set-Up

The instrumentation used for the study included an upper-limb robotic device and an EEG-ECG acquisition system. These are described in more detail in the following paragraphs.

2.2.1. Robotic Device

The robotic device used in this study is “PhiCube”, a patented compact and modular robotic device designed to combine motor training with cognitive, visual, and attentional activities [56,57]. It consists of a robotic central unit, a table-mounting mechanism, and a set of handles with different shapes and functions (Figure 1). The central unit is equipped with two motorized and sensorized interfaces and an orientation mechanism that allows the alignment of the movement axes along the main body planes (sagittal, frontal, vertical). Thanks to this modular solution, the device can support a wide range of upper-limb movements, including bilateral tasks. Three assistance modalities that differ in the level of participant motor involvement are allowed: (i) active; (ii) passive, in which the user does not voluntarily execute any movement; and (iii) semi-assisted mode, in which the movement of one limb is generated based on the contralateral one. The device is connected to a monitor displaying a dedicated graphical user interface (GUI). The GUI can be used to set the desired level of motor assistance/resistance or to activate the semi-assisted mode. Additionally, the application offers a series of training activities designed to contextualize the selected exercise (e.g., exergames).
For this specific study, PhiCube’s central unit was aligned with the participant’s sagittal plane and two independent lever accessories were installed onto the plug-and-play interfaces to allow counterphase bilateral movements of wrist, elbow, and shoulder (Figure 2).

2.2.2. Electrophysiological Signal Acquisitions

EEG was collected using a 64-channel eego™ mylab system (ANT Neuro, Hengelo, The Netherlands), equipped with a 24-bit amplifier, a standard EEG cap (Waveguard™ original, ANT Neuro, Hengelo, The Netherlands) with 64 active Ag/AgCl electrodes (Figure 2) positioned according to the international 10–10 system and the eego software (Version: 1.10.2.45890) to manage the acquisition. First, the nasion–inion distance was measured to identify the central point of the scalp. Then, the EEG cap was positioned with the Cz electrode placed at the midpoint of the nasion–inion distance. Finally, the conductive gel was applied to reduce the electrode-skin impedance below 30 kΩ throughout the recording session. The EEG signal was recorded at a sampling rate of 500 Hz across 62 channels (mastoid electrodes were excluded) and the online reference was set at the centroparietal midline electrode (CPz). The electrocardiographic (ECG) signal was acquired from the bipolar sensor input of the EEG amplifier. Two surface electrodes (30 × 24 mm, Cardinal Health™ Kendall™, Waukegan, IL, USA) were placed on the participant’s torso to capture a single bipolar lead. Specifically, the negative electrode was positioned at the center of the right clavicle, whereas the positive electrode was located on the lowest left rib, along the midaxillary line. Both signals were recorded as differential voltage, with EEG amplitudes in the µV range and ECG amplitudes in the mV range. Despite originating from ionic currents in tissue, the amplifiers’ high input impedance results in negligible current flow, and no current waveform was considered. The time needed to set the EEG and ECG sensors ranged from 15 to 45 min, depending on how easily the electrode–skin impedance could be brought below the desired value.

2.3. Protocol

The experimental protocol consisted of five phases. Firstly, a 5 min resting phase (Rest) was performed to acquire baseline data, followed by four different movement phases (Task). During the Rest, the subject had to stare at a white cross on a black background in the monitor in front of them, holding the handles of PhiCube.
The Task consisted of moving the two handles in counterphase (one arm ahead, the other backwards) repetitively following an auditory stimulus at 1 Hz. The experiment involved three robotic assistance modes cited previously (active, passive, and semi-assisted) and two types of auditory stimuli (metronome and music), which were combined in four distinct task conditions: Active with Metronome (AME), Active with Music (AMU), Passive with Metronome (PME), and Semi-assisted with Metronome (SME). In particular, the three assistance modalities differed in the level of participant motor contribution: (i) in AME/AMU the volunteer was required to autonomously perform muscular contraction in order to move both handles; (ii) in PME the participant was instructed to grasp the handles, relax both upper limbs and let PhiCube mobilize them without anticipating or resisting the movement; (iii) in SME the participant was asked to perform the movement with the dominant limb while keeping the other one passive. In SME, PhiCube performed the movement on the non-dominant side autonomously as a counterphase mirror of the dominant one. The auditory stimuli consisted of (i) a metronome sound at 1 Hz frequency (in AME, PME and SME tasks); (ii) a music track matched to the same frequency as the metronome (in AMU task).
The four modalities (i.e., AME, AMU, SME and PME) were presented in a random order across subjects, but with AME and AMU always executed randomized in close proximity to each other. This choice is motivated by the fact that switching between very different modalities (e.g., active–passive–active) can require cognitive adjustments that affect performance and might induce confusion and potential execution errors. Therefore, keeping the two versions of the active assistance together would have reduced such transition effects.
Each Task involved six rest runs (each lasting 20 s) alternating with six movement runs (each lasting 40 s) (Figure 3). A countdown of three seconds appeared on the screen before the next run began, notifying the subject of the change. During the experimental activities, the PhiCube application (i.e., the screen in front of the participant) only showed a black background during movement runs and a black background with a white cross in the center of the screen during rest runs. At the end of each Task, the participant was asked to complete the questionnaires on perceived MWL and engagement (more details of the questionnaires can be found in Section 2.5). This feedback session lasted approximately 3 min (Figure 3).
All experimental sessions were conducted in the same controlled laboratory environment, designed to minimize noise sources that could disrupt the participants’ performance. Moreover, the subject was asked to keep the head as still as possible and not to talk during Rest or Tasks to limit noise in the EEG signal.

2.4. Objective Assessment of Mental Workload and Engagement

The EEG analysis was performed using MATLAB (R2024a, The MathWorks) and EEGLAB toolbox [58], whereas the ECG signal was mainly analyzed using the Physionet toolbox [59,60]. Figure 4a,b schematically explains the steps performed for EEG.

2.4.1. EEG

The EEG signal was pre-processed as previously reported in [61]. First, the data were filtered using a Hamming-windowed sinc Finite Impulse Response band-pass filter (1–45 Hz). Secondly, artifact channels were rejected using the joint probability of the average log-power across recorded electrodes, as proposed in [62]: channels with probabilities exceeding three standard deviations from the mean were removed. After this, the Artifact Subspace Reconstruction algorithm [63] was applied to interpolate segments containing artifacts, defined as bursts with variances exceeding fifteen standard deviations. Following this, Independent Component Analysis was performed using the extended Infomax algorithm [64] to identify and remove artifacts, such as eye movements and electrocardiographic signals, which typically overlap with neural activity in EEG data. Then, ICLabel [65] was used to automatically classify components, and those with a probability below 40% of being brain sources were discarded. Finally, the bad channels were removed and interpolated using a spherical function based on neighbouring artifact-free channels, and the EEG signals were re-referenced to the average of all channels (Figure 5a).
Afterwards, the MWLI and the Engagement Index (EI) were computed to address MWL and engagement, respectively. Firstly, the Task sessions were discarded, with the six 20 s rest periods, to keep only the movement part. On each movement run, the Welch method was applied to estimate the power spectral density between 0 and 45 Hz [66]. The EEG signal was divided into 50% overlapping segments of 1 s length, and each segment was multiplied by a Hamming window. Then, the discrete Fourier transform was applied to each windowed segment, and the squared magnitude of the result was computed to obtain the periodogram. Periodograms were subsequently averaged across all segments within each movement run to derive the power spectral density corresponding to each experimental block. Next, the absolute band power for each channel was calculated by integrating the power spectrum within theta (4–8 Hz), alpha (9–12 Hz) and beta (13–30 Hz) frequency bands. Finally, the MWLI was computed, for each experimental block, as the ratio between the theta absolute power at the frontal midline electrode (θFz) and the alpha absolute power at the parietal midline electrode (αPz) [61,67]. This measure, which reflects changes in the EEG signal power spectral density in the theta and alpha frequency bands, is therefore used to assess the cognitive effort. To compute the EI, the sum of the absolute power of each frequency band at the central midline, parietal midline, left and right electrodes (Cz, Pz, P3 and P4) was calculated. Subsequently, the EI was computed as the ratio between the beta power ( β C z , P z , P 3 , P 4 ) and the sum of alpha power ( α C z , P z , P 3 , P 4 ) and theta power ( θ C z , P z , P 3 , P 4 ) [68,69]. This formulation of the EI enables the assessment of an individual’s engagement while performing a specific activity.

2.4.2. ECG

The ECG signal was also pre-processed prior to performing time-domain HR analysis. Firstly, a bandpass (0.5–40 Hz) Hamming-windowed sinc FIR filter was applied. From the clean signal, only Rest and movement blocks for each Task were kept. On these signals, R peaks were extracted using Physionet functions (Figure 5b). Then, RR interval data were processed in preparation for further analysis: noise and non-normal beats were removed and replaced with the cubic spline interpolation method, obtaining the Normal-to-Normal interval signal (NN). This processed time-series was then used to compute the AVNN, representing the mean duration of normal heartbeats, which is negatively related to MWL [29,30]. Each 40 s task window was selected to compute the AVNN, and the resulting values were averaged on each experimental session to have a single value for the Rest and for each Task. It was not possible to compute any further ECG-based parameters or perform an HRV analysis, as the movement runs in the protocol were insufficiently long.

2.5. Subjective Assessment of Mental Workload and Engagement

At the end of each Task, participants were asked to complete the NASA-TLX [45] and the Engagement subscale of the SSSQ [46] questionnaires.

2.5.1. NASA-Task Load Index Scale

The NASA-TLX scale is a widely used subjective assessment tool designed to evaluate the perceived cognitive workload during or after completing a specific task [45]. It consists of six dimensions: Mental Demand, Physical Demand, Temporal Demand, Performance, Effort, and Frustration. Participants are asked to rate each dimension using a 21-point continuous scale ranging from 0 to 20. An overall subjective score (hereafter, NASA-TLX Overall) is obtained by summing the single scores of the six dimensions.

2.5.2. Short Stress State Questionnaire

The Short Stress State Questionnaire [46] is a self-assessment instrument developed to rapidly measure subjective stress levels associated with performing cognitively or emotionally demanding tasks. The questionnaire consists of 24 items that assess three key dimensions of momentary psychological state: Engagement, Distress, and Worry. In this study, we selected the Engagement scale, which reflects the individual’s degree of positive involvement, energy, and motivation during the activity. Engagement subscale measures dimensions such as concentration, readiness for action, sense of control, and confidence in self-abilities. High scores on the Engagement scale indicate an optimal state of activation and participation, associated with adequate levels of mental energy and involvement; conversely, lower scores suggest fatigue, disengagement, or a decline in concentration. The instrument uses a 5-point Likert scale, and scores for each subscale are obtained by averaging the responses to the corresponding items.

2.6. Data Analysis

All statistical analyses were done using R (R 4.4.1, R Foundation for Statistical Computing, Vienna, Austria) within the RStudio environment (Version: 2024.12.0+467).
To allow comparison between objective and subjective indexes and to account for inter-subject variability, all objective indexes of each Task were subject-wise normalized by dividing the Task value for its corresponding Rest value (Task/Rest).
To verify normality, the Shapiro–Wilk test was applied to each index for each Task independently. If the p-value was smaller than 0.05, the null hypothesis was rejected, meaning that the data were not normally distributed. On the contrary, if the p-value was greater than 0.05, then the null hypothesis could not be rejected and graphical solutions were considered to determine whether the data were Gaussian or not. In particular, both the histogram and the Quantile–Quantile Plot were analyzed.
To separately compare the different levels of robot assistance and the effect of the different acoustic stimuli on the subject, data were grouped as follows: AME, PME, and SME (Assistance Levels), and AME and AMU (Auditory Stimuli). As all tested indices showed non-Gaussian distributions for each Task and the data were dependent due to repeated measures, the following analyses were performed. For the Assistance Levels, the Friedman rank-sum test was first performed for each objective and subjective index. If any statistically significant difference was found (p < 0.05), post hoc Nemenyi’s All-Pairs Comparisons Test was performed. Instead, for the Auditory Stimuli, only the Wilcoxon pairwise test was applied as there were just two conditions to compare.
Additionally, to quantify the relationship between physiological and subjective variables, we performed repeated measures correlation analysis [70] for the Assistance Levels and for the Auditory Stimuli, separately.

3. Results

3.1. Objective Measures

Regarding the Assistance Levels, the Friedman test showed a significant difference for the AVNN index. Post hoc pairwise comparison highlighted a significant increase from both AME to PME (p < 0.001) and from SME to PME (p < 0.01) (see Figure 6 and Table 2). The MWLI and the EI did not show any significant difference between modalities. Regarding the Auditory Stimuli, the EI highlighted a significant increase in the Task with music compared to the metronome sound (p < 0.001, see Figure 6 and Table 2). Neither the MWLI nor the AVNN revealed any significant difference across the two auditory modalities.

3.2. Subjective Measures

In the Assistance Levels, both the NASA Task Load Index (NASA-TLX) Overall and the Engagement indices revealed a significant difference in the Friedman test; according to the post hoc comparison, a significant decrease in the passive modality with respect to the active and the semi-assisted modalities was found in both measures (p < 0.05, see Figure 7 and Table 3).
Conversely, in the Auditory Stimuli, the Engagement demonstrated a significant difference (p < 0.05) between modalities (Table 3) with higher values in the music Task (Figure 7). However, NASA-TLX Overall scores were lower in the music Task than in the metronome Task (Figure 7), but this difference was not statistically significant (Table 3).

3.3. Objective and Subjective Measures Correlations

The analysis showed a significant moderate correlation between the AVNN and the NASA-TLX Overall (p = 0.010, r = −0.46) in the Auditory Stimuli (Table 4) and an almost-significant weak correlation (p = 0.062, r = −0.24) in the Assistance Levels (Table 5). All other comparisons resulted in negligible correlations, very far from achieving statistical significance, as shown in Table 4 and Table 5.

4. Discussion

In this study, we proposed a multidomain analysis combining objective and subjective measures to evaluate the modulation of MWL and engagement during upper-limb robotic-assisted movements. In particular, EEG and ECG signals were complemented with NASA-TLX and SSSQ questionnaires to comprehensively assess MWL and engagement changes at three levels of robotic assistance (active, passive and semi-assisted) and two auditory stimuli (metronome and music track).
Regarding the effects of auditory stimulation, our results showed that music-based stimuli significantly enhanced participant engagement, as indicated by both objective and subjective measures. The significant increase in EI and subjective engagement during the music task supports the idea that music enhances user involvement, even in simple, repetitive tasks, at both neurophysiological and perceptual levels. This finding is consistent with the work of Baur et al. [26], who, in a randomized study using an arm therapy robot, showed that integrating creative musical elements significantly increases intrinsic motivation, even when perceived effort or workload does not change. Similarly, Liu et al. [27] demonstrated in a pilot clinical trial that combining music therapy with robot-assisted training improves self-efficacy and positive emotion in post-stroke patients. The alignment between objective and subjective measures in our results further reinforces the idea that music should be considered as a valuable element when the goal is to enhance patient engagement during upper-limb rehabilitation.
Moreover, objective indexes did not reveal differences in MWL between music and metronome tasks; however, participants tended to perceive the music condition as slightly less demanding, as indicated by the NASA-TLX questionnaire results in Figure 7, although the decrease was not statistically significant. A plausible explanation is that more pleasant or intrinsically rewarding stimuli can support sustained involvement, thereby reducing the subjective perception of effort. This evidence suggests that motivational context can meaningfully influence how individuals experience and approach rehabilitation tasks, potentially contributing to better adherence, reduced dropout in long-term training, and ultimately enhancing the likelihood of functional recovery [21].
Although neither the NASA-TLX Overall nor the AVNN shows a significant difference between modalities, the significant negative correlation between these two measures in the Auditory Stimuli group reflects the higher perceived MWL. Accordingly, when subjects encounter situations that require higher cognitive involvement, the autonomic nervous system (ANS) modulates cardiac activity, thereby reflecting a physiological response [71].
Regarding the level of robotic assistance, we observed that subjective engagement and MWL are significantly lower in the passive modality. This result is consistent with studies showing that excessive robotic assistance can lead to “slacking” behavior, in which users reduce their effort and rely solely on the device. This can diminish effort, motivation, and perceived engagement [44,72].
Conversely, regarding the objective assessment, the EI did not reveal a statistically significant difference across modalities, although the trend was consistent with the subjective engagement. This suggests that subjective measures may capture motivational aspects that are not fully reflected in the EEG activity. This aspect, together with the non-significant and weak correlations found between objective and subjective engagement measures in Assistance Levels and Auditory Stimuli, suggests that analyzing both objective physiological signals and subjective measures provides a more complete assessment of the subject’s state, as each signal can highlight aspects that the other may miss.
Switching to the evaluation of MWL, only the AVNN was sensitive to differences in the level of assistance, revealing significantly higher values in the passive modality, indicating a decrease in ANS activity due to a higher sympathetic activation state [71]. The choice to use the AVNN was made because of time constraints as the protocol provided only 40 s of movement and HRV parameters could not be computed. Nevertheless, it was shown that this amount of time remains sufficient to assess ANS variations [73]. The discrepancy between the statistical significance among the objective MWL measures can be explained by the fact that AVNN and MWLI capture partially distinct aspects of workload. MWLI primarily reflects cortical mental effort and is therefore more closely tied to the task’s cognitive demands. In contrast, AVNN is also influenced by autonomic changes associated with motor and physical demands [74]. As a result, in the passive condition, where participants are less physically engaged, AVNN tends to increase even when cortical workload remains relatively unchanged. Therefore, its use as a pure cognitive load marker is questionable, and combining EEG- and ECG-derived markers may provide a more appropriate and comprehensive characterization of the underlying processes associated with MWL [75].
It should be noted that the selected movement (e.g., “bilateral repetitive flexo/extension of the elbow”) was chosen, among the various configurations for using the PhiCube device, as it allows bilateral action of both limbs, involving multiple joints (shoulder flexion–extension, elbow flexion–extension, wrist adduction/abduction), rather than just one joint. Moreover, this movement can be safely repeated across multiple trials, is sufficiently standardized to support meaningful physiological comparisons, is easy to learn and perform, and enables seamless integration of the three selected assistive modalities.
As concerns the auditory stimulation, it was implemented using metronome cues as a controlled baseline and music as an ecologically relevant stimulus. Indeed, music can make physical activity easier, less tiring, and more sustainable, both physically and mentally [76]. We also aimed to provide a musical stimulus that participants would likely enjoy, as musical enjoyment is a crucial moderator of music’s effects on exercise [77]. For practical reasons, it was not possible to allow each participant to choose their favorite song; therefore, we opted for a well-known song, considering that musical familiarity helps reduce interindividual variability in enjoyment [77,78]. Furthermore, the literature indicates that a stable beat reduces movement temporal variability and facilitates motor entrainment [24]. For this reason, we selected a song with a constant BPM, particularly suited to continuous rhythmic tasks. Considering all the mentioned aspects, among the various possible options, we chose “Stayin’ Alive” by the Bee Gees.
The absence of significant changes in EEG-derived MWLI across conditions may primarily reflect the characteristics of the task paradigm rather than true invariance in cognitive workload. In particular, the motor task adopted in this study is simple, highly repetitive, and performed by healthy participants; once familiarized, it is likely to rely largely on automatic sensorimotor control and therefore imposes a relatively low cognitive demand. Under these conditions, the modulation of mental workload induced by different assistance levels may be too subtle to elicit measurable changes in the selected EEG-based MWLI, or may be confined to short, transient phases (e.g., initial adaptation or occasional corrections) that are attenuated when computing indices over longer epochs. However, it should be considered that patients affected by upper limb impairment may experience considerable difficulty even in performing simple movements, potentially leading to higher MWL and clearer differences between passive, assisted and active modalities.
To the best of our knowledge, no studies have specifically evaluated the MWLI associated with the level of robot assistance in the rehabilitation context. Nevertheless, this result shows that measures derived from physiological signals, which are often assumed to capture the same dimensions, can differ in sensitivity because they reflect partially different aspects. This highlights the value of acquiring and analyzing physiological indices from multiple signals, as it provides a more complete picture of the patient’s state, without overcomplicating the setup.
Overall, our findings highlight the value of considering multiple objective physiological signals and subjective scales to characterize user experience and state during robot-assisted rehabilitation. Although simplifying acquisition protocols by reducing the number of data acquired can help minimize preparation time and, consequently, the burden on the patient, our results also show that collecting multiple physiological signals and psychological measures together provides a more comprehensive assessment of the subject’s overall state during the rehabilitation session. The combined use of EEG, ECG, and validated psychometric tools proved feasible and informative, revealing complementary facets of workload and engagement under different assistance and motivational conditions.

5. Limitations and Future Work

Some limitations should be acknowledged before discussing the future development of the findings. The first is the reduced sample size, and the second is the inclusion of only healthy participants, which limits the generalization of the findings. Although this choice ensured strict experimental control, it may also explain the absence of significant differences between the AME and SME modalities in all physiological and subjective parameters and the non-significant differences in MWLI across assistance levels. In clinical populations with motor impairments, these conditions are likely to differ more markedly in terms of required effort, and such differences may be more readily captured by the analyzed measures.
The multimodal analysis explored in our work opens promising avenues for adaptive rehabilitation systems that dynamically modulate robotic assistance and motivational content based on real-time psychophysiological feedback. This approach could help maintain engagement, prevent overload and monotony, and ultimately support better adherence and functional outcomes over the long term of training. Future research could test the proposed framework with individuals with motor impairment. This would determine whether different levels of assistance affect MWL and engagement during robotic rehabilitation in people with disabilities.
Moreover, a limited number of experimental variations was implemented to reduce session complexity and to preserve data interpretability by isolating the primary factors of interest. Therefore, future studies should also evaluate additional movement families and music characteristics (genre, tempo, personalization) to determine whether the observed effects generalize beyond the controlled conditions tested here.
To further enrich the cognitive/psychological assessment, testing longer task durations could help clarify whether prolonged execution of repetitive movements leads to decreased motivation due to boredom or fatigue, thereby negatively affecting MWL and user engagement. In this context, longer periods of movement within a single session can be exploited to compute HRV parameters (e.g., Low Frequency/High Frequency (LF/HF), RMSSD, and SDNN), which require longer recording windows to be reliably estimated. These measures could offer richer insights into autonomic modulation across conditions. Hence, exploring different durations of tasks may be necessary to identify an optimal trade-off between recordings long enough to ensure reliable physiological estimates and rehabilitation sessions that remain effective, engaging and not too demanding for the patient.

6. Conclusions

This study shows that a multimodal assessment combining objective physiological measures (EEG and ECG) with subjective reports can capture distinct aspects of MWL and engagement during upper-limb robot-assisted rehabilitation. While the EEG-derived MWLI did not show differences across assistance levels, subtle changes in MWL across conditions were detected by AVNN (ECG-derived). However, this likely reflects variations in autonomic activation associated with physical engagement, rather than solely cortical MWL, highlighting the importance of acquiring different physiological signals that can evidence distinct aspects of the same concept. In addition, EEG-derived EI did not differ significantly across assistance levels. Conversely, perceived MWL and engagement were significantly lower in the passive modality compared with the active and semi-assisted conditions despite no detectable changes in the objective EEG indexes, highlighting that self-reports can reveal motivational/experiential components not captured by EEG alone. Finally, music significantly increased perceived engagement and the EEG-derived EI without significantly affecting MWL, suggesting that auditory stimulation may improve the rehabilitation experience without increasing cognitive demand.
Overall, these results support the use of complementary objective–subjective metrics to guide the personalization of robot-assisted rehabilitation protocols. Future work should apply this framework to clinical populations with motor impairments to determine whether assistance levels and music similarly modulate MWL and engagement, and whether these factors translate into improved adherence and outcomes.

Author Contributions

Conceptualization, A.M., G.R. and M.M. (Marta Mondellini); methodology, C.Z. and A.M.; software, C.Z. and M.L.N.; formal analysis, A.M., G.R. and C.Z.; investigation, A.M., G.R., M.L.N., M.M. (Marta Mondellini) and C.Z.; resources, A.M., G.R., M.M. (Matteo Malosio), M.L.N., G.T. and M.M. (Marta Mondellini); data curation, C.Z.; writing—original draft preparation, C.Z.; writing—review and editing, C.Z., A.M., G.R., M.L.N., M.M. (Matteo Malosio), M.M. (Marta Mondellini) and G.T.; visualization, C.Z.; supervision, A.M. and G.R.; project administration, A.M.; funding acquisition, M.M. (Matteo Malosio) and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Italian Ministry of University and Research, under the complementary actions to the NRRP ‘Fit4MedRob-Fit for Medical Robotics’ Grant (# PNC0000007).

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Ethical Committee of the National Research Council of Italy.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors upon request.

Acknowledgments

The authors kindly acknowledge Alessandra Tafaro and Claudia Ceresa for their continuous support and assistance.

Conflicts of Interest

Matteo Malosio (M.M.), Matteo Lavit Nicora (M.L.N.), and Giovanni Tauro (G.T.) are members of the National Research Council of Italy (CNR) and co-founders of the company Rehabilia Technologies (RT). RT is the exclusive licensee of the industrial invention patent underlying PhiCube, developed at CNR. M.M., M.L.N. and G.T. are co-inventors of the patent. RT did not fund the study and did not participate in the analysis or interpretation of the study’s results. All the other authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
MWLMental Workload
EEGElectroencephalography
ECGElectrocardiogram
NASA-TLXNASA Task Load Index
SSSQShort Stress State Questionnaire
HRVHeart Rate Variability
MWLIMental Workload Index
GUIGraphical User Interface
AMEActive with Metronome
AMUActive with Music
PMEPassive with Metronome
SMESemi-assisted with Metronome
EIEngagement Index
AVNNAverage Value of Normal-to-Normal Intervals
ANSAutonomic Nervous System
LF/HFLow Frequency/High Frequency
RMSSDRoot Mean Square of Successive Differences
SDNNStandard Deviation of Normal-to-Normal Intervals

References

  1. Kwakkel, G.; van Peppen, R.; Wagenaar, R.C.; Wood Dauphinee, S.; Richards, C.; Ashburn, A.; Miller, K.; Lincoln, N.; Partridge, C.; Wellwood, I.; et al. Effects of augmented exercise therapy time after stroke: A meta-analysis. Stroke 2004, 35, 2529–2539. [Google Scholar] [CrossRef]
  2. Flynn, N.; Froude, E.; Cooke, D.; Kuys, S. Repetitions, duration and intensity of upper limb practice following the implementation of robot assisted therapy with sub-acute stroke survivors: An observational study. Disabil. Rehabil. Assist. Technol. 2022, 17, 675–680. [Google Scholar] [CrossRef]
  3. Perez-Marcos, D.; Chevalley, O.; Schmidlin, T.; Garipelli, G.; Serino, A.; Vuadens, P.; Tadi, T.; Blanke, O.; Millán, J.d.R. Increasing upper limb training intensity in chronic stroke using embodied virtual reality: A pilot study. J. Neuroeng. Rehabil. 2017, 14, 119. [Google Scholar] [CrossRef] [PubMed]
  4. Takebayashi, T.; Takahashi, K.; Okita, Y.; Kubo, H.; Hachisuka, K.; Domen, K. Impact of the robotic-assistance level on upper extremity function in stroke patients receiving adjunct robotic rehabilitation: Sub-analysis of a randomized clinical trial. J. NeuroEng. Rehabil. 2022, 19, 25. [Google Scholar] [CrossRef] [PubMed]
  5. Krebs, H.I.; Palazzolo, J.J.; Dipietro, L.; Ferraro, M.; Krol, J.; Rannekleiv, K.; Volpe, B.T.; Hogan, N. Rehabilitation robotics: Performance-based progressive robot-assisted therapy. Auton. Robot. 2003, 15, 7–20. [Google Scholar] [CrossRef]
  6. Neva, J.L.; Legon, W.; Staines, W.R. Primary motor cortex excitability is modulated with bimanual training. Neurosci. Lett. 2012, 514, 147–151. [Google Scholar] [CrossRef]
  7. Waller, S.M.; Whitall, J. Bilateral arm training: Why and who benefits? NeuroRehabilitation 2008, 23, 29–41. [Google Scholar] [CrossRef]
  8. Mehrabi, S.; Fleet, J.L.; Safaei-Qomi, M.R.; Dukelow, S.P.; Murie-Fernandez, M.; Harnett, A.; Teasell, R. Systematic Review of Upper Extremity Outcome Measures Assessed in Randomized Controlled Trials of Poststroke Upper Extremity Rehabilitation in Low-to-Middle-Income and High-Income Countries. Arch. Phys. Med. Rehabil. 2025, 106, 618–631. [Google Scholar] [CrossRef] [PubMed]
  9. Teasell, R.; Mehrabi, S.; Saikaley, M.; George, C.; Dukelow, S.P.; Harnett, A.; Fleet, J.L. A systematic review of upper extremity outcome measures assessed in randomized controlled trials of post stroke upper extremity rehabilitation over time. Top. Stroke Rehabil. 2025, 32, 307–322. [Google Scholar] [CrossRef]
  10. Maura, R.M.; Rueda Parra, S.; Stevens, R.E.; Weeks, D.L.; Wolbrecht, E.T.; Perry, J.C. Literature review of stroke assessment for upper-extremity physical function via EEG, EMG, kinematic, and kinetic measurements and their reliability. J. NeuroEng. Rehabil. 2023, 20, 21. [Google Scholar] [CrossRef]
  11. Milani, G.; Antonioni, A.; Baroni, A.; Malerba, P.; Straudi, S. Relation between EEG measures and upper limb motor recovery in stroke patients: A scoping review. Brain Topogr. 2022, 35, 651–666. [Google Scholar] [CrossRef]
  12. Trujillo, P.; Mastropietro, A.; Scano, A.; Chiavenna, A.; Mrakic-Sposta, S.; Caimmi, M.; Molteni, F.; Rizzo, G. Quantitative EEG for predicting upper limb motor recovery in chronic stroke robot-assisted rehabilitation. IEEE Trans. Neural Syst. Rehabil. Eng. 2017, 25, 1058–1067. [Google Scholar] [CrossRef]
  13. Danzl, M.M.; Etter, N.M.; Andreatta, R.D.; Kitzman, P.H. Facilitating neurorehabilitation through principles of engagement. J. Allied Health 2012, 41, 35–41. [Google Scholar]
  14. Teo, J.L.; Zheng, Z.; Bird, S.R. Identifying the factors affecting ‘patient engagement’in exercise rehabilitation. BMC Sport. Sci. Med. Rehabil. 2022, 14, 18. [Google Scholar] [CrossRef]
  15. Longo, L.; Wickens, C.D.; Hancock, G.; Hancock, P.A. Human mental workload: A survey and a novel inclusive definition. Front. Psychol. 2022, 13, 883321. [Google Scholar] [CrossRef]
  16. Marchand, C.; De Graaf, J.B.; Jarrassé, N. Measuring mental workload in assistive wearable devices: A review. J. NeuroEng. Rehabil. 2021, 18, 160. [Google Scholar] [CrossRef]
  17. Pan, L.; Song, A.; Wang, S.; Duan, S. Experimental Study on Upper-Limb Rehabilitation Training of Stroke Patients Based on Adaptive Task Level: A Preliminary Study. BioMed Res. Int. 2019, 2019, 2742595. [Google Scholar] [CrossRef]
  18. Gomes, E.; Alder, G.; Bright, F.A.; Signal, N. Understanding task “challenge” in stroke rehabilitation: An interdisciplinary concept analysis. Disabil. Rehabil. 2025, 47, 560–570. [Google Scholar] [CrossRef]
  19. Jaquess, K.J.; Lo, L.C.; Oh, H.; Lu, C.; Ginsberg, A.; Tan, Y.Y.; Lohse, K.R.; Miller, M.W.; Hatfield, B.D.; Gentili, R.J. Changes in mental workload and motor performance throughout multiple practice sessions under various levels of task difficulty. Neuroscience 2018, 393, 305–318. [Google Scholar] [CrossRef]
  20. Csikszentmihalyi, M.; Csikzentmihaly, M. Flow: The Psychology of Optimal Experience; Harper & Row: New York, NY, USA, 1990; Volume 1990. [Google Scholar]
  21. Maclean, N.; Pound, P. A critical review of the concept of patient motivation in the literature on physical rehabilitation. Soc. Sci. Med. 2000, 50, 495–506. [Google Scholar]
  22. Vaida, C.; Rus, G.; Tucan, P.; Machado, J.; Pisla, A.; Zima, I.; Birlescu, I.; Pisla, D. Enhancing Robotic-Assisted Lower Limb Rehabilitation Using Augmented Reality and Serious Gaming. Appl. Sci. 2024, 14, 12029. [Google Scholar] [CrossRef]
  23. Hadjipanayi, C.; Banakou, D.; Michael-Grigoriou, D. Virtual reality exergames for enhancing engagement in stroke rehabilitation: A narrative review. Heliyon 2024, 10, e37581. [Google Scholar] [CrossRef]
  24. Braun Janzen, T.; Koshimori, Y.; Richard, N.M.; Thaut, M.H. Rhythm and music-based interventions in motor rehabilitation: Current evidence and future perspectives. Front. Hum. Neurosci. 2022, 15, 789467. [Google Scholar] [CrossRef]
  25. Zhao, Y.; Xu, H.; Fu, J. Integrating rhythmic auditory stimulation in intelligent rehabilitation technologies for enhanced post-stroke recovery. Front. Bioeng. Biotechnol. 2025, 13, 1649011. [Google Scholar] [CrossRef]
  26. Baur, K.; Speth, F.; Nagle, A.; Riener, R.; Klamroth-Marganska, V. Music meets robotics: A prospective randomized study on motivation during robot aided therapy. J. Neuroeng. Rehabil. 2018, 15, 79. [Google Scholar] [CrossRef]
  27. Liu, Q.; Liu, L.; Liu, Z.; Xu, Y.; Wang, F.; Cheng, H.; Hu, X. Reminiscent music therapy combined with robot-assisted rehabilitation for elderly stroke patients: A pilot study. J. NeuroEng. Rehabil. 2024, 21, 16. [Google Scholar] [CrossRef]
  28. Novak, D.; Ziherl, J.; Olenšek, A.; Milavec, M.; Podobnik, J.; Mihelj, M.; Munih, M. Psychophysiological responses to robotic rehabilitation tasks in stroke. IEEE Trans. Neural Syst. Rehabil. Eng. 2010, 18, 351–361. [Google Scholar] [CrossRef]
  29. Charles, R.L.; Nixon, J. Measuring mental workload using physiological measures: A systematic review. Appl. Ergon. 2019, 74, 221–232. [Google Scholar] [CrossRef]
  30. Mandrick, K.; Peysakhovich, V.; Rémy, F.; Lepron, E.; Causse, M. Neural and psychophysiological correlates of human performance under stress and high mental workload. Biol. Psychol. 2016, 121, 62–73. [Google Scholar] [CrossRef]
  31. Veltman, J.; Gaillard, A. Physiological workload reactions to increasing levels of task difficulty. Ergonomics 1998, 41, 656–669. [Google Scholar] [CrossRef]
  32. Collet, C.; Averty, P.; Dittmar, A. Autonomic nervous system and subjective ratings of strain in air-traffic control. Appl. Ergon. 2009, 40, 23–32. [Google Scholar] [CrossRef]
  33. Nikula, R. Psychological correlates of nonspecific skin conductance responses. Psychophysiology 1991, 28, 86–90. [Google Scholar] [CrossRef]
  34. Haarmann, A.; Boucsein, W.; Schaefer, F. Combining electrodermal responses and cardiovascular measures for probing adaptive automation during simulated flight. Appl. Ergon. 2009, 40, 1026–1040. [Google Scholar] [CrossRef] [PubMed]
  35. Mondellini, M.; Pirovano, I.; Colombo, V.; Arlati, S.; Sacco, M.; Rizzo, G.; Mastropietro, A. A multimodal approach exploiting EEG to investigate the effects of VR environment on mental workload. Int. J. Hum.-Interact. 2024, 40, 6566–6578. [Google Scholar] [CrossRef]
  36. Caiazzo, C.; Savkovic, M.; Pusica, M.; Milojevic, D.; Leva, M.C.; Djapan, M. Development of a neuroergonomic assessment for the evaluation of mental workload in an industrial human–robot interaction assembly task: A comparative case study. Machines 2023, 11, 995. [Google Scholar] [CrossRef]
  37. Ruo, A.; Villani, V.; Sabattini, L. Use of EEG signals for mental workload assessment in human-robot collaboration. In International Workshop on Human-Friendly Robotics; Springer: Cham, Switzerland, 2022; pp. 233–247. [Google Scholar]
  38. Boiten, F. Component analysis of task-related respiratory patterns. Int. J. Psychophysiol. 1993, 15, 91–104. [Google Scholar] [CrossRef]
  39. Min, B.C.; Chung, S.C.; Park, S.J.; Kim, C.J.; Sim, M.K.; Sakamoto, K. Autonomic responses of young passengers contingent to the speed and driving mode of a vehicle. Int. J. Ind. Ergon. 2002, 29, 187–198. [Google Scholar] [CrossRef]
  40. Ohsuga, M.; Shimono, F.; Genno, H. Assessment of phasic work stress using autonomic indices. Int. J. Psychophysiol. 2001, 40, 211–220. [Google Scholar] [CrossRef]
  41. McMahan, T.; Parberry, I.; Parsons, T.D. Evaluating player task engagement and arousal using electroencephalography. Procedia Manuf. 2015, 3, 2303–2310. [Google Scholar] [CrossRef]
  42. Stuldreher, I.V.; van Erp, J.B.; Brouwer, A.M. Robustness of physiological synchrony in wearable electrodermal activity and heart rate as a measure of attentional engagement to movie clips. Sensors 2023, 23, 3006. [Google Scholar] [CrossRef]
  43. Zhang, Y.; Wang, H.; Shi, B.E. User Engagement Correlates Better with Behavioral than Physiological Measures in a Virtual Reality Robotic Rehabilitation System. In Proceedings of the 2024 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Kuching, Malaysia, 6–10 October 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 5189–5194. [Google Scholar]
  44. Blank, A.A.; French, J.A.; Pehlivan, A.U.; O’Malley, M.K. Current trends in robot-assisted upper-limb stroke rehabilitation: Promoting patient engagement in therapy. Curr. Phys. Med. Rehabil. Rep. 2014, 2, 184–195. [Google Scholar] [CrossRef] [PubMed]
  45. Hart, S.G.; Staveland, L.E. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In Advances in Psychology; Elsevier: Amsterdam, The Netherlands, 1988; Volume 52, pp. 139–183. [Google Scholar]
  46. Helton, W.S. Validation of a short stress state questionnaire. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, New Orleans, LA, USA, 20–24 September 2004; Sage Publications: Los Angeles, CA, USA, 2004; Volume 48, pp. 1238–1242. [Google Scholar]
  47. La Bara, L.M.A.; Meloni, L.; Giusino, D.; Pietrantoni, L. Assessment methods of usability and cognitive workload of rehabilitative exoskeletons: A systematic review. Appl. Sci. 2021, 11, 7146. [Google Scholar] [CrossRef]
  48. Paskett, M.D.; Garcia, J.K.; Jones, S.T.; Brinton, M.R.; Davis, T.S.; Duncan, C.C.; Cooper, J.M.; Strayer, D.L.; Clark, G.A. Improving upper-limb prosthesis usability: Cognitive workload measures quantify task difficulty. medRxiv 2022. [Google Scholar] [CrossRef]
  49. Badesa, F.J.; Diez, J.A.; Catalan, J.M.; Trigili, E.; Cordella, F.; Nann, M.; Crea, S.; Soekadar, S.R.; Zollo, L.; Vitiello, N.; et al. Physiological responses during hybrid BNCI control of an upper-limb exoskeleton. Sensors 2019, 19, 4931. [Google Scholar] [CrossRef]
  50. Novak, D.; Mihelj, M.; Ziherl, J.; Olensek, A.; Munih, M. Psychophysiological measurements in a biocooperative feedback loop for upper extremity rehabilitation. IEEE Trans. Neural Syst. Rehabil. Eng. 2011, 19, 400–410. [Google Scholar] [CrossRef]
  51. Pila, O.; Koeppel, T.; Grosmaire, A.G.; Duret, C. Comparison of active-assisted and active-unassisted robot-mediated upper limb therapy in subacute stroke. Restor. Neurol. Neurosci. 2021, 39, 1–7. [Google Scholar] [CrossRef]
  52. Park, J.H.; Park, G.; Kim, H.Y.; Lee, J.Y.; Ham, Y.; Hwang, D.; Kwon, S.; Shin, J.H. A comparison of the effects and usability of two exoskeletal robots with and without robotic actuation for upper extremity rehabilitation among patients with stroke: A single-blinded randomised controlled pilot study. J. Neuroeng. Rehabil. 2020, 17, 137. [Google Scholar] [CrossRef] [PubMed]
  53. Gonçalves, A.; Silva, M.F.; Mendonça, H.; Rocha, C.D. A Review of Robotic Interfaces for Post-Stroke Upper-Limb Rehabilitation: Assistance Types, Actuation Methods, and Control Mechanisms. Robotics 2025, 14, 141. [Google Scholar] [CrossRef]
  54. Oliver, S.; Kitago, T.; Buchwald, A.; Atashzar, S.F. EEG-Based Analysis of Brain Responses in Multi-Modal Human-Robot Interaction: Modulating Engagement. arXiv 2024, arXiv:2411.18587. [Google Scholar]
  55. Tang, C.; Zhou, T.; Zhang, Y.; Yuan, R.; Zhao, X.; Yin, R.; Song, P.; Liu, B.; Song, R.; Chen, W.; et al. Bilateral upper limb robot-assisted rehabilitation improves upper limb motor function in stroke patients: A study based on quantitative EEG. Eur. J. Med. Res. 2023, 28, 603. [Google Scholar] [CrossRef]
  56. Malosio, M.; Lavit Nicora, M.; Redaelli, D.; Tauro, G. Device for Bilateral Rehabilitation. EP4518825B1, 14 January 2026; filed 4 May 2023.
  57. Consales, A.; Biffi, E.; Nossa, R.; Pittaccio, S.; Lazzari, F.; Malosio, M.; Lavit Nicora, M.; Tauro, G.; Redaelli, D.F.; Chaudhary, A.; et al. Inclusivity is child’s play: Pilot study on usability, acceptability and user experience of a sensory-motor PC game for children with cerebral palsy (GiocAbile). Ital. J. Pediatr. 2024, 50, 263. [Google Scholar] [CrossRef] [PubMed]
  58. Delorme, A.; Makeig, S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 2004, 134, 9–21. [Google Scholar] [CrossRef]
  59. Vest, A.N.; Da Poian, G.; Li, Q.; Liu, C.; Nemati, S.; Shah, A.J.; Clifford, G.D. An open source benchmarked toolbox for cardiovascular waveform and interval analysis. Physiol. Meas. 2018, 39, 105004. [Google Scholar] [CrossRef]
  60. Goldberger, A.L.; Amaral, L.A.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef]
  61. Mastropietro, A.; Pirovano, I.; Marciano, A.; Porcelli, S.; Rizzo, G. Reliability of mental workload index assessed by eeg with different electrode configurations and signal pre-processing pipelines. Sensors 2023, 23, 1367. [Google Scholar] [CrossRef]
  62. Gabard-Durnam, L.J.; Mendez Leal, A.S.; Wilkinson, C.L.; Levin, A.R. The Harvard Automated Processing Pipeline for Electroencephalography (HAPPE): Standardized processing software for developmental and high-artifact data. Front. Neurosci. 2018, 12, 97. [Google Scholar] [CrossRef]
  63. Mullen, T.; Kothe, C.; Chi, Y.M.; Ojeda, A.; Kerth, T.; Makeig, S.; Cauwenberghs, G.; Jung, T.P. Real-time modeling and 3D visualization of source dynamics and connectivity using wearable EEG. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Osaka, Japan, 3–7 July 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 2184–2187. [Google Scholar]
  64. Bell, A.J.; Sejnowski, T.J. An information-maximization approach to blind separation and blind deconvolution. Neural Comput. 1995, 7, 1129–1159. [Google Scholar] [CrossRef] [PubMed]
  65. Pion-Tonachini, L.; Kreutz-Delgado, K.; Makeig, S. ICLabel: An automated electroencephalographic independent component classifier, dataset, and website. NeuroImage 2019, 198, 181–197. [Google Scholar] [CrossRef]
  66. Welch, P. The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 2003, 15, 70–73. [Google Scholar] [CrossRef]
  67. Holm, A.; Lukander, K.; Korpela, J.; Sallinen, M.; Müller, K.M. Estimating brain load from the EEG. Sci. World J. 2009, 9, 639–651. [Google Scholar] [CrossRef]
  68. Pope, A.T.; Bogart, E.H.; Bartolome, D.S. Biocybernetic system evaluates indices of operator engagement in automated task. Biol. Psychol. 1995, 40, 187–195. [Google Scholar] [CrossRef]
  69. Freeman, F.G.; Mikulka, P.J.; Prinzel, L.J.; Scerbo, M.W. Evaluation of an adaptive automation system using three EEG indices with a visual tracking task. Biol. Psychol. 1999, 50, 61–76. [Google Scholar] [CrossRef]
  70. Bakdash, J.Z.; Marusich, L.R. Repeated measures correlation. Front. Psychol. 2017, 8, 456, Correction in Front. Psychol. 2019, 10, 1201. [Google Scholar] [CrossRef]
  71. Malik, M.; Bigger, J.T.; Camm, A.J.; Kleiger, R.E.; Malliani, A.; Moss, A.J.; Schwartz, P.J. Heart rate variability: Standards of measurement, physiological interpretation, and clinical use. Eur. Heart J. 1996, 17, 354–381. [Google Scholar] [CrossRef]
  72. Washabaugh, E.P.; Treadway, E.; Gillespie, R.B.; Remy, C.D.; Krishnan, C. Self-powered robots to reduce motor slacking during upper-extremity rehabilitation: A proof of concept study. Restor. Neurol. Neurosci. 2018, 36, 693–708. [Google Scholar] [CrossRef] [PubMed]
  73. Bernardes, A.; Couceiro, R.; Medeiros, J.; Henriques, J.; Teixeira, C.; Simões, M.; Durães, J.; Barbosa, R.; Madeira, H.; Carvalho, P. How reliable are ultra-short-term HRV measurements during cognitively demanding tasks? Sensors 2022, 22, 6528. [Google Scholar] [CrossRef]
  74. Cheng, S.; Li, W.; Hui, D.; Ma, J.; Zhang, T.; Teng, C.; Dang, W.; Xiong, K.; Hu, W.; Cong, L. Acute combined effects of concurrent physical activities on autonomic nervous activation during cognitive tasks. Front. Physiol. 2024, 15, 1340061. [Google Scholar] [CrossRef]
  75. Wang, P.; Houghton, R.; Majumdar, A. Detecting and predicting pilot mental workload using heart rate variability: A systematic review. Sensors 2024, 24, 3723. [Google Scholar] [CrossRef]
  76. Li, C.; Jin, C.; Zhang, Z.; Shi, P. Music recharges people: Synchronized music during aerobic exercise leads to better self-regulation performance. PLoS ONE 2022, 17, e0278062. [Google Scholar] [CrossRef]
  77. Ballmann, C.G.; Rogers, R.R.; Porrill, S.L.; Washmuth, N.B. Implications for the Ergogenic Benefits of Self-Selected Music in Neurological Conditions: A Theoretical Review. Neurol. Int. 2025, 17, 106. [Google Scholar] [CrossRef] [PubMed]
  78. North, A.C.; Hargreaves, D.J. Subjective complexity, familiarity, and liking for popular music. Psychomusicol. J. Res. Music Cogn. 1995, 14, 77. [Google Scholar] [CrossRef]
Figure 1. PhiCube’s central unit and handles.
Figure 1. PhiCube’s central unit and handles.
Sensors 26 00922 g001
Figure 2. Experimental set-up including the upper-limb rehabilitation robot and the physiological data acquisition device.
Figure 2. Experimental set-up including the upper-limb rehabilitation robot and the physiological data acquisition device.
Sensors 26 00922 g002
Figure 3. Schematic of the protocol followed by each participant.
Figure 3. Schematic of the protocol followed by each participant.
Sensors 26 00922 g003
Figure 4. Schematic diagram of the pre-processing and analysis steps followed to compute the objective parameters. In panel (a) the pipeline adopted to process the EEG signal and example plots of the signals of the five used electrodes (Fz, Cz, P3, Pz, P4) at the key stages of the workflow: in the upper figure a plot of the unprocessed EEG; in the middle, the cleaned EEG; and at the bottom, the power spectral density of the electrode P3 highlighting the three investigated frequency bands (theta, alpha, beta). In panel (b) the steps taken to analyze the ECG signal supported by graphic representation of the ECG signal at the key stages of the workflow: in the upper figure, the unprocessed ECG; in the middle, the cleaned ECG; and at the bottom, the RR and the Normal-to-Normal interval (NN) signals.
Figure 4. Schematic diagram of the pre-processing and analysis steps followed to compute the objective parameters. In panel (a) the pipeline adopted to process the EEG signal and example plots of the signals of the five used electrodes (Fz, Cz, P3, Pz, P4) at the key stages of the workflow: in the upper figure a plot of the unprocessed EEG; in the middle, the cleaned EEG; and at the bottom, the power spectral density of the electrode P3 highlighting the three investigated frequency bands (theta, alpha, beta). In panel (b) the steps taken to analyze the ECG signal supported by graphic representation of the ECG signal at the key stages of the workflow: in the upper figure, the unprocessed ECG; in the middle, the cleaned ECG; and at the bottom, the RR and the Normal-to-Normal interval (NN) signals.
Sensors 26 00922 g004
Figure 5. Graphic representation of the physiological filtered signals used to compute the objective parameters evaluated. In panel (a), the EEG signal of the electrodes used in the computation of the objective parameters (Fz, Cz, P3, Pz, P4); in panel (b), the ECG signal, with R peaks highlighted in light blue dots.
Figure 5. Graphic representation of the physiological filtered signals used to compute the objective parameters evaluated. In panel (a), the EEG signal of the electrodes used in the computation of the objective parameters (Fz, Cz, P3, Pz, P4); in panel (b), the ECG signal, with R peaks highlighted in light blue dots.
Sensors 26 00922 g005
Figure 6. Scatter distributions of the objective physiological indexes Mental Workload Index (MWLI) (top row), Engagement Index (EI) (middle row), and Average Value of Normal-to-Normal intervals (AVNN) (bottom row) across Tasks for the Assistance Levels (left column) and the Auditory Stimuli (right column). Each dot represents an individual participant’s value. The white horizontal line indicates the mean, the darker shaded band represents the 95% confidence interval of the mean, and the lighter colored area corresponds to one standard deviation. Asterisks denote statistically significant differences between conditions, based on Nemenyi post hoc for the Assistance Levels and Wilcoxon pairwise comparison for the Auditory Stimuli.
Figure 6. Scatter distributions of the objective physiological indexes Mental Workload Index (MWLI) (top row), Engagement Index (EI) (middle row), and Average Value of Normal-to-Normal intervals (AVNN) (bottom row) across Tasks for the Assistance Levels (left column) and the Auditory Stimuli (right column). Each dot represents an individual participant’s value. The white horizontal line indicates the mean, the darker shaded band represents the 95% confidence interval of the mean, and the lighter colored area corresponds to one standard deviation. Asterisks denote statistically significant differences between conditions, based on Nemenyi post hoc for the Assistance Levels and Wilcoxon pairwise comparison for the Auditory Stimuli.
Sensors 26 00922 g006
Figure 7. Scatter distributions of the subjective self-assessed indexes NASA-TLX Overall (top row) and Engagement (bottom row) across Tasks for the Assistance Group (left column) and the Auditory Stimuli (right column). Each dot represents an individual participant’s value. The white horizontal line indicates the mean, the darker shaded band represents the 95% confidence interval of the mean, and the lighter colored area corresponds to one standard deviation. Asterisks denote statistically significant differences between conditions, based on Nemenyi post hoc for the Assistance Levels and Wilcoxon pairwise comparison for the Auditory Stimuli.
Figure 7. Scatter distributions of the subjective self-assessed indexes NASA-TLX Overall (top row) and Engagement (bottom row) across Tasks for the Assistance Group (left column) and the Auditory Stimuli (right column). Each dot represents an individual participant’s value. The white horizontal line indicates the mean, the darker shaded band represents the 95% confidence interval of the mean, and the lighter colored area corresponds to one standard deviation. Asterisks denote statistically significant differences between conditions, based on Nemenyi post hoc for the Assistance Levels and Wilcoxon pairwise comparison for the Auditory Stimuli.
Sensors 26 00922 g007
Table 1. Characterization by age group of the population. SD stands for the standard deviation of each group’s age in years.
Table 1. Characterization by age group of the population. SD stands for the standard deviation of each group’s age in years.
Number of SubjectsAge (Years)Mean AgeSD
1718–3025.42.3
831–5040.95.6
551–6555.25.4
Table 2. Nemenyi (for Assistance Levels) and Wilcoxon (for Auditory Stimuli) results for every objective index analyzed. The minus sign (−) indicates that no Nemenyi post hoc test was conducted, given the non-significant difference observed in the results of the Friedman test.
Table 2. Nemenyi (for Assistance Levels) and Wilcoxon (for Auditory Stimuli) results for every objective index analyzed. The minus sign (−) indicates that no Nemenyi post hoc test was conducted, given the non-significant difference observed in the results of the Friedman test.
OBJECTIVE INDEXAssistance LevelsAuditory Stimuli
AME-SMESME-PMEAME-PMEAME-AMU
MWLIp = 0.746
EIp < 0.001 *
AVNN0.8630.004 *p < 0.001 *p = 0.158
* Statistically significant ( α < 0.05).
Table 3. Nemenyi (for Assistance Levels) and Wilcoxon (for Auditory Stimuli) results for every subjective index analyzed.
Table 3. Nemenyi (for Assistance Levels) and Wilcoxon (for Auditory Stimuli) results for every subjective index analyzed.
SUBJECTIVE INDEXAssistance LevelsAuditory Stimuli
AME-SMESME-PMEAME-PMEAME-AMU
NASA-TLX Overallp = 0.991p = 0.015 *p = 0.022 *p = 0.203
Engagementp = 0.638p = 0.012 *p < 0.001 *p = 0.023 *
* Statistically significant ( α < 0.05).
Table 4. Repeated measures correlation results for Auditory Stimuli. The p-value and the correlation coefficient (r) are reported for each correlation. The minus sign (−) indicates that no correlation was performed.
Table 4. Repeated measures correlation results for Auditory Stimuli. The p-value and the correlation coefficient (r) are reported for each correlation. The minus sign (−) indicates that no correlation was performed.
NASA-TLX OverallEngagement
MWLIp = 0.573, r = 0.11
EIp = 0.571, r = 0.11
AVNNp = 0.010 *, r = −0.46
* Statistically significant ( α < 0.05).
Table 5. Repeated measures correlation results for Assistance Levels. The p-value and the correlation coefficient (r) are reported for each correlation. The minus sign (−) indicates that no correlation was performed.
Table 5. Repeated measures correlation results for Assistance Levels. The p-value and the correlation coefficient (r) are reported for each correlation. The minus sign (−) indicates that no correlation was performed.
NASA-TLX OverallEngagement
MWLIp = 0.694, r = 0.05
EIp = 0.438, r = 0.1
AVNNp = 0.062, r = −0.24
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zanco, C.; Mondellini, M.; Nicora, M.L.; Malosio, M.; Tauro, G.; Rizzo, G.; Mastropietro, A. Multimodal Evaluation of Mental Workload and Engagement in Upper-Limb Robot-Assisted Motor Tasks. Sensors 2026, 26, 922. https://doi.org/10.3390/s26030922

AMA Style

Zanco C, Mondellini M, Nicora ML, Malosio M, Tauro G, Rizzo G, Mastropietro A. Multimodal Evaluation of Mental Workload and Engagement in Upper-Limb Robot-Assisted Motor Tasks. Sensors. 2026; 26(3):922. https://doi.org/10.3390/s26030922

Chicago/Turabian Style

Zanco, Camilla, Marta Mondellini, Matteo Lavit Nicora, Matteo Malosio, Giovanni Tauro, Giovanna Rizzo, and Alfonso Mastropietro. 2026. "Multimodal Evaluation of Mental Workload and Engagement in Upper-Limb Robot-Assisted Motor Tasks" Sensors 26, no. 3: 922. https://doi.org/10.3390/s26030922

APA Style

Zanco, C., Mondellini, M., Nicora, M. L., Malosio, M., Tauro, G., Rizzo, G., & Mastropietro, A. (2026). Multimodal Evaluation of Mental Workload and Engagement in Upper-Limb Robot-Assisted Motor Tasks. Sensors, 26(3), 922. https://doi.org/10.3390/s26030922

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop