A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification

Chen, Chia-Hau; Tang, Li-Hsien; Yeh, Chang-Hsin; Wu, Eric Hsiao-Kuang; Yeh, Shih-Ching

doi:10.3390/electronics15071459

Open AccessArticle

A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification

by

Chia-Hau Chen

¹,

Li-Hsien Tang

²,

Chang-Hsin Yeh

³,

Eric Hsiao-Kuang Wu

²

and

Shih-Ching Yeh

^2,*

¹

Department of Civil Engineering and Environmental Informatics, Minghsin University of Science and Technology, Hsinchu 30401, Taiwan

²

Department of Computer Science and Information Engineering, National Central University, Taoyuan City 320317, Taiwan

³

School of Computer Science, Fudan University, Shanghai 200433, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(7), 1459; https://doi.org/10.3390/electronics15071459

Submission received: 14 February 2026 / Revised: 25 March 2026 / Accepted: 26 March 2026 / Published: 1 April 2026

(This article belongs to the Section Computer Science & Engineering)

Download

Browse Figures

Versions Notes

Abstract

Therapist shortages make home-based rehabilitation an essential component of post-stroke care, yet patients often exhibit reduced adherence when functional gains are difficult to quantify and interpret. This study presents a cloud-enabled assessment framework centered on a dynamic reaching task for upper-limb rehabilitation in individuals with mild stroke. The proposed system combines wearable sensing and Internet of Things (IoT) connectivity to stream kinematic data to the cloud for near real-time analysis, and integrates a force-feedback rehabilitation robot to deliver motion guidance during training. The pipeline proceeds in three stages. First, smoothness-related kinematic descriptors are extracted and fed into a deep multi-class classifier to discriminate the affected side (left, right, or healthy). Second, movement quality is modeled using a Gaussian Mixture Model (GMM) trained on IoT-acquired trajectories to quantify performance via probabilistic similarity. Third, a calibrated scoring function transforms GMM log-likelihood into a normalized 0–1 quality index, producing visual reports that support interpretable feedback for patients and therapists. The framework is validated using motion data collected from stroke patients at Taipei Veterans General Hospital. Experimental results demonstrate that the neural network multi-classifier achieved an F1-score of 0.95. Incorporating robot-derived interaction signals further improved classification performance by approximately 5%. For movement quality assessment, the derived scores showed a significant positive correlation (Pearson correlation = 0.632, p = 0.02) with therapist-defined gold reference standards for right-affected patients. Additionally, integrating robot force-feedback signals and AIoT-enabled dynamic streams improved score accuracy by 8% and score responsiveness by 10%. These quantitative outcomes substantiate the efficacy of combining IoT-driven sensing and robot-assisted training for objective, interpretable, and remotely deployable motor assessment.

Keywords:

stroke rehabilitation; machine learning; motor analysis; performance metrics; internet of things; rehabilitation robot

1. Introduction

Stroke is a major contributor to mortality and long-term disability in low- and middle-income countries, representing approximately 70% of global stroke incidence and 87% of stroke-related deaths and disability-adjusted life years (DALYs) [1]. It is typically caused by disrupted cerebral blood flow that leads to neuronal damage and cell death [2]. Conventional rehabilitation relies on clinicians and therapists to repeatedly stimulate relevant neural pathways through structured motor training to promote functional recovery [3]. Such scientific rehabilitation protocols require data collection by repeating standard motions multiple times, inevitably leading to repetitiveness and boredom. To counteract this, virtual reality (VR) can be leveraged to significantly increase patient engagement during these inevitable repetitive tasks [4]. In practice, however, the limited availability of therapists frequently constrains the intensity and continuity of rehabilitation services [5]. As a result, home-based rehabilitation has become an essential complement to clinical care, with therapists prescribing individualized programs that patients can perform outside the hospital. In parallel, recent advances in Internet of Things (IoT) technologies have enabled wearable-sensor-based motion capture and cloud analytics for remote monitoring and near-real-time assessment, helping reduce clinical workload, while rehabilitation robots can deliver force feedback and motion guidance to improve movement execution and training consistency in home settings [6].

Within home-based rehabilitation, patients typically follow the therapist’s instructions and self-report their progress. Prior work suggests that more than 90% of rehabilitation activities occur at home [7]. Despite its convenience, adherence is often undermined by the uncertainty of the patients about whether their movements are performed correctly, which can prolong recovery and increase overall treatment costs [8,9]. IoT-supported feedback—particularly when presented as intuitive visualized reports—can help patients interpret movement quality, while robot-assisted training can provide personalized guidance to support engagement and adherence [10]. More broadly, interactive rehabilitation systems have been shown to increase patient involvement [6,11] and are associated with meaningful functional improvements [12,13,14]. Such systems also generate rich multimodal data streams (e.g., skeletal kinematics and electromyography) through sensing technologies [15,16,17], enabling quantitative analyses of rehabilitation progress and motor performance [18,19].

For rehabilitation motion assessment, clinical scales such as the FMA [20] and WMFT [21], as well as many home-based exercise protocols, place particular emphasis on upper-limb motor function [16,22]. Furthermore, while these established clinical scales are widely used, they present an inherent limitation: potential subjectivity. Clinical assessments can vary due to different physicians’ backgrounds and are fundamentally based on discrete value ranges, which may fail to capture continuous changes in motor quality [4]. In ref. [22], the authors characterized movement performance in terms of three components—range of motion (ROM), smoothness, and compensation—captured by kinematic factors, where smoothness is especially informative for analyzing velocity-related patterns and tremor. Liao et al. [23] further categorized rehabilitation assessment approaches into discrete movement scoring, rule-based methods, and template-based modeling. To address these assessment needs, Artificial Intelligence (AI) and specifically deep learning (DL) architectures have become pivotal. Broadly, Artificial Intelligence (AI) enables machines to perform tasks requiring human-like cognition. Within this broad field, deep learning (DL) models are a powerful subset of AI techniques that can be directly compared and opposed to traditional machine learning (ML) classifiers. Specifically, rather than relying on hand-crafted features, DL training can be performed directly on complex raw data, often leveraging the hierarchical filtering effect of convolutional layers [24,25,26]. Discrete classification models (e.g., SVM and Random Forest) can effectively distinguish movement categories, but they may be less sensitive to subtle and continuous changes in motor quality [27,28]. Template-based methods compare patient trajectories with reference patterns using probabilistic density models such as GMM [29] and HMM [30] to represent variability; however, constructing robust multi-level models remains challenging. To enhance interpretability, scoring functions are often used to transform template outputs into normalized scores (e.g., 0–1 or 0–100) [23,29]. Although a recent survey highlights various DL models currently employed in remote monitoring for home-based rehabilitation [31], the specific application of DL classifiers to dynamically and explicitly discriminate the affected side in bilateral tasks remains largely unexplored.

Despite recent progress in automated rehabilitation assessment, existing methodologies exhibit two critical limitations that hinder their clinical utility in home-based settings. First, the lack of explicit affected-side identification: most current frameworks evaluate bilateral upper-limb motor function globally or assume the impaired side is predefined. In bilateral rehabilitation tasks, this oversight merges kinematic data from both the healthy and impaired limbs, diluting the sensitivity of the assessment and failing to isolate the actual motor deficit. Second, the inadequacy of continuous quality quantification: while discrete scoring models (e.g., SVM and Random Forest) can classify broad movement categories or impairment levels effectively, they suffer from coarse category boundaries. Consequently, they are less sensitive to subtle, continuous changes in motor quality, failing to capture the fine-grained daily improvements crucial for patient motivation. Quantitatively, the clinical validation of prior automated rehabilitation-assessment frameworks remains limited and highly heterogeneous. Lee et al. [22] collected Kinect recordings from 15 post-stroke survivors and 11 healthy subjects and used therapist-provided reference scores to assess exercise quality; however, their framework was designed for exercise-level scoring rather than explicit side-specific bilateral discrimination. Kim et al. [32] enrolled 41 patients with hemiplegic stroke and used Kinect-based motion capture to estimate upper-extremity Fugl–Meyer scores; however, the task formulation focused on item-wise clinical score prediction rather than continuous trajectory-quality modeling in bilateral reaching tasks. By contrast, Liao et al. [29] explicitly noted that the main validation of their deep learning framework was conducted primarily on healthy-subject rehabilitation data and that a substantial portion of the dataset lacked clinician-provided ground-truth quality labels. Therefore, among the representative studies most closely related to automated quantitative rehabilitation assessment cited here [22,29,30,32], only a limited subset incorporated real stroke patient motion data, and even fewer combined patient-specific high-dimensional kinematic measurements with therapist- or clinician-anchored reference standards for continuous quality estimation. Although the present pilot study involved a smaller cohort of five stroke patients and three healthy controls, it was specifically designed to capture synchronized multimodal bilateral reaching data and to anchor the resulting probabilistic modeling to clinically meaningful side-specific assessment. This distinction is important because, in bilateral upper-limb rehabilitation, side-specific impairment may be obscured if pathological motion data are sparse, absent, or not explicitly linked to clinically grounded scoring criteria. Conversely, while some continuous matching methods such as DTW can reveal nuanced differences, they often operate directly on raw sensor trajectories, which reduces robustness under sensor noise and natural inter-subject variability.

Addressing these existing problems is crucial because failing to isolate the affected side often results in misleading global scores that mask subtle, localized motor deficits. Consequently, therapists may prescribe inappropriate training intensities. Furthermore, without continuous and fine-grained quality quantification, patients may suffer from reduced motivation due to untracked micro-recoveries, ultimately delaying their overall functional restoration.

To address the limitations of existing rehabilitation assessment systems—such as the insensitivity of discrete scoring methods to subtle motor changes, the lack of affected-side identification in previous template-based approaches, and the tendency of current systems to emphasize training delivery over objective progress quantification —this study proposes a novel, integrated assessment framework. The core innovative contributions of this paper are summarized as follows:

Hybrid Side-Specific Assessment Pipeline: Unlike previous works that evaluate upper-limb function globally, the proposed framework introduces a sequential methodology. It first employs a deep learning multi-class classifier to explicitly identify the affected side (left, right, or healthy) using smoothness-related kinematic features. This guarantees that subsequent quality evaluations are highly targeted and side-specific.
Interpretable Continuous Quality Quantification: This study advances traditional template-based modeling by conditioning a Gaussian Mixture Model (GMM) specifically on the identified affected side. By integrating a calibrated scoring function, the framework maps complex log-likelihoods into an intuitive 0–1 index. Crucially, unlike many existing theoretical models, this quantitative index is clinically validated against therapist-defined gold standards using motion data from real stroke patients at Taipei Veterans General Hospital.
Synergistic AIoT and Robotic Architecture: Moving beyond conventional VR platforms, the proposed framework combines wearable Internet of Things (IoT) sensing, near real-time AWS cloud analytics, and a force-feedback rehabilitation robot. This multifaceted integration not only ensures movement execution fidelity via physical guidance but also fuses multimodal data streams to enhance the statistical robustness of remote motor assessment in home-based settings.

The rest of this paper is organized as follows: Section 2 reviews related work. Section 3 details the proposed system architecture and methodology. Section 4 presents the experimental results. Finally, Section 5 provides the discussion, and Section 6 concludes the paper.

2. Related Work

2.1. Technology-Assisted Stroke Rehabilitation Systems

Recently, numerous stroke rehabilitation exercises have been developed to train upper-limb motor function, and some studies have validated their effectiveness using established clinical scales, such as the Fugl–Meyer Assessment (FMA) and the Wolf Motor Function Test (WMFT). A low-cost upper-limb rehabilitation system leveraging Kinect [33] was introduced and evaluated by tracking score improvements during the rehabilitation process [34]. Mobile virtual reality (VR) programs have also been proposed and validated using FMA-UE, Brunnstrom stage, and manual muscle testing [35]. In addition, game-based VR training, such as canoe-paddling exercises, has been reported to improve upper-extremity function, with outcomes assessed by the modified functional reach test (mFRT) and manual function test (MFT) [36].

Beyond VR-based paradigms, rehabilitation platforms increasingly incorporate Internet of Things (IoT) infrastructures to support wearable-sensor motion capture and remote monitoring, thereby improving accessibility and reducing therapist workload [6]. Rehabilitation robots have likewise been adopted to deliver force feedback and individualized motion guidance to improve movement execution fidelity and training engagement [10]. However, many existing systems emphasize training delivery rather than progress quantification, leaving therapists with limited objective evidence to track recovery trajectories in home-based settings.

Specifically focusing on the studies that overlap most closely with the proposed framework, Postolache et al. [6] developed a remote physical-rehabilitation system that combines IoT connectivity, virtual reality, and wearable sensing to support stroke rehabilitation beyond the clinic. Their work demonstrates the practical value of cloud-enabled remote monitoring and multimodal sensing for tracking rehabilitation performance. However, the system is primarily oriented toward remote supervision and general progress observation, rather than explicit side-specific motor assessment during bilateral task execution.

Marchal-Crespo and Reinkensmeyer [10] reviewed robotic movement-training strategies for neurologic rehabilitation and highlighted the importance of robotic assistance, adaptive guidance, and interactive feedback for improving movement execution. Their framework strongly motivates the integration of force-feedback rehabilitation robots into upper-limb training. Nevertheless, the focus is placed on training control strategies and assistance paradigms, rather than on an AI-based assessment pipeline that automatically identifies the affected side and subsequently quantifies movement quality.

From the perspective of continuous quality modeling, Liao et al. [23] categorized rehabilitation assessment approaches into discrete scoring, rule-based evaluation, and template-based modeling, and further proposed an interpretable scoring mechanism for transforming model outputs into normalized performance indices. This line of work is highly relevant to the present study because it emphasizes interpretable quantitative feedback. However, it does not explicitly address bilateral affected-side discrimination before score generation.

In addition, Lee et al. [22] introduced a comprehensive set of kinematic descriptors for upper-limb rehabilitation assessment, including range-of-motion, smoothness, and compensation components. Their framework provides an important feature-design basis for movement analysis, particularly for smoothness-related assessment. Still, the method is centered on feature characterization rather than on a sequential framework that first performs affected-side classification and then applies side-specific probabilistic quality modeling.

Taken together, these prior studies establish the importance of remote monitoring, robotic assistance, interpretable scoring, and kinematic feature engineering in stroke rehabilitation. However, a framework that integrates all of these elements into a unified pipeline—namely, AI-based affected-side identification, side-specific continuous quality quantification, cloud-supported multimodal sensing, and robot-assisted bilateral reaching training—remains insufficiently explored. The proposed study is intended to fill this gap.

2.2. Discrete Motor Assessment Approaches

To address assessment needs, discrete movement scoring approaches have been explored, often by extracting smoothness-related features as indicators of upper-limb motor impairment. For example, SVM-based classifiers have been designed using smoothness descriptors to assess motor impairment severity [37]. Unsupervised strategies have also been investigated, such as clustering arm movements into impairment levels using smoothness features and regularized Mahalanobis distance-based k-means [38]. Other work has combined sensor-derived smoothness features with Random Forest models and anchored predictions to traditional scales (FMA, WMFT) for motor ability assessment [39]. Nevertheless, neural network-based methods have been integrated with FMA for upper-extremity function estimation [32]. IoT-enabled sensing can further strengthen these pipelines by providing continuous, multimodal streams (e.g., acceleration and angular velocity) that support richer feature construction and more stable classification in home environments [40]. Robot-assisted execution can additionally reduce trial-to-trial variability by enforcing more consistent movement patterns via force feedback, which is particularly relevant when motor differences are subtle and easily confounded by inconsistent task performance [10]. Nevertheless, discrete scoring frameworks remain intrinsically limited by coarse category boundaries, and may fail to capture fine-grained motor changes over time—an issue that can sustain patient uncertainty regarding improvement and weaken adherence in home-based rehabilitation. Furthermore, drawing inspiration from advanced feature representation techniques in broader deep learning domains—such as semantic compensated adaptive fusion networks [41] and progressive interaction with saliency-guided enhancement [42,43] designed for complex remote sensing image analysis—future rehabilitation systems could employ similar sophisticated attention and fusion strategies. These approaches could be adapted to isolate salient kinematic features and integrate multimodal IoT data (e.g., combining trajectories with electromyography) more effectively, thereby enhancing the sensitivity of motor impairment classification.

Within these discrete and continuous evaluation frameworks, various machine learning (ML) and DL architectures are commonly employed. For instance, Decision Trees (DTs) and Random Forests (RFs) operate via hierarchical decision rules and ensemble learning, respectively. Support Vector Machines (SVM) determine optimal class-separating hyperplanes to categorize impairment severity. For processing sequential motion data, Long Short-Term Memory (LSTM) networks—a specialized DL architecture—are frequently adopted to capture temporal dependencies [24]. Conversely, generative probabilistic modeling is effectively achieved using Gaussian Mixture Models (GMMs), which represent complex data distributions as weighted sums of multiple Gaussian components.

2.3. Continuous Movement Quality Modeling

To detect subtle movement changes more sensitively, several quantifiable assessment methods have been proposed. Template–data comparisons using Euclidean-distance similarity in position and velocity have been studied [4], and dynamic time warping (DTW)-based metrics have been applied via Euclidean norms of DTW differences [44]. Multi-template, multi-match DTW variants have also been developed to evaluate similarity between training sequences and reference templates [45]. However, these approaches often operate directly on raw sensor trajectories, even though they can reveal nuanced differences; this may reduce robustness under noise and inter-subject variability. In contrast, probability density function-based modeling has been explored to represent motion variability more systematically. A representative pipeline combines autoencoder-based dimensionality reduction with a Gaussian Mixture Model (GMM) to model human motion [46], and subsequent work has introduced scoring functions that transform GMM negative log-likelihood into interpretable performance indices (0–1) [29]. From a system perspective, IoT infrastructures can operationalize such modeling by enabling continuous data acquisition, cloud-based computation, and visualized reporting, thereby improving interpretability for both patients and therapists. In addition, robot-assisted execution can improve the consistency of training trajectories, potentially benefiting the accuracy of template-based modeling [10]. However, a recurring limitation is the lack of validation on real patient datasets, which constrains clinical credibility and practical deployment.

In summary, a critical review of the current literature reveals two primary research gaps that remain unaddressed. First, existing assessment pipelines predominantly evaluate bilateral upper-limb function globally or rely on predefined impairment labels. They fail to dynamically explicitly identify and isolate the affected side, which dilutes the sensitivity of subsequent quality modeling in bilateral tasks. Second, although continuous probabilistic modeling (e.g., GMM) offers theoretical advantages over discrete classification, previous implementations largely lack rigorous clinical validation against therapist-defined gold standards using actual stroke patient datasets. This absence of clinical anchoring limits their interpretability and practical deployment. The proposed framework directly bridges these gaps by chaining an automated side-identification multi-classifier with a clinically validated, continuous quality scoring function within an integrated AIoT-robotic architecture.

3. Method

3.1. Participants

To evaluate the computational feasibility and system stability of the proposed assessment framework, this research was conducted as an initial proof-of-concept pilot study. A total of 5 post-stroke participants and 3 healthy control subjects were enrolled. The experiment was conducted at the rehabilitation department of Taipei Veterans General Hospital. All participants provided written informed consent prior to the experiment.

Specifically, the system integrates a rehabilitation robot equipped with a multi-degree-of-freedom force-feedback arm. This robotic arm guides the reaching trajectories and delivers adaptive resistance or assistance tailored to the patient’s real-time motor capability. Furthermore, multiple reaching trials were administered within a structured virtual-reality-based assessment setting to obtain sufficiently representative motion data for model development [4].

The inclusion criteria for the post-stroke cohort were as follows:

Having basic cognitive ability, capable of understanding the experimental process, and following simple instructions.
Capable of partial voluntary hand movements, such as side lifting with a small angle
Able to tolerate 1 to 5 rehabilitation exercise units, depending on the individual’s motor ability.

All post-stroke participants were right-handed stroke survivors. A therapist assessed each patient’s motor performance during the dynamic reaching exercise and provided a ground-truth score, which was used for correlation analysis in the subsequent evaluation approaches. In addition, three healthy right-handed participants were recruited to perform the same task to collect reference skeletal-motion data. The demographic and clinical characteristics of all participants—including age, gender, and affected region—are summarized in a unified format in Table 1.

3.2. Rehabilitation System Design

3.2.1. System Introduction

The proposed VR-based stroke rehabilitation system integrates Kinect [33], 3D stereo glasses, a 3D projection display, a 3D graphics card, and related hardware modules to deliver upper-limb rehabilitation exercises for stroke patients. Using the Unity 3D engine, the authors implement a dynamic reaching task that targets upper-limb extension, postural balance during reaching, and hand–eye coordination. To enhance sensing fidelity and system deployability, the authors incorporate an Internet of Things (IoT) architecture (Figure 1) with wearable inertial measurement units (IMUs) that include tri-axial accelerometers and gyroscopes. The IMUs are worn on the wrist and upper arm to acquire multimodal motion signals (e.g., acceleration and angular velocity) at 100 Hz, complementing Kinect measurements by providing higher-resolution characterization of subtle movement components such as tremor. In addition, the system integrates a rehabilitation robot equipped with a force-feedback robotic arm that guides reaching trajectories and delivers adaptive resistance according to the patient’s motor capability, thereby supporting consistent and correct task execution. Motion streams from the Kinect and IMUs are transmitted via Wi-Fi to an Amazon Web Services (AWS) cloud platform for near real-time processing and storage, enabling remote monitoring and longitudinal review by therapists.

3.2.2. Task Content

The user interface of the dynamic reaching exercise is illustrated in Figure 2, and the corresponding schematic is provided in Figure 3. The task is designed as a bilateral ball-throwing-and-catching scenario, in which the participant uses both arms to repeatedly intercept a ball that follows a parabolic trajectory. During each trial, stroke patients are instructed to extend both upper limbs to catch successive balls, thereby training coordinated reaching performance. The total number of target catches is configurable by the therapist to match the patient’s rehabilitation stage and tolerance. To provide transparent feedback on task execution, the VR display reports key outcome statistics in real time, including the number of successful catches, the number of failed attempts, and the current streak of consecutive successful catches.

To support safe and consistent movement execution, the rehabilitation robot delivers haptic feedback during catching events and adapts the assistance/resistance profile according to the patient’s Fugl–Meyer Assessment Upper Extremity (FMA-UE) score, enabling individualized guidance. In addition, real-time performance metrics are computed on AWS Lambda and rendered on the VR interface as interactive bar charts, allowing patients to track immediate progress and facilitating sustained task engagement.

3.2.3. Difficulty Design Mechanism

The difficulty of the dynamic reaching exercise is configurable through the following parameters.

Horizontal falling distance of the sphere: The horizontal displacement of the falling sphere can be adjusted according to the patient’s upper-limb extension capability (i.e., reachable workspace). To accommodate different functional levels in the bilateral VR task, the system provides multiple reach-range settings, including 30–50%, 30–75%, and 30–100% of the target range.

Required number of successful catches: Therapists can specify the target count of successful catches to regulate training duration and endurance demand, thereby assessing how long the patient can sustain continuous practice.

The speed of the sphere falling: Speed is in units of gravitational acceleration (G). Therapists could set this to

\frac{1}{9} G, \frac{2}{9} G, \frac{3}{9} G

so that patients could train their reaction time and hand-eye coordination according to their impairment level.

3.2.4. Experimental Rehabilitation Platform

The experimental rehabilitation platform used in this study consisted of an integrated station comprising a display monitor, a webcam, and a host computer mounted on a dedicated support frame. Within the proposed cloud–robot–wearable framework, this station functioned as the interactive rehabilitation node for task presentation, participant supervision, and local system execution during the bilateral reaching assessment. The monitor was used to present the rehabilitation interface and task-related feedback to the participant, while the webcam enabled real-time visual monitoring throughout the experimental session. The host computer, positioned on the lower shelf of the platform, managed the execution of the rehabilitation software, local data acquisition, and synchronization with the sensing modules adopted in the proposed framework. This compact configuration provided a stable and reproducible setup for administering bilateral upper-limb assessment tasks in a controlled indoor environment.

3.3. Analysis Approach

To develop an assessment method for the dynamic reaching exercise that accounts for its bilateral nature, this study proposes a discrete movement score-based multi-class classifier that categorizes each participant as a healthy subject, a post-stroke subject with left-side impairment, or a post-stroke subject with right-side impairment. The classifier is constructed using smoothness-related features, as smoothness is a key indicator of motor control during continuous bilateral catching movements.

Prior to feature extraction, rigorous preprocessing operations were applied to the raw sensor data. Given the inherent difference in the sampling rates of the Kinect (approximately 50 Hz) and the IMU sensors (100 Hz), a resampling operation was strictly required to achieve accurate data synchronization. The raw IMU signals underwent noise-reduction filtering, normalization, and segmentation. Subsequently, all multimodal data streams were resampled and temporally aligned to ensure that the kinematic metrics extracted from different hardware sources were synchronized and unaffected by phase mismatches [24].

After identifying the affected side, the proposed pipeline further quantifies movement performance to support therapist evaluation and to provide feedback that may facilitate patient engagement during home-based training. Specifically, this study adopts the modeling strategy in [46] to compare patient motion data with healthy-subject reference data collected in the dynamic reaching task, thereby producing an objective measure of movement quality. To enhance interpretability, the authors additionally apply the scoring function proposed in [29], converting the model output into a normalized performance score that can be readily understood by clinicians and patients. The overall pipeline of the analysis approach is summarized in Figure 4.

It is important to note that although the participant cohort is small (N = 8), each participant performed multiple continuous dynamic reaching cycles across their prescribed rehabilitation units. Consequently, the feature matrices used for training the machine learning classifiers and the GMM were extracted at the segment level (i.e., individual reaching cycles) rather than being aggregated at the subject level. This approach substantially expanded the effective dataset size—yielding hundreds of independent kinematic samples—which provided sufficient data volume for model optimization and mitigated the risk of overfitting.

3.3.1. Feature Extraction

The feature set in this study is designed primarily based on three prior works [22,47,48]. Lee et al. [22] introduced a comprehensive set of kinematic descriptors to characterize three movement-performance components. In this study, the focus is placed on the smoothness-related component, and the corresponding smoothness-based kinematic features are adopted for subsequent analysis. j specifies a joint in the set J extracted from the Kinect joint data.

J ∈ {left wrist (lw), right wrist (rw)}.
c denotes a coordinate of movement joints in the set C ∈ {x, y, z}.
t denotes the frame index.
T denotes the total number of frames.
F denotes the sampling frequency.

The following formulas are basic smoothness-based features:

Relative trajectory:	$r t_{t} (b, s) = \sqrt{\sum_{c \in C} (b, c) - (s, c)}$	(1)
Speed:	$s p_{t} (j) \{\begin{matrix} F * (r t_{t} (b, j) - r t_{t - 1} (b, j)), t > 1 \\ 0 \end{matrix}$	(2)
Acceleration:	$a c_{t} (j) \{\begin{matrix} F * (s p_{t} (j) - s p_{t - 1} (j)), i f t > 1 \\ 0, o t h e r w i s e \end{matrix}$	(3)
Jerk:	$j k (j) = \{\begin{matrix} \frac{F}{∆ t} (a c_{t} (j) - a c_{t - 1} (j)), i f t > 1 \\ 0, o t h e r w i s e \end{matrix}$	(4)

The following formulas define the normalized speed and normalized jerk:

Normalized speed:	$n s p_{t} (j) = \frac{s p_{t}^{a v g} (j)}{s p_{t}^{m a x} (j)}$	(5)
Normalized jerk:	$n j k_{t} (j) = \frac{j k_{t}^{a v g} (j)}{j k_{t}^{m a x} (j)}$	(6)

The following formula, the Mean Arrest Period Ratio (MAPR), indicates the proportion of frames when the speed exceeds a target percentage (10%) of the maximum speed. Ref. [22] expects that patients would make more unnecessary movements and so attain higher MAPR values.

m a p r_{t} (f t, j) = \frac{1}{t} \sum_{s = 1}^{t} I_{A} (f t_{s} (j)),

A = \{f t_{s} (j) > f t_{t}^{m a x} (j) * 0.1, f t_{s} (j) \in {s p_{s} (j), j k_{s} (j)}\}

(7)

The following formula, which represents the zero-crossing ratio, represents the period of a motion when the sign of acceleration or jerk changes. If a participant has more trembling movements or more unnecessary movements, they would attain a higher zero-crossing ratio.

z c_{t} (f t, j) = \frac{1}{t - 1} \sum_{s = 2}^{t} I_{R_{< 0}} (f t_{s} (j) \cdot f t_{s - 1} (j)) f t_{s} (j) \in {a c_{s} (j), j k_{s} (j)}, for t > 1

(8)

Archambault et al. [47] proposed a feature called the index of curvature (IC), which estimates the straightness. The formula is as follows:

I C (j) = \frac{path length}{line of sight distance}

(9)

Balasubramanian et al. [4] evaluate the spectral arc-length metric that uses the Fourier magnitude spectrum of the movement speed profile to assess movement smoothness. Consider a movement with speed profile v(t), t ∈ [0, T] and duration T. The formula is as shown below:

η_{s a l} - \int_{0}^{ω_{c}} \sqrt{{(\frac{1}{ω_{c}})}^{2} + {(\frac{d \hat{V} (ω)}{d ω})}^{2}} d ω,

(10)

\hat{V} (ω) ≜ \frac{V (ω)}{V (0)}

where V(ω) is the Fourier magnitude spectrum of v(t), and [0, ω_c] is the frequency band occupied by the given movement. ω_c = 40 π rad/s (which corresponds to 20 Hz) covers the normal and abnormal aspects of human movements such as tremor.

3.3.2. Feature Selection

To identify an effective feature subset for the proposed analysis approach, a one-way ANOVA is applied for feature selection. The statistical significance threshold is set to 0.05, and the highly significant threshold is set to 0.01.

3.3.3. Discrete Score Movement-Based Multi-Classifier

To classify participants into three categories—healthy, post-stroke with left-side impairment, and post-stroke with right-side impairment—a set of discrete movement score-based multi-classifiers are tested using smoothness-related features. Conventional multi-class machine learning models are first evaluated, including Decision Tree, Random Forest, and SVM. For the SVM, a linear kernel is adopted with the penalty parameter set to C = 1.0.

Deep learning-based multi-classification models are also investigated, including a feed-forward neural network and an LSTM network. For the feed-forward neural network, the hyperparameters are set as follows: epochs = 10, batch size = 5, and learning rate = 0.005. The network contains five hidden layers with 16, 32, 64, 32, and 16 units, respectively. A softmax output layer is used for multi-class prediction, and the model is trained using binary cross-entropy as the loss function.

Because participant motion signals are inherently sequential, the authors further employ a recurrent architecture (LSTM) to capture temporal dependencies and extract latent states from time-series movement data. For the LSTM model, the hyperparameters are set to epochs = 40, batch size = 3, and learning rate = 0.0001. The LSTM backbone consists of three recurrent layers with 128, 256, and 512 hidden units, followed by three fully connected feed-forward layers with 128, 256, and 512 units. The LSTM network uses ReLU as the activation function and binary cross-entropy as the loss function. The LSTM architecture is illustrated in Figure 5.

All multi-classifiers are trained using the same feature set. For the sequential model (LSTM), features are organized as a time-by-feature matrix spanning the full set of timestamps, where each feature corresponds to a vector over time. For non-sequential models, a feature vector is constructed using values at the final timestamp, with each feature represented as a scalar.

3.3.4. Template-Based Assessment Approach

After participant classification, each patient’s movement quality is further quantified by comparing patient trajectories with those of healthy subjects. Specifically, a performance metric based on the GMM log-likelihood proposed in [30] is adopted. Probabilistic modeling is well suited for rehabilitation motion analysis because it can represent the inherent variability and stochasticity in human movement patterns. The overall workflow of the proposed quality assessment is illustrated in Figure 6.

A GMM is a probabilistic mixture model composed of multiple Gaussian probability density functions [30]. Owing to its flexibility in capturing multimodal distributions, GMM-based modeling has been widely applied to represent movement data in rehabilitation exercises [22]. The architecture of the adopted GMM and the subsequent scoring function are illustrated in Figure 7. For a GMM consisting of C Gaussian components, the corresponding probability density function is given by the following equations.

P (x_{f}| λ) = \sum_{C = 1}^{C} π_{c} N (x_{f}| μ_{c}, Σ c)

(11)

where x_f represents a healthy subject’s feature data, and λ = {π_c, u_c, Σ_c} are the mixing coefficient, mean, and covariance of the Gaussian component. Therefore, the negative log-likelihood is used as a performance metric. The log-likelihood formula is given by

P (y_{f}| λ) = - \sum_{m = 1}^{M} \log (\sum_{c = 1}^{C} π_{c} N (y_{f}| μ_{c}, Σ c))

(12)

where Y_f represents the patient’s feature data, and MMM denotes the number of features. To compare movement quality in specific body regions between healthy subjects and patients, two GMMs were established for different sides of movement. One GMM was established by using healthy subjects’ left-side movement data, and the other GMM was established by using healthy subjects’ right-side movement data. The number of Gaussian components was set to three in each GMM.

3.3.5. Scoring Function

To make the GMM log-likelihood-based performance metric interpretable for both therapists and patients, the authors adopt the scoring function proposed by Liao et al. [23] to transform log-likelihood values into a normalized score within the range 0–1. Let

x = (x_1, x_2, \dots, x_L)

denote the sequence of performance-metric values computed from healthy-subject movements, and let

y = (y_1, y_2, \dots, y_N)

denote the corresponding sequence computed from patient movements, where

L

is the total number of healthy-subject movement samples and

N

is the total number of patient movement samples. The scoring function is defined by the following equations.

{\bar{x}}_{k} = {(1 + e^{\frac{x_{k}}{u + 3 δ} - a_{1}})}^{- 1}

(13)

{\bar{y}}_{k} = {(1 + e^{\frac{x_{k}}{u + 3 δ} - a_{1}} + \frac{y_{k} - \bar{x}}{a_{2} (u + 3 δ)})}^{- 1}

(14)

where

k \in N, μ = \frac{1}{L} \sum_{k = 1}^{L} |x_{k}|, \bar{x} = \frac{1}{L} \sum_{k = 1}^{L} x_{k}

,

δ

is the standard deviation of x.

Compared with the formulation in [27], we replace

x_k

with the mean value

\bar{x}

in the scoring function when computing patient scores. This modification is adopted because the healthy-subject cohort and the patient cohort differ in both sample size and participant identity (i.e., they are not paired observations). In addition, the proposed scoring scheme is defined conditioned on the affected side. Specifically, the scoring function for left-affected patients uses

\bar{x}

computed from the healthy subjects’ left-side performance metrics, while that for right-affected patients uses

\bar{x}

computed from the healthy subjects’ right-side performance metrics. Therefore, patients are always normalized against the corresponding side-specific

\bar{x}

associated with their impairment category. A higher score indicates that the patient’s movement quality is closer to the healthy-subject reference.

3.4. Amazon Web Services

The game platform is built on Amazon Web Services (AWS), leveraging its scalability, security, and cloud computing power to handle game data processing, analysis, and storage efficiently (Figure 8). To ensure seamless data flow, game data is directly transmitted to Amazon S3, which triggers an AWS Lambda function once the data is uploaded. This function executes computational tasks such as data transformation, performance analysis, and predictive modeling.

Once the data is processed, the refined results are stored in relational databases like Amazon RDS and DynamoDB, depending on the type of data. For images, which cannot be stored in relational databases, they are kept in Amazon S3, with URLs stored in the database to enable retrieval by EC2 Web Services. EC2 retrieves the processed data and presents it through a web-based platform, allowing professionals such as therapists and clinicians to analyze player performance metrics, response times, and cognitive indicators.

The platform utilizes Amazon CloudWatch to continuously monitor the performance of the system, logging key metrics and triggering alerts to ensure optimal operation. With auto-scaling mechanisms in place, the platform can dynamically adjust resource allocation based on demand, ensuring efficiency and cost-effectiveness. This AWS-powered architecture guarantees a secure, real-time, and scalable system, enabling professionals to make data-driven decisions, generate detailed reports, and enhance therapeutic or analytical outcomes through precise game-based assessments.

3.5. Statistical Analysis

All statistical evaluations were conducted to rigorously identify significant kinematic features and validate the quality assessment models. Prior to employing a one-way ANOVA for feature selection, a Shapiro–Wilk test was conducted to evaluate the normality of the data distributions, which is a fundamental prerequisite for parametric testing, especially given the restricted sample size [49]. Following the confirmation of normal distributions across the critical feature sets, the one-way ANOVA was utilized. While some deep learning pipelines rely solely on removing highly correlated features, ANOVA was specifically selected in this study to explicitly quantify the variance between the healthy and affected sides, providing a statistically interpretable and clinically meaningful baseline for the multi-classifier. Additionally, Pearson’s correlation analysis was employed to evaluate the linear relationship between the GMM-derived quality scores and the therapist-defined gold reference standard. Statistical significance was set at a threshold of p < 0.05.

4. Results

4.1. One-Way ANOVA for Feature Selection

To determine an appropriate feature subset for training the proposed multi-classifiers, a one-way ANOVA is employed for feature selection. For statistical comparability, the authors evaluate differences between healthy subjects and patients on the same skeletal side, matched to the patient’s affected region (i.e., left-side features for left-affected patients and right-side features for right-affected patients). The ANOVA results are summarized in Table 2. Features satisfying the significance criterion (p < 0.05) are retained and used to establish the multi-classifiers.

4.2. Machine Learning-Based Multi-Classification

Based on the feature selection procedure described above, an effective feature subset is identified for training multiple machine learning-based multi-classifiers. The classification performance is summarized in Table 3. Among conventional machine learning methods, the Decision Tree and Random Forest achieve F1-scores of 0.77, while the SVM attains an F1-score of 0.83. Among deep learning models, the feed-forward neural network yields the best overall performance with an F1-score of 0.95, while the sequential model (LSTM) achieves an F1-score of 0.85.

The contribution of feature sets adopted from multiple prior studies is also examined using the neural network classifier, which attains the highest F1-score in Table 3. The comparison across different combinations of literature-derived features is reported in Table 4. Using the full feature set aggregated from all three referenced studies produces the highest neural network F1-score.

To rigorously evaluate the classification performance, the F1-score was selected as the primary metric. Although accuracy is a common metric, it can be misleading in medical datasets that often exhibit class imbalance. The F1-score computes the harmonic mean of Precision and Recall, offering a more balanced and robust evaluation of the capability of the model to correctly identify the affected side without bias toward the majority class. The comparative classification performance of different models is illustrated in Figure 9. The F1-score is calculated as follows:

F 1 = 2 \times \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

(15)

4.3. Quality Assessment Framework

Using the proposed quality assessment pipeline (GMM with an interpretable scoring function), the movement data of patients who participated in the dynamic reaching exercise is evaluated. Quality scores are computed for all participants to examine whether score variations align with therapist judgments. To validate this relationship, the verification protocol described in [30] was adopted. To quantify the correlation between the therapist-defined gold reference standard and the quality scores produced by the proposed method, Pearson correlation analysis was performed. Table 5 summarizes the gold reference standard metrics defined by the therapist. Although the therapist-defined metric incorporates performance components (e.g., range of motion) that are not explicitly captured by the proposed analysis approach, it still serves as a clinically meaningful reference for upper-limb motor function assessment. Table 6 reports the Pearson correlation between the gold reference standard and the derived quality scores obtained using static Kinect data alone.

5. Discussion

This study proposed an integrated home-based rehabilitation assessment framework for post-stroke upper-limb training. The framework first identifies the affected side by using a deep learning multi-class classifier trained on smoothness-related kinematic features, and then performs side-specific movement quality evaluation through Gaussian Mixture Model (GMM)-based scoring. By combining Kinect skeletal data, wearable IMU signals, and rehabilitation-robot force-feedback information within an AWS-enabled AIoT architecture, the proposed system supports both affected-side discrimination and continuous motion-quality quantification. The results indicate that the proposed framework can effectively distinguish left-affected, right-affected, and healthy participants, while the side-specific scoring mechanism shows promising agreement with therapist-defined reference standards, particularly for right-affected patients. These findings support the feasibility of the proposed framework as an interpretable tool for remote and home-based stroke rehabilitation assessment.

According to Table 2, the significant features extracted from the left-side and right-side skeletal data share the same selected feature combination. This consistency allows us to use a unified feature subset to train the multi-classifier for identifying whether a patient’s affected side is left or right. The same feature subset is also used to establish two side-specific GMMs for subsequent quality assessment. In addition, incorporating AIoT-derived IMU signals (processed in near real time via AWS IoT Core) enriches the feature space with higher-resolution dynamics (e.g., angular velocity) and is associated with an approximate 5% increase in F1-score. Rehabilitation-robot force-feedback signals integrated through AWS EC2 further complement kinematic features by capturing corrective interaction patterns, thereby improving neural network classification accuracy.

Based on Table 3, the neural network achieves the highest F1-score; however, the conventional machine learning models underperform relative to the deep learning approach in this setting. Although the LSTM explicitly models temporal dependencies, its F1-score is approximately 10% lower than that of the neural network. A plausible explanation is that the adopted smoothness-based descriptors function primarily as summary statistics rather than sequence representations, and therefore do not benefit substantially from sequential modeling; a similar observation is reported in [22]. Notably, robot-assisted adaptive guidance informed by AIoT inputs may help stabilize motion execution, which can partially mitigate variability that would otherwise complicate sequence-based classification.

Regarding Table 4, combining features adopted from all three prior studies yields the highest neural network F1-score, suggesting that a more diverse feature representation improves characterization of movement behavior. This result implies that incorporating additional feature designs from the literature may further strengthen multi-classifier performance.

According to Table 6, the correlation between the therapist-defined gold reference standard and the proposed quality assessment scores is supported for patients with right-side impairment. Robot-based real-time corrections, which help minimize trajectory deviations during task execution, and AIoT-enabled dynamic score updates visualized via AWS Amplify likely contribute to this outcome by stabilizing movement patterns and providing immediate, intuitive feedback. These findings suggest that, for right-affected patients, the dynamic reaching task can serve as a viable home-based rehabilitation exercise with interpretable quality feedback that approximates clinical evaluation utility. However, the corresponding correlation is not confirmed for left-affected patients. Given that the cohorts consist of only 2 left-affected and 3 right-affected patients, the sample sizes are comparably small; thus, sample size alone cannot fully explain this discrepancy. Instead, this divergence is likely driven by two key clinical and biomechanical factors. First, there may be greater heterogeneity in the impairment severity and recovery stages among the left-affected patients. With such a restricted sample, extreme inter-subject variability can easily obscure statistical correlations. Second, the dynamic reaching task—which involves rapid target interception—may be inherently more sensitive to assessing the dominant side. Because all participants in this study (both healthy and post-stroke) were right-handed, the kinematic baselines established by the healthy right hands reflect dominant-limb motor control, which naturally exhibits superior coordination and responsiveness. Consequently, assessing the non-dominant (left) side using a highly dynamic task might introduce baseline variance related to natural non-dominance rather than strictly stroke-induced impairment. Future work must recruit a balanced cohort of left- and right-handed individuals and develop handedness-adjusted scoring models to decouple stroke impairment from natural limb dominance.

Limitations

However, several critical limitations of the proposed framework must be acknowledged, despite its demonstrated technical feasibility in side-specific modeling and IoT-integrated robotic assessment. First, the sample size of this pilot study is extremely limited (N = 5 patients; N = 3 healthy controls.) Consequently, the statistical power is constrained, and the generalizability of the clinical conclusions should be interpreted with caution. Second, there is a substantial age discrepancy between the young healthy control group and the older stroke patient cohort. Because advancing age independently influences motor kinematics—such as decreasing movement speed and smoothness—the current healthy reference templates may overestimate the degree of stroke-induced impairment. The present findings primarily validate the computational pipeline and system architecture. Future full-scale clinical trials must recruit larger, age-matched healthy control groups to establish rigorous and unbiased normative baselines for the GMM scoring function.

Regarding the reliability of the deep learning models and GMM, it is acknowledged that training complex architectures on small clinical cohorts poses a risk of limited generalizability. To address this within the present experimental design, segment-level kinematic data from high-frequency IoT sensor streams were utilized, effectively multiplying the training instances. Furthermore, the GMM was specifically adopted for the quality assessment stage because probability density functions are relatively robust in modeling multi-modal variability even when the data scale is moderate. Nevertheless, it is emphasized that the current machine learning models are presented to validate the computational feasibility of the proposed AIoT pipeline. Expanding the dataset through multi-center trials remains a critical future step to guarantee the generalizability of the trained network weights.

6. Conclusions

This paper presents a stroke rehabilitation exercise, termed the dynamic reaching task, for upper-limb motor training, and proposes an analysis framework tailored to the task characteristics. The framework emphasizes movement smoothness, first applying a multi-class classifier to identify the affected side (left-affected, right-affected, or healthy), and then quantifying movement quality using a Gaussian Mixture Model (GMM) with an interpretable scoring function. Experimental results show that the classifier achieves an F1-score of 0.95, and the quality scores for right-affected patients are supported by correlation analysis against therapist-defined gold standards, indicating feasibility for mild-stroke home-based rehabilitation with interpretable feedback.

Future work will recruit left-handed participants to improve generalizability, and will collect additional skeletal-joint data beyond the wrists to capture broader motion components and support richer probabilistic modeling of human-movement variability. Dimension-reduction methods such as autoencoder-based encoding [46] will also be explored to derive latent representations for more robust assessment. System extensions will integrate additional sensors (e.g., EMG) and refine robot-adaptive guidance to enhance personalization and practicality in home settings.

Author Contributions

Conceptualization, E.H.-K.W.; methodology, S.-C.Y.; software, C.-H.Y.; validation, C.-H.Y.; formal analysis, L.-H.T.; investigation, S.-C.Y.; resources, C.-H.C.; writing—original draft preparation, L.-H.T.; writing—review and editing, C.-H.C.; supervision, E.H.-K.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Johnson, W.; Onuma, O.; Owolabi, M.; Sachdev, S. Stroke: A global response is needed. Bull. World Health Organ. 2016, 94, 634–634A. [Google Scholar] [CrossRef] [PubMed]
Doyle, K.P.; Simon, R.P.; Stenzel-Poore, M.P. Mechanisms of ischemic brain damage. Neuropharmacology 2008, 55, 310–318. [Google Scholar] [CrossRef] [PubMed]
Wolf, S.L.; Winstein, C.J.; Miller, J.P.; Taub, E.; Uswatte, G.; Morris, D.; Giuliani, C.; Light, K.E.; Nichols-Larsen, D. Effect of constraint-induced movement therapy on upper extremity function 3 to 9 months after stroke: The EXCITE randomized clinical trial. JAMA 2006, 296, 2095–2104. [Google Scholar] [CrossRef] [PubMed]
Suglia, V.; Brunetti, A.; Pasquini, G.; Caputo, M.; Marvulli, T.M.; Sibilano, E.; Della Bella, S.; Carrozza, P.; Beni, C.; Naso, D.; et al. A Serious Game for the Assessment of Visuomotor Adaptation Capabilities during Locomotion Tasks Employing an Embodied Avatar in Virtual Reality. Sensors 2023, 23, 5017. [Google Scholar] [CrossRef]
Lu, W.S.; Wang, C.H.; Lin, J.H.; Sheu, C.F.; Hsieh, C.L. The minimal detectable change of the simplified stroke rehabilitation assessment of movement measure. J. Rehabil. Med. 2008, 40, 615–619. [Google Scholar] [CrossRef]
Postolache, O.; Hemanth, D.J.; Alexandre, R.; Gupta, D.; Geman, O.; Khanna, A. Remote monitoring of physical rehabilitation of stroke patients using IoT and virtual reality. IEEE J. Sel. Areas Commun. 2021, 39, 562–573. [Google Scholar] [CrossRef]
Pollock, A.S.; Legg, L.; Langhorne, P.; Sellars, C. Barriers to achieving evidence-based stroke rehabilitation. Clin. Rehabil. 2000, 14, 611–617. [Google Scholar] [CrossRef]
Hendricks, H.T.; van Limbeek, J.; Geurts, A.C.; Zwarts, M.J. Motor recovery after stroke: A systematic review of the literature. Arch. Phys. Med. Rehabil. 2002, 83, 1629–1637. [Google Scholar] [CrossRef]
Jack, K.; McLean, S.M.; Moffett, J.K.; Gardiner, E. Barriers to treatment adherence in physiotherapy outpatient clinics: A systematic review. Man. Ther. 2010, 15, 220–228. [Google Scholar] [CrossRef]
Marchal-Crespo, L.; Reinkensmeyer, D.J. Review of control strategies for robotic movement training after neurologic injury. J. Neuroeng. Rehabil. 2009, 6, 20. [Google Scholar] [CrossRef]
Rodríguez-Hernández, M.; Criado-Álvarez, J.-J.; Corregidor-Sánchez, A.-I.; Martín-Conty, J.L.; Mohedano-Moriano, A.; Polonio-López, B. Effects of virtual reality-based therapy on quality of life of patients with subacute stroke: A three-month follow-up randomized controlled trial. Int. J. Environ. Res. Public Health 2021, 18, 2810. [Google Scholar] [CrossRef] [PubMed]
de Rooij, I.J.; van de Port, I.G.; Meijer, J.-W.G. Effect of Virtual Reality Training on Balance and Gait Ability in Patients With Stroke: Systematic Review and Meta-Analysis. Phys. Ther. 2016, 96, 1905–1918. [Google Scholar] [CrossRef] [PubMed]
Laver, K.E.; Lange, B.; George, S.; Deutsch, J.E.; Saposnik, G.; Crotty, M. Virtual reality for stroke rehabilitation. Cochrane Database Syst. Rev. 2017, 11, CD008349. [Google Scholar] [CrossRef] [PubMed]
Calabrò, R.S.; Naro, A.; Russo, M.; Leo, A.; De Luca, R.; Balletta, T.; Buda, A.; La Rosa, G.; Bramanti, A.; Bramanti, P. The role of virtual reality in improving motor performance as revealed by EEG: A randomized clinical trial. J. Neuroeng. Rehabil. 2017, 14, 53. [Google Scholar] [CrossRef]
Hayashi, Y.; Nagai, K.; Ito, K.; Nasuto, S.J.; Loureiro, R.C.; Harwin, W.S. A feasible study of EEG-driven assistive robotic system for stroke rehabilitation. In Proceedings of the IEEE RAS EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob), Rome, Italy, 24–27 June 2012; pp. 1733–1739. [Google Scholar] [CrossRef]
Lee, S.I.; Adans-Dester, C.P.; Grimaldi, M.; Dowling, A.V.; Horak, P.C.; Black-Schaffer, R.M.; Bonato, P.; Gwin, J.T. Enabling stroke rehabilitation in home and community settings: A wearable sensor-based approach for upper-limb motor training. IEEE J. Transl. Eng. Health Med. 2018, 6, 2100411. [Google Scholar] [CrossRef]
Panwar, M.; Biswas, D.; Bajaj, H.; Jöbges, M.; Turk, R.; Maharatna, K.; Acharyya, A. Rehab-Net: Deep learning framework for arm movement classification using wearable sensors for stroke rehabilitation. IEEE Trans. Biomed. Eng. 2019, 66, 3026–3037. [Google Scholar] [CrossRef]
Phienphanich, P.; Tankongchamruskul, N.; Akarathanawat, W.; Chutinet, A.; Nimnual, R.; Tantibundhit, C. Stroke screening feature selection for arm weakness using a mobile application. IEEE Access 2020, 8, 170898–170914. [Google Scholar] [CrossRef]
Lu, L.; Tan, Y.; Klaic, M.; Galea, M.P.; Khan, F.; Oliver, A.; Mareels, I.; Oetomo, D.; Zhao, E. Evaluating rehabilitation progress using motion features identified by machine learning. IEEE Trans. Biomed. Eng. 2021, 68, 1417–1428. [Google Scholar] [CrossRef]
Sanford, J.; Moreland, J.; Swanson, L.R.; Stratford, P.W.; Gowland, C. Reliability of the Fugl-Meyer Assessment for Testing Motor Performance in Patients Following Stroke. Phys. Ther. 1993, 73, 447–454. [Google Scholar] [CrossRef]
Wolf, S.L.; Lecraw, D.E.; Barton, L.A.; Jann, B.B. Forced use of hemiplegic upper extremities to reverse the effect of learned nonuse among chronic stroke and head-injured patients. Exp. Neurol. 1989, 104, 125–132. [Google Scholar] [CrossRef]
Lee, M.H.; Siewiorek, D.P.; Smailagic, A.; Bernardino, A.; Badia, S.B. Learning to assess the quality of stroke rehabilitation exercises. In Proceedings of the 24th International Conference on Intelligent User Interfaces (IUI ’19), Marina del Rey, CA, USA, 17–20 March 2019; pp. 218–228. [Google Scholar] [CrossRef]
Liao, Y.; Vakanski, A.; Xian, M.; Paul, D.; Baker, R. A review of computational approaches for evaluation of rehabilitation exercises. Comput. Biol. Med. 2020, 119, 103687. [Google Scholar] [CrossRef] [PubMed]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Suglia, V.; Palazzo, L.; Bevilacqua, V.; Passantino, A.; Pagano, G.; D’Addio, G. A Novel Framework Based on Deep Learning Architecture for Continuous Human Activity Recognition with Inertial Sensors. Sensors 2024, 24, 2199. [Google Scholar] [CrossRef] [PubMed]
Palazzo, L.; Suglia, V.; Grieco, S.; Buongiorno, D.; Brunetti, A.; Carnimeo, L.; Amitrano, F.; Coccia, A.; Pagano, G.; D’Addio, G.; et al. A Deep Learning-Based Framework Oriented to Pathological Gait Recognition with Inertial Sensors. Sensors 2025, 25, 260. [Google Scholar] [CrossRef]
Taylor, P.E.; Almeida, G.J.; Kanade, T.; Hodgins, J.K. Classifying human motion quality for knee osteoarthritis using accelerometers. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Buenos Aires, Argentina, 31 August–4 September 2010; pp. 339–343. [Google Scholar] [CrossRef]
Zhang, Z.; Fang, Q.; Wang, L.; Barrett, P. Template matching based motion classification for unsupervised post-stroke rehabilitation. In Proceedings of the International Symposium on Bioelectronics and Bioinformatics, Suzhou, China, 3–5 November 2011; pp. 199–202. [Google Scholar] [CrossRef]
Liao, Y.; Vakanski, A.; Xian, M. A deep learning framework for assessing physical rehabilitation exercises. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 468–477. [Google Scholar] [CrossRef]
Capecci, M.; Ceravolo, M.G.; Ferracuti, F.; Iarlori, S.; Kyrki, V.; Monteriu, A.; Romeo, L.; Verdini, F. A hidden semi-Markov model based approach for rehabilitation exercise assessment. J. Biomed. Inform. 2018, 78, 1–11. [Google Scholar] [CrossRef]
Sassi, M.; Villa Corta, M.; Pisani, M.G.; Nicodemi, G.; Schena, E.; Pecchia, L.; Longo, U.G. Advanced Home-Based Shoulder Rehabilitation: A Systematic Review of Remote Monitoring Devices and Their Therapeutic Efficacy. Sensors 2024, 24, 2936. [Google Scholar] [CrossRef]
Kim, W.S.; Cho, S.; Baek, D.; Bang, H.; Paik, N.J. Upper Extremity Functional Evaluation by Fugl-Meyer Assessment Scoring Using Depth-Sensing Camera in Hemiplegic Stroke Patients. PLoS ONE 2016, 11, e0158640. [Google Scholar] [CrossRef]
Webster, D.; Celik, O. Experimental evaluation of Microsoft Kinect’s accuracy and capture rate for stroke rehabilitation applications. In Proceedings of the 2014 IEEE Haptics Symposium (HAPTICS), Houston, TX, USA, 23–26 February 2014; pp. 455–460. [Google Scholar] [CrossRef]
Pastor, I.; Hayes, H.A.; Bamberg, S.J.M. A feasibility study of an upper limb rehabilitation system using Kinect and computer games. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), San Diego, CA, USA, 28 August–1 September 2012; pp. 1286–1289. [Google Scholar] [CrossRef]
Choi, Y.-H.; Paik, N.-J. Mobile game-based virtual reality program for upper extremity stroke rehabilitation. J. Vis. Exp. 2018, 133, e56241. [Google Scholar] [CrossRef]
Lee, M.-M.; Lee, K.; Song, C. Game-based virtual reality canoe paddling training to improve postural balance and upper extremity function: A preliminary randomized controlled study of 30 patients with subacute stroke. Med. Sci. Monit. 2018, 24, 2590–2598. [Google Scholar] [CrossRef]
Otten, P.; Kim, J.; Son, S.H. A framework to automate assessment of upper-limb motor function impairment: A feasibility study. Sensors 2015, 15, 20097–20114. [Google Scholar] [CrossRef] [PubMed]
Biswas, D.; Cranny, A.; Gupta, N.; Maharatna, K.; Achner, J.; Klemke, J.; Jöbges, M.; Ortmann, S. Recognizing upper limb movements with wrist worn inertial sensors using k-means clustering classification. Hum. Mov. Sci. 2015, 40, 59–76. [Google Scholar] [CrossRef] [PubMed]
Del Din, S.; Patel, S.; Cobelli, C.; Bonato, P. Estimating Fugl-Meyer clinical scores in stroke survivors using wearable sensors. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Boston, MA, USA, 30 August–3 September 2011; pp. 5839–5842. [Google Scholar] [CrossRef]
Islam, S.M.R.; Kwak, D.; Kabir, M.H.; Hossain, M.; Kwak, K.-S. The Internet of Things for Health Care: A Comprehensive Survey. IEEE Access 2015, 3, 678–708. [Google Scholar] [CrossRef]
Zhang, Y.; Zhen, J.; Sun, S.; Liu, T.; Huo, L.; Wang, T. SCAFNet: A Semantic Compensated Adaptive Fusion Network for Remote Sensing Images Change Detection. IEEE Geosci. Remote Sens. Lett. 2026, 23, 6003405. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, T.; Xue, L.; Lian, W.; Tao, R. ORSI Salient Object Detection via Progressive Interaction and Saliency-Guided Enhancement. IEEE Geosci. Remote Sens. Lett. 2026, 23, 6002105. [Google Scholar] [CrossRef]
Benettazzo, F.; Iarlori, S.; Ferracuti, F.; Giantomassi, A.; Ortenzi, D.; Freddi, A.; Monteriù, A.; Innocenzi, S.; Capecci, M.; Ceravolo, M.G.; et al. Low Cost RGB-D Vision Based System to Support Motor Disabilities Rehabilitation at Home. In Ambient Assisted Living; Andò, B., Siciliano, P., Marletta, V., Monteriù, A., Eds.; Springer: Cham, Switzerland, 2015; pp. 449–461. [Google Scholar] [CrossRef]
Antón, D.; Goñi, A.; Illarramendi, A. Exercise recognition for Kinect-based telerehabilitation. Methods Inf. Med. 2015, 54, 145–155. [Google Scholar] [CrossRef]
Yurtman, A.; Barshan, B. Automated evaluation of physical therapy exercises using multi-template dynamic time warping on wearable sensor signals. Comput. Methods Programs Biomed. 2014, 117, 189–207. [Google Scholar] [CrossRef]
Vakanski, A.; Ferguson, J.M.; Lee, S. Mathematical Modeling and Evaluation of Human Motions in Physical Therapy Using Mixture Density Neural Networks. J. Physiother. Phys. Rehabil. 2016, 1, 118. [Google Scholar] [CrossRef]
Archambault, P.; Pigeon, P.; Feldman, A.G.; Levin, M.F. Recruitment and sequencing of different degrees of freedom during pointing movements involving the trunk in healthy and hemiparetic subjects. Exp. Brain Res. 1999, 126, 55–67. [Google Scholar] [CrossRef]
Balasubramanian, S.; Melendez-Calderon, A.; Burdet, E. A robust and sensitive metric for quantifying movement smoothness. IEEE Trans. Biomed. Eng. 2012, 59, 2126–2136. [Google Scholar] [CrossRef]
Palazzo, L.; Suglia, V.; Grieco, S.; Buongiorno, D.; Pagano, G.; Bevilacqua, V.; D’Addio, G. Optimized Deep Learning-Based Pathological Gait Recognition Explored Through Network Analysis of Inertial Data. In Proceedings of the 2025 IEEE Medical Measurements & Applications (MeMeA), Chania, Greece, 28–30 May 2025; pp. 1–5. [Google Scholar] [CrossRef]

Figure 1. Design and context of the proposed system.

Figure 2. User interface of the dynamic reaching exercise. The VR interface displays the bilateral ball-catching task used for upper-limb rehabilitation. It provides real-time visual feedback, including the number of successful catches, failed attempts, and consecutive successful performances, allowing both patients and therapists to monitor task execution and short-term progress during training.

Figure 3. Schematic illustration of the dynamic reaching exercise. The participant performs a bilateral reaching movement to intercept a virtual ball following a parabolic trajectory. The task is designed to train upper-limb extension, postural control during reaching, and hand–eye coordination under configurable difficulty settings.

Figure 4. Overall workflow of the proposed analysis approach. After multimodal motion acquisition and preprocessing, smoothness-related kinematic features are extracted from synchronized Kinect and IMU data. These features are first used by a multi-class classifier to identify whether the participant is healthy, left-affected, or right-affected. The identified affected side then determines the corresponding side-specific GMM used for continuous movement quality quantification.

Figure 5. Architecture of LSTM network.

Figure 6. Flow diagram of the quality assessment approach.

Figure 7. Architecture of the side-specific Gaussian Mixture Model (GMM) and the subsequent scoring function for movement quality quantification. Healthy-subject feature data from the left or right side are used to train separate GMMs that model side-specific movement distributions. Patient feature data are then evaluated by negative log-likelihood with respect to the corresponding GMM, and the resulting values are transformed into a normalized 0–1 score through the scoring function, where higher scores indicate movement patterns closer to the healthy reference.

Figure 8. AWS-based cloud architecture for rehabilitation data processing and reporting. Motion data collected during the dynamic reaching exercise are uploaded to Amazon S3 and processed through AWS Lambda for transformation, analysis, and predictive modeling. The processed outputs are stored in Amazon RDS, DynamoDB, and S3, and are retrieved through EC2-based web services for therapist review, remote monitoring, and visualization of rehabilitation performance.

Figure 9. Comparison of F1-scores across conventional machine learning and deep learning models for affected-side identification. The figure compares the classification performance of DT, RF, SVM, NN, and LSTM models trained on the selected smoothness-related kinematic features. Among all evaluated models, the feed-forward neural network achieved the highest F1-score, indicating the best overall discrimination of healthy, left-affected, and right-affected participants.

Table 1. Demographic characteristics of participants.

Group	N	Age (Mean ± SD)	Gender (M/F)	Affected Region (L/R)	Mean Rehab Time (Months)
Healthy Control	3	24.6 ± 2.1	N/A	N/A	N/A
Stroke Patients	5	47.6 ± 21.7	0/5	2/3	6.6 ± 6.0

Note—Values are presented as mean ± standard deviation (SD).

Table 2. ANOVA-based feature selection results for left- and right-side kinematic features.

Metric Category	Kinematic Feature	Left Hand (p-Value)	Right Hand (p-Value)
Speed	Avg. Speed	<0.01 **	<0.01 **
	Max Speed	<0.01 **	<0.01 **
	Normalized Speed	<0.01 **	<0.01 **
	MAPR Speed	<0.01 **	<0.01 **
Acceleration	Avg. Acceleration	<0.01 **	<0.01 **
	Max Acceleration	<0.01 **	<0.01 **
	ZCR Acceleration	<0.01 **	<0.01 **
Jerk	Avg. Jerk	0.988	0.798
	Max Jerk	0.02 *	<0.01 **
	Normalized Jerk	0.218	0.869
	MAPR Jerk	<0.01 **	<0.01 **
	ZCR Jerk	<0.01 **	<0.01 **
Smoothness	IC	<0.01 **	<0.01 **
Smoothness	Spectral Arc-Length	<0.01 **	<0.01 **

Significance Level = 0.05, * p < 0.05 (significant) ** p < 0.01 (highly significant).

Table 3. Classification performance of multiple classifiers.

	DT	RF	SVM	NN	LSTM
F1-score	0.77	0.77	0.83	0.95	0.85

Table 4. The comparison across different combinations of literature-derived features, adapted from Lee et al. [22], Archambault et al. [47], and Balasubramanian et al. [48], is reported in Table 4.

Research Adoption	Lee et al. [22]	Lee et al. [22], Archambault et al. [47]	Lee et al. [22], Balasubramanian et al. [48]	Lee et al. [22], Archambault et al. [47], Balasubramanian et al. [48]
F1-score	0.9	0.88	0.93	0.95

Table 5. Correlation between the gold reference standard and quality scores.

Movement Performance	Gold Reference Standard
The affected region of the hand could be raised to 45 degrees	45
The affected region of the hand could be raised to 60 degrees	60
The affected region of the hand could be raised to a horizontal position	90
Improved muscle endurance and could be continuously active for 10 min	120
Improved muscle endurance and could be continuously active for 20 min	150
The affected region of the hand could be raised horizontally and stably	180
The affected region of the hand could naturally swing beyond the horizontal position	210

Table 6. The correlation between the gold reference standard and quality score.

Conditions	Correlation Coefficient	p-Value
Patients with the left affected region	0.167	0.623
Patients with the right affected region	0.632	0.02 *

Significance Level = 0.05, * p < 0.05 (significant).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, C.-H.; Tang, L.-H.; Yeh, C.-H.; Wu, E.H.-K.; Yeh, S.-C. A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification. Electronics 2026, 15, 1459. https://doi.org/10.3390/electronics15071459

AMA Style

Chen C-H, Tang L-H, Yeh C-H, Wu EH-K, Yeh S-C. A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification. Electronics. 2026; 15(7):1459. https://doi.org/10.3390/electronics15071459

Chicago/Turabian Style

Chen, Chia-Hau, Li-Hsien Tang, Chang-Hsin Yeh, Eric Hsiao-Kuang Wu, and Shih-Ching Yeh. 2026. "A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification" Electronics 15, no. 7: 1459. https://doi.org/10.3390/electronics15071459

APA Style

Chen, C.-H., Tang, L.-H., Yeh, C.-H., Wu, E. H.-K., & Yeh, S.-C. (2026). A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification. Electronics, 15(7), 1459. https://doi.org/10.3390/electronics15071459

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Cloud–Robot–Wearable System for Bilateral Reaching Rehabilitation: Affected-Side Identification and Quality Quantification

Abstract

1. Introduction

2. Related Work

2.1. Technology-Assisted Stroke Rehabilitation Systems

2.2. Discrete Motor Assessment Approaches

2.3. Continuous Movement Quality Modeling

3. Method

3.1. Participants

3.2. Rehabilitation System Design

3.2.1. System Introduction

3.2.2. Task Content

3.2.3. Difficulty Design Mechanism

3.2.4. Experimental Rehabilitation Platform

3.3. Analysis Approach

3.3.1. Feature Extraction

3.3.2. Feature Selection

3.3.3. Discrete Score Movement-Based Multi-Classifier

3.3.4. Template-Based Assessment Approach

3.3.5. Scoring Function

3.4. Amazon Web Services

3.5. Statistical Analysis

4. Results

4.1. One-Way ANOVA for Feature Selection

4.2. Machine Learning-Based Multi-Classification

4.3. Quality Assessment Framework

5. Discussion

Limitations

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI