1. Introduction
Spinocerebellar ataxia (SCA) is characterized by motor dysfunction caused by degenerative changes in the cerebellum [
1]. SCA-related motor dysfunction often manifests as abnormal limb coordination, speech difficulties, oculomotor abnormalities, and gait disorder, with impaired postural control during walking or standing [
1,
2,
3,
4]. Over time, several motor tasks were designed and standardized to evaluate these motor disturbances in terms of accuracy, timing, rhythmicity, and stability, and corresponding impairments were termed dysmetria, dyssynergia, and dysrhythmia [
5]. Among current methods to assess SCA, the Scale for the Assessment and Rating of Ataxia (SARA) is widely used and includes the finger chasing test (FCT), finger-to-nose test (FNT), and alternating hand movement assessment for dysdiadochokinesia (DDKT) [
2,
6]. In addition to the SARA, other assessment scales, including the Brief Ataxia Rating Scale (BARS) and the International Cooperative Ataxia Rating Scale, can effectively determine specific aspects of disease severity. However, these scales must be administered by a neurologist with particular expertise in ataxia and are subjective [
5,
7,
8]. For instance, the SARA stipulates the overshoot or undershoot distance between the patient’s finger and the clinician’s finger in the FCT. This evaluation and scoring system is prone to bias and inaccurate assessment owing to its subjective evaluation, and it is potentially compounded if the same clinician does not consistently evaluate the patient. The objective measurements can provide more consistent evaluations and are crucial when specialists are unavailable.
Moreover, objective measures may reveal patterns of motor control dysfunction below the threshold of detection even by subspecialty-trained clinical ataxiologists. More specifically, quantitative assessments in pre-symptomatic or early disease stages in neurodegenerative diseases may detect subtle changes in motor abilities that would not be sufficient to change the ordinal rating scale score from one severity level to the next on available clinical rating scales [
9,
10,
11]. Therefore, such an approach may facilitate repeated assessments that can more accurately and reliably detect changes over time. This can lead to improved outcome measures and compensates for the impact of biological variability (i.e., random fluctuations in motor response within the subject) in single time-point assessments. Several studies have proposed the objective assessment of ataxia [
12,
13]. For instance, a push-button system to evaluate the variation in the timing of ataxic movements was considered for the finger tapping test [
14], or using optoelectronic devices (e.g., VICON, Kinect, or video camera) to assess delay in initiating movement and accuracy in reaching the target. Recently, inertial measurement units (IMUs) were used to quantify the FNT [
3,
4] and DDKT [
15] performance. The working principle of these IMUs is based on estimating movement using onboard sensors, including an accelerometer, magnetometer, or gyroscope. The benefit of IMUs over other sensing modalities is the precise measurement of angular acceleration and velocity with minimal preparation and no interference with the motor examination.
Although these systems can provide the intended information for specific disease severity aspects, they have their limitations. For instance, using a system consisting of special video cameras, touch screens, and a full-size computer to sync various components to track a patient’s finger trajectory in the FCT may provide precise quantification of movement dynamics. However, it still requires the patients to travel to a clinic because such a data acquisition system is neither widely accessible nor easy to operate for non-specialists. The other challenge, especially in remote assessments, is to devise an automated technique in all its subcomponents. For instance, the average cycle time for a cyclic test such as the FNT, which is the focus of this study, is a widely used feature to analyze the results of this test. An objective assessment algorithm should detect cycles to extract relevant features from the associated data within each cycle and predict the severity score. Such a comprehensive procedure represents an important gap in the literature.
The objectives of this study were to develop a wearable-based solution that can objectively assess upper-extremity motor symptoms using wearable sensors. In addition, we investigated possible redundancies in the upper-extremity SARA assessments. In doing so, we made the following contributions: First, we developed an algorithm for detecting/extracting the cycles of the FNT. We also extracted multiple features from the detected cycles, investigated their correlation with the SARA scores, and developed models to predict the severity of symptoms based on the FNT. Second, we studied the dependency between the upper-extremity SARA tests through correlation analyses and building predictive models to estimate the severity of symptoms based on the FCT and DDKT using the extracted features from the FNT. Our findings pave the way for the enhancement of the utility of objective measures of SCA assessments using wearable sensors and complement our previous study focused on objective assessments of gait and balance in SCA [
11].
2. Materials and Methods
2.1. Participants
This study protocol was approved by the Partners HealthCare System Institutional Review Board. All participants signed the informed consent form before participating in the study. Seventeen participants were recruited from the Massachusetts General Hospital (MGH) Ataxia Center and studied at the MGH Neurological Clinical Research Institute (NCRI). Fourteen patients with a genetically confirmed diagnosis of SCA (SCA1: n = 1, SCA2: n = 3, SCA3: n = 5, SCA6: n = 5) and three healthy controls were enrolled. A subspecialty ataxiologist (JDS) determined the eligibility for the study participation. Exclusion criteria were less than 18 or more than 75 years of age, inability to walk independently even with a walker (SARA or BARS gait score > 6), inability to comply with all study activities, or unwillingness to provide informed consent. The inability to walk was an exclusion criterion because this study was part of a comprehensive investigation of both upper and lower limb dysfunction and gait.
2.2. Clinical Assessments
During the clinical visit, participants underwent a standardized ataxia evaluation using the SARA and BARS version 2 (BARS2, St. Petersburg, Russia) which utilizes half-points according to designated descriptors included in the scale [
7,
9]. All participants were equipped with 2 IMUs (LEGSys
TM, BioSensics, Newton, MA, USA) on their wrists during the assessments (
Figure 1). Each sensor consists of a triaxial accelerometer and a triaxial gyroscope with a sampling frequency of 100 Hz. The dynamic ranges of the accelerometer and gyroscope are, respectively, ±2 g and ±2000 deg/s, where g is the gravitational acceleration. The sensors were placed with the help of the research staff, and two clinical assessments of SARA and BARS2 were performed, with a duration of 30 min in-between. The sensors were secured over the wrist using elastic Velcro to avoid any shift in the sensor placement. The assessments were also video recorded. The SARA and BARS2 scores were conducted and rated in person by a neurologist (J.D.S). The video recordings were later reviewed and scored according to the BARS2 and SARA by two additional ataxia specialists (C.D.S, A.S.G) to examine inter-rater reliability. Each rater was blinded to the scores given by other raters.
This study only assessed the upper-extremity SARA assessment of the FNT, FCT, and DDKT. The FNT measures how smooth and coordinated the upper-extremity movement is by having the subject perform a cyclic task. In this task, the clinician holds his/her finger at 90% of their arm’s length from the patient, and the patient is instructed to touch the clinician’s finger and then his/her nose in several consecutive trials. For the FCT, the clinician performs five consecutive sudden and fast pointing movements in unpredictable directions in a frontal plane. The amplitude and frequency of the movements are, respectively, about 30 cm and 1 movement per 2 s. The patient is asked to follow the movements with his/her index finger as fast and precisely as possible, and the average performance of the last three iterations is rated. Finally, the DDKT measures how quickly and accurately one can repeat a rapid alternating pronation/supination task by asking the patient to tap the palm of one hand to his/her thigh, raise and flip the hand, tap the back of the hand to the thigh, raise and flip the hand, and repeat this cycle ten times with the regularity of the movement and time taken to perform 10 iterations recorded.
2.3. FNT Cycle Detection Algorithm
In this study, we focused on the cyclic FNT task. The objective of the proposed algorithm was to extract each cycle of the FNT from the sensor-derived kinematic signal. The proposed algorithms detect each cycle and identify different phases within each cycle as follows:
Phase 1 (Decline): from the time instant that the patient’s finger starts moving from the clinician’s finger until reaching their nose.
Phase 2 (Pause at the nose): the period between the instant that the patient’s finger reaches their nose until they start moving towards the clinician’s finger.
Phase 3 (Rise): from the instant that the patient’s finger starts moving from their nose until they reach the clinician’s finger.
Phase 4 (Pause at finger): the period between the instant that the patient’s finger reaches the clinician’s finger until they start moving towards their nose.
The duration of phases 2 and 3 may be zero if the participant does not pause when they reach their nose or the clinician’s finger. The segmented signal is then used to extract features dependent on the disease severity.
We used the Euler angles to extract the four phases of each cycle derived from the quaternion values provided by the sensors. First, the sensor quaternion is calibrated for the local fixed frame of the clinician’s finger, and then the Euler angles are estimated using the calibrated quaternion. A calibrated time series for the right hand of a healthy individual and a participant with SCA is shown in
Figure 2a. Given that the quaternions were calibrated at the clinician’s finger and assuming that the finger does not move during the test, the angle takes its maximum and minimum values when the participant reaches his nose and the clinician’s finger, respectively. These local extrema are easy to capture for the healthy participant because each cycle contains only one maximum and one minimum. However, multiple local extrema may exist in the angle time history of each FNT cycle for the participant with SCA.
To further clarify this, consider the angle signal of one FNT cycle for an SCA participant in
Figure 2a, which is magnified in
Figure 2b. Instead of a single maximum or minimum, we observe peaks and valleys with multiple local extrema. The valley starts at point A where the participant is at the clinician’s finger. There is no significant change in angle from point A to B, where the valley ends. This part is associated with the pause at the clinician’s finger, i.e., phase 4, and the local extrema between A and B are due to tremor. A rapid change in angle starts at B and ends at C, corresponding to the movement between the clinician’s finger and the participant’s nose, i.e., phase 1. The CD segment is similar to AB but occurs at the participant’s nose, i.e., phase 2.
Finally, there is a rapid change in angle from D, which corresponds to phase 3, until the participant reaches the clinician’s hand again at E. Therefore, by detecting the A, B, C, D, and E points in each cycle of the signal, the four phases of the FNT motion can be detected. Note that the direction of rotation from finger to nose is counterclockwise for the right hand, while it is clockwise for the left hand; thus, the left-hand measurements can be similarly analyzed after changing the sign of the Euler angles.
2.4. Signal Processing
2.4.1. Data Pre-Processing
The measured signals were filtered with a sixth-order band-pass Butterworth filter with cut-off frequencies of 0.2 and 20 Hz. The lower bound cut-off was used to minimize the drift effects, and the exclusion of high-frequency components was due to the restriction in the maximum frequency that can be generated by human movements [
16].
2.4.2. Feature Extraction
Two types of features are extracted from the sensor data. The first type is extracted from each FNT cycle and was extracted using the cycle detection algorithm, and the second type of features is extracted from the entire signal of a complete FNT task. We only considered the clinically relevant features and categorized them into five phenotypes: timing, speed, variability, rhythmicity, and steadiness. All features considered in this study are summarized in
Table 1. In this table, |.| is the absolute value; motion intensity (MI) of the angular velocity (AV) and linear acceleration (LA) are
where
x,
y, and
z are displacement components along the sensor’s local axes, the double dot is the second time derivative, (
θ) is the angular velocity, its subscripts represent the axis of rotation, and a single dot is the first time derivative. The speed metric is defined as the mean value of speed divided by the maximum speed [
17], where speed can be any member of
. The jerk index is the root-mean-square value of the derivative of acceleration data, i.e., the jerk, normalized by the maximum value of the velocity data [
1,
15].
Extracting these features from the angular velocity and linear acceleration in all directions and the motion intensities results in 140 features. Note that there are two types of features in general; the first type has a single value for an entire FNT, such as the maximum and range of AV, and the second type has a single value for each cycle and, hence, multiple values for each test, such as the cycle time. Herein, we call the second type of features cycle-based features. To combine the two types of features, first, an interquartile range (IQR) outlier detection was used to remove the outliers from the cycle-based features. The IQR, denoted by , is the range between the first and the third quartiles, namely, and , i.e., . The data points which fall outside of interval were considered outliers and removed. The average of the remaining values, along with the features extracted from the entire signal, are considered the feature vector for each test.
4. Discussion
We have developed a wearable-based solution for the objective assessment of the upper-extremity SARA test. Seventeen participants were recruited from the MGH Ataxia Center and studied in the MGH NCRI. Participants underwent a standardized ataxia evaluation using the SARA and BARS2 while wearing a single sensor on each wrist. We observed excellent inter-rater reliability for the SARA and BARS2 for all participants among the three raters, one of whom rated the participants in person and the others rated based on videos (
Table 2). These results show consistency in the target class ratings.
A cycle detection algorithm was developed for extracting the cycles of the FNT from the sensor measurements and segmenting each cycle data into four main phases, i.e., decline, pause at the nose, rise, and pause at the finger. The algorithm extracted 140 features from the segmented and full sensor measurements. The estimated features were correlated with the SARA scores and could accurately discriminate the control and SCA severity groups related to the FNT. The benefit of the proposed objective assessment over conventional subjective evaluation includes the precise measurement of movement. Due to high sensitivity in tracking hand movement and detecting smaller vibrations (i.e., tremors), the proposed solution may identify subtle motor symptoms, which are often challenging to detect visually.
Unlike previous studies that used wearable-based validated gait assessment tools to assess SCA severity [
11,
21], the present study focused on upper-extremity motor examination by breaking down arm movement into discrete subcomponents using the wearable. This is a novel and valuable approach for representing and quantifying the ataxia phenotype, as gait assessment can only be administered for ambulatory individuals and requires dedicated facilities (e.g., an uninterrupted walkway or trained staff to ensure safety). The present study would contribute to validating the wearable-based upper-extremity motor assessment to assess SCA severity.
Furthermore, the dependency of the upper-extremity SARA tests was investigated via training machine learning algorithms to predict the SARA scores related to the FCT and DDKT using the features obtained from the FNT. The high accuracy of the SVC with the nonlinear RBF kernel implies that the upper-extremity SARA tests are dependent, although their linear correlation may be insignificant. The results suggest that the number of motor tasks could be potentially reduced while estimating the upper-extremity motor symptoms with acceptable accuracy. A reduced number of motor tasks may improve the feasibility of using the platform in a non-clinical setting and for remote patient monitoring. Digital health technologies are poised to become an integral part of modern health care, expedited by the surge in remote care necessitated by the COVID-19 pandemic [
22,
23,
24,
25,
26,
27,
28,
29,
30,
31]. The proposed solution paves the way to integrating digital health technology (i.e., wearables) to enhance the utility of objective measures of ataxia.
Previously, Kasyap et al. (2020) proposed a comprehensive approach to the evaluation of cerebellar ataxia by objectively assessing five domains (speech, upper limb, lower limb, gait, and balance) through the instrumented versions of nine bedside neurological tests using wearable IMUs and microphones [
3]. Similarly, Tran et al. (2020) also proposed a comprehensive scheme for objective upper-body assessments of subjects with cerebellar ataxia [
5]. Tran et al. used an IMU and a Kinect camera system to evaluate the FNT, DDKT, and FCT performance. While our results were comparable to these studies, we also addressed the redundancy of the upper-extremity assessment tasks and primarily focused on parameters estimated from the FNT. In addition, Oubre et al. extracted the movement elements from the participants’ accelerometry data during the FNT and employed machine learning algorithms. The classification models distinguished between ataxia and healthy controls, and ataxia and Parkinsonism phenotypes with areas under the receiver–operating curve of 0.96 and 0.86, respectively [
13]. These findings support the evolving recognition that wearable-based systems may be helpful in managing movement disorders in the clinic or in home settings and potentially reduce observer bias in clinical trials. Consequently, the frequency of assessments may increase, which may compensate for the biological variability at single time points (same day, different times of day, weeks/months apart) in clinical care and clinical trial design.
A major limitation of the present study is the small sample size. While the high-performance accuracy of the current models is encouraging, the results need to be confirmed in a larger dataset. Furthermore, the models were trained on trial 1 and tested on trial 2. Since we have the same participants in trial 1 and trial 2, the model might have learned individual characteristics in addition to disease characteristics. A further limitation was the absence of late-stage severe SCA patients in whom somatosensory impairment, vestibular involvement, or other central nervous systems (CNS) lesions may contribute to the overall disability [
5]. It is important to establish whether the factors differentiating SCA associated with other neural lesions might differ from “pure” cerebellar ataxia to produce a more precise means of assessing ataxia. This would be a subject of future studies, which can also compare the platform’s accuracy in differentiating ataxia from other neurodegenerative diseases (e.g., Parkinson’s disease). Similarly, more severe ataxia reflected by SARA FNT scores > 3 would be important in future studies. The benefit of including individuals with a low to moderate level of SCA clinical severity is to highlight the finding that objective assessment using the proposed platform can perhaps detect SCA in the early stages.