A Non-Invasive Gait-Based Screening Approach for Parkinson’s Disease Using Time-Series Analysis
Abstract
1. Introduction
1.1. Research Gap and Objectives
- To extract time-series data of key body joint movements from videos of individuals walking.
- To apply and compare several well-established time-series analysis techniques—Dynamic Time Warping (DTW), Bag-of-Patterns (BoP), Symbolic Aggregate Approximation (SAX), and Shapelet-based Classification—for distinguishing between healthy and PD gait patterns.
- To evaluate the classification performance of these techniques using both a simple distance-based classifier (K-Nearest Neighbors) and a more complex temporal model (Long Short-Term Memory network), thereby assessing the trade-offs between model complexity and performance on this specific task.
1.2. Related Work
1.2.1. Sensor-Based Approaches
1.2.2. Vision-Based Approaches
2. Materials and Methods
2.1. Data Collection
2.1.1. Self-Collected Dataset (MMU Parkinson Disease Dataset)
- Ethical Approval and Informed Consent: The data collection protocol received full ethical approval from the Multimedia University Research Ethics Committee (Approval Number: EA0422022). All participants, including healthy controls and patients with PD, provided written informed consent prior to their involvement. The research objectives, procedures, and voluntary nature of participation were thoroughly explained, and any questions were addressed before the recording session.
- Participant Recruitment and Demographics: A total of 43 video instances were collected. The cohort included 35 healthy individuals recruited at Multimedia University (MMU) and 8 patients with a formal clinical diagnosis of PD from the University of Malaya Medical Centre (UMMC). The participant pool was diverse, aged between 20 and 80 years, and included individuals from the primary ethnic groups in the region (Malay, Chinese, and Indian). The healthy group was further divided into a young group (n = 21) and an elderly group (n = 22) to account for age-related variations in gait.
- Data Collection Protocol: The data acquisition setting is depicted in Figure 2. All participants performed the Timed Up and Go (TUG) test [27], a standardized clinical assessment of mobility and fall risk. Participants were instructed to rise from a standard armchair, walk a distance of 3 m at a comfortable pace, turn around, walk back to the chair, and sit down (refer Figure 3). The entire sequence was recorded using a high-quality camera (1080p resolution and 30 frames per second) from both frontal and side perspectives to ensure a comprehensive capture of movement dynamics. Some example captures of the TUG test are illustrated in Figure 4.
- Data Collection Challenges: Several challenges were encountered during data collection. Some healthy participants expressed hesitation due to shyness, while some families required reassurance regarding the study’s legitimacy. Among the elderly cohort, pre-existing health conditions were a common reason for non-participation. Additionally, logistical challenges such as space constraints in participants’ homes required cooperation from families to create a suitable environment for the TUG test.
2.1.2. Online Dataset Curation and Ethical Considerations
- Dataset Sourcing and Labeling: To test model robustness on highly variable data, an online dataset was compiled by sourcing 294 videos from YouTube using search terms such as “Parkinson’s walk,” “PD gait”, and “Parkinson’s patient walking.” The videos were labeled as ‘PD’ (n = 144, subdivided into mild, moderate, and severe where specified) or ‘healthy’ (n = 150) based on information explicitly provided by the original uploaders in the video title, description, or on-screen text. Some example captures of the online dataset are depicted in Figure 5.
- Screening and Preprocessing: From the initial 294 videos, a significant portion was excluded during a rigorous screening process. Exclusion criteria included
- Poor video quality (e.g., low resolution, excessive motion blur, or unstable camera).
- Presence of confounding factors that would interfere with gait analysis (e.g., subjects receiving physical assistance and use of walking aids like canes or walkers).
- Video segments being too short to capture a full gait cycle.
- Obscured views of the lower limbs. After this screening, 59 videos were deemed suitable for the automated analysis pipeline.
- Ethical Considerations: The use of publicly available data from platforms like YouTube for medical research requires careful ethical consideration [28]. While the data are public, individuals may not have explicitly consented to their videos being used for research purposes. In accordance with ethical guidelines, we have taken the following steps:
- No attempt was made to identify the individuals in the videos.
- All data were anonymized during analysis.
- We only report aggregated results to protect privacy.
- We acknowledge that this approach has significant limitations, including the lack of verified clinical diagnoses, unknown medication status, and an inherent self-selection bias in the data. These limitations are discussed further in Section 4.1. This portion of the study is intended as an exploratory analysis to test model robustness on highly variable data, rather than for drawing clinical conclusions.
2.2. Video Enhancement
2.3. Body Key Points Estimation
2.4. Time-Series Data Preprocessing
2.4.1. Dynamic Time Warping (DTW)
2.4.2. Shapelet-Based Classification
2.4.3. Bag-of-Patterns (BoP)
2.4.4. Symbolic Aggregate ApproXimation (SAX)
- represent the symbolic representation of the series;
- represent the threshold;
- represents the corresponding symbol.
2.5. Classifier Implementation and Hyperparameter Tuning
3. Results
3.1. Performance on the Self-Collected Dataset
3.2. Online Dataset
3.3. Computational Performance
4. Discussion
- The Efficacy of Feature Engineering vs. End-to-End Learning: A central finding of this study is the superior performance of the KNN classifier over the LSTM network, especially on the more challenging online dataset (e.g., 71.19% accuracy for KNN-DTW vs. 57.63% for LSTM-DTW). This result is somewhat counterintuitive, as LSTMs are specifically designed to capture complex temporal dependencies. We hypothesize this discrepancy stems from two primary factors. First, the limited size and high noise level of our datasets were likely insufficient for the LSTM to learn robust, generalizable patterns, leading to overfitting. Second, methods like DTW and BoP perform an implicit and powerful form of feature engineering. DTW’s alignment process effectively normalizes temporal variations (e.g., walking speed), while BoP’s symbolic representation quantifies the frequency of characteristic sub-patterns. This preprocessed, feature-rich input appears to be more suitable for a simple distance-based classifier like KNN than the raw (or minimally processed) time series fed to the LSTM. This aligns with recent findings suggesting that, for smaller, noisy clinical datasets, models built on carefully engineered, interpretable features can outperform complex “black box” models.
- Performance Discrepancy, Overfitting, and False Positives: The marked drop in performance from the self-collected dataset to the online dataset highlights the critical challenge of model generalization and suggests significant overfitting on the training data. The high accuracy on the self-collected test set, coupled with low precision and recall, indicates that the models likely memorized patterns from the small and highly imbalanced training set (35 healthy vs. 8 PD). This led to a high number of false positives, as the models struggled to generalize to the wider variety of gaits present in the unseen healthy controls. The BoP-KNN combination was a notable exception on the online dataset, likely because the BoP method abstracts the time series into a simpler, noise-resistant histogram of patterns, which is less susceptible to overfitting than methods that analyze the full temporal sequence.
- Comparison with State of the Art: While our approach is not directly comparable to all studies due to different datasets and tasks, our best-performing models achieve accuracies that are in a similar range to some other non-deep-learning and early deep learning approaches reported in the literature. However, recent studies using more advanced deep learning architectures, such as Spatio-Temporal Graph Convolutional Networks (ST-GCNs), have reported accuracies exceeding 93% on gait classification tasks [5]. This suggests that, while our classical time-series approach is a feasible proof-of-concept, significant improvements in accuracy would likely require more sophisticated models that can better leverage the spatiotemporal structure of the human skeleton.
4.1. Limitations
- Dataset Size and Composition: The most significant limitation is the small size and imbalanced nature of our datasets. The self-collected dataset included only 8 PD patients, which severely limits the statistical power and generalizability of the findings. The online dataset, while larger initially, was reduced to 59 usable videos after screening, and suffered from a lack of clinical verification and demographic information.
- Validation Method: The use of a simple 80/20 train–test split, while acceptable for a preliminary study, is not as robust as methods like k-fold cross-validation, especially given the small sample size. This may have contributed to the observed overfitting.
- Ethical and Qualitative Constraints of Public Data: The use of uncontrolled YouTube data introduces significant ethical and qualitative issues, including the absence of informed consent and verified diagnoses. This dataset should be viewed as a tool for testing model robustness against noise, not for clinical validation.
- Pose Estimation Errors: While AlphaPose is a state-of-the-art tool, errors in key point estimation (e.g., misplacements and jitter) can occur, especially in videos with poor lighting, unusual camera angles, or occlusions. These errors introduce noise into the kinematic time series, which can negatively impact classification performance.
- Simplified Model Architectures: The classifiers used (KNN and a basic LSTM) are relatively simple. We did not explore more advanced architectures or optimization techniques that might yield higher performance.
4.2. Future Research
- Dataset Expansion and Robust Validation: The highest priority is the collection of a larger, more diverse, and well-annotated dataset through multi-site collaborations. Future analyses must also incorporate more robust validation techniques, such as k-fold cross-validation, to ensure the generalizability and statistical significance of the findings.
- Data Augmentation: We will explore data augmentation techniques to synthetically increase the size and variability of our training data. Simple yet effective methods for time-series augmentation include jittering (adding random noise), scaling (altering signal magnitude), and time warping (stretching or compressing the time axis) [31]. For greater biomechanical fidelity, future work could investigate advanced physics-based simulation frameworks like OpenSIM to generate realistic, novel gait sequences [32,33].
- Exploration of Advanced Models: Future work should evaluate more complex and domain-specific models. Spatio-Temporal Graph Convolutional Networks (ST-GCNs) are particularly promising, as they operate directly on skeletal graph data [5]. Additionally, benchmarking against state-of-the-art architectures like Vision Transformers (ViTs), which have shown excellent performance in pattern recognition, will be crucial [34].
- Multimodal Data Fusion: To create a more comprehensive diagnostic model, we plan to integrate our gait analysis with other data modalities. Combining video-based gait features with data from voice analysis, handwriting analysis, or wearable sensors could provide a more holistic and robust assessment of PD motor symptoms.
- Clinical Validation: Ultimately, any proposed model must be validated in a real-world clinical setting against neurologist ratings on a large, prospective patient cohort. This is the critical next step to translate this proof-of-concept research into a clinically meaningful tool.
4.3. Clinical Significance and Translational Potential
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Goetz, C.G.; Tilley, B.C.; Shaftman, S.R.; Stebbins, G.T.; Fahn, S.; Martinez-Martin, P.; Poewe, W.; Sampaio, C.; Stern, M.B.; Dodel, R.; et al. Movement Disorder Society-sponsored Revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): Scale Presentation and Clinimetric Testing Results. Mov. Disord. 2008, 23, 2129–2170. [Google Scholar] [CrossRef]
- Rizzo, G.; Copetti, M.; Arcuti, S.; Martino, D.; Fontana, A.; Logroscino, G. Accuracy of Clinical Diagnosis of Parkinson Disease. Neurology 2016, 86, 566–576. [Google Scholar] [CrossRef]
- Franco, A.; Russo, M.; Amboni, M.; Ponsiglione, A.M.; Di Filippo, F.; Romano, M.; Amato, F.; Ricciardi, C. The Role of Deep Learning and Gait Analysis in Parkinson’s Disease: A Systematic Review. Sensors 2024, 24, 5957. [Google Scholar] [CrossRef]
- Alarfaj, M.; Al Madini, A.; Alsafran, A.; Farag, M.; Chtourou, S.; Afifi, A.; Ahmad, A.; Al Rubayyi, O.; Al Harbi, A.; Al Thunaian, M. Wearable Sensors Based on Artificial Intelligence Models for Human Activity Recognition. Front. Artif. Intell. 2024, 7, 1424190. [Google Scholar] [CrossRef] [PubMed]
- Wu, J.; Su, N.; Li, X.; Yao, C.; Zhang, J.; Zhang, X.; Sun, W. Insights into Gait Performance in Parkinson’s Disease via Latent Features of Deep Graph Neural Networks. Front. Neurol. 2025, 16, 1567344. [Google Scholar] [CrossRef] [PubMed]
- Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of Deep Learning: Concepts, CNN Architectures, Challenges, Applications, Future Directions. J. Big Data 2021, 8, 53. [Google Scholar] [CrossRef] [PubMed]
- Amann, J.; Vetter, D.; Blomberg, S.N.; Christensen, H.C.; Coffee, M.; Gerke, S.; Gilbert, T.K.; Hagendorff, T.; Holm, S.; Livne, M.; et al. To Explain or Not to Explain?—Artificial Intelligence Explainability in Clinical Decision Support Systems. PLOS Digit. Health 2022, 1, e0000016. [Google Scholar] [CrossRef]
- Iwana, B.K.; Uchida, S. An Empirical Survey of Data Augmentation for Time Series Classification with Neural Networks. PLoS ONE 2021, 16, e0254841. [Google Scholar] [CrossRef]
- Ashfaq, N.; Khan, M.H.; Nisar, M.A. Identification of Optimal Data Augmentation Techniques for Multimodal Time-Series Sensory Data: A Framework. Information 2024, 15, 343. [Google Scholar] [CrossRef]
- Di Biase, L.; Di Santo, A.; Caminiti, M.L.; De Liso, A.; Shah, S.A.; Ricci, L.; Di Lazzaro, V. Gait Analysis in Parkinson’s Disease: An Overview of the Most Accurate Markers for Diagnosis and Symptoms Monitoring. Sensors 2020, 20, 3529. [Google Scholar] [CrossRef]
- Tahir, A.M.; Chowdhury, M.E.H.; Khandakar, A.; Al-Hamouz, S.; Abdalla, M.; Awadallah, S.; Reaz, M.B.I.; Al-Emadi, N. A Systematic Approach to the Design and Characterization of a Smart Insole for Detecting Vertical Ground Reaction Force (vGRF) in Gait Analysis. Sensors 2020, 20, 957. [Google Scholar] [CrossRef]
- Escamilla-Luna, O.; Wister, M.A.; Hemandez-Torruco, J. Classification algorithms for analyzing parkinson’s disease patient. In Proceedings of the 2022 30th International Conference on Software, Telecommunications and Computer Networks SoftCOM, Split, Croatia, 22–24 September 2022. [Google Scholar] [CrossRef]
- Balaji, E.; Brindha, D.; Elumalai, V.K.; Vikrama, R. Automatic and Non-Invasive Parkinson’s Disease Diagnosis and Severity Rating Using LSTM Network. Appl. Soft Comput. 2021, 108, 107463. [Google Scholar] [CrossRef]
- Hssayeni, M.D.; Adams, J.L.; Ghoraani, B. Deep Learning for Medication Assessment of Individuals with Parkinson’s Disease Using Wearable Sensors. In Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 18–21 July 2018; pp. 1–4. [Google Scholar] [CrossRef]
- Prince, J.; De Vos, M. A Deep Learning Framework for the Remote Detection of Parkinson’s Disease Using Smart-Phone Sensor Data. In Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 18–21 July 2018; pp. 3144–3147. [Google Scholar] [CrossRef]
- Ajay, J.; Song, C.; Wang, A.; Langan, J.; Li, Z.; Xu, W. A Pervasive and Sensor-Free Deep Learning System for Parkinsonian Gait Analysis. In Proceedings of the 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), Las Vegas, NV, USA, 4–7 March 2018; pp. 108–111. [Google Scholar] [CrossRef]
- Panda, A.; Bhuyan, P. Gait Data-Driven Analysis of Parkinson’s Disease Using Machine Learning. ResearchGate 2025, 10. [Google Scholar] [CrossRef]
- Abujrida, H.; Agu, E.; Pahlavan, K. DeepaMed: Deep Learning-Based Medication Adherence of Parkinson’s Disease Using Smartphone Gait Analysis. Smart Health 2023, 30, 100430. [Google Scholar] [CrossRef]
- Deng, D.; Ostrem, J.L.; Nguyen, V.; Cummins, D.D.; Sun, J.; Pathak, A.; Little, S.; Abbasi-Asl, R. Interpretable Video-Based Tracking and Quantification of Parkinsonism Clinical Motor States. NPJ Park. Dis. 2024, 10, 122. [Google Scholar] [CrossRef]
- Yu, H.; Park, J.; Kang, K.; Jeong, S. SMAGNet: Scaled Mask Attention Guided Network for Vision-Based Gait Analysis in Multi-Person Environments. IEIE Trans. Smart Process. Comput. 2024, 13, 23–32. [Google Scholar] [CrossRef]
- Dibenedetto, G.; Sotiropoulos, S.; Polignano, M.; Cavallo, G.; Lops, P. Comparing Human Pose Estimation through Deep Learning Approaches: An Overview. Comput. Vis. Image Underst. 2025, 252, 104297. [Google Scholar] [CrossRef]
- Urcuqui, C.; Castano, Y.; Delgado, J.; Navarro, A.; Diaz, J.; Munoz, B.; Orozco, J. Exploring Machine Learning to Analyze Parkinson’s Disease Patients. In Proceedings of the 2018 14th International Conference on Semantics, Knowledge and Grid (SKG), Guangzhou, China, 12–14 September 2018; pp. 160–166. [Google Scholar] [CrossRef]
- Li, T.; Chen, J.; Hu, C.; Ma, Y.; Wu, Z.; Wan, W.; Huang, Y.; Jia, F.; Gong, C.; Wan, S.; et al. Automatic Timed Up-and-Go Sub-Task Segmentation for Parkinson’s Disease Patients Using Video-Based Activity Classification. IEEE Trans. Neural Syst. Rehabil. Eng. 2018, 26, 2189–2199. [Google Scholar] [CrossRef]
- Liu, Y.; Chen, J.; Hu, C.; Ma, Y.; Ge, D.; Miao, S.; Xue, Y.; Li, L. Vision-Based Method for Automatic Quantification of Parkinsonian Bradykinesia. IEEE Trans. Neural Syst. Rehabil. Eng. 2019, 27, 1952–1961. [Google Scholar] [CrossRef]
- Seo, J.S.; Chen, Y.; Kwon, D.Y.; Wallraven, C. Single-View, Video-Based Diagnosis of Parkinson’s Disease Based on Arm and Leg Joint Tracking. In Proceedings of the 2022 International Conference on Mechanical, Automation and Electrical Engineering (CMAEE), Chengdu, China, 16–18 December 2022; pp. 172–176. [Google Scholar] [CrossRef]
- Reyes, J.F.; Steven Montealegre, J.; Castano, Y.J.; Urcuqui, C.; Navarro, A. LSTM and Convolution Networks Exploration for Parkinson’s Diagnosis. In Proceedings of the 2019 IEEE Colombian Conference on Communications and Computing (COLCOM), Barranquilla, Colombia, 5–7 June 2019. [Google Scholar] [CrossRef]
- Timed Up and Go Test (TUG). Available online: https://www.physio-pedia.com/Timed_Up_and_Go_Test_(TUG) (accessed on 6 August 2025).
- Howe, E.G., III; Elenberg, F. Ethical Challenges Posed by Big Data. Innov. Clin. Neurosci. 2020, 17, 24–30. [Google Scholar]
- Olafenwa, A. Simplifying Object Segmentation with PixelLib Library. Available online: https://vixra.org/pdf/2101.0122v1.pdf (accessed on 13 August 2025).
- Fang, H.S.; Li, J.; Tang, H.; Xu, C.; Zhu, H.; Xiu, Y.; Li, Y.L.; Lu, C. AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 7157–7173. [Google Scholar] [CrossRef]
- Iglesias, G.; Talavera, E.; González-Prieto, Á.; Mozo, A.; Gómez-Canaval, S. Data Augmentation Techniques in Time Series Domain: A Survey and Taxonomy. Neural Comput. Appl. 2023, 35, 10123–10145. [Google Scholar] [CrossRef]
- (PDF) Gait Data Augmentation Using Physics-Based Biomechanical Simulation. Available online: https://www.researchgate.net/publication/372416921_Gait_Data_Augmentation_using_Physics-Based_Biomechanical_Simulation (accessed on 4 August 2025).
- Seth, A.; Sherman, M.; Reinbolt, J.A.; Delp, S.L. OpenSim: A Musculoskeletal Modeling and Simulation Framework for in Silico Investigations and Exchange. Procedia IUTAM 2011, 2, 212–232. [Google Scholar] [CrossRef]
- Xu, Y.; Zhang, J.; Zhang, Q.; Tao, D. ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation. arXiv 2022, arXiv:2204.12484. [Google Scholar] [CrossRef]
Author | Input Type | Method | Datasets | Class | Accuracy | Pros | Cons |
---|---|---|---|---|---|---|---|
Escamilla-Luna et al. (2022) [12] | sensor | SVM | Dataset created by using iPhone 5s accelerometers (60 participants; 53 with PD) | Change in velocity | 97.5% | High accuracy, convenient, and cost-effective by using only smartphone as sensor | Limited sample size (60 participants) may reduce statistical power and limit generalizability across broader PD populations |
Balaji et al. (2021) [13] | sensor | Long Short-Term Memory (LSTM) | Vertical ground reaction force (VGRF) (8 force-resistive sensors on foot for 2 min at regular pace, with 93 PD and 73 healthy) | Gait in Parkinson’s disease | Multi-class: 96.60% Binary: 98.60% | High accuracy; allows for the detection of medication states during a variety of activities that individuals routinely carry out in their daily lives | The computational time was significantly prolonged during the analysis |
Hssayeni et al. (2018) [14] | sensor | LSTM | Motion data from Parkinson’s disease individuals using a triaxial gyroscope sensor on the most affected ankle. Dataset 1: seven daily living activities in OFF and ON states with a 128 Hz sampling rate Dataset 2: symptoms during tasks in a home-like setting, recording continuously for about two hours per subject at a 64 Hz sampling rate | Gait in PD | Dataset 1: 73% Dataset 2: 77% | High accuracy; allows for the detection of medication states during a variety of activities that individuals routinely carry out in their daily lives | Dataset size and setting (home-based recordings from a small sample) may reduce statistical robustness and generalization to diverse real-world conditions |
Prince & De Vos (2018) [15] | sensor | DNN, CNN | 866 and 949 real healthy control and PD volunteer subjects used iPhones to conduct speaking, walking, memory, and tapping tests remotely | Gait in Parkinson’s disease | DNN: 61.2% CNN: 62.1% | Convenient and cost-effective by using only smartphone; easy for data collection | Low accuracy |
Ajay et al. (2018) [16] | camera | Binary decision tree | A total of 49 gait videos were obtained from YouTube, which were captured using ubiquitous devices, including webcams, smartphones, and surveillance cameras | Gait in PD | 93.75% | High accuracy; convenient and cost effective by using only camera | Limited number of YouTube-sourced gait videos (49 total) may hinder model robustness and pose risks of bias and overfitting |
Panda et al. (2024) [17] | sensor | Ensemble model (DL, NN, SVM, GBT) | VGRF signals sampled at 100 Hz over 2–5 min from PD patients and healthy controls, containing 19 attributes representing time, left/right foot pressures, and total foot pressures | Gait in PD | 93.21% | Achieves highest accuracy (92.31%) and robust PD detection from gait data | Marginal accuracy gain and limited insight into generalizability and validation |
Abujrida et al. (2023) [18] | sensor | DeePaMed (A multilayer CNN) | Gait sensor data from 152 PD patients and 304 healthy controls | Gait in PD | 98.2% | DeePaMed enables real-time detection of medication nonadherence, supports personalized treatment with privacy-preserving continuous monitoring, and outperforms other models on noisy mPower data | Performance depends on sufficient gait strides, is limited by comorbidities and dataset imbalance, and requires improvement for subject-level classification and overfitting risks |
Urcuqui et al. (2018) [22] | camera | RF, LR, NB | Recordings of the walking patterns of 30 individuals with PD and 30 individuals without any health issues | Gait in PD | RF: 82%, NB: 64%, LR: 76% | High accuracy; convenient and cost effective by using only RGBD camera | Dataset size (30 PD and 30 controls) may not adequately capture population variability, limiting external validity |
Li et al. (2018) [23] | camera | LTSM, SVM | A regular 2D video camera was used to record 127 TUG tests from 24 Parkinson patients in a semi-controlled setting | Gait in Parkinson’s disease | I + S: 91.9% I + L: 92.7% O + S: 92.8% O + L: 93.1% | High accuracy; convenient and cost effective by using only camera; deep learning-based human pose estimation allows for accurate subtask segmentation | Limited number of subjects (24 PD patients with 127 recordings) restricts statistical power and generalizability, especially in extreme cases |
Liu et al. (2019) [24] | camera | RF, KNN, L-SVM, RBF-SVM | The MPII dataset and author self-collected dataset, which consists of self-collected datasets from 60 people, each with varying age, gender, and levels of symptom severity | Hand movements in PD | KNN: 72.7%, RF: 80.4%, L-SVM: 83.5%, RDF-SVM: 89.7% | High accuracy; convenient and cost-effective by using only camera and laptop | Computationally intensive, with the authors noting the need for powerful GPUs for analysis (specific time not quantified in the original paper) |
Seo et al. (2022) [25] | camera | Neural Network, SVM, Ensemble classifier | 150 control walking sequences are randomly selected from the CASIA dataset, which consists of data from 18 individuals. The PD dataset includes 19 patients with PD, and it has a total of 82 full, non-assisted walking cycles | Arm and leg joint movements in PD | Neural Network: 93%, SVM: 92%, Ensemble classifier: 97% | High accuracy; convenient and cost effective by using only camera and laptop | Evaluation based on a small, controlled cohort limits applicability to diverse clinical populations; further validation on large-scale, real-world datasets is needed |
Reyes et al. (2019) [26] | camera | LSTM, Conv1D, Conv LSTM | 94 healthy control patients and 88 PD patients were collected using a Kinect®-based system; 3 gait features are used | Gait in PD | LSTM: 53%, Conv1D: 82%, Conv LSTM: 83.1% | High accuracy; convenient and cost effective by using only camera | Dataset (94 healthy and 88 PD) lacks diversity in subject conditions and movement variations, which may affect model generalization |
Deng et al. (2024) [19] | camera | SVM, LR, LDA, RF, AB, KNN, GNB | Video recordings from 31 parkinsonism patients, using 3–7 s single-view clips captured with consumer-grade devices | Gait in PD | SVM:0.72%, LR:0.72%, LDA:0.68%, RF:0.61%, AB:0.58%, KNN:0.68%, GNB:0.68% | Accurate, interpretable, and scalable framework using short, real-world videos from consumer devices | Analysis based on a small, biased cohort without healthy controls may introduce model overfitting and limit applicability to broader clinical settings |
Method | Classifier | Dataset | Optimal Parameters |
---|---|---|---|
DTW | KNN | Self-Collected | k = 3 |
DTW | LSTM | Self-Collected | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
DTW | KNN | Online | k = 5 |
DTW | LSTM | Online | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
BoP | KNN | Self-Collected | k = 3, window_size = 10, word_size = 4 |
BoP | LSTM | Self-Collected | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
BoP | KNN | Online | k = 5, window_size = 12, word_size = 4 |
BoP | LSTM | Online | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
SAX | KNN | Self-Collected | k = 3, window_size = 10, alphabet_size = 4 |
SAX | LSTM | Self-Collected | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
SAX | KNN | Online | k = 5, window_size = 12, alphabet_size = 4 |
SAX | LSTM | Online | epochs = 50, optimizer = ‘Adam’, learning_rate = 0.001 |
Shapelet | - | Self-Collected | max_len = 15, min_len = 5 |
Shapelet | - | Online | max_len = 20, min_len = 8 |
Method | Classifier | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
---|---|---|---|---|---|
DTW | KNN | 88.89 | 44.44 | 50.00 | 47.06 |
LSTM | 88.89 | 44.44 | 50.00 | 47.06 | |
Shapelet | - | 88.89 | 44.44 | 50.00 | 47.06 |
BoP | KNN | 85.71 | 42.86 | 50.00 | 46.15 |
LSTM | 71.43 | 35.71 | 50.00 | 41.67 | |
SAX | KNN | 88.89 | 44.44 | 50.00 | 47.06 |
LSTM | 88.89 | 44.44 | 50.00 | 47.06 |
Method | Classifier | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
---|---|---|---|---|---|
DTW | KNN | 71.19 | 72.88 | 72.88 | 71.19 |
LSTM | 57.63 | 28.81 | 50.00 | 36.56 | |
Shapelet | - | 42.37 | 21.19 | 50.00 | 29.76 |
BoP | KNN | 65.91 | 69.20 | 68.11 | 65.75 |
LSTM | 47.73 | 48.69 | 48.75 | 47.48 | |
SAX | KNN | 42.37 | 21.19 | 50.00 | 29.76 |
LSTM | 57.63 | 28.81 | 50.00 | 36.56 |
Method | Classifier | Dataset | Avg Time (s) |
---|---|---|---|
DTW | KNN | Self-Collected | 0.0990 |
LSTM | Self-Collected | 0.1874 | |
KNN | Online | 0.0853 | |
LSTM | Online | 0.2613 | |
BoP | KNN | Self-Collected | 0.0037 |
LSTM | Self-Collected | 2.6633 | |
KNN | Online | 0.0046 | |
LSTM | Online | 2.7850 | |
SAX | KNN | Self-Collected | 0.6426 |
LSTM | Self-Collected | 2.2002 | |
KNN | Online | 0.5814 | |
LSTM | Online | 2.4547 | |
Shapelet | N/A | Self-Collected | 503.78 * |
N/A | Online | 345.62 * |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, H.; Connie, T.; Tan, V.W.S.; Goh, M.K.O.; Saedon, N.I.; Al-Khatib, A.; Farfoura, M. A Non-Invasive Gait-Based Screening Approach for Parkinson’s Disease Using Time-Series Analysis. Symmetry 2025, 17, 1385. https://doi.org/10.3390/sym17091385
Chen H, Connie T, Tan VWS, Goh MKO, Saedon NI, Al-Khatib A, Farfoura M. A Non-Invasive Gait-Based Screening Approach for Parkinson’s Disease Using Time-Series Analysis. Symmetry. 2025; 17(9):1385. https://doi.org/10.3390/sym17091385
Chicago/Turabian StyleChen, Hui, Tee Connie, Vincent Wei Sheng Tan, Michael Kah Ong Goh, Nor Izzati Saedon, Ahmad Al-Khatib, and Mahmoud Farfoura. 2025. "A Non-Invasive Gait-Based Screening Approach for Parkinson’s Disease Using Time-Series Analysis" Symmetry 17, no. 9: 1385. https://doi.org/10.3390/sym17091385
APA StyleChen, H., Connie, T., Tan, V. W. S., Goh, M. K. O., Saedon, N. I., Al-Khatib, A., & Farfoura, M. (2025). A Non-Invasive Gait-Based Screening Approach for Parkinson’s Disease Using Time-Series Analysis. Symmetry, 17(9), 1385. https://doi.org/10.3390/sym17091385