Pathology-Informed Personalized Exoskeleton Assistance for Post-Stroke Gait Rehabilitation via Simulation-to-Real Reinforcement Learning
Abstract
1. Introduction
- Clinically Interpretable Pathological Gait Modeling: We introduce Neuromuscular-Inspired Parametric Augmentation (NIPA), a mechanism-driven method that synthesizes diverse pathological gait trajectories by explicitly modeling stroke-related impairment mechanisms, including weakness, stiffness, and abnormal synergies. Unlike unstructured perturbation strategies, NIPA preserves interpretable links between impairment mechanisms and kinematic deviations.
- Data-Efficient Simulation-to-Real Personalization: We develop a partial transfer learning strategy that preserves the pretrained feature extractor while adapting only lightweight task-specific output layers to individual patients. This design improves data efficiency under limited clinical samples and mitigates catastrophic forgetting during patient-specific adaptation.
- Quantitative Evaluation on Clinical Gait Data: We evaluate the proposed framework on a public clinical gait dataset [43]. Results show improved tracking performance, smoothness, generalization, and few-shot adaptation relative to representative baselines, while supporting quantitative analysis of heterogeneous post-stroke gait patterns.
2. Materials and Methods
2.1. Overall Workflow
2.2. Human–Exoskeleton Coupled Dynamics
2.3. Reinforcement Learning Formulation
2.4. Neuromuscular Inspired Parametric Augmentation
2.5. Simulation-to-Real Transfer Learning
2.6. Clinical Dataset, Baselines, and Training Setup
- Zero Assistance (Zero): Simulates a transparent or disabled exoskeleton (action ). It serves as a lower bound to quantify the patient’s raw performance and calculate improvement gains.
- Phase-based Heuristic Rule (Rule): Adjusts assistance based on gait phase: low (0.1) during stance and high (1.0) during swing. It represents traditional heuristic control and provides a reference for adaptive methods.
- Standard RL from Scratch (Scratch): Trains PPO directly on the target task without pretraining. It serves as a reference for the gains associated with the transfer learning strategy.
- Bounded PD Tracking (PD-B): Uses a conventional proportional-derivative tracking controller to follow the bounded command trajectory defined within the corridor between the stroke baseline and the healthy reference. This baseline does not use reinforcement learning or source-domain pretraining and provides a stronger model-based control comparator than Zero and Rule.
- Feature-Alignment Adaptation (FeatAlign): Uses the same source-domain pretraining setting as Ours but adds a feature-distribution alignment objective during target-domain adaptation to reduce the discrepancy between source and target latent representations. This baseline represents an explicit domain-adaptation strategy for evaluating whether pathology-informed pretraining with frozen feature extraction offers advantages beyond generic feature alignment.
2.7. Statistical Analysis
2.8. Evaluation Protocol and Metrics
3. Results
3.1. Quantitative Validation of NIPA-Generated Pathological Trajectories
3.2. Quantitative Evaluation of Patient-Specific Assistance Performance
3.3. Statistical Analysis of Repeated Experiments
3.4. Ablation Study of Pathology-Informed Pretraining and Freezing Strategy
3.5. Robustness Across Unseen Pathological Gait Patterns
3.6. Stability of Patient-Specific Adaptation During Training
3.7. Performance Under Limited Clinical Data
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- World Health Organization. Stroke, Cerebrovascular Accident. WHO Fact Sheets. 2023. Available online: https://www.who.int/news-room/fact-sheets/detail/stroke-cerebrovascular-accident (accessed on 16 March 2026).
- GBD 2021 Diseases and Injuries Collaborators. Global incidence, prevalence, years lived with disability (YLDs), disability-adjusted life-years (DALYs), and healthy life expectancy (HALE) for 371 diseases and injuries in 204 countries and territories and 811 subnational locations, 1990–2021: A systematic analysis for the Global Burden of Disease Study 2021. Lancet 2024, 403, 2133–2161. [Google Scholar] [CrossRef]
- Carmo, A.; Kleiner, A.; Costa, P.; Barros, R. Three-dimensional kinematic analysis of upper and lower limb motion during gait of post-stroke patients. Braz. J. Med. Biol. Res. 2012, 45, 537–545. [Google Scholar] [CrossRef]
- Emken, J.; Benitez, R.; Reinkensmeyer, D. Human–robot cooperative movement training: Learning a novel sensory motor transformation during walking with robotic assistance-as-needed. J. Neuroeng. Rehabil. 2007, 4, 8. [Google Scholar] [CrossRef] [PubMed]
- Ploumis, A.; Gkatziani, P.; Tsingeli, P.; Ntritsos, G.; Dimopoulos, D.; Athanasiou, A.; Kefalas, A.; Varvarousis, N.D. Assessment of the robotic devices for overground gait training in poststroke patient: A systematic review and meta-analysis. Am. J. Phys. Med. Rehabil. 2026, 105, 12–19. [Google Scholar] [CrossRef]
- Marchal-Crespo, L.; Reinkensmeyer, D. Review of control strategies for robotic movement training after neurologic injury. J. Neuroeng. Rehabil. 2009, 6, 20. [Google Scholar] [CrossRef]
- Shi, D.; Zhang, W.; Zhang, W.; Ding, X. A review on lower limb rehabilitation exoskeleton robots. Chin. J. Mech. Eng. 2019, 32, 74. [Google Scholar] [CrossRef]
- Hasan, S.; Alam, N. Comprehensive comparative analysis of lower limb exoskeleton research: Control, design, and application. Actuators 2025, 14, 342. [Google Scholar] [CrossRef]
- Proietti, T.; Crocher, V.; Roby-Brami, A.; Jarrassé, N. Upper-limb robotic exoskeletons for neurorehabilitation: A review on control strategies. IEEE Rev. Biomed. Eng. 2016, 9, 4–14. [Google Scholar] [CrossRef]
- Xu, J.; Huang, K.; Zhao, M.; Liu, J. Human-in-the-loop modeling and bilateral skill transfer control of soft exoskeleton. Sensors 2024, 24, 7845. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
- Yu, C.; Liu, J.; Nemati, S.; Yin, G. Reinforcement learning in healthcare: A survey. ACM Comput. Surv. 2023, 55, 1–36. [Google Scholar] [CrossRef]
- Kapsalymov, A.; Brown, N.; Goecke, R.; Jamwal, P.; Hussain, S. Velocity control of a Stephenson III six-bar linkage-based gait rehabilitation robot using deep reinforcement learning. Neural Comput. Appl. 2025, 37, 5671–5682. [Google Scholar] [CrossRef]
- Wu, Y.; Qiao, S.; Yao, D. A hybrid chaotic controller integrating hip stiffness modulation and reinforcement learning-based torque control to stabilize passive dynamic walking. Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2023, 237, 673–691. [Google Scholar] [CrossRef]
- Du, Y.; Wang, H.; Qiu, S.; Yao, W.; Xie, P.; Chen, X. An advanced adaptive control of lower limb rehabilitation robot. Front. Robot. AI 2018, 5, 116. [Google Scholar] [CrossRef]
- Helian, Q.; Chang, J.; He, Z.; Sun, T. A Novel Robotic-Assisted Rehabilitation System for Elbow Fracture: Design and Full-Cycle Rehabilitation Strategy. IEEE Trans. Neural Syst. Rehabil. Eng. 2025, 33, 2404–2415. [Google Scholar] [CrossRef]
- Hamaya, M.; Matsubara, T.; Noda, T.; Teramae, T.; Morimoto, J. Learning assistive strategies from a few user–robot interactions: Model-based reinforcement learning approach. IEEE Robot. Autom. Lett. 2016, 1, 942–949. [Google Scholar] [CrossRef]
- Zhang, W.; Zhang, H.; Jiang, Z.; Servati, A.; Servati, P. Real-time forecasting of pathological gait via IMU navigation: A few-shot and generative learning framework for wearable devices. Discov. Electron. 2025, 2, 51. [Google Scholar] [CrossRef]
- Dulac-Arnold, G.; Levine, N.; Mankowitz, D.; Li, J.; Paduraru, C.; Gowal, S.; Hester, T. Challenges of real-world reinforcement learning: Definitions, benchmarks and analysis. Mach. Learn. 2021, 110, 2419–2468. [Google Scholar] [CrossRef]
- Rv, M.; Rakshit, S. Deep reinforcement learning based control of lower limb exoskeleton. In Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN); IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Todorov, E.; Erez, T.; Tassa, Y. MuJoCo: A physics engine for model-based control. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vilamoura-Algarve, Portugal, 7–12 October 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 5026–5033. [Google Scholar] [CrossRef]
- Tobin, J.; Fong, R.; Ray, A.; Schneider, J.; Zaremba, W.; Abbeel, P. Domain randomization for transferring deep neural networks from simulation to the real world. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 23–30. [Google Scholar] [CrossRef]
- Peng, X.; Andrychowicz, M.; Zaremba, W.; Abbeel, P. Sim-to-real transfer of robotic control with dynamics randomization. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia, 21–25 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 3803–3810. [Google Scholar] [CrossRef]
- Yu, W.; Tan, J.; Liu, C.; Turk, G. Preparing for the unknown: Learning a universal policy with online system identification. In Proceedings of the Robotics: Science and Systems (RSS), Cambridge, MA, USA, 12–16 July 2017. [Google Scholar]
- Zhu, J.; Park, T.; Isola, P.; Efros, A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 2223–2232. [Google Scholar] [CrossRef]
- Zhang, Y.; Xiong, J.; Xian, H.; Chen, C.; Chen, X.; Liang, H.; Fu, C.; Leng, Y. Joint moment estimation for hip exoskeleton control: A generalized moment feature generation method. Biomim. Intell. Robot. 2025, 5, 100246. [Google Scholar] [CrossRef]
- OpenAI; Andrychowicz, M.; Baker, B.; Chociej, M.; Jozefowicz, R.; McGrew, B.; Pachocki, J.; Petron, A.; Plappert, M.; Powell, G.; et al. Learning dexterous in-hand manipulation. Int. J. Robot. Res. 2020, 39, 3–20. [Google Scholar] [CrossRef]
- Triess, L.T.; Dreissig, M.; Rist, C.B.; Zöllner, J.M. A survey on deep domain adaptation for lidar perception. In Proceedings of the 2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops); IEEE: Piscataway, NJ, USA, 2021; pp. 350–357. [Google Scholar] [CrossRef]
- Wang, R.; Lee, K.; Zhu, X.; Winterbottom, L.; Nilsen, D.M.; Stein, J.; Ciocarlie, M. ReactEMG Stroke: Healthy-to-Stroke Few-shot Adaptation for sEMG-Based Intent Detection. arXiv 2026, arXiv:2601.22090. [Google Scholar] [CrossRef]
- Moon, J.; Le, N.A.; Minaya, N.H.; Choi, S.I. Multimodal few-shot learning for gait recognition. Appl. Sci. 2020, 10, 7619. [Google Scholar] [CrossRef]
- Finn, C.; Abbeel, P.; Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the International Conference on Machine Learning (ICML); JMLR: Cambridge, MA, USA, 2017; pp. 1126–1135. [Google Scholar]
- Hospedales, T.; Antoniou, A.; Micaelli, P.; Storkey, A. Meta-learning in neural networks: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 5149–5169. [Google Scholar] [CrossRef]
- Kueper, J.; Terry, A.; Zwarenstein, M. Artificial intelligence in primary care: A scoping review. Ann. Fam. Med. 2020, 18, 250–258. [Google Scholar] [CrossRef]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Montreal, QC, Canada, 8–13 December 2014; pp. 2672–2680. [Google Scholar]
- Holden, D.; Komura, T.; Saito, J. Phase-functioned neural networks for character control. ACM Trans. Graph. 2017, 36, 1–13. [Google Scholar] [CrossRef]
- Trottet, C.; Schürch, M.; Mollaysa, A.; Allam, A.; Krauthammer, M. Generative Time Series Models with Interpretable Latent Processes for Complex Disease Trajectories. In Deep Generative Models for HealthWorkshop, NeurIPS. 2023. Available online: https://neurips.cc/virtual/2023/workshop/66495 (accessed on 13 April 2026).
- Figueiredo, J.; Santos, C.P.; Moreno, J.C. Automatic recognition of gait patterns in human motor disorders using machine learning: A review. Med. Eng. Phys. 2018, 53, 1. [Google Scholar] [CrossRef]
- Delp, S.; Anderson, F.; Arnold, A.; Loan, P.; Habib, A.; John, C.; Guendelman, E.; Thelen, D. OpenSim: Open-source software to create and analyze dynamic simulations of movement. IEEE Trans. Biomed. Eng. 2007, 54, 1940–1950. [Google Scholar] [CrossRef]
- Steele, K.; Seth, A.; Hicks, J.; Schwartz, M.; Delp, S. Muscle contributions to support and progression during single-limb stance in crouch gait. J. Biomech. 2010, 43, 2099–2105. [Google Scholar] [CrossRef]
- Adeli, V.; Mehraban, S.; Mirmehdi, M.; Whone, A.; Filtjens, B.; Dadashzadeh, A.; Fasano, A.; Iaboni, A.; Taati, B. GAITGen: Disentangled motion-pathology impaired gait generative model–Bringing motion generation to the clinical domain. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV); IEEE: Piscataway, NJ, USA, 2026; pp. 3150–3161. [Google Scholar] [CrossRef]
- Pan, S.; Yang, Q. A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
- van der Krogt, M.; Delp, S.; Schwartz, M. How robust is human gait to muscle weakness? Gait Posture 2012, 36, 113–119. [Google Scholar] [CrossRef] [PubMed]
- Van Criekinge, T.; Saeys, W.; Truijen, S.; Vereeck, L.; Sloot, L.H.; Hallemans, A. A full-body motion capture gait dataset of 138 able-bodied adults across the life span and 50 stroke survivors. Sci. Data 2023, 10, 852. [Google Scholar] [CrossRef] [PubMed]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar] [CrossRef]
- Yang, J.; Zhu, Y.; Li, H.; Wang, K.; Li, D.; Qi, Q. Effect of robotic exoskeleton training on lower limb function, activity and participation in stroke patients: A systematic review and meta-analysis of randomized controlled trials. Front. Neurol. 2024, 15, 1453781. [Google Scholar] [CrossRef]







| Joint | Impairment | Operator | Prob. | Range | Clinical Meaning |
|---|---|---|---|---|---|
| Ankle | Drop Foot | Weakness | Tibialis anterior weakness, insufficient dorsiflexion | ||
| PF Contracture | Stiffness | Achilles tendon contracture, dorsiflexion shift | |||
| Push-off Deficit | Weakness | Plantarflexor weakness, reduced propulsion | |||
| Knee | Hyperextension | Weakness | Knee hyperextension during stance | ||
| Stiffness | Stiffness | Reduced range of motion, stiff gait | |||
| Hip | Flexion Deficit | Weakness | Hip flexor weakness, reduced step length | ||
| Extension Deficit | Stiffness | Hip flexion contracture, limited extension | |||
| Knee & Ankle | Extensor Synergy | Synergy | Hip extension triggers knee extension/ankle PF |
| Parameter | Scratch | Finetune |
|---|---|---|
| Total Timesteps | 500,000 | 500,000 |
| Learning Rate | ||
| Batch Size | 128 | 128 |
| Discount Factor (Gamma) | 0.995 | 0.995 |
| Clip Range | 0.2 | 0.1 |
| GAE Lambda | 0.95 | 0.95 |
| Entropy Coef | 0.01 | 0.01 |
| N_steps | 2048 | 2048 |
| Metric | Dataset | Hip | Knee | Ankle | Overall |
|---|---|---|---|---|---|
| Peak-to-peak range | NIPA stroke | 30.096 [29.829, 30.360] | 51.089 [50.756, 51.474] | 18.576 [18.433, 18.726] | 33.254 [33.109, 33.415] |
| Real stroke | 35.634 [33.259, 37.888] | 44.684 [41.597, 47.685] | 19.476 [17.758, 21.755] | 33.265 [30.978, 35.548] | |
| Mean joint angle | NIPA stroke | 18.046 [17.838, 18.235] | 29.380 [29.172, 29.596] | 7.975 [7.807, 8.117] | 18.467 [18.345, 18.585] |
| Real stroke | 15.582 [13.740, 17.864] | 24.314 [22.449, 25.933] | 6.448 [5.278, 7.664] | 15.448 [14.151, 16.801] |
| Subject | MSE (↓) | Reward (↑) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ours | Scratch | Rule | Zero | PD-B | FeatAlign | Ours | Scratch | Rule | Zero | PD-B | FeatAlign | |
| Sub16 | 7.38 | 12.47 | 15.46 | 21.04 | 13.23 | 11.99 | −8.2900 | −19.5000 | −29.4800 | −39.8800 | −20.5201 | −18.6072 |
| Sub22 | 35.35 | 42.40 | 56.14 | 78.61 | 27.81 | 24.04 | −14.9700 | −30.7500 | −34.1500 | −49.6300 | −24.8762 | −16.1279 |
| Sub28 | 24.00 | 27.59 | 38.69 | 50.44 | 31.65 | 29.85 | −11.1100 | −23.9100 | −34.7100 | −49.3800 | −33.0837 | −24.7822 |
| Sub37 | 11.40 | 12.81 | 19.91 | 30.92 | 12.58 | 10.29 | −8.6100 | −10.7400 | −23.0900 | −38.5600 | −15.2173 | −11.4828 |
| Sub46 | 15.64 | 16.35 | 20.37 | 29.00 | 20.51 | 18.45 | −9.9400 | −11.4000 | −20.1400 | −39.1700 | −29.1230 | −23.4824 |
| Sub38 | 8.39 | 9.70 | 16.16 | 22.75 | 11.52 | 9.19 | −8.8200 | −12.2500 | −30.8000 | −43.8900 | −16.1222 | −12.8955 |
| Sub07 | 11.19 | 12.58 | 17.41 | 26.57 | 13.61 | 11.72 | −9.5600 | −13.8200 | −20.1000 | −31.2800 | −14.1214 | −12.2019 |
| Sub10 | 22.68 | 24.80 | 31.32 | 40.73 | 29.31 | 27.65 | −11.4600 | −12.8500 | −26.7300 | −44.3100 | −22.8050 | −18.6480 |
| Sub03 | 27.37 | 29.00 | 48.31 | 59.93 | 32.37 | 30.70 | −13.3200 | −20.7800 | −36.3800 | −48.7000 | −32.0430 | −23.0302 |
| Sub36 | 28.77 | 32.15 | 48.72 | 59.42 | 25.46 | 25.12 | −14.4100 | −22.4000 | −35.4200 | −49.5400 | −38.9763 | −34.9449 |
| Sub47 | 11.05 | 12.49 | 20.52 | 27.99 | 11.58 | 9.39 | −10.6300 | −16.7400 | −32.1700 | −47.9000 | −16.8812 | −13.0657 |
| Sub25 | 34.66 | 40.41 | 43.66 | 57.35 | 34.34 | 34.49 | −20.1100 | −30.1500 | −34.1500 | −49.5900 | −27.4416 | −21.8431 |
| Sub30 | 17.89 | 18.87 | 23.13 | 28.76 | 22.45 | 20.73 | −10.8100 | −14.4100 | −20.5700 | −36.3200 | −23.4442 | −18.7374 |
| Sub45 | 16.32 | 19.63 | 28.13 | 33.65 | 17.01 | 15.23 | −9.7000 | −22.2900 | −33.9200 | −48.4600 | −24.7264 | −18.8631 |
| Sub39 | 7.86 | 14.21 | 23.21 | 31.00 | 17.55 | 16.37 | −7.5700 | −18.2400 | −35.6900 | −46.8700 | −38.2363 | −33.1955 |
| Sub41 | 25.22 | 28.51 | 32.37 | 46.46 | 30.27 | 28.09 | −15.5400 | −27.8500 | −32.6700 | −48.7900 | −29.8998 | −21.6876 |
| Sub35 | 13.32 | 16.15 | 28.32 | 34.36 | 16.86 | 14.93 | −9.8700 | −20.7600 | −35.6700 | −48.8100 | −26.4565 | −19.5712 |
| Sub17 | 7.07 | 8.41 | 12.92 | 16.62 | 10.88 | 9.64 | −7.9100 | −10.9400 | −20.5400 | −27.1500 | −15.9567 | −13.6174 |
| Sub32 | 12.57 | 18.38 | 23.39 | 33.40 | 16.89 | 15.17 | −10.3900 | −21.6900 | −33.4600 | −44.9900 | −17.8512 | −15.0290 |
| Sub23 | 16.22 | 16.49 | 19.90 | 26.44 | 18.93 | 17.91 | −15.2500 | −16.8900 | −22.5300 | −36.4900 | −20.7057 | −19.2157 |
| Sub18 | 17.96 | 19.97 | 24.78 | 30.58 | 18.44 | 17.42 | −10.7300 | −17.8500 | −26.3900 | −38.5900 | −14.6950 | −14.3062 |
| Sub13 | 9.72 | 14.14 | 28.85 | 39.03 | 18.01 | 14.70 | −9.9800 | −19.5200 | −35.8100 | −49.0600 | −27.8731 | −17.9618 |
| Sub05 | 24.80 | 27.29 | 38.17 | 45.63 | 26.14 | 24.69 | −12.2000 | −22.1600 | −32.9400 | −46.4000 | −22.2265 | −16.9284 |
| Sub44 | 7.76 | 11.61 | 11.40 | 17.80 | 15.16 | 12.92 | −7.7900 | −17.2000 | −23.1400 | −36.0500 | −27.3538 | −21.6972 |
| Sub01 | 26.72 | 28.18 | 35.08 | 46.16 | 26.75 | 24.94 | −11.7400 | −19.3100 | −33.6500 | −49.1600 | −20.5735 | −16.8944 |
| Mean | 17.65 | 20.58 | 28.25 | 37.39 | 20.77 | 19.02 | −11.2284 | −18.9760 | −29.7720 | −43.5588 | −24.0484 | −19.1527 |
| Metric | Ours | Scr | Rule | PD-B | FeatAlign | Zero | p (Ours vs. Scr) |
|---|---|---|---|---|---|---|---|
| MSE Hip | 4.0143 ± 1.3136 | 5.9544 ± 2.4457 | 8.3693 ± 5.1398 | 9.6776 ± 4.7966 | 9.4481 ± 4.8333 | 11.5103 ± 6.9183 | |
| MSE Knee | 5.4507 ± 1.7431 | 6.5507 ± 2.6541 | 6.9683 ± 3.4544 | 6.2857 ± 2.1345 | 5.7793 ± 2.0833 | 10.7025 ± 4.1805 | |
| MSE Ankle | 2.4719 ± 1.7180 | 2.3630 ± 1.3550 | 5.9244 ± 4.7532 | 4.2786 ± 2.0278 | 4.1340 ± 1.9635 | 9.1827 ± 6.5866 | 0.9578 |
| Total MSE | 11.9369 ± 3.4081 | 14.8681 ± 4.8668 | 21.2621 ± 9.6127 | 20.2420 ± 6.6225 | 19.3614 ± 6.5181 | 31.3955 ± 13.3157 | |
| Reward | −18.4798 ± 7.0654 | −21.2264 ± 5.5398 | −28.3779 ± 7.3077 | −24.3079 ± 7.7734 | −21.8160 ± 7.1105 | −41.4144 ± 7.2752 | |
| Jerk | 0.0051 ± 0.0117 | 0.0069 ± 0.0155 | 0.0009 ± 0.0005 | 0.0027 ± 0.0021 | 0.0019 ± 0.0020 | 0.0004 ± 0.0002 |
| Metric | No Pretraining | Random + Full FT | Random + Frozen FE | NIPA + Full FT | NIPA + Frozen FE |
|---|---|---|---|---|---|
| NIPA | No | No | No | Yes | Yes |
| Pretraining | No | Yes | Yes | Yes | Yes |
| Freezing | No | No | Yes | No | Yes |
| MSE Hip | 5.9544 ± 2.4457 | 5.1050 ± 2.0716 | 6.3523 ± 2.2888 | 5.5257 ± 1.4637 | 4.0143 ± 1.3136 |
| MSE Knee | 6.5507 ± 2.6541 | 5.5735 ± 2.4782 | 5.6449 ± 2.4408 | 4.6894 ± 0.2408 | 5.4507 ± 1.7431 |
| MSE Ankle | 2.3630 ± 1.3550 | 3.6280 ± 0.6404 | 3.6405 ± 0.6600 | 2.6312 ± 0.6702 | 2.4719 ± 1.7180 |
| Total MSE | 14.8681 ± 4.8668 | 14.3065 ± 3.2659 | 15.6376 ± 3.4189 | 12.8463 ± 1.7676 | 11.9369 ± 3.4081 |
| Reward | −21.2264 ± 5.5398 | −21.4454 ± 6.6561 | −20.5479 ± 6.5645 | −20.3965 ± 2.1160 | −18.4798 ± 7.0654 |
| Jerk | 0.0069 ± 0.0155 | 0.0317 ± 0.0246 | 0.0117 ± 0.0108 | 0.0305 ± 0.0464 | 0.0051 ± 0.0117 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ou, C.; Peng, Y.; Zhang, F. Pathology-Informed Personalized Exoskeleton Assistance for Post-Stroke Gait Rehabilitation via Simulation-to-Real Reinforcement Learning. Healthcare 2026, 14, 1523. https://doi.org/10.3390/healthcare14111523
Ou C, Peng Y, Zhang F. Pathology-Informed Personalized Exoskeleton Assistance for Post-Stroke Gait Rehabilitation via Simulation-to-Real Reinforcement Learning. Healthcare. 2026; 14(11):1523. https://doi.org/10.3390/healthcare14111523
Chicago/Turabian StyleOu, Chuyi, Yinbin Peng, and Furong Zhang. 2026. "Pathology-Informed Personalized Exoskeleton Assistance for Post-Stroke Gait Rehabilitation via Simulation-to-Real Reinforcement Learning" Healthcare 14, no. 11: 1523. https://doi.org/10.3390/healthcare14111523
APA StyleOu, C., Peng, Y., & Zhang, F. (2026). Pathology-Informed Personalized Exoskeleton Assistance for Post-Stroke Gait Rehabilitation via Simulation-to-Real Reinforcement Learning. Healthcare, 14(11), 1523. https://doi.org/10.3390/healthcare14111523
