Self-Supervised Gait Event Detection from Smartphone IMUs for Human Performance and Sports Medicine
Abstract
1. Introduction
2. Materials and Methods
2.1. Data and Harmonization
2.2. SSL Encoder (Backbone)
- 1D-CNN (dilated): the first was a one-dimensional dilated convolutional neural network (1D-CNN), configured with 6→64→128→256 channels, kernel sizes of 7/7/5, dilations of 1/2/4, and a stride of 2 in the first block. GELU activations and residual connections were applied throughout, followed by global average pooling to obtain a 256-dimensional embedding.
- Tiny Transformer: the second backbone was a lightweight Transformer. Input windows were patchified with stride 2 (patch length 4) and processed through four encoder blocks with hidden dimension 256, 4 attention heads, and dropout of 0.1. A [CLS] token representation was pooled into a 256-dimensional embedding. A two-layer MLP (256→256→128) with batch normalization then projected embeddings into the contrastive space.
2.3. Sensor-Aware Augmentations
2.4. Downstream Heads and Labels
2.5. Baselines
2.6. Evaluation Protocol and Statistics
- a linear probe, in which the backbone was frozen and only a logistic or linear head was trained;
- few-shot fine-tuning, where 1%, 5%, or 10% of labeled windows from D were used with subject-stratified sampling.
2.7. Computing Environment and Reproducibility
- Hardware. Experiments were performed on high-performance workstations equipped with NVIDIA RTX 3090/4090 and A6000 GPUs (24–48 GB VRAM), 64–128 GB RAM, and ≥1 TB SSD storage.
- Software. The implementation used Python 3.10 and PyTorch 2.x with CUDA 12.x/cuDNN. Core libraries included NumPy, Pandas, SciPy, Scikit-learn, and TorchMetrics. Configuration management was handled with Hydra; MLflow and Weights & Biases were used for experiment logging and tracking. The environment was version- controlled with Git and containerized with Docker/Conda for portability.
- Data pipeline. Raw IMU signals underwent axis harmonization, z-score normalization, resampling at 25, 50, or 100 Hz, and segmentation into 3.2 s windows. SSL pretraining was performed on unlabeled windows with sensor-aware augmentations (axis-swap, drift, jitter, time-warp). Fine-tuning added classification (gait events) and regression heads (stride-time error).
- Evaluation protocol. Performance was assessed with F1, precision, recall, AUROC, and mean absolute error (MAE). Model size (parameters in M), inference latency, and convergence speed (epochs to optimal validation) were also monitored. Each condition was repeated in triplicate with fixed random seeds.
- Statistical analysis. All evaluations employed non-parametric bootstrap confidence intervals (5000 iterations), paired Wilcoxon signed-rank tests with Benjamini–Hochberg FDR control (α = 0.05), and dual reporting of standardized effect sizes (Cohen’s d) together with absolute gains (ΔF1, ΔMAE), providing robust, reproducible, and practically interpretable inference.
- Reproducibility. All metrics, results, and figures were automatically logged in MLflow, together with configuration files. Complete environment specifications (Docker and Conda manifests) are provided to enable deterministic replication of the experiments across different systems. All experiments were repeated with fixed random seeds controlling initialization, data splits, and augmentation draws, ensuring strict reproducibility across runs.
- Inference feasibility. On typical GPU hardware (RTX 3090/4090), inference latency ranged between 6 ms (CNN) and 15 ms (Transformer) per 3.2 s input window. On-edge deployment tests on a high-end smartphone (Snapdragon 8 Gen 2, 12 GB RAM) yielded average inference times below 60 ms per window, equivalent to near-real-time operation (<0.1 s delay). The computational footprint (~1–7 M parameters; <108 FLOPs) indicates feasibility for real-time gait analysis on modern mobile processors.
2.8. Experiments
3. Results
3.1. RQ1—Cross-Dataset Transfer (Linked to H1/E1)
3.2. RQ2—Label Efficiency (Linked to H2/E2)
3.3. RQ3—Augmentation Ablations (Linked to H3/E3)
3.4. RQ4—Device and Placement Shift (Linked to H4/E4)

3.5. RQ5—Sampling Rate Sensitivity (Linked to H5/E5)
3.6. RQ6—Backbone Comparison (Linked to H6/E6)

3.7. Summary of Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| Abbreviation | Definition |
| SSL | Self-Supervised Learning |
| IMU | Inertial Measurement Unit |
| CNN | Convolutional Neural Network |
| TCN | Temporal Convolutional Network |
| BiLSTM | Bidirectional Long Short-Term Memory |
| MAE | Mean Absolute Error |
| AUROC | Area Under the Receiver Operating Characteristic Curve |
| F1 | F1-score (harmonic mean of precision and recall) |
| HS | Heel Strike |
| TO | Toe Off |
| Δ | Delta (change relative to baseline) |
| CI | Confidence Interval |
| FDR | False Discovery Rate |
References
- Young, F.; Mason, R.; Morris, R.E.; Stuart, S.; Godfrey, A. IoT-Enabled Gait Assessment: The Next Step for Habitual Monitoring. Sensors 2023, 23, 4100. [Google Scholar] [CrossRef]
- Prisco, G.; Pirozzi, M.A.; Santone, A.; Esposito, F.; Cesarelli, M.; Amato, F.; Donisi, L. Validity of Wearable Inertial Sensors for Gait Analysis: A Systematic Review. Diagnostics 2025, 15, 36. [Google Scholar] [CrossRef] [PubMed]
- Boutaayamou, M.; Azzi, S.; Desailly, E.; Dumas, R. Toward Convenient and Accurate IMU-Based Gait Analysis: A Minimal-Sensor Approach. Sensors 2025, 25, 1267. [Google Scholar] [CrossRef] [PubMed]
- Lin, S.; Evans, K.; Hartley, D.; Morrison, S.; McDonald, S.; Veidt, M.; Wang, G. A Review of Gait Analysis Using Gyroscopes and Inertial Measurement Units. Sensors 2025, 25, 3481. [Google Scholar] [CrossRef] [PubMed]
- Gastaldi, L.; Pastorelli, S.; Rosati, S. Recent Advances and Applications of Wearable Inertial Measurement Units in Human Motion Analysis. Sensors 2025, 25, 818. [Google Scholar] [CrossRef]
- Mănescu, A.M.; Grigoroiu, C.; Smîdu, N.; Dinciu, C.C.; Mărgărit, I.R.; Iacobini, A.; Mănescu, D.C. Biomechanical Effects of Lower Limb Asymmetry During Running: An OpenSim Computational Study. Symmetry 2025, 17, 1348. [Google Scholar] [CrossRef]
- Liu, Y.; Liu, X.; Zhu, Q.; Chen, Y.; Yang, Y.; Xie, H.; Wang, Y.; Wang, X. Adaptive Detection in Real-Time Gait Analysis through the Dynamic Gait Event Identifier. Bioengineering 2024, 11, 806. [Google Scholar] [CrossRef]
- Gouda, A.; Andrysek, J. Rules-Based Real-Time Gait Event Detection Algorithm for Lower-Limb Prosthesis Users during Level-Ground and Ramp Walking. Sensors 2022, 22, 8888. [Google Scholar] [CrossRef]
- Romijnders, R.; Warmerdam, E.; Hansen, C.; Schmidt, G.; Maetzler, W. A Deep Learning Approach for Gait Event Detection from a Single Shank-Worn IMU: Validation in Healthy and Neurological Cohorts. Sensors 2022, 22, 3859. [Google Scholar] [CrossRef]
- Prasanth, H.; Caban, M.; Keller, U.; Courtine, G.; Ijspeert, A.; Vallery, H.; von Zitzewitz, J. Wearable Sensor-Based Real-Time Gait Detection: A Systematic Review. Sensors 2021, 21, 2727. [Google Scholar] [CrossRef]
- Tao, W.; Liu, T.; Zheng, R.; Feng, H. Gait Analysis Using Wearable Sensors. Sensors 2012, 12, 2255–2283. [Google Scholar] [CrossRef]
- Mannini, A.; Sabatini, A.M. Machine Learning Methods for Classifying Human Physical Activity from On-Body Accelerometers. Sensors 2010, 10, 1154–1175. [Google Scholar] [CrossRef]
- Mănescu, D.C. Big Data Analytics Framework for Decision-Making in Sports Performance Optimization. Data 2025, 10, 116. [Google Scholar] [CrossRef]
- Badau, D.; Badau, A.; Teodor, D.F.; Dinciu, C.C.; Dulceata, V.; Mănescu, D.C.; Mănescu, C.O.; Litoi, M.F.; Stoica, A.-M. Multidimensional Assessment of Athletic and Non-Athletic Female Students Through Analysis of BMI, Body Perception, Objectification, and Attitudes Towards the Ideal Body. Behav. Sci. 2025, 15, 1454. [Google Scholar] [CrossRef]
- Ren, J.; Wang, A.; Li, H.; Yue, X.; Meng, L. A Transformer-Based Neural Network for Gait Prediction in Lower Limb Exoskeleton Robots Using Plantar Force. Sensors 2023, 23, 6547. [Google Scholar] [CrossRef]
- Wang, J.; Ma, H.; Zhou, J.; Liu, H.; Yan, Y.; Xiong, W. A Multimodal CNN–Transformer Network for Gait Pattern Recognition. Electronics 2025, 14, 1537. [Google Scholar] [CrossRef]
- Mogan, J.N.; Lee, C.P.; Lim, K.M.; Ali, M.; Alqahtani, A. Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer. Sensors 2023, 23, 3809. [Google Scholar] [CrossRef]
- Jung, D.; Kim, J.; Park, Y.; Lee, H.; Choi, M.; Seo, K. Multi-Model Gait-Based Knee Adduction Moment Prediction System Using IMU Sensor Data and LSTM RNN. Appl. Sci. 2024, 14, 10721. [Google Scholar] [CrossRef]
- Shi, Y.; Ying, X.; Yang, J. Deep Unsupervised Domain Adaptation with Time Series Sensor Data: A Survey. Sensors 2022, 22, 5507. [Google Scholar] [CrossRef] [PubMed]
- Powell, K.; Amer, A.; Glavcheva-Laleva, Z.; Williams, J.; O’Flaherty Farrell, C.; Harwood, F.; Bishop, P.; Holt, C. MoveLab®: Validation and Development of Novel Cross-Platform Gait and Mobility Assessments Using Gold Standard Motion Capture and Clinical Standard Assessment. Sensors 2025, 25, 5706. [Google Scholar] [CrossRef] [PubMed]
- Li, C.; Wang, B.; Li, Y.; Liu, B. A Lightweight Pathological Gait Recognition Approach Based on a New Gait Template in Side-View and Improved Attention Mechanism. Sensors 2024, 24, 5574. [Google Scholar] [CrossRef]
- Mănescu, D.C. Computational Analysis of Neuromuscular Adaptations to Strength and Plyometric Training: An Integrated Modeling Study. Sports 2025, 13, 298. [Google Scholar] [CrossRef]
- Hwang, S.; Kim, J.; Yang, S.; Moon, H.-J.; Cho, K.-H.; Youn, I.; Sung, J.-K.; Han, S. Machine Learning Based Abnormal Gait Classification with IMU Considering Joint Impairment. Sensors 2024, 24, 5571. [Google Scholar] [CrossRef]
- Pang, L.; Li, Y.; Liao, M.-X.; Qiu, J.-G.; Li, H.; Wang, Z.; Sun, G. A Feasibility Study of Domain Adaptation for Exercise Intensity Recognition Based on Wearable Sensors. Sensors 2025, 25, 3437. [Google Scholar] [CrossRef]
- Chang, Y.; Mathur, A.; Isopoussu, A.; Song, J.; Kawsar, F. A Systematic Study of Unsupervised Domain Adaptation for Robust Human-Activity Recognition. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2020, 4, 3380985. [Google Scholar] [CrossRef]
- Mu, F.; Gu, X.; Guo, Y.; Lo, B. Unsupervised Domain Adaptation for Position-Independent IMU Based Gait Analysis. In Proceedings of the IEEE SENSORS 2020, Rotterdam, The Netherlands, 25–28 October 2020; pp. 1–4. [Google Scholar] [CrossRef]
- Guo, Y.; Gu, X.; Yang, G.-Z. MCDCD: Multi-Source Unsupervised Domain Adaptation for Abnormal Human Gait Detection. IEEE J. Biomed. Health Inform. 2021, 25, 4017–4028. [Google Scholar] [CrossRef] [PubMed]
- Mănescu, D.C.; Mănescu, A.M. Artificial Intelligence in the Selection of Top-Performing Athletes for Team Sports: A Proof-of-Concept Predictive Modeling Study. Appl. Sci. 2025, 15, 9918. [Google Scholar] [CrossRef]
- Haresamudram, H.; Essa, I.; Plötz, T. Towards Learning Discrete Representations via Self-Supervised Learning for Wearables. Sensors 2024, 24, 1238. [Google Scholar] [CrossRef]
- Sheng, T.; Huber, M. Reducing Label Dependency in Human Activity Recognition with Wearables: From Supervised Learning to Novel Weakly Self-Supervised Approaches. Sensors 2025, 25, 4032. [Google Scholar] [CrossRef] [PubMed]
- Stenger, R.; Hozhabr Pour, H.; Teich, J.; Hein, A.; Fudickar, S. Gait Event Detection and Gait Parameter Estimation from a Single Waist-Worn IMU Sensor. Sensors 2025, 25, 6463. [Google Scholar] [CrossRef]
- Geissinger, J.H.; Asbeck, A.T. Motion Inference Using Sparse Inertial Sensors, Self-Supervised Learning, and a New Dataset of Unscripted Human Motion. Sensors 2020, 20, 6330. [Google Scholar] [CrossRef] [PubMed]
- Das, A.M.; Tang, C.I.; Kawsar, F.; Malekzadeh, M. PRIMUS: Pretraining IMU Encoders with Multimodal Self-Supervision. arXiv 2024. [Google Scholar] [CrossRef]
- He, Y.; Chen, Y.; Tang, L.; Zhang, X.; Li, P.; Wang, Q. Accuracy Validation of a Wearable IMU-Based Gait Analysis in Healthy Female. BMC Sports Sci. Med. Rehabil. 2024, 16, 2. [Google Scholar] [CrossRef] [PubMed]
- Zhang, M.; Huang, Y.; Liu, R.; Sato, Y. Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition. arXiv 2024. [Google Scholar] [CrossRef]
- Hasan, M.A.; Li, F.; Gouverneur, P.; Piet, A.; Grzegorzek, M. A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing. PLoS ONE 2025, 20, e0315343. [Google Scholar] [CrossRef] [PubMed]
- Manescu, D.C. Inteligencia Artificial en el entrenamiento deportivo de élite y perspectiva de su integración en el deporte escolar. Retos 2025, 73, 128–141. [Google Scholar] [CrossRef]
- Ashfaq, N.; Anwar, A.; Khan, F.; Cheema, M.U. Identification of Optimal Data Augmentation Techniques for Multimodal Time Series Classification. Information 2024, 15, 343. [Google Scholar] [CrossRef]
- Tu, Y.-C.; Lin, C.-Y.; Liu, C.-P.; Chan, C.-T. Performance Analysis of Data Augmentation Approaches for Improving Wrist- Based Fall Detection System. Sensors 2025, 25, 2168. [Google Scholar] [CrossRef]
- Liu, Z.; Alavi, A.; Li, M.; Zhang, X. Self-Supervised Contrastive Learning for Medical Time Series: A Systematic Review. Sensors 2023, 23, 4221. [Google Scholar] [CrossRef]
- Chen, H.; Gouin-Vallerand, C.; Bouchard, K.; Gaboury, S.; Couture, M.; Bier, N.; Giroux, S. Contrastive Self-Supervised Learning for Sensor-Based Human Activity Recognition: A Review. IEEE Access 2024, 12, 152511–152531. [Google Scholar] [CrossRef]
- Montero Quispe, K.G.; Utyiama, D.M.S.; dos Santos, E.M.; Oliveira, H.A.B.F.; Souto, E.J.P. Applying Self-Supervised Representation Learning for Emotion Recognition Using Physiological Signals. Sensors 2022, 22, 9102. [Google Scholar] [CrossRef]
- Kapoor, S.; Narayanan, A. Leakage and the Reproducibility Crisis in Machine-Learning-Based Science. Patterns 2023, 4, 100804. [Google Scholar] [CrossRef] [PubMed]
- Ding, L.; Zhang, C.; Yue, Y.; Yao, C.; Li, Z.; Hu, Y.; Yang, B.; Ma, W.; Yu, L.; Gao, R.; et al. Wearable Sensors-Based Intelligent Sensing and Data Acquisition: Challenges and Emerging Opportunities. Sensors 2025, 25, 4515. [Google Scholar] [CrossRef]
- Kwapisz, J.R.; Weiss, G.M.; Moore, S.A. Activity Recognition Using Cell Phone Accelerometers. ACM SIGKDD Explor. Newsl. 2011, 12, 74–82. [Google Scholar] [CrossRef]
- Reiss, A.; Stricker, D. Introducing a New Benchmarked Dataset for Activity Monitoring. In Proceedings of the 2012 16th International Symposium on Wearable Computers (ISWC), Newcastle, UK, 18–22 June 2012; IEEE: Newcastle, UK, 2012; pp. 108–109. [Google Scholar] [CrossRef]
- Sikder, N.; Al Nahid, A. KU-HAR: An Open Dataset for Heterogeneous Human Activity Recognition. Pattern Recognit. Lett. 2021, 146, 46–54. [Google Scholar] [CrossRef]
- Baños, O.; García, R.; Saez, A. MHEALTH [Dataset]; UCI Machine Learning Repository; UC Irvine Machine Learning Repository: Irvine, CA, USA, 2014. [Google Scholar] [CrossRef]
- Chavarriaga, R.; Sagha, H.; Calatroni, A.; Digumarti, S.T.; Tröster, G.; Millán, J.d.R.; Roggen, D. The Opportunity Challenge: A Benchmark Database for On-Body Sensor-Based Activity Recognition. Pattern Recognit. Lett. 2013, 34, 2033–2042. [Google Scholar] [CrossRef]
- Sztyler, T.; Stuckenschmidt, H. On-Body Localization of Wearable Devices: An Investigation of Position-Aware Activity Recognition. In Proceedings of the 2016 IEEE International Conference on Pervasive Computing and Communications (PerCom), Sydney, Australia, 14–19 March 2016; IEEE: Sydney, Australia, 2016; pp. 1–9. [Google Scholar] [CrossRef]
- Liao, Y.; Cao, J.; Yu, L.; Xiang, J.; Zhao, Y. A Multi-sensor Gait Dataset Collected Under Non-standardized Dual-Task Conditions. Sci. Data 2025, 12, 1121. [Google Scholar] [CrossRef]
- Griškevičius, J.; Marčiulionis, K.; Pavalkis, D.; Kriščiukaitis, A.; Lukoševičius, M. Retrospective Frailty Assessment in Older Adults Using Deep Learning on IMU-Based Gait Data. Sensors 2025, 25, 3351. [Google Scholar] [CrossRef]
- Voisard, C.; Ross, A.; Robertson, J.; Le Roux, A.; Wilson, S. Automatic Gait Events Detection with Inertial Measurement Units (IMUs). J. NeuroEng. Rehabil. 2024, 21, 104. [Google Scholar] [CrossRef]
- Simonetti, E.; Vacherand, E.; van den Noort, J.; Stoll, F.; Porter-Armstrong, A.; Hamel, J.; Kirtley, C.; Leardini, A.; Picerno, P. Gait Event Detection Using Inertial Measurement Units in People with Transfemoral Amputation: A Comparative Study. Med. Biol. Eng. Comput. 2020, 58, 461–470. [Google Scholar] [CrossRef] [PubMed]
- Dion, G.; Reed, E.; Bartlett, M.; Fukuda, K.; Lamarche, C.; Delgado, P. In-Sensor Human Gait Analysis with Machine Learning in a Wearable System. Nat. Mach. Intell. 2024, 6, 667–678. [Google Scholar] [CrossRef]
- Panteli, N.; Hadjicharalambous, M.; Zaras, N. Delayed Potentiation Effect on Sprint, Power and Agility Performance in WellTrained Soccer Players. J. Sci. Sport Exerc. 2023, 6, 131–139. [Google Scholar] [CrossRef]
- Hadjicharalambous, M.; Chalari, E.; Zaras, N. Influence of puberty stage in immune-inflammatory parameters in well-trained adolescent soccer-players, following 8-weeks of pre-seasonal preparation training. Explor. Immunol. 2024, 4, 822–836. [Google Scholar] [CrossRef]
- Zaras, N.; Stasinaki, A.-N.; Spiliopoulou, P.; Mpampoulis, T.; Hadjicharalambous, M.; Terzis, G. Effect of Inter-Repetition Rest vs. Traditional Strength Training on Lower Body Strength, Rate of Force Development, and Muscle Architecture. Appl. Sci. 2021, 11, 45. [Google Scholar] [CrossRef]
- Manupibul, U.; Tanthuwapathom, R.; Jarumethitanont, W.; Kaimuk, P.; Limroongreungrat, W.; Charoensuk, W. Integration of Force and IMU Sensors for Developing Low-Cost Portable Gait Measurement System in Lower Extremities. Sci. Rep. 2023, 13, 10653. [Google Scholar] [CrossRef]
- Baek, J.-E.; Jung, J.-H.; Kim, H.-K.; Cho, H.-Y. Smartphone Accelerometer for Gait Assessment: Validity and Reliability in Healthy Adults. Appl. Sci. 2024, 14, 11321. [Google Scholar] [CrossRef]
- Teufl, W.; Lorenz, M.; Miezal, M.; Taetz, B.; Fröhlich, M.; Bleser, G. Towards Inertial Sensor Based Mobile Gait Analysis: Event-Detection and Spatio-Temporal Parameters. Sensors 2019, 19, 38. [Google Scholar] [CrossRef] [PubMed]
- Larsen, A.G.; Sadolin, L.Ø.; Thomsen, T.R.; Oliveira, A.S. Accurate Detection of Gait Events Using Neural Networks and IMU Data Mimicking Real-World Smartphone Usage. Comput. Methods Biomech. Biomed. Engin. 2024, 27, 1–11. [Google Scholar] [CrossRef]





| Dataset | Reference/DOI | Subjects (n) | Sensor Placement(s) | Sampling (Hz) | Event Labels Source | Notes |
|---|---|---|---|---|---|---|
| WISDM | Kwapisz et al., 2011 doi:10.1145/1964897.1964918 | 36 | smartphone at waist/hip | 50 | manual annotation | scripted walking |
| PAMAP2 | Reiss & Stricker, 2012 doi:10.1109/ISWC.2012.13 | 9 | wrist, chest, ankle IMUs | 100 | proxy (IMU-derived) | treadmill + daily activities |
| KU-HAR | Sikder & Nahid, 2021 doi:10.1016/j.patrec.2021.02.024 | 90 | smartphone at waist | 100 | manual annotation | scripted activities |
| mHealth | Banos et al., 2014 doi:10.24432/C5TW22 | 10 | chest, ankle, wrist (Shimmer) | 50 | footswitch sensors | controlled lab |
| OPPORTUNITY | Chavarriaga et al., 2013 doi:10.1016/j.patrec.2012.12.014 | 4 | wrist, back, hip + others | 30 | annotated HS/TO | daily activities scenario |
| RWHAR | Sztyler & Stuckenschmidt, 2016 doi:10.1109/PERCOM.2016.7456521 | 15 | smartphone in pocket | 50 | manual annotation | real-world free living |
| Hypothesis (H) | Experiment (E) | Research Question (RQ) |
|---|---|---|
| H1. Self-supervised pretraining improves gait event detection compared to supervised learning from scratch. | E1. Pretraining vs. supervised baseline within datasets. | RQ1. Does SSL provide consistent gains on event detection accuracy? |
| H2. SSL models transfer better across heterogeneous datasets than supervised models. | E2. Cross-dataset transfer evaluation. | RQ2. Does pretraining improve generalization across datasets? |
| H3. Larger unlabeled datasets for SSL pretraining yield stronger downstream performance. | E3. Scaling pretraining corpus size. | RQ3. What is the effect of unlabeled dataset size on downstream performance? |
| H4. SSL gains hold even with limited labeled data for fine-tuning. | E4. Fine-tuning with reduced labeled fractions. | RQ4. How does SSL behave when labeled data availability is scarce? |
| H5. SSL models improve detection of temporal gait phases beyond discrete events. | E5. Phase-level analysis of gait cycles. | RQ5. Does SSL improve accuracy in estimating stride, stance, and swing? |
| H6. SSL provides statistically significant improvements robust to evaluation method. | E6. Statistical testing across metrics and datasets. | RQ6. Are improvements statistically reliable across conditions? |
| Source → Target | Method | F1 (±95% CI) | AUROC (±95% CI) | MAE (ms ± 95% CI) | Gain F1 (%) | Gain MAE (%) | Cohen’s d | p-Value |
|---|---|---|---|---|---|---|---|---|
| Dataset A → B | Supervised baseline | 0.73 [0.71–0.75] | 0.81 [0.79–0.83] | 28 [26–30] | Ref. | Ref. | Ref. | Ref. |
| SSL Linear Probe | 0.80 [0.77–0.83] | 0.87 [0.84–0.90] | 18 [15–22] | +9.6 | −35.7 | 1.2 | <0.001 | |
| SSL Few-shot (10%) | 0.83 [0.81–0.85] | 0.89 [0.87–0.91] | 15 [13–17] | +13.7 | −46.4 | 1.5 | <0.001 | |
| Dataset A → C | Supervised baseline | 0.75 [0.73–0.77] | 0.83 [0.81–0.85] | 27 [25–29] | Ref. | Ref. | Ref. | Ref. |
| SSL Linear Probe | 0.81 [0.79–0.83] | 0.88 [0.86–0.90] | 19 [17–21] | +8.0 | −29.6 | 1.1 | <0.001 | |
| SSL Few-shot (10%) | 0.84 [0.82–0.86] | 0.90 [0.88–0.92] | 16 [14–18] | +3.4 | −40.7 | 0.39 | <0.072 (ns) | |
| Dataset B → C | Supervised baseline | 0.74 [0.72–0.76] | 0.82 [0.80–0.84] | 29 [27–31] | Ref. | Ref. | Ref. | Ref. |
| SSL Linear Probe | 0.79 [0.76–0.82] | 0.86 [0.84–0.88] | 20 [17–24] | +4.1 | −31.0 | 1.0 | <0.018 | |
| SSL Few-shot (10%) | 0.83 [0.81–0.85] | 0.89 [0.87–0.91] | 16 [14–18] | +12.2 | −44.8 | 1.4 | <0.001 |
| Label Fraction | Method | F1 (±95% CI) | AUROC (±95% CI) | MAE (ms ± 95% CI) | Gain F1 (%) vs. Supervised | Gain MAE (%) vs. Supervised | Cohen’s d | p-Value |
|---|---|---|---|---|---|---|---|---|
| 1% | Supervised baseline | 0.62 [0.59–0.65] | 0.71 [0.68–0.74] | 29 [25–33] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.73 [0.70–0.76] | 0.82 [0.79–0.85] | 20 [17–24] | +17.7 | −31.0 | 1.3 | <0.001 | |
| 5% | Supervised baseline | 0.72 [0.70–0.74] | 0.80 [0.77–0.83] | 24 [21–27] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.81 [0.79–0.83] | 0.87 [0.85–0.90] | 18 [15–21] | +12.5 | −25.0 | 1.1 | <0.001 | |
| 10% | Supervised baseline | 0.77 [0.75–0.79] | 0.84 [0.82–0.87] | 22 [19–25] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.83 [0.81–0.85] | 0.89 [0.87–0.91] | 17 [14–20] | +7.8 | −22.7 | 1.0 | <0.001 | |
| 25% | Supervised baseline | 0.80 [0.78–0.82] | 0.87 [0.85–0.89] | 20 [17–23] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.84 [0.82–0.86] | 0.90 [0.88–0.92] | 18 [15–21] | +5.0 | −10.0 | 0.7 | <0.05 | |
| 50% | Supervised baseline | 0.83 [0.81–0.85] | 0.89 [0.87–0.91] | 18 [16–20] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.85 [0.83–0.87] | 0.91 [0.89–0.93] | 17 [15–19] | +2.4 | −5.6 | 0.4 | n.s. | |
| 100% | Supervised baseline | 0.86 [0.84–0.88] | 0.92 [0.90–0.94] | 16 [14–18] | Ref. | Ref. | Ref. | Ref. |
| SSL Pretrained | 0.87 [0.85–0.89] | 0.92 [0.90–0.94] | 16 [14–18] | +1.2 | −0.0 | 0.1 | n.s. |
| Augmentation Removed | ΔF1 (95% CI) | ΔAUROC | ΔAUROC % | ΔMAE (ms) | Cohen’s d | p-Value | ΔEpochs to Convergence | Relative Importance |
|---|---|---|---|---|---|---|---|---|
| Full augmentation set (baseline) | Ref. | Ref. | Ref. | Ref. | Ref. | Ref. | Ref. | Ref. |
| Axis-swap | −0.06 [−0.09, −0.04] | −0.05 | −6% | +8 [6, 11] | 1.1 | <0.01 | +5 | 0.95 |
| Sensor-drift | −0.04 [−0.06, −0.02] | −0.03 | −4% | +10 [7, 13] | 1.2 | <0.01 | +7 | 0.90 |
| Jitter | −0.02 [−0.04, 0.00] | −0.01 | −1% | +3 [1, 6] | 0.6 | 0.04 | +2 | 0.55 |
| Magnitude scaling | −0.01 [−0.03, 0.00] | −0.01 | −1% | +2 [0, 5] | 0.4 | 0.07 | +1 | 0.45 |
| Time-warp | −0.03 [−0.05, −0.01] | −0.02 | −3% | +12 [9, 15] | 1.0 | <0.01 | +6 | 0.70 |
| Train/Test Config | Model | F1 ↑ | Precision ↑ | Recall ↑ | MAE (ms) ↓ | ΔF1 | ΔMAE (ms) | Rel. ΔF1 (%) | p-Value | Cohen’s d |
|---|---|---|---|---|---|---|---|---|---|---|
| Wrist → Thigh | Supervised (Ref.) | 0.62 | 0.64 | 0.61 | 39 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.74 | 0.76 | 0.73 | 26 | +0.12 | –13 | +19% | 0.002 | 1.25 | |
| Pocket → Hand | Supervised (Ref.) | 0.65 | 0.66 | 0.64 | 36 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.78 | 0.79 | 0.77 | 24 | +0.13 | –12 | +20% | 0.001 | 1.40 | |
| Wrist → | Supervised (Ref.) | 0.60 | 0.62 | 0.59 | 42 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.72 | 0.74 | 0.71 | 27 | +0.12 | –15 | +20% | 0.003 | 1.31 | |
| Hand → Thigh | Supervised (Ref.) | 0.63 | 0.65 | 0.62 | 37 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.76 | 0.77 | 0.75 | 25 | +0.13 | –12 | +21% | 0.002 | 1.28 | |
| Thigh → | Supervised (Ref.) | 0.61 | 0.62 | 0.60 | 40 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.73 | 0.74 | 0.72 | 28 | +0.12 | –12 | +20% | 0.004 | 1.22 | |
| Pocket → Thigh | Supervised (Ref.) | 0.64 | 0.65 | 0.63 | 38 | 0.00 | 0 | 0% | N/A | N/A |
| SSL- pretrained | 0.77 | 0.78 | 0.76 | 25 | +0.13 | –13 | +20% | 0.001 | 1.36 |
| Sampling Rate | Eval Regime | F1 (±95% CI) | AUROC (±95% CI) | Step MAE (ms) | Stride MAE (ms) | ΔF1 SSL–Sup | Δ vs. 50 Hz SSL (F1) | Epochs |
|---|---|---|---|---|---|---|---|---|
| 25 Hz | Supervised | 0.75 [0.73–0.77] | 0.83 [0.81–0.85] | 19 | 25 | N/A | N/A | 42 |
| 25 Hz | SSL Linear Probe | 0.80 [0.78–0.82] | 0.87 [0.85–0.89] | 16 | 22 | +0.05 | −0.04 | 31 |
| 25 Hz | SSL Few-shot (10%) | 0.82 [0.80–0.84] | 0.88 [0.86–0.90] | 15 | 20 | +0.07 | −0.02 | 30 |
| 50 Hz | Supervised | 0.78 [0.76–0.80] | 0.85 [0.83–0.87] | 17 | 23 | N/A | N/A | 40 |
| 50 Hz | SSL Linear Probe | 0.84 [0.82–0.86] | 0.90 [0.88–0.92] | 14 | 19 | +0.06 | Ref. | 28 |
| 50 Hz | SSL Few-shot (10%) | 0.86 [0.84–0.88] | 0.91 [0.89–0.93] | 13 | 18 | +0.08 | Ref. | 27 |
| 100 Hz | Supervised | 0.79 [0.77–0.81] | 0.86 [0.84–0.88] | 16 | 22 | N/A | N/A | 41 |
| 100 Hz | SSL Linear Probe | 0.85 [0.83–0.87] | 0.91 [0.89–0.93] | 13 | 18 | +0.06 | +0.01 | 29 |
| 100 Hz | SSL Few-shot (10%) | 0.87 [0.85–0.89] | 0.92 [0.90–0.94] | 12 | 17 | +0.08 | +0.01 | 28 |
| Backbone | Training Type | F1 (±95% CI) | AUROC (±95% CI) | MAE (ms ±95% CI) | Params (M) | Latency (ms) | Cohen’s d | p-Value |
|---|---|---|---|---|---|---|---|---|
| CNN | Supervised baseline | 0.78 [0.76–0.80] | 0.86 [0.84–0.88] | 21 [19–23] | 1.2 | 6 | Ref. | Ref. |
| SSL Pretrained | 0.84 [0.82–0.86] | 0.90 [0.88–0.92] | 16 [14–18] | 1.2 | 6 | 1.0 | <0.01 | |
| TCN | Supervised baseline | 0.80 [0.78–0.82] | 0.87 [0.85–0.89] | 20 [18–22] | 2.5 | 9 | Ref. | Ref. |
| SSL Pretrained | 0.86 [0.84–0.88] | 0.91 [0.89–0.93] | 15 [13–17] | 2.5 | 9 | 1.1 | <0.01 | |
| BiLSTM | Supervised baseline | 0.79 [0.77–0.81] | 0.86 [0.84–0.88] | 22 [20–24] | 3.1 | 11 | Ref. | Ref. |
| SSL Pretrained | 0.85 [0.83–0.87] | 0.90 [0.88–0.92] | 17 [15–19] | 3.1 | 11 | 1.0 | <0.01 | |
| Transformer | Supervised baseline | 0.82 [0.80–0.84] | 0.88 [0.86–0.90] | 19 [17–21] | 6.8 | 15 | Ref. | Ref. |
| SSL Pretrained | 0.88 [0.86–0.90] | 0.92 [0.90–0.94] | 14 [12–16] | 6.8 | 15 | 1.2 | <0.001 |
| Scenario | Model | F1 (HS) Median [IQR] | F1 (TO) Median [IQR] | MAE (HS, ms) Median [IQR] | MAE (TO, ms) Median [IQR] | ΔF1 (HS) (95% CI) | ΔF1 (TO) (95% CI) | ΔMAE (HS, ms) (95% CI) | ΔMAE (TO, ms) (95% CI) | p adj |
|---|---|---|---|---|---|---|---|---|---|---|
| Overall (all datasets) | Supervised | 0.78 [0.72–0.82] | 0.75 [0.70–0.80] | 29 [25–33] | 32 [27–36] | N/A | N/A | N/A | N/A | N/A |
| SSL (ours) | 0.90 [0.86–0.93] | 0.84 [0.79–0.88] | 19 [16–22] | 24 [21–27] | +0.12 (0.09–0.15) | +0.09 (0.06–0.12) | −10 (−12, −8) | −8 (−10, −6) | <0.001 | |
| Device shift | Supervised | 0.76 [0.71–0.81] | 0.73 [0.68–0.78] | 31 [27–36] | 34 [29–39] | N/A | N/A | N/A | N/A | N/A |
| SSL (ours) | 0.88 [0.84–0.91] | 0.82 [0.77–0.86] | 21 [18–25] | 26 [23–30] | +0.12 (0.08–0.15) | +0.09 (0.05–0.12) | −10 (−13, −7) | −8 (−11, −6) | <0.001 | |
| Placement shift | Supervised | 0.74 [0.69–0.79] | 0.71 [0.66–0.76] | 33 [29–38] | 36 [31–41] | N/A | N/A | N/A | N/A | N/A |
| SSL (ours) | 0.86 [0.82–0.90] | 0.80 [0.75–0.84] | 23 [20–27] | 28 [24–32] | +0.12 (0.08–0.15) | +0.09 (0.05–0.12) | −10 (−13, −8) | −8 (−10, −6) | <0.001 | |
| Sampling-rate shift | Supervised | 0.77 [0.72–0.81] | 0.74 [0.69–0.78] | 30 [26–35] | 33 [28–37] | N/A | N/A | N/A | N/A | N/A |
| SSL (ours) | 0.89 [0.85–0.92] | 0.83 [0.78–0.87] | 20 [17–23] | 25 [21–28] | +0.12 (0.09–0.15) | +0.09 (0.06–0.12) | −10 (−12, −8) | −8 (−10, −6) | <0.001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mănescu, A.M.; Mănescu, D.C. Self-Supervised Gait Event Detection from Smartphone IMUs for Human Performance and Sports Medicine. Appl. Sci. 2025, 15, 11974. https://doi.org/10.3390/app152211974
Mănescu AM, Mănescu DC. Self-Supervised Gait Event Detection from Smartphone IMUs for Human Performance and Sports Medicine. Applied Sciences. 2025; 15(22):11974. https://doi.org/10.3390/app152211974
Chicago/Turabian StyleMănescu, Andreea Maria, and Dan Cristian Mănescu. 2025. "Self-Supervised Gait Event Detection from Smartphone IMUs for Human Performance and Sports Medicine" Applied Sciences 15, no. 22: 11974. https://doi.org/10.3390/app152211974
APA StyleMănescu, A. M., & Mănescu, D. C. (2025). Self-Supervised Gait Event Detection from Smartphone IMUs for Human Performance and Sports Medicine. Applied Sciences, 15(22), 11974. https://doi.org/10.3390/app152211974
