Next Article in Journal
A Wearable Inertial Sensor Approach for Locomotion and Localization Recognition on Physical Activity
Previous Article in Journal
Real-Time Walk Error Compensation Method Using Echo Signal Magnitude Measurement in ToF Laser Scanners
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation of Gait Parameters for Adults with Surface Electromyogram Based on Machine Learning Models

1
Department of Computer Science and Information Engineering, Chaoyang University of Technology, Taichung City 41349, Taiwan
2
Department of Biomedical Engineering, I-Shou University, Kaohsiung 82445, Taiwan
3
Department of Golden-Ager Industry Management, Chaoyang University of Technology, Taichung City 41349, Taiwan
4
Division of Information Systems, School of Computer Science and Engineering, The University of Aizu, Aizu-Wakamatsu City 965-8580, Fukushima, Japan
*
Authors to whom correspondence should be addressed.
Sensors 2024, 24(3), 734; https://doi.org/10.3390/s24030734
Submission received: 16 October 2023 / Revised: 18 January 2024 / Accepted: 22 January 2024 / Published: 23 January 2024
(This article belongs to the Section Biomedical Sensors)

Abstract

:
Gait analysis has been studied over the last few decades as the best way to objectively assess the technical outcome of a procedure designed to improve gait. The treating physician can understand the type of gait problem, gain insight into the etiology, and find the best treatment with gait analysis. The gait parameters are the kinematics, including the temporal and spatial parameters, and lack the activity information of skeletal muscles. Thus, the gait analysis measures not only the three-dimensional temporal and spatial graphs of kinematics but also the surface electromyograms (sEMGs) of the lower limbs. Now, the shoe-worn GaitUp Physilog® wearable inertial sensors can easily measure the gait parameters when subjects are walking on the general ground. However, it cannot measure muscle activity. The aim of this study is to measure the gait parameters using the sEMGs of the lower limbs. A self-made wireless device was used to measure the sEMGs from the vastus lateralis and gastrocnemius muscles of the left and right feet. Twenty young female subjects with a skeletal muscle index (SMI) below 5.7 kg/m2 were recruited for this study and examined by the InBody 270 instrument. Four parameters of sEMG were used to estimate 23 gait parameters. They were measured using the GaitUp Physilog® wearable inertial sensors with three machine learning models, including random forest (RF), decision tree (DT), and XGBoost. The results show that 14 gait parameters could be well-estimated, and their correlation coefficients are above 0.800. This study signifies a step towards a more comprehensive analysis of gait with only sEMGs.

1. Introduction

Examination and comprehension of human movement have been enduring areas of scientific investigation. Gait parameters, which encompass intricate patterns and exhibit distinctive characteristics during the locomotion of walking and running, assume a pivotal position within a wide array of disciplines [1]. These disciplines span the domains of biomechanics and rehabilitation medicine, extending to sports science and human–computer interaction [2]. The precise prediction and assessment of these parameters have the potential to assess the identification of deviations from normal movement patterns in people with neurological diseases (like Parkinson’s), cardiopathies, sequelae from stroke, and diseases caused by aging and provide the treating physicians with the particular treatment programs [3,4,5]. Additionally, such predictions and assessments can contribute to the development of more productive assistive devices and the development of therapeutic interventions [6].
A surface electromyogram (sEMG) is used to record the electrical activity produced by muscle contraction, examine neuromuscular activity, and assist in the interpretation of kinematic and kinetic information [7]. In the intricacies between the neuromuscular system and motion, sEMG offers significant data pertaining to activation patterns, muscular strength, and the phenomenon of fatigue [8]. In the past, researchers have investigated the complex association between sEMG characteristics and gait measures using conventional statistical techniques, which have generally provided only partial or fragmented understanding [9,10]. Numerous studies have substantiated that the onset of sarcopenia can be discerned through the use of sEMG techniques or via the analysis of gait parameters [11]. However, it is pertinent to note that the examination of gait parameters primarily elucidates the repercussions of lower extremity muscle malfunction. It shows limited insights into the comprehensive gait effects precipitated by muscle dysfunction. In contrast, analyzing the lower extremity through sEMG predominantly offers data pertaining to the muscle activation status during dysfunction without offering substantial information on the consequential impacts on gait [12].
The traditional gait analysis uses charge-coupled device (CCD) cameras to capture the three-dimensional kinematic dynamics in temporal and spatial domains while a subject walks on a walkway [13,14]. sEMG is usually measured to determine whether the muscle is active or inactive during walking and to help in the evaluation of its dynamic power and condition. However, this system can only be used in a motion laboratory, and subjects cannot naturally walk on a general road. Agostini et al. proposed the problems for sEMG applied to gait analysis [9]. The limitations of sEMG are a lack of reference patterns and poor repeatability, reproducibility, and reliability [11]. Now, the shoe-worn GaitUp Physilog® (Gait Up SA., CH-1015, Renens, Switzerland) wearable inertial sensors are a powerful alternative to traditional gait assessment in the motion laboratory [15,16]. Subjects wearing the shoe-worn sensors on the shoes can naturally walk on the general ground, and the temporal–spatial parameters of gaits are measured in real time.
Recent advances in machine learning (ML) have shown promising results in healthcare analytics, offering a more efficient approach for predicting and understanding complex medical conditions [17,18]. Moreover, ML models have been widely used in physiological measurements for estimating physiological parameters such as blood pressure [19,20], muscle mass [21], calories [22,23], glucose [24], and left ventricular ejection fraction [25]. In the classification applications, ML models performed the signal qualities of an electrocardiogram [26] and a photoplethysmogram [27,28], arrythmia detection [29], and risky activities in daily life [30]. When using an ML method to process the regression or classification problem, searching for the major features and finding the appropriate ML algorithms will depend on the collected data [31,32]. The processing of features is an important issue that can directly affect the performance of the ML model. The more accurate the features, the higher the performance of the ML model. Although some traditional statistical analysis methods have good results for clinical prediction in some cases, ML methods reignite interest in exploiting these fields [33,34].
As mentioned previously, the gait analysis not only includes the temporal and spatial parameters of the lower limbs but also needs the dynamic information of these muscles. Although shoe-worn wearable sensors measure the parameters of gaits, they cannot measure the electrical activities of muscles. A 3D motion camera system, although the most accurate, is bulky and costly. It is not compatible with daily life activities. Now, some textile integrations are replaced with surface electrodes for long-term bioelectronic potential measurement. Thus, the sEMG has the chance to be compatible with daily life activities.
Thus, the aim of this study is to explore the temporal and spatial parameters of gaits measured by sEMGs of the lower limbs with the ML methods. The sEMGs were measured from two major muscles of two feet, the vastus lateralis and the gastrocnemius muscles, and the commercial gait analysis system, GaitUp Physilog® 5, measures the gait parameters as the reference. A wireless sEMG device was designed in this study to measure the sEMGs when subjects were running on the treadmill. Six features of sEMG were used to estimate 23 gait parameters by 3 ML models, namely decision tree (DT), random forest (RF), and XGBoost. Twenty female subjects participated in this study whose skeletal mass index (SMI) was below 5.7 kg/m2. The results showed that the performances of RF and XGBoost are almost equal. The Pearson correlation coefficients (PCCs) of fourteen gait parameters are above 0.800. The proposed method has the potential advantage of serving as a wearable gait analysis system with only sEMGs.

2. Materials and Methods

The layout of the proposed system is shown in Figure 1. The designed sEMG measurement system uses the XBee module (Digi International XBee®, Hopkins, MD, USA) to transmit four-channel sEMG signals to the personal computer (PC). The GaitUp Physilog® 5 sensors are fixed on the shoes. Initially, the captured sEMG signals are processed through a digital filter to remove the noise as the raw sEMG (Raw_sEMG), followed by a one-order discrete wavelet transform to extract the high-frequency component (DWTH_sEMG). Then, six features are, respectively, extracted from the Raw_sEMG and DWTH_sEMG. The gait parameters are estimated by the ML models. Moreover, the models with better performances would be optimized to enhance their performances. Finally, the results are analyzed by the statistical evaluation methods.

2.1. Data Measurement Devices

The devices utilized in this research include a self-built sEMG apparatus and a pair of shoe-worn GaitUp Physilog® wearable inertial sensors.

2.1.1. GaitUp Physilog® 5 System

Figure 2 shows the GaitUp Physilog® analysis tool, including the shoe-worn GaitUp, specifically the Physilog® 5 wearable inertial sensor (Figure 2a), and the analysis software (Figure 2b) [35]. This supplemental tool aids in monitoring walking patterns by garnering precise quantitative data through pre- and post-gait evaluations. This facilitates medical practitioners and therapists in identifying areas of gait weakness and potential hazards associated with the subjects. Within the scope of this study, the GaitUp Lab gait analysis system was employed to measure the individual 23 gait parameters of the left and right feet, which include 9 temporal parameters, 9 spatial parameters, 4 ground clearance analyses, and a singular turn analysis. Table 1 shows the descriptions of 23 gait parameters.

2.1.2. sEMG Measuring Device

We developed a multi-channel wireless sEMG monitoring device, as depicted in Figure 3. Two boards (slave and master) employ the XBEE S2C modules for data exchange between them with a transmission rate of 1000 bytes/s. Additionally, the master board incorporates the HC-05 Bluetooth module to forward the acquired sEMG data to a personal computer (PC) with a transmission rate of 500 bytes/s. The board has two channels of two sEMGs and an accelerometer (not used in this study). The sEMG circuit inherits the study of Liu et al. [36]. The device’s microcontroller is based on the Texas Instruments (TI) MSP430F5438A, whose analog-to-digital resolution (ADC) is 12 bits. Thus, the package transmitted to the PC terminal has 2 head bytes and 20 data bytes. In Figure 4, the time sequence of the four sEMG signals measured by the device during walking is presented, spanning a duration of 5 s. These sEMGs show the biomechanical movements associated with ambulation, explicitly focusing on the contracting strengths of the thigh and calf muscles for both the left and right lower extremities.

2.2. Data Processing

Figure 5 depicts a flowchart of the data processing architecture, illustrating the process of handling data in the study. Initially, the recoded sEMG data were subjected to digital filtering to eliminate noise, resulting in Raw_sEMG. Subsequently, the decomposition of DWT yielded DWTH_sEMG. Data segmentation was then performed on both the Raw_sEMG and DWTH_sEMG signals, followed by the calculation of time and frequency domain parameters. The data from shoe-worn GaitUp Physilog® 5 wearable inertial sensors were segmented and processed via GaitUp Lab (Gait Up SA., CH-1015, Renens, Switzerland) software to extract gait parameters as the reference.

2.2.1. Digital Filtering

A median filter was deployed to neutralize the spike disturbances originating from wireless communication packet discrepancies. The Raw_sEMG was then processed by a 4th-order Butterworth bandpass filter with a 20–240 Hz passband to remove the wandering baseline and high-frequency noise.

2.2.2. Discrete Wavelet Transform

Existing studies highlighted that the high-frequency component of sEMG exhibited enhanced sensitivity for monitoring muscle fatigue under dynamic contraction [8]. The DWT is a widely utilized technique for processing data and is noted for its ability to separate high- and low-frequency components from a discrete signal [37]. Based on these findings, this study employed the primary DWT to extract the high-frequency component, DWTH_sEMG.

2.2.3. Data Segmentation

The recorded data was 6 min in time, or 180,000 points for both Raw_sEMG and DWTH_sEMG. The initiation of data segmentation was aligned with the inception of intentional muscle contraction noted in the sEMG recordings, coupled with the commencement of activity noted within the GaitUp Lab, serving as the benchmark for the onset of data cutting. To mitigate the effects of gait variations at the experiment’s start and end, this study excluded 7.5 s from both the beginning and the end of the signal. This led to a total elimination of 15 s from the signal, resulting in a post-deletion signal length of 172,500 points. Regarding data segmentation, both the sEMGs and the shoe-worn GaitUp Physilog® 5 wireless sensors were configured to segment the data into batches, each lasting 30 s, with a window shift of 15 s. This setup allowed for the extraction of 22 sample sets from every experiment, as shown in Figure 6. The data recorded by GaitUp Physilog® 5 were segmented and processed using the GaitUp Lab software to yield gait parameters.

2.2.4. sEMG Parameters

This study incorporated the common sEMG parameters, as defined in previous studies [38,39,40,41]. The mean frequency (MF) was derived from the product of the signal’s power spectrum and its frequency. This resultant value was then divided by the cumulative sum of the power spectrum, as shown in Equation (1):
M F = j = 1 M f j p j / j = 1 M p j
where f j represents the jth frequency of sEMG relative to the jth power spectrum, p j , and M is the number of discrete frequency harmonics.
The median frequency (MDF) separates the signal’s power spectrum into two equal portions, as shown in Equation (2).
j = 1 M D F p j = j = M D F 1 M p j = 1 2 j = 1 M p j .
The standard deviation (STD) formula is shown in Equation (3).
S T D = 1 2 i = 1 N ( x i x ¯ ) 2 ,
where x is sampled data and N is the number of samples.
The root mean square (RMS) is given in Equation (4), and the average power (AP) in Equation (5).
R M S = i = 1 n x i 2 n .
A P = 1 T j = 1 T x j 2 ,
Sample entropy (SampEn) is a parameter used to assess the regularity and complexity of a time series. A higher entropy value indicates increased complexity in the time series. A signal of length N is denoted as:
X = { x 1 , x 2 , x 3 , , x N } .
Computing sample entropy necessitates specifying the embedding dimension m and the tolerance value r. Once the dimension m is established, the signal X can be divided into several segments, denoted by X m :
X m i = { x i , x i + 1 , x i + 2 , , x i + m 1 } .
Equation (8) represents the sample entropy.
S a m p E n = l n A B ,
where
  • A = d X m + 1 i ,   X m + 1 j < r ,
  • B = d X m i ,   X m j < r .
  • d […] indicates the Euclidean distance or Manhattan distance?
In this study, we set m = 2 and r = STD(sEMG) × 0.2. Thus, Raw_sEMG and DWTH_sEMG signals from both the thigh and calf were extracted, resulting in a total of 24 sEMG parameters for a foot.

2.3. Experiment Protocol

In the study, all participants were voluntary and selected from adults diagnosed with sarcopenia symptoms, possessing healthy limbs, and exhibiting standard standing postures. The cohort consisted of 20 female subjects aged from 19 to 23 years, with a mean ± standard deviation of 20 ± 1 years. The average height and weight were 156 ± 4.6 cm (ranging from 149 to 164 cm) and 45.9 ± 5.7 kg (spanning 31 to 56 kg), respectively. The muscle mass of the right foot was 4.95 ± 0.68 kg (ranging from 3.31 to 6.15 kg), the muscle mass of the left foot was 4.94 ± 0.68 kg (ranging from 3.24 to 6.03 kg), and the skeletal muscle index (SMI) was 5.07 ± 0.49 kg/m2 (ranging from 3.5 to 5.7 kg/m2). Furthermore, the subjects had an average shoe size of 23.9 ± 0.6, with sizes ranging from 23 to 25 cm. This experiment was approved by the Research Ethics Committee of Chung Shan University Hospital (No. CS2-22210) in Taichung City, Taiwan.
In the beginning, the participants’ muscle masses in their feet and their SMI were measured using the InBody270 body composition analyzer (InBody Co. Ltd. Korea). The participants were recruited as subjects when their SMIs were below 5.7 kg/m2 for females and 7.0 kg/m2 for males [42]. The study employed a self-designed multi-channel wireless sEMG device for sEMG recordings in conjunction with the shoe-worn GaitUp Physilog® 5 wearable inertial sensors to gather gait metrics. Prior to partaking in the experimental procedures, subjects were required to self-evaluate their overall physical health. This research applied the following procedures to gather data for the study:
  • In the sEMG measurement, there were four distinct sets of electrodes. These are individually adhered to the vastus lateralis and gastrocnemius muscle dips and plateau regions related to the presence of an innervation zone on each foot [11]. As shown in Figure 7, the reference electrode is positioned on the knee-facing side of the rectus femoris muscle (shown in Figure 7a,b). We avoided the belly position of the muscle and shifted the electrodes to a higher position. The surface electrodes used for the sEMG recording were Ag/AgCl with a 10 mm diameter on self-adhesive supports. The positions of the electrodes for each subject were recorded, and the electrodes were placed at the same position in the subsequent experiments. The shoe-worn GaitUp Physilog® 5 wearable inertial sensors were placed on the tongue area of both shoes, as shown in Figure 7c. It was essential to ascertain that the electrodes and sensors remained secure, preventing any potential loosening from movement that could compromise data integrity.
  • The subjects ran on the treadmill at a speed of 5 km per hour for a duration of 6 min.
  • The subjects would evaluate their physical state and make necessary modifications during each experiment, all ensuring at least a 10 min break.
  • Subjects were measured four times. The interval between any two measurement sessions is at least a week.

2.4. Regression Model

In this study, the regression model training was facilitated using a system outfitted with an i7-10700 processor, an NVIDIA GeForce RTX3070 (NVIDIA, Santa Clara, CA, USA) graphics card, and 64 GB of memory at 2933 MHz. To predict the gait parameters, the study employed three prevalent nonlinear regression models, such as DT, RF, and XGBoost, all of which were developed by Python.

2.4.1. Decision Tree

A decision tree (DT) [43,44] is a model characterized by a hierarchical tree framework. Each node within the tree signifies various choices, leading sequentially to anticipated outcomes. Due to their distinctive configuration, DTs are markedly clear and facilitate easy comprehension of the model’s predictive guidelines. Consequently, DT has emerged as one of the most renowned nonlinear regression models. DT is noted for its proficiency in nonlinear regression tasks, adeptly modeling intricate relationships between inputs and outputs without necessitating a linear correlation.

2.4.2. Random Forest

The random forest tree (RF) [45] model is an advanced method of the conventional DT technique, leveraging the Random Subspace Learning algorithm to formulate trees characterized by low inter-correlations. Consequently, the RF [46] strategy amalgamates a multitude of low-correlation DTs, culminating in a substantial decision forest. Due to the enhanced randomness introduced within the model structure, it is observed that models conceived through the RF tree approach exhibit heightened robustness, thereby mitigating the propensity for overfitting, which underscores the efficacy of this methodology in complex predictive analytics.

2.4.3. XGBoost

XGBoost (Extreme Gradient Boosting) [47] stands as an improved version of the Gradient Boosting technique. Bearing structural resemblances to a DT, it amalgamates numerous weak DTs to craft a potent predictive model. When compared with conventional classification and regression techniques, XGBoost typically exhibits superior accuracy [48]. Given its robust adaptive learning capabilities, XGBoost retains its prominence as a frequently employed classification and regression model in contemporary research and on competitive platforms.

2.5. Statistical Analysis

The quantitative data are expressed as the mean ± standard deviation. A computation of the PCC, r, is performed to establish the relationship between the predicted values and the target values found in the test data. This correlation coefficient is delineated in Equation (9). Here, m x represents the average of the x-vectors and m y denotes the average of the y-vectors.
r = x m x y m y x m x 2 y m y 2

3. Results

Following the experiment, the total number of experiments executed by subjects was 68. The number of samples extracted from the sEMGs was 1496 for the left and right legs, respectively. Some samples missed the target output extracted by the GaitUp Lab software. There were 1477 viable samples for the left and right legs, respectively. A sample has 24 sEMG parameters: 12 parameters for Raw_sEMG and DWTH_sEMG of calf muscle and 12 for thing muscle. We methodically structured the samples into training and testing sets, adhering to a proportion of 8:2. Therefore, the numbers of training and testing samples were 1181 and 296, respectively. There were two models, left and right legs, for each gait parameter. Thus, the number of total models was 46.
Table 2 presents the statistical analysis of gait parameters for all samples, illustrating the minimum value, which indicates the lowest recorded point, and the maximum value, which signifies the highest point observed. Moreover, it also delineates the mean value, which is the average of all the collected data, along with the standard deviation, which showcases the extent of variation or dispersion of the set of values within the samples.

3.1. Selection of Gait Parameters

We hypothesized that the sEMG signals could not simply estimate all gait parameters. Thus, the XGBoost regression model was used to estimate the gait parameters of two feet. Table 3 shows the PCCs of all gait parameters. The PCCs between 0.7 and 1.0 (−0.7 and −1.0) indicate a strong positive (negative) linear relationship through the regression model [49]. Table 3 shows the correlation coefficients of all gait parameters. We defined two rules to determine what gait parameters could be estimated. The first rule was that the coefficients’ difference between two feet should not exceed 0.05. The second rule was that both the left and right feet must exhibit coefficients greater than 0.8 simultaneously. In Table 3, the gait parameters fitting these rules are marked with bold asterisks. There are six temporal parameters: gait cycle time, double-foot stance period, stride frequency, weight-bearing reaction period, mid-foot landing period, and propulsion period. Additionally, there are six spatial parameters: step length, stride length, maximum swing speed, toe-off ground angle, swing width, and three-dimensional walking length. The ground clearance analysis includes two parameters: the minimum toe clearance height and the maximum toe clearance height. In the initial phase of the study, the gait parameters requiring estimation are delineated.

3.2. Optimal sEMG Parameters

Normalizing feature importance can correct the bias in feature importance [50]. The method repeats permutations of the outcome vector to estimate the distribution of measured importance for each parameter. The p-value of the observed importance provides a corrected measure of parameter importance. In Table 4, we define the significance attributions associated with each sEMG parameter when it pertains to the efficacy of the established regression model. Remarkably, the statistical analysis reveals an absence of correlation, evidenced by a zero score, between the root mean square value and the average power. Thus, in the second phase, the MF, MDF, STD, and SampEn would be selected for the following assessment of the three ML models.

3.3. Regression Models

The DT, RF, and XGBoost were used to estimate the gait parameters. Table 5 shows the coefficients of 14 gait parameters for the left and right feet estimated by the three models. The RF model has the best performance, coefficients are 0.920 ± 0.016 and 0.916 ± 0.018 for left and right feet, respectively. The DT has the worst performance, and coefficients are 0.806 ± 0.033 and 0.774 ± 0.040 for left and right feet. The coefficients of XGBoost are 0.920 ± 0.016 and 0.916 ± 0.018 for the left and right feet, respectively. We find the performances of the RF and XGBoost models very close. Thus, the two models would be evaluated in the next phases.

3.4. Optimize Hyperparameters

In order to optimize DT, RF, and XGBoost models, the Optuna algorithm was used to fine-tune important hyperparameters [51]. The optimal hyperparameter configurations by Optuna and the respective range values established for these regression models are shown in Table 6.

3.5. Cross-Validation

In this phase, a five-fold cross-validation was used to examine the potential predictive bias presented by constant input samples during the model training phase [52]. This method facilitated the evaluation of the influences of varying training and test samples on the predictive outcomes generated by the trio of models under study. As explained in Table 7, the statistical analysis of the DT, RF, and XGBoost models shows the RF model demonstrating superior efficacy (0.915 ± 0.006), while the DT model lagged, exhibiting the worst performance (0.796 ± 0.017).

3.6. Training and Prediction Time

In this study, the efficiency of the optimized regression model was not only evaluated by the correlation coefficient but also considered both the training and prediction times, with the detailed findings showcased in Table 8. A notable observation is that the DT model exhibits a considerably shorter training time, 0.02 s, when compared with the RF and XGBoost models, 0.34 and 2.10 s, respectively. Furthermore, a closer time for the prediction underscores a distinct variance in performance across the models. Remarkably, both the RF and DT models manage to complete predictions in less than 0.01 s, presenting a favorable outlook in terms of speed and efficiency. In contrast, the XGBoost model lags behind, consuming a more substantial amount of time to finalize predictions compared to its counterparts.

3.7. Model Size

The other important point for this study is the memory space of the model because the proposed system will be deployed on an edge computing platform in the future. Table 9 shows the sizes of three models. A discernible difference is observed in the memory space occupied by each model. The empirical findings indicate that the DT model occupies a minimal memory space of 56 KB, which is considerably smaller than that required by the other two models. RF needs the most memory space, 11,045 kB, which is difficult to implement in the edge computing environment.

4. Discussion

The use of gait analysis in clinical practices began three decades ago. Inman et al. proposed the complete study of gait kinematics in 1981 [53]. Muro-de-la-Herran et al. explored a number of articles based on gait analysis methods and clinical applications from 2012 and 2013. The total number was 32; 40% of these articles were related to traditional CCD cameras systems, 37.5% presented inertial sensor-based systems, and the remaining 22.5% corresponded to other wearable sensor systems [5]. The number of nonwearable systems was less than that of wearable systems because the potential benefit of wearable systems would be that they can be operated in nonmedical environments, like the home and care center for the elderly. Now the wearable systems for the gait analysis are all off-line systems. The wearable devices only record the signals, and the parameters of gaits are calculated by their algorithms in the computer. Moreover, Liu et al. have proposed a sEMG patch [36]. In this study, we used the ML models to calculate these parameters. If the ML model is implemented in a microcontroller with the edge computing technique, the wearable system shows these parameters in real time. Therefore, the treating physicians can adjust the walking posture of the patient or modify the treatments.
The sEMG records muscle electrical activity, which has been identified as a potential biomarker for numerous neuromuscular disorders [54]. By capturing the subtle nuances in muscle activation and deactivation phases, sEMG provides a granular snapshot of muscle function. Moreover, the gait analysis belongs to the outcome of mechanic activity, which includes the temporal and spatial gait parameters and the information of joint kinematics measured by the accelerometer and gyroscope [55]. Thus, the gait analysis using the sEMG information will add value to the traditional gait analysis in clinical practice [56,57]. In this study, we successfully predicted the 14 gait parameters with the sEMGs of two feet using ML models. The best correlation coefficient approached 0.915 ± 0.006, which was regressed by the RF model. Thus, the sEMG signals could support both information about gait and muscle activity.
In this study, we focused on the collection and analysis of raw sEMG (Raw_sEMG) and DWT1 sEMG_H (DWTH_sEMG) signals from the thigh and calf of the left and right feet. Our efforts and methodologies allowed us to successfully derive four distinct sEMG parameters, MF, MDF, STD, and SampEn, for a single foot shown in Table 4. These parameters can potentially provide insights into muscle activity, function, and other relevant biomechanical aspects. But the RMS and AP parameters did not make any contribution. They all belong to the “energy” content of sEMG, which is hard to link to the muscle force, reflecting the degree of activation of skeletal muscles [58]. Thus, they did not have a relationship with the gait parameters.
In Table 3, the lowest PCC is 0.702. This result should represent that the sEMGs used to estimate the gait parameters are a potential research. Thus, the sEMGs have the potential benefit of approaching gait analysis with the ML technique. Moreover, we analyzed the reasons for the best and worst performances for estimating gait parameters according to the histograms of gait parameters. First, the more normal the distribution, the better the performance. Figure 8 shows the histograms of the “Cadence” and “Stance” parameters, whose coefficients are 0.913 and 0.909 vs. 0.917 and 0.780. The distribution of the “Stance” parameter is a disturbance, but for the “Cadence” parameter, it is the normal distribution. Second, the greater the difference between the gait-parameter distributions of left and right feet, the larger the difference between the estimating correlation coefficients. Figure 9 shows the histograms of the “DS” parameter and the “PAVS” parameter, whose coefficients are 0.931 and 0.893 vs. 0.883 and 0.793. The distributions of the left and right feet have a significant difference. Moreover, the palpability of the “PAVS” parameter of the right foot in the 10~20% range is significantly larger than the other ranges. Thus, the coefficient of the left foot is larger than the right foot, 0.883 vs. 0.793. However, the “Speed” parameter is an exception. Figure 10 shows the histograms of its left and right feet, which have normal distributions and are very close. But the coefficients of the “Speed” parameter are 0.776 and 0.798. The reason is that walking speed directly relates to the length and frequency of steps. The sEMG parameters do not have any length and frequency information. Thus, the “Speed” parameter could be difficult to estimate well.
This study not only compares the performance of RF, XGBoost, and DT models on several metrics, including PCC, training time, and prediction time, but also considers the model size. After combining all the results, this study observed that the RF model performed well in terms of PCCs for the individual gait parameters. However, in terms of model size, the RF model took up the most memory space, which shows that it may encounter limitations on lower configuration devices or edge computing environments. In the case of applying ML models to edge computing devices, the DT model may be more suitable, although its accuracy would need to be sacrificed. On the other hand, the XGBoost model achieved intermediate performance between the other two models in terms of correlation coefficient and model size. However, according to our analysis of the training and prediction times, it can be observed that the XGBoost model requires a longer training time, and its size is also larger, needing a memory space of 5805 kB, which is hardly enough to perform in edge computing settings.
The limitation of the current study pertains to the confined size of subject’s group, which restricts the broad applicability of our findings. Larger and more diverse samples in future studies could provide results that are both statistically more significant and more generalizable across different populations. Because the subject’s SMI should be below 5.7 kg/m2 for females and 7.0 kg/m2 for males, the gender of all subjects was female. This is also another limitation of this study. Although this study encompassed several ML models and sEMG parameters, the potential of unexplored or newly developed algorithms remains untapped. These could offer superior prediction accuracy and efficiency. Looking ahead, there’s a pressing need for future studies to incorporate a broader spectrum of sEMG feature mining and motion-sensing assistant. The final aim of this study will be to focus on the elderly population in the future because the elderly have a higher risk of dysfunctional walking. When elders have an abnormal walking pattern, the quality of their lives will decrease. Thus, the real-time gait information could allow the treating physicians to modify the treatment.

5. Conclusions

The study applied three ML models to effectively estimate the gait parameters using sEMGs measured from the lower limbs. These findings present a promising step towards more personalized and effective interventions for sarcopenia adults. Because the proposed method uses only sEMG signals to estimate the gait parameters, it has the potential to develop a wearable device based on edge computing in the future. If one combines MEC (mobile edge computing) with URLLC (ultrareliable low-latency communication) in 5G/6G, instantaneous data exchange and immediate patient feedback can be realized. Moreover, despite the progress made, future research should aim to expand the dataset and explore integration with other predictive technologies for more comprehensive and reliable gait analysis in elders with walking dysfunction.

Author Contributions

Conceptualization, S.-H.L.; methodology, S.-H.L. and C.-E.T.; software, C.-E.T. and A.K.S.; validation, J.-J.W. and C.-J.C.; formal analysis, S.-H.L. and C.-E.T.; investigation, S.-H.L. and C.-E.T.; resources, S.-H.L.; data curation, C.-J.C.; writing—original draft preparation, S.-H.L. and A.K.S.; writing—review and editing, W.C. and A.K.S.; supervision, S.-H.L.; project administration, S.-H.L.; funding acquisition, S.-H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science and Technology Council, Taiwan, grant numbers: NSTC 111-2221-E-324-003-MY3 and NSTC 112-2221-E-214-013.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Research Ethics Committee of Chung-Shan University Hospital (No. CS2-22210), Taichung City, Taiwan.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to acknowledge Chaoyang University of Technology for the administrative support and thank Yu-Hsuan Huang and Christie Chang for their instances in the experiment.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Dorschky, E.; Nitschke, M.; Martindale, C.F.; van den Bogert, A.J.; Koelewijn, A.D.; Eskofier, B.M. CNN-Based Estimation of Sagittal Plane Walking and Running Biomechanics from Measured and Simulated Inertial Sensor Data. Front. Bioeng. Biotechnol. 2020, 8, 604. [Google Scholar] [CrossRef] [PubMed]
  2. Kim, J.-K.; Bae, M.-N.; Lee, K.B.; Hong, S.G. Identification of Patients with Sarcopenia Using Gait Parameters Based on Inertial Sensors. Sensors 2021, 21, 1786. [Google Scholar] [CrossRef] [PubMed]
  3. Dubois, A.; Charpillet, F. A Gait Analysis Method Based on a Depth Camera for Fall Prevention. In Proceedings of the 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Chicago, IL, USA, 26–30 August 2014; pp. 4515–4518. [Google Scholar]
  4. Prince, F.; Corriveau, H.; Hébert, R.; Winter, D.A. Gait in the Elderly. Gait Posture 1997, 5, 128–135. [Google Scholar] [CrossRef]
  5. Muro-de-la-Herran, A.; Garcia-Zapirain, B.; Mendez-Zorrilla, A. Gait Analysis Methods: An Overview of Wearable and Non-Wearable Systems, Highlighting Clinical Applications. Sensors 2014, 14, 3362–3394. [Google Scholar] [CrossRef] [PubMed]
  6. Landi, F.; Liperoti, R.; Russo, A.; Giovannini, S.; Tosato, M.; Capoluongo, E.; Bernabei, R.; Onder, G. Sarcopenia as a Risk Factor for Falls in Elderly Individuals: Results from the IlSIRENTE Study. Clin. Nutr. 2012, 31, 652–658. [Google Scholar] [CrossRef] [PubMed]
  7. Clancy, E.A.; Morin, E.L.; Hajian, G.; Merletti, R. Tutorial. Surface Electromyogram (SEMG) Amplitude Estimation: Best Practices. J. Electromyogr. Kinesiol. 2023, 72, 102807. [Google Scholar] [CrossRef] [PubMed]
  8. Liu, S.-H.; Chang, K.-M.; Cheng, D.-C. The Progression of Muscle Fatigue During Exercise Estimation with the Aid of High-Frequency Component Parameters Derived From Ensemble Empirical Mode Decomposition. IEEE J. Biomed. Health Inform. 2014, 18, 1647–1658. [Google Scholar] [CrossRef] [PubMed]
  9. Agostini, V.; Ghislieri, M.; Rosati, S.; Balestra, G.; Knaflitz, M. Surface Electromyography Applied to Gait Analysis: How to Improve Its Impact in Clinics? Front. Neurol. 2020, 11, 994. [Google Scholar] [CrossRef]
  10. Wei, W.; Tan, F.; Zhang, H.; Mao, H.; Fu, M.; Samuel, O.W.; Li, G. Surface Electromyogram, Kinematic, and Kinetic Dataset of Lower Limb Walking for Movement Intent Recognition. Sci. Data 2023, 10, 358. [Google Scholar] [CrossRef]
  11. Habenicht, R.; Ebenbichler, G.; Bonato, P.; Kollmitzer, J.; Ziegelbecker, S.; Unterlerchner, L.; Mair, P.; Kienbacher, T. Age-Specific Differences in the Time-Frequency Representation of Surface Electromyographic Data Recorded during a Submaximal Cyclic Back Extension Exercise: A Promising Biomarker to Detect Early Signs of Sarcopenia. J. Neuroeng. Rehabil. 2020, 17, 8. [Google Scholar] [CrossRef]
  12. Cheng, Y.; Li, G.; Li, J.; Sun, Y.; Jiang, G.; Zeng, F.; Zhao, H.; Chen, D. Visualization of Activated Muscle Area Based on SEMG. J. Intell. Fuzzy Syst. 2020, 38, 2623–2634. [Google Scholar] [CrossRef]
  13. Chambers, H.G.; Sutherland, D.H. A Practical Guide to Gait Analysis. J. Am. Acad. Orthop. Surg. 2002, 10, 222–231. [Google Scholar] [CrossRef] [PubMed]
  14. Chester, V.L.; Biden, E.N.; Tingley, M. Gait Analysis. Biomed. Instrum. Technol. 2005, 39, 64–74. [Google Scholar] [PubMed]
  15. Carroll, K.; Kennedy, R.A.; Koutoulas, V.; Bui, M.; Kraan, C.M. Validation of Shoe-Worn Gait Up Physilog®5 Wearable Inertial Sensors in Adolescents. Gait Posture 2022, 91, 19–25. [Google Scholar] [CrossRef] [PubMed]
  16. Mariani, B.; Rouhani, H.; Crevoisier, X.; Aminian, K. Quantitative Estimation of Foot-Flat and Stance Phase of Gait Using Foot-Worn Inertial Sensors. Gait Posture 2013, 37, 229–234. [Google Scholar] [CrossRef] [PubMed]
  17. Shehab, M.; Abualigah, L.; Shambour, Q.; Abu-Hashem, M.A.; Shambour, M.K.Y.; Alsalibi, A.I.; Gandomi, A.H. Machine learning in medical applications: A review of state-of-the-art methods. Comput. Biol. Med. 2022, 145, 105458. [Google Scholar] [CrossRef] [PubMed]
  18. Magoulas, G.D.; Prentza, A. Machine Learning in Medical Applications; SpringLink: Berlin/Heidelberg, Germany, 2001; pp. 300–307. [Google Scholar]
  19. Liu, S.-H.; Liu, L.-J.; Pan, K.-L.; Chen, W.; Tan, T.-H. Using the Characteristics of Pulse Waveform to Enhance the Accuracy of Blood Pressure Measurement by a Multi-Dimension Regression Model. Appl. Sci. 2019, 9, 2922. [Google Scholar] [CrossRef]
  20. Xu, Z.; Liu, J.; Chen, X.; Wang, Y.; Zhao, Z. Continuous Blood Pressure Estimation Based on Multiple Parameters from Eletrocardiogram and Photoplethysmogram by Back-Propagation Neural Network. Comput. Ind. 2017, 89, 50–59. [Google Scholar] [CrossRef]
  21. Wu, X.; Park, S. An Inverse Relation between Hyperglycemia and Skeletal Muscle Mass Predicted by Using a Machine Learning Approach in Middle-Aged and Older Adults in Large Cohorts. J. Clin. Med. 2021, 10, 2133. [Google Scholar] [CrossRef]
  22. Pouladzadeh, P.; Kuhad, P.; Peddi, S.V.B.; Yassine, A.; Shirmohammadi, S. Food Calorie Measurement Using Deep Learning Neural Network. In Proceedings of the 2016 IEEE International Instrumentation and Measurement Technology Conference, Taipei, Taiwan, 23–26 May 2016; pp. 1–6. [Google Scholar]
  23. Ruede, R.; Heusser, V.; Frank, L.; Roitberg, A.; Haurilet, M.; Stiefelhagen, R. Multi-Task Learning for Calorie Prediction on a Novel Large-Scale Recipe Dataset Enriched with Nutritional Information. In Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 10–15 January 2021; pp. 4001–4008. [Google Scholar]
  24. Mhaskar, H.N.; Pereverzyev, S.V.; van der Walt, M.D. A Deep Learning Approach to Diabetic Blood Glucose Prediction. Front. Appl. Math. Stat. 2017, 3, 14. [Google Scholar] [CrossRef]
  25. Liu, S.-H.; Yang, Z.-K.; Pan, K.-L.; Zhu, X.; Chen, W. Estimation of Left Ventricular Ejection Fraction Using Cardiovascular Hemodynamic Parameters and Pulse Morphological Characteristics with Machine Learning Algorithms. Nutrients 2022, 14, 4051. [Google Scholar] [CrossRef] [PubMed]
  26. Satija, U.; Ramkumar, B.; Manikandan, M.S. A Review of Signal Processing Techniques for Electrocardiogram Signal Quality Assessment. IEEE Rev. Biomed. Eng. 2018, 11, 36–52. [Google Scholar] [CrossRef] [PubMed]
  27. Chowdhury, M.H.; Shuzan, M.N.I.; Chowdhury, M.E.H.; Mahbub, Z.B.; Uddin, M.M.; Khandakar, A.; Reaz, M.B.I. Estimating Blood Pressure from the Photoplethysmogram Signal and Demographic Features Using Machine Learning Techniques. Sensors 2020, 20, 3127. [Google Scholar] [CrossRef] [PubMed]
  28. Liu, S.-H.; Wang, J.-J.; Chen, W.; Pan, K.-L.; Su, C.-H. Classification of Photoplethysmographic Signal Quality with Fuzzy Neural Network for Improvement of Stroke Volume Measurement. Appl. Sci. 2020, 10, 1476. [Google Scholar] [CrossRef]
  29. Sraitih, M.; Jabrane, Y.; Hajjam El Hassani, A. An Automated System for ECG Arrhythmia Detection Using Machine Learning Techniques. J. Clin. Med. 2021, 10, 5450. [Google Scholar] [CrossRef]
  30. Dubois, A.; Bihl, T.; Bresciani, J.-P. Identifying Fall Risk Predictors by Monitoring Daily Activities at Home Using a Depth Sensor Coupled to Machine Learning Algorithms. Sensors 2021, 21, 1957. [Google Scholar] [CrossRef]
  31. Louridas, P.; Ebert, C. Machine Learning. IEEE Softw. 2016, 33, 110–115. [Google Scholar] [CrossRef]
  32. Xin, Y.; Kong, L.; Liu, Z.; Chen, Y.; Li, Y.; Zhu, H.; Gao, M.; Hou, H.; Wang, C. Machine Learning and Deep Learning Methods for Cybersecurity. IEEE Access 2018, 6, 35365–35381. [Google Scholar] [CrossRef]
  33. Beam, A.L.; Kohane, I.S. Big Data and Machine Learning in Health Care. JAMA 2018, 319, 1317. [Google Scholar] [CrossRef]
  34. Chen, J.H.; Asch, S.M. Machine Learning and Prediction in Medicine—Beyond the Peak of Inflated Expectations. N. Engl. J. Med. 2017, 376, 2507–2509. [Google Scholar] [CrossRef]
  35. Gait Up SA Physilog—Digital Motion Analysis Platform. Available online: https://physilog.com (accessed on 2 April 2023).
  36. Liu, S.-H.; Lin, C.-B.; Chen, Y.; Chen, W.; Huang, T.-S.; Hsu, C.-Y. An EMG Patch for the Real-Time Monitoring of Muscle-Fatigue Conditions During Exercise. Sensors 2019, 19, 3108. [Google Scholar] [CrossRef] [PubMed]
  37. Mallat, S.G. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 674–693. [Google Scholar] [CrossRef]
  38. Phinyomark, A.; Thongpanja, S.; Hu, H.; Phukpattaranont, P.; Limsakul, C. The Usefulness of Mean and Median Frequencies in Electromyography Analysis. In Computational Intelligence in Electromyography Analysis—A Perspective on Current Applications and Future Challenges; InTech: London, UK, 2012. [Google Scholar]
  39. Aghamohammadi-Sereshki, A.; Bayazi, M.-J.D.; Ghomsheh, F.T.; Amirabdollahian, F. Investigation of Fatigue Using Different EMG Features. In Proceedings of the 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR), Toronto, ON, Canada, 24–28 June 2019; pp. 115–120. [Google Scholar]
  40. Zhang, X.; Zhou, P. Sample Entropy Analysis of Surface EMG for Improved Muscle Activity Onset Detection against Spurious Background Spikes. J. Electromyogr. Kinesiol. 2012, 22, 901–907. [Google Scholar] [CrossRef] [PubMed]
  41. Crenshaw, A.G.; Karlsson, S.; Gerdle, B.; Fridén, J. Differential Responses in Intramuscular Pressure and EMG Fatigue Indicators during Low- vs. High-level Isometric Contractions to Fatigue. Acta Physiol. Scand. 1997, 160, 353–361. [Google Scholar] [CrossRef]
  42. Czartoryski, P.; Garcia, J.; Manimaleth, R.; Napolitano, P.; Watters, H.; Weber, C.; Alvarez-Beaton, A.; Nieto, A.C.; Patel, A.; Peacock, C.; et al. Body Composition Assessment: A Comparison of the DXA, InBody 270, and Omron. J. Exerc. Nutr. 2020, 3, 1–6. [Google Scholar]
  43. Kotsiantis, S.B. Decision Trees: A Recent Overview. Artif. Intell. Rev. 2013, 39, 261–283. [Google Scholar] [CrossRef]
  44. Kingsford, C.; Salzberg, S.L. What Are Decision Trees? Nat. Biotechnol. 2008, 26, 1011–1013. [Google Scholar] [CrossRef]
  45. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  46. Martínez-Gramage, J.; Albiach, J.P.; Moltó, I.N.; Amer-Cuenca, J.J.; Huesa Moreno, V.; Segura-Ortí, E. A Random Forest Machine Learning Framework to Reduce Running Injuries in Young Triathletes. Sensors 2020, 20, 6388. [Google Scholar] [CrossRef]
  47. Pan, S.; Zheng, Z.; Guo, Z.; Luo, H. An Optimized XGBoost Method for Predicting Reservoir Porosity Using Petrophysical Logs. J. Pet. Sci. Eng. 2022, 208, 109520. [Google Scholar] [CrossRef]
  48. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; Volume 42, pp. 785–794. [Google Scholar]
  49. Ratner, B. The Correlation Coefficient: Its Values Range Between +1/−1, or Do They? J. Target. Meas. Anal. Mark. 2009, 17, 139–142. [Google Scholar] [CrossRef]
  50. Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation Importance: A Corrected Feature Importance Measure. Bioinformatics 2010, 26, 1340–1347. [Google Scholar] [CrossRef] [PubMed]
  51. Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-Generation Hyperparameter Optimization Framework. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; ACM: New York, NY, USA, 2019; pp. 2623–2631. [Google Scholar]
  52. Browne, M.W. Cross-Validation Methods. J. Math. Psychol. 2000, 44, 108–132. [Google Scholar] [CrossRef] [PubMed]
  53. Pheasant, S.T. A Review of: “Human Walking”. By V.T. Inman, H.J. Ralston and F. Todd. (Baltimore, London: Williams & Wilkins, 1981.) [Pp.154.]. Ergonomics 1981, 24, 969–976. [Google Scholar] [CrossRef]
  54. Ismail, M.; Batool, S.; Haider, Z.; Khan, M.H.; Ali, I.; Haider, Z. Review on the Detection of Multiple Neuromuscular Disorder Using Electromyography. South. J. Eng. Technol. 2023, 1, 46–68. [Google Scholar]
  55. Lanovaz, J.L.; Oates, A.R.; Treen, T.T.; Unger, J.; Musselman, K.E. Validation of a Commercial Inertial Sensor System for Spatiotemporal Gait Measurements in Children. Gait Posture 2017, 51, 14–19. [Google Scholar] [CrossRef] [PubMed]
  56. Agostini, V.; Lo Fermo, F.; Massazza, G.; Knaflitz, M. Does Texting While Walking Really Affect Gait in Young Adults? J. Neuroeng. Rehabil. 2015, 12, 86. [Google Scholar] [CrossRef]
  57. Castagneri, C.; Agostini, V.; Rosati, S.; Balestra, G.; Knaflitz, M. Asymmetry Index in Muscle Activations. IEEE Trans. Neural Syst. Rehabil. Eng. 2019, 27, 772–779. [Google Scholar] [CrossRef]
  58. Disselhorst-Klug, C.; Schmitz-Rode, T.; Rau, G. Surface Electromyography and Muscle Force: Limits in SEMG–Force Relationship and New Approaches for Applications. Clin. Biomech. 2009, 24, 225–235. [Google Scholar] [CrossRef]
Figure 1. Overall system architecture diagram.
Figure 1. Overall system architecture diagram.
Sensors 24 00734 g001
Figure 2. (a) The photo of the shoe-worn GaitUp Physilog® 5 wearable inertial sensor and (b) the interface of the GaitUp Lab gait analysis system.
Figure 2. (a) The photo of the shoe-worn GaitUp Physilog® 5 wearable inertial sensor and (b) the interface of the GaitUp Lab gait analysis system.
Sensors 24 00734 g002
Figure 3. A self-made multi-channel wireless sEMG measurement device.
Figure 3. A self-made multi-channel wireless sEMG measurement device.
Sensors 24 00734 g003
Figure 4. Four sEMG signals (with additive offsets in the vertical axis) during walking. The first row is the sEMG of the left thing, the second row is the sEMG of the left calf, the third row is the sEMG of the right thing, and the fourth row is the sEMG of the right calf.
Figure 4. Four sEMG signals (with additive offsets in the vertical axis) during walking. The first row is the sEMG of the left thing, the second row is the sEMG of the left calf, the third row is the sEMG of the right thing, and the fourth row is the sEMG of the right calf.
Sensors 24 00734 g004
Figure 5. Data processing flowchart.
Figure 5. Data processing flowchart.
Sensors 24 00734 g005
Figure 6. Data segmentation diagram.
Figure 6. Data segmentation diagram.
Sensors 24 00734 g006
Figure 7. (a) The electrodes are attached to the vastus lateralis muscle, (b) the electrodes are adhered to the gastrocnemius muscle, and (c) the GaitUp sensor is tied to the shoe tongue.
Figure 7. (a) The electrodes are attached to the vastus lateralis muscle, (b) the electrodes are adhered to the gastrocnemius muscle, and (c) the GaitUp sensor is tied to the shoe tongue.
Sensors 24 00734 g007
Figure 8. (a) The histograms of the “Cadence” parameter of the left and right feet and (b) the histograms of the “Stance” parameter of the left and right feet.
Figure 8. (a) The histograms of the “Cadence” parameter of the left and right feet and (b) the histograms of the “Stance” parameter of the left and right feet.
Sensors 24 00734 g008
Figure 9. (a) The histograms of the “DS” parameter of the left and right feet and (b) the histograms of the “Peak Swing” parameter of the left and right feet.
Figure 9. (a) The histograms of the “DS” parameter of the left and right feet and (b) the histograms of the “Peak Swing” parameter of the left and right feet.
Sensors 24 00734 g009
Figure 10. The histograms of the “Speed” parameter of the left and right feet.
Figure 10. The histograms of the “Speed” parameter of the left and right feet.
Sensors 24 00734 g010
Table 1. The categories and abbreviations of 23 gait parameters.
Table 1. The categories and abbreviations of 23 gait parameters.
TypeGait
Parameter
NameDescription
Temporal
parameter
HSHeel StrikeThe point is when the heel hits the ground.
CDCycle DurationThe time spent in one gait cycle is the time difference from the heel strike of the same foot to the next heel strike.
DLSDouble Leg SupportThe bipedal stance period is the proportion of the gait cycle when both feet are on the ground.
CadenceCadenceNumber of steps taken per minute.
StanceStance RatioThe stance period is the portion of the gait cycle during which the foot is in contact with the ground.
SRSwing RatioThe swing period is the proportion of the gait cycle spent with the foot off the ground.
LRLoad RatioThe proportion of the stance period from the time when the heel strikes to when the sole of the foot touches the ground.
FFRFoot Flat RatioThe proportion of the stance period in which the foot is completely flat on the ground.
PRPush RatioThe proportion of the stance period from when the soles of the feet are flat to when the toes are off the ground.
Spatial
parameters
Step LengthStep LengthThe distance between one foot and the other foot on the ground.
Strike LengthStride LengthDistance from one heel strike to the next, equating one gait cycle.
SpeedGait SpeedAverage forward walking speed.
PAVSMax. Angular Velocity During SwingMaximum angular velocity in swing between the heel’s maximum height and the toe’s minimum height.
SMTCFoot Speed Norm at Minimal Toe ClearanceMaximum swing forward speed, typically over 3 times the pace, occurs near the minimum toe-off height.
SAFoot Pitch Angle at Heel Strike (Strike Angle)The angle of the foot to the ground when it hits the ground.
LoAFoot Pitch Angle at Toe-Off (Lift-Off Angle)Angle of toes to ground at the end of propulsion before lift-off.
SWSwing WidthThe maximum lateral offset in the swing period equals the greatest lateral distance in the trajectory.
Path Length3D Path LengthDescribes a 3D gait cycle trajectory scaled by stride length.
Turn analysisTATurning AngleDeflection angle for two consecutive flat steps of the same foot.
Ground
clearance
analysis
maxHCMax. Heel ClearanceHeel reaches peak height off ground in each gait cycle.
maxTC1Max.1 Toe ClearanceAfter the heel’s peak height, the toes reach maximum height off the ground.
minTCMin. Toe ClearanceMinimum toe-off height during swing.
maxTC2Max.2 Toe ClearanceToes reach maximum height off the ground before heels are ready to strike.
Table 2. The range, mean, and standard deviation of the two-foot gait parameters.
Table 2. The range, mean, and standard deviation of the two-foot gait parameters.
ParameterUnitMINMAXAVESTD
HS_LMillisecond9.0315.8314.760.71
HS_RMillisecond9.3015.8614.770.69
CD_LSecond0.851.171.020.06
CD_RSecond0.861.171.020.06
DS_L% of cycle duration8.0526.9620.353.25
DS_R% of cycle duration8.0526.9620.353.25
Cadence_Lsteps/minute103.00141.18117.506.67
Cadence_Rsteps/minute102.56140.35117.486.67
Stance_L% of cycle duration53.4664.0460.281.90
Stance_R% of cycle duration54.0765.3060.221.73
SR_L% of gait cycle35.9646.5439.721.90
SR_R% of gait cycle34.7045.9339.781.73
LR_L% of stance8.0020.8012.742.53
LR_R% of stance7.8121.4912.882.60
FFR_L% of stance33.7756.0347.324.67
FFR_R% of stance28.7057.8546.455.26
PR_L% of stance31.7653.5539.854.22
PR_R% of stance30.6854.0840.504.90
Step Length_LMeter0.530.740.640.04
Step Length_RMeter0.490.720.620.04
Strike Length_LMeter1.011.431.250.07
Strike Length_RMeter1.011.431.240.07
Speed_Lmeter/s1.181.281.230.02
Speed_Rmeter/s1.171.271.220.02
PAVS_Ldegree/s327.96546.44412.2039.17
PAVS_Rdegree/s334.39846.93408.6933.76
SMTC_Lmeter/s3.404.133.800.15
SMTC_Rmeter/s3.204.083.740.13
SA_LDegree10.3534.3323.024.79
SA_RDegree9.3133.6723.244.56
LoA_LDegree−97.11−63.32−81.297.11
LoA_RDegree−93.12−66.14−81.916.56
SW_LMeter0.010.080.040.02
SW_RMeter−0.09−0.01−0.050.01
Path Length_L% of stride length102.75107.92105.271.09
Path Length_R% of stride length102.78109.14105.341.04
TA_LDegree−1.611.700.210.50
TA_RDegree−2.441.14−0.540.53
maxHC_LMeter0.220.360.260.02
maxHC_RMeter0.220.330.250.02
maxTC1_LMeter0.040.170.060.02
maxTC1_RMeter0.030.140.060.01
minTC_LMeter0.000.050.030.01
minTC_RMeter0.000.060.030.01
maxTC2_LMeter0.070.180.120.02
maxTC2_RMeter0.060.180.120.02
Table 3. Pearson correlation coefficients of gait parameters estimated by the XGBoost model.
Table 3. Pearson correlation coefficients of gait parameters estimated by the XGBoost model.
Gait ParameterLeft SideRight Side
HS0.8010.854
* CD0.8970.899
* DLS0.9310.893
* Cadence0.9130.909
Stance0.9170.780
SR0.9260.808
* LR0.8770.862
* FFR0.9170.906
* PR0.9070.907
* Step length0.8620.836
* Strike length0.8880.883
Speed0.7760.798
PAVS0.8830.793
* SMTC0.9190.913
SA0.8810.941
* LoA0.9320.913
* SW0.9030.874
* Path length0.9040.898
TA0.7480.723
maxHC0.908 0.809
maxTC10.918 0.702
* minTC0.8510.812
* maxTC20.8840.881
* The PCCs of both the left and right feet are greater than 0.8, and their difference is not greater than 0.05.
Table 4. Feature importance score.
Table 4. Feature importance score.
FeatureRaw_sEMGDWTH_sEMG
ThighCalfThighCalf
MF0.080.170.070.05
MDF0.110.070.020.03
STD0.060.130.250.21
SampEn0.110.140.090.23
RMS0000
AP0000
Table 5. Pearson correlation coefficients of 14 gait parameters for left and right feet estimated by DT, RF, and XGBoost models.
Table 5. Pearson correlation coefficients of 14 gait parameters for left and right feet estimated by DT, RF, and XGBoost models.
NameLeft FootRight Foot
DTRFXGBoostDTRFXGBoost
CD0.840.930.900.810.930.90
DLS0.830.930.930.710.910.89
Cadence0.840.930.910.800.930.91
LR0.750.900.880.750.880.86
FFR0.810.940.920.820.930.91
PR0.780.920.910.840.930.91
Step length0.800.910.860.760.910.84
Strike length0.810.920.890.780.920.88
SMTC0.830.940.920.810.930.91
SA0.840.930.930.770.930.91
SW0.820.930.900.760.910.87
Path length0.780.910.900.790.930.90
minTC0.740.880.850.740.880.81
maxTC20.820.920.880.700.910.88
Mean0.8060.9200.8990.7740.9160.884
STD0.0330.0160.0300.0400.0180.030
Table 6. Optuna’s Setting Values for hyperparameters of DT, RF, and XGBoost models.
Table 6. Optuna’s Setting Values for hyperparameters of DT, RF, and XGBoost models.
ModelHyperparameterRangeValue
DTmax_depth10~50010
min_simples_split1~322
min_samples_leaf1~101
RFn_estimators100~50050
max_featuressqrt|log2sqrt
max_depth5~1515
min_simples_split1~322
min_samples_leaf1~101
XGBoostn_estimators500~100200
learning_rate0.1~1.00.1
max_depth5~1513
subsample0.1~1.00.8
min_chilld_weight0~30030
Table 7. Five-fold cross-validation of RF and XGBoost models.
Table 7. Five-fold cross-validation of RF and XGBoost models.
ModelsMeanSTD
DT0.7960.017
RF0.9150.006
XGBoost0.8940.007
Table 8. Training and predicting times of DT, RF, and XGBoost.
Table 8. Training and predicting times of DT, RF, and XGBoost.
ModelsTraining Time (s)Predicting Time (s)
DT0.02≥0.01
RF0.34≥0.01
XGBoost2.100.02
Table 9. The memory space occupied by the DT, RF, and XGBoost models.
Table 9. The memory space occupied by the DT, RF, and XGBoost models.
Model Size (KB)
DT56
RF11,045
XGBoost 5805
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, S.-H.; Ting, C.-E.; Wang, J.-J.; Chang, C.-J.; Chen, W.; Sharma, A.K. Estimation of Gait Parameters for Adults with Surface Electromyogram Based on Machine Learning Models. Sensors 2024, 24, 734. https://doi.org/10.3390/s24030734

AMA Style

Liu S-H, Ting C-E, Wang J-J, Chang C-J, Chen W, Sharma AK. Estimation of Gait Parameters for Adults with Surface Electromyogram Based on Machine Learning Models. Sensors. 2024; 24(3):734. https://doi.org/10.3390/s24030734

Chicago/Turabian Style

Liu, Shing-Hong, Chi-En Ting, Jia-Jung Wang, Chun-Ju Chang, Wenxi Chen, and Alok Kumar Sharma. 2024. "Estimation of Gait Parameters for Adults with Surface Electromyogram Based on Machine Learning Models" Sensors 24, no. 3: 734. https://doi.org/10.3390/s24030734

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop