1. Introduction
The human lower limb is crucial to performing various daily movement tasks including locomotion. Aging or illness can impose physical and functional constraints on lower limb mobility [
1]. Additionally, repetitive tasks or those involving heavy loads, such as lifting or carrying objects, can strain the lower limbs, and result in fatigue and injuries [
2]. To overcome mobility limitations and mitigate muscle fatigue, powered wearable assistive technologies, such as lower limb exoskeletons, have been developed [
3,
4]. Previous work exploring various exoskeleton hardware and control concepts [
4,
5] clearly supports personalized assistance strategies in order to maximize benefits for users [
6]. For example, Quinlivan et al. employed a single control strategy for all users, which led to inconsistent metabolic cost reductions, ranging from 15% to 36% during walking [
7].
To personalize assistance, human-in-the-loop (HITL) optimization has shown considerable promise [
8,
9]. HITL optimization relies on both a clear optimization objective, such as minimizing walking effort, and a real-time feedback metric that is related to the objective. Thus far, metabolic cost has served as the predominant feedback metric to reduce movement effort. However, while studies have shown significant reductions in walking effort using metabolic cost, this approach is hindered by prolonged optimization periods and its impracticality for everyday scenarios, primarily due to the necessity of wearing a face mask for data acquisition. About two minutes are used for every iteration for an ankle exoskeleton [
9], a hip exosuit [
10], single and multi-joint exoskeleton configurations [
11], a hip exoskeleton [
12] and a prosthetic foot with stiffness adaptation [
13] during walking, as well as an ankle exoskeleton in running [
14]. Most of these approaches used in addition a first-order dynamical model to estimate the metabolic cost within this time window [
15].
Considering these limitations, an alternative feedback metric that provides a measure of effort is necessary. One promising candidate is lower limb muscle activity measured via surface electromyography (EMG). While EMG does not measure effort directly, changes in EMG amplitude have been widely used as a practical surrogate for relative changes in muscular demand during locomotion. Further, EMG signals have been increasingly used as real-time feedback in human-in-the-loop (HIL) optimization of lower-limb exoskeletons [
12,
16,
17]. In such closed-loop systems, EMG signals from key lower limb muscles are continuously measured and processed to quantify muscle effort. These measurements are used to construct a cost function, which is then minimized by an optimization algorithm that iteratively adjusts exoskeleton assistance parameters. In this way, the EMG signals directly inform the control system about the human physiological response, completing a closed loop from human effort to device adjustment.
However, the use of EMG as a feedback signal is not without challenges. Due to slow drift, EMG amplitude may change over time independently of movement effort. The drift can arise from multiple physiological and measurement-related factors, including sensor–skin motion early in an experiment [
18], sweat accumulation altering electrode impedance [
19,
20], and increases in skin and muscle temperature during warmup [
21,
22,
23]. Fatigue has also been proposed as a contributor, although its influence on EMG amplitude remains inconsistent and task dependent [
24,
25,
26]. Still, previous research has demonstrated the utility of EMG in HITL optimization and has shown notable benefits and quicker optimization compared to methods that rely on metabolic cost [
9,
10,
11,
13,
14]. Specifically, Han et al. [
16], Xu et al. [
17], and Zhao et al. [
12] demonstrated that using EMG as feedback substantially accelerates optimization, enhancing efficiency. With only 25–60 s allocated per exploration phase, the number of testable parameter sets per hour can be up to four times higher compared to approaches relying on metabolic cost as the feedback metric. Additionally, Grimmer et al. [
27] anticipate further improvements by leveraging combinations of EMG data from multiple strides and various muscles across both limbs. By averaging over a larger dataset, this approach mitigates one of the common challenges associated with EMG, the signal variability, thus paving the way for even more robust optimization processes. In addition to reducing short-term signal variability, EMG-based approaches can also remain effective in the presence of slow signal drift. In earlier work [
27], we showed that although EMG amplitude can exhibit substantial drift over extended walking durations, relative comparisons between temporally adjacent conditions remain robust, enabling identification of changes in movement effort.
In Grimmer et al. [
27], the mean average value of rectified EMG, also known as mean absolute value (MAV), served as the primary EMG feature for analysis. Han et al. [
16], Xu et al. [
17] and Zhao et al. [
12] used the root mean square (RMS). However, it is important to acknowledge the vast array of time-, frequency-, and spatial-domain features, as well as their combinations, that are available for EMG analysis [
28,
29,
30,
31]. According to a review by Toledo et al. [
31], mean absolute value (MAV), slope sign change (SSC), zero-crossing (ZC), and wavelength (WL) emerge as the most prevalent EMG features. These features are typically employed in pattern recognition to classify upper- and lower-limb movements for robotic control. In such applications, the features are not required to scale monotonically with muscle activity or walking effort, as the primary objective is movement classification rather than effort minimization. Consequently, changes in EMG feature amplitude may reflect altered coordination strategies or compensatory movements induced by external assistance, rather than a reduction in metabolic effort. This distinction is critical for human-in-the-loop optimization, where the feedback signal must reliably track effort-related changes.
Despite growing interest in EMG-based HITL optimization, it remains largely unexplored which EMG features are best suited to robustly reflect changes in muscle effort across different walking conditions despite inherent EMG signal variability.
The aim of this study is to systematically evaluate and compare a broad set of commonly used EMG features with respect to their ability to detect effort-related changes during walking, using two complementary experimental paradigms: a controlled load manipulation to elicit graded changes in muscle effort, and an exoskeleton-assisted walking experiment that reflects a realistic human-in-the-loop optimization scenario.
In the first experiment (Load), muscle effort was manipulated by having participants walk under an unloaded condition and three loaded conditions, during which they carried dumbbells totaling 2, 4, and 8 kg [
27]. EMG features were evaluated based on their ability to detect changes in walking effort during transitions between loaded and unloaded conditions and to differentiate between the three load levels. Particular emphasis was placed on the most challenging 2 kg condition, which represents the smallest change in external load.
We hypothesize that the effectiveness of EMG-based features in distinguishing varying levels of muscle effort differs substantially across features. This effectiveness is quantified using the detection rate, defined as the percentage of observed changes in EMG that align with the expected direction of effort modulation (e.g., increased effort from unloaded to loaded walking). Furthermore, we propose that a key determinant of feature performance is the noise-to-signal-change ratio, defined as the ratio of intra-participant variability to the absolute magnitude of the effort-related signal change.
The second experiment (Exo) examined walking with a hip exoskeleton [
12,
32,
33], where walking assistance was provided under ten different conditions, each featuring varying assistance timings. Additionally, an unassisted condition was included as a baseline for comparison. This exoskeleton experiment offers a valuable opportunity to validate the results concerning feature performance of the Load experiment. This validation specifically pertains to detection rates of changes in walking effort when comparing conditions with and without assistance, as well as the noise-to-signal ratios in a context that closely resembles real-world applications.
2. Materials and Methods
For the analysis of EMG-based features intended to reflect changes in walking effort, experimental conditions must induce systematic variations in effort. Previous studies have shown that walking effort decreases with reduced body weight [
34] and increases with additional carried load [
35]. These load-induced changes affect not only whole-body metabolic cost but also the activation of individual lower-limb muscles, which can be observed in corresponding EMG signals [
36]. Consequently, experimental paradigms that manipulate external load are well suited for evaluating EMG-based indicators of walking effort.
Many load-modulation experiments apply additional weight using backpacks or weights attached to the lower limbs. For the present study, however, a setup was required that allowed rapid and repeated changes in load within a single experimental session. Miller and Stamford [
37] demonstrated that carrying additional weight in the hands increases oxygen consumption, and thus walking effort, by approximately 1.3% per 100 g added. Based on this finding, a previously published study on EMG-based human-in-the-loop optimization [
27] employed an experimental design in which participants walked on a treadmill and, guided by a metronome, repeatedly picked up or released a pair of dumbbells. This dataset is particularly well suited for the present analysis, as it includes three distinct load levels of 2 kg, 4 kg, and 8 kg. This allows assessment of whether EMG features systematically increase with increasing load, which is a key requirement for their use in HITL optimization.
Nevertheless, this load-carrying experiment (Load experiment) does not fully capture the EMG adaptations that occur during walking with a lower-limb exoskeleton. To address this limitation, we additionally included a dataset of walking with a hip exoskeleton (Exo experiment) using different assistance timings, which was originally recorded in preparation for a separate HITL optimization study [
12]. In this dataset, both lower-limb EMG and metabolic cost of walking were measured. For our analysis, we selected all assistance conditions that resulted on average in a reduction in metabolic cost, assuming that for most muscles this would be accompanied by a corresponding reduction in EMG activity.
2.1. Participant Information
The Load experiment was originally conducted as part of a previous study [
27] and included 13 participants (27 ± 5 years, 1.82 ± 0.07 m, 81 ± 11 kg). The Exo experiment was likewise performed in a prior study [
12] and involved 11 participants (21 ± 1 years, 1.75 ± 0.08 m, 71.7 ± 8 kg). Both study protocols were approved by the institutional review board of the Technical University of Darmstadt, Germany. All participants provided written informed consent in accordance with the Declaration of Helsinki and reported no gait-related impairments.
2.2. Experimental Setup
In the Load experiment, EMG data was collected from several lower limb muscles for each leg, including the rectus femoris (RF), gluteus maximus (GM), vastus lateralis (VL), tibialis anterior (TA), gastrocnemius lateralis (GAS), and soleus (SOL). In the Exo experiment, EMG data was recorded from the rectus femoris (RF), gluteus maximus (GM), biceps femoris (BF), gastrocnemius lateralis (GAS), and soleus (SOL). Both studies utilized a wireless EMG system (Trigno Avanti, Delsys, Natick, MA, USA). For the Load experiment, the entire acquisition process (1926 Hz) was conducted using this hardware. However, in the Exo experiment, the EMG system was only used for measurement, with data storage synchronized with the exoskeleton data at 1000 Hz on a real-time off-board computer.
EMG electrode placement followed the recommendations of SENIAM (seniam.org). Before sensor placement, the skin was prepared by removing hair and cleaning the area with alcohol. To minimize the risk of sensor displacement due to movement or sweating, sensors were further secured with adhesive non-woven fabric tape (Rudavlies). Each EMG sensor also included a 3D gyroscope (148 Hz). In the Load experiment, the gyroscope data of the EMG sensors attached to both the muscles and vinyl-coated dumbbells were used to identify individual strides and to differentiate between loaded and unloaded walking, respectively. Both studies were conducted on an instrumented treadmill (ADAL-WR, HEF Tecmachine, Andrezieux-Boutheon, France).
In the Exo experiment, the hip exoskeleton GuroX (3.1 kg) was controlled in real time using an off-board computer running Matlab Simulink xPC (Matlab R2020a, Mathworks, Natick, MA, USA) at 1000 Hz [
12,
33]. The exoskeleton’s extension and flexion torques were governed by a sinusoidal curve with one full cycle corresponding to a single stride. The peak torque amplitudes were set to 10 Nm for extension and 8 Nm for flexion. The peak flexion torque occurred with a fixed delay of 50% of the stride duration following the peak extension torque. The peak extension torque timing varied from 0% to 45% of the stride time in 5% increments, resulting in a total of 10 timing conditions. During the experiments, the peak extension torque timings were randomly assigned. Heel strikes were detected using a vertical ground reaction force (GRF) threshold, and the timing of two consecutive heel strikes of the same limb was used to calculate stride time. The average stride time from the last five strides combined with the elapsed time since the most recent heel strike was used to assess the current stride progress. This information was then used to regulate the exoskeleton’s torque amplitude throughout each stride.
2.3. Experimental Protocol
The Load experiment consisted of four walking sessions at 1.3 m/s, each lasting 14 min. After performing the first session without load for warmup, three loaded sessions were performed in a balanced and randomized order where each session required participants to carry two vinyl-coated dumbbells with a total mass of 2, 4, or 8 kg shared equally across both hands. Within each loaded session and at every 40 s, participants were instructed by an acoustic metronome to either pick up or release the pair of dumbbells thus resulting in 11 intervals without loads and 10 intervals with loads over the course of 14 min.
The Exo experiment began with a three-minute standing session used to determine net metabolic cost. Subsequently, a five-minute walking phase without exoskeleton assistance served as a warmup and familiarization period. Then, a four-minute walking session was conducted to gauge metabolic cost without exoskeleton assistance. Ten four-minute walking sessions were then performed with exoskeleton assistance where peak extension and flexion torque timings were varied in a randomized order. All treadmill walking sessions were performed at 1.25 m/s. For all sessions including standing and warmup, participants were wearing the exoskeleton.
2.4. Data Processing
Data from both experiments were exported to Matlab (Mathworks, Natick, MA, USA) for further processing.
2.4.1. Stride Identification
In the Load experiment, strides were identified using the zero-crossing of the shank’s angular velocity from the gyroscope data, which occurs near the heel strike, as described in [
27]. For the Exo experiment, strides were also identified based on heel strike timing; however, this was achieved using vertical GRFs recorded by two force plates from the treadmill, each covering one leg (left and right). To reduce noise, a zero-lag second-order Butterworth filter with a cutoff frequency of 40 Hz was applied to the GRF data. Force thresholds were then used to detect heel strikes and identify individual strides for each leg [
38].
Stride times were calculated from consecutive heel strike timings from the same limb. In both experiments, for each participant and condition, a total of 40 consecutive strides, equally divided between both limbs, were selected for stride time and EMG analyses. These 40 strides excluded the ten strides immediately prior to and following any change in condition.
2.4.2. Metabolic Cost of Exoskeleton Study
In the Exo experiment, metabolic power was calculated from oxygen consumption and carbon dioxide production using a modified Brockway equation [
39]. Metabolic cost was determined for a standing condition (including three minutes of data), a single unassisted walking condition (four minutes), and all assisted walking conditions (four minutes each) using the dynamic model described in [
9]. To obtain the net metabolic cost, the average metabolic power from the standing condition was subtracted from each walking condition. Relative changes in net metabolic power between each assisted condition and the unassisted walking condition were then computed following the approach outlined in [
40].
2.4.3. EMG-Basic Processing
To eliminate potential EMG offsets, the mean EMG signal was subtracted from the data of each sensor. Subsequently, the EMG data was bandpass filtered using a zero-lag fourth-order Butterworth filter with a frequency range of 40–450 Hz. The 40 Hz cutoff, which is twice the typical value, proved highly effective in reducing EMG drift during prolonged experimental sessions [
27]. For calculating the average EMG across a stride, the EMG data was additionally rectified and filtered using a zero-lag second-order Butterworth filter with a 6 Hz cutoff frequency.
Three strategies were employed to remove strides with EMG measurement artifacts [
27]. First, phases with significant and prolonged changes in the EMG signal, such as those caused by a loose sensor, were manually excluded. Second, any distinct EMG peaks with amplitudes exceeding three times the average signal amplitude (based on mean peak height across the session), along with the 1 s periods before and after these peaks, were removed. Third, strides with artifacts that exhibited similar EMG amplitudes to expected signals but differed in duration or frequency were identified using the mean absolute value (MAV) of each stride under the same condition. Strides were excluded if their MAV was more than three times greater or less than 0.4 times the average MAV. In total, 10% of the 17,600 analyzed strides in the Exo experiment (11 participants, 7 assisted and 1 unassisted session, 2 limbs, 5 muscles, 20 strides) and 6% of the 374,400 analyzed strides in the Load experiment (13 participants, 3 loaded and 3 unloaded phases, 2 limbs, 6 muscles, 20 strides, 10 transitions, 2 directions of transitions) were excluded.
2.5. EMG Features
The computed EMG features (
Table 1) were primarily based on those provided in [
29,
30]. Both studies listed EMG features commonly used in pattern recognition and motor control applications, as they capture amplitude, frequency, and time–frequency characteristics of the signal, providing a comprehensive representation of muscle activity. In our analysis, we further organized these features by domain for clarity, including a broad range spanning amplitude-, model-, energy-, complexity-, variation-, mean-, and spectral-based features. Features were selected to provide comprehensive coverage of commonly used EMG representations while remaining applicable to cyclic locomotion tasks and suitable for effort-based human-in-the-loop optimization, where features must reflect changes across the entire stride rather than isolated portions of the movement, ensuring that activation changes induced by the exoskeleton are captured throughout the cycle. Equations and Matlab code can be found in the
Supplementary Information.
We excluded features that emphasize only specific portions of the movement cycle because they focus on a limited time window (e.g., 25% to 75% of the movement). Examples of these features includes the Absolute Value of Summation in a given window and its Mean (ASM) [
41] and the Modified Mean Absolute Value (MMAV, MMAV2) [
28,
42]. Similarly, features like the Enhanced Wavelength (EWL) and Enhanced Mean Absolute Value (EMAV) were excluded as they disproportionately increase the contribution of data from 20% to 80% of the movement cycle to the overall feature value [
30]. The Multiple Hamming Window (MHW) and Mean Absolute Value Slope (MAVSLP) features were also omitted due to their bias toward specific times within the movement cycle [
28]. While these excluded features may be advantageous for pattern recognition by minimizing data with a high noise-to-signal ratio at the beginning and end of non-cyclic movements, they are problematic in repetitive cyclic movements like walking where phases of high muscular activity can occur throughout the entire movement cycle. In particular, changes in activation due to an exoskeleton that can occur in every phase of a stride will not be reflected.
Table 1.
List of EMG-based features including identification number (ID), full name, abbreviation, domain, group, and source. Further details of each feature, including any parameters, the equation (
Appendix A) and Matlab code, can be found in the
Supplementary Information.
Table 1.
List of EMG-based features including identification number (ID), full name, abbreviation, domain, group, and source. Further details of each feature, including any parameters, the equation (
Appendix A) and Matlab code, can be found in the
Supplementary Information.
| ID | Feature | Abbreviation | Domain | Group | Source |
|---|
| 1 | Average Amplitude Change | AAC | T | Amplitude | [28] |
| 2 | Auto-Regressive Model/Coefficients | AR | T | Model | [43,44,45] |
| 3 | Absolute Value of Summation of Square Root | ASS | T | Energy | [41] |
| 4 | Cardinality | CARD | T | Complex | [46] |
| 5 | Cepstral Coefficients | CC | T | Model | [28,44,47] |
| 6 | Coefficient of Variation | COV | T | Variation | [48] |
| 7 | Difference Absolute Mean Value | DAMV | T | Amplitude | [49] |
| 8 | Difference Absolute Standard Deviation Value | DASDV | T | Variation | [28,49] |
| 9 | Difference Variance Value | DVARV | T | Variation | [50] |
| 10 | Energy | EN | T | Energy | [51] |
| 11 | Frequency Ratio | FR | F | Spectral | [28,52] |
| 12 | Integrated EMG | IEMG | T | Amplitude | [28] |
| 13 | Interquartile Range | IQR | T | Variation | [53] |
| 14 | Kurtosis | KURT | T | Variation | [54] |
| 15 | Log Coefficient of Variation | LCOV | T | Variation | [55] |
| 16 | Log Detector | LD | T | Energy | [28] |
| 17 | Log Difference Absolute Mean Value | LDAMV | T | Amplitude | [50] |
| 18 | Log Difference Absolute Standard Deviation | LDASD | T | Variation | [50] |
| 19 | Log Teager Kaiser Energy Operator | LTKEO | T | Amplitude | [55] |
| 20 | Mean Absolute Deviation | MAD | T | Mean | [53] |
| 21 | Mean Absolute Value | MAV | T | Mean | [28,47,51,56,57] |
| 22 | Maximum Amplitude | MAX | T | Amplitude | [57] |
| 23 | Median Frequency | MDF | F | Spectral | [28,58] |
| 24 | Median Absolute Value | MED | T | Mean | by authors |
| 25 | Maximum Fractal Length | MFL | T | Complex | [59,60] |
| 26 | Modified Median Frequency | MMDF | F | Spectral | [61] |
| 27 | Modified Mean Frequency | MMNF | F | Spectral | [61] |
| 28 | Mean Energy | MnE | T | Mean | [62,63] |
| 29 | Mean Frequency | MNF | F | Mean | [28,58] |
| 30 | Mean Power | MNP | F | Mean | [28] |
| 31 | Mean Value of Square Root | MSR | T | Mean | [41] |
| 32 | Myopulse Percentage Rate | MYOP | T | Amplitude | [64,65] |
| 33 | Peak Frequency | PKF | F | Spectral | [28] |
| 34 | Power Spectrum Ratio | PSR | F | Energy | [28,66] |
| 35 | Root Mean Square | RMS | T | Mean | [28,49,56,57] |
| 36 | Standard Deviation | SD | T | Variation | [51,56,57] |
| 37 | Sample Entropy | SE | T | Complex | [67,68] |
| 38 | Skewness | SKEW | T | Variation | [69] |
| 39 | Spectral Moment 1 | SM1 | F | Spectral | [28] |
| 40 | Spectral Moment 2 | SM2 | F | Spectral | [28] |
| 41 | Spectral Moment 3 | SM3 | F | Spectral | [28] |
| 42 | Slope Sign Change | SSC | T | Complex | [28,51,56,70] |
| 43 | Temporal Moment | TM | T | Complex | [71] |
| 44 | Total Power | TTP | F | Energy | [28] |
| 45 | Threshold Zero Crossing | TZC | T | Complex | [72] |
| 46 | Variance | VAR | T | Variation | [28,47,56] |
| 47 | V-Order | VO | T | Complex | [28,47] |
| 48 | Willison Amplitude | WA | T | Amplitude | [28,47,56] |
| 49 | Waveform Length | WL | T | Amplitude | [28,47,51,70] |
| 50 | Zero Crossing | ZC | T | Complex | [28,47,51,64,70] |
We also excluded features referenced by multiple names where the equation or numeric result was that same as that provided by other features. The Energy (EN) [
51] feature was selected over the equal Simple Square Integral (SSI) [
28]. Furthermore, based on [
28], Total Power (TTP) and the Zero Spectral Moment (SM0) result in a similar outcome, and only TTP was analyzed.
The features listed in
Table 1 can be organized into subgroups based on their functional roles and performance in specific tasks. Amplitude-based features (WA, AAC, IEMG, WL, DAMV, LTKEO, MYOP, LDAMV, MAX) capture the overall magnitude or strength of the EMG. Mean-based features (MAV, MAD, MED, MNP, MNF, MnE, MSR, RMS) describe the central tendency of the signal also capturing variable muscle activity. Variation-based features (SD, COV, LCOV, DASDV, DVARV, LDASD, KURT, IQR, SKEW, VAR) quantify changes and variability in muscle activity making them suitable for analyzing dynamic or inconsistent movement patterns. Spectral features (SM1, SM2, SM3, MDF, MMDF, MMNF, PKF, FR) represent higher-order frequency-domain properties of the signal and are valuable for tasks with rhythmic components or complex frequency content. Energy-based features (TTP, LD, EN, PSR, ASS) quantify the overall signal power and are useful for assessing the intensity of muscle activation. Complex features (SE, SSC, ZC, TZC, TM, VO, CARD, MFL) reflect advanced properties of the signal, such as nonlinearity or temporal patterns, which may offer unique insights into specific tasks. Model-based features (AR, CC) represent the underlying dynamics of the signal and may be beneficial in tasks requiring an understanding of signal behavior over time.
Feature processing methods, including adaptations, are detailed in the
Supplementary Information along with their corresponding equations (
Appendix A). Below, we provide two general guidelines on feature usage.
First, several features require threshold values for their determination. Although some studies offer recommendations for these thresholds, using them is critical, as thresholds depend on individual participants and muscles. EMG amplitudes can vary due to factors like tissue thickness between the muscle and sensor, skin temperature, and moisture levels. Therefore, we opted to use normalized signal thresholds based on each participant’s EMG sensor data amplitudes. To achieve this, we calculated the median of the rectified EMG for each stride in both the unloaded condition and the condition without exoskeleton assistance. These medians were then averaged across all strides to obtain a basic threshold. Using this basic threshold, thresholds for individual features were manually defined based on visual inspection. For the features WA and MYOP, the basic threshold was used directly. For Zero Crossing (ZC) and Slope Sign Change (SSC), the basic threshold was divided by 10, and for Cardinality (CARD), it was divided by 100.
Second, unlike all other features that produced reasonable outcomes, Maximum Fractal Length (MFL), Log Difference Absolute Standard Deviation (LDASD), Log Teager-Kaiser Energy Operator (LTKEO), and Skewness (SKEW) showed decreasing feature values when transitioning from unloaded to loaded conditions, which is contrary to our expectations. This led to a decrease in detection rate values as the load increased from 2 kg to 8 kg. To address this, for these features, the detection rate was mirrored around the 50% axis.
2.5.1. EMG Outcome Measures and Averaging
EMG features were calculated for each stride in both experiments. We then extracted feature values from specific sets of strides to compute averages for different conditions: loaded vs. unloaded, and with vs. without exoskeleton assistance. These averages were calculated separately for each leg, muscle, participant, and load or assistance condition.
For the Load experiment, feature values from 20 consecutive strides of each leg were extracted from each interval as outlined in [
27]. In the Exo experiment, values for 20 consecutive strides were similarly extracted from the beginning of each of the conditions. To account for adaptation to a new condition, values from the first ten strides of each condition preceding these 20 strides were excluded. The average of each 20-stride set was then calculated.
Data were normalized by scaling each feature average for the unloaded conditions in the Load experiment and for the without exoskeleton assistance condition in the Exo experiment to 100%, and all other conditions were expressed relative to this baseline. The intra-participant standard deviation was then calculated for each normalized 20-stride set. Signal-change was determined by subtracting the respective averages of unloaded from loaded conditions and with-assistance from without-assistance conditions. To avoid infinite values when calculating the intra-participant standard deviation to signal-change ratio, signal-change was set to NaN in cases where it was zero. This occurred in 85 out of 468,000 conditions (13 participants, 3 load sessions, 2 limbs, 6 muscles, 20 transitions, 50 features) in the Load experiment and 4 out of 25,410 conditions (11 participants, 7 assistance sessions, 2 limbs, 5 muscles, 1 comparison, 33 features) in the Exo experiment.
Following this, for the Load experiment, the individual participant averages of the 10 transitions for the standard deviation and signal-change were determined. Combined limb averages for standard deviation and signal-change were then calculated for both experiments. Additionally, the detection rate was calculated to indicate the percentage of feature values that changed in the expected direction across all analyzed transitions. For instance, a detection rate of 80% means that 80% of the analyzed changes aligned with expectations: an increase for unloaded to loaded, a decrease for loaded to unloaded, and a decrease for unassisted to assisted conditions. Equal feature outcomes when comparing conditions were considered failed detections, as they did not align with expectations. Eight cases occurred for the Load and one for the Exo experiment. In total, for the Load experiment, 10 loaded-to-unloaded transitions and 10 unloaded-to-loaded transitions were included for each participant and load. Subsequently, participant averages were determined for the signal-change during loaded-to-unloaded and unloaded-to-loaded transitions; participant averages were also determined for the intra-participant standard deviation at each load for the loaded condition. Finally, group averages for each load condition were determined based on the 13 participants for these variables.
The method for determining variables and their group averages in the Exo experiment was comparable. However, when determining the detection rate, multiple assisted conditions were compared to a single unassisted condition for each participant. Based on the metabolic cost reductions (
Figure 1), we selected conditions with peak extension torque timings between 10% and 40% as these seven individual samples presumed to reduce muscle activity. These conditions showed the greatest reductions in metabolic cost and were evenly distributed around the maximum metabolic cost reduction of −12.5 ± 4.7%, which occurred with a peak extension torque timing of 25% and a peak flexion torque timing of 75%.
The absolute value of the feature signal-change was used to determine the intra-participant standard deviation to absolute signal-change ratio. This approach was chosen because individual strides, muscles and participants could all induce both positive and negative signal-change values, potentially resulting in a near-zero average signal-change and thus extreme ratio values. Since the amplitude of change was our primary criterion, using the absolute value was deemed appropriate.
2.5.2. Statistics
To compare the performance of individual EMG features, each feature was statistically evaluated against the overall best-performing reference feature at the participant level. In the Load experiment (2 kg condition), Waveform Length (WL) served as the reference feature, whereas in the Exo experiment, Log Difference Absolute Mean Value (LDAMV) was used. For each participant, feature values were first averaged across strides, muscles, limbs, and condition, yielding a single value per feature and participant.
Paired comparisons between each feature and the reference feature were performed using the Wilcoxon signed-rank test, which does not assume normality, to assess whether the distribution of participant-level feature values differed significantly from that of the reference feature.
Because multiple features were tested simultaneously, raw p-values were corrected to account for multiple comparisons within each study. All valid raw p-values for a given experiment were treated as a single family of tests and adjusted using the Benjamini–Hochberg false discovery rate (FDR) procedure. This approach controls the expected proportion of false positives among the tested features within each experiment. Features with FDR-adjusted p-values below 0.05 were considered statistically significant.
3. Results
For the Load experiment participants had similar stride times (about 1.1 ± 0.05 s) for the loaded and unloaded walking conditions when carrying 2, 4 and 8 kg. In contrast, in the Exo experiment, depending on the peak torque timing, the stride times varied between 1.05 ± 0.04 s and 1.11 ± 0.06 s with the without assistance condition lasting 1.07 ± 0.04 s (
Table 2).
For the Load experiment, as expected, the normalized EMG of all analyzed muscles increased as the carried load increased (
Figure 2). In contrast, in the Exo experiment, the normalized EMG primarily decreased for the GAS and SOL, whereas both increases and decreases were seen for the RF, GM, and BF depending on the peak extension torque timing (
Figure 3). The largest EMG fluctuations were observed at the early (0% and 5%) and late (45%) peak extension torque timings. These timings were excluded from further analyses, as they consistently showed either increases or no changes in metabolic cost on average. Consequently, they were unsuitable for evaluating the relationship between EMG reduction and corresponding decreases in metabolic cost.
Upon analyzing the data from the Load experiment, it became evident that a total of 17 EMG features, representing both time and frequency domains, failed to effectively differentiate between loaded and unloaded conditions (
Figure 4). Their detection rates hovered around 50%, akin to random chance. Furthermore, none of these features, nor the signal entropy (SE), exhibited a consistent pattern in detection rates relative to the total carried mass. There was no discernible trend indicating higher detection rates for larger changes in mass as was found for all other features. Therefore, these 17 EMG features were excluded from further analyses.
The greatest detection rates at the most challenging condition for detecting a change between carrying no weight and carrying 2 kg was found for the Waveform Length (WL, 77%). This result was significantly larger compared to all other features (
Table 3) but the Absolute Value of Summation of Square Root (ASS), the Sample Entropy (SE) and the Willison Amplitude (WA). On average, for the qualified features, the 2 kg, 4 kg, and 8 kg conditions had detection rates of 70 ± 4%, 82 ± 5%, and 92 ± 5%, respectively.
For the Exo experiment, the greatest average detection rates were found for the Log Difference Absolute Mean Value (LDAMV) at 71.1% closely followed by Root Mean Square (RMS), Standard Deviation (SD), Mean Absolute Deviation (MAD) and Mean Absolute Value (MAV) at 70.8%. Also while only including qualified features form the Load experiment in the Exo analyses, several features exhibited detection rates that hovered around random chance (50%) including some of those that performed best during the Load experiment. While the features TM, SM1, LTKEO, IEMG, SM2, MFL, TZC, SM3, WL, ASS, and WA showed significant differences compared to the best performing feature LDAMV before correction, none remained significant after FDR adjustment (
Table 4).
The average detection rate of all muscles (MEAN) is defined by the average of individual muscles with the SOL and GAS showing greatest and the RF showing lowest detection rates for the Load and Exo experiments (
Figure 5A and
Figure 6A).
Table 4.
FDR-adjusted
p-values from paired Wilcoxon signed-rank tests for the reduced set of 33 features of the Exo experiment when comparing to the best performing feature Log Difference Absolute Mean Value (LDAMV). The order is with respect to the order of
Figure 6. While the features TM, SM1, LTEKO, IEMG, SM2, MFL, TZC, SM3, WL, ASS and WA were significant without correction, no features reached significance after FDR correction.
Table 4.
FDR-adjusted
p-values from paired Wilcoxon signed-rank tests for the reduced set of 33 features of the Exo experiment when comparing to the best performing feature Log Difference Absolute Mean Value (LDAMV). The order is with respect to the order of
Figure 6. While the features TM, SM1, LTEKO, IEMG, SM2, MFL, TZC, SM3, WL, ASS and WA were significant without correction, no features reached significance after FDR correction.
| Feature | | Feature | | Feature | | Feature | |
|---|
| LDAMV | – | DAMV | 0.75 | MED | 0.39 | MFL | 0.10 |
| MAD | 1.0 | MSR | 0.72 | IQR | 0.39 | TZC | 0.10 |
| MAV | 1.0 | DASDV | 0.54 | EN | 0.15 | SM3 | 0.10 |
| RMS | 1.0 | LDASD | 0.52 | TTP | 0.15 | WL | 0.10 |
| SD | 1.0 | CARD | 0.74 | TM | 0.10 | ASS | 0.10 |
| MnE | 1.0 | DVARV | 0.61 | SM1 | 0.10 | WA | 0.10 |
| MNP | 1.0 | LD | 0.33 | LTEKO | 0.10 | | |
| VAR | 1.0 | MAX | 0.15 | IEMG | 0.10 | | |
| AAC | 0.75 | MYOP | 0.61 | SM2 | 0.10 | | |
The intra-participant standard deviation and the absolute signal-change show high variability across features in both experiments. Nonetheless, there exists a strong correlation between the magnitudes of these characteristics. Consequently, when calculating the noise-to-signal-change ratio, the feature averages demonstrate much narrower dispersion, ranging from 2.9 to 9.7 for the Load experiment and 1.3 to 4.1 for the Exo experiment (
Figure 5B,C and
Figure 6B,C).
In the Load experiment, a strong correlation (R
2 = 0.69) was observed between the detection rate and the noise-to-signal-change ratio. Conversely, no correlation (R
2 = 0.0) was identified between these variables in the Exo experiment (
Figure 5D and
Figure 6D).
When comparing the detection rates of the Load and Exo experiment, it becomes obvious that there are several features that performed well in one but not the other experiment. In addition, there is a group of fourteen features including CARD (4), DAMV (7), DASDV (8), DVARV (9), LDAMV (17), LDASD (18), MAD (20), MAV (21), MnE (28), MNP (30), MSR (31), RMS (35), SD (36), and VAR (46) that had an average detection rate close or above 70% for both experiments (
Figure 7). These include one complex-based, two amplitude-based, five variation-based and six mean-based features.
4. Discussion
In this study, data from two experiments was used to identify promising EMG-based features capable of distinguishing varying levels of muscle effort for use in human-in-the-loop optimization. Using the Load experiment, 17 out of 50 analyzed features were unable to differentiate increases in muscle activity associated with increasing load. The remaining features showed a clear trend, with higher detection rates as loads increased. Detection rates for the 2 kg session ranged from 60% to 77%, demonstrating the existence of a significant difference and thus favorable features for this task. We validated these findings with data from the Exo experiment and found that some of the top-performing features in the Load experiment failed in a context more closely resembling real-world exoskeleton applications.
In the Load experiment, Waveform Length (WL) emerged as the best-performing feature with the highest detection rate of 77% for the 2 kg condition. Oskoei et al. were also highlighting the accuracy and stability of WL in EMG analysis [
42]. Features such as the Absolute Value of Summation of Square Root (ASS), Willison Amplitude (WA), Integrated EMG (IEMG), and Average Amplitude Change (AAC) also showed high detection rates. These amplitude-based cumulative features (e.g., WL, ASS, WA, IEMG) were most effective due to the consistent increases in muscle activation from carrying loads, which captured gradual changes in muscle effort throughout the stride across all analyzed muscles under the steady, predictable conditions of the Load experiment (
Figure 5).
However, in the Exo experiment, these amplitude-based cumulative features like Waveform Length (WL) performed poorly. The dynamic nature of muscle activation in the presence of exoskeleton assistance, where both increases and decreases in EMG were observed depending on assistance timing, challenged these features. Amplitude-based cumulative features such as WL, Absolute Summation of Square Root (ASS), and Willison Amplitude (WA) sum all sample-to-sample changes without accounting for the direction of the fluctuations. Consequently, when muscle activation increases in some phases and decreases in others within a stride, these features accumulate bidirectional fluctuations without reflecting the net muscle effort, effectively “confusing” the signal. For instance, muscles such as the rectus femoris (RF), gluteus maximus (GM), and biceps femoris (BF) showed detection rates well below 50% for WL, indicating that this feature, which performed well in the Load experiment, struggled with the variable, phase-specific activation patterns introduced by the exoskeleton. This suggests that WL and similar features are less suited to scenarios where EMG exhibits phase-dependent bidirectional variations within each stride (
Figure 6).
In contrast, mean-based (e.g., MAD, MAV, RMS) and variation-based features (e.g., DASDV, DVARV, LDASD, VAR, SD) performed better during the Exo experiment, effectively capturing the dynamic changes in muscle activation introduced by exoskeleton assistance. These features are more adaptable to the fluctuations in muscle activity and stride-time variability introduced by the exoskeleton’s assistance. While amplitude-based cumulative features like Waveform Length (WL) depend on consistent muscle activation, the increased variability in the Exo condition favors features that can capture dynamic changes within a stride. This underscores the importance of context-specific feature selection, especially for tasks involving complex and variable biomechanical interactions, such as those involving exoskeleton assistance (
Figure 6).
The best-performing features in the Exo experiment was the Log Difference Absolute Mean Value (LDAMV). Unlike simple amplitude-based cumulative features (e.g., WL), LDAMV and DAMV (both close to 70% detection rate for both studies
Figure 7) calculate the mean of absolute differences between consecutive EMG samples, effectively capturing the magnitude of fluctuations without canceling out increases and decreases. This allows them to reliably reflect dynamic, phase-specific changes in muscle activation within each stride, making them particularly suited to the variable and timing-dependent assistance provided by the exoskeleton.
We acknowledge that no significant differences were observed between the best- and worst-performing features after correcting for multiple comparisons in the Exo experiment. Nevertheless,
Figure 5 and
Figure 6, as well as
Table 3 and
Table 4, reveal a clear trend: a plateau of high-performing features, including CARD (4), DAMV (7), DASDV (8), DVARV (9), LDAMV (17), LDASD (18), MAD (20), MAV (21), MnE (28), MNP (30), MSR (31), RMS (35), SD (36), and VAR (46) for the combination of Load and Exo experiment. We attribute the absence of significant differences in the Exo experiment primarily to three factors, all of which likely increased data variability. First, there was substantial inter-participant variability in EMG responses to different exoskeleton assistance timings. Second, time-related effects may have contributed, as the reference trial without assistance was performed at the beginning of the study, introducing individual differences in potential EMG signal drift in subsequent conditions as well as individual adaptation or training effects associated with walking with the exoskeleton. Third, the Exo experiment involved fewer comparisons (seven per participant) than the Load experiment (20 per participant), which may have increased the influence of noise and thus limited the ability to detect systematic effects.
The similar performance of several features for the Load and Exo experiment suggests that certain feature types yield comparable outcomes. Therefore, when combining features to differentiate movements or gestures, it may be beneficial to select those that speak to different or distinct EMG qualities to maximize diversity. Toledo et al. showed that it’s not necessary to include a large number of features to differentiate movement conditions [
31]. We anticipate similar performance outcomes for feature combinations when distinguishing movement effort.
In general, seven out of twelve frequency domain features were dismissed after the initial analysis. Additionally, among the top-ranking features from the Load and Exo datasets, the highest-ranking frequency domain feature ranked only seventh out of all features. This suggests that frequency domain features may be less effective at capturing distinguishing characteristics of movement effort based on EMG data.
So far, RMS has been used in previous EMG-based HITL studies [
12,
16,
17]. Reasons for its selection might include: it provides a direct measure of overall signal amplitude, which correlates with muscle effort; its calculation averages out small fluctuations, offering robustness against random noise and a more stable signal than raw EMG; it is simple to compute and interpret; and its longstanding use in biomechanics and prosthetics research makes it a convenient choice. Our results confirm that RMS seems to be one of the most effective: it ranks among the top 14 features when combining results of the Load and Exo experiments (
Figure 7), reliably capturing variations in muscle activation across different conditions.
Our feature analysis highlights that there is no universally best EMG feature, as their effectiveness depends on the task context and how muscle activity changes during the movement. Therefore, feature selection must be carefully adapted to the specific demands of each task. Based on our results, for the application of human-in-the-loop optimization of hip exoskeleton assistance during walking we would currently recommend to explore the use of the top performing features of the Exo experiment that are primarily mean-based and variation-based features (e.g., LDAMV, RMS, SD, MAD, MAV). For unknown conditions with respect to changes in muscle activity, most robust features of both studies could be selected based on
Figure 7. The highest detection rates were consistently observed for the gastrocnemius (GAS) and soleus (SOL) muscles across both experiments, suggesting that changes in the movement tasks had a significant impact on the calf muscles. This aligns with their known biomechanical function during walking: the SOL and GAS are primary contributors to push-off, generating the ankle plantarflexion work that drives forward movement [
73]. Consequently, changes in activation due to load carrying or exoskeleton assistance directly reflect variations in mechanical effort, making EMG from these muscles particularly informative. Given that the noise-to-signal-change ratio was also lowest for these muscles, it may be especially beneficial to focus on SOL and GAS within applications of human-in-the-loop optimization with a hip exoskeleton.
We hypothesized that one reason contributing to a high detection rate is the noise-to-signal-change ratio. This assumption was validated for the Load experiment data. However, this was not the case for the Exo experiment despite many of the best-performing features and the muscles with the highest detection rates having the lowest noise-to-signal-change ratios. Several factors could explain this discrepancy. First, features such as Waveform Length (WL), Absolute Value of Summation of Square Root (ASS), and Willison Amplitude (WA), which performed well in the Load experiment, still exhibited low noise-to-signal-change ratios but had very low detection rates in the Exo experiment. This indicates that their inability to detect changes in movement effort for the Exo experiment disrupts the expected relationship. Second, in the Exo experiment only seven timing conditions that resulted in metabolic reductions were included. Consequently, the effective sample size was approximately one third of that in the Load experiment, where 20 transitions per load condition were analyzed for each participant. A larger sample size helps mitigate the influence of signal noise and variability in EMG measurements [
27]. Third, the reference condition in the Exo experiment (without assistance) was measured only once at the beginning of the experiment, whereas in the Load experiment (unloaded), it was recorded immediately before and after each 40 s load interval. This reduces the impact of signal drift in the Load experiment, a known issue with EMG data [
27]. Finally, although the selected Exo conditions resulted in an average reduction in metabolic cost across participants, individual responses likely varied across muscles and assistance timings. Such participant- and muscle-specific adaptations may further reduce detection rates and obscure the expected relationship between noise-to-signal-change ratio and feature performance. In summary, our results suggest that the noise-to-signal-change ratio is an important factor in determining the effectiveness of an EMG feature for detecting changes in walking effort. However, the nature of individual features also plays a crucial role in regards to handling varying EMG signal qualities, such as during phases of increasing and decreasing activity within a stride.
4.1. Methodological Considerations
In this study, we analyzed a selection of 50 EMG-related features, though we did not conduct a systematic review to identify all potential features used in EMG analysis. It’s possible that other features could offer superior performance. Additionally, combining multiple features, exploring alternative feature options, or applying different feature processing methods could improve detection accuracy. For instance, as suggested by [
50], exploring the derivative from the EMG rather than the EMG directly may enhance performance. Using rectified EMG could also yield benefits for certain features. However, for individual features, we do not anticipate performance outcomes that significantly exceed the range of those already analyzed.
To better understand the limitations and implications of the selected features, it is important to consider how preprocessing choices influence their behavior. The preprocessing applied to the EMG data, particularly the mean subtraction for offset removal, caused certain features to become numerically similar. For instance, Mean Absolute Value (MAV) and Mean Absolute Deviation (MAD), as well as Average Amplitude Change (AAC) and Difference Absolute Mean Value (DAMV), yielded the same results since these metrics measure similar properties once the signal mean has been removed. Likewise, Variance (VAR) and Mean Energy (MnE) became nearly identical as subtracting the mean makes VAR effectively measure the average squared signal values, which aligns it with MnE when data is normalized.
These similarities also extend to frequency-domain features, as highlighted by Parseval’s theorem, which states that a signal’s total energy in the time domain equals its total power in the frequency domain. This theoretical foundation explains why energy- and power-related features, such as Energy (EN) and Total Power (TTP), as well as Mean Power (MNP) and MnE, produced comparable values, since they assess the same signal strength [
74]. Recognizing these overlaps is essential for interpreting the results of feature-based analyses and for identifying opportunities to refine feature selection.
In addition, readers should note that while muscle activity generally increased with load in the Load experiment and decreased with exoskeleton assistance in the Exo experiment, this trend was not consistent across all analyzed conditions for every participant. Therefore, detection rates less than 100% are not unusual.
Further, in both experiments the stride times of the reference and the comparison conditions was relatively close. We can imagine that large variation in stride times could benefit features that accumulate signal amplitude over time, such as IEMG, WL, EN, ASS, or WA, since these features scale directly with stride duration.
Finally, as mentioned in the Introduction, EMG signals are inherently sensitive to factors such as electrode movement, perspiration, muscle fatigue, and changes in skin impedance, which may affect feature magnitudes during extended HITL sessions. While our analysis does not explicitly quantify the resilience of individual features to these effects, we mitigated inter-individual and sensor-related variability by avoiding fixed, absolute amplitude thresholds.
Instead, all features requiring thresholds were normalized using participant- and muscle-specific EMG amplitudes derived from reference walking conditions. This normalization strategy reduces sensitivity to absolute mV-level differences across users and sensors, supporting generalization without requiring extensive per-user calibration. Nevertheless, although this approach improves robustness across participants under comparable walking conditions, feature sensitivity to larger changes in gait dynamics, such as walking speed, slope, load magnitude, or assistance level, was not systematically evaluated and remains an important topic for future experimental validation.
4.2. Future Directions and Experimental Validation
While this study identified EMG features with high detection rates for varying muscle effort, these findings are based on retrospective analysis of two experimental datasets. The effectiveness of the identified features for human-in-the-loop (HITL) optimization of hip exoskeleton assistance still needs to be verified experimentally. Future studies should implement the top-performing features in real-time HITL algorithms to determine whether they improve optimization outcomes, including reducing metabolic cost or achieving faster adaptation.
Additional experiments are necessary to evaluate whether these features generalize to other exoskeleton configurations, assistance patterns, or movement tasks, including upper-limb applications or more variable walking conditions, and to systematically assess their robustness to common EMG signal perturbations such as electrode shifts, sweat accumulation, fatigue, and changes in skin impedance, ensuring reliable performance during extended HITL optimization sessions. Such studies could further investigate how EMG features respond to compensatory strategies or co-contraction induced by exoskeleton assistance, for example by combining EMG with joint-level biomechanics or antagonist muscle analysis. Prospective testing would also allow direct comparison between features under controlled HITL conditions, providing empirical evidence of feature performance and relative effectiveness, which cannot be concluded from the current observational dataset. Identifying muscle synergies can help reduce the number of EMG sensors required. Conversely, combining signals from multiple synergistic muscles may mitigate noise and enhance signal quality, thereby improving the robustness of HITL optimization. While this study focused on healthy participants, the translation of these features to clinical populations with altered neuromuscular control (e.g., stroke, spinal cord injury, cerebral palsy) remains uncertain. Altered muscle recruitment and compensatory strategies may affect feature performance, making the selection of both muscles and features critical for reliably capturing changes in walking effort. Future work should also evaluate the consistency of these features across multiple sessions and days to ensure their reliability for long-term, real-world exoskeleton use.
While EMG-based features provide a direct and sensitive measure of muscle activation, relying on EMG alone as an optimization objective may not fully capture higher-level aspects of locomotor quality. In particular, minimizing muscle activation does not necessarily guarantee preservation of natural gait patterns, balance, or stability, as reductions in EMG amplitude may also arise from altered coordination strategies or redistribution of effort across muscles. Consequently, future HITL implementations may benefit from combining EMG-based objectives with complementary biomechanical or gait-level metrics, such as kinematic symmetry, joint moments, or stability-related measures, to ensure that reductions in muscle effort do not come at the expense of functional or natural locomotion.
From a real-time implementation perspective, computational efficiency remains a relevant consideration. In an initial estimation using a laptop running at up to 2.4 GHz, feature computation times for a single stride were determined in Matlab. RMS was used as a reference feature, as previous studies have demonstrated its effectiveness for EMG-based HITL optimization [
12,
16,
17]. In our implementation, it achieved a computation time of 0.0021 s. When RMS was set to 100%, eight features required 200–400% (e.g., AAC, FR, LCOV, MSR, MED, TTP), seven features required 400–800% (e.g., COV, IQR, KURT, MMDF, SKEW, SM1, SSC), and four features (AR, MDF, MNF, SKEW) exceeded 800% of RMS computation time, with MDF being the most computationally expensive at 0.074 s (approximately 1500%). All other 32 features required less than 200% of the the time. With stride times lasting about 1.1 s, even the more computationally demanding features remain feasible when considering that EMG-based feedback is evaluated over multiple strides to reduce the effects of noise.
Importantly, recent advances in embedded hardware architectures and algorithm deployment techniques substantially reduce traditional constraints related to memory footprint, inference time, and power consumption. For example, Just et al. (2024) [
75] demonstrated that neural network quantization strategies combined with modern microcontroller architectures can reduce flash memory requirements by more than an order of magnitude while simultaneously improving inference speed and energy efficiency in self-contained prosthetic systems. Their comparison of state-of-the-art microcontrollers further showed substantial gains in computational performance relative to previous generations, enabling real-time motor intent decoding on resource-constrained wearable hardware. These findings indicate that embedded platforms are rapidly approaching the computational capabilities required for increasingly sophisticated signal processing and machine learning pipelines. Beyond computational complexity, EMG signal artifacts (e.g., caused by sensor movement or transient loss) may pose additional challenges for real-time implementation. Future work could focus on improving signal robustness and developing methods to automatically detect and exclude corrupted data.
Clarifying these limitations ensures that readers understand the scope of the findings. The results presented here provide a foundation for EMG-based feature selection in HITL optimization, but the ultimate performance and comparative efficacy of these features must be demonstrated in future experimental studies.