Kinematics, Speed, and Anthropometry-Based Ankle Joint Torque Estimation: A Deep Learning Regression Approach

: Powered Assistive Devices (PADs) have been proposed to enable repetitive, user-oriented gait rehabilitation. They may include torque controllers that typically require reference joint torque trajectories to determine the most suitable level of assistance. However, a robust approach able to automatically estimate user-oriented reference joint torque trajectories, namely ankle torque, while considering the effects of varying walking speed, body mass, and height on the gait dynamics, is needed. This study evaluates the accuracy and generalization ability of two Deep Learning (DL) regressors (Long-Short Term Memory and Convolutional Neural Network (CNN)) to generate user-oriented reference ankle torque trajectories by innovatively customizing them according to the walking speed (ranging from 1.0 to 4.0 km/h) and users’ body height and mass (ranging from 1.51 to 1.83 m and 52.0 to 83.7 kg, respectively). Furthermore, this study hypothesizes that DL regressors can estimate joint torque without resourcing electromyography signals. CNN was the most robust algorithm (Normalized Root Mean Square Error: 0.70 ± 0.06; Spearman Correlation: 0.89 ± 0.03; Coefﬁcient of Determination: 0.91 ± 0.03). No statistically signiﬁcant differences were found in CNN accuracy ( p -value > 0.05) whether electromyography signals are included as inputs or not, enabling a less obtrusive and accurate setup for torque estimation. the kinematic-, speed-, and anthropometry-driven DL model may output reliable ankle joint torque trajectories without using EMG signals towards simplifying the current setup for real joint torque estimations. Thus, this study proposes an innovative analysis by investigating the statistical relevance and effect of including EMG signals as input in the DL model performance.


Introduction
There is an extreme need to recover the motor function of people with lower limb impairments so they can independently perform their daily living activities, improving their quality of life [1][2][3][4][5][6]. Physical rehabilitation has been pointed out as the most appropriate strategy to face the long-term motor disabilities of neurologically injured patients [7]. Nonetheless, lower limb rehabilitation specialists have in recent years acknowledged the need to deal with: (i) the disadvantages associated with the inter-and intra-therapist variances; and (ii) the absence of precise, user-, and daily-task oriented repeatable movements during therapy.
Robotics-based gait assistance and rehabilitation interventions managed by Powered Assistive Devices (PADs), such as active orthosis and exoskeletons, have been recommended for subjects with long-term locomotor disabilities and impairments [8,9]. According to [10], rehabilitation therapies driven by PADs may improve the patient's muscular strength, movement coordination, and balance control, fostering the patient's ambulation and performance for successful locomotion.
Typically, these PADs are controlled by torque controllers ( [11][12][13][14][15]) that require joint torque reference trajectories to (i) automatically adapt the assistance that the patient should receive, providing an Assist-As-Needed (AAN) training; or (ii) impose a desired joint motion considering healthy conditions [16]. Generally, these joint torque reference trajectories are obtained from multimodal walking datasets available in the literature. They contain pre-recorded joint trajectories from healthy subjects walking at self-selected speeds (varying between slow, normal, and fast) above force platforms or instrumented treadmills [17][18][19][20].
However, considering the findings reported in [16,[21][22][23][24][25][26], there is evidence that lower limb biomechanical parameters (e.g., joint kinematics, torques, and muscle activations) are highly dependent on the subject's walking speed and anthropometric information (such as body height and mass). Additionally, it is being verified that the typical slower walking speeds of healthy subjects (2.8 km/h) may be higher than the preferred walking speeds of subjects with lower limb impairments (1.6-2.5 km/h) [27]. Thus, the employment of joint torque reference trajectories obtained from available datasets may not be the most suitable method to impose the desired motion, nor to tailor the assistance according to patients' needs due to the potential bias that can introduce in the gait dynamics given the differences in walking speed.
It is necessary to develop a tool that is able to automatically generate healthy joint torque reference trajectories while avoiding the need to collect joint trajectories for many walking speeds and anthropometric data combinations. Nonetheless, the available methods for joint torque generation still present some issues, such as: (i) they only predict the peak of the joint torque in specific gait cycle phases, not generating the entire trajectory for the full gait motion [22,23,28]; (ii) they do not consider the effects introduced by the subject's anthropometry [22,[28][29][30] and speed [23,[29][30][31]; (iii) they depend on several muscle parameters that need to be calibrated to the user's anthropometric characteristics [31]; (iv) they rely on an expensive and complex sensor setup composed by electromyography (EMG) [29,30] or cameras and force plates systems (in case of data collections [17][18][19][20]).
To the authors' best knowledge, there is no available automatic method dedicated to the generation of reference joint torque trajectories oriented to the user's anthropometry and walking speed. Given the successful performance of Long-Short Term Memory (LSTM) and Convolutional Neural Network (CNN) for time-series data and their prominent capacity to model nonlinear walking motion data relationships [23,[32][33][34][35][36], this study tackles the challenges mentioned above, extending our previous work [37]. It proposes an automatic, accurate Deep Learning (DL) approach for generating reference ankle joint torque trajectories under healthy conditions according to the user's walking speed, joint kinematics, and anthropometry, considering a stratified anthropometric distribution. This advance enables the achievement of a user-oriented ankle joint torque reference trajectory by innovatively customizing the trajectories according to the speed, gender, age, body mass, height, and shank and foot length of each subject. The creation of a generalized model is of utmost importance to estimate lower limb joint torque trajectories while avoiding the need for regular data collections every time a user-oriented joint torque is required and facilitating the PAD's setup and use. The ankle joint was chosen as it is the lower limb joint that is commonly affected by neurological diseases [37,38].
Moreover, most studies focused on AAN training by adjusting the PADs' assistance according to the user's joint torque that is estimated in real-time. Most of the algorithms for joint torque estimation fuse EMG data ( [29,30]). EMG-based approaches have been left behind since the EMG sensing is prone to fade during long-term use due to (i) movements between the skin and the electrodes; (ii) temperature variations; and (iii) sweating [39][40][41][42]. These phenomena can cause incorrect joint torque estimations, which may compromise the PADs' assistance efficacy. This motivates the development of joint torque estimation approaches non-dependent on EMG. For this reason, this study aims at a less obtrusive approach for real-time joint torque estimation, versatile to both human motion analysis and robotics-based gait assistance. We hypothesize that the kinematic-, speed-, and anthropometry-driven DL model may output reliable ankle joint torque trajectories without using EMG signals towards simplifying the current setup for real joint torque estimations. Thus, this study proposes an innovative analysis by investigating the statistical relevance and effect of including EMG signals as input in the DL model performance.

Participants
To build the DL models, a single data collection was needed. The study involved thirteen healthy adult participants (6 males and 7 females) with a mean age of 24.2 ± 1.85 years, a mean body mass of 65.2 ± 10.3 kg, and a mean body height of 1.68 ± 0.12 m), as presented in Table 1. Balanced gender distribution was tackled considering possible biomechanical gender differences [43,44]. Before undergoing the experiments, an explanation of the study's purpose was provided, and each participant gave written and informed consent according to the ethical conduct of the University of Minho Committee (CICVS 006/2020). The eligibility criteria consider that the participants must: (i) sign the consent form; (ii) not present any evidence of physical or physiological disorders that could perturb his/her walking pattern; (iii) be older than 18 years old; (iv) present a body mass ranging from 45.0 to 90.0 kg; and (v) present a body height ranging from 1.50 to 1.90 m, covering the mean anthropometric data of adult men and women across countries [45].
A twelve-camera motion-capture system (Oqus; Qualisys-Motion Capture System, Göteborg, Sweden) and five force platforms embedded on the floor (FP4060; Bertec, Ohio, OH, USA) were utilized to collect the 3D kinematic motion data and the ground reaction force (GRF) data at 200 Hz, respectively.
The Newington-Hayes marker-set was adopted to determine the kinematic data from the lower limb joints [47]. A total of 12 pairs of retro-reflective markers were used, as presented in Figure 1.

Experimental Protocol
In the beginning, the gender, age, body mass, height, and shank and foot length of each subject was registered. Then, the subjects were asked to perform a standing anatomic-calibration trial during 3 s with both arms crossed in front of the chest, looking ahead, and with both feet aligned with the shoulders, to adjust all the recordings to the subject's anthropometry. Subsequently, each subject was instructed to perform ten forward walking trials, walking sequentially at seven controlled walking speeds (1.0, 1.5, 2.0, 2.5, 3.0, 3.5, and 4.0 km/h) on a 10 m-flat surface, including five force platforms, as depicted in Figure 2. The speeds were controlled with a metronome. The participants were asked to look ahead and walk naturally according to the beats of the metronome. Further details of the protocol are presented in [48].

Experimental Protocol
In the beginning, the gender, age, body mass, height, and shank and foot length of each subject was registered. Then, the subjects were asked to perform a standing anatomiccalibration trial during 3 s with both arms crossed in front of the chest, looking ahead, and with both feet aligned with the shoulders, to adjust all the recordings to the subject's anthropometry. Subsequently, each subject was instructed to perform ten forward walking trials, walking sequentially at seven controlled walking speeds (1.0, 1.5, 2.0, 2.5, 3.0, 3.5, and 4.0 km/h) on a 10 m-flat surface, including five force platforms, as depicted in Figure 2. The speeds were controlled with a metronome. The participants were asked to look ahead and walk naturally according to the beats of the metronome. Further details of the protocol are presented in [48].

Experimental Protocol
In the beginning, the gender, age, body mass, height, and shank and foot length of each subject was registered. Then, the subjects were asked to perform a standing anatomic-calibration trial during 3 s with both arms crossed in front of the chest, looking ahead, and with both feet aligned with the shoulders, to adjust all the recordings to the subject's anthropometry. Subsequently, each subject was instructed to perform ten forward walking trials, walking sequentially at seven controlled walking speeds (1.0, 1.5, 2.0, 2.5, 3.0, 3.5, and 4.0 km/h) on a 10 m-flat surface, including five force platforms, as depicted in Figure 2. The speeds were controlled with a metronome. The participants were asked to look ahead and walk naturally according to the beats of the metronome. Further details of the protocol are presented in [48].

Data Processing
The EMG signals were filtered using a band-pass fourth-order zero-lag Butterworth filter with cutoff frequencies of 20 and 450 Hz. Furthermore, the signals' envelope was obtained by using the Root Mean Square value with a 300 ms movable window [49].
The 3D marker trajectories registered with the motion-capture system were filtered by the same filter, using a cutoff frequency of 6 Hz [47]. Hereinafter, the filtered 3D marker trajectories, along with the GRF data, were processed in Visual3D software to compute the lower limb joint angles and torques. This computed torque corresponds to the real torque data that were used as ground truth to build DL regression models. The Automatic Gait Events function from the Visual3D software was also used to automatically detect the heel-strike event and segment the data in gait cycle-normalized data. This function considers that the heel strike event corresponds to the instant where the participants' foot strikes the force platforms. Moreover, all heel-strike detections were visually inspected against GRF's Z component data to confer robustness to the data segmentation. Every time that the GRF's Z data component did not start to increase at the heel-strike event by the Visual3D software, the heel-strike instant was manually corrected. Further details are presented in [48]. The raw and the processed data used in this study are available at [48]. Study [48] shows the dataset variability across subjects and speeds.

Data Preparation
We implemented two approaches to predict the ankle joint torques. The first one considered ankle joint angles, angular velocities, angular accelerations, walking speed, body height and mass, foot and shank length, gender, and age, pursuing the evidence that this anthropometric information improves the model performance [23]. In the second approach, we added the EMG signals from the TA and GAL muscles to the above-mentioned inputs. These two muscles were chosen since they are the most responsible muscle for ankle joint motion [50].
Out of the thirteen subjects that participated in this study, one participant was randomly selected to test the final model. A leave-one-subject-out cross-validation (LOSOCV) method among the remaining twelve subjects was implemented to evaluate the model generalization and optimize the model hyperparameters.

Implementation of the Regression Models
In this study, LSTM and CNN were applied, tested, and compared by using Matlab ® (2021a, The Mathworks, MA, USA).
Regarding the LSTM neural network, the data were organized by sequences. Each sequence, which represents a single gait cycle, was composed of X lines (X can take the values 10 or 12, representing the 10 or 12 inputs with or without EMG signals, respectively) and 250 columns (representing the 250 samples that form a single normalized gait cycle). We conducted an empirical analysis to select (i) the number of neurons per LSTM layer (from 10 to 200); and (ii) the number of LSTM layers (from 1 to 2), based on the findings of [23,36,51]. According to [36,51], the number of neurons should be 5 times the number of output responses (1). In this study, we explore higher values to create a model with enough capacity, while controlling overfitting. The final layer corresponds to a fully connected dense (feedforward) layer, and its number was set to the number of responses to predict, i.e., 1 ( [36,51]).
In CNN, we performed a data transformation, treating the inputs as several images [35], being organized into 3D matrices, as follows: width-number of each gait cycle's samples (250); height-number of inputs (10 or 12); depth-number of gait cycles. In this neural network, we also conducted an empirical analysis to select (a) the kernel size (2 × 2, 5 × 5, and 10 × 10); (b) the number of filters per convolutional layer (8,16,32,64); and (c) the number of convolutional layers (1,2,3). This empirical analysis was based on the findings of [52].
In both models, an empirical analysis was applied to select (i) the normalization method (considering the max-min, z-score, and robust normalization methods); (ii) the optimal batch size (from 1 to the maximum number of gait cycles); and (iii) the dropout percentage. In addition, both neural networks' weights and biases were updated according to an adaptive moment estimation optimization algorithm (ADAM) considering the mean square error (MSE) ( [36,51]). Moreover, the activation function used for LSTM and CNN layers was the rectified rectilinear unit (ReLU) considering [36,52].

Model Evaluation Metrics
The regression model performance was evaluated during the LOSOCV and final model testing procedures. Three metrics were determined, namely Normalized Root Mean Square Error (NRMSE), Spearman correlation (SC), and Coefficient of Determination (R2) between two variables: the predicted and the real ankle joint torque trajectory.
The predictions performed by the final model were also evaluated using the Bland-Altman Plot [53]. Furthermore, we computed the average of the real ankle joint torque data of all gait cycles belonging to the test dataset, resulting in a mean gait cycle and its standard deviation (SD). The same procedure was performed for the predicted ankle joint torques. Additionally, we assessed the computational time (in ms) to predict a single sample. All predictions were performed in a Hewlett-Packard computer with an Intel ® Core™ i7-4710MQ CPU @ 2.50 GHz processor and a Random Access Memory with 16.0 GB. Table 2 presents the best results achieved for the two explored DL algorithms, for both validation and test conditions, without considering the EMG signals as input. The results presented by the LOSOCV method (depicted in Figure 3) suggest that CNN appears to be the best-fitted regression model to estimate the ankle joint torque trajectories. This DL regressor revealed a higher generalization capability when compared to LSTM since it presents a higher mean and a lower SD in the evaluation metrics (NRMSE, SC, and R 2 ). Regarding the performance achieved in the test dataset (presented in Table 2), both results are within the mean ± SD exhibited in LOSOCV method, which suggests that both models are generalized. CNN revealed a smaller computational time in comparison with LSTM (0.51 ms < 3.7 ms), which suggests its promising application, even in real-time conditions.

Comparative Analysis of Regression Models without Using EMG Signals
Overall, there is a consistency across all findings that the ankle joint torque trajectories predicted by the CNN exhibit a high level of similarity with real trajectories when compared to the predictions made by LSTM.

Detailed Analysis of CNN Performance
Given its prominent capacity to estimate the ankle joint torque trajectories, we compared the predictions performed by CNN to the real ankle joint torque trajectories of the test dataset using the Bland-Altman Plot. The results depicted in Figure 4a show that the predictions made by CNN are closer to the real ankle joint torque trajectories since the majority of the measures are within the limits of agreement, the bias is close to 0 N.m, and the limits of agreement are small. Furthermore, Figure 4b illustrates the mean and SD values of the real and CNN-based predicted ankle joint torque for the test dataset. These results show that the CNN produced reference ankle joint torque trajectories closely similar to the real ones.

Detailed Analysis of CNN Performance
Given its prominent capacity to estimate the ankle joint torque trajectories, we compared the predictions performed by CNN to the real ankle joint torque trajectories of the test dataset using the Bland-Altman Plot. The results depicted in Figure 4a show that the predictions made by CNN are closer to the real ankle joint torque trajectories since the majority of the measures are within the limits of agreement, the bias is close to 0 N.m, and the limits of agreement are small.

Detailed Analysis of CNN Performance
Given its prominent capacity to estimate the ankle joint torque traject compared the predictions performed by CNN to the real ankle joint torque traje the test dataset using the Bland-Altman Plot. The results depicted in Figure 4a

Walking Speed Versus Body Mass and Height Analysis
We investigated the performance of CNN under variations of walking speed, body mass, and body height, as presented in Figure 5. Visually, the results seem to indicate that the performance of CNN tends to improve as the walking speed increases. Nonetheless, this improvement is minor for each body height or body mass interval since all metrics present values closer to 1 (the desired value to achieve in each evaluation metric). The lower performances were more pronounced for lower values of walking speed. The Friedman test (using a significance level of 5%) was conducted to analyze if the performance in terms of NRMSE, SC, and R 2 presented statistically significant differences between the seven walking speeds. This non-parametric test was chosen since the assumptions of normality, homoscedasticity, and the existence of outliers were not achieved for all evaluation metrics. The results, presented in Table A1, show that there are statistically significant differences (p-value < 0.02) in the CNN performance when the participants walked at the lower speed (1.0 km/h) comparatively to the other speeds (2.0, 2.5, 3.0, 3.5, and 4.0 km/h). No significant differences (p-value > 0.05) were found among the other conditions.  Regarding the body height, the results of Figure 5 show that the model exhibited (i) lower performances for subjects with a body height ranging from 1.60 to 1.70 m; and (ii) higher performances for subjects with a body height between 1.80 and 1.90 m. However, based on the results of the Kruskal-Wallis H test (Table A2), there are no statistically significant differences between different body heights for all evaluation metrics and walking speeds (p-value > 0.05). The same finding was reached for the body mass since no statistically significant differences were found between different body mass for all evaluation metrics and walking speeds (p-value > 0.05).

EMG Inclusion for Ankle Joint Torque Prediction
We investigated the effect of introducing EMG signals from TA and GAL in the ankle joint torque trajectory prediction. The results presented in Table 3 indicate that CNN exhibited a higher performance (a higher mean and a lower SD of NRMSE, SC, and R 2 ) when compared to the results presented by LSTM.
Additionally, we compared the best-fitted CNN model obtained when EMG signals were not used/used as inputs (presented in Tables 2 and 3, respectively). Figure 6 depicts the results of this comparative analysis. Pursuing the hypothesis that the ankle joint torque trajectories can be accurately estimated without using EMG signals, we performed a statistical analysis to evaluate if the differences presented in Figure 6 were statistically significant. The Shapiro-Wilk normality test showed that all data are parametric, and the assumptions of homoscedasticity and the existence of outliers were accomplished. Thus, a two-tailed and paired t-test was conducted with a level of confidence of 95%. The results indicate that there are no significant differences between both approaches, with a p-value > 0.05 having been achieved for all metrics (NRMSE: p-value = 0.10, SC: p-value = 0.17, and R 2 : p-value = 0.28).

EMG Inclusion for Ankle Joint Torque Prediction
We investigated the effect of introducing EMG signals from TA and GAL i joint torque trajectory prediction. The results presented in Table 3 indicate exhibited a higher performance (a higher mean and a lower SD of NRMSE, S when compared to the results presented by LSTM. Additionally, we compared the best-fitted CNN model obtained when EM were not used/used as inputs (presented in Tables 2 and 3, respectively). Figur the results of this comparative analysis. Pursuing the hypothesis that the torque trajectories can be accurately estimated without using EMG signals, we a statistical analysis to evaluate if the differences presented in Figure 6 were s significant. The Shapiro-Wilk normality test showed that all data are parametr assumptions of homoscedasticity and the existence of outliers were accomplis a two-tailed and paired t-test was conducted with a level of confidence of 95%. indicate that there are no significant differences between both approaches, wit > 0.05 having been achieved for all metrics (NRMSE: p-value = 0.10, SC: p-value R 2 : p-value = 0.28).

Discussion
This study was developed under the scope of robotics-based gait assistance and rehabilitation, where reference ankle joint torque trajectories are required to provide an AAN training to the patient or to impose a desired healthy joint motion. In parallel, the proposed DL regression models are also useful for a less obtrusive torque estimation, contributing to the human motion analysis field.
The comprehension of the walking speed, body mass, and height effects on lower limb biomechanical parameters is fundamental to determine changes in the gait pattern [16,21,22,24,26,54]. Contributions to the state-of-the-art were centered on the benchmarking analysis of two DL methods to find an accurate and time-effective approach to automatically generate reference ankle joint torque trajectories according to the speed and anthropometric information of the user. This avoids the need for repeated data collections by research teams dedicated to robotic gait assistance.
In this study, we presented a proof-of-concept of DL regressors' applicability to achieve an accurate and generalized method for estimating ankle torque, considering walking speeds ranging from 1.0 to 4.0 km/h and for subjects with body height and mass varying from 1.51 to 1.83, and from 52.0 to 83.7 kg, respectively. This enables the reference ankle joint torque's estimation for a widespread population.
The achieved results indicated that CNN yielded the best performances (NRMSE: 0.70 ± 0.06; SC: 0.89 ± 0.03; R2: 0.91 ± 0.03) when compared to those presented by LSTM (NRMSE: 0.58 ± 0.20; SC: 0.84 ± 0.08; R2: 0.79 ± 0.22). In [16], the lowest Correlation Coefficient (R = 0.69) was verified for lower walking speeds. Our study achieved the same results as the ones stated in [16] since the lower performances of CNN were achieved for slower speeds, in which statistically significant differences were found between the performance at 1.0 km/h and the remaining walking speeds (p-value < 0.05). This phenomenon may be associated with greater difficulty in maintaining the equilibrium at slow walking speeds (below 2.0 km/h), as reported by the subjects during the data acquisition since these speeds are less comfortable for healthy humans [55]. Consequently, the walking dynamics suffered modifications, increasing intersubject variability. The inclusion of more subjects might enhance the CNN robustness and generalization to deal with intersubject variabilities, boosting the neural network performance, especially at slow speeds. Nonetheless, according to [27], the preferred walking speeds of individuals with lower limb impairments range from 1.6 km/h to 2.5 km/h. Since no statistically significant differences were found in the CNN performance for this range of walking speeds and higher (p-value > 0.05), its use for generating the reference ankle joint torque trajectories desired for impaired subjects is plausible. Further, no statistically significant differences were found (p-value > 0.05) in the obtained performance across different body heights and mass, which allows us to conclude that the implemented CNN is robust in relation to the body height and body mass variation. This finding empowers the notion that the proposed approach is a promising choice for generating ankle joint torque trajectories for individuals with a body height and mass ranging from 1.51 to 1.83 m and 52.0 to 83.7 kg, respectively.
The results achieved in this study are encouraging when considering the findings presented in previous studies [22,23,[28][29][30]. Study [22] reached a R 2 of 0.48 for predicting the peak kinetic ankle dorsiflexion torque using linear and quadratic equations. In [28], peak ankle, knee, and hip joint angles and torques were estimated using linear, quadratic second order, and quadratic third order equations. The highest value of R 2 for peak ankle joint torque estimation was 0.93. Study [23] estimated ankle, knee, and hip angles and torques for the stance phase using feedforward and LSTM neural networks. A R of 0.98 was achieved for ankle joint torque estimation. Despite presenting different datasets and study conditions, our proposal appears to progress the results achieved in [22,23,28], predicting ankle joint torque trajectories for the full gait cycle with a R 2 of 0.91 ± 0.03 (R of 0.95 ± 0.18). The estimation performed for the complete gait cycle is advantageous over the estimation for specific gait events because the torque curve is characterized not only in magnitude but also temporally [15]. Other studies ( [29,30]) have reported that a Multilayer Perceptron can estimate knee joint torque trajectories with a R of 0.97 by combining EMG signals with kinematic sensors. Our study predicts ankle joint torque trajectories, achieving a R of 0.95 ± 0.18, based on fewer input data and without the need for complex data acquisitions related to EMG.
Despite offering promising performances, the use of EMG signals can present some practice-related drawbacks, such as (a) the difficulty to maintain the sensors in the same position during the user's locomotion, which can affect the joint torque estimation over time; (b) the need for an expert-based setup; (c) skin reactions; (d) the measures are affected by temperature variations and sweating [39][40][41][42]56]. From a comparative analysis between the estimation of the ankle joint torque trajectories with and without considering EMG signals as input, significant differences between both approaches were not found (p-value > 0.05). Moreover, the low computational load demonstrated by CNN suggests its promising application, even in real-time conditions. These findings revealed that the proposed speed, kinematic-, speed-, and anthropometry-dependent approach corresponds to a more practical solution than the ones proposed in [22,23,[28][29][30] to estimate ankle joint torque trajectories for the full gait cycle and it may simplify the current sensor setup for estimating real joint torque estimations.
However, there is still room for extending the CNN applicability to other conditions by including more participants per each anthropometric category and a wider range of ages, ensuring a balanced body height and mass distribution, which may increase the model performance. Moreover, introducing different locomotion modes (such as ramps or stairs) may be useful for PADs that address personalized assistance in daily living scenarios. The reference joint torque estimation for the remaining lower limb joints, covering an wide range of walking speeds is another future challenge that could be addressed. Special attention should be paid to lower walking speeds, such as 1.0 km/h, since the intra-and inter-variability across individuals is higher and the lower limb joint biomechanics is highly modified, jeopardizing the model's learning.

Conclusions
This study tackles a gap in the state-of-the-art by presenting a DL-based benchmark approach to generate user-oriented healthy reference ankle joint torque trajectories for walking speeds ranging from 1.0 to 4.0 km/h and for subjects with body height and mass varying from 1.51 to 1.83, and from 52.0 to 83.7 kg, respectively. CNN is demonstrated to be the most accurate and time-effective method from a benchmarking analysis with two DL algorithms (LSTM and CNN). The proposed system for joint torque generation avoids the need for repeated data collection by research teams dedicated to robotic gait assistance. From a comparative analysis between the estimation of ankle joint torque trajectories with and without considering EMG signals as input, significant differences between both approaches were not found. This finding revealed that the proposed speed-, kinematic-, and anthropometry-dependent approach corresponds to a more practical solution for ankle joint torque estimation in several applications of human motion analysis.

Institutional Review Board Statement:
The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of University of Minho (CICVS 006/2020).

Informed Consent Statement:
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement: Not applicable.

Acknowledgments:
We thank Pedro Fonseca for the assistance in data collection in LABIOMEP-Porto Biomechanics Laboratory, University of Porto, Porto, Portugal.

Conflicts of Interest:
The authors declare no conflict of interest. Table A1 presents the results of the Friedman test (using a significance level of 5%) conducted to analyze the performance of the kinematic-, speed-, and anthropometry-driven CNN in terms of NRMSE, SC, and R 2 among the seven walking speeds.   Table A2 presents the results of the Kruskal-Wallis H test (using a significance level of 5%) conducted to analyze the performance of the kinematic-, speed-, and anthropometrydriven CNN in terms of NRMSE, SC, and R 2 among the different body heights and mass for each walking speed.