1. Introduction
Locomotion is a universal behavior that animals and humans use to efficiently translocate and navigate between places. Particularly in humans, the central pattern generator, a complex network located in the spinal cord, is responsible for the generation of rhythmic motor behaviors such as walking. The brain stem and motor cortex supply this network with inputs and motor commands, while the various joints, muscles, and skin provide it with sensory feedback. This network then produces different patterns of bipedal gait [
1]. Furthermore, musculoskeletal/neurological disorders and the overall health status of a person can affect their gait, hence producing a unique walking pattern (gait) [
2].
Gait analysis is highly demanded in the medical field, which is mainly adopted for precise patient monitoring, pathological gait treatment assessment, movement abnormality identification, and surgical outcome evaluation [
3]. Its importance in the health area has been discussed in various studies. These studies cover areas such as knee and hip osteoarthritis [
4], falling risk [
5], spinal damage-level determination [
6], Parkinson’s disease diagnosis [
7], and facilitating interactive rehabilitation and predictive diagnostics [
8,
9]. Moreover, it can also be crucial in sports and robotics applications [
10], virtual reality, and character animation applications [
11]. Gaits are interpreted by first quantifying them using representative parameters that are easier to understand. These parameters mainly fall into two categories, either spatiotemporal (e.g., speed and stride/step length), kinematic (e.g., hip extension/flexion), or kinetic (e.g., moments and ground reaction forces) parameters [
3]. In this paper, the focus is the estimation of the kinematic parameters for the lower half of the body. Leg joints are the key source of degrees of freedom for walking locomotion. Hence, accurately computing the joint angles is vital in understanding human gaits during walking. To do so, the type of sensor employed for data acquisition plays a key role in the accuracy of joint-angle computation. Hence, several data-collection techniques have been investigated over the past few years. Generally, they can be categorized as wearable and nonwearable sensor systems.
Nonwearable sensor methods mostly employ a 3D motion capture system using special markers attached to the bodies of subjects. The 3D human pose is captured in a specialized indoor setting, such as laboratories and studios, using a high level of position accuracy optical motion capture systems [
12]. These methods have long been considered the industry standard methods. Another type of nonwearable system which is a pressure-sensing carpet was proposed by the Massachusetts Institute of Technology. It is used to estimate the 3D human pose using the pressure data acquired from the tactile carpet. The system includes a carpet of 36 ft
2 areas with 9216 sensors, readout circuits, and two cameras [
13]. Moreover, vision-based methods by [
14,
15,
16] developed a 3D reconstruction of a human pose from 2D still images and movies while [
17] computed walking speed and stride length from a Kinect camera depth data. Despite their excellent performance, nonwearable systems only operate inside controlled laboratory settings, which makes them difficult for physiotherapists and sports scientists who are looking to bridge the lab-to-field gap. On top of that, such systems are expensive and demand longer setup time and substantial skill.
These limitations are currently being eased owing to the technological advancement of wearable sensor miniaturization. Inertial measurement units (IMUs), electromyography, and other wearable sensors have opened the way for practical indoor/outdoor motion capture systems for long-term use. The continuous digitization progress and the high demand for motion analysis in various fields such as rehabilitation centers have made inertial sensors to be the center of the topic over the last few years. Even though they enable us to assess movements in a real-world setting with easier portability, wearable sensors are not yet a standard practice in motion analysis because of a lack of examination related to accuracy and reliability. However, recent works by [
18,
19] performed an investigation on the reliability and validity of the commercially available inertial sensors called Xsens inertial sensors. They evaluated them for different activities including walking, squatting, and jumping. As a result, they concluded reliability and validity were fair to excellent in the sagittal plane for hip, knee, and ankle joint angles and the system can be used by a clinician to quantify leg-joint angles. For their convenient accompanying software, these inertial sensors were used in this study as well. However, many of the inertial capture systems vary in terms of sensor quantity, sensor positioning, and estimation method [
20,
21,
22]. The study by [
20] adopted an extended Kalman filter method for lower-limb segment position and orientation estimation from two (fixed only to the feet) and three (attached to the pelvis and the feet) sensor sets. For the three-sensor set, they achieved an overall root mean square error (RMSE) of 5.0 ± 1.0, 8.2 ± 2.2, and 5.9 ± 1.6 for the hip, knee, and ankle, respectively. A study by [
21] developed a microcontroller with two inertial sensors mounted to the thigh and the shank for the computation of the knee joint angle. Their system claimed to have achieved an RMSE of 0.04° with a mean average percentage error of 2.95% compared to a Vicon motion capture system. Similarly, [
22] used one inertial sensor fixed to the thigh to target the knee joint angle and two inertial sensors fixed to the shank and thigh to target the ankle joint angles during walking. They have achieved an MAE of 1.69 ± 1.43°, 1.29 ± 1.0°, and 0.82 ± 0.69° for the knee, talocrural joint, and subtalar joint, respectively. In the existing systems, there is a lack of information on how many inertial sensors are enough to correctly estimate the lower-limb-joint angles during walking locomotion. Certainly, multiple inertial sensors would make the subject uncomfortable and the system complex and thus expensive to run. Therefore, any method which employs a reduced sensor quantity while not sacrificing the performance of the system is favorable. Additionally, considering the fact that each person has a unique gait makes it challenging for implementing gait-analysis systems for any random subject. However, a walking motion is comprised of cyclic leg motions where the bone segments move in a correlated way with each other. Hence, the walking motion can be mapped or reconstructed from the motion of a single bone segment. The nonlinear relation that exists among the bone segments could be possibly approximated by neural networks.
Various algorithms have been used to estimate human poses. However, with the ability to reconstruct human poses from fewer sensor quantities and the ability to generalize across subjects, neural networks have been the center of attention in recent years. This has been demonstrated by our previous study, where we investigated the estimating leg joints from only one IMU sensor fixed onto the pelvis of a subject using a neural network [
23]. Another data-driven technique by [
24] gathered data from five people with one IMU sensor unit fixed on the shank of the right leg to train a recurrent neural network (RNN) that approximates the gaits of construction workers. They made a special rectangular wooden frame to perform data measurement experimentation. Then subjects were instructed to walk on top of it while carrying all the computing equipment. Similarly, [
25] also used a shank-mounted single IMU sensor to estimate the sagittal-plane lower-limb-joint angles. Their data collection was performed by instructing subjects to walk in a straight line of a 5-m distance inside a laboratory.
The existing methods explained above proved one or two sensors can be enough to estimate the leg-joint angles with good accuracy. This is possible due to the periodicity and kinematically constrained biomechanical walking of humans. Reduced sensor quantity not only helps reduce the complexity but also contributes to a more natural gait performed by subjects. Despite increased research in this field, there is a paucity of information investigating the most suitable single IMU placement for leg-joint estimation. As the need for portable and simple wearable sensors for motion analysis is growing, identifying the best possible sensor-fixing body locations is the critical part. The position of the fixed single inertial sensor highly affects the estimation result of the neural networks. There is no consensus regarding the position of the sensors on the body as previous studies fix inertial sensors on the pelvis [
20,
23], thigh [
21,
22], shank [
21,
22,
24,
25], and foot [
20]. Hence, in this study, the placement of a single sensor on different parts of the body for joint-angle estimation of both legs will be investigated by employing various neural-network algorithms. This is essential to understand the optimal inertial sensor placement on the lower half of the body when reduced inertial sensors are needed for lower-body motion analysis. This study will contribute to healthcare physiotherapists and motion analysts in the sports field. The most dominant sensor positions in many of the existing studies will be the potential candidates for the inertial sensor placement to estimate two lower-limb-joint angle sets. These include the pelvis, thigh, shank, and foot. According to [
26], CNN is a better candidate for only prediction tasks while LSTM is desired for sagittal-plane joint-angle prediction and real-time joint-angle estimation over multilayer perceptron networks. Hence, four neural networks including convolution-based ones and LSTM networks were selected. These include a unidirectional LSTM, a bidirectional long short-term memory (BLSTM), a convolutional neural network (CNN), and a wavelet neural network (WNN). For the neural-network training, walking data were collected from 16 subjects. The data measurement was performed in an outdoor setting where subjects were told to walk freely and naturally. This study was accomplished with easier mounting labor and significantly lower sensor setup cost.
Therefore, the main contributions of this research are: (i) the use of a single IMU sensor to estimate the lower-limb joint rotation angles from data collected outdoors; (ii) the investigation of an optimal body position for a single inertial sensor placement to estimate the lower-limb-joint angles; (iii) to show the promising future of reduced wearable sensors in addressing gait analysis and pose estimation problems; and (iv) to give physiotherapists and sports scientists insight regarding how good a single inertial sensor can be in estimating lower-limb-joint angles in an outdoor setting. Therefore, this could be further extended for daily activity pose-tracking which could be crucial in rehabilitation and assistive robot applications.
  2. Data Acquisition
  2.1. IMU Sensor
The sensors used in this study are called MTw Awinda (hereafter referred to as Awinda sensors), manufactured by Movella Inc., which is headquartered in Henderson, NV, USA. These sensors are wireless and easy to integrate small microelectromechanical system inertial sensors that are convenient for real-time human motion tracking. Awinda sensors ensure accurate and well-synchronized data among all connected sensors, which is vital in human pose estimation. The sensors are accompanied by a free software named MT Manager, which has the functionality of recording and exporting raw inertial data and orientation data of each sensor.
Since IMU sensors suffer from drifting errors and environmental magnetism, validating and evaluating their performance is a necessary step before their usage. A study by [
27] compared the Awinda sensor system and an 8-camera Qualisys optical motion capture system for walking and static poses. The minimum and maximum average root mean square error (RMSE) results for 18 lower-limb joints were 3.2° and 10.1° for walking and 3.7° and 8.0° for the static pose, respectively. Additionally, the effectiveness of the Awinda sensor system was evaluated in a study by [
28] in comparison to the Optotrak motion capture system using three activities namely walking, descending stairs, and ascending stairs. Resultantly, a mean estimation error of the joint angles ranged from a minimum of 1.38° to a maximum of 6.69°. However, since experiment environments affect the performance of the Awinda inertial sensors, the sensors were tested in our optical motion capture indoor experiment. In particular, verifying the performance of the Awinda inertial sensors’ orientation is the main goal as their orientation is used to compute the joint angles. To do so, five-minute data were collected using a rectangular rigid frame with markers and an Awinda sensor mounted on it. Resultantly, the orientation deviation of the Awinda sensor system from the Optotrack motion capture system was 1.45°, 1.66°, and 0.67° corresponding to the x, y, and z axes. On top of the lower results, our data-collection experiments were conducted for a shorter period, 10 min, to avoid any possible long-term error. However, more importantly, our actual data-collection experimentation was carried out in a barely magnetized outdoor space. The magnetization of the site was verified by the magnetic norm of the sensors as recommended by the manufacturer, which hardly varies. This is because there are no big man-made structures in the outdoor experimental site. Therefore, the Awinda sensor system data are sufficient to rely on for this study’s experimental and analytical needs.
  2.2. Data Measurement
To compute the ground-truth joint-angle values of the lower limb, seven individual Awinda sensors were mounted to the lower half of each subject’s body. As depicted in 
Figure 1, one sensor unit per each lower-body bone segment was fixed. The bone segments include the pelvis, the thighs, the shanks, and the upper parts of the feet. To reduce the effect of skin motion artifacts, sensors are mounted in places with less skin movement. These include the pelvis bone at the height of the anterior superior iliac spine, the middle of the lateral thighs, the upper parts of the tibiae, and the front upper parts of the feet.
Here the objective is to estimate the leg kinematics (joint angles, particularly) from any of the sensors fixed to the body as summarized in 
Figure 2. As the right leg is dominant for most people, the three sensors on the right leg in addition to the waist sensor were investigated and compared in this study. A study by [
29] suggested that human locomotor muscle synergies are decoded from slow cortical waves of the brain. They claimed to have formulated a relationship between brain signals and leg kinematics. However, in this study, a noninvasive method with only a single sensor is used to mimic the function of spinal cord signals during locomotion. This is possible because the movement of our leg is manifested in our pelvis motion, presuming the subject always maintains contact with the ground. The pelvis moves forward/backward and sideways during normal walking. Due to maintaining continuous ground contact, the leg motion directly drives the trunk body depending on the speed and direction. This creates a repetitive rhythmic motion. This makes it easier to estimate the repetitive poses of the lower half of the body from various bone segments’ inertial data. As an example, 
Figure 3, shows the inertial data of the pelvis for a single gait leg pose.
After sensor synchronization, sensor calibration was performed before every experiment by orienting the sensors in one direction on a level surface. Next, sensors were carefully attached to subjects by Velcro tape straps in a similar direction as recommended by the manufacturer. Then, subjects were instructed so that they walk naturally, in any direction, by switching their paces to slow, normal, or fast at their convenience. Hence, diverse data were collected during our experimentation from the 16 subjects. The Awinda station, which is connected directly to an LG Gram 11th Gen Intel
® Core™ i7 computer, receives the synchronized data from the seven sensors via a wireless transmission. The Awinda station antenna supports wireless communication up to 50 m range in an outdoor area. This made the data-collection process a lot easier. The data collection was made at a sampling rate of 100 Hz for approximately 10 min per subject. Sixteen subjects comprised 13 males and 3 females; an age group of 28 ± 7.2 years old; a weight group of 63.3 ± 12.2 [Kg]; and a height group of 169.3 ± 8.1 [cm]. In this study, the data were collected from walking activity only. The experiment was carried out in a level, open space field which does not have any structures that could pose magnetic interference to the sensor. A Google map of the experimental site is shown in 
Figure 4.
  2.3. Data Preparation
The first step of dataset preparation is the ground-truth joint-angle computation. The MT Manager software exports the collected raw motion data from the seven sensors as a text file. However, only three quantities, a 3-axis accelerometer, a 3-axis gyroscope, and a quaternion orientation, were extracted. The MT Manager software calculates each sensor’s orientation in both Euler angles and unit quaternions and outputs it with reference to a global coordinate system. After the raw data are exported and saved as a text file, the next step is to compute the leg-joint angles which will be used as target values during the supervised neural-network training. The joint-angle calculation, dataset preparation, training, and inferencing steps were computed and programmed on the PyCharm IDE using Python 3.7.
Since each sensor is firmly attached to each bone segment of the body, it is assumed that the sensor’s orientation corresponds to the orientation of the associated body segment. The orientation difference between the distal and proximal segments then defines the joint rotation angle that connects them. This is mathematically expressed in Equation (1). All attached sensors are aligned to face the same direction.
In other words, if a subject stands upright, making his shank and thigh perpendicular to the flat ground, the extension/flexion angle of the knee and hip will be 0°.
        
        where 
qdis_prox denotes the distal and proximal bone segments orientation difference, 
qdis is the distal bone segment orientation, and 
qprox is the proximal bone segment orientation. Both the later quantities are measured in reference to the global frame. The ‘⨂’ symbol denotes quaternion multiplication while ‘*’ indicates quaternion complex conjugate. For instance, the rotation angle of the knee joint is computed from the orientations of the distal (thigh) and the proximal (shank) bone segments. This is illustrated in 
Figure 5. Subsequently, the quaternion result from Equation (1) was transformed to Euler angles format from which relevant Euler angles corresponding to the extension/flexion of hip and knee joints were taken as the ground-truth values. The size of the computed joint angles is the same in size as the original raw data collected.
Two sets of target leg-joint angles were investigated. The first set is comprised of four joint angles, namely the extension/flexion joint angles of both the hip and knee of both legs. The second set contains the ankle dorsiflexion/plantarflexion and hip abduction and adduction joint angles of both legs in addition to the first leg-joint angle set. From the collected data, the rotation of hip, knee, and ankle joints ranges from −40° (flexion) to 20° (extension) and 0° (extension) to 80° (flexion), and −18° (dorsiflexion) to 40° (plantarflexion), respectively.
Datasets preparation is the second step during the data preparation stage. Datasets are the input arrays for neural networks during deep learning. These are created by cutting the raw time-series data into smaller-sized data pieces. To prepare the datasets, a sampling window of 100 samples-wide (equivalent to 1 s) with an overlap of 80% was employed to cut the time-series raw data as shown in 
Figure 6. The resultant dataset becomes an array of size 100 × 6 inertial data. This method was implemented on all the inertial data of the pelvis, thigh, shank, and foot. The target labels for the neural networks are the joint angles that correspond to the last frame of the shifting window. The target joint angles which correspond to the input inertial datasets are shown with the vertical lines in 
Figure 6. The target (label) joint-angle data were then organized into 4 × 1 and 8 × 1 arrays for both sets.
For deeper analysis, three varieties of input datasets were created. One dataset has only inertial data of one of the four sensor positions on either leg, which is shaped into a 100 × 6 array. Another dataset consists of inertial data of both feet (bFID) and pelvis inertial data (PID). The resultant dataset was then structured into a 100 × 18 array. The 18 columns are the 6-axis inertial data of the pelvis and both feet. This set was created to examine the estimation performance improvement by combining the inertial data of the pelvis and both feet. The last one adds the subjects’ biometric information to the PID. Each person has a distinctive gait, step size, walking speed, and range of motion. Age, gender, weight, and height are among the factors that could affect these variations. Hence, adding this information to the training process could improve the estimation accuracy. Except for gender, the other quantities are expressed numerically. Hence, gender was represented with a binary quantity that 1 indicates male participants while 0 is for female participants. As a result, the last dataset will have two separate inputs: a 100 × 6 PID and 4 × 1 biometric information data (BID). A total of 50,973 datasets were prepared for deep learning. First, it was divided into three categories as follows: 84.5% of the datasets for training, 14% of the datasets for validation, and the rest 1.5% of the datasets for testing. The testing dataset was collected from a separate subject whose data are not included in the training. The testing data from the 16th subject, which is less than 10 min data, is a new and unencountered dataset for the trained model.
  5. Conclusions
The gait-analysis research area is expanding quickly due to its fast-growing demand in areas such as health services and robotics. Due to the rapid advancement in sensing technology and artificial intelligence, gait analysis has become possible using only a few wearable sensors. However, there is less consensus on the sensor quantity and placement for better lower-leg pose estimation. Therefore, in this study, the placement of a single inertial sensor on the lower half of the body for the leg-joint angle estimation using neural networks was investigated. Four neural-network models were compared using walking-motion data collection from 16 multiracial subjects. Among the neural networks, BLSTM networks performed better with MAE ranging from 3.02° to 4.33° for the four dominant sagittal-plane leg-joint angles. The results were improved with the increment of sensors and the introduction of biometric information. From the investigation of single senor placement, it was found that the shank or thigh is the optimal position for leg-joint angle estimation. Both achieve similar results with an overall average error of 3.84° and 3.65° for the thigh and shank, respectively. Others positions such as the pelvis would not be close enough to capture whole-leg kinematics from the hip to the toe. Furthermore, it was confirmed from the estimation results that a single inertial sensor can be enough to estimate the extension/flexion angles of the hip and knee joints. However, it was challenging to accurately estimate the coronal-plane joint angles of the lower limb and ankle joints owing to the inherent small lateral movement during walking foot–ground impact during heal strike.
Hence, adding low-dimensional sensors, such as pressure sensors, could potentially improve the obtained result. However, this study has achieved a promising result that could serve as a springboard for the further extension of the study to other human activities. If a robust estimation mechanism for various human activities is developed, it can be implemented to solve real-world issues, particularly in healthcare services, assistive robotics, and collaborative robotics.