A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices

Pang, Shengli; Wang, Zhe; Xu, Shiji; Long, Weichen; Pan, Ruoyu; Wang, Honggang

doi:10.3390/s26041309

Open AccessArticle

A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices

by

Shengli Pang

,

Zhe Wang

^*

,

Shiji Xu

,

Weichen Long

,

Ruoyu Pan

and

Honggang Wang

College of Communication and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an 710121, China

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(4), 1309; https://doi.org/10.3390/s26041309

Submission received: 8 January 2026 / Revised: 16 February 2026 / Accepted: 16 February 2026 / Published: 18 February 2026

(This article belongs to the Section Wearables)

Download

Browse Figures

Versions Notes

Abstract

Driven by advancements in artificial intelligence technology, pedestrian trajectory prediction is shifting from traditional machine learning methods toward autonomous decision-making frameworks based on neural networks. However, the spatiotemporal uncertainty of pedestrian movement results in low accuracy of existing prediction models. To address this issue, we propose a multi-source perception fusion system based on INS-GNSS wearable devices. By integrating high-precision inertial measurement units (IMUs) and multi-mode global navigation satellite systems (GNSS), we enhance localization and prediction accuracy. For localization, we introduce a Gait Adaptive UKF (Gait-AUKF) that identifies pedestrian gait patterns and motion states by fusing multi-sensor data. An adaptive algorithm effectively suppresses trajectory drift and improves tracking accuracy. For trajectory prediction, we propose a pedestrian trajectory prediction framework based on a multi-source fusion attention mechanism. A GRU encoder extracts pedestrian trajectory features from historical motion data. An attention mechanism assigns varying weights to trajectory features across different scales. An LSTM decoder and A* path planning algorithm constrain spatiotemporal paths to generate future pedestrian trajectories. Experimental results demonstrate that compared to UKF and AKF, the Gait-AUKF reduces eastward error by 30%, northward error by 26.27%, and vertical error by 49.08%. The complete prediction framework achieves a 68.54% reduction in average position error (APE) and a 70.42% reduction in direction error (DE) compared to LSTM and Transformer models. Ablation experiments demonstrate that the integrated Gait-AUKF algorithm and A* path planning algorithm enhance model decision performance. After incorporating these algorithms, the model’s ADE decreased by 68.49% and FDE by 71.86%.

Keywords:

pedestrian trajectory prediction; INS-GNSS integration; wearable sensors; sensor fusion; adaptive Kalman filter

1. Introduction

In the fields of urban navigation and personal safety, reliable and precise pedestrian localization and tracking systems play an extremely critical role. Pedestrian trajectory prediction (PTP) [1] serves as a key technical prerequisite for ensuring the safety of vulnerable road users (VRUs) [2] in autonomous driving systems. Its core task is to model a pedestrian’s past and current movement data to infer their historical motion states, thereby predicting their future spatiotemporal path [3]. However, pedestrian motion is inherently highly random and multimodal due to intentional uncertainty [4], making accurate and robust trajectory prediction a persistent challenge, particularly in complex urban environments.

In pedestrian navigation, high-precision inertial measurement units (IMUs) can provide high-frequency motion data, but errors in IMU sensors accumulate over time [5,6,7]. Multi-global navigation satellite systems (Multi-GNSS) [8] can improve positioning accuracy, but signals are prone to loss in complex environments such as urban high-rise buildings and tunnels [9]. Therefore, the INS/GNSS integrated navigation system maintains accuracy in the event of signal loss through complementary advantages, while eliminating IMU accumulated errors through satellite positioning. The key lies in effectively integrating the two types of information [10,11]. To suppress IMU drift, IMUs are often placed on the feet or legs and zero velocity update (ZUPT) technology is used. Traditional ZUPT relies on a fixed threshold and is only suitable for uniform motion, making it difficult to adapt to different gaits. Therefore, in previous studies, Johan et al. [12] uased Bayesian adaptive thresholding method to select a separate threshold for each type of motion pattern, but this method overly relied on the number of motion patterns. Cho et al. [13] developed a threshold-free algorithm that detects zero speed through signal shape but is limited to walking and brisk walking motion patterns. This article proposes an adaptive unscented Kalman filter localization algorithm based on gait constraint model. By establishing a pedestrian gait phase constraint model and detecting zero velocity intervals, the heading and step size are optimized according to different motion states. Meanwhile, utilizing an adaptive Kalman filter to fuse motion data and correct position drift during gait effectively improves positioning accuracy and robustness in complex environments.

With the rapid development of intelligent transportation, urban planning, and public safety, trajectory prediction technology has gradually become one of the research hotspots. Pedestrian trajectory prediction is widely used in scenarios such as autonomous driving, crowd behavior analysis, and intelligent monitoring [14]. Due to human sociality, uncertainty in movement, and environmental factors, pedestrian trajectory prediction is a challenging task. In pedestrian trajectory prediction methods, there are mainly data-driven methods and motion model establishment. Firstly, most prediction methods rely on observable external stimuli [15], including historical trajectories, kinematic attributes (such as position, velocity, angular velocity), and contextual information such as road geometry and pedestrian vehicle interaction [16]. Secondly, modeling methods include parameterized methods based on kinematics and dynamics, as well as shallow and deep learning techniques [17,18]. These methods are optimized through various loss functions to generate outputs such as Gaussian distributions, multimodal trajectories, or probabilistic grids.

In recent years, the development of deep learning has significantly improved the accuracy and robustness of pedestrian trajectory prediction, making it increasingly important in practical applications such as autonomous driving and robot navigation. However, accurately modeling of the spatiotemporal relationships in pedestrian motion-especially when facing complex scenes and multimodal future behavior-remains a challenge. Early works such as the Social LSTM model proposed by Alahi et al. [19] integrated the hidden states of neighboring pedestrians through a grid based pooling mechanism, achieving preliminary modeling of pedestrian interaction. Gupta et al. [20] introduced generative adversarial networks (GANs) to handle the multimodal characteristics of trajectory prediction. However, these methods often rely on predefined interaction features or fixed neighborhood structures, making it difficult to explain the complex relationships between pedestrian trajectories. Li et al. [21] used an adaptive spatiotemporal graph construction algorithm to calculate edge weights based on dynamic features such as velocity and direction and combined them with temporal characteristics to generate more accurate trajectory predictions. The SHENet framework proposed by Meng et al. [22] uses a memory bank to store historical trajectories and establishes trajectory prediction based on the relationship between individuals and their surrounding environment. Li, R. et al. [23] combined multi-scale graph based spatial transformers and trajectory smoothing algorithms to predict multiple paths of historical trajectories. Although these advances have been made, there are still two issues with pedestrian trajectory prediction at present. Firstly, methods based on spatiotemporal graphs overly rely on interaction graphs with a single scale, ignoring the long-term trajectory relationships of pedestrians. Secondly, generative models perform well in terms of diversity but often lack clear mechanisms to ensure temporal consistency, resulting in high-frequency turns in generated paths and often ignoring pedestrian dynamic behavior.

Pedestrian trajectory prediction relies on current and historical trajectory information, detecting pedestrian motion intentions and states through motion behavior to enhance the accuracy of future trajectory prediction. This article proposes a multi-source data fusion behavioral attention mechanism framework for pedestrian trajectory prediction and path planning. Using INS and GNSS data to locate pedestrian trajectories, while capturing pedestrian gait motion information as guidance data for motion features. The framework uses a gated recurrent unit (GRU) encoder to extract key features of pedestrian motion and construct an adaptive fusion mechanism guided by physical constraints. Introducing a memory module to store pedestrian historical trajectories and assigning different attention weights to trajectory features at different scales through an attention mechanism addressable device. The LSTM decoder combines a spatiotemporal constraint path planning coordination mechanism to decode future pedestrian motion trajectories, achieving accurate prediction of pedestrian travel paths. The main contributions of this article are summarized as follows:

1. Existing gait assistance algorithms typically use ZUPT or PDR algorithms to estimate the step size and direction of two-dimensional plane displacement localization. The gait constraint localization method proposed in this article can distinguish gait changes and use ZUPT during the standing phase, which can not only update speed but also dynamically adjust the noise statistical characteristics of the filter. At the same time, a motion model for the swing phase was constructed to constrain and expand trajectory positioning in three-dimensional space. The method provides more detailed pedestrian gait decomposition and action model analysis, improving the accuracy of pedestrian trajectory localization.

2. Unlike visual or radar based prediction methods, the limitations of static images make it difficult to understand the spatiotemporal connections between pedestrian movements. By using wearable solutions, the complex spatial relationships of pedestrian movements can be easily captured for predicting low-cost pedestrian positions.

3. The method integrates pedestrian gait information into a unified framework for target tracking and future trajectory prediction, achieving end-to-end sharing of information and effectively addressing noise issues in practical scenarios.

4. The historical memory module is introduced into the attention mechanism, which ensures the smoothness and temporal consistency of predicted trajectories based on retrieval features and trajectory embedding.

The structure of this article is as follows: Section 2 introduces the design framework of the wearable devices used in the study. Section 3 introduces the gait constrained trajectory localization algorithm and pedestrian trajectory prediction algorithm framework proposed in this study. Section 4 introduces the results of algorithm simulation environment and experimental scenario testing. Section 5 is the research conclusion.

2. Wearable Device Design

To accurately capture the dynamic behavior of pedestrians, this study designed a wearable data acquisition hardware device based on the INS-GNSS multi-mode integrated navigation system. The system is powered by a centralized rechargeable 3.7 V lithium battery. The hardware platform adopts STM32H7 series microcontrollers based on Cortex-M7 core, responsible for processing and parsing data from various sensors. The selected IMU is the JY-901S nine-axis inertial sensor, which integrates high-precision three-axis accelerometers, three-axis gyroscopes, and three-axis magnetometers and can collect pedestrian gait data at high frequencies. The selected GNSS module is E108-GN04D, which supports multi-system joint localization (BDS/GPS/GLONASS/Galileo) to provide accurate pedestrian position information. This device integrates multiple communication protocol modules, allowing for flexible information transmission based on specific scenarios. In addition, the collected pedestrian posture data and GNSS data can be stored on the SD memory card in the terminal device for subsequent analysis. The hardware architecture and physical diagram are shown in Figure 1, and the key parameters of IMU and GNSS are shown in Table 1 and Table 2.

3. Methodology

3.1. System Overview

The framework proposed in this article is an integrated framework that includes data perception, pedestrian gait analysis and localization, and pedestrian trajectory prediction. Its core is the principle of multi-source sensor fusion: fusing high-frequency IMU data with low-frequency but high-precision GNSS data to achieve stable and accurate pedestrian positioning and motion prediction. The main modules of the system are shown in Figure 2.

The system collects data obtained from wearable devices, filters it, and inputs it into the Gait-AUKF algorithm for high-precision 3D positioning by integrating IMU and GNSS data. Detailed gait analysis is also performed to obtain pedestrian gait action data, provide the data output by the algorithm to the prediction module, capture the dynamic characteristics of pedestrian motion through gate controlled loop units, and establish spatiotemporal correlations by compressing historical trajectories with memory enhanced attention mechanisms to guide future predictions. Based on a long short-term memory network, a future trajectory with reasonable behavior is generated, and the A* path planning algorithm is added to physically constrain the motion trajectory. The final predicted trajectory generated is both in line with the intention and feasible.

3.2. Pedestrian Dead Reckoning

Pedestrian dead reckoning (PDR) [24] is a sensor-based localization technique that estimates a pedestrian’s trajectory in real-time by detecting steps, estimating stride length, and determining heading angle. In this paper, wavelet transform is employed to distinguish gait phases, and the derived step frequency and stride length are incorporated into the PDR algorithm to enhance the accuracy of trajectory estimation. Given an initial position

(X_{0}, Y_{0})

, subsequent positions are computed recursively using the step length

L_{k}

and the heading angle

Ψ_{k}

:

\{\begin{matrix} X_{k} = X_{k - 1} + L_{k} \cos Ψ_{k} \\ Y_{k} = Y_{k - 1} + L_{k} \sin Ψ_{k} \end{matrix}

(1)

where

X_{k}

and

Y_{k}

denote the current position coordinates, k represents the step index, and

k \geq 1

. During walking, the magnitude of acceleration exhibits periodic fluctuations. For heading estimation, angular velocity

ω = {[ω_{x}, ω_{y}, ω_{z}]}^{T}

acquired from a triaxial gyroscope is integrated numerically to obtain a short-term heading estimate. Observations from a triaxial magnetometer

m = {[m_{x}, m_{y}, m_{z}]}^{T}

are projected onto the horizontal plane using pitch and roll angles, thereby compensating for the accumulated integration error of the gyroscope through magnetometer-based correction. Equation (3) converts the magnetometer from the device coordinate system to the horizontal navigation coordinate system, where

H_{x}

represents the X-direction magnetic field component on the horizontal plane, and

H_{y}

represents the Y-direction magnetic field component on the horizontal plane. An adaptive weighting factor

α_{w} \in [0, 1]

is introduced to balance the contributions of the two sensors:

Ψ_{k} = Ψ_{k - 1} + \int_{t_{k - 1}}^{t_{k}} ω_{z} (t) d t

(2)

\{\begin{matrix} H_{x} = m_{x} \cos ϕ + m_{y} \sin θ \sin ϕ + m_{z} \cos θ \sin ϕ \\ H_{y} = m_{y} \cos θ - m_{z} \sin θ \end{matrix}

(3)

Ψ_{mag} = \arctan (\frac{H_{y}}{H_{x}})

(4)

Ψ_{k} = (1 - α_{w}) \cdot Ψ_{k} + α_{w} \cdot Ψ_{mag}

(5)

3.3. Adaptive Unscented Kalman Filter Algorithm

The Kalman Filter is an optimal recursive estimation algorithm based on Bayesian estimation theory, suitable for linear Gaussian systems. Its core concept lies in a predictionupdate cycle that integrates a system dynamics model with noisy measurement data to achieve minimum mean square error estimation of the state variables. The state and observation equations are given by Equations (6) and (7), respectively:

x_{k} = F_{k ∣ k - 1} x_{k - 1} + B_{k} u_{k} + w_{k}

(6)

z_{k} = H_{k} x_{k} + v_{k}

(7)

\{\begin{matrix} E [w_{k}] = 0, cov (w_{k}, w_{j}) = Q_{k} δ_{k j} \\ E [v_{k}] = 0, cov (v_{k}, v_{j}) = R_{k} δ_{k j} \\ cov (w_{k}, v_{j}) = 0 \end{matrix}

(8)

where

x_{k}, z_{k}

denote the state vector and observation vector at time k,

F_{k ∣ k - 1}

is the state transition matrix, and

H_{k}

is the observation matrix. The process noise

w_{k} \sim N (0, Q_{k})

and the measurement noise

v_{k} \sim N (0, R_{k})

, where

Q_{k}

and

R_{k}

represent the process and measurement noise covariance matrices, respectively. Their statistical properties are specified as Formula (8).

The unscented Kalman filter addresses the nonlinear propagation of mean and covariance through the unscented transform (UT), which employs a deterministic sampling strategy [25]. A set of sigma points is generated and propagated through the nonlinear function, after which the mean and covariance of the transformed points are computed to approximate the output statistics. For a discrete-time nonlinear system, the UKF is formulated as follows:

\{\begin{matrix} X_{k} = f (X_{k - 1}, u_{k}, W_{k}) \\ Z_{K} = H_{k} X_{k} + V_{k} \end{matrix}

(9)

where

f (X)

is a nonlinear state transition function,

H_{k}

is the observation function, and

W_{k}

,

V_{k}

are uncorrelated zero-mean Gaussian white noise processes. Here,

q_{k}

is a non-negative definite matrix and

r_{k}

is a positive definite matrix, representing the covariance matrices of

W_{k}

and

V_{k}

, respectively. The Kronecker delta function is denoted by

δ_{k j}

.

\{\begin{matrix} E [W_{k}] = 0, cov (W_{k}, W_{j}) = q_{k} δ_{k j} \\ E [V_{k}] = 0, cov (V_{k}, V_{j}) = r_{k} δ_{k j} \\ cov (W_{k}, V_{j}) = 0 \end{matrix}

(10)

Algorithms estimate the state of nonlinear dynamic systems through a series of structured processes. The process begins with the initialization of state estimation and covariance matrix:

{\hat{x}}_{0} = E [x_{0}], P_{0} = E [(x_{0} - {\hat{x}}_{0}) {(x_{0} - {\hat{x}}_{0})}^{T}]

(11)

where

{\hat{x}}_{0}

is the initial state vector and

P_{0}

is a positive symmetric definite covariance matrix. Subsequently, a set of

2 n + 1

sigma points is generated, where denotes the dimension of the state vector. Given the state mean

{\hat{x}}_{k - 1}

and covariance

P_{k - 1}

at time

k - 1

, the sigma points are computed as

\{\begin{matrix} X_{k - 1}^{(0)} = {\hat{x}}_{k - 1} \\ X_{k - 1}^{(i)} = X_{k - 1}^{(0)} + γ {(\sqrt{P_{k - 1}})}_{i} i = 1, 2, \dots, n \\ X_{k - 1}^{(i)} = X_{k - 1}^{(0)} - γ {(\sqrt{P_{k - 1}})}_{i} i = n + 1, n + 2, \dots, 2 n \end{matrix}

(12)

Here,

γ = \sqrt{n + λ}

,

λ = α^{2} (n + ζ) - n

.

α

,

ζ

is a scaling parameter that controls the spread of the sigma points around the mean. During the time update, each sigma point is propagated through the nonlinear process model:

X_{k | k - 1}^{*} = f (X_{k - 1}, u_{k})

(13)

The predicted state mean

{\hat{x}}_{k}^{-}

and covariance

P_{k}^{-}

are then computed as

{\hat{x}}_{k}^{-} = \sum_{i = 0}^{2 n} W_{i}^{(m)} X_{i, k | k - 1}^{*}

(14)

P_{k}^{-} = \sum_{i = 0}^{2 n} W_{i}^{(c)} (X_{i, k | k - 1}^{*} - {\hat{x}}_{k}^{-}) {(X_{i, k | k - 1}^{*} - {\hat{x}}_{k}^{-})}^{T} + Q_{k - 1}

(15)

where Q is the process noise covariance matrix, and

W_{i}^{(m)}

and

W_{i}^{(c)}

are weights assigned to the mean and covariance calculations, respectively.

\begin{matrix} W_{0}^{(m)} = \frac{λ}{n + λ} \end{matrix}

(16)

\begin{matrix} W_{0}^{(c)} = \frac{λ}{n + λ} + (1 - α^{2} + β) \end{matrix}

(17)

\begin{matrix} W_{i}^{(m)} = W_{i}^{(c)} = \frac{1}{2 (n + λ)}, i = 1, \dots, 2 n \end{matrix}

(18)

For the measurement update, the observation sigma points are generated and transformed using the measurement model:

Z_{k | k - 1} = h (X_{k | k - 1}^{*})

(19)

The predicted measurement mean

{\hat{z}}_{k}^{-}

, innovation covariance

P_{z z, k}

, and cross-covariance

P_{x z, k}

are calculated as follows:

{\hat{z}}_{k}^{-} = \sum_{i = 0}^{2 n} W_{i}^{(m)} Z_{i, k | k - 1}

(20)

ν_{k} = z_{k} - {\hat{z}}_{k}^{-}

(21)

P_{z z, k} = \sum_{i = 0}^{2 n} W_{i}^{(c)} (Z_{i, k | k - 1} - {\hat{z}}_{k}^{-}) {(Z_{i, k | k - 1} - {\hat{z}}_{k}^{-})}^{T} + R_{k - 1}

(22)

P_{x z, k} = \sum_{i = 0}^{2 n} W_{i}^{(c)} (X_{i, k | k - 1}^{*} - {\hat{x}}_{k}^{-}) {(Z_{i, k | k - 1} - {\hat{z}}_{k}^{-})}^{T}

(23)

Finally, the Kalman gain

K_{k}

is computed, and the state estimate and covariance are updated:

K_{k} = P_{x z, k} P_{z z, k}^{- 1}

(24)

{\hat{x}}_{k} = {\hat{x}}_{k}^{-} + K_{k} ν_{k}

(25)

P_{k} = P_{k}^{-} - K_{k} P_{z z, k} K_{k}^{T}

(26)

This formulation ensures an efficient and accurate mechanism for state estimation in nonlinear systems through sigma point propagation and statistical linearization.

The adaptive unscented Kalman filter (AUKF) [26] incorporates a mechanism for adaptive tuning of the noise covariance matrices. The algorithm proposed in this paper further integrates an adaptive adjustment strategy based on the statistical characteristics of the innovation sequence, enabling dynamic optimization of filtering parameters in response to real time changes in pedestrian motion. The adaptive factor is computed using the norm of the normalized innovation sequence and implemented via a piecewise function for gradual adjustment. The normalized innovation sequence is defined as

v_{k} = z_{k} - {\hat{z}}_{k | k - 1}, {\bar{v}}_{k} = P_{z z}^{- 1 / 2} v_{k}

(27)

where

v_{k}

is the original innovation sequence and

{\bar{v}}_{k}

is its normalized form. A piecewise adaptive factor is designed based on the norm of the normalized innovation sequence:

α_{k} = \{\begin{matrix} 1, & if | | {\bar{v}}_{k} | | \leq c_{0} \\ \frac{c_{0}}{| | {\bar{v}}_{k} | |} \cdot \frac{c_{1} - | | {\bar{v}}_{k} | |}{c_{1} - c_{0}}, & if c_{0} < | | {\bar{v}}_{k} | | \leq c_{1} \\ 0, & if | | {\bar{v}}_{k} | | > c_{1} \end{matrix}

(28)

This function facilitates progressive adjustment of the filtering gain: when the innovation sequence exhibits normal statistical characteristics, standard filtering performance is maintained; when minor model mismatch is detected, the weight of measurement information is reduced proportionally; and in the presence of severe anomalies, the filter relies entirely on predicted information, effectively mitigating the impact of abnormal observations on localization accuracy. The process noise covariance is updated adaptively as:

{\hat{Q}}_{k} = \frac{1}{N} \sum_{j = k - N + 1}^{k} K_{j} v_{j} v_{j}^{T} K_{j}^{T}

(29)

Similarly, the measurement noise covariance is adjusted via

{\hat{R}}_{k} = \frac{1}{N} \sum_{j = k - N + 1}^{k} (v_{j} v_{j}^{T} - P_{z z, j})

(30)

3.4. The Proposed Gait-AUKF Algorithm for Localization

This paper proposes a gait phase-constrained adaptive unscented Kalman filter (UKF) [27] localization algorithm that achieves high precision pedestrian localization by fusing INS data, GNSS measurements, and human gait characteristics [28]. First, a 16-dimensional state space model is established, encompassing position, velocity, attitude, and sensor biases. Second, a gait phase detection method based on acceleration signals is designed to accurately identify the stance and swing phases. Then, an adaptive UKF framework is constructed, where process and measurement noise covariances are dynamically adjusted by monitoring innovation sequences. Finally, a phase dependent constraint weight adjustment mechanism is introduced to apply velocity update constraints during different gait phases. Experimental results demonstrate that the proposed algorithm effectively suppresses drift errors from IMU sensors and maintains high localization accuracy even during GNSS signal outages. Figure 3 depicts the three-axis acceleration changes that transition between dynamic and static during gait.

The algorithm employs a 16-dimensional state vector

x = {[p^{T}, v^{T}, q^{T}, b_{a}^{T}, b_{g}^{T}]}^{T}

to represent the pedestrian’s motion state, including the position vector

p = {[p_{x}, p_{y}, p_{z}]}^{T}

, velocity vector

v = {[v_{x}, v_{y}, v_{z}]}^{T}

, attitude quaternion

q = {[q_{w}, q_{x}, q_{y}, q_{z}]}^{T}

, accelerometer bias

b_{a} = {[b_{a x}, b_{a y}, b_{a z}]}^{T}

, and gyroscope bias

b_{g} = {[b_{g x}, b_{g y}, b_{g z}]}^{T}

.

The measurement inputs include INS data, GNSS position data, and acceleration based gait phase detection. The GNSS observation model measurement equation is as follows:

z_{G} = [\begin{matrix} P_{GNSS}^{n} \\ v_{GNSS}^{n} \end{matrix}] = H_{G} x_{k} + R_{G}

(31)

Gait discrimination is divided into two types: standing phase and swinging phase. The zero velocity update (ZUPT) mechanism is activated when the standing phase is detected, constraining the current velocity state to a zero vector and effectively reducing velocity drift. The gait phase function is defined as

ϕ (t) = \arctan 2 (\int_{0}^{t} ∥ a (τ) ∥ \sin (ω_{s} τ) d τ, \int_{0}^{t} ∥ a (τ) ∥ \cos (ω_{s} τ) d τ)

(32)

ϕ (t)

is the gait phase at time t,

a (τ)

is the acceleration at time

τ

, and

ω_{s}

is the gait angular frequency. When the standing phase is detected, the zero velocity constraint is applied:

0 = H_{Z} x_{k} + R_{Z}

(33)

[\begin{matrix} α_{g} \cdot {\hat{v}}_{x}^{b} \\ β \cdot {\hat{v}}_{x}^{b} + γ_{g} \end{matrix}] = S \cdot C_{n}^{b} (q) v^{n} + R_{Gait}

(34)

During the swing phase, use gait periodicity to establish a motion model related to forward speed or gait phase. The model is shown in the following formula.

{\hat{v}}_{x}^{b}

indicate the estimated velocity in the x-axis direction in the carrier coordinate system,

S

used to select or combine specific components from three-dimensional velocity vectors,

α_{g}

,

γ_{g}

and

β

is gait related model parameters.

C_{n}^{b} (q)

is the rotation matrix from the navigation coordinate system to the carrier coordinate system. After incorporating the subsequent process into the PDR and AUKF algorithms for multi-source observation fusion, it is fed back to the motion mechanics to form a closed loop for pedestrian position localization.

In contrast to the conventional UKF and even the standard AUKF, the proposed Gait-AUKF algorithm introduces a gait phase constraint mechanism that integrates PDR to estimate pedestrian trajectories. This allows the system to maintain localization accuracy even during GNSS outages. By incorporating gait phase detection, the algorithm provides additional reference information for trajectory prediction. It adaptively adjusts the noise statistical model to accommodate the diversity of pedestrian motion patterns, effectively handling uncertainties in complex scenarios such as gait transitions and turning maneuvers, thereby significantly enhancing localization accuracy in challenging environments.

3.5. A Multi-Source Attention Framework for Pedestrian Trajectory Prediction and Planning

3.5.1. Framework Overview

This study proposes a hybrid framework for pedestrian trajectory prediction in complex dynamic environments, which integrates multi-source fusion attention mechanism, long short-term memory (LSTM) network, and A* path planning algorithm. This model is based on the front-end Gait-AUKF algorithm for real time estimation of pedestrian trajectory and gait information. By delving into the intrinsic correlation between leg movement features and advanced behavioral intentions, pedestrian trajectory prediction is achieved.

In terms of architecture design, this model fully utilizes the advantages of attention mechanism in capturing long-distance spatiotemporal dependencies, as well as the strengths of long short-term memory (LSTM) network in maintaining motion state memory. By analyzing the temporal evolution characteristics such as gait cycle and step size, the system can simulate the probability distribution of pedestrian intention. On this basis, an A* path planning algorithm for collaborative reasoning is introduced: the A* path planning algorithm guides the pedestrian direction provided by the front-end, performs heuristic search in the environmental map, and generates the optimal or suboptimal geometric path that conforms to physical constraints such as obstacles and road structures. The environmental map is a map data from the experimental area, obtained by vectorizing and annotating information such as road boundaries, pedestrian crossings, and fixed obstacles. The output of the prediction module will undergo dynamic collaborative evaluation with the planned path, ultimately generating a trajectory that combines behavioral authenticity and physical feasibility. The pedestrian trajectory prediction architecture and data flow diagram are shown in Figure 4.

According to architecture Figure 4, it can be seen that the data flow of the algorithm begins with multi-source perception data from wearable devices, which is received and processed by the Gait-AUKF module to generate a 16-dimensional state vector and gait feature vector, including three-dimensional position, three-dimensional velocity, quaternion pose, and sensor deviation estimation. Sensor error estimation is used for module adaptive parameter adjustment. The original 16-dimensional state undergoes dimension and format conversion as a time series input to the GRU encoder, where acceleration is calculated from velocity difference. The GRU encoder compresses the temporal feature sequence into a fixed length encoding vector

h_{T}

as the query state and sends it to the attention mechanism module. It interacts with the key value encoder in the historical memory to output a weighted context

c

. The LSTM decoder obtains the data stream and outputs the final predicted trajectory under the physical constraints of the A* path planner.

3.5.2. Gated Recurrent Unit Encoding Mechanism

The motion feature encoder is the temporal feature extraction module of this prediction model, which takes the real-time multi-dimensional pedestrian state sequence output by Gait-AUKF as input. Its core is a cyclic encoding network composed of gated recurrent units (GRUs). This encoder utilizes the gating mechanism of GRU to adaptively integrate the short-term dynamics and long-term trends of pedestrian trajectories, thereby constructing a hierarchical motion feature representation. The input data is the feature vector

x_{t} = [p_{t}, v_{t}, a_{t}, g_{t}]

for each time step, and GRU controls the flow of information through reset and update gates, as shown in Figure 5 where

p_{t}

denotes the 2D planar coordinates;

v_{t}

and

a_{t}

represent the instantaneous velocity and acceleration vectors, respectively; and

g_{t}

corresponds to the gait feature vector.

Resetting the gate can reduce the influence of historical states when gait mutations are detected, making candidate states more dependent on the current input and responding to instantaneous changes. When the motion trend of the update gate is stable, it tends to retain most of the historical state, thereby maintaining the continuity of the motion direction and supporting long-term path modeling. The encoder outputs the hidden state of the last time step as the representation of the entire sequence. This vector

h_{T}

integrates multi-level information from subtle gait adjustments to macroscopic motion trends, which will be used as input for subsequent multi-source fusion attention modules for further spatiotemporal correlation and future trajectory inference.

3.5.3. Design of the Attention-Based Addressing Mechanism

The memory enhanced attention addressing module is the core module for implementing historical experience retrieval in this framework. Its function is to establish semantic similarity mapping between the currently observed features and compressed historical memory features, thereby providing interpretable historical references for prediction. This module receives two input sources: first, the feature vector

q_{m} = h_{T}

output by the motion feature encoder that fuses the current time period’s motion and intent, and second, the key value set

M = {\{(k_{i}, v_{i})\}}_{i = 1}^{N}

constructed by compressing and encoding the features of all historical time steps. The key vector

k_{i}

is used for similarity addressing, and the value vector

v_{i}

stores the corresponding trajectory context information.

This module adopts a dual encoder architecture to map the target pedestrian’s historical trajectory set and pedestrian action features to the same semantic space and combines the spatiotemporal relationship between pedestrian action patterns and historical trajectories to predict the pedestrian’s future trajectory. The query encoder takes pedestrian motion feature vectors as input and establishes a nonlinear mapping based on pedestrian motion features. The key value encoder takes pedestrian trajectory data as input, learns the periodic motion patterns of pedestrians in the trajectory, and captures long-term temporal dependencies. Query encoder and key value encoder are multi-layer perceptrons with parameter sharing, where

θ_{q}

,

θ_{k}

and

θ_{v}

are learnable parameters,

s (q_{m}, k_{i})

represents the similarity score. In the calculation, since both

q^{'}

and

k_{i}^{'}

are normalized, Formula (35) can be optimized as a dot product.

q^{'} = f_{q} (q_{m}; θ_{q}), k_{i}^{'} = f_{k} (k_{i}; θ_{k}) v_{i}^{'} = f_{v} (v_{i}; θ_{v})

(35)

s (q_{m}, k_{i}) = \frac{q^{'} \cdot k_{i}^{'}}{∥ q^{'} ∥ ∥ k_{i}^{'} ∥} = q^{'} \cdot k_{i}^{'}

(36)

Normalize the similarity score to attention weights to achieve soft addressing of memory,

α_{i}

represents the attention weight of the i memory.

\begin{matrix} α_{i} & = \frac{\exp (s (q_{m}, k_{i}) / \sqrt{d_{h}})}{\sum_{j = 1}^{N} \exp (s (q_{m}, k_{j}) / \sqrt{d_{h}})} . \end{matrix}

(37)

The context vector

c

retrieved is a weighted aggregation of memory values.

\begin{matrix} c & = \sum_{i = 1}^{N} α_{i} \cdot v_{i} . \end{matrix}

(38)

The obtained vector

c

integrates the historical experience most relevant to the current state. This vector will be concatenated with the output of the GRU encoder to form an enhanced fusion feature.

3.5.4. Collaboration Between LSTM Decoder Trajectory Prediction and A* Path Planning

This framework adopts a collaborative mechanism of LSTM decoder and A* path planning [29] to generate trajectory predictions that conform to pedestrian behavior patterns while ensuring physical feasibility and intention orientation. The LSTM decoder [30] takes the feature representation obtained in the encoding stage as the initial state input

h_{t} = [h_{T}, c]

, and gradually generates the future trajectory sequence through recursion. The schematic diagram of recursive calculation is shown in Figure 6.

The A* path planning algorithm provides physically feasible path guidance for the LSTM decoder through heuristic search. The algorithm takes the current pedestrian position

P_{t}

as the starting point, determines the target point

d_{t} = (\cos θ_{t}, \sin θ_{t})

based on orientation information

P_{g} = P_{t} + L \cdot d_{t}

, and searches for the optimal path to avoid obstacles in the environment grid map. The core evaluation function balances the actual cost with heuristic estimation:

f (n) = g (n) + h (n)

. Where

g (n)

is the actual cumulative cost from the starting point to node n, and

h (n)

is the Euclidean distance heuristic function. The algorithm outputs a sequence of path points

P^{astar}

as the intention guidance for pedestrians to move towards the target implicit in the path. The trajectory prediction

P^{lstm}

generated by the LSTM decoder is fused with the physically feasible path

P^{astar}

planned by the A* path planning algorithm through a collaborative weighting mechanism as follows.

P_{t + k} = σ (θ_{k}) \cdot P_{t + k}^{l s t m} + (1 - σ (θ_{k})) \cdot P_{t + k}^{astar}, k = 1, \dots, K θ_{k} = w_{1} \cdot u_{k} + w_{2} \cdot e^{- d_{k}} + b

(39)

The final trajectory coordinate

P_{t + k}

is obtained by adaptively fusing the coordinates predicted by the LSTM decoder,

P_{t + k}^{lstm}

, and the coordinates planned by the A* path planning algorithm,

P_{t + k}^{astar}

, through weight fusion. The weights are calculated by the sigmoid function

σ (θ_{k})

,

u_{k}

is the variance based on prediction, and

d_{k}

is the difference between prediction and planning.

4. Results and Discussion

4.1. Gait-AUKF Pedestrian Trajectory Localization Algorithm

In order to verify the rationality and feasibility of the proposed Gait-AUKF pedestrian trajectory localization algorithm, simulation experiments and real data tests were conducted. Real world data is collected using independently developed wearable devices worn by pedestrians. The placement of the equipment is shown in Figure 7. The simulation experiment replicated the real pedestrian movement trajectory, and the visual results are shown in Figure 8.

The simulation system was configured with an SINS data sampling rate of 200 Hz and a GNSS position update rate of 10 Hz. Detailed parameters are provided in Table 3. The initial geographic location was set to 34.0929° north latitude and 108.5374° east longitude, using a north-east-up (NEU) navigation coordinate system with zero initial velocity. Typical measurement errors from both INS and GPS were superimposed on the predefined reference pedestrian trajectory.

The experiment was designed to include comprehensive motion patterns such as uniform linear motion, ascent, descent, acceleration, and turning in order to evaluate the algorithm’s accuracy in estimating pedestrian position and quantify the resulting localization error.

To validate the robustness of the proposed Gait-AUKF algorithm in complex environments, a simulation experiment was designed with artificially intensified measurement noise to simulate real world scenarios involving sudden noise variations and interference. Under these abnormal noise conditions, a comparative performance test was conducted among three filtering algorithms: UKF, AUKF, and Gait-AUKF, with a focus on analyzing the accuracy of the output navigation trajectories and positional error characteristics. This evaluation aims to assess the adaptability and filtering effectiveness of each algorithm under noisy conditions. Root mean square error (RMSE) and maximum position error (MPE) were adopted as evaluation metrics. The RMSE is defined as:

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {({\hat{X}}_{i}^{k} - X_{i}^{k})}^{2}}

(40)

where N denotes the total number of time points, and

{\hat{X}}_{i}^{k} - X_{i}^{k}

represents the error between the algorithm output and the ground truth at the i time point. The MPE describes the maximum deviation observed under worst case conditions. A comparison of the trajectories estimated by each algorithm is shown in Figure 9, and the distance errors in the east, north, and up directions are presented in Figure 10. The colored diamond markings in Figure 9 represent significant trajectory changes that are about to occur at this time. The blue diamond pattern represents a left turn, the orange diamond pattern represents a climb after a right turn, and the orange diamond pattern represents a descent. The colored dashed lines in Figure 10 also represent the same meaning.

Figure 10 presents a performance comparison of the algorithms in the East-North-Up (ENU) navigation coordinate system. The proposed Gait-AUKF algorithm incorporates gait phase constraints and an adaptive correction mechanism for the measurement noise covariance matrix during iteration, resulting in superior estimation accuracy compared to the conventional UKF and AUKF methods. The results in Table 4 further verify the effectiveness of the proposed algorithm.

The real world data collection environment is shown in Figure 11. In this study, a high precision RTK device was used to provide ground truth localization trajectories. The performance of each algorithm was evaluated using RMSE and MPE. Experimental results demonstrate that the Gait-AUKF algorithm maintains better localization accuracy and reliability in real world scenarios.

Figure 12 shows the navigation trajectories obtained from real world measurements, where data collected by the RTK device serve as the ground truth trajectory. The proposed Gait-AUKF algorithm is represented by the orange trajectory, while the blue and yellow lines correspond to the AUKF and UKF algorithms, respectively. As illustrated, the trajectories estimated by the algorithms exhibit certain deviations from the ground truth, which are further quantified in Figure 13. This figure displays the deviations of the three localization algorithms relative to the ground truth in the East-North-Up directions.

In the 3D plane, both the AUKF and Gait-AUKF trajectories align more closely with the ground truth compared to the UKF method, which exhibits noticeable drift over time. Although all three algorithms show deviations in altitude estimation, Gait-AUKF demonstrates significantly smaller errors, indicating superior performance in trajectory localization and tracking. These observations are further supported by the quantitative RMSE and MPE metrics presented in Table 5.

4.2. Multi-Source Fusion and Attention-Based Framework for Pedestrian Trajectory Prediction and Planning Decision

This study aims to evaluate the accuracy and practicality of the proposed pedestrian trajectory prediction decision framework. Based on real world road scenarios, we designed a data collection protocol to capture human pose data and location information in real time, thereby constructing a dataset specifically for model training and optimization. This approach enhances the algorithm’s adaptability to diverse and complex environments, using the proposed Gait-AUKF algorithm to filter and fuse pedestrian trajectories collected by wearable devices. The coordinate data generated by GNSS is combined with INS measurement values, and pedestrian gait information is added to form the final input trajectory prediction algorithm dataset. During the training process of SHENet and STMR models, the frame numbers in the models are based on the absolute time obtained from GNSS. Convert the two dimensional coordinates relative to the static image to the absolute position output by Gait-AUKF, and then convert them to plane coordinates based on the relative motion of pedestrians. The ST-MR model outputs multimodal predicted trajectories. When compared with other models, the predicted trajectory with the smallest error from the original trajectory is selected from the multimodal predicted trajectories. Table 6 summarizes the parameter settings used during the training period.

The performance of the trajectory prediction model was evaluated using average displacement error (ADE) and final displacement error (FDE), both inversely related to prediction accuracy, i.e., lower values indicate better performance. ADE measures the average Euclidean distance between the predicted and ground truth trajectories over all time steps:

ADE = \frac{1}{T} \sum_{t = 1}^{T} \sqrt{{(x_{t} - {\hat{x}}_{t})}^{2} + {(y_{t} - {\hat{y}}_{t})}^{2}}

(41)

Among them, T is the total number of time steps,

(x_{t}, y_{t})

and

({\hat{x}}_{t}, {\hat{y}}_{t})

respectively represent the ground truth and predicted position in the coordinate system established with the predicted point as the origin at time t. FDE computes the Euclidean distance specifically at the final predicted time step:

FDE = \sqrt{{(x_{t} - {\hat{x}}_{t})}^{2} + {(y_{t} - {\hat{y}}_{t})}^{2}}

(42)

During the process of pedestrians walking in a straight line, the range of trajectory changes is not large, and the quality of model prediction cannot be evaluated. Therefore, this article chooses turning intersections with drastic changes to verify the model. As shown in Figure 14, in the nonlinear motion of pedestrians, the predicted trajectories of different methods were compared with the original trajectories. Specifically, the SHENet [22], ST-MR model [21], and proposed method (MSF-TrajPlan) perform more similarly, while the trajectory predicted by the Transformer model shows significant deviation. At the beginning of trajectory prediction, the SHENet model showed good response, but as the time length increased, the ST-MR model became more prominent in long-term trajectories. In contrast, the method proposed in this article performs more stably throughout the entire prediction process. This is because the motion feature encoding mechanism and historical trajectory memory mechanism not only effectively handle the instantaneous changes in pedestrian actions but also dynamically remember trajectory time information, thereby enhancing the smoothness and rationality of the trajectory. Table 7 provide quantitative performance indicators.

This study models a crosswalk intersection with a pedestrian overpass in an urban area, where start and end points are defined based on satellite imagery to simulate streetcrossing behavior. Since the ground crosswalks are blocked by railings, pedestrians must use the overpass to reach their destinations. A satellite view of the urban intersection is presented in Figure 15.

In the validation experiment of pedestrian movement through the overpass intersection, the trajectory prediction and planning decision framework produced the results shown in Figure 16. To facilitate interpretation, the trajectories are visualized with red circles indicating the start points, green diamonds denoting the end points, and blue lines representing the predicted paths, allowing clear comparison among trajectories with different endpoints.

As shown in Figure 16, the model generates different planned routes based on the destination, with each route starting from the same starting point. Both suggested paths involve crossing pedestrian overpasses and comply with realistic movement restrictions and physical accessibility. Figure 17 shows the difference in trajectory between pure GNSS data without map restrictions and actual roads responsible for urban roads. The GNSS data exhibits certain deviations, while the model constrains the trajectory correctly to the overpass structure and generates stable predicted trajectory points under the guidance of the model.

The ablation experiment results in Table 8 reveal the different effects of the two core modules Gait-AUKF module and A* path planning module on prediction error in this framework. The analysis found that the core function of the Gait-AUKF module is to provide the initial state of the system and the historical trajectory of pedestrians, while the core function of the A* path planning module is to impose physical feasibility and scene structure constraints.

The accuracy of the current state provided by the Gait-AUKF module determines the initial conditions for recursive prediction. Without this module, errors in the decoding process are continuously amplified and accumulated by the recursive model, resulting in severe offset of the predicted endpoint. The function of the A* path planning module is to limit the trajectory provided by A* to physically feasible paths through fusion mechanisms when the decoder generates trajectory segments that may cross obstacles or deviate from feasible areas based on behavioral patterns. This enhances the overall rationality of the trajectory and has a good effect on improving ADE.

5. Conclusions

This study proposed a multi-source fusion and attention-based framework for pedestrian trajectory prediction and planning, supported by a novel Gait-AUKF algorithm that adaptively integrates gait phase detection with nonlinear motion modeling. Experimental results demonstrate that the proposed method achieves higher accuracy and robustness compared to UKF, AUKF, LSTM, and Transformer baselines, particularly in turning scenarios and complex urban settings such as overpass intersections. The incorporation of Gait-AUKF and A* path planning significantly improved performance, reducing ADE and FDE by over 68% and 71%, respectively. The framework provides a reliable solution for trajectory prediction in dynamic pedestrian environments.

Author Contributions

Conceptualization, Z.W. and S.P.; methodology, Z.W.; software, Z.W.; validation, Z.W., S.X. and R.P.; formal analysis, Z.W.; investigation, W.L.; resources, Z.W.; data curation, Z.W.; writing—original draft preparation, Z.W.; writing—review and editing, Z.W.; visualization, H.W.; supervision, Z.W.; project administration, Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Key Research and Development Plan of Shaanxi Province (No. 2024GX-ZDCYL-01-33, No. 2024PT-ZCK-25, No. 2024CY2-GJHX-63), the Key Industry Innovation Chain Project of Shaanxi Province (No. 2021ZDLGY07-10, No. 2021ZDLNY03-08), the Science and Technology Plan Project of Shaanxi Province (No. 2022GY-045), Scientific Research Program Funded by Shaanxi Provincial Education Department (Program No. 21JC030), the Science and Technology Plan Project of Xi’an (No. 22GXFW0124).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data supporting the reported results in this study are not publicly available due to privacy restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wang, X.; Yang, X.; Zhou, D. Goal-CurveNet: A pedestrian trajectory prediction network using heterogeneous graph attention goal prediction and curve fitting. Eng. Appl. Artif. Intell. 2024, 133, 108323. [Google Scholar] [CrossRef]
Ahmed, S.; Huda, M.N.; Rajbhandari, S.; Saha, C.; Elshaw, M.; Kanarachos, S. Pedestrian and Cyclist Detection and Intent Estimation for Autonomous Vehicles: A Survey. Appl. Sci. 2019, 9, 2335. [Google Scholar] [CrossRef]
Wang, R.; Hu, Z.; Song, X.; Li, W. Trajectory Distribution Aware Graph Convolutional Network for Trajectory Prediction Considering Spatio-Temporal Interactions and Scene Information. IEEE Trans. Knowl. Data Eng. 2024, 36, 4304–4316. [Google Scholar] [CrossRef]
Zhang, Z.; Ding, Z.; Tian, R. Decouple Ego-View Motions for Predicting Pedestrian Trajectory and Intention. IEEE Trans. Image Process. 2024, 33, 4716–4727. [Google Scholar] [CrossRef] [PubMed]
Chen, C.; Wu, X.; Bo, Y.; Chen, Y.; Liu, Y.; Alsaadi, F.E. SARSA in extended Kalman Filter for complex urban environments positioning. Int. J. Syst. Sci. 2021, 52, 3044–3059. [Google Scholar] [CrossRef]
Hsu, Y.L.; Wang, J.S.; Chang, C.W. A Wearable Inertial Pedestrian Navigation System With Quaternion-Based Extended Kalman Filter for Pedestrian Localization. IEEE Sens. J. 2017, 17, 3193–3206. [Google Scholar] [CrossRef]
Li, Y.; Guo, Z.; Wang, Q.; Cui, X. An advanced adaptive algorithm driven by online blind noise level estimation for pedestrian positioning. Measurement 2024, 235, 115028. [Google Scholar] [CrossRef]
Wang, X.; Liu, J.; Liu, Z.; Dan, H. Multi-GNSS and INS data fusion enhancement algorithm combined with ANT colony particle filtering. J. Nonlinear Convex Anal. 2020, 21, 8. [Google Scholar]
Wang, F.; Xu, L.; Zhuang, W.; Yin, G.; Pi, D.; Liang, J.; Liu, Y.; Lu, Y. Geometry-Based Cooperative Localization for Connected Vehicle Subject to Temporary Loss of GNSS Signals. IEEE Sens. J. 2021, 21, 23527–23536. [Google Scholar] [CrossRef]
Hu, G.; Wang, W.; Zhong, Y.; Gao, B.; Gu, C. A new direct filtering approach to INS/GNSS integration. Aerosp. Sci. Technol. 2018, 77, 755–764. [Google Scholar] [CrossRef]
Basso, M.; Galanti, M.; Innocenti, G.; Miceli, D. Triggered INS/GNSS Data Fusion Algorithms for Enhanced Pedestrian Navigation System. IEEE Sens. J. 2020, 20, 7447–7459. [Google Scholar] [CrossRef]
Wahlström, J.; Skog, I.; Gustafsson, F.; Markham, A.; Trigoni, N. Zero-velocity detection—A Bayesian approach to adaptive thresholding. IEEE Sens. Lett. 2019, 3, 1–4. [Google Scholar] [CrossRef]
Cho, S.Y.; Park, C.G. Threshold-less Zero-Velocity Detection Algorithm for Pedestrian Dead Reckoning. In Proceedings of the 2019 European Navigation Conference (ENC), Warsaw, Poland, 9–12 April 2019; pp. 1–5. [Google Scholar] [CrossRef]
Lefevre, S.; Vasquez, D.; Laugier, C. A survey on motion prediction and risk assessment for intelligent vehicles. ROBOMECH J. 2014, 1, 1. [Google Scholar] [CrossRef]
Ellis, D.; Sommerlade, E.; Reid, I. Modelling pedestrian trajectory patterns with Gaussian processes. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan, 29 September–2 October 2009; pp. 1229–1234. [Google Scholar] [CrossRef]
Wu, H.; Chen, Z.Y.; Sun, W.; Zheng, B.; Wang, W. Modeling trajectories with recurrent neural networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence, Melbourne, Australia, 19–25 August 2017; pp. 3083–3090. [Google Scholar]
Karatzolou, A.; Jablonski, A.; Beigl, M. A Seq2Seq learning approach for modeling semantic trajectories and predicting the next location. In Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 6–9 November 2018; pp. 528–531. [Google Scholar] [CrossRef]
Amirian, J.; Hayet, J.B.; Pettré, J. Social ways: Learning multi-modal distributions of pedestrian trajectories with GANs. arXiv 2019, arXiv:1904.09507. [Google Scholar] [CrossRef]
Alahi, A.; Goel, K.; Ramanathan, V.; Robicquet, A.; Fei-Fei, L.; Savarese, S. Social LSTM: Human Trajectory Prediction in Crowded Spaces. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 961–971. [Google Scholar] [CrossRef]
Gupta, A.; Johnson, J.; Fei-Fei, L.; Savarese, S.; Alahi, A. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 2255–2264. [Google Scholar] [CrossRef]
Li, L.; Pagnucco, M.; Song, Y. Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; Volume 1, pp. 2221–2231. [Google Scholar] [CrossRef]
Meng, M.; Wu, Z.; Chen, T.; Cai, X.; Zhou, X.S.; Yang, F.; Shen, D. Forecasting Human Trajectory from Scene History. arXiv 2022, arXiv:2210.08732. [Google Scholar] [CrossRef]
Li, R.; Qiao, T.; Katsigiannis, S.; Zhu, Z.; Shum, H.P.H. Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 7047–7060. [Google Scholar] [CrossRef]
Wu, B.; Ma, C.; Poslad, S.; Selviah, D.L. An Adaptive Human Activity-Aided Hand-Held Smartphone-Based Pedestrian Dead Reckoning Positioning System. Remote Sens. 2021, 13, 2137. [Google Scholar] [CrossRef]
Jiang, Y.; Gao, H.; Zhang, P.; Hu, Q. An enhanced mobile localization algorithm integrating multiple AUKF models for mixed indoor environments. Meas. Sci. Technol. 2024, 36, 016317. [Google Scholar] [CrossRef]
Xia, G.; Wang, G. INS/GNSS Tightly-Coupled Integration Using Quaternion-Based AUPF for USV. Sensors 2016, 16, 1215. [Google Scholar] [CrossRef]
Kulikov, G.Y.; Kulikova, M.V. Hyperbolic-SVD-Based Square-Root Unscented Kalman Filters in Continuous-Discrete Target Tracking Scenarios. IEEE Trans. Autom. Control 2022, 67, 366–373. [Google Scholar] [CrossRef]
Masum, H.; Chattopadhyay, S.; Ray, R.; Bhaumik, S. Measurement of Walking Speed from Gait Data using Kurtosis and Skewness based Approximate and Detailed Coefficients. IET Sci. Meas. Technol. 2018, 12, 521–527. [Google Scholar] [CrossRef]
Costa, M.M.; Silva, M.F. A Survey on Path Planning Algorithms for Mobile Robots. In Proceedings of the 2019 IEEE International Conference on Autonomous Robot Systems and Competitions, Porto, Portugal, 24–26 April 2019; pp. 1–7. [Google Scholar]
Zhang, P.; Ouyang, W.; Zhang, P.; Xue, J.; Zheng, N. SR-LSTM: State refinement for lstm towards pedestrian trajectory prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 12077–12086. [Google Scholar] [CrossRef]

Figure 1. Wearable hardware design and physical diagram.

Figure 2. Overview of pedestrian trajectory prediction method.

Figure 3. Pedestrian gait differentiation diagram.

Figure 4. A multi-source attention framework for pedestrian trajectory prediction and planning.

Figure 5. Structure of the gated recurrent unit (GRU).

Figure 6. Schematic diagram of the LSTM structure.

Figure 7. Wearable device placement and RTK equipment. (a) Placement of wearable device. (b) RTK equipment diagram.

Figure 8. 3D simulated pedestrian trajectory. (a) 3D simulated trajectories a. (b) 3D simulated trajectories b.

Figure 9. Comparison of 3D simulated trajectories obtained by UKF, AUKF, and Gait-AUKF algorithms. (a) 3D simulated trajectories a. (b) 3D simulated trajectories b.

Figure 10. Distance errors and empirical CDF in the East-rth-Up directions for UKF, AUKF, and Gait-AUKF algorithms applied to simulated data.

Figure 11. Road scenario for real world data collection.

Figure 12. Comparison of 3D real world trajectories obtained by UKF, AUKF, and Gait-AUKF algorithms.

Figure 13. The distance error and empirical CDF of UKF, AUKF, and Gait AUKF algorithms applied to real data in the northeast upward direction.

Figure 14. Comparison of actual and predicted pedestrian trajectories. (a) Turning path a. (b) Turning path b.

Figure 15. Satellite map of the overpass intersection.

Figure 16. Path planning results for pedestrian overpass crossing. (a) Starting point and ending point facing each other. (b) Starting point and ending point in the same direction.

Figure 17. Model based trajectory localization and prediction results in real intersection scenarios.

Table 1. JY901S parameters.

Unit	Dimensions (Axis)	Dynamic Range	Sensitivity	Word Length
Accelerometer	3	±16 g	0.0005 (g/LSB)	16 bits
Gyroscope	3	±2000°/s	0.061 (°/s)/(LSB)	16 bits
Magnetometer	3	±2 Gaituss	6.67 nT/LSB	16 bits

Table 2. E108-GN04D parameters.

Unit	Cold Start	Hot Start
localization time	28 s	1 s
Sensitivity	−148 dBm	−159 dBm
Accuracy	2 m-CEP, 0.05 m/s-RMS	1.5 m-CEP, 0.05 m/s-RMS

Table 3. Simulation sensor parameter settings.

Simulation Parameters (Unit)	Parameter Settings
SINS Gyroscope Bias (°/s)	0.05
SINS Gyroscope Random Walk (°/ $\sqrt{h}$ )	0.65
SINS Accelerometer Bias (mg)	40
SINS Accelerometer Random Walk (ug/ $\sqrt{Hz}$ )	600
SINS Sampling Rate (Hz)	200
GPS Ranging Error (m)	2.0
GPS Velocity Error (m/s)	0.1
GPS Sampling Rate (Hz)	10

Table 4. Evaluation metrics of different algorithms for simulated data.

Method	Direction	RMSE (m)	MPE (m)
UKF	East	2.3734	2.8428
	North	2.6592	2.9859
	Up	3.2435	7.4907
AUKF	East	2.3259	2.8134
	North	2.4349	2.6039
	Up	3.7291	6.8660
Gait-AUKF	East	2.2368	2.4380
	North	2.0628	2.2753
	Up	3.1792	6.3275

Table 5. Evaluation metrics of different algorithms for real world data.

Method	Direction	RMSE (m)	MPE (m)
UKF	East	2.0396	2.1107
	North	1.1675	1.8332
	Up	6.9158	0.7143
AUKF	East	0.4796	3.1584
	North	0.4148	1.6458
	Up	0.2288	0.7295
Gait-AUKF	East	0.3551	1.7461
	North	0.3058	1.0275
	Up	0.1165	0.3938

Table 6. Training model parameters.

Parameter	Value
Epochs	1000
Batch Size	16
Initial LR (Feature)	$10^{- 3}$
Initial LR (Memory)	$10^{- 4}$
Initial LR (Trajectory)	$10^{- 3}$
Final LR (All Modules)	$10^{- 6}$
Optimizer	ADAM

Table 7. Performance of trajectory prediction of various models in actual road scenarios.

Model	ADE	FDE
ST-MR	0.97	1.20
Transformer	1.16	1.39
SHENet	1.01	1.24
MSF-TrajPlan	0.93	1.16

Table 8. Ablation study in the overpass intersection scenario (unit: meters).

Model	ADE	FDE
Without A* Planning	1.61	2.49
Without Gait-AUKF	1.36	3.74
Without Gait-AUKF and A*	2.92	5.19
With Gait-AUKF and A*	0.93	1.16

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pang, S.; Wang, Z.; Xu, S.; Long, W.; Pan, R.; Wang, H. A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices. Sensors 2026, 26, 1309. https://doi.org/10.3390/s26041309

AMA Style

Pang S, Wang Z, Xu S, Long W, Pan R, Wang H. A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices. Sensors. 2026; 26(4):1309. https://doi.org/10.3390/s26041309

Chicago/Turabian Style

Pang, Shengli, Zhe Wang, Shiji Xu, Weichen Long, Ruoyu Pan, and Honggang Wang. 2026. "A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices" Sensors 26, no. 4: 1309. https://doi.org/10.3390/s26041309

APA Style

Pang, S., Wang, Z., Xu, S., Long, W., Pan, R., & Wang, H. (2026). A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices. Sensors, 26(4), 1309. https://doi.org/10.3390/s26041309

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Method for Pedestrian Trajectory Prediction Using INS-GNSS Wearable Devices

Abstract

1. Introduction

2. Wearable Device Design

3. Methodology

3.1. System Overview

3.2. Pedestrian Dead Reckoning

3.3. Adaptive Unscented Kalman Filter Algorithm

3.4. The Proposed Gait-AUKF Algorithm for Localization

3.5. A Multi-Source Attention Framework for Pedestrian Trajectory Prediction and Planning

3.5.1. Framework Overview

3.5.2. Gated Recurrent Unit Encoding Mechanism

3.5.3. Design of the Attention-Based Addressing Mechanism

3.5.4. Collaboration Between LSTM Decoder Trajectory Prediction and A* Path Planning

4. Results and Discussion

4.1. Gait-AUKF Pedestrian Trajectory Localization Algorithm

4.2. Multi-Source Fusion and Attention-Based Framework for Pedestrian Trajectory Prediction and Planning Decision

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI