Next Article in Journal
REEV SENSE IMUs for Gait Analysis in Stroke: A Clinical Study on Lower Limb Kinematics
Previous Article in Journal
Security for the Internet of Vehicles with Integration of Sensing, Communication, Computing, and Intelligence: A Comprehensive Survey
Previous Article in Special Issue
Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Virtual Observation Using Location-Dependent Statistical Information of Cyclists’ Movement for Estimation of Position and Uncertainty

by
Kento Suzuki
and
Takuma Ito
*
Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Tokyo 113-8656, Japan
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(16), 5122; https://doi.org/10.3390/s25165122
Submission received: 15 July 2025 / Revised: 10 August 2025 / Accepted: 13 August 2025 / Published: 18 August 2025
(This article belongs to the Special Issue Artificial Intelligence and Sensors Technology in Smart Cities)

Abstract

Crossing collisions between cyclists and automobiles around nonsignalized intersections on community roads, where visibility around the intersection is poor due to occlusions caused by house walls, is a social issue related to traffic safety in Japan. Because available observation information for collision prevention is limited on community roads, utilizing the accumulated data is useful to compensate for the lack of observation information. Given these motivations, we propose a movement estimation method of cyclists by combining information from roadside sensors with location-dependent statistical information. First, we develop a method for analyzing the location-dependent statistical information of cyclists on a certain road from accumulated GNSS data using the Kalman smoother. Then, we develop a method for stochastically predicting the movement of cyclists even outside the observation range of a roadside sensor by using the concept of “virtual observation” based on location-dependent statistical information. To evaluate the proposed method, we conduct an experiment to accumulate GNSS data from cyclists using smartphones. As a result of comparison with a conventional method, we confirm that our proposed method can reduce the uncertainty of the estimated position; further, the reduction in the uncertainty will contribute to traffic safety by future advanced driver assistance systems.

1. Introduction

Recently, the number of fatal traffic accidents in Japan has basically been declining [1]. However, the declining trend in the number of fatal traffic accidents on community roads is slow compared with that on non-community roads [2]. Figure 1 shows an example of a nonsignalized intersection on a community road. As shown in Figure 1, visibility around intersections on community roads is limited by narrow roads and building walls. Therefore, the risk of crossing collisions around such intersections is high. In fact, 45% of traffic accidents involving cyclists in Japan occur around intersections [3]. In addition, the velocity of cyclists is relatively high among vulnerable road users (VRUs), and cyclist-related traffic accidents are associated with a higher risk of severe damage. In summary, countermeasures are required to prevent cyclist-related collisions on community roads. Furthermore, this is not a problem specific to Japan, but a problem found in other regions as well. For example, cyclist fatalities are decreasing only slightly in the EU [4], and a high percentage of cyclist fatalities occur at intersections compared to other traffic modes [4]. Additionally, the authors of [5] pointed out that traffic accidents related to cyclists and electric bike riders are a current social problem in China. Thus, the countermeasures are also required in other regions.
To prevent cyclist-related crossing collisions, it is important to grasp the position of the cyclists moving on intersecting roads, as shown in Figure 2. Because current advanced driver assistance systems (ADASs) for commercial vehicles mainly use onboard sensors, they may not work adequately in road environments with occlusions, such as community roads. In such environments, the placement of roadside sensors is an effective approach for reliably preventing traffic accidents. However, the placement of roadside sensors at every intersection is not practical from a cost perspective. In another approach, although using real-time position information provided by portable devices, such as smartphones, is also effective, not all cyclists provide their position information. Thus, the challenge in preventing collisions on community roads is that the available real-time information is limited.
Figure 3 shows a conceptual schematic of the traffic environment assumed in this study. An important point is that roadside sensors are placed sparsely; some intersections have roadside sensors, whereas others do not. More precisely, we assume a situation where a cyclist moves from the first intersection, which has a roadside sensor, to the second intersection, which does not have roadside sensors. Although the distance between intersections varies depending on the town, we assume 100 m as a baseline of the relatively long distance between intersections in Japan. To prevent crossing collisions around the second intersection, it is necessary to estimate the position and uncertainty of the cyclist via a stochastic position estimation method using previously observed information around the first intersection. From the viewpoint of the prevention of crossing collisions, smaller uncertainty of the estimated position is desirable because connected ADASs can behave more adequately using the estimated position with less uncertainty [6]. In the usual stochastic position estimation method, the uncertainty of the position estimation increases if the estimation system lacks observation information. Thus, the challenge in the assumed situation of this study is that the uncertainty is large because of the lack of real-time observations. However, although the movement of cyclists between intersections cannot be observed in the assumed situation, we can roughly estimate their movement based on statistical information. With these motivations, in our previous study [7], we proposed a method for compensating insufficient information of real-time observation by the concept of “virtual observation” based on literature-based statistical information, such as average velocity, and confirmed that this approach could contribute to adequate position estimation by reducing the uncertainty of the position estimation.
However, because literature-based statistical information describes only the average movement, we assumed only static movement without acceleration/deceleration in the previous study. Consequently, the generality of the proposed method was limited. Thus, considering dynamic motion is necessary to extend the method further. Because acceleration/deceleration of cyclists occurs due to the elements of the road environment, such as intersections and crosswalks, such dynamic movements depend on the location. Thus, if we accumulate and analyze the movement data of cyclists related to a certain road, we can estimate location-dependent movement on the road, even under the assumption that cyclists accelerate or decelerate. From the viewpoint of stochastic motion estimation methods, such as the Kalman filter, such dynamic motion information is usually described as a control input. Thus, we call the assumed location-dependent dynamic movement related to acceleration and yawing rate as “virtual control input.” Figure 4 summarizes a conceptual schematic of our newly proposed method in this study.
Given the above motivations, in this study, we propose a method for analyzing such location-dependent statistical information (LDSI) by considering the differences among locations and individuals, as well as a method for predicting the position and uncertainty of cyclists based on LDSI. First, we propose a Kalman-smoother-based method for analyzing movement characteristics using GNSS data which was obtained and accumulated from cyclists’ smartphones. Then, we cluster the data by considering the movement characteristics that depend on the location and individual characteristics. These processes allow us to prepare LDSI from accumulated data. Subsequently, we propose a method for estimating the position and uncertainty of cyclists based on LDSI. The main contributions of this study are as follows:
  • We propose a method for analyzing LDSI from accumulated low-level GNSS data.
  • We propose a method for predicting the position and uncertainty of cyclists via virtual observation and virtual control input based on the LDSI.
In this study, we verified the proposed methods by the simulation using the data obtained in the experiments under the control conditions. In other words, the position estimation method was not verified by the actual experiment in the real-world community roads, but only by the simulation.
The remainder of this paper is organized as follows. Section 2 presents related works on traffic safety technologies. The proposed method is described in detail in Section 3. Section 4 describes the experiments for data accumulation and the analysis results of LDSI preparation. Section 5 describes the simulation experiment of movement prediction using LDSI. Finally, Section 6 concludes the study and describes future work.

2. Related Works

This study aims to develop technologies related to cooperative safety systems to prevent traffic accidents around nonsignalized intersections with occlusions. Thus, Section 2.1 organizes existing studies related to cooperative perception methods. Additionally, cooperative safety systems require the position estimation of other traffic participants, considering the movement characteristics that depend on individual and location characteristics. Therefore, Section 2.2 organizes the existing studies related to the position estimation method considering movement characteristics based on accumulated data. Finally, Section 2.3 organizes existing studies on position estimation using technologies related to virtual observations and virtual control inputs.

2.1. Cooperative Perception Methods for Traffic Safety of Vulnerable Road Users

Estimating the positions of other traffic participants is necessary to prevent traffic accidents in road environments with occlusions. Existing studies related to the position estimation of VRUs are roughly classified into two approaches: methods using roadside units (RSUs) that are equipped with various sensors, and methods using portable devices such as smartphones.
As for the methods using RSUs, various types of sensors have been utilized. Bo-Bo et al. [8] proposed a traffic participant tracking method that uses multiple cameras installed on RSUs. Zhang et al. [9] proposed an improved method for vehicle velocity observation using roadside LiDAR. Zhang et al. [10] installed fisheye cameras in a roundabout and verified their performance in terms of vehicle perception.
As for the methods using portable devices, smartphones have been frequently utilized. Sugimoto et al. [11] developed a collision warning system which utilizes V2P (Vehicle to Pedestrian) communication via a cellular phone and car navigation system. Liu et al. [12] developed Vehicle to Infrastructure (V2I)/Pedestrian to Infrastructure (P2I) communication devices and a collision warning application that uses observation information from a roadside radar. Hussein et al. [13] proposed a collision warning application and evaluated its usability.
Both approaches are useful in preventing traffic accidents. However, because placing many RSUs so densely that all traffic participants can always be observed is costly, this approach is difficult, especially on community roads. Additionally, not all traffic participants are willing to provide their position information from their portable devices. Thus, we need to design a new method based on the assumption that information only from sparsely placed RSU and information from portable devices of a limited number of cyclists can be used. Therefore, we design a new method that prepares LDSI from data which is obtained and accumulated from a limited number of cyclists, and estimates the position of cyclists by combining information from sparsely placed RSUs with LDSI.

2.2. Position Estimation Methods Considering Movement Characteristics Based on Accumulated Data

The movement characteristics of cyclists depend on various factors such as traffic volume, road width, and the existence of crosswalks. Thus, the use of LDSI prepared from accumulated data on each road is effective for predicting the positions of cyclists. Related to data-driven prediction methods, existing studies have proposed two approaches: methods using machine learning (ML), which do not clearly define the motion model, and methods that use motion models with adjusting the model parameters.
As an example of the methods using ML, Li et al. [14] utilized Gaussian process regression models to predict the trajectories of traffic participants which were observed by roadside sensors. Zhou et al. [15] proposed a method using long short-term memory (LSTM) to estimate pedestrian trajectories and collision risk. Deo et al. [16] proposed an LSTM encoder–decoder model that uses convolutional social pooling to predict vehicle trajectories, considering the interaction between vehicles. Liu et al. [17] proposed a Transformer-based method that predicts vehicle trajectories multimodally. Liu’s method used previous trajectories of observed vehicles and map information for prediction.
On the other hand, methods using motion models based on accumulated data contrast with the above ones. Such methods analyzed the accumulated data and then adjusted parameters related to the estimation model, such as the process noise matrix and measurement noise matrix in the extended Kalman filter (EKF). For instance, Boulkroune et al. [18] proposed a tuning method for the process noise matrix and measurement noise matrix of the EKF. They tuned the matrices by minimizing the cost function in the sample data. Jouaber et al. [19] tuned the process noise matrix of the EKF in real-time estimation using a recurrent neural network (RNN). In Jouaber’s method, they trained an RNN-based model, which utilizes the innovation as an input and calculates the process noise matrix, using accumulated trajectories. Additionally, some studies clustered accumulated data and utilized the results for the position estimation and intention prediction of traffic participants. For instance, Jeong et al. [20] collected driving data and clustered the data by velocity. Subsequently, they defined different EKF models for each cluster and utilized them for real-time trajectory prediction. Yi et al. [21] collected vehicles’ trajectory data at a certain intersection and conducted a polynomial regression mixture clustering algorithm. Subsequently, they trained the classification and regression tree algorithm and predicted the drivers’ behavior.
Because the former approach using ML does not explicitly model the motion of the targets, it cannot use the observation information from sensors whose information is not learned by the ML model. Thus, we select the latter approach, which defines the motion model and adjusts its parameters.

2.3. Virtual Observations and Virtual Control Inputs for Kalman Filter

As methods similar to our proposed method in this study, we organize existing studies related to the Kalman filter and virtual observation. Tahk et al. [22], and Alouani et al. [23] proposed methods for estimating the position of airplanes using virtual observations. In their method, movement constraints were utilized as virtual observations. Because the movement of ground vehicles is constrained by the road, virtual observations derived from such constraints have been utilized in some research. Zhou et al. [24] utilized the trajectory shape constraint as a virtual observation. In addition, Zhang et al. [25] proposed a position estimation method that utilizes heading direction constraints derived from road geometry.
Furthermore, some studies have derived virtual observations using ML-based methods. Vargas-Meléndez et al. [26] estimated the roll angle of vehicles by using virtual observation derived by a neural network (NN) module. Kim et al. [27] proposed a method that uses virtual observations derived by an LSTM-based network to estimate the sideslip angle of a vehicle. In the methods above, virtual observations are calculated from actual sensor observations using ML methods.
In addition to virtual observations, some researchers have investigated the utilization of virtual control inputs. Jiang et al. [28] utilized an NN-based module to derive virtual control input and process noise covariance and estimated the position of the aircraft using the EKF. Kim et al. [27] utilized the angular velocity derived by an LSTM-based network as a virtual control input, in addition to virtual observation.
The above methods improve the estimation accuracy by additionally inputting supplemental information, such as virtual observation and virtual control input, only when the methods obtain some observation information. Thus, these methods cannot compensate for the continuous lack of real-time information. Based on this background, in our previous study, we proposed a method that assumes virtual observation information from literature-based statistical information to improve movement estimation [7]. As a result of numerical simulations and experiments in the real world, we confirmed that our previous method adequately estimated the position using virtual observations. However, in our previous study, we used only literature-based statistical information and did not discuss how to prepare data-driven statistical information. In addition, we assumed that the virtual observation information was always constant and did not discuss the locationally adaptive method using LDSI. Thus, in this study, we propose new methods that prepares LDSI and predicts the position of cyclists using LDSI. Furthermore, we discuss a new approach that uses a virtual control input, which was not used in our previous study.

3. Method

3.1. Investigated Situation

In this study, we analyze the movement characteristics of cyclists from the accumulated data of low-level GNSS obtained by smartphones, and then prepare the LDSI. After that, we utilize the LDSI for position estimation. Figure 5 shows a conceptual schematic of the proposed method. The mean and standard deviation of the movement state variables at each location are utilized as the LDSI. LDSI enables us to improve the accuracy of movement prediction of a cyclist observed by an RSU, even if real-time GNSS data of the cyclist cannot be obtained. Here, the important point is that the data from cyclists who agree to provide their GNSS data improves the estimation accuracy for cyclists who do not provide GNSS data.

3.2. Conceptual Design

To use location-dependent information, we must associate the accumulated information with digital map systems. Although many existing studies have proposed various digital map systems, we assume a relatively simple digital map system consisting of nodes and links that correspond to intersections and roads between intersections in the real world. Figure 6 shows a conceptual schematic of the digital map system. In such a digital map system, links contain waypoints that are placed at a certain interval along the road, and the position of each waypoint is expressed by the distance along the road. In this study, because we assume the interval of waypoints to be 1 m, the spatial resolution of the LDSI associated with each waypoint is also 1 m. Additionally, we call the distance as “offset” by referring to the digital map system that we developed in our previous study [29]. Similarly, we call the lateral deviation perpendicular to the road direction as “LD” (lateral deviation).
Figure 7 shows an overview of the proposed method. As shown in Figure 7, the proposed method is divided into two parts: preprocessing and real-time estimation. The preprocessing part analyzes LDSI from accumulated GNSS data, and the real-time estimation part by EKF combines LDSI with roadside sensor observations for movement estimation of cyclists.
In the preprocessing part, we first accumulate GNSS data from portable devices such as smartphones on a certain road. By analyzing the accumulated GNSS data with a Kalman smoother [30], we estimate the movement state of cyclists with their uncertainties. Although the Kalman smoother estimates the time-series movement state, the movement characteristics of cyclists depend on each location. Thus, we spatially resample the estimated state at each waypoint. Subsequently, we cluster the series of resampled states by considering the estimated states and their uncertainties. Finally, we derive the LDSI for each cluster from the resampled states.
In real-time estimation part, the process consists of three components: classification of observed cyclist, derivation of virtual control input and virtual observation, and EKF for position estimation. First, a classification method of the cyclists observed by the roadside sensor for an appropriate cluster, which are predefined in the preprocessing step, is required. However, the main purpose of this study is to propose a method to utilize LDSI. Thus, in this study, we assume an ideally accurate classification method and focus on the following part. As for the assumption of the classification method, we will develop the method which classifies cyclists by comparing the cyclists’ movement states, such as velocity and spatial difference of velocity, estimated by roadside sensor with LDSI in future studies.
Next, the current estimated position is used to obtain the reference waypoint for the derivation of the virtual observation and virtual control input from the LDSI. Here, it is important to note that the current position is estimated with its uncertainty. Therefore, the choice of reference waypoints also requires consideration of uncertainty. Thus, we design a method to derive virtual observations and virtual control inputs, considering such uncertainty. The detailed method is described in Section 3.4.3.
Finally, by integrating the information of the virtual observation and virtual control input with the real roadside sensor’s observation, we estimate the cyclist’s movement state. Here, the LDSI related to the directional angle and velocity are utilized as the virtual observation, and the LDSI related to the yawing rate and acceleration are utilized as virtual control inputs. In addition, the variance of these states in clustered data is utilized as the uncertainty of the prediction and observation of EKF.

3.3. Preparation of LDSI

3.3.1. Movement State Estimation Using Kalman Smoother

As mentioned in the previous section, we utilize the accumulated GNSS data of cyclists in the preprocessing part. Because the position obtained by GNSS contains an error, a method for handling this error is required. The movement state of cyclists are not analyzed in real time, but after accumulation. Thus, we utilize the Kalman smoother, which can be used only in post analysis and can estimate the movement state with smaller uncertainty than the Kalman filter. More precisely, in this study, we utilize a fixed-interval smoothing algorithm [30] for the Kalman smoother. The forward-direction filter is first applied in the fixed-interval smoothing algorithm. Subsequently, a backward-direction filter is applied. The forward-direction filter consists of a normal EKF.
Although the Kalman smoother and EKF are widely used in existing studies, we explain the equations of the Kalman smoother and EKF in the following paragraphs to clarify the state variables and motion models used in this study and their relations. In the usual EKF for estimating a cyclist’s movement state, the planar positions, directional angle, and velocity are often expressed as state variables. On the other hand, yawing rate and acceleration are often utilized as control inputs. However, the analysis for LDSI needs to estimate the yawing rate and acceleration in addition to other variables. Thus, in the analysis using the Kalman smoother, the yawing rate and acceleration are also expressed as state variables instead of control inputs, and the state variable x S and estimation covariance matrix P S are as follows:
x S = x y θ v ω a T
P S = σ x 2 σ x y σ x θ σ x v σ x ω σ x a σ x y σ y 2 σ y θ σ y v σ y ω σ y a σ x θ σ y θ σ θ 2 σ θ v σ θ ω σ θ a σ x v σ y v σ θ v σ v 2 σ v ω σ v a σ x ω σ y ω σ θ ω σ v ω σ ω 2 σ ω a σ x a σ y a σ θ a σ v a σ ω a σ a 2
where x and y denote the planar positions in the Cartesian coordinate system based on the East and North directions, θ denotes the directional angle, v denotes the velocity, ω denotes the yawing rate, and a denotes the acceleration.
The EKF equations in the forward-direction filter are expressed as follows:
x S t k = f S ( x S t k 1 , n S t k 1 )
z S t k = h S ( x S t k )
where t k denotes the discretized time, n S denotes the noise, f S denotes the state transition function, z S denotes the observation, and h S denotes the observation function. The EKF consists of two parts: prediction and update. As for the prediction of EKF, the formulas are expressed as follows:
x ^ S t k t k 1 = f S x ^ S t k 1 t k 1 , 0
P S t k t k 1 = F S t k 1 P S t k 1 t k 1 F S T t k 1 + Γ S t k 1 Q S Γ S T t k 1
where x ^ S denotes the filtered state, F S denotes the matrix of the partial derivatives of f S by x S , Γ S denotes the matrix of the partial derivatives of f S by n S , and Q S denotes the process noise matrix. As for the update of EKF, the formulas are expressed as follows:
S S t k t k 1 = H S P S t k t k 1 H S T + R S
K S t k = P S t k t k 1 H S T S S 1 t k t k 1
x ^ S t k t k = x ^ S t k t k 1 + K S t k z S t k h S x ^ S t k t k 1
P S t k t k = I K S t k H S P S t k t k 1   I K S t k H S T + K S t k R S K S T t k
where S S denotes the system uncertainty matrix, R S denotes the measurement noise matrix, K S denotes the Kalman gain, and H S denotes the matrix of the partial derivatives of h S by x S . Here, we assume that the noise n S is applied to the yawing rate and acceleration. Therefore, the state transition function f S is expressed as follows:
f S x S t k 1 , n S t k 1 = x t k 1 + v t k 1 t cos θ t k 1 + 1 2 a t k 1 t 2 cos θ t k 1 + 1 2 a n o i s e t 2 cos θ t k 1 y t k 1 + v t k 1 t sin θ t k 1 + 1 2 a t k 1 t 2 sin θ t k 1 + 1 2 a n o i s e t 2 sin θ t k 1 θ t k 1 + ω t k 1 t + ω n o i s e t v t k 1 + a t k 1 t + a n o i s e t ω t k 1 + ω n o i s e a t k 1 + a n o i s e
where t denotes the time step between the predictions. Moreover, the noise n S and the process noise matrix Q S are as follows:
n S = ω n o i s e a n o i s e T
Q S = σ ω _ n o i s e 2 0 0 σ a _ n o i s e 2
where ω n o i s e denotes the noise of the yawing rate, a n o i s e denotes the noise of acceleration, σ ω _ n o i s e denotes the standard deviation of ω n o i s e , and σ a _ n o i s e denotes the standard deviation of a n o i s e . From the above discussions, F S and Γ S are derived as follows:
F S t k 1 = f S x S , n S x S | x S = x S t k 1 , n S = 0 = 1 0 v t k 1 t sin θ t k 1 1 2 a t k 1 t 2 sin θ t k 1 t cos θ t k 1 0 1 2 t 2 cos θ t k 1 0 1 v t k 1 t cos θ t k 1 + 1 2 a t k 1 t 2 cos θ t k 1 t sin θ t k 1 0 1 2 t 2 sin θ t k 1 0 0 1 0 t 0 0 0 0 1 0 t 0 0 0 0 1 0 0 0 0 0 0 1
Γ S t k 1 = f S x S , n S n S | x S = x S t k 1 = 0 1 2 t 2 cos θ t k 1 0 1 2 t 2 sin θ t k 1 t 0 0 t 1 0 0 1
As for Q S , we derive the parameters referring to the assumed maximum yawing rate and maximum acceleration. We assume that the maximum yawing rate is achieved when cyclists change their direction at a small intersection with an average velocity. In addition, we assume that maximum acceleration is achieved when the cyclists start moving. We derive these values from the road structure and cyclists’ general movements in Japan. From the standard value of corner cut length at low traffic intersections 3.0 m [31], and cyclists’ average velocity 15 km/h [32], the maximum yawing rate is calculated as follows:
ω m a x = π 2 ÷ 3.0 × π 2 15   ×   1000 3600 = 1.38   r a d / s
From the maximum yawing rate above and the maximum acceleration of 1.95 m/s [33], Q S is determined as follows:
Q S = ω m a x 2 2 0 0 a m a x 2 2 = 0.7 2 0 0 1.0 2
The remaining undetermined variables in the forward-direction filtering are z S , H S , and R S . Because these variables depend on the sensor characteristics, we consider them in Section 3.3.2.
After the forward-direction filtering, the backward filtering is conducted. Similarly to the forward-direction filtering, the backward filtering consists of two parts: prediction and update. The prediction part is expressed as follows:
P S t k + 1 t k = F S t k P S t k t k F S T t k + Γ S t k Q S Γ S T t k
where P S denotes the covariance matrix of smoothed state x ^ S . As for the update part, the formula is expressed as follows:
K S t k = P S t k t k F S T t k P S 1 t k + 1 t k
x ^ S t k t k = x ^ S t k t k + K S t k x ^ S t k + 1 t k + 1 f S x ^ S t k t k , 0
P S t k t k = P S t k t k + K S t k P S t k + 1 t k + 1 P S t k + 1 t k K S T t k
where K S denotes the smoother gain.

3.3.2. Implementation of Observation

In this study, GNSS data are mainly assumed to be a candidate for accumulated data. However, observations by other sensors, such as roadside LiDAR, can be utilized when available. Because not all types of sensors can directly observe direction and velocity, we alternatively calculate the direction and velocity from the position difference. Thus, we utilize the planar positions, directional angle, and velocity as sensor observations for the Kalman smoother. More precisely, z S , H S , and R S are expressed as follows:
θ o b s , S t = a t a n 2 ( y o b s , S t + N S t o b s , S 2 y o b s , S t N S t o b s , S 2 ,   x o b s , S t + N S t o b s , S 2 x o b s , S t N S t o b s , S 2 )
v o b s , S t = x o b s , S t + N S t o b s , S 2 x o b s , S t N S t o b s , S 2 2 + y o b s , S t + N S t o b s , S 2 y o b s , S t N S t o b s , S 2 2 N S t o b s , S
z S = x o b s , S y o b s , S θ o b s , S v o b s , S T
H S = 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0
R S = σ p o s , S 2 0 0 0 0 σ p o s , S 2 0 0 0 0 σ θ , S 2 0 0 0 0 σ v , S 2
where x o b s , S and y o b s , S denote the positions, θ o b s , S denotes the directional angle, v o b s , S denotes the velocity, t o b s , S denotes the observation interval of the sensor, and N S denotes the number of time steps for stabilizing the calculation of the directional angle and the velocity. Here, because the directional angle and velocity are derived from the position difference, σ θ , S and σ v , S depend on N S , the accuracy of the position observation, and the cyclists’ actual velocity. Thus, these parameters are determined in Section 4.2.

3.3.3. Clustering of Smoothed Data

Similarly to the variation in movement characteristics depending on the location, movement characteristics also vary among individuals. In the assumption of this study, individual optimization of movement characteristic parameters is impossible because the target cyclists who do not provide movement data are different from the cyclists who provide their movement data. However, if a cyclist observed by roadside sensors can be classified in the group whose characteristics are similar to those of the observed cyclist, the proposed system can select relatively adequate parameters. Thus, we cluster the smoothed data that have similar movement characteristics and derive the LDSI of each cluster.
Considering the above discussion, we designed a clustering method. As mentioned earlier, we first spatially resample the time-series smoothed data to the data linked with waypoints that have 1 m intervals between each other. Then, we cluster the resampled estimated data into multiple clusters. Because the movement data estimated by the Kalman smoother contain the uncertainty of each variable, consideration of the uncertainty is necessary for selecting the clustering method. Thus, we utilize the Wasserstein distance [34] as the metric between multiple data for clustering. More precisely, we select the 2-Wasserstein distance. The 2-Wasserstein distance between multivariate normal distributions P W and Q W , whose means are μ P _ W and μ Q _ W , and covariance matrices are Σ P _ W and Σ Q _ W , respectively, is expressed as follows [34]:
D W 2 P W ,   Q W = μ P _ W μ Q _ W 2 + t r a c e Σ P _ W + t r a c e Σ Q _ W 2 t r a c e Σ P _ W Σ Q _ W Σ P _ W 1 2
In this study, we calculate the 2-Wasserstein distance of the distribution P W and Q W of estimated movement state variables on each waypoint between the different data. Here, the above one indicates only the distance on the same waypoint of two different data. Thus, we then calculate the average 2-Wasserstein distance for all waypoints between each pair of data, and use the average distance for the clustering. As for the clustering algorithm, we use hierarchical clustering. In addition, from the viewpoint of the calculation method for the distance between clusters, we use unweighted average linkage clustering.
To calculate the Wasserstein distance, we must consider the variables used for clustering. From the viewpoint of collision prevention applications, it is important to estimate the offset, which is the position of the cyclist along the road. At this point, the velocity is directly useful for estimating the offset. In contrast, because the directional angle and yawing rate of cyclists differ little on a straight road, we omitted these variables for the clustering criteria. Additionally, because we confirmed that acceleration did not work well for clustering in the preliminary analysis, we also omitted acceleration. Consequently, we calculate the Wasserstein distance based only on the estimated velocity.
After the calculation of the distance between each pair of data, we divide all the data into a predetermined number of clusters based on the dendrogram. In each cluster, we prepare LDSI by calculating the mean values and standard deviations of the estimated variables at each waypoint.

3.4. Movement States Estimation Using LDSI

3.4.1. Real-Time Movement States Estimation Using Kalman Filter

In the real-time estimation of the movement state using the EKF, we define the state variables x F , the estimation covariance matrix P F , and the control input u F as follows:
x F = x y θ v T
P F = σ x 2 σ x y σ x θ σ x v σ x y σ y 2 σ y θ σ y v σ x θ σ y θ σ θ 2 σ θ v σ x v σ y v σ θ v σ v 2
u F = ω a T
The equations for the EKF are as follows:
x F t k = f F ( x F t k 1 , u F t k 1 )
z F t k = h F ( x F t k )
where f F denotes the state transition function, and h F denotes the observation function. The prediction part is expressed as follows:
x ^ F t k t k 1 = f F x ^ F t k 1 t k 1 , u F t k 1
P F t k t k 1 = F F t k 1 P F t k 1 t k 1 F F T t k 1 + B F t k 1 Q F t k 1 B F T t k 1
where x ^ F denotes the filtered state, F F denotes the matrix of the partial derivatives of f F by x F , B F denotes the matrix of the partial derivatives of f F by u F , and Q F denotes the process noise matrix. The update part is expressed as follows:
S F t k t k 1 = H F P F t k t k 1 H F T + R F t k
K F t k = P F t k t k 1 H F T S F 1 t k t k 1
x ^ F t k t k = x ^ F t k t k 1 + K F t k z F t k h F x ^ F t k t k 1
P F t k t k = I K F t k H F P F t k t k 1 I K F t k H F T + K F t k R F t k K F T t k
where S F denotes the system uncertainty matrix, H F denotes the matrix of the partial derivatives of h F by x F , R F denotes the measurement noise matrix, and K F denotes the Kalman gain.
Unlike the Kalman smoother in the preprocessing part, we assume that only the uncertainty of the control input affects the estimation covariance matrix P F in real-time movement state estimation using the Kalman filter. Thus, f F is expressed as follows:
f F x F t k 1 , u F t k 1 = x t k 1 + v t k 1 t cos θ t k 1 + 1 2 a t k 1 t 2 cos θ t k 1 y t k 1 + v t k 1 t sin θ t k 1 + 1 2 a t k 1 t 2 sin θ t k 1 θ t k 1 + ω t k 1 t v t k 1 + a t k 1 t
where t denotes the time step between the predictions. Moreover, the process noise matrix Q F is as follows:
Q F t k 1 = σ ω 2 t k 1 0 0 σ a 2 t k 1
From the above discussion, F F and B F are derived as follows:
F F t k 1 = f F x F , u F x F | x F = x F t k 1 , u F = u F t k 1 = 1 0 v t k 1 t sin θ t k 1 1 2 a t k 1 t 2 sin θ t k 1 t cos θ t k 1 0 1 v t k 1 t cos θ t k 1 + 1 2 a t k 1 t 2 cos θ t k 1 t sin θ t k 1 0 0 1 0 0 0 0 1
B F t k 1 = f F x F , u F u F | x F = x F t k 1 = 0 1 2 t 2 cos θ t k 1 0 1 2 t 2 sin θ t k 1 t 0 0 t
Here, we first assume that the control input u F is 0, and the process noise Q F has the same value in the preprocessing part when there is no available information. When the virtual observation is used, virtual control input u v and corresponding process noise Q v based on LDSI are assigned to u F and Q F , respectively. The specific calculation method is described in Section 3.4.3.
The remaining undetermined variables are z F , H F , and R F . Because we utilize the virtual observation to compensate for the lack of actual sensor observation, the virtual observation is utilized only when actual sensor observation is not available. Thus, if actual sensor observation is available, corresponding values of actual sensors are assigned to z F , H F , and R F . Otherwise, virtual observation z v , H v , and R v based on LDSI are assigned to z F , H F , and R F . Additionally, because these variables depend on the sensor characteristics, the variables of the actual sensor observation and virtual observation are different from each other. Thus, we determine the variables of the actual sensor observation and the virtual observation in Section 3.4.2 and Section 3.4.3, respectively.

3.4.2. Implementation of Actual Sensor Observation

In existing studies, various types of sensors, such as cameras, LiDARs, and radars, have been utilized as roadside sensors. Because not all types of sensors can directly observe directional angle and velocity, we assume to calculate the directional angle and velocity from the position difference when the sensors do not provide such information. Thus, similar to the observation in the preprocessing part, we utilize the position, directional angle, and velocity as sensor observations. More precisely, z F , H F , and R F . are expressed as follows:
θ o b s , F t = a t a n 2 y o b s , F t y o b s , F t N F t o b s , F , x o b s , F t x o b s , F t N F t o b s , F
v o b s , F t = x o b s , F t x o b s , F t N F t o b s , F 2 + y o b s , F t y o b s , F t N F t o b s , F 2 N F t o b s , F
z F = x o b s , F y o b s , F θ o b s , F v o b s , F T
H F = 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1
R F = σ p o s , F 2 0 0 0 0 σ p o s , F 2 0 0 0 0 σ θ , F 2 0 0 0 0 σ v , F 2
where t o b s , F denotes the observation interval of a sensor, and N F denotes the number of time steps for stabilizing the calculation of the directional angle and velocity. Here, t o b s , F and N F are different from the corresponding values in the preprocessing part. On the contrary, as discussed in Section 3.3.2, σ θ , F and σ v , F depend on N F , the accuracy of the position observation, and the cyclists’ actual velocity. Thus, these parameters are determined in Section 4.3.

3.4.3. Implementation of Virtual Control Input and Virtual Observation

Because both the virtual observation and virtual control input are calculated from the LDSI that is associated with waypoints, the estimated current position is required for referring to the LDSI. However, the coordinate system of the EKF is different from that of the LDSI; the EKF is based on the Cartesian coordinate system, and the LDSI is based on the Offset-LD coordinate system, as shown in Figure 6. Thus, the transformation of coordinate systems is necessary. Figure 8 shows the pipeline of movement estimation related to the transformation of the coordinate systems. The transformation of the coordinate system is expressed as follows:
M ψ = cos ψ sin ψ sin ψ cos ψ
o ^ l ^ = M ψ 1 x ^ x C y ^ y C + o C l C
where ψ denotes the angle from the x-axis to the offset-axis with counterclockwise rotation as positive, M ψ denotes the rotation matrix, x C and y C denote the planar positions of the closest waypoint from the current position, o ^ and l ^ denote the estimated offset and LD in the Offset-LD coordinate system, and o C and l C denote the offset and LD of the closest waypoint from the current position, respectively. The transformation of the coordinate system for the estimation covariance matrix is expressed as follows:
σ o 2 σ o l σ o l σ l 2 = M ψ 1 σ x 2 σ x y σ x y σ y 2 M ψ
where σ o and σ l denote the estimation uncertainties of the offset and LD positions, respectively, and σ o l denotes the covariance of the offset and LD positions.
In addition, we must consider the uncertainty of the estimated current position. Thus, we calculate the probability of the current position of the cyclist for each waypoint based on the estimated movement state. Based on the probability, we calculate stochastically weighted location-dependent statistical information (SWLDSI). Figure 9 shows a conceptual schematic of this process.
First, the existence probability α i of a cyclist for the i-th waypoint at time step t is calculated from the probability distribution function of the position of a cyclist along the road. To be more detailed, α i t is approximately calculated as follows:
α i t o i 0.5 o i + 0.5 1 2 π σ o 2 t exp o o ^ t 2 2 σ o 2 t d o
where o i denotes offset of the i-th waypoint, o ^ t denotes the estimated offset at time step t along the road, and σ o t denotes its uncertainty.
Next, we need to design a method to calculate SWLDSI based on the existence probability for each waypoint. The LDSI is linked with each waypoint and consists of the mean values and standard deviations of the movement state. Thus, we calculate the SWLDSI by weighting the LDSI with the above existence probability α i . To be more detailed, we assume weighted average of probability distribution of LDSI related to velocity v at each way point as probability distribution p S W v ,   t of SWLDSI, as shown in Equation (52).
p S W v , t i = 1 n α i t 1 2 π σ ~ L D S I v , i 2 exp v μ ~ L D S I v , i 2 2 σ ~ L D S I v , i 2
where n denotes the number of waypoint, and μ ~ L D S I v , i and σ ~ L D S I v , i denote the LDSI at the i-th waypoint. Here, we assume probability distribution of LDSI at each way point as the normal distribution. In practice, the mean μ ~ v t and the standard deviation σ ~ v t of the SWLDSI at time step t are calculated as follows:
μ ~ v t = i = 1 n α i t μ ~ L D S I v , i
σ ~ v 2 t = i = 1 n α i t σ ~ L D S I v , i 2 + i = 1 n α i t μ ~ L D S I v , i 2 i = 1 n α i t μ ~ L D S I v , i 2
The detailed derivations are given in the Appendix A. From Equation (53), μ ~ v is derived by the weighted average of μ ~ L D S I v , i . Additionally, from Equation (54), σ ~ v 2 is derived by the summation of the weighted average of σ ~ L D S I v , i 2 and the variance of μ ~ L D S I v , i . Similarly to velocity, the SWLDSI of the directional angle, yawing rate, and acceleration are calculated as follows:
μ ~ θ t = i = 1 n α i t μ ~ L D S I θ , i
σ ~ θ 2 t = i = 1 n α i t σ ~ L D S I θ , i 2 + i = 1 n α i t μ ~ L D S I θ , i 2 i = 1 n α i t μ ~ L D S I θ , i 2
μ ~ ω t = i = 1 n α i t μ ~ L D S I ω , i
σ ~ ω 2 t = i = 1 n α i t σ ~ L D S I ω , i 2 + i = 1 n α i t μ ~ L D S I ω , i 2 i = 1 n α i t μ ~ L D S I ω , i 2
μ ~ a t = i = 1 n α i t μ ~ L D S I a , i
σ ~ a 2 t = i = 1 n α i t σ ~ L D S I a , i 2 + i = 1 n α i t μ ~ L D S I a , i 2 i = 1 n α i t μ ~ L D S I a , i 2
Owing to the above SWLDSI, even if the current estimated position contains large uncertainty and is located around the area where LDSI changes greatly, we can adequately reflect the uncertainty of the estimated movement state.
Additionally, to stably estimate moving states that contain relatively outliers among the clusters, we introduce safety factors s f Q and s f R to the process noise and observation noise, respectively. SWLDSI with safety factors is expressed as follows:
σ ~ θ t k = 1 + s f R σ ~ θ t k
σ ~ v t k = 1 + s f R σ ~ v t k
σ ~ ω t k = 1 + s f Q σ ~ ω t k
σ ~ a t k = 1 + s f Q σ ~ a t k
s f Q and s f R are not the analyzed parameters but the design parameters, and we assigned 0.3 to these parameters in this study.
SWLDSI is utilized for the calculation of the virtual control input u v , process noise Q v , virtual observation z v , and observation noise R v , respectively. Figure 10 shows a conceptual schematic of the calculation process. The virtual control input u v and process noise Q v at time step t k 1 are calculated from SWLDSI based on the results of the EKF update at time step t k 1 , and used for the prediction of the EKF from time step t k 1 . The virtual observation z v and observation noise R v are calculated from SWLDSI based on the latest results of the EKF prediction. The movement state at time step t k are updated using the above virtual observations and virtual control inputs.
As for the virtual control input u v , virtual observation z v , and process noise Q v , SWLDSI is directly applied to EKF. In other words, each value for EKF is expressed as follows:
u v t k 1 = μ ~ ω t k 1 μ ~ a t k 1 T
Q v t k 1 = σ ~ ω 2 t k 1 0 0 σ ~ a 2 t k 1
z v t k = μ ~ θ t k μ ~ v t k T
H v = 0 0 1 0 0 0 0 1
As for the observation noise R v , we need to consider the time-series effect of R v . In general, if the EKF process is repeated, the element of the estimation covariance matrix P becomes smaller than the corresponding value of the observation noise R v . Thus, to avoid making the estimation covariance matrix P excessively small, the observation noise R v must be adjusted so that the estimation covariance matrix P via repeated virtual observations converges to SWLDSI, similar to our previous study [7]. Here, the uncertainty of the directional angle and the velocity are not affected by the position uncertainty. Thus we calculate the modified R v by solving the algebraic Riccati Equations [35] for the directional angle and velocity. R v is expressed as follows:
σ θ _ v t k = σ ~ θ 4 t k + σ ~ ω 2 t k 1 t 2 σ ~ θ 2 t k σ ~ ω t k 1 t
σ v _ v t k = σ ~ v 4 t k + σ ~ a 2 t k 1 t 2 σ ~ v 2 t k σ ~ a t k 1 t
R v t k = σ θ _ v 2 t k 0 0 σ v _ v 2 t k
where t denotes the observation cycle time. Here, we need to design an adequate cycle time for the prediction of the EKF, which is also equivalent to the observation cycle. Because the values of the virtual control input u v and process noise Q v are estimated from the accumulated GNSS data, these values are based on the assumption of constant acceleration and constant yawing motion within the observation cycle time of GNSS. Thus, it is desirable to match the cycle time of the EKF prediction, which uses virtual control input u v and process noise Q v , to the GNSS observation cycle time. Based on the above consideration, we determine the cycle time for virtual observation and virtual control input as 1.0 s, which is equivalent to the cycle time of GNSS used in the experiment. Using variables derived by the above discussion, we estimate the cyclists’ movement states with the virtual control input and virtual observation when actual sensor observation is not available, as mentioned in Section 3.4.1.

4. Experiment for Preparing LDSI

4.1. Experimental Setup

We conducted an experiment to accumulate GNSS data in the Hongo campus of the University of Tokyo. Although the roads in the campus of the university are not actual community roads, the traffic characteristics of them are similar to those of community roads; pedestrians and cyclists move, and the visibility around nonsignalized intersections is bad. Thus, we considered the road in the university campus as pseudo community roads, and we conducted the experiment in the university campus. The study was conducted in accordance with the Declaration of Helsinki, and the protocol for this study was approved by Research Ethics Committee, School of Engineering, The University of Tokyo (KE24-52) on 21 October 2024. We recruited 10 experimental participants with ages ranged from twenties to thirties. We obtained written informed consent from the participants. The participants rode on a bicycle and moved along the experimental course three times. Figure 11 shows the appearances of the experimental course [36,37]. The course length was approximately 840 m. During the experiment, a smartphone was attached to the bicycle, and it accumulated GNSS data. Although we recorded 30 data in total, one participant took the wrong course once. Thus, we finally obtained a total of 29 valid data.

4.2. Parameters Related to GNSS Observation for Kalman Smoother

The observation frequency of GNSS was 1 Hz. Because we used a typical smartphone to obtain GNSS data, the observation accuracy of the GNSS was not very high. Thus, we assumed that σ p o s ,   S for the GNSS was 4.25 m. In addition, we calculated the directional angle and velocity via Equations (22) and (23). As for the σ θ ,   S and σ v ,   S , we determined the values from the preliminary numerical simulation. More precisely, we added noise, whose average was zero and the standard deviation was σ p o s ,   S , to the ground truth value of the position on the assumption that the average velocity of cyclists was 4.2 m/s. Then, we calculated the distribution of θ and v , and determined the N S , σ θ ,   S , and σ v ,   S as follows:
N S = 2
σ θ , S = 0.88   r a d
σ v , S = 2.8   m / s
As a result, R S was expressed as follows:
R S = σ p o s , S 2 0 0 0 0 σ p o s , S 2 0 0 0 0 σ θ , S 2 0 0 0 0 σ v , S 2 = 4.25 2 0 0 0 0 4.25 2 0 0 0 0 0.88 2 0 0 0 0 2.8 2

4.3. Clustering Result

We analyzed the movement characteristics of the cyclists using the accumulated GNSS data. We focused on the movement data on a straight road from the start point to intersection 2, as shown in Figure 11.
Figure 12, Figure 13, Figure 14 and Figure 15 show the directional angle, velocity, yawing rate, and acceleration, respectively, estimated by the Kalman smoother. The horizontal axis in each figure indicates the offset, which is the distance from the start point along the road. In addition, a directional angle of 0 rad indicates the East direction.
Figure 13 indicates that the estimated velocity in almost all trials decreased around stop signs 1 and 2, which corresponds to the areas whose offset are approximately 175 m and 315 m, respectively. Additionally, Figure 13 indicates that the estimated velocity in some trials decreased around intersection 1 and the crosswalk, which corresponds to the areas whose offset are approximately 60 m and 270 m, respectively. Such location-dependent movement characteristics were also confirmed for the other state variables. Thus, the LDSI can be calculated from the estimated values.
Next, we clustered the accumulated data based on the estimated velocities. Figure 16 shows the dendrogram of the clustering results. As an initial step in the approach proposed in this study, we divided the obtained data into three groups based on the dendrogram. The top, middle, and bottom graphs in Figure 17 show the estimated velocity in each cluster, average velocity with 2 σ range in each cluster, and standard deviation of the velocity in each cluster, respectively. The data in Cluster 1 tended to show a large velocity change. The data in Cluster 2 tended to show high velocity. The data in Cluster 3 tended to show low velocity. Additionally, the standard deviation in each cluster was smaller than that of all the data without clustering. Therefore, the clustering process enables us to understand the movement characteristics in more detail.
Based on the above clustering, we also classified the directional angle data, yawing rate data, and acceleration data. Figure 18, Figure 19 and Figure 20 show the results. We calculated the LDSI related to the virtual observation and virtual control input based on the distribution of these variables, and utilized the LDSI for position estimation in the following section.

5. Simulation Experiment of Position Estimation Using LDSI

5.1. Data for Simulation Experiment

In this simulation experiment, we used the same data explained in Section 4. Thus, we used the values estimated by the Kalman smoother as the ground truth for position estimation. To be more precise, we linearly interpolated the estimated values, which were estimated using the Kalman smoother at 1 Hz, to values at 10 Hz, and used the interpolated values as the ground truth.
As shown in Figure 4, the observation of the cyclist by the roadside sensor at the first intersection enables our proposed system to continue estimating the position even after the cyclist moves out from the observation range of the roadside sensor. To confirm the effectiveness of this function, we set the observation range of the virtual roadside sensor, and utilized the ground truth values of the position as observation values by the roadside sensor. Additionally, although the classification function of the observed cyclist into a predetermined cluster corresponding to the LDSI was necessary to realize the proposed system, the main purpose of this study was to confirm the effectiveness of using LDSI. Thus, we assumed an ideal classification function and evaluated the effectiveness of the position estimation using LDSI.

5.2. Parameters of Roadside Sensor Observation

We assumed LiDAR as the roadside sensor for this simulation. Although LiDAR obtains point clouds with precise accuracy, the clustering process for detecting traffic participants produces positioning errors to a certain extent. Thus, we assumed the σ p o s ,   F of LiDAR observation as 0.1 m. In addition, we set the observation range as a circular area with a radius of 10 m, and observation cycle as 10 Hz. As for the σ θ ,   F and σ v ,   F , we preliminarily determined the values from the numerical simulation, similar to the case in Section 4.2. More precisely, we added noise, whose average was zero and standard deviation was σ p o s ,   F , to the ground truth value of the position on the assumption that the average velocity of cyclists was 4.2 m/s. Then, we calculate the distribution of θ and v , and determined the N F , σ θ ,   F , and σ v ,   F as follows:
N F = 5
σ θ ,   F = 0.067   r a d
σ v ,   F = 0.28   m / s
The above values are different from those in the Kalman smoother because the observation cycle and the observation accuracy are different from those of the GNSS in the smartphone. As a result, R F was expressed as follows:
R F = σ p o s , F 2 0 0 0 0 σ p o s , F 2 0 0 0 0 σ θ , F 2 0 0 0 0 σ v , F 2 = 0.1 2 0 0 0 0 0.1 2 0 0 0 0 0.067 2 0 0 0 0 0.28 2

5.3. Conventional Method as a Baseline for Comparison

Although comparisons with existing methods are necessary for evaluating the proposed method in this study, only a limited number of studies were conducted in a similar situation to this study. Among the existing studies, a method that utilizes virtual observation derived from literature-based statistical information [7] was the most related and competitive to this study. Thus, we set up a conventional virtual observation method using literature-based statistical information as the baseline for comparison. Literature-based statistical information provides an average velocity that is universal for various roads and does not provide location-dependent information related to the yawing rate and acceleration. Thus, the conventional method does not use the virtual control input. Additionally, the values of the process noise matrix Q were the same as those used for the Kalman smoother. Figure 21 shows a comparison between the conventional method used in our previous study and the method proposed in this study.
As for the velocity of the conventional virtual observation, we calculated the average value and standard deviation based on the literature. Reference [32] describes that the average velocity of cyclists in Japan is approximated as 15 km/h, and the distribution of cyclists’ velocity ranges from 2.8 m/s to 6.9 m/s. Because this is not symmetrical distribution, we approximated this distribution as a normal distribution whose average velocity is 4.2 m/s and 95% confidence interval ranges from 1.5 m/s to 6.9 m/s. Consequently, the values related to the velocity of the conventional virtual observation were calculated as follows:
μ ~ v = 4.2   m / s
σ ~ v = 1.4   m / s
As for the directional angle, there is no literature-based information. Thus, we assumed that 95% of the directional angle was distributed within a range of ±15 deg. As a result, the values related to the directional angle of the virtual observation are calculated as follows:
μ ~ θ = θ l i n k
σ ~ θ = 0.13   r a d
where θ l i n k denotes the directional angle of the link. Based on the above values, we set z v as follows:
z v = μ ~ θ t k μ ~ v t k T = θ l i n k 4.2 T
Additionally, we calculated the observation noise matrix R v for the conventional method based on Equations (69)–(71) with σ ~ v and σ ~ θ . The conventional method estimates the position of cyclists in an observation cycle of 1.0 Hz using virtual observation.

5.4. Results and Discussions

5.4.1. Position Estimation Results for Representative Cases

Figure 22, Figure 23, Figure 24 and Figure 25 show the estimation results of data 22 by the conventional method, those by the proposed method, the estimation results of data 11 by the conventional method, and those by the proposed method, respectively. In each figure, the left graph shows the estimated variables, 95% confidence interval, and ground truth values. The orange areas around 0.0 s indicate the duration during which the roadside sensor observed the cyclists. Red lines indicate the estimated variables, and red areas indicate 95% confidence intervals. Blue lines indicate ground truth values. The green dots indicate the ground truth values that were outside the 95% confidence interval. The graph on the right shows the time-series uncertainty related to each variable. As for the characteristics of the data, data 22 branched out in the relatively lower area of the dendrogram shown in Figure 16, indicating that the data of this trial was located in the relatively central area of the cluster. In contrast, data 11 branched out in the relatively upper area of the dendrogram, indicating that the data of this trial was located in the relatively outlier area of the cluster.
In the results of the conventional method using literature-based statistical information, as shown in Figure 22, the estimated values of the directional angle and velocity converged to the average values that were used for virtual observation. In addition, their uncertainty also converged to the uncertainty of the virtual observation.
In contrast, in the result of the proposed method using LDSI, as shown in Figure 23, the estimated velocity changed at each location. Additionally, the uncertainty of the estimated directional angle and velocity changed depending on the uncertainty of the LDSI. However, because the uncertainty of position estimation affected the uncertainty of the virtual observation, as described in Section 3.4.3, the uncertainty of the other estimated variables increased according to the increase in the uncertainty of the position estimation because of the long time since the roadside observation. Furthermore, in the case where the ground truth of velocity was outside the 95% confidence interval, as shown in Figure 25, the ground truth of position was also outside the 95% confidence interval. Because data 11 was located in the relatively outlier area of the cluster, the ground truth values might be outside the 95% confidence interval.

5.4.2. Evaluation Results for All Cases

Because the final goal of this study is to prevent crossing collisions, estimation of the offset, which directly contributes to the collision prediction, is most important. From this viewpoint, although the directional angle and LD are not directly related to the offset, velocity is directly related to the offset. Therefore, in the following analyses, we focus only on velocity and offset.
Table 1 and Table 2 show the average ratios of the ground truth velocity and offset within the 95% confidence interval, respectively. First, we calculated the ratio of time-series variables inside the confidence interval from the time when the cyclist exited the observation range of the roadside sensor to the time when the cyclist arrived at the point 100 m away from the center of intersection 1 in each trial. We then calculated the average ratio from all the results. As shown in Table 1 and Table 2, although the ratio of the conventional method was better than that of the proposed method, appropriateness of the estimated values by the conventional method was not the reason; rather, the large uncertainty of the conventional method appeared to be the reason for this result.
Figure 26 and Figure 27 show the comparison of the error of each variable between the ground truth values and estimated values when the cyclist reached a point 100 m ahead of the center of intersection 1. Here, e v and e o indicate the errors in velocity and offset, respectively. From Figure 26 and Figure 27, we can confirm that the errors in the estimated velocity and offset in the proposed method were smaller than those in the conventional method. Additionally, a comparison of the corresponding results in each cluster indicates that the median values in the proposed method existed around zero, and those in the conventional method were far from zero.
Similarly, Figure 28 and Figure 29 show the comparison of the uncertainty of estimated variables at the same moment. σ v and σ o indicate the uncertainties in the estimated velocity and offset, respectively. In the conventional method, the uncertainties of the estimated velocity converged to the same values as in the literature-based statistical information. By contrast, because the uncertainty of the estimated velocity depended on the uncertainty of the estimated position in the proposed method, the uncertainty of the velocity differed from each trial. In addition, the uncertainties of the offset in the proposed method were smaller than those in the conventional method.

5.4.3. Discussion

To evaluate the position estimation methods, discussions from the perspectives of adequacy and performance are necessary. Regarding adequacy, the average ratio of the ground truth variables inside the 95% confidence interval were 93% and 91% in the proposed method with ideal classification; they were less than 95%. As shown in Figure 25, the cases in which acceleration/deceleration was more intense than average show a low ratio of the ground truth value inside the confidence interval. Such trials are basically clustered in the upper area of the dendrogram shown in Figure 16, which indicates that such trials are relatively outliers in the cluster. Thus, if we accumulate much more data and increase the number of clusters, such trials will belong to the new clusters; further, it will increase the average ratio inside the confidence interval and improve the adequateness.
As for the performance, although the estimated values in the conventional method converged to constant values based on literature-based statistical information, those in the proposed method changed according to the location. As a result, the proposed method could decrease the errors of the estimated values when the cyclist arrived at the point that was 100 m away from the roadside sensor in intersection 1. Here, the median errors of each cluster estimated by the proposed method were better than those obtained by the conventional method. Thus, consideration of movement characteristics from the viewpoint of group differences may contribute to the improvement of the results. Furthermore, the smaller uncertainty of the estimated velocity in the proposed method also contributes to the improvement in the uncertainty of the offset, which will contribute to the adequate behaviors of connected ADASs [6]. From these results, we can confirm that the proposed method improved the performance of movement estimation owing to the use of LDSI.

5.5. Position Estimation Results with Miss Classification

In the above estimation results, we assumed an ideally accurate classification method, as mentioned in Section 5.1. However, realization of such ideal classification method is practically hard, and miss-classifications are inevitable in the real conditions to a certain degree. Therefore, we also simulated position estimation with wrong classifications, and analyzed the average ratio within the 95% confidence interval, similar to Section 5.4.2. Table 3 shows the comparison of the average ratio between estimation with correct classifications and wrong classifications. In general, the average ratios within the 95% confidence interval with wrong classification are low compared to those with correct classification. In detail, because the movement characteristics of each cluster are different from each other, the degree of degrading the ratio is various.
Because the results indicate the importance of correct classification, development of an accurate classification method will be required. Additionally, not only the high accuracy of the classification method but also consideration of classification uncertainty are important. In particular, to prevent inappropriate position estimation with large errors caused by wrong classification, position estimation method considering the classification uncertainty is important. For instance, when the uncertainty of the classification is large, utilization of the literature-based statistical information instead of LDSI would contribute to the improvement of estimation robustness. Such method, which switches the estimation method to a conservative one based on classification uncertainty can be one of the solutions to improve robustness of proposed method.

5.6. Limitations and Future Prospects

Because this study was conducted as a first step of the proposed approach, we assumed many ideal conditions. First, we conducted position estimation only with the simulation. Thus, the feasibility validation of proposed method was limited. It is desirable to use the data obtained by actual roadside sensors for validation. Additionally, it is also desirable to use the data with precise ground truth value. Therefore, an experiment in the real-world using actual roadside sensors as well as real time kinematic GNSS for obtaining ground truth values is necessary. Such actual data will enable us to validate proposed method in detail.
Second, we used the estimated trajectories by Kalman smoother as the ground truth in the simulation. Utilization of the estimated trajectories, which are the original sources of LDSI, as the ground truth may increase the risk of inflating the performance in the simulation. However, virtual observation and virtual control input are not calculated directly from the ground truth data. They are calculated from LDSI, which is not a value directly related to a single data, but a representative value of multiple data. Thus, the risk of inflating the performance was not crucial. However, although the risk was limited, additional data is necessary to separate data for preparing LDSI and those for simulation to conduct more appropriate verification.
Third, we assumed static environmental conditions, and did not consider the change in the cyclists’ movement caused by changes in the environmental conditions, such as time of day, weather, and so on. Thus, robustness of proposed method to the changes in environmental conditions was limited. To overcome such limitation, a detection method for the changes in environmental conditions and an adaptation method of LDSI to such changes are required. In the future smart city, some cyclists are assumed to be real-time connected to the system. The real-time information including movement and density from such cyclists enables the system to detect the changes in environmental conditions. When the changes in environmental conditions are detected, the system can adapt LDSI to them. Position estimation with such adaptive LDSI will improve the robustness of proposed method.
Finally, in this study, only 10 participants were recruited to the experiment, and the generality of data was limited. To overcome such limitation, utilization of GNSS data of smartphones collected by telecommunication companies or smartphone-application companies is effective. A study using such data for analysis of pedestrians’ travel characteristics in macro scale [38] was already conducted. Although existing study [38] conducted macroscopic people-flow analysis, applying the proposed method in this study to such data will contribute to the microscopic people-flow analysis. In addition, because the study [38] demonstrated that the large amount of GNSS data is available from such data sources, sufficient data for constructing general LDSI would be available. Furthermore, considering the high penetration ratio of smartphones, GNSS data are collected from wide range of people. Therefore, we think that sufficient data from various cyclists will reflect the actual characteristics of the cyclists on the roads where corresponding data are obtained. Thus, the utilization of such data enables a more general validation of the proposed method.

6. Conclusions

In this study, we developed a method for preparing LDSI from accumulated GNSS data of cyclists using the Kalman smoother. We then developed a movement estimation method using virtual observation and virtual control input based on LDSI. To evaluate the proposed methods, we conducted an experiment to accumulate the GNSS data of cyclists. From the experimental data, we confirmed that our proposed method adequately prepared the LDSI. In addition, we confirmed that our proposed method using LDSI estimated the movement state of cyclists better than the conventional method. These methods will contribute to the prevention of cyclist-related crossing collisions around nonsignalized intersections with occlusions on community roads, where roadside sensors are sparsely placed.
However, further studies are still needed. In this study, because this study aimed to develop the methods as a first step, the number of the experimental participants and the variations was limited. Thus, increasing the number of participants for data accumulation and the variations in traffic environments is necessary. Additionally, we simulated a numerical experiment of the position estimation of cyclists with the assumption of an ideal classification system that classifies cyclists into appropriate clusters related to LDSI. Thus, we need to develop a classification system based on information from roadside sensors. Additionally, real-world experiments on actual community roads need to be conducted using actual roadside sensors. After completing the above future works, we will try to implement the proposed method in this study into the connected ADASs including motion control.

Author Contributions

Conceptualization, K.S. and T.I.; methodology, K.S. and T.I.; software, K.S.; validation, K.S.; formal analysis, K.S. and T.I.; investigation, K.S. and T.I.; data curation, K.S.; writing—original draft preparation, K.S. and T.I.; writing—review and editing, K.S. and T.I.; supervision, T.I.; project administration, T.I.; funding acquisition, T.I. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Suzuki Foundation. The authors would like to thank them for providing financial support.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and the protocol for this study was approved by Research Ethics Committee, School of Engineering, The University of Tokyo (KE24-52) on 21 October 2024. We obtained written informed consent from the participants.

Informed Consent Statement

Informed consent was obtained from all the subjects involved in the study.

Data Availability Statement

The original contributions presented in this study are included in the article.

Acknowledgments

The authors would like to acknowledge Gen Yamamoto for helping conduct experiments of data collection.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
GNSSGlobal navigation satellite system
VRUVulnerable road user
ADASAdvanced driver-assistance system
LDSILocation-dependent statistical information
RSURoadside unit
V2PVehicle to pedestrian
V2IVehicle to infrastructure
P2IPedestrian to infrastructure
MLMachine learning
LSTMLong short-term memory
EKFExtended Kalman filter
RNNRecurrent neural network
NNNeural network
LDLateral deviation
SWLDSIStochastically weighted location-dependent statistical information

Appendix A

The derivations of Equation (53) in Section 3.4.3 are as follows:
μ ~ v t = E v = v p S W v , t d v = v i = 1 n α i t 1 2 π σ ~ L D S I v , i 2 exp v μ ~ L D S I v , i 2 2 σ ~ L D S I v , i 2 d v
Here, the first two derivations are from definition, and the last derivation is obtained by Equation (52). The rest of the derivation is as follows:
μ ~ v t = i = 1 n α i t v 1 2 π σ ~ L D S I v , i 2 exp v μ ~ L D S I v , i 2 2 σ ~ L D S I v , i 2 d v = i = 1 n α i t μ ~ L D S I v , i
The first derivation can be obtained because α i t is not a function of v . The last derivation is obtained from the definition of normal distribution.
The derivations of Equation (54) in Section 3.4.3 are as follows:
σ ~ v 2 t = V a r v = E v 2 E v 2 = v 2 p S W v ,   t d v i = 1 n α i t μ ~ L D S I v , i 2 = v 2 i = 1 n α i t 1 2 π σ ~ L D S I v , i 2 exp v μ ~ L D S I v , i 2 2 σ ~ L D S I v , i 2 d v i = 1 n α i t μ ~ L D S I v , i 2
Here, the first two derivations are from definition, and the third derivation is obtained by using Equation (A2). The last derivation is obtained by Equation (52). The rest of the derivations are as follows:
σ ~ v 2 t = i = 1 n α i t v 2 1 2 π σ ~ L D S I v , i 2 exp v μ ~ L D S I v , i 2 2 σ ~ L D S I v , i 2 d v i = 1 n α i t μ ~ L D S I v , i 2 = i = 1 n α i t σ ~ L D S I v , i 2 + μ ~ L D S I v , i 2 i = 1 n α i t μ ~ L D S I v , i 2 = i = 1 n α i t σ ~ L D S I v , i 2 + i = 1 n α i t μ ~ L D S I v , i 2 i = 1 n α i t μ ~ L D S I v , i 2
The first derivation can be obtained because α i t is not a function of v . The second derivation is obtained by using the definition of normal distribution.

References

  1. Cabinet Office. White Paper on Traffic Safety in Japan 2023. Available online: https://www8.cao.go.jp/koutu/taisaku/r05kou_haku/english/pdf/wp2023.pdf (accessed on 29 January 2025).
  2. Cabinet Office. White Paper on Traffic Safety in Japan 2021. Available online: https://www8.cao.go.jp/koutu/taisaku/r03kou_haku/english/pdf/wp2021-1.pdf (accessed on 7 January 2025).
  3. National Police Agency. Annual Report by Traffic Accidents 2024 Trends in Traffic Accidents Involving Pedal Cyclist (Primary or Secondary Party) by Type of Accident. Available online: https://www.npa.go.jp/publications/statistics/koutsuu/toukeihyo_e.html (accessed on 2 July 2025).
  4. European Road Safety Observatory. European Commission (2024) Facts and Figures Cyclists. Available online: https://road-safety.transport.ec.europa.eu/document/download/5efc920c-99be-4870-9e4a-8f9c3029f9cb_en?filename=ff_cyclists_14082024.pdf (accessed on 28 July 2025).
  5. Zhang, G.; Tan, Y.; Jou, R.-C. Factors influencing traffic signal violations by car drivers, cyclists, and pedestrians: A case study from Guangdong, China. Transp. Res. Part F Traffic Psychol. Behav. 2016, 42, 205–216. [Google Scholar] [CrossRef]
  6. Ito, T. Connected Collision Avoidance System via Stochastic Localization on Community Roads. Int. J. Automot. Eng. 2025, 16, 1–11. [Google Scholar] [CrossRef] [PubMed]
  7. Suzuki, K.; Watanabe, K.; Yang, J.; Ito, T. Virtual Observation Using Statistical Information of Cyclist’s Velocity for Estimation of Position and Uncertainty. Int. J. Automot. Eng. 2025, 16, 54–65. [Google Scholar] [CrossRef] [PubMed]
  8. Bo Bo, N.; Slembrouck, M.; Veelaert, P.; Philips, W. Distributed Multi-Class Road User Tracking in Multi Camera Network for Smart Traffic Applications. Adv. Concepts Intell. Vis. Syst. 2020, 12002, 517–528. [Google Scholar] [CrossRef]
  9. Zhang, J.; Xiao, W.; Coifman, B.; Mills, J.P. Vehicle Tracking and Speed Estimation From Roadside Lidar. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5597–5608. [Google Scholar] [CrossRef]
  10. Zhang, R.; Zou, Z.; Shen, S.; Liu, H.X. Design, Implementation, and Evaluation of a Roadside Cooperative Perception System. Transp. Res. Rec. 2022, 2676, 273–284. [Google Scholar] [CrossRef]
  11. Sugimoto, C.; Nakamura, Y.; Hashimoto, T. Prototype of pedestrian-to-vehicle communication system for the prevention of pedestrian accidents using both 3G wireless and WLAN communication. In Proceedings of the 2008 3rd International Symposium on Wireless Pervasive Computing, Santorini, Greece, 7–9 May 2008; pp. 764–767. [Google Scholar] [CrossRef]
  12. Liu, W.; Muramatsu, S.; Okubo, Y. Cooperation of V2I/P2I Communication and Roadside Radar Perception for the Safety of Vulnerable Road Users. In Proceedings of the 2018 16th International Conference on Intelligent Transportation Systems Telecommunications (ITST), Lisboa, Portugal, 15–17 October 2018; pp. 1–7. [Google Scholar] [CrossRef]
  13. Hussein, A.; García, F.; Armingol, J.M.; Olaverri-Monreal, C. P2V and V2P communication for Pedestrian warning on the basis of Autonomous Vehicles. In Proceedings of the 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil, 1–4 November 2016; pp. 2034–2039. [Google Scholar] [CrossRef]
  14. Li, P.; Guo, H.; Bao, S.; Kusari, A. A Probabilistic Framework for Estimating the Risk of Pedestrian Vehicle Conflicts at Intersections. IEEE Trans. Intell. Transp. Syst. 2023, 24, 14111–14120. [Google Scholar] [CrossRef]
  15. Zhou, S.; Xu, H.; Zhang, G.; Ma, T.; Yang, Y. Deep learning based pedestrian trajectory prediction and risk assessment at signalized intersections using trajectory data captured through roadside LiDAR. J. Intell. Transp. Syst. 2023, 28, 793–805. [Google Scholar] [CrossRef]
  16. Deo, N.; Trivedi, M.M. Convolutional Social Pooling for Vehicle Trajectory Prediction. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18–22 June 2018; pp. 1549–15498. [Google Scholar] [CrossRef]
  17. Liu, Y.; Zhang, J.; Fang, L.; Jiang, Q.; Zhou, B. Multimodal Motion Prediction with Stacked Transformers. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 7573–7582. [Google Scholar] [CrossRef]
  18. Boulkroune, B.; Geebelen, K.; Wan, J.; van Nunen, E. Auto-tuning extended Kalman filters to improve state estimation. In Proceedings of the 2023 IEEE Intelligent Vehicles Symposium (IV), Anchorage, AK, USA, 4–7 June 2023; pp. 1–6. [Google Scholar] [CrossRef]
  19. Jouaber, S.; Bonnabel, S.; Velasco-Forero, S.; Pilté, M. NNAKF: A Neural Network Adapted Kalman Filter for Target Tracking. In Proceedings of the ICASSP 2021—2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6–11 June 2021; pp. 4075–4079. [Google Scholar] [CrossRef]
  20. Jeong, Y.; Yi, K. Target Vehicle Motion Prediction-Based Motion Planning Framework for Autonomous Driving in Uncontrolled Intersections. IEEE Trans. Intell. Transp. Syst. 2021, 22, 168–177. [Google Scholar] [CrossRef]
  21. Yi, D.; Su, J.; Liu, C.; Chen, W.-H. Trajectory Clustering Aided Personalized Driver Intention Prediction for Intelligent Vehicles. Trans. Ind. Inform. 2019, 15, 3693–3702. [Google Scholar] [CrossRef]
  22. Tahk, M.; Speyer, J.L. Target tracking problems subject to kinematic constraints. IEEE Trans. Autom. Control. 1990, 35, 324–326. [Google Scholar] [CrossRef]
  23. Alouani, A.T.; Blair, W.D. Use of a kinematic constraint in tracking constant speed, maneuvering targets. IEEE Trans. Autom. Control. 1993, 38, 1107–1111. [Google Scholar] [CrossRef]
  24. Zhou, G.; Li, K.; Kirubarajan, T.; Xu, L. State Estimation With Trajectory Shape Constraints Using Pseudomeasurements. IEEE Trans. Aerosp. Electron. Syst. 2019, 55, 2395–2407. [Google Scholar] [CrossRef]
  25. Zhang, Z.; Li, K.; Zhou, G. State Estimation with Heading Constraints for on-Road Vehicle Tracking. IEEE Trans. Intell. Transp. Syst. 2022, 23, 13614–13635. [Google Scholar] [CrossRef]
  26. Vargas-Meléndez, L.; Boada, B.L.; Boada, M.J.L.; Gauchía, A.; Díaz, V. A Sensor Fusion Method Based on an Integrated Neural Network and Kalman Filter for Vehicle Roll Angle Estimation. Sensors 2016, 16, 1400. [Google Scholar] [CrossRef] [PubMed]
  27. Kim, D.; Kim, G.; Choi, S.; Huh, K. An Integrated Deep Ensemble-Unscented Kalman Filter for Sideslip Angle Estimation with Sensor Filtering Network. IEEE Access 2021, 9, 149681–149689. [Google Scholar] [CrossRef]
  28. Jiang, Y.; Nong, X. A Radar Filtering Model for Aerial Surveillance Base on Kalman Filter and Neural Network. In Proceedings of the 2020 IEEE 11th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 16–18 October 2020; pp. 57–60. [Google Scholar] [CrossRef]
  29. Ito, T.; Mio, M.; Tohriyama, K.; Kamata, M. Novel Map Platform based on Primitive Elements of Traffic Environments for Automated Driving Technologies. Int. J. Automot. Eng. 2016, 7, 143–151. [Google Scholar] [CrossRef] [PubMed]
  30. Brown, R.G.; Hwang, P.Y.C. Introduction to Random Signals and Applied Kalman Filtering; John Wiley and Sons: Hoboken, NJ, USA, 2012; pp. 208–209. [Google Scholar]
  31. Japan Road Association. Explanation and Operation of Government Order on Road Design Standards; Japan Road Association: Tokyo, Japan, 2021; p. 517. (In Japanese) [Google Scholar]
  32. Japan Society of Traffic Engineers. Road Traffic Technology Handbook 2024; Japan Society of Traffic Engineers: Tokyo, Japan, 2024; p. 44. (In Japanese) [Google Scholar]
  33. Yuuki, K.; Hideo, Y.; Kensuke, K.; Shunsuke, Y. An analysis on characteristics of cycling behavior by elderly people at slopes and in starting to move. In Proceedings of the 39th Infrastructure Planning Conference, Tokushima, Japan, 13–14 January 2009. (In Japanese). [Google Scholar]
  34. Givens, C.R.; Shortt, R.M. A class of Wasserstein metrics for probability distributions. Mich. Math. J. 1984, 31, 231–240. [Google Scholar] [CrossRef]
  35. Kučera, V. The discrete Riccati equation of optimal control. Kybernetika 1972, 8, 430–447. [Google Scholar]
  36. Geospatial Information Authority of Japan. GSI Maps. Available online: https://maps.gsi.go.jp/ (accessed on 13 July 2025).
  37. Geospatial Information Authority of Japan. The Aerial Photograph by Geospatial Information Authority of Japan; Geospatial Information Authority of Japan: Ibaraki, Japan, 2019. [Google Scholar]
  38. Eom, S.; Kim, H.; Hasegawa, D.; Yamada, I. Pedestrian movement with large-scale GPS records and transit-oriented development attributes. Sustain. Cities Soc. 2024, 102, 105223. [Google Scholar] [CrossRef]
Figure 1. Appearance of a nonsignalized intersection on a community road in Japan.
Figure 1. Appearance of a nonsignalized intersection on a community road in Japan.
Sensors 25 05122 g001
Figure 2. Conceptual schematic of a cyclist moving behind occlusions.
Figure 2. Conceptual schematic of a cyclist moving behind occlusions.
Sensors 25 05122 g002
Figure 3. Conceptual schematic of assumed situation and our previously proposed method.
Figure 3. Conceptual schematic of assumed situation and our previously proposed method.
Sensors 25 05122 g003
Figure 4. Conceptual schematic of the proposed method in this study.
Figure 4. Conceptual schematic of the proposed method in this study.
Sensors 25 05122 g004
Figure 5. Conceptual schematic of LDSI preparation and application. The details of analysis in the data accumulation and analysis in the real-time estimation are explained in Section 3.3 and Section 3.4, respectively. The details of the experiment related to the data accumulation and the numerical simulation related to the real-time estimation are explained in Section 4 and Section 5, respectively.
Figure 5. Conceptual schematic of LDSI preparation and application. The details of analysis in the data accumulation and analysis in the real-time estimation are explained in Section 3.3 and Section 3.4, respectively. The details of the experiment related to the data accumulation and the numerical simulation related to the real-time estimation are explained in Section 4 and Section 5, respectively.
Sensors 25 05122 g005
Figure 6. Conceptual schematic of the node-link map assumed in this study.
Figure 6. Conceptual schematic of the node-link map assumed in this study.
Sensors 25 05122 g006
Figure 7. Overview of the proposed method. The red cross signs in the top left figure indicate the position of cyclists. The top middle graphs indicate the velocity along offset axis. The colors of the graph indicate the corresponding clusters. The top right vectors indicate LDSI of clusters. The methods related to preprocessing and real-time estimation are explained in Section 3.3 and Section 3.4, respectively.
Figure 7. Overview of the proposed method. The red cross signs in the top left figure indicate the position of cyclists. The top middle graphs indicate the velocity along offset axis. The colors of the graph indicate the corresponding clusters. The top right vectors indicate LDSI of clusters. The methods related to preprocessing and real-time estimation are explained in Section 3.3 and Section 3.4, respectively.
Sensors 25 05122 g007
Figure 8. Pipeline of movement estimation using LDSI.
Figure 8. Pipeline of movement estimation using LDSI.
Sensors 25 05122 g008
Figure 9. Conceptual schematic of the SWLDSI calculation. The red line in the left graph indicates stochastic position distribution of the target cyclist along offset axis.
Figure 9. Conceptual schematic of the SWLDSI calculation. The red line in the left graph indicates stochastic position distribution of the target cyclist along offset axis.
Sensors 25 05122 g009
Figure 10. Conceptual schematic of the calculation process for virtual observation and virtual control input. The colors in the figure indicate the time step of estimated variables. Blue indicates the estimated variables at t k 1 estimated from t k 1 . Yellow indicates the estimated variables at t k estimated from t k 1 . Green indicates the estimated variables at t k estimated from t k .
Figure 10. Conceptual schematic of the calculation process for virtual observation and virtual control input. The colors in the figure indicate the time step of estimated variables. Blue indicates the estimated variables at t k 1 estimated from t k 1 . Yellow indicates the estimated variables at t k estimated from t k 1 . Green indicates the estimated variables at t k estimated from t k .
Sensors 25 05122 g010
Figure 11. (a) Map of the experiment site (created by editing the digital map [36]). (b) Aerial photo of the experiment site (created by editing the aerial photograph [37]). The Chinese characters on the left side of (a) indicate a “post office”, and the Chinese characters on the right side of (a) indicate the “Hongo campus of the University of Tokyo”.
Figure 11. (a) Map of the experiment site (created by editing the digital map [36]). (b) Aerial photo of the experiment site (created by editing the aerial photograph [37]). The Chinese characters on the left side of (a) indicate a “post office”, and the Chinese characters on the right side of (a) indicate the “Hongo campus of the University of Tokyo”.
Sensors 25 05122 g011
Figure 12. Accumulated directional angle data along offset axis.
Figure 12. Accumulated directional angle data along offset axis.
Sensors 25 05122 g012
Figure 13. Accumulated velocity data along the offset axis.
Figure 13. Accumulated velocity data along the offset axis.
Sensors 25 05122 g013
Figure 14. Accumulated yawing rate data along the offset axis.
Figure 14. Accumulated yawing rate data along the offset axis.
Sensors 25 05122 g014
Figure 15. Accumulated acceleration data along the offset axis.
Figure 15. Accumulated acceleration data along the offset axis.
Sensors 25 05122 g015
Figure 16. Dendrogram of clustering results. The blue lines, the green lines, and the red lines indicate the data in Cluster 1, data in Cluster 2, and the data in Cluster 3, respectively. The orange dashed line indicates the cut-off line.
Figure 16. Dendrogram of clustering results. The blue lines, the green lines, and the red lines indicate the data in Cluster 1, data in Cluster 2, and the data in Cluster 3, respectively. The orange dashed line indicates the cut-off line.
Sensors 25 05122 g016
Figure 17. Velocity in each cluster.
Figure 17. Velocity in each cluster.
Sensors 25 05122 g017
Figure 18. Acceleration in each cluster.
Figure 18. Acceleration in each cluster.
Sensors 25 05122 g018
Figure 19. Directional angle in each cluster.
Figure 19. Directional angle in each cluster.
Sensors 25 05122 g019
Figure 20. Yawing rate in each cluster.
Figure 20. Yawing rate in each cluster.
Sensors 25 05122 g020
Figure 21. Comparison between the conventional method in our previous study and the method proposed in this study.
Figure 21. Comparison between the conventional method in our previous study and the method proposed in this study.
Sensors 25 05122 g021
Figure 22. Estimated results of data 22 in Cluster 2 using the conventional method.
Figure 22. Estimated results of data 22 in Cluster 2 using the conventional method.
Sensors 25 05122 g022
Figure 23. Estimated results of data 22 in Cluster 2 using the proposed method.
Figure 23. Estimated results of data 22 in Cluster 2 using the proposed method.
Sensors 25 05122 g023
Figure 24. Estimated results of data 11 in Cluster 2 using the conventional method.
Figure 24. Estimated results of data 11 in Cluster 2 using the conventional method.
Sensors 25 05122 g024
Figure 25. Estimated results of data 11 in Cluster 2 using the proposed method.
Figure 25. Estimated results of data 11 in Cluster 2 using the proposed method.
Sensors 25 05122 g025
Figure 26. Comparison of estimated velocity error between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Figure 26. Comparison of estimated velocity error between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Sensors 25 05122 g026
Figure 27. Comparison of estimated offset error between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Figure 27. Comparison of estimated offset error between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Sensors 25 05122 g027
Figure 28. Comparison of velocity uncertainty between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Figure 28. Comparison of velocity uncertainty between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median. The red plus signs in the graphs indicate the outliers.
Sensors 25 05122 g028
Figure 29. Comparison of offset uncertainty between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median.
Figure 29. Comparison of offset uncertainty between conventional and proposed methods. The blue boxes in the left graph indicate the interquartile range, and the red lines inside the blue box indicate the median.
Sensors 25 05122 g029
Table 1. Average ratio within the 95% confidence interval in velocity estimation.
Table 1. Average ratio within the 95% confidence interval in velocity estimation.
MethodsAll Data (29 Data)Cluster 1 (5 Data)Cluster 2 (9 Data)Cluster 3 (15 Data)
Conventional1.001.001.001.00
Proposed0.930.900.920.94
Table 2. Average ratio within the 95% confidence interval in offset estimation.
Table 2. Average ratio within the 95% confidence interval in offset estimation.
MethodsAll Data (29 Data)Cluster 1 (5 Data)Cluster 2 (9 Data)Cluster 3 (15 Data)
Conventional0.971.001.000.95
Proposed0.910.770.930.93
Table 3. Average ratio within 95% confidence interval in the state estimation with wrong classification.
Table 3. Average ratio within 95% confidence interval in the state estimation with wrong classification.
GT ClusterCluster 1 (5 Data)Cluster 2 (9 Data)Cluster 3 (15 Data)
Classified Cluster123123123
CorrectWrongWrongWrongCorrectWrongWrongWrongCorrect
Average ratio within the confidence intervalVelocity0.900.720.370.610.920.890.400.460.94
Offset0.770.650.360.540.930.820.240.150.93
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Suzuki, K.; Ito, T. Virtual Observation Using Location-Dependent Statistical Information of Cyclists’ Movement for Estimation of Position and Uncertainty. Sensors 2025, 25, 5122. https://doi.org/10.3390/s25165122

AMA Style

Suzuki K, Ito T. Virtual Observation Using Location-Dependent Statistical Information of Cyclists’ Movement for Estimation of Position and Uncertainty. Sensors. 2025; 25(16):5122. https://doi.org/10.3390/s25165122

Chicago/Turabian Style

Suzuki, Kento, and Takuma Ito. 2025. "Virtual Observation Using Location-Dependent Statistical Information of Cyclists’ Movement for Estimation of Position and Uncertainty" Sensors 25, no. 16: 5122. https://doi.org/10.3390/s25165122

APA Style

Suzuki, K., & Ito, T. (2025). Virtual Observation Using Location-Dependent Statistical Information of Cyclists’ Movement for Estimation of Position and Uncertainty. Sensors, 25(16), 5122. https://doi.org/10.3390/s25165122

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop