Trajectory Data Preprocessing: Methods and Models
Abstract
1. Introduction
2. Overview of Research Methods
2.1. PRISMA Declaration
2.2. Retrieval Strategy
- IEEE Xplore;
- ACM Digital Library;
- Scopus;
- Web of Science;
- Transport Research International Documentation.
2.3. Inclusion and Exclusion Criteria
2.4. Literature Screening and Data Extraction
2.5. Literature Screening Process
2.6. Substantiation of Comprehensive Coverage
3. Trajectory Data Preprocessing Methods
3.1. Data Cleaning
- Removing outliers: Outliers are data points that are significantly different from the rest of the data. They can be caused by errors in the GPS tracking device or environmental factors. Removing these outliers can improve the accuracy of the data.
- Interpolating missing data: Sometimes, GPS tracking devices may lose signal or malfunction, resulting in gaps in the data. Interpolation generally estimates missing data points based on neighboring data points.
- Smoothing data: Smoothing involves averaging out small variations in the data to remove noise. This can be carried out using a moving average or a low-pass filter.
- Aligning data: If the GPS device and the data processing software are not synchronized, there may be time delays or offsets in the data. Aligning the data involves adjusting the timestamps to ensure that they are accurate.
3.1.1. Removing Outliers and Duplicates
3.1.2. Data Interpolating
3.1.3. Data Smoothing
3.1.4. Data Aligning
3.2. Trajectory Compression
3.2.1. Line-Simplification-Based Trajectory Compression
- Offline compression is a static compression based on global trajectory characteristics, which compresses the entire trajectory data after it has been read. A classic example of this approach is the Douglas–Peucker algorithm [46]. The algorithm uses the segmented and simplified lines to replace the original trajectory (as shown in Figure 3a–c). This splitting process repeats recursively for each new segment until all deviations fall below the threshold or only endpoints remain. The final simplified trajectory consists of key points such as T1, T5, T10, and T14. The Douglas–Peucker algorithm greatly recognizes the data information of special nodes and has a better performance in both compression rate and accuracy. However, the disadvantage of this algorithm is the high time complexity.
- 2.
- Online compression is a dynamic compression based on local trajectory features, which compresses trajectory data in real time. The most representative algorithm for online compression is Sliding Window [51] and Opening Window [52]. The main idea of the Sliding Window algorithm is to compress trajectories through increasing the size of the window until the error for the potential segment is greater than the threshold. A sliding window with a unit of 1 is defined from the start point of the trajectory, and the size of the window is gradually increased until there exists a trajectory point within the window whose distance to the approximated line segment is greater than the error threshold. Then, the trajectory within the previous window is compressed, as shown in Figure 3d–f. Due to the Sliding Window algorithm’s inability to look ahead and lack of a global view like offline algorithms, the compression accuracy is somewhat lacking.
3.2.2. Road Network Constrained Trajectory Compression
3.2.3. Semantic Trajectory Compression
3.3. Trajectory Segmentation
3.3.1. Supervised Trajectory Segmentation
3.3.2. Unsupervised Trajectory Segmentation
- Clustering-based segmentation. In earlier research, unsupervised trajectory segmentation algorithms were mostly based on clustering, such as clustering based on distance, speed, acceleration, density, direction, etc. Among them, TRACLUS is one of the representative algorithms for unsupervised trajectory segmentation methods, which is a distance-based clustering algorithm [84]. It proposed three distance properties: perpendicular distance, parallel distance, and angle distance. Approximate trajectory partitioning is achieved by using the Minimum Description Length (MDL) to partition trajectories. SMoT is a time-based clustering algorithm, which extracts stop points and move points from a trajectory based on the stop time in the trajectory, and then divides a trajectory into multiple stop and move segments [85]. CB-SMoT is a speed-based clustering algorithm [86], and its most significant difference from the SMoT algorithm is that CB-SMoT adds speed as a segmentation criterion on the basis of SMoT, which not only considers the spatial and temporal relationships between trajectory points, but also takes into account the influence of speed. Therefore, the CB-SMoT algorithm can identify stop points and move points more accurately. DB-SMoT is a direction-based clustering algorithm [87]. It clusters trajectories by computing the magnitude of the direction changes. SMoT, CB-SMoT, and DB-SMoT do not require any labels or prior knowledge to perform clustering operations. Time, speed, and direction are only the metrics they use to calculate clustering, not labels for clustering. Leiva et al. proposed a well-known Warped K-Means model to achieve unsupervised trajectory segmentation through sequential clustering algorithm [88]. By minimizing the criterion function with the Sum of Quadratic Error (SQE) which incorporates rigid temporal constraints in the trajectory segmentation step, WKM achieves a more efficient and accurate trajectory segmentation. As is known, K-means algorithm has the advantages of low complexity and high robustness that can quickly converge to local minima. However, it requires the number of priori clusters k to be an input parameter, which limits its application and development in multiple scenarios. Buchin et al. presented an algorithmic framework that can efficiently segment trajectories according to feature combination analysis [89,90]. This framework uses a greedy strategy for segmentation to obtain an optimal solution for monotone criteria, which is similar to the principle of clustering algorithms. It ensures that the trajectory is divided into as few segments as possible and maximizes the length of each individual piece. In addition, they proposed two routines, TEST and FURTHEST, to segment three-dimensional trajectories, respectively. In univariate attribute criteria, it segments trajectories based on the criteria of location, velocity, and heading. In combinations of attribute criteria, they present two different ways of combining criteria, namely, Boolean combinations and linear combinations. In addition, three more complex criteria have been added on the basis of the univariate attribute criteria, which are curvature, sinuosity, and curviness. The proposed method can improve the robustness of the trajectory segmentation algorithm.
- Cost-function-based segmentation. The cost function is commonly used in unsupervised trajectory segmentation, mainly to calculate the distance, correlation, and similarity of trajectories. Yan et al. proposed an unsupervised trajectory construction platform, namely, SeTraStream [91]. This method uses a sliding window strategy to achieve online trajectory segmentation. Firstly, the feature vectors of the new incoming batch of raw trajectories are extracted to form the corresponding matrix, and the new batch of trajectories is buffered into a segmentation queue, waiting for segmentation processing. Secondly, the earliest trajectory that completes segmentation is dequeued and a candidate division point is placed at the end of the original batch as shown in Figure 4. Then, the new batch of trajectory Wr is compared with the previous batch of trajectory matrix Wl to calculate the correlation between these two feature vectors with an RV-coefficient function which is a generalization of the correlation coefficient for matrix data. This process is called short-term changes seeking. If the short-term changes do not trigger the threshold of the segmentation algorithm, the comparison window is doubled and long-term changes are sought to find a division point. If no long-term changes are detected, segmentation stops until the next new batch of trajectories is buffered into the queue and the trajectory segmentation process is restarted.
- 3.
- Interpolation-based segmentation. Trajectory interpolation is one of the commonly used methods for trajectory segmentation. Etemad defined an octal window (Sow), which is a sub-trajectory with seven trajectory points, and proposed a trajectory segmentation algorithm based on interpolation called Octal Window Segmentation (OWS) [94]. In octal window, new estimated trajectory points can be created using interpolation techniques, and an error signal can be created by comparing the actual position and estimated position of the moving object. By evaluating the error signal, we can determine whether there has been a heterogeneous change in the trajectory of the moving object. The advantage of this algorithm is that it can adapt to different application scenarios by selecting different interpolation algorithms, which can improve the flexibility of trajectory segmentation. After OWS, Etemad extended Sow to a sliding window, which modified the window size to a configurable parameter with a minimum value of 7, and named the algorithm Sliding Window Segmentation Algorithm (SWS) [95]. The author analyzed the segmentation performance of the SWS algorithm under different window sizes. The generation of error signals and trajectory segmentation are realized through different interpolation techniques, such as linear interpolation, cubic interpolation, random walk interpolation, and kinematic interpolation. At the same time, the robustness of different interpolation techniques is also analyzed. Highlighted that SWS can endure more noise to some extend by having the forward and backward extrapolation mechanisms. Therefore, the algorithm has a high robustness to noise.
- 4.
- Semantic-based segmentation. Apart from speed, acceleration, direction, and trajectory similarity, the metric for trajectory segmentation also includes transportation semantic and mode. Inspired by the deep semantic image segmentation in computer vision applications, deep semantic segmentation has been applied to trajectory segmentation. By extracting the motion feature sequence information of GPS raw trajectories, such as geographic location, traffic patterns, user behavior, trajectory relationships, and other feature vectors, the trajectory is segmented and assigned corresponding semantics for each segment. One of the representative methods is Convolutional Neural Network (CNN) schemes for trajectory segmentation, firstly, with an extraction of continuous, overlapping, and equidistant trajectory segments for segmentation, and extracting features such as position, velocity, and acceleration for each segment. Then, transportation modes are recognized through a CNN framework. This method has high segmentation efficiency, but, in cases where traffic patterns are more complex, the uniformly fixed segmentation method will lead to a decrease in accuracy. To address this issue, a Bayesian Temporal Convolutional Network (BTCN) is proposed for unsupervised, uncertainty-aware GPS trajectory segmentation [96]. The BTCN algorithm can capture the uncertainty of different traffic patterns in the trajectory segmentation process. Specifically, this method uses a convolutional neural network to map GPS trajectory points into a one-dimensional space and uses a variational autoencoder to capture the latent distribution between trajectory points, improving the representation ability of trajectory points. Then, Bayesian inference is used to estimate the posterior distribution of trajectory points, and Monte Carlo dropout sampling is used for variational inference. This method can more accurately capture the uncertainty in the trajectory segmentation process, improving the accuracy of segmentation. However, this method requires a large amount of computing resources and time and is not suitable for online trajectory segmentation applications.
3.3.3. Semi-Supervised Trajectory Segmentation
3.4. Map Matching
3.4.1. Geometric-Based Methods
3.4.2. Topology-Based Method
3.4.3. Probability-Statistics-Based Model
3.4.4. Advanced Model
- The HMM model performs well in sequential modeling and road network connectivity, which is why it is widely used. In the HMM-based map-matching algorithm, the observation sequence represents the trajectory sample points, and the state sequence represents the potential matching path points. The key of this algorithm is to transform the map-matching problem into the decoding problem of HMM, to find the state sequence with the maximum joint probability, which means selecting the maximum probability value and its corresponding previous state at each time step, and, finally, obtaining the optimal state sequence corresponding to the maximum joint probability value. The most representative research in this field is the algorithm proposed by Newson et al. [118]. They assumed that the absolute difference between the Euclidean distance of adjacent sampling points and the path distance between two matched points follows an exponential distribution, and designed the transition probability based on this assumption. The smaller the difference, the higher the transition probability. This method uses the Viterbi algorithm to calculate the sequence of vehicle travel paths.
- 2.
- The maximum-weight-based map-matching algorithm converts the map-matching problem into a problem of minimizing distance weights. By establishing a scoring system, this system evaluates each candidate path based on multiple features and selects the optimal path with the highest score. Compared to the HMM model, the advantage of this method is its higher flexibility in handling complex paths. However, a drawback of weight-based map-matching algorithms is that they require higher data quality for the collected point data. Therefore, data preprocessing is needed to improve the data quality.
- 3.
- Deep-learning-based map-matching algorithms have become popular in recent years. The map-matching algorithms based on the HMM model or weight-based methods do not take into account the potential value of historical trajectories, including the historical trajectories of the same vehicle and the trajectories of other vehicles passing through similar road segments. With the rise of deep learning algorithms, researchers have approached the problem from a data-driven perspective. By using deep learning methods, the large-scale trajectory data can be maximally utilized in the map-matching process. This approach reduces the impact of sampling frequency and trajectory noise on the matching results, greatly improving the accuracy of map matching. However, a drawback of these algorithms is that they require a large amount of labeled data to train the model parameters, and obtaining labeled training data is not always easy.
3.5. Interplay and Boundaries Between Preprocessing Tasks
4. Public Dataset
A Framework for Comparative Evaluation and Benchmarking
- Datasets: Utilizing diverse public datasets to cover various scenarios (e.g., GeoLife for urban pedestrian and vehicle trajectories, T-Drive for taxi data, HighD for highway driving, and UAV datasets for free-moving objects).
- Unified Metrics: Adopting a core set of metrics for each task. For example, evaluating compression algorithms should always report the Compression Ratio and Synchronous Euclidean Distance (SED). Map-matching algorithms should be compared on Accuracy and Running Time under different conditions.
- Testing Dimensions: Crucially, methods must be evaluated across dimensions as follows:
- (1)
- Varying Sampling Rates: From high-frequency (1 s) to low-frequency (5 min).
- (2)
- Induced Noise Levels: Adding synthetic Gaussian noise or outliers to test robustness.
- (3)
- Road-Network Density: Comparing performance in sparse rural networks versus dense urban grids.
5. Discussion
- (1)
- Efficient Computation for Large-Scale Trajectory Processing
- (2)
- Deep-Learning-based Preprocessing
- (3)
- Semantic Trajectory Tagging with Context Awareness
- (4)
- Trajectory Data Privacy in the Era of Cloud and AI
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| TD-TR | Top-Down Time-Ratio |
| DPTS | Direction Preserving Trajectory Simplification |
| CTEV | Compression Technique Enhanced Version |
| POI | Points Of Interest |
| TS | Time-Synchronous |
| BOPW | Before Opening Window |
| NOPW | Normal Opening Window |
| STTrace | Sampling Trajectory Threshold Race |
| SQUISH | Spatial Quality Simplification Heuristic |
| SQUISH-E | Enhanced Spatial Quality Simplification Heuristic |
| BQS | Bounded Quadrant System |
| ROPW | Reverse Order Processing Window |
| SCG | Snapping Configuration Graph |
| FPA | Following Path Algorithm |
| MMTC | Map-Matched Trajectory Compression |
| MDL | Minimal Description Length |
| PRESS | Paralleled Road Network Based Trajectory Compression |
| COMPRESS | Comprehensive Paralleled Road Network Based Trajectory Compression |
| CiNCT | Compressed-Index for Network Constrained Trajectories |
| VTracer | Vehicle Tracing |
| STC | Semantic Trajectory Compression |
| EHSTC | Enhanced Semantic Trajectory Compression |
| SATC | Semantic-Aware Trajectory Compression |
| GR-B | Gpstrajectory Compression |
| STSS | Semantics Based Trajectory Segmentation Simplification |
| ROCE | Region Based Online Trajectory Compression with Error Bounded |
| SPD | Stay Point Detection |
| WS-II | Wise Sliding Window Segmentation |
| TRACLUS | Trajectory Clustering |
| SMoT | Simplification Method of Single Stop Segment of Trajectory |
| CB-SMoT | Clustering-Based Simplification Method of Single Stop Segment of Trajectory |
| DB- SMoT | Direction-Based Simplification Method of Single Stop Segment of Trajectory |
| SQE | Sum Of Quadratic Error |
| WKM | Warped K-Means Model |
| SeTraStream | Semantic-aware trajectory construction over streaming movement |
| GRASP-UTS | Greedy Randomized Adaptive Search Procedure for Unsupervised Trajectory Segmentation |
| TS-TF | Trajectory Segmentation Based on Multiple Motion Features |
| OWS | Octal Window Segmentation |
| SWS | Sliding Window Segmentation |
| CNN | Convolutional Neural Network |
| BTCN | Bayesian Temporal Convolutional Network |
| RGRASP-SemTS | Reactive Greedy Randomized Search Strategy for Semi-Supervised Semantic Trajectory Segmentation |
| SECA | Semi-Supervised Convolutional Autoencoder |
| MM | Map Matching |
| MHT | Multiple Hypothesis Tracking |
| ATMM | Advanced Topological Map Matching |
| HFTMM | High-Frequency Trajectory Map Matching |
| MHT-MM | Map-Matching Method Using the Multiple Hypothesis Technique |
| MDP-MM | Map-Matching Algorithm for Large-Scale Low-Frequency Floating Car Data |
| ST-CRF | Spatial And Temporal Conditional Random Field |
| MCM | Multiple Candidate Matching |
| PMHT-MM | Probabilistic Multiple Hypotheses Tracking Map Matching |
| MCMC | Markov Chain Monte Carlo |
| HMM | Hidden Markov Map Matching |
| OHMM | Online Map-Matching Based on Hidden Markov Model |
| FMM | Fast Map Matching, An Algorithm Integrating Hidden Markov Model |
| OM2 | Off-Line Map-Matching |
| INC-RB | Incremental Route Inference Algorithm With Rollback |
| ST-Matching | Spatial Temporal Map-Matching |
| IVMM | Interactive Voting-Based Map Matching |
| STP-IWC | Spatio-Temporal Proximity and Improved Weighted Circle |
| AMM | Adaptive Online Map Matching |
| HRIS | History-Based Route Inference System |
| DMM | Deep-Learning-Based Map-Matching |
| TMM | Transformer-Based Map-Matching |
| L2MM | Learning To Map Matching with Deep Models |
References
- Wang, S.; Li, L.; Ma, W.; Chen, X. Trajectory analysis for on-demand services: A survey focusing on spatial-temporal demand and supply patterns. Transp. Res. Part C Emerg. Technol. 2019, 108, 74–99. [Google Scholar] [CrossRef]
- Xiong, W.; Wang, X.; Li, H. Efficient large-scale GPS trajectory compression on spark: A pipeline-based approach. Electronics 2023, 12, 3569. [Google Scholar] [CrossRef]
- Guo, P. Optimized Unsupervised Semantic Trajectory Mining for Personalized Tourism Recommendations. Informatica 2025, 49. [Google Scholar] [CrossRef]
- Zhang, P. Distributed Computing and Unsupervised Deep Learning for Analyzing Human Travel Behaviors Using Big Trajectory Data. University of Maryland, College Park. 2025. Available online: https://www.proquest.com/dissertations-theses/distributed-computing-unsupervised-deep-learning/docview/3250258686/se-2 (accessed on 22 November 2025).
- Sheng, H.; Wang, T.; Luo, Y.; Liang, H. A review of trajectory data preprocessing and mining technology research. In Proceedings of the 2024 4th International Conference on Big Data, Artificial Intelligence and Risk Management, Shanghai, China, 19–21 January 2024; pp. 45–50. [Google Scholar] [CrossRef]
- Chen, J.; Zhang, H.; Li, W.; Shibasaki, R. Spatio-temporal data preprocessing technologies. In Big Data and Mobility as a Service; Elsevier: Amsterdam, The Netherlands, 2022; pp. 25–75. [Google Scholar] [CrossRef]
- Feng, Z.; Zhu, Y. A survey on trajectory data mining: Techniques and applications. IEEE Access 2016, 4, 2056–2067. [Google Scholar] [CrossRef]
- Mazimpaka, J.D.; Timpf, S. Trajectory data mining: A review of methods and applications. J. Spat. Inf. Sci. 2016, 13, 61–99. [Google Scholar] [CrossRef]
- Ribeiro de Almeida, D.; de Souza Baptista, C.; Gomes de Andrade, F.; Soares, A. A survey on big data for trajectory analytics. ISPRS Int. J. Geo Inf. 2020, 9, 88. [Google Scholar] [CrossRef]
- Wang, D.; Miwa, T.; Morikawa, T. Big trajectory data mining: A survey of methods, applications, and services. Sensors 2020, 20, 4571. [Google Scholar] [CrossRef] [PubMed]
- Wang, S.; Bao, Z.; Culpepper, J.S.; Cong, G. A survey on trajectory data management, analytics, and learning. ACM Comput. Surv. 2021, 54, 1–36. [Google Scholar] [CrossRef]
- Zheng, Y. Trajectory data mining: An overview. ACM Trans. Intell. Syst. Technol. 2015, 6, 1–41. [Google Scholar] [CrossRef]
- Lee, W.-C.; Krumm, J. Trajectory preprocessing. In Computing with Spatial Trajectories; Springer: Berlin/Heidelberg, Germany, 2011; pp. 3–33. [Google Scholar] [CrossRef]
- Amigo, D.; Sánchez Pedroche, D.; García, J.; Molina, J.M. Review and classification of trajectory summarisation algorithms: From compression to segmentation. Int. J. Distrib. Sens. Netw. 2021, 17, 15501477211050729. [Google Scholar] [CrossRef]
- Sun, P.; Xia, S.; Yuan, G.; Li, D. An overview of moving object trajectory compression algorithms. Math. Probl. Eng. 2016, 5, 1–13. [Google Scholar] [CrossRef]
- Muckell, J.; Olsen, P.W.; Hwang, J.-H.; Lawson, C.T.; Ravi, S. Compression of trajectory data: A comprehensive evaluation and new approach. GeoInformatica 2014, 18, 435–460. [Google Scholar] [CrossRef]
- Chao, P.; Xu, Y.; Hua, W.; Zhou, X. A survey on map-matching algorithms. In Proceedings of the Databases Theory and Applications: 31st Australasian Database Conference, ADC 2020, Melbourne, VIC, Australia, 3–7 February 2020; pp. 121–133. [Google Scholar] [CrossRef]
- Huang, Z.; Qiao, S.; Han, N.; Yuan, C.A.; Song, X.; Xiao, Y. Survey on vehicle map matching techniques. CAAI Trans. Intell. Technol. 2021, 6, 55–71. [Google Scholar] [CrossRef]
- Hashemi, M.; Karimi, H.A. A critical review of real-time map-matching algorithms: Current issues and future directions. Comput. Environ. Urban Syst. 2014, 48, 153–165. [Google Scholar] [CrossRef]
- Sousa, R.S.D.; Boukerche, A.; Loureiro, A.A. Vehicle trajectory similarity: Models, methods, and applications. ACM Comput. Surv. 2020, 53, 1–32. [Google Scholar] [CrossRef]
- Li, H.; Ma, D.; Yan, Z.; Fu, J.; Zeng, M.; Bao, W. Algorithm of Vehicle’s Data Cleaning and Monitoring. In Journal of Physics: Conference Series; IOP Publishing: Bristol, UK, 2021; Volume 1828, p. 012052. [Google Scholar] [CrossRef]
- Xia, X.; Meng, Z.; Han, X.; Li, H.; Tsukiji, T.; Xu, R.; Zheng, Z.; Ma, J. An automated driving systems data acquisition and analytics platform. Transp. Res. Part C Emerg. Technol. 2023, 151, 104120. [Google Scholar] [CrossRef]
- Zhu, S.; Yue, H.; Suzuki, T.; Kim, I.; Yu, L.; Lan, Q. A UWB/INS Trajectory Tracking System Application in a Cycling Safety Study. Sensors 2023, 23, 3629. [Google Scholar] [CrossRef]
- Khodarahmi, M.; Maihami, V. A review on Kalman filter models. Arch. Comput. Methods Eng. 2023, 30, 727–747. [Google Scholar] [CrossRef]
- Liu, Y.; Yang, Z. Trajectory Smoothing Algorithm Based on Kalman Filter. In Proceedings of the 2023 7th International Conference on Machine Vision and Information Technology (CMVIT), Guangzhou, China, 24–26 February 2023; pp. 52–56. [Google Scholar] [CrossRef]
- Yuan, G.; Zhu, M.; Qiao, S.; Wang, Z.; Zhang, L. Sparse high-noise GPS trajectory data compression and recovery based on compressed sensing. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2018, 101, 811–821. [Google Scholar] [CrossRef]
- Feng, D.; Zhang, H.; Song, X. Noise filter method for mobile trajectory data. In Handbook of Mobility Data Mining; Elsevier: Amsterdam, The Netherlands, 2023; pp. 35–50. [Google Scholar] [CrossRef]
- Wu, R.; Luo, G.; Shao, J.; Tian, L.; Peng, C. Location prediction on trajectory data: A review. Big Data Min. Anal. 2018, 1, 108–127. [Google Scholar] [CrossRef]
- Marczak, F.; Buisson, C. New filtering method for trajectory measurement errors and its comparison with existing methods. Transp. Res. Rec. 2012, 2315, 35–46. [Google Scholar] [CrossRef]
- Vaibhav, M.; Rahul, K. Trajectory prediction and tracking using a multi-behaviour social particle filter. Appl. Intell. 2022, 52, 7158–7200. [Google Scholar] [CrossRef]
- Zhang, J.; Sun, Y. An Automatic Data Cleaning Method for GPS Trajectory Data on Didi Chuxing GAIA Open Dataset Using Machine Learning Algorithms. In Proceedings of the 2019 6th International Conference on Systems and Informatics (ICSAI), Shanghai, China, 2–4 November 2019; pp. 1522–1526. [Google Scholar] [CrossRef]
- Li, L.; Chen, X.; Liu, Q.; Bao, Z. A data-driven approach for GPS trajectory data cleaning. In Proceedings of the Database Systems for Advanced Applications: 25th International Conference, DASFAA 2020, Jeju, Republic of Korea, 24–27 September 2020; Springer: Cham, Switzerland, 2020; pp. 3–19. [Google Scholar] [CrossRef]
- Xie, Y.; Xu, F.; Wang, Q.; Han, W. Data Construction Method of Unmanned Underwater Vehicle Test Scene Based on Linear Interpolation. In Proceedings of the 2021 IEEE International Conference on Unmanned Systems (ICUS), Beijing, China, 15–17 October 2021; pp. 260–265. [Google Scholar] [CrossRef]
- Early, J.J.; Sykulski, A.M. Smoothing and interpolating noisy GPS data with smoothing splines. J. Atmos. Ocean. Technol. 2020, 37, 449–465. [Google Scholar] [CrossRef]
- Ambrósio, J.; Antunes, P.; Pombo, J. On the requirements of interpolating polynomials for path motion constraints. In Interdisciplinary Applications of Kinematics, Proceedings of the International Conference, Lima, Peru, 9–11 September 2014; Springer: Cham, Switzerland; pp. 179–197. [CrossRef]
- Markovsky, I.; Dörfler, F. Data-driven dynamic interpolation and approximation. Automatica 2022, 135, 110008. [Google Scholar] [CrossRef]
- Guo, S.; Mou, J.; Chen, L.; Chen, P. Improved kinematic interpolation for AIS trajectory reconstruction. Ocean. Eng. 2021, 234, 109256. [Google Scholar] [CrossRef]
- Venthuruthiyil, S.P.; Chunchu, M. Vehicle path reconstruction using Recursively Ensembled Low-pass filter (RELP) and adaptive tri-cubic kernel smoother. Transp. Res. Part C Emerg. Technol. 2020, 120, 102847. [Google Scholar] [CrossRef]
- Zhao, J.; Yang, X.; Zhang, C. Vehicle trajectory reconstruction for intersections: An integrated wavelet transform and Savitzky-Golay filter approach. Transp. A Transp. Sci. 2023, 20, 2163207. [Google Scholar] [CrossRef]
- Aftab, W.; Mihaylova, L. A learning Gaussian process approach for maneuvering target tracking and smoothing. IEEE Trans. Aerosp. Electron. Syst. 2020, 57, 278–292. [Google Scholar] [CrossRef]
- Li, L.; Pagnucco, M.; Song, Y. Graph-based spatial transformer with memory replay for multi-future pedestrian trajectory prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 18–22 June 2023; Vancouver, BC, Canada; IEEE: New York, NY, USA, 2022; pp. 2231–2241. Available online: https://openaccess.thecvf.com/content/CVPR2022/html/Li_Graph-Based_Spatial_Transformer_With_Memory_Replay_for_Multi-Future_Pedestrian_Trajectory_CVPR_2022_paper.html (accessed on 22 November 2025).
- Lasota, P.A.; Shah, J.A. Bayesian estimator for partial trajectory alignment. In Proceedings of the Robotics: Science and Systems, Freiburg, Germany, 22–26 June 2019. [Google Scholar] [CrossRef]
- Taylor, J.; Zhou, X.; Rouphail, N.M.; Porter, R.J. Method for investigating intradriver heterogeneity using vehicle trajectory data: A dynamic time warping approach. Transp. Res. Part B Methodol. 2015, 73, 59–80. [Google Scholar] [CrossRef]
- Chen, L.; Shang, S.; Feng, S.; Kalnis, P. Parallel subtrajectory alignment over massive-scale trajectory data. In Proceedings of the International Joint Conferences on Artificial Intelligence Organization, Montreal, QC, Canada, 19–27 August 2021. [Google Scholar] [CrossRef]
- Reyes Zambrano, G. GPS trajectory compression algorithm. In Proceedings of the Computer and Communication Engineering: First International Conference, ICCCE 2018, Guayaquil, Ecuador, 25–27 October 2018; Springer: Berlin/Heidelberg, Germany, 2019; pp. 57–69. [Google Scholar] [CrossRef]
- Douglas, D.H.; Peucker, T.K. Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartogr. Int. J. Geogr. Inf. Geovisualization 1973, 10, 112–122. [Google Scholar] [CrossRef]
- Hershberger, J.E.; Snoeyink, J. Speeding Up the Douglas-Peucker Line-Simplification Algorithm; University of British Columbia: Vancouver, BC, Canada, 1992. [Google Scholar]
- Meratnia, N.; de By, R.A. Spatiotemporal compression techniques for moving point objects. In Proceedings of the Advances in Database Technology-EDBT 2004: 9th International Conference on Extending Database Technology, Heraklion, Crete, Greece, 14–18 March 2004; Springer: Berlin/Heidelberg, Germany, 2004; pp. 765–782. [Google Scholar] [CrossRef]
- Long, C.; Wong, R.C.-W.; Jagadish, H. Direction-preserving trajectory simplification. Proc. VLDB Endow. 2013, 6, 949–960. [Google Scholar] [CrossRef]
- Bashir, M.; Ashraf, J.; Habib, A.; Muzammil, M. An intelligent linear time trajectory data compression framework for smart planning of sustainable metropolitan cities. Trans. Emerg. Telecommun. Technol. 2022, 33, e3886. [Google Scholar] [CrossRef]
- Keogh, E.; Chu, S.; Hart, D.; Pazzani, M. An online algorithm for segmenting time series. In Proceedings of the 2001 IEEE international conference on data mining, San Jose, CA, USA, 29 November–2 December 2001; pp. 289–296. [Google Scholar] [CrossRef]
- Potamias, M.; Patroumpas, K.; Sellis, T. Sampling trajectory streams with spatiotemporal criteria. In Proceedings of the 18th International Conference on Scientific and Statistical Database Management (SSDBM’06), Vienna, Austria, 3–5 July 2006; pp. 275–284. [Google Scholar] [CrossRef]
- Muckell, J.; Hwang, J.-H.; Patil, V.; Lawson, C.T.; Ping, F.; Ravi, S. SQUISH: An online approach for GPS trajectory compression. In Proceedings of the 2nd International Conference on Computing for Geospatial Research & Applications, Washington, DC, USA, 23–25 May 2011; pp. 1–8. [Google Scholar] [CrossRef]
- Liu, J.; Zhao, K.; Sommer, P.; Shang, S.; Kusy, B.; Jurdak, R. Bounded quadrant system: Error-bounded trajectory compression on the go. In Proceedings of the 2015 IEEE 31st International Conference on Data Engineering, Seoul, Republic of Korea, 13–17 April 2015; pp. 987–998. [Google Scholar] [CrossRef]
- Li, S.; Zhang, K.; Yin, H.; Yin, D.; Zu, H.; Gao, H. ROPW: An online trajectory compression algorithm. In Database Systems for Advanced Applications, Proceedings of the DASFAA 2021 International Workshops: BDQM, GDMA, MLDLDSA, MobiSocial, and MUST, Taipei, Taiwan, 11–14 April 2021; Springer: Cham, Switzerland, 2021; pp. 16–28. [Google Scholar] [CrossRef]
- Lin, X.; Ma, S.; Jiang, J.; Hou, Y.; Wo, T. Error bounded line simplification algorithms for trajectory compression: An experimental evaluation. ACM Trans. Database Syst. 2021, 46, 1–44. [Google Scholar] [CrossRef]
- Makris, A.; Silva, C.L.d.; Bogorny, V.; Alvares, L.O.; Macedo, J.A.; Tserpes, K. Evaluating the effect of compressing algorithms for trajectory similarity and classification problems. GeoInformatica 2021, 25, 679–711. [Google Scholar] [CrossRef]
- Cao, H.; Wolfson, O. Nonmaterialized motion information in transport networks. In Proceedings of the Database Theory-ICDT 2005: 10th International Conference, Edinburgh, UK, 5–7 January 2005; pp. 173–188. [Google Scholar] [CrossRef]
- Lerin, P.M.; Yamamoto, D.; Takahashi, N. Encoding travel traces by using road networks and routing algorithms. In Intelligent Interactive Multimedia: Systems and Services, Proceedings of the 5th International Conference on Intelligent Interactive Multimedia Systems and Services (IIMSS 2012), Gifu, Japan, 25–27 July 2012; Springer: Berlin/Heidelberg, Germany, 2012; pp. 233–243. [Google Scholar] [CrossRef]
- Kellaris, G.; Pelekis, N.; Theodoridis, Y. Trajectory compression under network constraints. In Proceedings of the Advances in Spatial and Temporal Databases: 11th International Symposium, SSTD 2009, Aalborg, Denmark, 8–10 July 2009; Springer: Berlin/Heidelberg, Germany, 2009; pp. 392–398. [Google Scholar] [CrossRef]
- Kellaris, G.; Pelekis, N.; Theodoridis, Y. Map-matched trajectory compression. J. Syst. Softw. 2013, 86, 1566–1579. [Google Scholar] [CrossRef]
- Song, R.; Sun, W.; Zheng, B.; Zheng, Y. PRESS: A novel framework of trajectory compression in road networks. arXiv 2014, arXiv:1402.1546. [Google Scholar] [CrossRef]
- Han, Y.; Sun, W.; Zheng, B. COMPRESS: A comprehensive framework of trajectory compression in road networks. ACM Trans. Database Syst. 2017, 42, 1–49. [Google Scholar] [CrossRef]
- Koide, S.; Tadokoro, Y.; Xiao, C.; Ishikawa, Y. CiNCT: Compression and retrieval for massive vehicular trajectories via relative movement labeling. In Proceedings of the 2018 IEEE 34th International Conference on Data Engineering (ICDE), Paris, France, 16–19 April 2018; pp. 1097–1108. [Google Scholar] [CrossRef]
- Zhao, P.; Zhao, Q.; Zhang, C.; Su, G.; Zhang, Q.; Rao, W. CLEAN: Frequent pattern-based trajectory compression and computation on road networks. China Commun. 2020, 17, 119–136. [Google Scholar] [CrossRef]
- Chen, Q.; Cao, J.; Xia, Y. Physics-enhanced pca for data compression in edge devices. IEEE Trans. Green Commun. Netw. 2022, 6, 1624–1634. [Google Scholar] [CrossRef]
- Li, T.; Chen, L.; Jensen, C.S.; Pedersen, T.B. TRACE: Real-time compression of streaming trajectories in road networks. Proc. VLDB Endow. 2021, 14, 1175–1187. [Google Scholar] [CrossRef]
- Chen, C.; Ding, Y.; Xie, X.; Zhang, S.; Wang, Z.; Feng, L. TrajCompressor: An online map-matching-based trajectory compression framework leveraging vehicle heading direction and change. IEEE Trans. Intell. Transp. Syst. 2019, 21, 2012–2028. [Google Scholar] [CrossRef]
- Chen, C.; Ding, Y.; Wang, Z.; Zhao, J.; Guo, B.; Zhang, D. VTracer: When online vehicle trajectory compression meets mobile edge computing. IEEE Syst. J. 2019, 14, 1635–1646. [Google Scholar] [CrossRef]
- Schmid, F.; Richter, K.-F.; Laube, P. Semantic trajectory compression. In Proceedings of the Advances in Spatial and Temporal Databases: 11th International Symposium, SSTD 2009, Aalborg, Denmark, 8–10 July 2009; Springer: Berlin/Heidelberg, Germany, 2009; pp. 411–416. [Google Scholar] [CrossRef]
- Richter, K.-F.; Schmid, F.; Laube, P. Semantic trajectory compression: Representing urban movement in a nutshell. J. Spat. Inf. Sci. 2012, 4, 3–30. [Google Scholar] [CrossRef]
- Feng, S.; Xu, J.; Xu, M.; Zheng, N.; Zhang, X. EHSTC: An enhanced method for semantic trajectory compression. In Proceedings of the 4th ACM SIGSPATIAL International Workshop on GeoStreaming, Orlando, FL, USA, 5–8 November 2013; pp. 43–49. [Google Scholar] [CrossRef]
- Su, H.; Zheng, K.; Zeng, K.; Huang, J.; Zhou, X. STMaker: A system to make sense of trajectory data. Proc. VLDB Endow. 2014, 7, 1701–1704. [Google Scholar] [CrossRef]
- Su, H.; Zheng, K.; Zeng, K.; Huang, J.; Sadiq, S.; Yuan, N.J.; Zhou, X. Making sense of trajectory data: A partition-and-summarization approach. In Proceedings of the 2015 IEEE 31st International Conference on Data Engineering, Seoul, Republic of Korea, 13–17 April 2015; pp. 963–974. [Google Scholar] [CrossRef]
- Liu, M.; He, G.; Long, Y. A semantics-based trajectory segmentation simplification method. J. Geovisualization Spat. Anal. 2021, 5, 19. [Google Scholar] [CrossRef]
- Zhang, K.; Zhao, D.; Liu, W. Online vehicle trajectory compression algorithm based on motion pattern recognition. IET Intell. Transp. Syst. 2022, 16, 998–1010. [Google Scholar] [CrossRef]
- Chen, H.; Chen, X. A trajectory ensemble-compression algorithm based on finite element method. ISPRS Int. J. Geo-Inf. 2021, 10, 334. [Google Scholar] [CrossRef]
- Zhao, Y.; Shang, S.; Wang, Y.; Zheng, B.; Nguyen, Q.V.H.; Zheng, K. Rest: A reference-based framework for spatio-temporal trajectory compression. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 2797–2806. [Google Scholar] [CrossRef]
- Zheng, K.; Zhao, Y.; Lian, D.; Zheng, B.; Liu, G.; Zhou, X. Reference-based framework for spatio-temporal trajectory compression and query processing. IEEE Trans. Knowl. Data Eng. 2019, 32, 2227–2240. [Google Scholar] [CrossRef]
- Ta, N.; Li, G.; Chen, B.; Feng, J. Semantic-aware trajectory compression with urban road network. In Proceedings of the Web-Age Information Management: 17th International Conference, WAIM 2016, Nanchang, China, 3–5 June 2016; Springer: Cham, Switzerland, 2016; pp. 124–136. [Google Scholar] [CrossRef]
- Yin, H.; Gao, H.; Wang, B.; Li, S.; Li, J. Efficient trajectory compression and range query processing. World Wide Web 2022, 25, 1259–1285. [Google Scholar] [CrossRef]
- Zheng, Y.; Zhang, L.; Ma, Z.; Xie, X.; Ma, W.-Y. Recommending friends and locations based on individual location history. ACM Trans. Web 2011, 5, 1–44. [Google Scholar] [CrossRef]
- Etemad, M.; Etemad, Z.; Soares, A.; Bogorny, V.; Matwin, S.; Torgo, L. Wise sliding window segmentation: A classification-aided approach for trajectory segmentation. In Proceedings of the Advances in Artificial Intelligence: 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020, Ottawa, ON, Canada, 13–15 May 2020; Springer: Cham, Switzerland, 2020; pp. 208–219. [Google Scholar] [CrossRef]
- Lee, J.-G.; Han, J.; Whang, K.-Y. Trajectory clustering: A partition-and-group framework. In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, Beijing, China, 11–14 June 2007; pp. 593–604. [Google Scholar] [CrossRef]
- Alvares, L.O.; Bogorny, V.; Kuijpers, B.; de Macedo, J.A.F.; Moelans, B.; Vaisman, A. A model for enriching trajectories with semantic geographical information. In Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, WA, USA, 7–9 November 2007; pp. 1–8. [Google Scholar] [CrossRef]
- Palma, A.T.; Bogorny, V.; Kuijpers, B.; Alvares, L.O. A clustering-based approach for discovering interesting places in trajectories. In Proceedings of the 2008 ACM Symposium on Applied Computing, Fortaleza, Brazil, 16–20 March 2008; pp. 863–868. [Google Scholar] [CrossRef]
- Rocha, J.A.M.; Times, V.C.; Oliveira, G.; Alvares, L.O.; Bogorny, V. DB-SMoT: A direction-based spatio-temporal clustering method. In Proceedings of the 2010 5th IEEE International Conference Intelligent Systems, London, UK, 7–9 July 2010; pp. 114–119. [Google Scholar] [CrossRef]
- Leiva, L.A.; Vidal, E. Warped k-means: An algorithm to cluster sequentially-distributed data. Inf. Sci. 2013, 237, 196–210. [Google Scholar] [CrossRef]
- Buchin, M.; Driemel, A.; Kreveld, M.V.; Sacristán Adinolfi, V. Segmenting trajectories: A framework and algorithms using spatiotemporal criteria. J. Spat. Inf. Sci. 2011, 3, 33–63. [Google Scholar] [CrossRef]
- Buchin, M.; Driemel, A.; Van Kreveld, M.; Sacristán, V. An algorithmic framework for segmenting trajectories based on spatio-temporal criteria. In Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA, 2–5 November 2010; pp. 202–211. [Google Scholar] [CrossRef]
- Yan, Z.; Giatrakos, N.; Katsikaros, V.; Pelekis, N.; Theodoridis, Y. SeTraStream: Semantic-aware trajectory construction over streaming movement data. In Proceedings of the Advances in Spatial and Temporal Databases: 12th International Symposium, SSTD 2011, Minneapolis, MN, USA, 24–26 August 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 367–385. [Google Scholar] [CrossRef]
- Soares Júnior, A.; Moreno, B.N.; Times, V.C.; Matwin, S.; Cabral, L.d.A.F. GRASP-UTS: An algorithm for unsupervised trajectory segmentation. Int. J. Geogr. Inf. Sci. 2015, 29, 46–68. [Google Scholar] [CrossRef]
- Xu, W.; Dong, S. Application of artificial intelligence in an unsupervised algorithm for trajectory segmentation based on multiple motion features. Wirel. Commun. Mob. Comput. 2022, 2022, 9540944. [Google Scholar] [CrossRef]
- Etemad, M.; Júnior, A.S.; Hoseyni, A.; Rose, J.; Matwin, S. A Trajectory Segmentation Algorithm Based on Interpolation-Based Change Detection Strategies; EDBT/ICDT Workshops: Lisbon, Portugal, 2019; p. 58. [Google Scholar] [CrossRef]
- Etemad, M.; Soares, A.; Etemad, E.; Rose, J.; Torgo, L.; Matwin, S. SWS: An unsupervised trajectory segmentation algorithm based on change detection with interpolation kernels. GeoInformatica 2021, 25, 269–289. [Google Scholar] [CrossRef]
- Markos, C.; James, J.; Da Xu, R.Y. Capturing uncertainty in unsupervised GPS trajectory segmentation using Bayesian deep learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Online, 2–9 February 2021; pp. 390–398. [Google Scholar] [CrossRef]
- Dabiri, S.; Heaslip, K. Inferring transportation modes from GPS trajectories using a convolutional neural network. Transp. Res. Part C: Emerg. Technol. 2018, 86, 360–371. [Google Scholar] [CrossRef]
- Junior, A.S.; Times, V.C.; Renso, C.; Matwin, S.; Cabral, L.A. A semi-supervised approach for the semantic segmentation of trajectories. In Proceedings of the 2018 19th IEEE International Conference on Mobile Data Management (MDM), Aalborg, Denmark, 25–28 June 2018; pp. 145–154. [Google Scholar] [CrossRef]
- Dabiri, S.; Lu, C.-T.; Heaslip, K.; Reddy, C.K. Semi-supervised deep learning approach for transportation mode identification using GPS trajectory data. IEEE Trans. Knowl. Data Eng. 2019, 32, 1010–1023. [Google Scholar] [CrossRef]
- Quddus, M.A.; Ochieng, W.Y.; Noland, R.B. Current map-matching algorithms for transport applications: State-of-the art and future research directions. Transp. Res. Part C Emerg. Technol. 2007, 15, 312–328. [Google Scholar] [CrossRef]
- Wei, H.; Wang, Y.; Forman, G.; Zhu, Y. Map matching: Comparison of approaches using sparse and noisy data. In Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Orlando, FL, USA, 5–8 November 2013; pp. 444–447. [Google Scholar] [CrossRef]
- Bernstein, D.; Kornhauser, A. An introduction to map matching for personal navigation assistants. Transp. Res. Board 1998, 122, 1082–1083. [Google Scholar] [CrossRef]
- White, C.E.; Bernstein, D.; Kornhauser, A.L. Some map matching algorithms for personal navigation assistants. Transp. Res. Part C: Emerg. Technol. 2000, 8, 91–108. [Google Scholar] [CrossRef]
- Taylor, G.; Blewitt, G.; Steup, D.; Corbett, S.; Car, A. Road reduction filtering for GPS--GIS navigation. Trans. GIS 2001, 5, 193–207. [Google Scholar] [CrossRef]
- Brakatsoulas, S.; Pfoser, D.; Salas, R.; Wenk, C. On map-matching vehicle tracking data. In Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, 30 August–2 September 2005; pp. 853–864. Available online: https://dl.acm.org/doi/epdf/10.5555/1083592.1083691 (accessed on 22 November 2025).
- Alt, H.; Efrat, A.; Rote, G.; Wenk, C. Matching planar maps. J. Algorithms 2003, 49, 262–283. [Google Scholar] [CrossRef]
- Quddus, M.A.; Ochieng, W.Y.; Zhao, L.; Noland, R.B. A general map matching algorithm for transport telematics applications. GPS Solut. 2003, 7, 157–167. [Google Scholar] [CrossRef]
- Chawathe, S.S. Segment-based map matching. In Proceedings of the 2007 IEEE Intelligent Vehicles Symposium, Istanbul, Turkey, 13–15 June 2007; pp. 1190–1197. [Google Scholar] [CrossRef]
- Zhao, X.; Cheng, X.; Zhou, J.; Xu, Z.; Dey, N.; Ashour, A.S.; Satapathy, S.C. Advanced topological map matching algorithm based on D–S theory. Arab. J. Sci. Eng. 2018, 43, 3863–3874. [Google Scholar] [CrossRef]
- Yu, Q.; Hu, F.; Ye, Z.; Chen, C.; Sun, L.; Luo, Y. High-frequency trajectory map matching algorithm based on road network topology. IEEE Trans. Intell. Transp. Syst. 2022, 23, 17530–17545. [Google Scholar] [CrossRef]
- Pyo, J.-S.; Shin, D.-H.; Sung, T.-K. Development of a map matching method using the multiple hypothesis technique. In Proceedings of the ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No. 01TH8585), Oakland, CA, USA, 25–29 August 2001; pp. 23–27. [Google Scholar] [CrossRef]
- Marchal, F.; Hackney, J.; Axhausen, K.W. Efficient map matching of large global positioning system data sets: Tests on speed-monitoring experiment in Zürich. Transp. Res. Rec. 2005, 1935, 93–100. [Google Scholar] [CrossRef]
- Schuessler, N.; Axhausen, K.W. Map-Matching of GPS Traces on High-Resolution Navigation Networks Using the Multiple Hypothesis Technique (MHT); Arbeitsberichte Verkehrs-und Raumplanung; ETH Zurich, Institute for Transport Planning and Systems: Zurich, Switzerland, 2009; Volume 568. [Google Scholar] [CrossRef]
- Chen, B.Y.; Yuan, H.; Li, Q.; Lam, W.H.; Shaw, S.-L.; Yan, K. Map-matching algorithm for large-scale low-frequency floating car data. Int. J. Geogr. Inf. Sci. 2014, 28, 22–38. [Google Scholar] [CrossRef]
- Liu, X.; Liu, K.; Li, M.; Lu, F. A ST-CRF map-matching method for low-frequency floating car data. IEEE Trans. Intell. Transp. Syst. 2016, 18, 1241–1254. [Google Scholar] [CrossRef]
- Li, W.; Wang, Y.; Li, D.; Xu, X. A robust map matching method by considering memorized multiple matching candidates. Theor. Comput. Sci. 2023, 941, 104–120. [Google Scholar] [CrossRef]
- Ning, S.; Liu, H.; Zhang, S.; Jiang, Y.; Han, J.; Liu, S.; Fang, J.; Tan, N.; Chai, H.; Zhang, B. Estimation and Prediction of Road Free Flow Speed with More Efficient DNN Map Matching Results. 2022. Available online: http://urban-computing.com/urbcomp2022/file/UrbComp2022_paper_0498.pdf (accessed on 22 November 2025).
- Newson, P.; Krumm, J. Hidden Markov map matching through noise and sparseness. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 336–343. [Google Scholar] [CrossRef]
- Goh, C.Y.; Dauwels, J.; Mitrovic, N.; Asif, M.T.; Oran, A.; Jaillet, P. Online map-matching based on hidden markov model for real-time traffic sensing applications. In Proceedings of the 2012 15th International IEEE Conference on Intelligent Transportation Systems, Anchorage, AK, USA, 16–19 September 2012; pp. 776–781. [Google Scholar] [CrossRef]
- Yang, C.; Gidofalvi, G. Fast map matching, an algorithm integrating hidden Markov model with precomputation. Int. J. Geogr. Inf. Sci. 2018, 32, 547–570. [Google Scholar] [CrossRef]
- Xie, Y.; Zhou, K.; Miao, F.; Zhang, Q. High-Accuracy off-line map-matching of trajectory network division based on weight adaptation HMM. IEEE Access 2020, 8, 7256–7266. [Google Scholar] [CrossRef]
- Luo, L.; Hou, X.; Cai, W.; Guo, B. Incremental route inference from low-sampling GPS data: An opportunistic approach to online map matching. Inf. Sci. 2020, 512, 1407–1423. [Google Scholar] [CrossRef]
- Lou, Y.; Zhang, C.; Zheng, Y.; Xie, X.; Wang, W.; Huang, Y. Map-matching for low-sampling-rate GPS trajectorie. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 352–361. [Google Scholar] [CrossRef]
- Yuan, J.; Zheng, Y.; Zhang, C.; Xie, X.; Sun, G.-Z. An interactive-voting based map matching algorithm. In Proceedings of the 2010 Eleventh International Conference on Mobile Data Management, Kansas City, MO, USA, 23–26 May 2010; pp. 43–52. [Google Scholar] [CrossRef]
- Teng, W.; Wang, Y. Real-time map matching: A new algorithm integrating spatio-temporal proximity and improved weighted circle. Open Geosci. 2019, 11, 288–297. [Google Scholar] [CrossRef]
- Hu, H.; Qian, S.; Ouyang, J.; Cao, J.; Han, H.; Wang, J.; Chen, Y. AMM: An Adaptive Online Map Matching Algorithm. IEEE Trans. Intell. Transp. Syst. 2023, 24, 5039–5051. [Google Scholar] [CrossRef]
- Zheng, K.; Zheng, Y.; Xie, X.; Zhou, X. Reducing uncertainty of low-sampling-rate trajectories. In Proceedings of the 2012 IEEE 28th International Conference on Data Engineering, Washington, DC, USA, 1–5 April 2012; pp. 1144–1155. [Google Scholar] [CrossRef]
- Jagadeesh, G.R.; Srikanthan, T. Online map-matching of noisy and sparse location data with hidden Markov and route choice models. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2423–2434. [Google Scholar] [CrossRef]
- Zhao, K.; Feng, J.; Xu, Z.; Xia, T.; Chen, L.; Sun, F.; Guo, D.; Jin, D.; Li, Y. DeepMM: Deep learning based map matching with data augmentation. In Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Chicago, IL, USA, 5–8 November 2019; pp. 452–455. [Google Scholar] [CrossRef]
- Jin, Z.; Kim, J.; Yeo, H.; Choi, S. Transformer-based map-matching model with limited labeled data using transfer-learning approach. Transp. Res. Part C Emerg. Technol. 2022, 140, 103668. [Google Scholar] [CrossRef]
- Jiang, Z.; Huang, A.; Qi, G.; Guan, W. A Framework of Travel Mode Identification Fusing Deep Learning and Map-Matching Algorithm. IEEE Trans. Intell. Transp. Syst. 2023, 24, 6401–6415. [Google Scholar] [CrossRef]
- Wang, X.; Gilliam, C.; Kealy, A.; Close, J.; Moran, B. Probabilistic map matching for robust inertial navigation aiding. Navig. J. Inst. Navig. 2023, 70, navi.583. [Google Scholar] [CrossRef]
- Jiang, L.; Chen, C.-X.; Chen, C. L2mm: Learning to map matching with deep models for low-quality gps trajectory data. ACM Trans. Knowl. Discov. Data 2023, 17, 1–25. [Google Scholar] [CrossRef]
- Harder, D.; Shoushtari, H.; Sternberg, H. Real-Time Map Matching with a Backtracking Particle Filter Using Geospatial Analysis. Sensors 2022, 22, 3289. [Google Scholar] [CrossRef]
- Peker, A.U.; Tosun, O.; Acarman, T. Particle filter vehicle localization and map-matching using map topology. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 248–253. [Google Scholar] [CrossRef]
- Yu, Z.; Wu, H.; Yin, Z.; Liu, K.; Zhang, R. Vessel trajectory segmentation: A survey. In Proceedings of the International Conference on Database Systems for Advanced Applications, Tianjin, China, 17–20 April 2023; Springer Nature: Cham, Switzerland, 2023; pp. 166–180. [Google Scholar] [CrossRef]
- Ma, X.; Zhou, P.; He, X. Advances in multi-source navigation data fusion processing methods. Mathematics 2025, 13, 1485. [Google Scholar] [CrossRef]
- Miranda-Pascual, À.; Guerra-Balboa, P.; Parra-Arnau, J.; Forné, J.; Strufe, T. An overview of proposals towards the privacy-preserving publication of trajectory data. Int. J. Inf. Secur. 2024, 23, 3711–3747. [Google Scholar] [CrossRef]




| Survey | Data Cleaning | Data Compression | Data Segmentation | Map Matching | Datasets |
|---|---|---|---|---|---|
| (Lee and Krumm, 2011) [13] | √ | √ | |||
| (Amigo et al., 2021) [14] | √ | √ | √ | ||
| (Sun et al., 2016) [15] | √ | ||||
| (Muckell et al., 2014) [16] | √ | ||||
| (Chao et al., 2020) [17] | √ | ||||
| (Huang et al., 2021) [18] | √ | ||||
| (Hashemi and Karimi, 2014) [19] | √ | ||||
| (Sousa et al., 2020) [20] | √ | √ | |||
| This Survey | √ | √ | √ | √ | √ |
| Classification | Algorithm | Time Complexity | Error Metric | Article | |
|---|---|---|---|---|---|
| Compression based on line segment simplification | Offline Compression | Douglas–Peucker | Perpendicular Euclidean distance | (Douglas and Peucker, 1973) [46] | |
| Path Hull | Perpendicular Euclidean distance | (Hershberger and Snoeyink, 1992) [47] | |||
| TD-TR | Time-ratio distance | (Meratnia and de By, 2004) [48] | |||
| DPTS | Euclidean distance, direction, speed | (Long et al., 2013) [49] | |||
| CTEV | Euclidean distance | (Bashir et al., 2022) [50] | |||
| Online Compression | Sliding Window | Perpendicular Euclidean distance | (Keogh et al., 2001) [51] | ||
| Opening Window | Time-synchronous distance | (Meratnia and de By, 2004) [48] | |||
| STTrace | Time-synchronous distance, direction, speed | (Potamias et al., 2006) [52] | |||
| SQUISH | Time-synchronous distance, direction, speed | (Muckell et al., 2011) [53] | |||
| SQUISH-E | Time-synchronous distance, direction, speed | (Muckell et al., 2014) [16] | |||
| BQS | Euclidean distance | (Liu et al., 2015) [54] | |||
| ROPW | Perpendicular Euclidean distance | (Li et al., 2021) [55] | |||
| Road-network-constrained compression | Nonmaterialized | Perpendicular Euclidean distance | (Cao and Wolfson, 2005) [58] | ||
| Shortest Path | Euclidean distance | (Lerin et al., 2012) [59] | |||
| MMTC | A weighted average of network distance and time distance | (Kellaris et al., 2009, 2013) [60,61] | |||
| PRESS | Time synchronized network distance, network synchronized time difference | (Song et al., 2014) [62] | |||
| COMPRESS | Time synchronized network distance, network synchronized time difference | (Han et al., 2017) [63] | |||
| CiNCT | Bit-wise rank value | (Koide et al., 2018) [64] | |||
| CLEAN | Time synchronized network distance, network synchronized time difference | (Zhao et al., 2020) [65] | |||
| TrajCompressor | Perpendicular Euclidean distance | (Chen et al., 2019) [68] | |||
| VTracer | Perpendicular Euclidean distance | (Chen et al., 2019) [69] | |||
| Semantic Compression | STC | Average spatio-temporal distance | (Richter et al., 2012; Schmid et al., 2009) [70,71] | ||
| EHSTC | Perpendicular Euclidean distance | (Feng et al., 2013) [72] | |||
| STMaker | N/A | (Su et al., 2014) [73] | |||
| STSS | Homomorphic distance | (Liu et al., 2021) [75] | |||
| SATC | Synchronous Euclidean distance | (Ta et al., 2016) [80] | |||
| ROCE | Point-to-segment Euclidean distance | (Yin et al., 2022) [81] | |||
| Classification | Method | Time Complexity | Segment Metric | Article |
|---|---|---|---|---|
| Supervised trajectory segmentation | SPD | Time and distant threshold | (Zheng et al., 2011) [82] | |
| WS-II | Error threshold | (Etemad et al., 2020) [83] | ||
| Unsupervised trajectory segmentation | TRACLUS | Distance | (Lee et al., 2007) [84] | |
| SMoT | Time | (Alvares et al., 2007) [85] | ||
| CB-SMoT | Speed | (Palma et al., 2008) [86] | ||
| DB-SMoT | Direction | (Rocha et al., 2010) [87] | ||
| Warped K-Means | Criterion function | (Leiva and Vidal, 2013) [88] | ||
| Greedy segmentation | Location, heading, speed, velocity, curvature, sinuosity, and curviness. | (Buchin et al., 2011; Buchin et al., 2010) [89,90] | ||
| SeTraStream | Correlation of features | (Yan et al., 2011) [91] | ||
| GRASP-UTS | Homogeneity of features | (Soares Júnior et al., 2015) [92] | ||
| TS-MF | Similarity of multiple motion features | (Xu and Dong, 2022) [93] | ||
| OWS | Error signal | (Etemad et al., 2019) [94] | ||
| SWS | Error signal | (Etemad et al., 2021) [95] | ||
| BTCN | Transportation mode | (Markos et al., 2021) [96] | ||
| Semi-supervised trajectory segmentation | RGRASP-SemTS | Homogeneity of features | (Junior et al., 2018) [97] | |
| SECA | Homogeneity of features | (Dabiri et al., 2019) [98] |
| Classification | Method | Time Complexity | Article |
|---|---|---|---|
| Geometric-based model | PTP, PTC, CTC | (Bernstein and Kornhauser, 1996) [102] | |
| PTP, PTC, CTC | (White et al., 2000) [103] | ||
| RRF | (Taylor et al., 2001) [104] | ||
| Global Map-Matching | for weak Fréchet dist | (Brakatsoulas et al., 2005) [105] | |
| Topology-based model | MPM | (Alt et al., 2003) [106] | |
| MM | (Quddus et al., 2003) [107] | ||
| GeoTrackMapper | (Chawathe, 2007) [108] | ||
| ATMM | (Zhao et al., 2018a) [109] | ||
| HFTMM | (Yu et al., 2022) [110] | ||
| Probability-statistics-based model | MHT-MM | N/A | (Pyo et al., 2001) [111] |
| Adaptation MHT-MM | N/A | (Marchal et al., 2005; Schuessler and Axhausen, 2009) [112,113] | |
| MDP-MM | (Chen et al., 2014) [114] | ||
| ST-CRF | (Liu et al., 2016) [115] | ||
| MCM | (Li et al., 2023) [116] | ||
| PMHT-MM | N/A | (Wang et al., 2023) [117] | |
| Advanced model | HMM | (Newson and Krumm, 2009) [118] | |
| OHMM | (Goh et al., 2012) [119] | ||
| FMM | (Yang and Gidofalvi, 2018) [120] | ||
| OM2 | (Xie et al., 2020) [121] | ||
| INC-RB | (Luo et al., 2020) [122] | ||
| ST-Matching | (Lou et al., 2009) [123] | ||
| IVMM | (Yuan et al., 2010) [124] | ||
| STP-IWC | (Teng and Wang, 2019) [125] | ||
| AMM | (Hu et al., 2023) [126] | ||
| HRIS | (Zheng et al., 2012) [127] | ||
| HMM + RCM | (Jagadeesh and Srikanthan, 2017) [128] | ||
| DeepMM | N/A | (Zhao et al., 2019) [129] | |
| TMM | N/A | (Jin et al., 2022) [130] | |
| L2MM | N/A | (Jiang et al., 2023a) [131] |
| Dataset | Download Address | Object | Geography | Classification | Field | Format | Source | Year |
|---|---|---|---|---|---|---|---|---|
| GeoLife | https://www.microsoft.com/en-us/download/details.aspx?id=52367 (accessed on 22 November 2025) | Vehicle, Pedestrian | China, USA, South Korea, Japan | Urban activity trajectory data | Position coordinates, time, transportation mode, etc. | plt | Sensors, phones | 2007–2012 |
| T-Drive | https://www.microsoft.com/en-us/research/publication/t-drive-trajectory-data-sample/ (accessed on 22 November 2025) | Vehicle | Beijing, China | Taxi trajectory data | Position coordinates, time, etc. | txt | Sensors | 2011 |
| NYC-Taxi | https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page (accessed on 22 November 2025) | Vehicle | New York, NY, USA | Taxi trajectory data | Position coordinates, time, etc. | parquet | Sensors | 2009–present |
| Mirror-Traffic | http://www.scenarios.cn (accessed on 22 November 2025) | Vehicle, Pedestrian | China | Vehicle trajectory data on highway ramps, straight roads, intersections, etc. | Mileage, time, lane, etc. | video | Camera, UAV | 2019–2020 |
| NGSIM | https://catalog.data.gov/dataset/next-generation-simulation-ngsim-vehicle-trajectories (accessed on 22 November 2025) | Vehicle | USA | Highway vehicle trajectory dataset | Position coordinates, time, etc. | txt | Sensors | 2004 |
| HighD | https://www.highd-dataset.com/ (accessed on 22 November 2025) | Vehicle | Cologne, Germany | Highway vehicle trajectory dataset | Position coordinates, velocity, acceleration, lane, etc. | csv | UAV | 2017–2018 |
| MAGIC | https://magic.tongji.edu.cn/kycg/MAGICsjj.htm (accessed on 22 November 2025) | Vehicle | Shanghai, China | Urban expressway vehicle trajectory data | Position coordinates, velocity, acceleration, lane, etc. | csv | UAV | 2020 |
| BikeShare | https://open.toronto.ca/dataset/bike-share-toronto-ridership-data/ (accessed on 22 November 2025) | Shared bike | Toronto, UK | Shared bicycle ride data | Origin–destination, time, etc. | csv | APP | 2017–2020 |
| OpenSky Network | https://opensky-network.org/ (accessed on 22 November 2025) | aircraft | Global | Aircraft trajectory data | Origin–destination, time, etc. | csv | ADS-B | 2012–present |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, P.; Tian, Z.; Yang, Y.; Lin, Y. Trajectory Data Preprocessing: Methods and Models. Electronics 2025, 14, 4694. https://doi.org/10.3390/electronics14234694
Li P, Tian Z, Yang Y, Lin Y. Trajectory Data Preprocessing: Methods and Models. Electronics. 2025; 14(23):4694. https://doi.org/10.3390/electronics14234694
Chicago/Turabian StyleLi, Peiyu, Zhao Tian, Yanfang Yang, and Yusong Lin. 2025. "Trajectory Data Preprocessing: Methods and Models" Electronics 14, no. 23: 4694. https://doi.org/10.3390/electronics14234694
APA StyleLi, P., Tian, Z., Yang, Y., & Lin, Y. (2025). Trajectory Data Preprocessing: Methods and Models. Electronics, 14(23), 4694. https://doi.org/10.3390/electronics14234694

