Next Article in Journal
Morphological and Mechanical Characterization of the Limpet Shells from the Korean Intertidal Coast
Previous Article in Journal
Coupled Responses and Performance Assessment of Mooring-Connection Systems for Floating Photovoltaic Arrays in Shallow Waters
Previous Article in Special Issue
Development of an Initial Burial Rate Estimation Simulator for Bottom-Contact Mines and a Reinforcement Learning-Based Mine-Laying Route Optimization Method
error_outline You can access the new MDPI.com website here. Explore and share your feedback with us.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

BAARTR: Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction from Sparse AIS

by
Hee-jong Choi
1,
Joo-sung Kim
2 and
Dae-han Lee
3,*
1
Department of Maritime Transportation System, Mokpo National Maritime University, Mokpo 58628, Republic of Korea
2
Division of Navigation Science, Mokpo National Maritime University, Mokpo 58628, Republic of Korea
3
Graduate School of Maritime Transportation System, Mokpo National Maritime University, Mokpo 58628, Republic of Korea
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2026, 14(2), 116; https://doi.org/10.3390/jmse14020116
Submission received: 5 December 2025 / Revised: 29 December 2025 / Accepted: 5 January 2026 / Published: 7 January 2026
(This article belongs to the Special Issue Advanced Research on Path Planning for Intelligent Ships)

Abstract

The Automatic Identification System (AIS) frequently suffers from data loss and irregular report intervals in real maritime environments, compromising the reliability of downstream navigation, monitoring, and trajectory reconstruction tasks. To address these challenges, we propose BAARTR (Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction), a novel kinematically consistent interpolation framework. Operating solely on time, latitude, and longitude inputs, BAARTR explicitly enforces boundary velocities derived from raw AIS data. The framework adaptively selects a velocity-estimation strategy based on the AIS reporting gap: central differencing is applied for short intervals, while a hierarchical cubic velocity regression with a quadratic acceleration constraint is employed for long or irregular gaps to iteratively refine endpoint slopes. These boundary slopes are subsequently incorporated into a clamped quartic interpolation at a 1 s resolution, effectively suppressing overshoots and ensuring velocity continuity across segments. We evaluated BAARTR against Linear, Spline, Hermite, Bezier, Piecewise cubic hermite interpolating polynomial (PCHIP) and Modified akima (Makima) methods using real-world AIS data collected from the Mokpo Port channel, Republic of Korea (2023–2024), across three representative vessels. The experimental results demonstrate that BAARTR achieves superior reconstruction accuracy while maintaining strictly linear time complexity ( O ( N ) ). BAARTR consistently achieved the lowest median Root Mean Square Error (RMSE) and the narrowest Interquartile Ranges (IQR), producing visibly smoother and more kinematically plausible paths-especially in high-curvature turns where standard geometric interpolations tend to oscillate. Furthermore, sensitivity analysis shows stable performance with a modest training window (n ≈ 16) and minimal regression iterations (m = 2–3). By reducing reliance on large training datasets, BAARTR offers a lightweight, extensible foundation for post-processing in Maritime Autonomous Surface Ship (MASS) and Vessel Traffic Service (VTS), as well as for accident reconstruction and multi-sensor fusion.

1. Introduction

AIS functions as a cornerstone of maritime technology, facilitating the real-time exchange of critical navigational data—including vessel position, speed, and course—to mitigate collision risks and enhance the efficiency of maritime operations. To strengthen navigational safety and ensure effective traffic management, the International Maritime Organization has mandated the installation and operation of AIS on all passenger ships and vessels engaged in international voyages, establishing it as an essential component of the global maritime safety framework [1,2]. By providing dynamic information (e.g., position, speed, and course), static information (e.g., vessel name and Maritime Mobile Service Identity (MMSI)), and voyage-related information (e.g., destination and estimated time of arrival), AIS significantly improves situational awareness among ships and contributes to the systematic management of maritime traffic flow [1].
However, AIS data acquired in real-world maritime environments is frequently compromised by data loss and errors. The primary causes include packet collisions arising from congestion and interference in Very High Frequency (VHF) communication channels [3], signal attenuation or blockage due to adverse weather conditions and topographic obstructions [4], as well as equipment failures or deliberate signal manipulation (spoofing) [5]. These data irregularities extend beyond intermittent gaps, significantly undermining the overall reliability of maritime traffic systems. Recent comprehensive reviews on AIS track anomaly detection and studies on base station credibility monitoring under spoofing scenarios have highlighted these vulnerabilities [6,7]. To address these data integrity challenges, this study proposes an interpolation framework designed to ensure kinematic consistency, even in the presence of missing or erroneous AIS segments.
Notably, with the emergence of the MASS era, the quality of AIS data has become paramount. AIS serves as a primary information source that enables MASS to maintain situational awareness of surrounding vessels and the maritime environment, forming the foundation for critical navigational decisions such as voyage planning and collision avoidance. However, when AIS data is compromised by losses or errors due to the aforementioned factors, autonomous systems are forced to rely on inaccurate or incomplete information. This reliance can precipitate erroneous decision-making processes, potentially resulting in catastrophic maritime accidents. Therefore, beyond the existing challenges—such as reduced accuracy in VTS monitoring [7] and diminished reliability in maritime accident investigations and route analyses [5,8]—guaranteeing the continuity and accuracy of AIS data is indispensable for ensuring the safe and efficient operation of MASS.
To mitigate these practical limitations of AIS data quality and safeguard mission-critical systems dependent on data-driven decision-making—such as MASS—the application of interpolation methods is indispensable. Interpolation functions as a fundamental mechanism for reconstructing fragmented trajectories by estimating missing navigational parameters, including vessel position, speed, and course. This process ensures data continuity and generates information that approximates the vessel’s actual dynamic state with high fidelity [8,9].
Recent studies have highlighted the deficiencies of conventional interpolation methods, such as linear and cubic splines, in adequately capturing the complex non-linear dynamics of vessel motion [10,11]. Consequently, alternative approaches have been proposed, including Improved Kinematic Interpolation (IKI) and machine learning (ML)-based methods such as Long Short-Term Memory (LSTM) networks and Random Forests [4,8]. While these methods demonstrate improved performance compared to traditional approaches, the direct application of kinematic interpolation to real-world AIS data—characterized by frequent losses and errors—can lead to error propagation. Moreover, ML-based techniques face significant challenges due to their reliance on large-scale training datasets, which are often scarce or computationally expensive to acquire in the maritime domain.
To address these challenges, this study proposes a boundary-aware, data-efficient interpolation framework (BAARTR) that robustly reconstructs vessel trajectories under realistic AIS loss and noise conditions, leveraging only time, latitude, and longitude inputs. The core innovation involves estimating the endpoint velocities of each missing segment and enforcing them as boundary constraints during interpolation, thereby preserving kinematic consistency and suppressing overshoots at curvature transitions. The reconstruction strategy adapts to the AIS reporting interval: for short gaps (defined as those shorter than the nominal reporting period), vessel speed is estimated from positions using a central-difference scheme; for long or irregular gaps, trajectory vectors are decomposed, and a complementary hierarchical regression model is used to predict endpoint velocities. These velocities are subsequently integrated into a clamped spline interpolation to produce kinematically plausible paths, all while maintaining a lightweight computational structure (local windows with ≤16 samples and a few iterations, no pre-training).

2. Related Work

A significant body of research has addressed vessel trajectory reconstruction using AIS data to ensure safe and efficient navigation. Existing literature can be broadly categorized into three streams: statistical and rule-based approaches, kinematic model-based approaches, and deep learning-based approaches.
Early research primarily utilized statistical characteristics and heuristic rules derived from navigation patterns. Sang et al. [12] established a reconstruction procedure based on stepwise outlier removal and segment connection rules. In a subsequent study, Sang et al. [13] refined this method by incorporating three heuristic rules and combining linear, curvilinear, and arc segments to improve reconstruction accuracy in inland waterways. Zhang et al. [14] leveraged vector analysis for anomaly detection, differentiating between straight and curved segments to apply linear and cubic spline interpolation, respectively. Addressing Unmanned Surface Vehicle scenarios, Shi et al. [15] integrated outlier removal, Empirical Mode Decomposition for noise reduction, and curve fitting with Fermat’s spiral, effectively demonstrating the feasibility of AIS-based approaches in real-world waters. To mitigate uncertainty in sparse AIS data, Zhang et al. [16] employed an ant colony optimization algorithm with weighted nodes and edges, successfully reconstructing optimal paths in open-sea environments. More recently, Deng et al. [17] utilized Graph Signal Variation Detection to eliminate false connections in complex crossing trajectories, thereby enhancing the stability of downstream interpolation and prediction. Furthermore, Chen and Huang [18] proposed a clustering algorithm, which is based on trajectory reconstruction, an unsupervised clustering method that aggregates similar trajectory patterns without manual labeling, facilitating a novel reconstruction approach. Finally, Liang et al. [19] developed the AISClean framework, which integrates statistical detection, polynomial interpolation, and Dynamic Time Warping, verifying the capability to reconstruct long-term missing segments.
Concurrently with statistical approaches, research incorporating the explicit physical motion characteristics of vessels has gained significant traction. Hedger et al. [20] presented a position–velocity optimal interpolation framework for hydrophone array data, highlighting the criticality of interpolation design in reconstructing trajectories from sparse and irregular observations. Similarly, Hintzen et al. [21] applied cubic Hermite splines to Vessel Monitoring System (VMS) data; by incorporating derivative values (i.e., speed) at each data point, this method significantly improved trajectory accuracy compared to standard linear interpolation. Long [22] introduced a kinematic interpolation scheme that directly integrates velocity and acceleration into the interpolation process, demonstrating its effectiveness in bridging AIS data gaps. Building on this foundation, Du et al. [23] enhanced linear interpolation by incorporating quadratic polynomials constrained by speed, heading, and estimated acceleration. Guo et al. [10] further advanced this line of inquiry by introducing IKI, which refined acceleration estimation accuracy and boosted overall trajectory reconstruction performance. Addressing the challenge of irregular AIS sampling intervals, Zaman et al. [24] employed a two-stage approach comprising trajectory reconstruction followed by waypoint detection, which outperformed conventional methods in identifying turning points. More recently, Liu et al. [25] combined navigation state recognition with Bi-Directional Kinematic Interpolation (BDKI), leveraging both forward and backward boundary conditions to improve reconstruction accuracy. This approach aligns with the boundary-aware methodology proposed in our study, further validating the empirical effectiveness of such kinematic constraints. Wang et al. [26] corroborated these insights by reporting that piecewise cubic hermite interpolating polynomial (Hermite) preprocessing improved trajectory classification and prediction in real-world confluence waterways, a finding consistent with the comparative analysis between Hermite and clamped interpolation presented herein.
In recent years, deep learning paradigms have been increasingly deployed to capture complex temporal patterns in AIS data for trajectory reconstruction. Zhong et al. [27] leveraged a Bi-directional LSTM Recurrent Neural Networks model, achieving high accuracy with an average RMSE of approximately 10 m. Adapting architectures from computer vision, Li et al. [28] applied the U-Net model—originally designed for image segmentation—to trajectory reconstruction, demonstrating superior performance in handling curved segments. To address data stability, Murray and Perera [29] utilized dual autoencoders to compress and reconstruct spatiotemporal features. Further enhancing accuracy, Chen et al. [30] proposed an integrated framework that combines outlier removal (via moving average models) with neural network prediction, effectively mitigating AIS noise while reconstructing realistic trajectories. Similarly, Wu et al. [31] introduced a unified solution merging low-rank tensor completion with outlier separation to handle missing data and noise simultaneously. Advancing multi-step prediction capabilities, Ye et al. [32] integrated feature-correlation-based variable selection with a Seq2Seq model. Taking a hybrid approach, Li et al. [33] proposed a Graph Attention Network–LSTM model capable of jointly learning spatiotemporal features, which reduced the mean error by 44–56% compared to existing methods. Expanding the domain to fisheries, Zhao et al. [34] introduced HiTrip, a deep learning-based framework fusing fishing vessel VMS data with oceanographic data; this method overcame challenges associated with irregular vessel movements and low-resolution datasets to accurately reconstruct missing segments of historical trajectories. Most recently, Chen et al. [35] presented a framework integrating advanced denoising and interpolation, demonstrating the potential to simultaneously address real-world noise and data gaps.
Collectively, these studies have consistently advanced the accuracy of vessel trajectory reconstruction through statistical, kinematic, and deep learning approaches. However, significant limitations persist. First, despite the high performance of deep learning models, they entail substantial computational costs and data dependency; furthermore, their generalization capability often diminishes when encountering anomalous navigation patterns absent from the training dataset. Second, although kinematic models effectively capture vessel dynamics, they tend to decouple velocity estimation from position interpolation or apply a uniform interpolation strategy regardless of the missing interval duration.
To bridge these gaps, this study proposes the BAARTR framework. BAARTR employs an adaptive strategy that selectively applies central differencing or regression analysis contingent on the length of missing intervals, while precisely estimating velocity in data gaps through iterative polynomial regression (assuming variable acceleration) with limited inputs. Crucially, the predicted velocities are directly enforced as boundary conditions in a clamped spline interpolation, thereby establishing a unified process between velocity estimation and position reconstruction. Consequently, BAARTR minimizes reliance on complex deep learning models while capturing vessel dynamics with high fidelity, enabling reliable trajectory reconstruction even from sparse AIS data.

3. Research Method

3.1. Overall Framework

As illustrated in Figure 1, the proposed BAARTR framework comprises three main stages: (1) data preprocessing, (2) trajectory segmentation and velocity estimation, and (3) trajectory reconstruction.
In the preprocessing stage, raw AIS data are decoded and cleaned. Data acquisition was performed using a custom receiver system integrating a Raspberry Pi 4 with a dAISy HAT, capturing AIS messages in ASCII format. The study dataset encompasses vessels navigating the Mokpo Port waterway in the Republic of Korea, recorded from June 2023 to June 2024 (an overview is provided in Table 1).
In real-world scenarios, while raw AIS data typically includes vessel position, MMSI, and other static/dynamic parameters, frequent data anomalies are observed. Specifically, some records lack critical attributes (e.g., vessel dimensions, Speed Over Ground (SOG), Course Over Ground (COG)), and significant discrepancies often exist between the reported speed data and the actual positional displacement. To address these irregularities, this study developed the BAARTR framework to robustly reconstruct vessel trajectories even when input data is compromised.
The study area encompasses the Mokpo Port channel, which operates under regulations enforced by the West Sea Regional Coast Guard. Vessels navigating the 3.2 km-long and 450 m-wide channel between Dalli Island and the Hwawon Peninsula are subject to a maximum speed limit of 20 knots. Consequently, AIS data records indicating a SOG exceeding 20 knots were identified as anomalies and filtered out.
In the trajectory segmentation and velocity estimation stage, the reconstruction strategy is adaptively determined based on the AIS reporting interval, which varies according to vessel speed as summarized in Table 2. To ensure kinematic consistency rather than relying on potentially unreliable raw SOG values reported in AIS, vessel velocity is independently recalculated using the central difference method. Based on this computed velocity, the reconstruction procedure bifurcates into two distinct pathways: when the reporting interval is short, velocity is estimated using the central difference method; conversely, when the interval is long, a regression-based velocity estimation model is applied.
The BAARTR framework employs an adaptive velocity estimation strategy contingent on the AIS reporting interval. When the transmission interval is relatively short—ensuring that distance variations between consecutive observations are reliably captured—vessel speed is computed using the central difference method. Specifically, the velocity vector v i at the i -th position, defined with respect to the distance d between two adjacent positions, is expressed as follows:
v i = d i + 1 d i 1 t i + 1 t i 1
Conversely, when the AIS reporting interval extends beyond one minute or exhibits significant irregularity, the accuracy of the central difference method diminishes due to discretization errors. To mitigate this, our approach employs a regression-based velocity prediction framework. In this scheme, position and time data for each segment serve as inputs to iteratively estimate the continuous time-series function of vessel speed.
In this study, a vessel’s trajectory is modeled as a time-series sequence within a two-dimensional vector space, where position is defined by latitude and longitude coordinates. The position vector at time step t is denoted as r ( t )   =   [ λ t , ϕ ( t ) ] . The velocity vector, defined as the first derivative of position with respect to time, is decomposed into orthogonal components along the longitude and latitude axes to facilitate independent prediction and interpolation. This decoupling strategy ensures the stability of velocity and acceleration estimation while minimizing error propagation arising from cross-axis correlations during the reconstruction process.
A vessel can be kinematically characterized as a moving object whose acceleration, velocity, and position vary over time. These changes arise from factors such as waves, wind, encounters with other vessels, obstacles, engine operations depending on the route, and maneuvers such as turning, all of which influence acceleration. To describe these dynamics, this study formulates acceleration using a non-uniform acceleration equation of motion. Although such equations can be expressed using polynomial forms ranging from quadratic to higher orders, increasing the polynomial degree tends to cause overfitting. While it is recognized that a rigorous description of vessel motion necessitates hydrodynamic modeling (e.g., MMG or Nomoto models) that accounts for mass, drag, and external forces, applying such dynamic models to sparse AIS data is often infeasible due to the unavailability of specific hydrodynamic coefficients and force inputs.
Consequently, this study adopts a parametric kinematic approach, modeling the time-variant acceleration as a quadratic polynomial. This formulation serves as a robust kinematic approximation designed to capture variable acceleration trends without overfitting, rather than attempting to simulate a fully grounded dynamic vessel model. Through successive integration with respect to time, we derive a cubic velocity model and a quartic position model for each trajectory segment.
To enhance the accuracy of velocity estimation in missing segments, this study performs regression analysis iteratively, incorporating the estimated values from the previous step as inputs for the subsequent regression. This iterative approach ensures both the continuity and physical plausibility of segment-wise predictions, while the numbers of training samples and iterations are optimized across the entire vessel trajectory based on RMSE to prevent overfitting.
Once velocity has been estimated, BAARTR applies adaptive clamped quartic polynomial interpolation to reconstruct positions within each segment. Clamped interpolation employs the slopes at the segment endpoints as boundary constraints, thereby maximizing the continuity and smoothness of the trajectory in accordance with the estimated velocities. The interpolation polynomial on the i -th segment is defined as d ( t ) .
The core of this study lies in a double-anchoring strategy, whereby missing segments are reconstructed using clamped interpolation while the boundary slopes (velocities) are reliably estimated through adaptive hierarchical regression. First, the velocity estimation pathway is chosen according to the AIS reporting interval. For short intervals (AIS reporting interval   t ), the central difference method is employed to minimize noise, whereas for long intervals (AIS reporting interval   t ), boundary slopes are refined through iterative feedback ( m times) of cubic velocity regression with acceleration constrained to a quadratic form. The resulting endpoint slopes are imposed as boundary conditions in the clamped quartic polynomial interpolation, effectively suppressing oscillatory artifacts in high-curvature segments while ensuring velocity continuity. The number of training samples ( n ) and iterations ( m ) is determined via RMSE-based sensitivity analysis across the entire trajectory, thereby achieving a balance between overfitting suppression and continuity assurance.

3.2. Boundary-Aware Adaptive Regression

Before regression, raw AIS trajectories were subjected to a light low-pass stabilization to suppress transient positional jitter caused by Global Positioning System noise while preserving the original trend. This step aimed to ensure numerical stability during the regression process rather than to smooth or reshape the data. The cut-off frequency was empirically determined as 0.1 Hz, which effectively removed short-term fluctuations (<10 s) without affecting vessel-scale motion patterns.
When the reporting interval is long, errors increase if trajectory reconstruction relies solely on the central difference method. To address this, the present study employs the BAAR technique to calculate velocity while accounting for vessel dynamics. For the i -th data point, BAAR performs m stages of hierarchical regression based on n training samples, differing from conventional one-time regression models by incorporating the predicted values from the previous stage into the subsequent learning process. The number of training samples ( n ) and the number of regression iterations ( m ) are empirically predetermined through RMSE-based analysis across the entire trajectory.
Step 1: Initial Regression ( m = 1 ) .
The velocity at the i -th point, v ^ i ( 1 ) , is estimated using the previously described velocity model v ( t ) , with the training dataset X i ( 1 ) defined as follows:
X i ( 1 ) = v i n ,   v i n + 1 ,   ,   v i 1 ,   v i + 1 ,   ,   v i + n 1 ,   v i + n
In the same manner, v ^ i + 1 ( 1 ) is also estimated.
Step 2: Complementary Hierarchical Regression.
At the m -th iterative regression, the training process incorporates the predicted values obtained from the previous stage.
v ^ i ( m ) = R e g r e s s ( v i n ,   v i n + 1 ,   ,   v i 1 , v ^ i + 1 m 1 , v i + 2 ,   v i + n ) v ^ i + 1 ( m ) = R e g r e s s ( v i n ,   v i n + 1 ,   ,   v i 1 , v ^ i m 1 , v i + 2 ,   v i + n )
The iterative regression aims at empirical stabilization, not a formal convergence proof. Preliminary experiments were performed with n   { 8 ,   12 ,   16 ,   20 ,   24 } and m   { 1 ,   2 ,   3 ,   4 } . The results indicated that n = 16 provided the lowest RMSE while preserving local responsiveness, whereas larger n values led to excessive smoothing. Similarly, m = 2 ~ 3 yielded an optimal balance between regression refinement and computational cost, as RMSE improvement saturated beyond m = 3 . Within these ranges, BAARTR showed stable RMSE variation (<5%), indicating that its performance is not highly sensitive to parameter changes. Additionally, these parameters vary depending on factors such as the ports through which vessels navigate and vessel traffic volume.
This complementary hierarchical structure enables accurate velocity estimation even in long-term missing segments by incorporating feedback among predicted values while reflecting vessel dynamics. To preserve the physical realism of velocity, BAAR constrains acceleration to a quadratic polynomial and integrates it to construct a cubic polynomial regression model for velocity, thereby accounting for the smooth variation in velocity inherent in vessel motion. The clamped approach is distinctive in that it imposes kinematic constraints directly, unlike purely geometric interpolations. By fixing the slopes (velocities) at the start and end of interpolation segments, overshoot and oscillations that typically arise at turning initiation and termination are structurally suppressed. Meanwhile, BAAR limits acceleration to a quadratic form and models velocity as a cubic polynomial, minimizing degrees of freedom while still incorporating trajectory continuity constraints as a kinematic approximation. Iterative regression further updates boundary estimates in a complementary manner, suppressing the propagation of uncertainty even in long-duration gaps. This combined framework ultimately enhances the overall accuracy of trajectory reconstruction (Figure 2).

4. Results and Analysis

4.1. Experimental Setup Overview

In this study, AIS data from vessels navigating the Mokpo Port channel were collected between June 2023 and June 2024, and three representative vessels were selected for performance evaluation of BAARTR, as illustrated in Figure 3, Figure 4 and Figure 5 (the background colors represent the electronic navigational chart features: brown indicates land, blue represents navigable waters, and light green indicates shallow waters or tidal flats.). The selected vessels consisted of one fishing vessel, one cargo ship, and one tugboat, which are the dominant ship types operating in the Mokpo Port channel. These vessels exhibited different speeds, reporting intervals, and trajectory patterns, and the dataset was obtained under real navigation conditions. To validate performance, artificial data removal was conducted to generate missing segments, which were then reconstructed to assess interpolation accuracy.
To objectively evaluate the performance of the proposed BAARTR, we conducted comparative experiments against six representative deterministic interpolation methods: linear, spline, Hermite, and Bezier. Furthermore, two advanced shape-preserving techniques—PCHIP and Makima—were specifically included as state-of-the-art (SOTA) lightweight baselines. While standard cubic splines often suffer from overshoot oscillations, PCHIP and Makima are designed to preserve the shape of the data, offering a rigorous benchmark for assessing the stability of the proposed algorithm. We specifically selected fundamental geometric and kinematic interpolation methods (Linear, Spline, Hermite, Bezier, PCHIP, Makima) as baselines to isolate the contribution of the proposed boundary-aware mechanism. Smoothing filters such as Savitzky–Golay were excluded because they are designed for denoising continuous data rather than reconstructing large gaps in sparse datasets. Additionally, complex state-recognition-based frameworks like BDKI were not included in the direct comparison to focus on evaluating the performance of the pure interpolation kernel itself under identical input conditions. The evaluation metric was the RMSE, and the mean RMSE for each segment was calculated through repeated experiments.

4.2. Interpolation Results

The influence of the number of training samples and regression iterations on interpolation accuracy, according to the trajectories of the target vessels, can be examined through the thresholds presented in Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11 for the three vessel scenarios.
(1) Selection of Training Sample Size.
Across all three scenarios—(a) id440XXX000, (b) id440XXX860, and (c) id440XXX320—a sharp decline in RMSE was observed with increasing numbers of training samples, followed by a gradual transition into a saturation region. In this study, n = 16 was selected by considering (i) the residual reduction margin within the saturation region, (ii) fairness of comparison through a common setting across scenarios, and (iii) stability of boundary conditions (velocity) in long-term missing segments. This selection (n = 16) incurs minimal computational overhead while maintaining robust performance across all scenarios.
(2) Selection of Regression Iterations.
The distribution of RMSE with respect to the number of regression iterations is shown in Figure 9, Figure 10 and Figure 11. The optimal number of iterations was identified as (a) m = 2 , (b) m = 3 , (c) m = 3 for each scenario, respectively. The required number of regression iterations varies depending on the trajectory patterns generated by the vessels during navigation.
The comparative results between BAARTR and other interpolation methods, based on the selected number of training samples and regression iterations for each scenario, are presented in Figure 12, Figure 13 and Figure 14. The visualization of reconstructed trajectories demonstrates that BAARTR generates smoother and more kinematically consistent paths compared with conventional interpolation methods (linear, spline, Hermite, etc.). In particular, in scenario id440XXX860, which involves sharp turning maneuvers, traditional methods exhibited oscillations and overfitting effects, whereas BAARTR produced reconstruction results that reflect continuous maneuvering characteristics during navigation.
Across all scenarios, conventional interpolation methods failed to produce trajectories that aligned with the actual vessel paths. In cases involving gentle curves, sharp turns, and high-curvature segments, BAARTR’s advantages in preserving kinematic consistency and suppressing oscillations were clearly evident compared to other methods.

4.3. Comparison of RMSE Distributions Across Interpolation Methods

Figure 15, Figure 16 and Figure 17 present boxplots and violinplots of RMSE distributions for Linear, Spline, Hermite, Bezier, PCHIP, Makima, and the proposed BAARTR methods across the various scenarios. RMSE was calculated by artificially removing one to three segments from each trajectory, creating intervals of 15–30 s, 30–45 s, and 45–60 s. Statistical outliers, depicted as red crosses, are defined as data points falling beyond 1.5   ×   I Q R of the RMSE distribution. These points correspond to trajectories with irregular gap patterns or abrupt maneuvers, causing unusually high interpolation errors. In all cases, BAARTR exhibited the lowest median RMSE and the smallest variance. By contrast, geometric interpolations (Spline/Bezier) repeatedly showed increased variance and a higher frequency of outliers, particularly in curvature transitions and long-term missing segments.
Analyzing the performance drivers, BAARTR achieved the lowest median and the narrowest IQR, while also producing the fewest outliers. Linear interpolation, conversely, showed elevated median values and wide variance due to structural discontinuities (lack of smoothness at segment boundaries). Spline and Bezier interpolations suffered from overshoots and oscillatory artifacts at curvature transitions and endpoint connections, which contributed to greater variance and error magnitude. Hermite interpolation offered greater stability than the purely geometric methods but still exhibited minor deviations contingent on the accuracy of endpoint slope estimation. The Average Speed method demonstrated moderate performance, serving as a baseline reference. A quantitative summary of the detailed RMSE values for each scenario is provided in Table 3.

4.4. Computational Complexity and Efficiency Analysis

To objectively verify the “lightweight” claim of the proposed framework, we analyzed the theoretical time complexity of BAARTR in comparison with six deterministic interpolation methods (Linear, Spline, Hermite, Bezier, PCHIP, Makima) and a deep learning-based reference.
As summarized in Table 4, all deterministic methods, including the state-of-the-art shape-preserving algorithms (PCHIP and Makima), operate within O ( N ) linear time complexity. This indicates that the computational cost increases linearly with the number of trajectory points, ensuring high scalability.
However, a qualitative distinction exists in their operational logic. While standard methods (Linear, Spline) and shape-preserving methods (PCHIP, Makima) rely solely on geometric constraints to determine the curve, BAARTR incorporates kinematic constraints—specifically, velocity estimation via multi-round regression—within the same O ( N ) complexity class. The core processes of BAARTR are sequential and deterministic, requiring no iterative optimization or heavy matrix multiplications. Consequently, BAARTR maintains the extreme computational lightness of mathematical interpolation methods while offering the kinematic consistency typically sought in complex models, making it far more suitable for real-time VTS systems than deep learning approaches (e.g., LSTM), which scale with O ( N × d 2 ) .

5. Discussion

This study proposed the BAARTR framework designed for environments where AIS data are compromised by data loss and irregular reporting intervals. We evaluated its performance against conventional interpolation methods using AIS data from three vessels operating in actual port waters. BAARTR adopts clamped polynomial interpolation, in which the boundary slopes (velocities) of missing segments are explicitly enforced as constraints. Unlike simple geometric interpolations (linear, spline, Bezier), this approach guarantees both velocity continuity and kinematic plausibility, thereby suppressing overshoots and oscillatory artifacts at trajectory endpoints (Figure 5 and Figure 6). Consequently, oscillations in high-curvature regions were mitigated, and unrealistic turning radius distortions were alleviated.
For short reporting intervals, central differencing was applied, while for long or irregular intervals, velocity prediction was performed using a hierarchical iterative regression strategy. This dual approach enabled both numerical stability in short gaps and prediction reliability in long gaps. Specifically, constraining acceleration to a quadratic polynomial and integrating it to construct a cubic velocity model effectively limited excessive degrees of freedom while still reflecting vessel inertia and hydrodynamic resistance as a first-order approximation (Equations (2) and (3)).
Across all three scenarios, the RMSE curves for training sample size showed a sharp decline followed by convergence, with n = 16 providing stable low-variance performance at minimal computational cost (Figure 6, Figure 7 and Figure 8). The optimal number of regression iterations, m = 2 or m = 3 , varied depending on route geometry (specifically curvature and turning segments), suggesting that boundary velocity estimates can be sufficiently stabilized with only a few feedback iterations (Figure 9, Figure 10 and Figure 11).
Boxplot comparisons further revealed that BAARTR consistently achieved the lowest median values and narrowest IQR across all scenarios (Figure 15, Figure 16 and Figure 17). As shown in Table 3, BAARTR yielded the minimum RMSE in each case, outperforming Linear, Spline, Hermite, Bezier, PCHIP and Makima. Notably, in scenario (b), which involved sharp turns, geometric interpolations produced greater variance and outliers due to endpoint oscillations and overshoots; in contrast, BAARTR maintained stability through boundary-condition-based regularization. The consistently lower median and IQR values of BAARTR can be attributed to: (i) the accuracy of the regression module, which filters out noise inherent in spline/Bezier methods, and (ii) the use of clamped interpolation, which enforces velocity continuity (kinematic consistency) in curved trajectories, preventing error propagation even in complex segments with significant data loss. These findings are substantiated by the RMSE results in Figure 5 and Figure 6 and Table 3. Furthermore, theoretical complexity analysis confirms that BAARTR operates with strictly linear time complexity ( O ( N ) ), maintaining the same level of computational efficiency as lightweight mathematical models (e.g., Makima). This contrasts with deep learning approaches ( O ( N × d 2 ) ), making BAARTR highly suitable for embedded systems.
Considering the realities of VHF communication, where data losses, delays, and packet collisions occur frequently, BAARTR ensures the continuity and kinematic consistency of trajectories even under conditions of sparse sensor availability. The method can be seamlessly integrated as a post-processing module to compensate for missing AIS data in MASS navigation systems, particularly within voyage planning and collision-avoidance modules, as well as in VTS accident reconstruction, investigative support, and port traffic flow analysis. Moreover, the high-resolution (one-second) reconstruction outputs offer high potential for fusion with radar and camera tracking systems.
Nevertheless, this study acknowledges certain limitations. Only three variables—time, latitude, and longitude—were used, while velocity was reconstructed via central differencing or regression estimation. Although this strategy maximizes model simplicity and general applicability, it may not fully capture dynamics in extreme environmental conditions, such as strong cross-currents, tidal effects, or abrupt steering inputs. Additionally, the dataset was limited to AIS data collected from a specific waterway and temporal period. Future extension to open seas, high-speed navigation, or heavily congested communication environments may yield different performance characteristics.

6. Conclusions

This study introduced BAARTR, a novel interpolation framework that integrates boundary awareness (velocity and slope) with adaptive regression to mitigate the challenges of data loss and irregular sampling in AIS data. The efficacy of the method was rigorously evaluated against conventional interpolation methods (Linear, Spline, Hermite, Bezier, PCHIP, Makima) across three real-world vessel trajectories collected in a port channel. In the proposed approach, velocities at the boundaries of missing segments are estimated through hierarchical regression and explicitly enforced as constraints in clamped interpolation, thereby ensuring kinematic continuity and smooth curvature transitions in the reconstructed trajectories. As a result, BAARTR consistently achieved lower median RMSE, narrower IQR, and fewer outliers across all scenarios, demonstrating superiority in both accuracy and stability. Visual comparisons further confirmed that BAARTR effectively suppressed overshoots and oscillatory artifacts (observed in Spline/Bezier) as well as structural discontinuities (observed in Linear), thereby reproducing trajectories that adhere to realistic navigational dynamics.
Sensitivity analysis confirmed that using n = 16 training samples provides an optimal balance among performance, stability, and computational cost. Furthermore, setting the number of regression iterations conservatively to m = 2 or m = 3 , depending on segment dynamics, effectively prevents overfitting and error propagation while delivering robust performance gains. Moreover, the theoretical complexity analysis proved that BAARTR operates with O ( N ) linear complexity, ensuring real-time feasibility unlike deep learning-based approaches. Therefore, BAARTR serves as an optimal lightweight solution for restoring missing trajectory data in resource-constrained maritime environments. Beyond operational monitoring, BAARTR provides realistic trajectories without overfitting in accident reconstruction and route analysis. Moreover, when applied as a data normalization pre-processor for downstream tasks such as prediction, classification, and anomaly detection, it effectively mitigates performance variability caused by irregular AIS reporting intervals.
Despite the promising results, several limitations of this study warrant mention. First, the BAARTR framework remains contingent upon the quality and density of AIS observations. When long-term data gaps coincide with large curvature transitions, uncertainty in boundary velocity estimation increases, potentially leading to elevated local errors. Second, the method employs a simplified quadratic acceleration model as a computational heuristic. Since this purely kinematic approach does not explicitly account for vessel mass, added mass, or hydrodynamic damping, it inherently limits the ability to fully capture nonlinear dynamics or abrupt changes in motion under strong external forces (e.g., gusts, complex currents) or complex maneuvers. Third, this study relied exclusively on univariate interpolation based on time, latitude, and longitude, thereby omitting exogenous variables such as currents, wind, vessel type/load, and engine response. Fourth, complex interaction scenarios caused by multi-vessel encounters (e.g., collision avoidance or formation keeping) may complicate boundary estimation. Finally, the dataset was confined to a specific region and time period, necessitating further validation to ensure generalization across diverse waterways and environmental conditions.
While the framework was intentionally designed to operate with minimal inputs (time, latitude, longitude) to ensure low latency, we acknowledge that excluding SOG/COG, vessel type/handling characteristics, and environmental forcing may limit absolute accuracy. However, when available, these variables can be readily incorporated as training data for endpoint-velocity estimation.
Future research will focus on the following key areas: First, generalization capability will be enhanced by integrating exogenous variables—such as Speed and Course Over Ground (SOG/COG), vessel type/tonnage, and environmental factors including weather and currents—into a multivariate boundary estimation framework. Second, incorporating navigation state recognition (e.g., distinguishing between straight sailing and turning) alongside waypoint-aware strategies would enable the automatic selection of optimal interpolation methods for each segment. Third, hybridization with path-planning-based restoration algorithms is recommended to suppress kinematically implausible trajectories, particularly under conditions of severe data loss or spatial constraints. Fourth, multimodal fusion with radar, vision-based tracking, and IMU data, as well as validation across multiple waterways, should be conducted to ensure greater robustness and general applicability. Finally, we aim to refine BAARTR by leveraging data-driven enhancements through machine learning with large-scale training datasets and multiple parameters.
In conclusion, BAARTR has demonstrated the ability to achieve both accuracy and stability under conditions of missing and irregular AIS data, while incorporating critical features such as noise suppression and continuous trajectory connectivity. By balancing a lightweight model with high extensibility, including multivariate integration, state awareness, and path-planning linkage, the proposed boundary-aware adaptive regression framework is positioned as a promising foundational technology with significant potential for applications in autonomous navigation, maritime traffic control, and accident reconstruction.

Author Contributions

Conceptualization, H.-j.C. and J.-s.K.; methodology, H.-j.C.; software, H.-j.C. and J.-s.K.; validation, H.-j.C. and D.-h.L.; formal analysis, H.-j.C.; investigation, H.-j.C.; resources, H.-j.C.; data curation, H.-j.C.; writing—original draft preparation, H.-j.C.; writing—review and editing, D.-h.L.; visualization, H.-j.C.; supervision, J.-s.K. and D.-h.L.; project administration, J.-s.K.; funding acquisition, J.-s.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Restrictions apply to the datasets. The datasets presented in this article are not readily available because they are part of an ongoing study. Requests to access the datasets can be made to the corresponding author.

Acknowledgments

The authors extend their sincere gratitude to Joo Sung Kim of Mokpo National Maritime University for his invaluable assistance throughout this study.

Conflicts of Interest

The authors declare no conflicts of interest. The sponsors had no role in the design, execution, interpretation, or writing of the study.

References

  1. Lee, E.; Mokashi, A.J.; Moon, S.Y.; Kim, G.S. The Maturity of Automatic Identification Systems (AIS) and Its Implications for Innovation. J. Mar. Sci. Eng. 2019, 7, 287. [Google Scholar] [CrossRef]
  2. Kim, B.K.; Hong, S.H.; Lee, J. The Efficient Extraction Strategy for ship displays in AIS Monitoring System. In Proceedings of the Korean Institute of Information and Commucation Sciences Conference; The Korea Institute of Information and Commucation Engineering: Busan, Republic of Korea, 2022; pp. 588–590. [Google Scholar]
  3. Maeng, S.Y.; Park, S.H.; Kim, G.H.; Lee, S.B.; Choi, M.S.; Choi, G.H.; Lee, Y.W. A Study on AIS Data Collision Attenuation Technique using Terrestrial Mobile Communication Channel. In Proceedings of the Korean Institute of Information and Commucation Sciences Conference; The Korea Institute of Information and Commucation Engineering: Busan, Republic of Korea, 2013; pp. 426–428. [Google Scholar]
  4. Liang, M.; Liu, R.W.; Zhong, Q.; Liu, J.; Zhang, J. Neural Network-Based Automatic Reconstruction of Missing Vessel Trajectory Data. In Proceedings of the 2019 IEEE 4th International Conference on Big Data Analytics (ICBDA), Suzhou, China, 15–18 March 2019; pp. 426–430. [Google Scholar] [CrossRef]
  5. Androjna, A.; Perkovič, M.; Pavic, I.; Mišković, J. AIS Data Vulnerability Indicated by a Spoofing Case-Study. Appl. Sci. 2021, 11, 5015. [Google Scholar] [CrossRef]
  6. Wolsing, K.; Roepert, L.; Bauer, J.; Wehrle, K. Anomaly Detection in Maritime AIS Tracks: A Review of Recent Approaches. J. Mar. Sci. Eng. 2022, 10, 112. [Google Scholar] [CrossRef]
  7. Wang, X.; Wang, Y.; Fu, L.; Hu, Q. An AIS Base Station Credibility Monitoring Method Based on Service Radius Detection Patterns in Complex Sea Surface Environments. J. Mar. Sci. Eng. 2024, 12, 1352. [Google Scholar] [CrossRef]
  8. Zhang, X.; He, Y.; Tang, R.; Mou, J.; Gong, S. A Novel Method for Reconstruct Ship Trajectory Using Raw AIS Data. In Proceedings of the 2018 3rd IEEE International Conference on Intelligent Transportation Engineering (ICITE), Singapore, 3–5 September 2018; pp. 192–198. [Google Scholar] [CrossRef]
  9. Xue, J.; Wu, C.; Chen, Z.; Chen, X. A novel estimation algorithm for interpolating ship motion. In Proceedings of the 2017 4th International Conference on Transportation Information and Safety (ICTIS), Banff, AB, Canada, 8–10 August 2017; pp. 557–562. [Google Scholar] [CrossRef]
  10. Guo, S.; Mou, J.; Chen, L.; Chen, P. Improved kinematic interpolation for AIS trajectory reconstruction. Ocean Eng. 2021, 234, 109256. [Google Scholar] [CrossRef]
  11. Nguyen, V.S.; Im, N.K.; Lee, S.M. The interpolation method for the missing AIS data of ship. J. Navig. Port Res. 2015, 39, 377–384. [Google Scholar] [CrossRef]
  12. Sang, L.-Z.; Yan, X.-P.; Mao, Z.; Ma, F. Restoring Method of Vessel Track Based on AIS Information. In Proceedings of the 2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science, Guilin, China, 19–22 October 2012; pp. 336–340. [Google Scholar] [CrossRef]
  13. Sang, L.Z.; Wall, A.; Mao, Z.; Yan, X.P.; Wang, J. A novel method for restoring the trajectory of the inland waterway ship by using AIS data. Ocean Eng. 2015, 110, 183–194. [Google Scholar] [CrossRef]
  14. Zhang, L.; Meng, Q.; Xiao, Z.; Fu, X. A novel ship trajectory reconstruction approach using AIS data. Ocean Eng. 2018, 159, 165–174. [Google Scholar] [CrossRef]
  15. Shi, B.; Su, Y.; Zhang, D.; Wang, C.; AbouOmar, M.S. Research on Trajectory Reconstruction Method Using Automatic Identification System Data for Unmanned Surface Vessel. IEEE Access 2019, 7, 170374–170384. [Google Scholar] [CrossRef]
  16. Zhang, X.; Wang, J.; Chen, Y.; Li, M.; Cheng, L. Density-weighted ant colony algorithm for ship trajectory reconstruction. Int. J. Shipp. Transp. Logist. 2020, 12, 19–38. [Google Scholar] [CrossRef]
  17. Deng, C.; Wang, S.; Liu, J.; Li, H.; Chu, B.; Zhu, J. Graph Signal Variation Detection: A novel approach for identifying and reconstructing ship AIS tangled trajectories. Ocean Eng. 2023, 286, 115452. [Google Scholar] [CrossRef]
  18. Chen, C.W.; Huang, H.H. Unsupervised vessel trajectory reconstruction. Front. Appl. Math. Stat. 2023, 9, 1124091. [Google Scholar] [CrossRef]
  19. Liang, M.; Su, J.; Liu, R.W.; Lam, J.S.L. AISClean: AIS data-driven vessel trajectory reconstruction under uncertain conditions. Ocean Eng. 2024, 306, 117987. [Google Scholar] [CrossRef]
  20. Hedger, R.D.; Martin, F.; Dodson, J.J.; Hatin, D.; Caron, F.; Whoriskey, F.G. The optimized interpolation of fish positions and speeds in an array of fixed acoustic receivers. ICES J. Mar. Sci. 2008, 65, 1248–1259. [Google Scholar] [CrossRef]
  21. Hintzen, N.T.; Piet, G.J.; Brunel, T. Improved estimation of trawling tracks using cubic Hermite spline interpolation of position registration data. Fish. Res. 2010, 101, 108–115. [Google Scholar] [CrossRef]
  22. Long, J.A. Kinematic interpolation of movement data. Int. J. Geogr. Inf. Sci. 2016, 30, 854–868. [Google Scholar] [CrossRef]
  23. Du, H.; Xiao, Y.; Duan, L.; Gao, S. An algorithm for vessel’s missing trajectory restoration based on polynomial interpolation. In Proceedings of the 2017 4th International Conference on Transportation Information and Safety (ICTIS), Banff, AB, Canada, 8–10 August 2017; pp. 825–830. [Google Scholar] [CrossRef]
  24. Zaman, B.; Marijan, D.; Kholodna, T. Interpolation-Based Inference of Vessel Trajectory Waypoints from Sparse AIS Data in Maritime. J. Mar. Sci. Eng. 2023, 11, 615. [Google Scholar] [CrossRef]
  25. Liu, Y.; Shi, Z.; Fu, B.; Xu, H.; Wu, H. Marine Trajectory Reconstruction Method Based on Navigation State Recognition and Bi-Directional Kinematic Interpolation. J. Mar. Sci. Eng. 2024, 12, 2164. [Google Scholar] [CrossRef]
  26. Wang, Z.; He, W.; Lan, J.; Zhu, C.; Lei, J.; Liu, X. Ship Trajectory Classification Prediction at Waterway Confluences: An Improved KNN Approach. J. Mar. Sci. Eng. 2024, 12, 1070. [Google Scholar] [CrossRef]
  27. Zhong, C.; Jiang, Z.; Chu, X.; Liu, L. Inland ship trajectory restoration by recurrent neural network. J. Navig. 2019, 72, 1359–1377. [Google Scholar] [CrossRef]
  28. Li, S.; Liang, M.; Wu, X.; Liu, Z.; Liu, R.W. AIS-Based Vessel Trajectory Reconstruction with U-Net Convolutional Networks. In Proceedings of the 2020 IEEE 5th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), Chengdu, China, 10–13 April 2020; pp. 157–161. [Google Scholar] [CrossRef]
  29. Murray, B.; Perera, L.P. A dual linear autoencoder approach for vessel trajectory prediction using historical AIS data. Ocean Eng. 2020, 209, 107478. [Google Scholar] [CrossRef]
  30. Chen, X.; Ling, J.; Yang, Y.; Zheng, H.; Xiong, P.; Postolache, O.; Xiong, Y. Ship trajectory reconstruction from AIS sensory data via data quality control and prediction. Math. Probl. Eng. 2020, 2020, 7191296. [Google Scholar] [CrossRef]
  31. Wu, H.; Hu, L.; Li, X.; Wang, C.; Ye, Z. A novel ship trajectory reconstruction approach based on low-rank tensor completion. Ocean Eng. 2024, 310, 118724. [Google Scholar] [CrossRef]
  32. Ye, L.; Chen, X.; Liu, H.; Zhang, R.; Li, J.; Lu, C.; Zhao, Y. A Study of Multi-Step Sparse Vessel Trajectory Restoration Based on Feature Correlation. Appl. Sci. 2024, 14, 4057. [Google Scholar] [CrossRef]
  33. Li, Y.; Yu, Q.; Yang, Z. Vessel Trajectory Prediction for Enhanced Maritime Navigation Safety: A Novel Hybrid Methodology. J. Mar. Sci. Eng. 2024, 12, 1351. [Google Scholar] [CrossRef]
  34. Zhao, Z.; Chen, J.; Shi, Y.; Hong, F.; Jiang, G.; Huang, H.; Zhao, J. HiTrip: Historical trajectory interpolation for trawlers via deep learning on multi-source data. Ocean Eng. 2024, 292, 116588. [Google Scholar] [CrossRef]
  35. Chen, Y.; Chen, Y.; Cui, Y.; Cai, X.; Yin, C.; Cheng, Y. Optimizing vessel trajectories: Advanced denoising and interpolation techniques for AIS data. Ocean Eng. 2025, 327, 120988. [Google Scholar] [CrossRef]
Figure 1. BAARTR pipeline.
Figure 1. BAARTR pipeline.
Jmse 14 00116 g001
Figure 2. Conceptual framework of the BAARTR process based on kinematic modeling, regression, and adaptive interpolation.
Figure 2. Conceptual framework of the BAARTR process based on kinematic modeling, regression, and adaptive interpolation.
Jmse 14 00116 g002
Figure 3. Comparison between raw data and interpolation result of Vessel id440XXX000. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 3. Comparison between raw data and interpolation result of Vessel id440XXX000. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g003
Figure 4. Comparison between raw data and interpolation result of Vessel id440XXX860. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 4. Comparison between raw data and interpolation result of Vessel id440XXX860. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g004
Figure 5. Comparison between raw data and interpolation result of Vessel id440XXX320. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 5. Comparison between raw data and interpolation result of Vessel id440XXX320. Graph comparing (a) AIS raw data, (b) interpolation result. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g005
Figure 6. Interpolation accuracy according to the number of training samples at Vessel id440XXX000.
Figure 6. Interpolation accuracy according to the number of training samples at Vessel id440XXX000.
Jmse 14 00116 g006
Figure 7. Interpolation accuracy according to the number of training samples at Vessel id440XXX860.
Figure 7. Interpolation accuracy according to the number of training samples at Vessel id440XXX860.
Jmse 14 00116 g007
Figure 8. Interpolation accuracy according to the number of training samples at Vessel id440XXX320.
Figure 8. Interpolation accuracy according to the number of training samples at Vessel id440XXX320.
Jmse 14 00116 g008
Figure 9. RMSE comparison with regression times at Vessel id440XXX000.
Figure 9. RMSE comparison with regression times at Vessel id440XXX000.
Jmse 14 00116 g009
Figure 10. RMSE comparison with regression times at Vessel id440XXX860.
Figure 10. RMSE comparison with regression times at Vessel id440XXX860.
Jmse 14 00116 g010
Figure 11. RMSE comparison with regression times at Vessel id440XXX320.
Figure 11. RMSE comparison with regression times at Vessel id440XXX320.
Jmse 14 00116 g011
Figure 12. Interpolation results comparison of vessel id440XXX000. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 12. Interpolation results comparison of vessel id440XXX000. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g012
Figure 13. Interpolation results comparison of vessel id440XXX860. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 13. Interpolation results comparison of vessel id440XXX860. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g013
Figure 14. Interpolation results comparison of vessel id440XXX320. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Figure 14. Interpolation results comparison of vessel id440XXX320. Graph comparing (a) Interpolation results of all methods, (b-1) Part of the enlarged vessel trajectory-1, (b-2) Part of the enlarged vessel trajectory-2. In (a), background colors denote ENC features: brown (land), blue (sea), and light green (shallow water).
Jmse 14 00116 g014
Figure 15. Performance comparison of each interpolation methods of vessel id440XXX000. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Figure 15. Performance comparison of each interpolation methods of vessel id440XXX000. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Jmse 14 00116 g015
Figure 16. Performance comparison of each interpolation of vessel id440XXX860. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Figure 16. Performance comparison of each interpolation of vessel id440XXX860. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Jmse 14 00116 g016
Figure 17. Performance comparison of each interpolation methods of vessel id440XXX320. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Figure 17. Performance comparison of each interpolation methods of vessel id440XXX320. The colored box within each violin plot indicates the interquartile range (IQR), and the vertical black line represents the whisker range.
Jmse 14 00116 g017
Table 1. Summary of the AIS Dataset (June 2023–June 2024).
Table 1. Summary of the AIS Dataset (June 2023–June 2024).
Ship Type CodeNumber of VesselsLabelPercent (%)Ship Type CodeNumber of VesselsLabelPercent (%)
2012WIG0.160600Passenger, all ships of this type4.3
304536Fishing32.26512Passenger, Reserved0.1
31180Towing1.36912Passenger, No additional information0.1
32264Towing: length > 200 m or breadth > 25 m1.9701392Cargo, all ships of this type9.9
3360Dredging or underwater ops0.47712Cargo, Reserved0.1
3548Military ops0.37812Cargo, Reserved0.1
3684Sailing0.67912Cargo, No additional information0.1
37588Pleasure Craft4.280504Tanker, all ships of this type3.6
3872Special, Reserved0.58124Tanker, Hazardous category A0.2
4024HSC, all ships of this type0.28224Tanker, Hazardous category B0.2
4412HSC, Hazardous category D0.18312Tanker, Hazardous category C0.1
5036Pilot Vessel0.38412Tanker, Hazardous category D0.1
51108SAR (Search and Rescue)0.88512Tanker, Reserved0.1
521080Tug7.78948Tanker, No additional information0.3
5384Port Tender0.690360Other Type, all ships of this type2.6
5484Anti-pollution equipment0.69948Other Type, No additional information0.3
55168Law Enforcement1.203564Unknown25.3
Table 2. AIS transponder (Class A) reporting intervals as per vessel status.
Table 2. AIS transponder (Class A) reporting intervals as per vessel status.
Ship’s Dynamic ConditionsReporting Interval
Ship at anchor or moored and not moving faster than 3 knots3 min
Ship at anchor or moored and moving faster than 3 knots10 s
Ship 0–14 knots10 s
Ship 0–14 knots and changing course 3 1 3  s
Ship 13–23 knots6 s
Ship 14–23 knots and changing course2 s
Ship > 23 knots2 s
Ship > 23 knots and changing course2 s
Table 3. RMSE median comparison at each scenario (Table in Figure 15, Figure 16 and Figure 17).
Table 3. RMSE median comparison at each scenario (Table in Figure 15, Figure 16 and Figure 17).
ScenarioLinearSplineHermiteBezierBAARTRPCHIPMakima
(a) id440XXX0000.04824530.04813930.04813700.04898570.04813700.04813700.0481442
(b) id440XXX8600.37248410.37174990.37205460.37219310.37121460.37174990.3719443
(c) id440XXX3200.13376060.133344440.13294330.13296470.13260390.13294330.1330542
Table 4. Computational Complexity and Characteristics of Reconstruction Algorithms ( O : Order of time complexity, N: Number of trajectory points, d: Hidden state dimension size.).
Table 4. Computational Complexity and Characteristics of Reconstruction Algorithms ( O : Order of time complexity, N: Number of trajectory points, d: Hidden state dimension size.).
MethodTime ComplexityMain OperationKey Characteristics
Linear O ( N ) Linear InterpolationSimple, but physically unrealistic (C0 continuity)
Spline O ( N ) Tridiagonal Matrix SolverSmooth (C2), but prone to overshoot oscillations
Hermite O ( N ) Polynomial CalculationRequires tangent estimation, C1 continuity
Bezier O ( N ) Bernstein PolynomialsGeometric design focus, lacks local control for AIS
BAARTR O ( N ) Regression + Adaptive SplineReflects Ship Kinematics (Speed/Course), non-iterative
PCHIP O ( N ) Monotonicity CheckShape-preserving, no overshoot, less smooth
Makima O ( N ) Weighted Slope CalculationCompromise between Spline smoothness and PCHIP stability
Deep Learning O ( N × d 2 ) Matrix MultiplicationComputationally heavy, requires GPU & Training
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Choi, H.-j.; Kim, J.-s.; Lee, D.-h. BAARTR: Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction from Sparse AIS. J. Mar. Sci. Eng. 2026, 14, 116. https://doi.org/10.3390/jmse14020116

AMA Style

Choi H-j, Kim J-s, Lee D-h. BAARTR: Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction from Sparse AIS. Journal of Marine Science and Engineering. 2026; 14(2):116. https://doi.org/10.3390/jmse14020116

Chicago/Turabian Style

Choi, Hee-jong, Joo-sung Kim, and Dae-han Lee. 2026. "BAARTR: Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction from Sparse AIS" Journal of Marine Science and Engineering 14, no. 2: 116. https://doi.org/10.3390/jmse14020116

APA Style

Choi, H.-j., Kim, J.-s., & Lee, D.-h. (2026). BAARTR: Boundary-Aware Adaptive Regression for Kinematically Consistent Vessel Trajectory Reconstruction from Sparse AIS. Journal of Marine Science and Engineering, 14(2), 116. https://doi.org/10.3390/jmse14020116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop