A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition

Jia, Xiang; Feng, Hongxiang; Grifoll, Manel; Lin, Qin

doi:10.3390/systems14020225

Open AccessArticle

A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition

¹

Faculty of Maritime and Transportation, Ningbo University, Ningbo 315211, China

²

Barcelona Innovation in Transport (BIT), Department of Civil and Environmental Engineering, Universitat Politècnica de Catalunya—BarcelonaTech, 08003 Barcelona, Spain

³

College of International Economics & Trade, Ningbo University of Finance & Economics, Ningbo 315000, China

^*

Author to whom correspondence should be addressed.

Systems 2026, 14(2), 225; https://doi.org/10.3390/systems14020225

Submission received: 8 January 2026 / Revised: 15 February 2026 / Accepted: 19 February 2026 / Published: 23 February 2026

(This article belongs to the Special Issue Modeling and Optimization for Resilient and Sustainable Global Supply Chains)

Download

Browse Figures

Versions Notes

Abstract

Accurate perception of tugboat operational status is essential for optimising port scheduling efficiency and ensuring operational safety. However, existing AIS-based methods often struggle to capture the fine-grained and asymmetric manoeuvring characteristics of tugboats, particularly in distinguishing assisted berthing from unberthing operations. To address these limitations, this study proposes a hybrid recognition framework integrating multidimensional feature engineering with spatiotemporal dynamics. First, a speed-threshold-based sliding window algorithm segments trajectories into sailing and berthing states. Second, a 15-dimensional feature vector—comprising statistical and descriptive features from speed, heading, and trajectory morphology—is constructed to characterise tugboat behaviour. Notably, morpho-logical descriptors such as the ‘Overlap Ratio’ serve as implicit spatial proxies, capturing geographical constraints without reliance on Electronic Navigational Charts. A three-layer fully connected neural network (FCNN) is then developed to classify segments into “Cruising” and “Assisting in Berthing/Unberthing.” Finally, a speed-dynamics rule further distinguishes berthing from unberthing based on opposing temporal evolution patterns. Experiments on real AIS data from Ningbo–Zhoushan Port demonstrate that the model achieves an F1-score of 0.90 and a recall of 0.93 for assistance-related operations. Permutation importance analysis confirms that integrating kinematic and morphological features enables interpretable and precise intent inference. This study offers a high-precision, low-dependency solution for tugboat operation identification, supporting intelligent port surveillance and sustainable maritime management.

Keywords:

AIS data; tugboat operation identification; multidimensional feature engineering; fully connected neural network (FCNN); intelligent port surveillance

1. Introduction

The continued expansion of global trade has led to a substantial rise in maritime cargo volumes, accompanied by a pronounced shift toward larger vessels. According to the International Transport Forum [1], the average size of container ships has in-creased by approximately 90% since 1996. This rapid growth in vessel dimensions has intensified the complexity of berthing and unberthing—operations already recognised as among the most technically demanding tasks in port environments [2]. Within con-fined harbour waters, the manoeuvrability of large vessels becomes significantly restricted, resulting in higher operational difficulty and elevated safety risks [3]. In this context, the importance of tugboats as essential support units—providing auxiliary propulsion, navigational guidance, and safety assurance—has grown considerably [4,5]. Through pushing, pulling, and precision manoeuvring, tugboats play a crucial role in enabling large vessels to complete berthing and unberthing operations safely and efficiently [6]. However, the highly dynamic, short-duration, and task-dependent nature of tugboat maneuvers makes their operational states inherently difficult to be automatically identified using conventional monitoring approaches.

Despite this, the management of tugboat operations still relies heavily on manual coordination and radio communication, which is inefficient and highly dependent on operator experience [7]. Such approaches are poorly suited to the short-distance, highly dynamic nature of tugboat activities and are unable to support reliable, large-scale, and fine-grained identification of tugboat operational states [8,9]. Consequently, there is an urgent need for data-driven, automated methods capable of accurately identifying tugboat operational states, thereby improving the transparency and intelligence of port management systems [10].

Accurate identification of tugboat operational states not only facilitates the optimisation of port resource allocation but also provides substantial additional value. The recognised states can be directly used to calculate ship carbon emissions under different operational modes with greater precision [11,12], thereby offering essential data support for the development of green ports [13,14]. Furthermore, efficiency assessments based on operation-state information can assist decision-makers in improving port management and operational scheduling [15]. Recent studies have also investigated the coupling between forward speed and tugboat hydrodynamics, suggesting that control strategies must account for these speed-dependent variations [16].Accurate recognition of berthing and unberthing events also supplies critical information for compiling port statis-tics on ship arrivals and departures and for detecting loading and unloading activities [17,18]. Despite the richness of Automatic Identification System (AIS) data for analysing vessel behaviour, several challenges hinder its practical application: raw AIS records often contain substantial noise and missing data [19,20], and manual updates of AIS navigational status by crew members are prone to delays and inaccuracies [21]. These characteristics significantly limit the effectiveness of generic AIS-based learning models and highlight the need for robust state-identification methods tailored to tug-boat operations.

Current research has largely concentrated on topics such as tugboat scheduling optimisation [8,22], manoeuvrability analysis [23], and assessments of emission characteristics [11,12], leaving a clear gap in the automatic identification of tugboat operational states. Existing studies suffer from several notable limitations. First, methods such as those proposed by [17] rely heavily on the accuracy of AIS-reported navigational status, which substantially limits their practical applicability. Second, although [24] used logistic regression to identify cooperative interactions between tugboats and large vessels, their method does not sufficiently distinguish the specific operational nature of these interactions—namely, whether they correspond to berthing or unberthing activities. More critically, existing studies generally treat tug assistance activities as a single operational category, without explicitly analysing or quantifying the directional and temporal asymmetry between assisted berthing and assisted unberthing operations. As a result, the fundamentally different speed-evolution patterns embedded in these two processes remain largely underexplored, despite being essential for achieving high-precision tugboat operational state recognition. Additionally, most current studies have not fully exploited the multidimensional information inherent in AIS data—such as speed, heading, and trajectory morphology—to develop a more dis-criminative and comprehensive feature system [17].

Although substantial progress has been made in applying artificial intelligence techniques to AIS data mining—for instance, anomaly detection frameworks utilizing clustering and random forests have effectively classified abnormal behaviors in ship trajectories [25]. In terms of predictive modeling, LSTM-based trajectory prediction methods have yielded F1 scores approaching 95% on benchmark datasets [26]. For visual perception, multi-scale CNNs have achieved remarkable precision in ship detection, reporting classification accuracies of 99%, with precision, recall, and F1-scores all reaching 0.99 [27]. Furthermore, the integration of knowledge graphs with multi-model stacking ensemble learning has offered novel technical guidance for predicting fines related to illegal fishing, thereby enhancing law enforcement efficiency [28]. Additionally, multimodal trajectory prediction frameworks have demonstrated the capability to forecast vessel attributes over 10 h in advance, significantly outperforming competitive baselines [29]—these approaches are predominantly designed for long-range navigation, large-scale trajectory continuity, or general vessel behavior analysis. As a result, they are not well suited to capturing the short, fragmented, and highly task-oriented maneuvers that characterize tugboat operations in port environments, and therefore fail to achieve fine-grained differentiation between assisted berthing and assisted unberthing operations under noisy AIS data conditions.

To address this research gap, this study presents an integrated recognition framework that combines trajectory segmentation, feature engineering, and deep learning. A speed-threshold-based sliding-window method is employed to segment trajectories, effectively distinguishing berthing from sailing states. A comprehensive 15-dimensional feature set—including 11 statistical and 4 descriptive features—is constructed to characterize tugboat operational behaviors. A fully connected neural network classifier is developed to capture non-linear interactions among multidimensional features, while explicit speed-dynamic rules are employed to further discriminate between assisted berthing and unberthing operations. The proposed approach is validated on real-world AIS data from Ningbo–Zhoushan Port, offering a practical and reliable solution for intelligent port surveillance.

Beyond proposing a classification framework, this study provides new methodological and empirical insights into tugboat operational behavior that have not been systematically explored in existing AIS-based research. Specifically, this work makes three key contributions:

(1): It explicitly reveals and quantifies the spatiotemporal asymmetry between assisted berthing and assisted unberthing operations, demonstrating that opposite speed-evolution patterns constitute a robust and interpretable discriminative cue. To the authors’ knowledge, few existing studies have analytically distinguished these two assistance modes using AIS trajectory dynamics.
(2): It introduces a set of trajectory morphological descriptors, such as the Overlap Ratio and start–end distance ratio, which function as implicit spatial proxies. These features enable accurate identification of berth-adjacent assistance behaviours without reliance on Electronic Navigational Chart (ENC) data, reducing data dependency and deployment cost.
(3): It develops a hybrid learning–rule recognition strategy that combines feature-based deep learning with domain-informed temporal rules, achieving a balance between classification accuracy, interpretability, and operational practicality for intelligent port surveillance systems. This design avoids excessive model complexity while maintaining robustness under limited and noisy training data.

The structure of the paper is organized as follows: Section 2 analyses the typical patterns and operational characteristics of tugboats. Section 3 presents the methodology, including data preprocessing, trajectory segmentation, feature extraction, and model development. Section 4 validates the proposed framework through a case study conducted at Ningbo–Zhoushan Port. Finally, Section 5 concludes the study and outlines potential directions for future research.

2. Problem Description

As illustrated in Figure 1, tugboat activities within the port environment can be categorised into four fundamental operational states based on motion characteristics and functional objectives: Berthing, Cruising, Assisting in Berthing, and Assisting in Unberthing. Each state displays distinct behavioural patterns in AIS trajectory data, as described below.

Berthing: The tugboat remains stationary or moves at very low speed, typically positioned near a dock, anchorage, or standby area. The AIS trajectory is characterised by densely clustered points, sustained speeds below 0.3 knots [30], minimal heading variation, and negligible spatial displacement.

Cruising: The tugboat travels at a relatively high and stable speed (typically 6–8 knots) with only minor fluctuations in heading. The trajectory is predominantly linear, covering considerable distances without notable turning or loitering behaviour.

Assisting in Berthing: When supporting the docking of large vessels, tugboats exhibit characteristic “wandering” behaviour [31], which is marked by frequent fluctuations in both speed and heading. Trajectories commonly contain reciprocal or back-tracking movements—for example, accelerating toward a rendezvous point and subsequently returning at reduced speed while escorting the vessel. A notable temporal feature is that speed is generally higher at the beginning of the operation and decreases significantly toward the end of the segment.

Assisting in Unberthing: Although sharing similar spatial features with berthing assistance—such as short-distance manoeuvres and looping trajectories—this state exhibits opposite temporal dynamics. The tugboat typically begins at a low speed and gradually accelerates, with speed increasing markedly toward the end of the trajectory segment (see Section 3.5).

These differences among operational states primarily arise from the geographical constraints of port environments and the dynamic interactions between tugboats and the larger vessels that they assist. Figure 2 presents the core behavioural characteristics and schematic representations of typical trajectories corresponding to each of the four operational states.

Building on these observations, this study constructs a feature system encompassing three dimensions—speed, heading, and trajectory morphology. The system comprises 11 statistical features and 4 descriptive features (see Section 3.2 for details), designed to quantitatively capture the distinguishing characteristics of each operational state. These features form the input to the subsequent classification model.

3. Methodology

3.1. Methodology Overview

Directly applying learning models to raw AIS trajectories is problematic for tug-boats, as their trajectories typically consist of heterogeneous operational phases, including sailing, idling, and short-duration assistance maneuvers. These mixed patterns obscure the behavioral signatures of berthing-related assistance and significantly de-grade classification performance. Therefore, trajectory segmentation is a necessary prerequisite to isolate behaviorally homogeneous segments for reliable state identification.

This study proposes an automated method for identifying tugboat operational states using AIS data, with the aim of classifying four typical states: Berthing, Cruising, Assisting in Berthing, and Assisting in Unberthing. The overall workflow, illustrated in Figure 3, comprises the following key steps:

Step 1: Trajectory Segmentation: AIS trajectories are segmented using a sliding-window approach based on a speed threshold, applied after data preprocessing (including outlier removal, filtering, and interpolation). This process removes noise, smooths the speed time series, and bridges minor data gaps, thereby ensuring high-quality inputs for segmentation. It provides a preliminary distinction between berthing and sailing phases.

Step 2: Feature Extraction: From each sailing segment, 11 statistical and 4 descriptive features are extracted to form a multidimensional feature vector that characterises the segment’s spatiotemporal behaviour.

Step 3: Initial Classification: A three-layer fully connected neural network (FCNN) is employed to classify each sailing segment as either “Cruising” or “Assisting in Berthing/Unberthing.”

Step 4: Fine-Grained Classification: Segments classified as “Assisting in Berth-ing/Unberthing” are further differentiated into “Assisting in Berthing” and “Assisting in Unberthing” based on their dynamic speed profiles.

As illustrated in Figure 3, the proposed method forms a conceptual deployment scenario, data-driven decision-support system. The end-to-end workflow—from AIS data ingestion to state inference and operational feedback—can be seamlessly embedded within an intelligent port-supervision framework. Model outputs may serve as direct inputs to human operators or automated decision-making modules, while the corrected states or operational responses generated in return can further refine subsequent inferences. This closed feedback mechanism enables adaptive optimisation over time and enhances the system’s applicability within smart-port infrastructures.

By integrating feature engineering with deep learning, the proposed method achieves the automated and high-accuracy identification of tugboat operational states, thereby providing robust technical support for intelligent port surveillance.

For clarity in subsequent sections, the primary variables and their definitions used in this study are summarised in Table 1.

It should be emphasized that the proposed workflow is not a purely engineering-driven pipeline. Each component is designed to address a specific limitation identified in existing studies: trajectory segmentation isolates operational phases, morphological features compensate for the absence of explicit spatial constraints, and the hybrid classification strategy explicitly captures the asymmetric temporal dynamics between assisted berthing and unberthing operations.

3.2. Trajectory Segmentation

The preprocessed AIS data (see Algorithm A1 in Appendix A) form the tugboat trajectory dataset

T_{k}

, representing the latitude

ϕ_{k, m}

, longitude

λ_{k, m}

, smoothed speed over ground

v_{k, m}

, and course over ground

c_{k, m}

of the k-th tugboat at time

t_{m}

, expressed as

T_{k} = (ϕ_{k, m}, λ_{k, m}, v_{k, m}, c_{k, m}, t_{m})_{m}^{k}

(1)

To distinguish between sailing and berthing states, a sliding window method based on a speed threshold [32] is applied for trajectory segmentation. The specific steps are as follows:

Sort the AIS data chronologically and set the speed threshold $v_{t h r e s h o l d} = 0.3$ knot [33].
Slide a window with a step size of 1. When $v_{k, m} > v_{t h r e s h o l d}$ is detected consecutively, start recording a segment; the segment ends when $v_{k, m} \leq v_{t h r e s h o l d}$ occurs. This segment is labelled as a sailing segment.
Data points not included in any sailing segment are considered part of the berthing state and are excluded from subsequent feature extraction and classification.

Each sailing segment serves as a sample for subsequent feature extraction and model training (see Algorithm A2 in Appendix A).

3.3. Feature Engineering

To accurately characterize the heterogeneous operational patterns of tugboats, this study constructs a 15-dimensional feature (11 statistical features and 4 descriptive features) vector comprising statistical and descriptive dimensions. The selection of these features is grounded in the kinematic physics of tugboats and the specific operational constraints of port environments (see Algorithm A3 in Appendix A).

First, regarding kinematic dynamics, although environmental factors (e.g., wind and current) are not explicitly input into the model, their physical effects are implicitly encoded within the statistical features of Speed Over Ground (SOG) and Course Over Ground (COG). For instance, a tugboat maintaining a position against strong currents will exhibit increased speed variance (SOG_diff_sum) and heading fluctuations (COG_change_mean) due to compensatory maneuvering. Therefore, these statistical features serve as effective proxies for external environmental disturbances, capturing the resultant vessel behavior without requiring separate meteorological data streams.

Second, regarding spatial morphology, the feature design explicitly considers the operational nature of tugboats as harbor-working vessels. Unlike long-haul merchant ships, tugboats operate within confined port waters characterized by short sailing distances and fixed dispatch routes.

3.3.1. Statistical Features

The statistical features comprise 11 indicators across three categories—speed, heading, and spatial characteristics—with their calculation formulas provided in Table 2 (where h denotes the Haversine distance function):

Speed Features: average speed, maximum speed, maximum speed change, median speed change, sum of speed changes, and mean speed change;

Heading Features: mean course change, maximum course change, median course change, and range of course changes;

Spatial Feature: straight-line distance between the start and end points of the trajectory segment.

Together, these features capture differences in motion stability and manoeuvrability across different tugboat operation states.

3.3.2. Descriptive Features

To further capture the characteristics of trajectory morphology, four descriptive features are introduced:

Speed Change Ratio (SOG_diff_ratio):

S O G_d i f f_r a t i o = \frac{v_{\max}}{S_{diff}}

(13)

The SOG_diff_ratio quantifies the degree of speed fluctuation. This value tends to be lower during Assisting in Berthing/Unberthing operations due to frequent low-speed manoeuvring.

2.: Start–End Point Distance Ratio (start_end_distance_ratio):

start_end_distance_ratio = \frac{d_{start - end}}{d_{\max}}

(14)

start_end_distance_ratio reflects the extent of backtracking within a trajectory. Smaller values are typically observed during Assisting in Berthing/Unberthing, consistent with short-range zigzag movements.

3.: Maximum Distance Ratio (max_distance_ratio):

\max_distance_ratio = \frac{d_{\max}}{d_{total}}

(15)

max_distance_ratio describes the degree of trajectory tortuosity. Higher values are generally associated with Cruising, where movement is more linear and spatially extended.

4.: Overlap Ratio (overlap_ratio):

overlap_ratio = \frac{o v e r l a p_{n u m}}{n e w_{n u m}} \times c o n s t

(16)

overlap_ratio measures the extent of repeated trajectory coverage based on a 100 m × 100 m grid. A constant factor of 100 is applied to amplify the differences between operational states.

To ensure the robustness of the feature set and identify potential multicollinearity, a Pearson correlation analysis is incorporated into the evaluation framework. This step is critical for understanding the structural relationships between the proposed morphological descriptors and traditional kinematic features, the results of which are detailed in Section 4.3.

3.4. FCNN-Based Tugboat Trajectory Classification Model

Although extensive hyperparameter tuning was not performed, the selected FCNN architecture—with two hidden layers comprising 64 and 32 neurons, respectively—was chosen based on well-established empirical heuristics and prior experience in classification tasks of similar complexity. The use of ReLU activation and dropout regularization was intended to ensure efficient training and robust generalization. This configuration strikes a practical balance between model capacity and overfitting risk, making it suitable for moderate-sized AIS datasets with non-linear feature interactions (see Algorithm A4 in Appendix A).

3.4.1. Model Architecture

This study constructs a three-layer fully connected neural network (FCNN) to classify sailing segments into “Cruising” or “Assisting in Berthing/Unberthing”. The model structure is as follows:

Input Layer: Takes a 15-dimensional feature vector as input, with Z-score normalisation applied;

Hidden Layer 1: 64 neurons, ReLU activation, Dropout rate of 0.4, He initialisation;

Hidden Layer 2: 32 neurons, ReLU activation;

Output Layer: 2 neurons, Softmax activation, outputting class probabilities.

This relatively shallow architecture was chosen to balance model complexity with the available data, capturing non-linear feature interactions without excessive overfitting risk. The model uses the Adam optimiser with a learning rate of 0.0001. L2 regularisation (weight decay = 0.005) and early stopping (patience = 30) are incorporated to enhance generalisation capability. These hyperparameters were determined through preliminary experiments to optimize validation performance and prevent overfitting.

3.4.2. Training and Loss Function

The sparse categorical cross-entropy loss function is employed. Class weights are introduced to address sample imbalance:

L_{i} = - \sum_{c = 1}^{C} y_{i, c} \log ({\hat{y}}_{i, c})

(17)

L_{i}^{w e i g h t e d} = W_{y i} \cdot L_{i}

(18)

where C = 2, and

w_{y_{i}}

represents the class weight. During training, SMOTE is applied to oversample the minority class, with a batch size of 32 and a validation set ratio of 10%.

3.4.3. Model Interpretability Strategy

Although deep learning models are often considered ‘black boxes,’ interpreting their decision-making logic is essential for safety-critical port operations. Instead of relying on static weight analysis, which can be unstable due to random initialization, this study employs the Permutation Importance method to quantify feature contribution.

The process involves randomly shuffling the values of a single feature

j

in the test set while keeping others fixed, thereby breaking the association between the feature and the target. The importance score

I_{j}

is defined as the degradation in model performance (accuracy):

I_{j} = {A c c u r a c y}_{o r i g} - {A c c u r a c y}_{p e r m, j}

(19)

A significant drop in accuracy indicates that the model heavily relies on feature j for prediction. This method provides an unbiased metric of feature relevance, robust to model structural variances.

3.5. Fine-Grained Classification: Distinguishing Assistance During Berthing and Unberthing

Based on the “assisting in berthing/unberthing” category output by the FCNN, a further distinction between assisting in berthing and assisting in unberthing is made using dynamic speed characteristics:

Divide the trajectory into first and second halves based on the temporal midpoint.

Calculate the average speeds of the first and second halves, denoted as

V_{1}

and

V_{2}

:

If

V_{1} > V_{2}

, classify the segment as assisting-in-berthing;

If

V_{1} \leq V_{2}

, classify it as assisting-in-unberthing.

This rule is based on typical speed variation patterns of tugboats during berthing and departure operations, offering high interpretability and practical utility (see Algorithm A5 in Appendix A).

4. Case Study

4.1. Data and Experimental Setup

This study employed tugboat AIS data from Ningbo–Zhoushan Port for the year 2020 as the empirical dataset. A total of 572 tugboat operation trajectories were collected over six months. After applying the trajectory segmentation method described in Section 3.2, 483 valid voyage segments were obtained. These segments exhibited a notable class imbalance, with cruising segments approximately three times more numerous than assistance-related segments.

To address this issue, the Synthetic Minority Over-sampling Technique (SMOTE) was applied to the 15-dimensional statistical feature space. This approach enhances the model’s ability to learn the decision boundaries of minority classes without generating physically impossible geographical trajectories. The balanced dataset was then randomly divided into a training set (338 samples) and a test set (146 samples) using a 70:30 ratio. Basic statistical characteristics of the dataset are summarised in Table 3.The resulting dataset was then randomly divided into a training set and a test set using a 70:30 ratio. Basic statistical characteristics of the dataset are summarised in Table 3.

The proposed framework was implemented in a Python 3.12 environment utilizing the Keras deep learning library with a TensorFlow backend. All experiments were conducted on a workstation equipped with an AMD Ryzen 7 5800X 8-Core Processor and 32 GB of RAM. To ensure the reproducibility of the experimental results, a global random seed was fixed at 42 for all stochastic processes, including weight initialization and dataset partitioning. The classification model consists of a fully connected neural network (FCNN) with two hidden layers containing 64 and 32 neurons, respectively. To mitigate overfitting, L2 regularization (penalty coefficient

λ

= 0.005) was applied to the kernels of both hidden layers, coupled with a Dropout layer (rate = 0.4) after the first hidden layer. The model parameters were optimized using the Adam optimizer with a learning rate of 0.0001. During training, an Early Stopping mechanism was employed to monitor the validation loss; training was automatically terminated if no improvement was observed for 30 consecutive epochs (patience = 30), and the weights corresponding to the minimum validation loss were restored.

4.2. Model Training Results and Performance Analysis

As shown in Figure 4, the training process exhibited favourable convergence behaviour: the accuracy increased rapidly from an initial value of 0.54 and eventually stabilised above 0.95, while the loss decreased markedly from 1.41 to approximately 0.32. The validation curves closely matched the training curves, achieving a final accuracy of 0.9412 and a loss of about 0.325, indicating strong generalisation capability.

The application of SMOTE oversampling effectively mitigated the class-imbalance issue. During the early training phase, the model displayed a tendency to favour the majority class (Cruising). However, after oversampling and introducing class-weighted loss, the model’s ability to learn the minority class (Assisting in Berthing/Unberthing) improved substantially. This improvement is reflected in the high recall (0.93) for the assistance-related class during testing.

Detailed performance metrics on the test set are summarised in Table 4. Overall, the model achieved an accuracy of 89.73% and an F1-score of 0.90, demonstrating robust and balanced classification performance. Further analysis reveals the following:

For the Cruising class, the model achieved high precision (0.93) but relatively lower recall (0.86), suggesting a conservative classification tendency in which some borderline cases were misclassified as assistance operations.

For the Assisting in Berthing/Unberthing class, the high recall (0.93) indicates a strong recognition capability for this minority class. However, the slightly lower precision (0.87) suggests that certain atypical cruising behaviours were incorrectly identified as assistance-related activities.

These findings are consistent with the operational complexities of tugboat behaviour. Within port environments, cruising patterns may vary considerably due to factors such as traffic control, dynamic task assignments, and local navigation constraints, resulting in ambiguous boundaries between cruising and assistance operations.

4.3. Feature Distribution and Discriminative Analysis

To gain a deeper understanding of the model’s decision-making basis and validate the effectiveness of the proposed multidimensional feature set, a comprehensive analysis was conducted. This includes examining statistical distributions, evaluating feature correlations, and quantifying feature importance based on a permutation strategy.

4.3.1. Distributional Characteristics of Features

Figure 5 presents the distributions of the 11 statistical features across different operational states, revealing clear and interpretable patterns:

Speed-related features: During the Cruising state, the average speed is predominantly concentrated within the 6–8-knot range, and speed-variation indicators (such as SOG_diff_sum and SOG_change_median) remain low, reflecting stable and continuous navigation. By contrast, the average speed during Assisting in Berthing/Unberthing exhibits a more dispersed distribution (typically between 2 and 6 knots), and the total amount of speed change (SOG_diff_sum) is substantially higher. This is consistent with the frequent acceleration–deceleration behaviour characteristic of tugboats during close-range manoeuvring.

Heading-related features: A similar pattern is observed for heading features. In the Cruising state, the mean course change (COG_change_mean) is generally below 5°, and the variation range (COG_change_range) is typically under 30°, indicating stable course-keeping behaviour. In contrast, heading-variation metrics during Assisting in Berthing/Unberthing are considerably greater: COG_change_mean often exceeds 10°, and COG_change_range commonly surpasses 90°. These results align with the operational requirements of tugboats, which frequently adjust propulsion direction when assisting larger vessels.

Spatial features: Among the spatial indicators, the start–end point distance (d_start_end) demonstrates particularly pronounced differences. For Cruising, this distance typically exceeds 1000 m, reflecting sustained linear movement over longer distances. In comparison, during Assisting in Berthing/Unberthing, the distance generally falls below 500 m, consistent with operations conducted within confined harbour areas that involve repeated back-and-forth manoeuvres.

Figure 6 further illustrates the discriminative power of the four descriptive features:

SOG_diff_ratio: Under Cruising conditions, this ratio exhibits a clear unimodal distribution with a peak around 0.5, corresponding to a typical accelerate–steady–decelerate speed pattern. In contrast, during Assisting in Berthing/Unberthing, the values are more dispersed and generally lower, reflecting the more complex and variable speed dynamics characteristic of close-range manoeuvring.

Overlap_ratio: This feature demonstrates near-complete separation between the two classes. Most Cruising segments have overlap_ratio values below 10, whereas the majority of Assisting in Berthing/Unberthing segments exceed 20, strongly indicating the repeated zigzag movement prevalent in assistance operations.

Start–End Point Distance Ratio: For Cruising, this ratio is concentrated around 1.0, indicating nearly linear movement. By comparison, during Assisting in Berthing/Unberthing, the ratio predominantly falls within the 0.2–0.6 range, clearly capturing the characteristic zigzag behaviour.

Maximum Distance Ratio: The distribution of max_distance_ratio further reinforces these observations: values for Cruising are mostly above 0.7, while those for Assisting in Berthing/Unberthing are largely below 0.5.

These distinctive distribution patterns not only validate the efficacy of the proposed feature engineering but also provide interpretable evidence underpinning the model’s robust classification performance. Crucially, although the framework does not explicitly ingest Electronic Navigational Chart (ENC) data, it effectively accounts for geographical context through descriptive morphological features. In particular, the Overlap Ratio and Start–End Distance Ratio function as implicit spatial proxies. The combination of a high Overlap Ratio and a low Start–End Distance Ratio quantitatively captures the ‘wandering’ behaviour and spatial confinement characteristic of berth-adjacent assistance operations. By leveraging these indicators of tortuosity and repeated coverage, the model successfully discriminates between task-oriented manoeuvring and transit-oriented cruising, without relying on external spatial constraints.

4.3.2. Correlation and Redundancy Analysis

To assess the internal structure of the feature set and identify potential multicollinearity, a Pearson correlation analysis was performed (Figure 7).

The heatmap reveals two key structural insights:

Kinematic Redundancy: High positive correlations are observed among certain SOG-related indicators. For instance, SOG_diff_ratio and max_distance_ratio show a correlation coefficient exceeding 0.8. While this indicates information redundancy regarding speed dynamics, the FCNN model effectively mitigates the risk of overfitting through the application of L2 regularisation.

Independence of Descriptive Features: Crucially, the descriptive features exhibit a high degree of independence. The overlap_rate shows low correlation with primary kinematic metrics (e.g., |r| < 0.45 with mean_SOG). This confirms that these features capture spatial structural information that is complementary to, rather than redundant with, the kinematic data.

4.3.3. Feature Importance and Model Interpretability

To further quantify the contribution of each feature to the model’s final decision, a Permutation Importance analysis was conducted. Unlike static weight analysis, this method measures the drop in model accuracy when a feature’s values are randomly shuffled, providing a robust metric of feature dependence. To ensure the statistical reliability of these estimates, the permutation process was repeated 10 times for each feature using different random seeds. The results are visualized in Figure 8, where the bar length represents the mean decrease in accuracy, and the error bars indicate the standard deviation across these 10 independent runs. This visualization explicitly captures the stability of each feature’s influence on the classification outcome.

The analysis reveals the following mechanisms underlying the model’s high performance:

Dominance of Temporal SOG Dynamics: The results indicate that SOG_change_mean and mean_SOG are the most critical predictors. Permuting these features leads to the most significant degradation in accuracy (approximately 4–6%). This suggests that the deep learning model primarily distinguishes “Cruising” from “Assistance” by learning the temporal dynamics of SOG—specifically, the contrast between stable high-speed transit and fluctuating low-speed maneuvering.

Descriptive Features as Spatial Proxies: It is noteworthy that while the Descriptive Features (e.g., overlap_rate) exhibited high discriminative power in the distributional analysis, their individual permutation importance scores were lower than those of the kinematic features due to feature redundancy. However, this does not diminish their value.

These distinctive distribution patterns not only validate the efficacy of the feature design but also provide interpretable evidence underpinning the model’s robust classification performance. Crucially, although the framework does not explicitly ingest Electronic Navigational Chart (ENC) data, it effectively accounts for geographical context through the inclusion of these descriptive features, which inherently characterize trajectory morphology. In particular, the Overlap Ratio and Start–End Distance Ratio function as implicit spatial proxies. The combination of a high Overlap Ratio and a low Start–End Distance Ratio quantitatively captures the ‘wandering’ behavior and spatial confinement characteristic of berth-adjacent assistance operations. By leveraging these indicators of tortuosity and repeated coverage, the model successfully discriminates between task-oriented maneuvering and transit-oriented cruising without reliance on external spatial constraints.

4.4. Visual Verification

To further validate the consistency between the model outputs and actual tugboat operations, a multilevel visual analysis was conducted. Figure 9 presents the state-identification results for tugboat trajectories (Tugboat: Yonggang 18, MMSI: 412036030; Dates: 1–2 January 2020) within the test set, with different colours indicating distinct operational states. The visualisations reveal several clear patterns:

Berthing states (blue) are predominantly located within designated port areas, closely matching the spatial distribution of known berthing points.

Cruising trajectories (orange) generally exhibit straight or gently curved paths connecting major port zones, consistent with the routing characteristics of routine tugboat deployment.

Assisting in Berthing (yellow) and Assisting in Unberthing (purple) trajectories cluster in waters adjacent to wharves and display dense zigzag patterns characteristic of tugboats performing close-range manoeuvring.

Figure 10 further validates the model’s reliability through a comparative analysis of the trajectories of a tugboat and the large vessels it assisted. AIS records from the same spatiotemporal scope were extracted—covering the tugboat Yonggang 18 (MMSI: 412036030, 1 January 2020) and the large vessels LUCKY EFFIE (MMSI: 229859000, entering port on 1 January 2020) and Huajiang 7 (MMSI: 413352710, departing on the same date). Their trajectories were rendered to scale based on actual vessel dimensions obtained from AIS static data. The resulting visualisation clearly illustrates the spatial interaction patterns between the tugboat and the assisted vessels:

During assisted berthing operations, the tugboat first travels at relatively high speed (often reaching 8–10 knots) toward the rendezvous point with the large vessel. After meeting the vessel, its speed decreases sharply to 2–3 knots as it escorts the ship at low speed toward the berth. The embedded temporal speed profile clearly illustrates this characteristic pattern of a high-speed approach followed by low-speed assistance.

In assisted unberthing operations, the speed pattern exhibits the opposite trend. The initial phase is characterised by low speeds (1–2 knots), reflecting the fine manoeuvring required to assist the large vessel in undocking, followed by a gradual increase in speed to 3–5 knots as the tugboat departs the area. These temporal dynamics are highly consistent with the subdivision criteria defined in Section 3.5.

Visualisation results also reveal instances of multi-tug collaborative operations, where multiple tugboats jointly assist a single vessel. In such cases, the trajectories of the participating tugboats display both spatial clustering and behavioural consistency, further supporting the reliability of the model’s identifications. Conversely, occasional misclassifications—such as cruising segments within complex waterways being interpreted as assistance-related operations—highlight the challenges posed by intricate port environments and suggest potential avenues for future optimisation.

Overall, these visualisation outcomes not only confirm the consistency between model outputs and real-world tugboat behaviour but also demonstrate the practical applicability of the proposed method for intelligent port supervision. By accurately identifying tugboat operational states, port authorities can better evaluate operational efficiency, optimise resource allocation, and establish a reliable data foundation for carbon-emission assessment.

5. Conclusions

Focusing on the practical need for the automatic identification of tugboat operational states in port environments, this study proposes an integrated framework that combines trajectory segmentation, feature engineering, and deep learning. Based on empirical analysis using real AIS data from Ningbo–Zhoushan Port, the main conclusions are as follows:

First, the sliding-window trajectory-segmentation method based on speed thresholds effectively separates berthing and sailing states. By isolating behaviorally homogeneous trajectory segments, this step reduces the interference of mixed operational phases and provides a reliable foundation for subsequent state recognition. When combined with the subsequent feature-extraction process, it provides high-quality in-put samples for model construction. Second, the 15-dimensional feature system—covering speed, heading, and spatial-morphology dimensions—demonstrates strong discriminative capability. The Cruising state is characterised by higher speeds, minimal fluctuations, and long-distance linear movement, whereas Assisting in Berthing/Unberthing exhibits low speeds, greater variability, and short-distance back-tracking behaviours. These differences reflect the fundamentally distinct maneuvering objectives of transit and assistance operations, forming a robust basis for operational state classification. Third, the proposed three-layer fully connected neural network classifier achieved an accuracy of 89.73% and an F1-score of 0.90 on the test set, with a recall of 0.93 for the assistance-related class, indicating strong recognition capability for minority-class samples. This result confirms that a feature-driven learning strategy can achieve high accuracy and robustness even under noisy and imbalanced AIS data conditions. Finally, the subdivision method for distinguishing between assisted berthing and unberthing based on speed dynamics—where higher initial speed corresponds to berthing and lower initial speed corresponds to unberthing—was validated through trajectory visualisation. This finding verifies that the spatiotemporal asymmetry be-tween these two assistance modes constitutes an interpretable and physically meaningful discriminative cue.

The contributions of this study can be summarised in three main aspects. First, at the theoretical and methodological level, a complete technical framework for tugboat operational state identification was established, integrating trajectory segmentation, feature engineering, and a classification model. Rather than relying on purely end-to-end learning, the framework explicitly embeds domain knowledge into both feature design and decision logic, offering a balanced trade-off between accuracy and interpretability. This framework provides a transferable methodological reference for future research on vessel-behaviour identification. Second, at the application level, an identification solution based solely on conventional AIS data was developed, requiring no additional hardware and offering a low-cost approach suitable for intelligent port-surveillance systems. The avoidance of dependency on electronic navigational charts or dedicated sensors further enhances the deployability of the proposed method in real-world port environments. Third, at the feature-engineering level, the trajectory characteristics of tugboat operations were thoroughly analysed, leading to the design of a multidimensional feature system. In particular, the four descriptive features effectively captured subtle operational differences, demonstrating that trajectory morphology can function as an implicit spatial proxy for berth-adjacent behaviors.

Despite the promising results, several limitations warrant further investigation. First, the feature extraction approach in this study relies primarily on trajectory geometry and motion parameters, without incorporating external environmental factors such as port geography, tides, or weather conditions. Future work could integrate multi-source in-formation—such as electronic navigational charts and meteorological data—to enhance the model’s environmental adaptability. Second, the current model focuses on the operational patterns of individual tugboats and does not explicitly consider multi-tug collaborative scenarios. Future research may employ graph neural networks or spatiotemporal interaction models to capture the relational dynamics between multiple tugboats and assisted vessels, thereby improving performance in complex operational settings. Third, the robustness of the method under extreme weather conditions has not yet been fully verified. Systematic evaluation using AIS data collected under diverse and adverse meteorological conditions would help assess generalisation capability. Fourth, as this study primarily analysed historical AIS datasets, further development toward real-time identification is needed. Building a streaming-data-based system could enable real-time monitoring and decision support for port scheduling and dispatching. Finally, the current framework relies primarily on kinematic trajectory data, overlooking the mechanical heterogeneity of tugboats. Factors such as rated engine power and propulsion system type significantly influence acceleration profiles and maneuvering agility. Future research could incorporate these static vessel attributes to refine the classification logic, potentially developing power-dependent speed thresholds for more precise state recognition.

In conclusion, the tugboat operational state identification method proposed in this study demonstrates strong effectiveness and practicality, offering both methodological insight and operational value for fine-grained port surveillance. Future efforts will focus on enhancing model performance and expanding applicability through multi-source data integration, model optimisation, and real-time system development.

Author Contributions

Conceptualization, X.J. and H.F.; methodology, X.J. and H.F.; validation, Q.L.; formal analysis, M.G. and Q.L.; investigation, M.G. and Q.L.; resources and Q.L.; data curation, X.J.; writing—original draft preparation, X.J. and H.F.; writing—review and editing, H.F.; visualization, X.J.; supervision, H.F., M.G. and Q.L.; project administration, H.F., M.G. and Q.L.; funding acquisition, H.F., M.G. and Q.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Ningbo Natural Science Foundation [2025J175], the “Five Batches” Project of Education Integration in Ningbo in 2021–2022 [Practice and Exploration of Maritime Construction and Navigation Safety Management for Teachers in Mari-time Universities], the Zhejiang Provincial Soft Science Research Program Project [grant number 2026C25007] and the 111 Project [grant number D21013].

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AIS	Automatic Identification System
FCNN	fully connected neural network
COG	Course Over Ground
SOG	Speed Over Ground

Appendix A

Algorithm A1 Data Preprocessing for Tugboat AIS Data

Input:
Ddyn: Dynamic AIS data (MMSI → [Name, Timestamp, Lon, Lat, COG, SOG, HDG, ROT, Status])
Dstat: Static AIS data (MMSI → [Name, Timestamp, L, W, Lstern, Lport, Lstarboard, Draft])
Output:
Dproc: Preprocessed dataset for each tugboat MMSI (t, SOG, COG, Lon, Lat)

1: D ← ∅ // Initialize result dataset
2: for each MMSI in Ddyn do
3: if Dstat[MMSI].ship_type ∈ {31, 32, 52} then
4: Dship ← MATCH(Ddyn[MMSI], Dstat[MMSI])
5: Dship ← FILTER_REGION(Dship, StudyArea) // Remove records outside study region
6: Dship ← REMOVE_DUPLICATES(Dship, key=Timestamp)
7: Dship ← REMOVE_OUTLIERS(Dship, condition: Lat > 90° or Lon > 180° or COG > 360°)
8: for each record pair (p_i, p_j) in Dship do
9: d ← HAVERSINE (p_i.Lon, p_i.Lat, p_j.Lon, p_j.Lat)
10: if d > mean(d) + 2σ(d) then
11: remove p_j
12: end if
13: end for
14: Dship.SOG ← KALMAN_FILTER(Dship.SOG) // Smooth abnormal speed values
15: Dship ← LINEAR_INTERPOLATION(Dship, interval=6s)
16: Dship.SOG ← KALMAN_FILTER(Dship.SOG) // Smooth interpolated speed
17: T_k ←T_k ∪ Dship
18: end if
19: end for
20: return T_k

Algorithm A2 Sliding Window Segmentation

Require: Trajectory dataset T = {(v_(k,m),c_(k,m),(λ_(k,m),ϕ_(k,m)),t_m)}, speed threshold v_threshold
Ensure: Set of trajectory segments S

1: Initialize empty set S ← ∅
2: for each tugboat trajectory T_k in T do
3:    Initialize window W ← ∅
4:    for each point p = ((v_(k,m),c_(k,m),(λ_(k,m),ϕ_(k,m)),t_m)) in T_k do
5:    if v_(k,m) > v_threshold then
6:    Append p to window: W ← W ∪ {p}
7:    else
8:    if W ≠ ∅ then
9:    Append W to S: S ← S ∪ {W}
10:   end if
11:   Reset window: W ← {p}
12:   end if
13:   end for
14:   if W ≠ ∅ then
15:   Append W to S: S ← S ∪ {W}
16:   end if
17: end for
18: return S

Algorithm A3 Feature Engineering

Require: Set of trajectory segments S
Ensure: Feature matrix F =

x_{i}

, label set Y =

y_{i}

1: Initialize empty feature matrix F ← ∅
2: Initialize empty label set Y ← ∅
3: for each segment s in S do
4: Compute number of points: n ← |s|
5: Compute mean speed:

\bar{v}

← (1/n) ←∑v_(k,m)
6:    Compute maximum speed: v_max max(v_(k,m))
7:    Compute speed changes: ∆v_i ← |v_(k,m+i+1) − v_(k,m+i)| for i = 2 to n
8:    Compute sum of speed changes: s_diff ← ∑∆v_i
9:    Compute mean speed change:

{\bar{s}}_{c h a n g e}

← (1/(n − 1)) ∆v_i if n > 1, else 0
10:      Compute maximum speed change: s_change,max ← max(∆v_i) if n > 1, else 0
11:      Compute median speed change: s_{change,median} ← median(∆v_i) if n > 1, else 0
12:      Compute course changes: ∆c_i ← min((c_(k,m+i+1) − c_(k,m+i))), 360 − (c_(k,m+i+1) − c_(k,m+i))) for i = 2 to n
13:      Compute mean course change:

{\bar{c}}_{c h a n g e}

← (1/(n − 1))∑||∆c_i|| if n > 1, else 0
14:      Compute maximum course change: c_change,max ←max(∆c_i) if n > 1, else 0
15:      Compute median course change: c_{change,median} ←(|∆c_i|) if n > 1, else 0
16:      Compute course range: c_range ←min((max(c_(k,m)) − min(c_(k,m))), 360 − (max(c_(k,m)) − min(c_(k,m))))
17: Compute total distance: d_total ← ∑haversine((λ_(k,m+i), ϕ_(k,m+i)), (λ_(k,m+i+1), ϕ_(k,m+i+1))) for i = 1 to n − 1
18:      Compute start-to-end distance: d_start₋_end ← haversine((λ_(k,m),ϕ_(k,m)),(λ_(k,m+n),ϕ_(k,m+n)))
19:      Compute maximum distance: d_max ← max(haversine((λ_(k,m+i),ϕ_(k,m+i)),(λ_(k,m+j),ϕ_(k,m+j)))) for all i, j ∈ s
20: Compute start-to-end distance ratio: start_end_distance_ratio ← d_start₋_end/d_max if d_max = ∅, else 0
21: Compute maximum distance ratio: max_distance_ratio ← d_max/d_total if d_total = ∅, else 0
22:    Compute speed difference ratio: SOG_diff_ratio ← v_max/s_diff if s_diff = ∅, else 0
23:    Compute overlap metrics: (overlap_num,new_num) ← Compute_Grid_Overlap(s)
24: Compute overlap ratio: overlap_ratio ← overlap_num/new_num if new_num = ∅, else 0
25: Construct feature vector:

x_{i}

← (

\bar{v}

, v_max, s_{change, max}, s_{change, median}, s_diff,

{\bar{s}}_{c h a n g e}

,

{\bar{c}}_{c h a n g e}

, c_{change, max})
26: Append

x_{i}

to F: F ← F ∪

x_{i}

27: Extract label

y_{i}

from s (assume consistent navtype in segment)
28: Append

y_{i}

to Y: Y ← Y ∪

y_{i}

29: end for
30: return F, Y

Algorithm A4 Neural Network Training

Require: Feature matrix F = {

x_{i}

}, label set Y = {

y_{i}

}
Ensure: Trained model M

1: Standardize feature matrix F → F_scaled
2: Map labels Y to {0, 1} (e.g., navtype 1 → 0, 2 → 1)
3: Apply SMOTE to balance dataset: (F_scaled,Y_mapped) ← SMOTE(F_scaled,Y_mapped)
4: Split dataset into training and test sets: (F_train, Y_train, F_test, Y_test)
Split (F_scaled, Y_mapped, test_size = 0.3)
5: Initialize neural network M:
6: Input layer (dimension = size of

x_{i}

)
7:   Dense layer (64 neurons, ReLU activation, L2 regularization)
8:   Dropout layer (rate = 0.4)
9:   Dense layer (32 neurons, ReLU activation, L2 regularization)
10:     Output layer (2 neurons, Softmax activation)
11: Compile model M:
12:     Loss function = categorical cross entropy Optimizer = Adam (learn ingrate = 0.0001)
13:     Metric = accuracy
14: Configure early stopping: Early Stopping (patience = 30, monitor = val_loss, resto rebest weights = True) Train model M:
15:     Use training set (F_train, Y_train)
16:     Max epochs = 10,000
17:     Batch size = 32
18:     Validation split = 0.1
19:     Apply early stopping
20: Save model M
21: return M

Algorithm A5 Tug refined classification

Require: Trajectory dataset T_k = {(v_(k,m),c_(k,m),(λ_(k,m),ϕ_(k,m)),t_m,Status_(k,m),MMSI_k)}, window size n_window, input file path, output file path
Ensure: Updated dataset T_k with modified Status_(k,m), segment information table S

1: Load dataset T_k from the input file
2: Sort T_k by t_m in ascending order
3: Initialize empty segment table S ← ∅
4: Initialize segment index segment_idx ← 1
5: Compute number of points n ← |T_k|
6: Initialize index m ← 1
7: while m ≤ n − n_window + 1 do
8: Initialize window W ← ∅
9: for i = m to m + n_window − 1 do
10: Extract point pi = (v_(k,i),c_(k,i),(λ_(k,i),ϕ_(k,i)),t_i,Status_(k,i),MMSI_k) from T_k
11: Append p_i to W: W ← W ∪ {p_i}
12:        end for
13: Compute number of points in window n_window ← |W|
14:          if n_window = 0 then
15:    Increment m ← m + 1
16:    Continue
17:          end if

18:          Compute midpoint index mid_idx ← ⌊n_w/2⌋
19:          Extract first half: first_half ← W [1:mid_idx]
20:          Extract second half: second_half ← W [mid_idx + 1: n_w]
21:          Compute mean speed of first half: V₁ ← (1/|first_half|)∑v_(k,i)|for∑i ∈ first_half
22:          Compute mean speed of second half: V₂ ← (1/|second_half) ∑v_(k,i)|for i∈second_half
23:          Compute middle timestamp: t_mid ← t_m_+mididx−1
24:          Determine segment status: Status_seg ← most frequent Status_(k,i) in W
25:          Compute speed difference: Speed_diff ← (V₂ − V₁) · 100
26:          Store segment information in S:
27:           S.MMSI[segment_idx] ← MMSI_k[m]
28:           S.Start_Idx[segment_idx] ← m
29:           S.End_Idx[segment_idx] ← m + n_window − 1
30:           S.Status[segment_idx] ← Status_seg
31:           S.V₁[segment_idx] ← V₁
32:           S.V₂[segment_idx] ← V₂
33:           S.Speed_diff[segment_idx] ← Speed_diff
34:           S.Start_Time[segment_idx] ← t_m
35:           S.End_Time[segment_idx] ← t_m_+nwindow−1
36:           S.Mid_Time[segment_idx] ← t_mid
37:      Increment segment_idx ← segment_{idx + 1}
38:      Increment m ← m + 1
39: end while
40: for each segment i in S do
41:      if S.Status[i] = 2 and S.Speed_diff[i] > 0 then
42:       Set S.Status[i] ← 3
43:      end if
44: end for
45: for each segment i in S do
46:      Extract row indices: row_indices ← [S.Start_Idx[i]:S.End_Idx[i]]
47:      Update Status_(k,m) ← S.Status[i] for all m ∈ row_indices
48: end for

References

ITF. The Impact of Mega-Ships; International Transport Forum Policy Papers: Paris, France, 2015; Volume 10. [Google Scholar]
Qiang, Z.; Im, N.-K.; Zhongyu, D.; Meijuan, Z. Review on the Research of Ship Automatic Berthing Control. In Proceedings of the Offshore Robotics; Su, S.-F., Wang, N., Eds.; Springer: Singapore, 2022; pp. 87–109. [Google Scholar]
Mentjes, J.; Wiards, H.; Feuerstack, S. Berthing Assistant System Using Reference Points. J. Mar. Sci. Eng. 2022, 10, 385. [Google Scholar] [CrossRef]
Chen, G.; Ding, C.; Yin, J.; Zhu, H.; Li, Y. Study on Automatic Berthing of Large Under-Actuated Vessel with Multi-Tug Collaboration. Ocean Eng. 2025, 325, 120709. [Google Scholar] [CrossRef]
Paulauskas, V.; Paulauskas, D. Ship Mooring Methodology Designed for Ship Berthing in Extremely Limited Conditions. J. Mar. Sci. Eng. 2025, 13, 575. [Google Scholar] [CrossRef]
Park, J.-S.; Nguyen, T.-N.; Dinh, C.-T.; Huynh, T.; Kim, Y.-B. Modeling and Control of Tugboat-Assisted Operation for Marine Vessels. J. Mar. Sci. Eng. 2025, 13, 804. [Google Scholar] [CrossRef]
Otsuki, S.; Taya, M.; Nakashima, K.; Hatanaka, T. Hierarchical Control of Multiple Tugboats with Constraint-Driven Model-Following Control. SICE J. Control Meas. Syst. Integr. 2025, 18, 2521907. [Google Scholar] [CrossRef]
Li, X.; Xiao, Y.; Su, F.; Wu, W.; Zhou, L. AIS and VBD Data Fusion for Marine Fishing Intensity Mapping and Analysis in the Northern Part of the South China Sea. ISPRS Int. J. Geo-Inf. 2021, 10, 277. [Google Scholar] [CrossRef]
Zhong, H.; Zhang, Y.; Gu, Y. A Bi-Objective Green Tugboat Scheduling Problem with the Tidal Port Time Windows. Transp. Res. Part D Transp. Environ. 2022, 110, 103409. [Google Scholar] [CrossRef]
Chen, J.; Zhang, Q.; Liang, M.; Peng, C.; Chen, C. Big-Data-Driven Vessel Destination Prediction for Smart Port Management. Eng. Appl. Artif. Intell. 2025, 154, 110829. [Google Scholar] [CrossRef]
Ortega-Piris, A.; Diaz-Ruiz-Navamuel, E.; Martinez, A.H.; Gutierrez, M.A.; Lopez-Diaz, A.-I. Analysis of the Concentration of Emissions from the Spanish Fleet of Tugboats. Atmosphere 2022, 13, 2109. [Google Scholar] [CrossRef]
Tang, Y.-Z.; Lou, D.-M.; Zhang, Y.-H.; Sun, X.-C.; Tan, P.-Q.; Hu, Z.-Y. Emission Characteristics of Port Tugboat Based on Working Conditions. Zhongguo Huanjing Kexue/China Environ. Sci. 2021, 41, 1995–2003. [Google Scholar]
Feng, X.; Liu, M.; Zhang, W.; Yin, W.; Chao, Y. The Impacts of Pilotage Planning on Green Maritime Logistics. Reg. Stud. Mar. Sci. 2025, 81, 103989. [Google Scholar] [CrossRef]
Li, B.; Chen, Q.; Lau, Y.; Dulebenets, M.A. Tugboat Scheduling with Multiple Berthing Bases under Uncertainty. J. Mar. Sci. Eng. 2023, 11, 2180. [Google Scholar] [CrossRef]
Feng, M.; Shaw, S.-L.; Peng, G.; Fang, Z. Time Efficiency Assessment of Ship Movements in Maritime Ports: A Case Study of Two Ports Based on AIS Data. J. Transp. Geogr. 2020, 86, 102741. [Google Scholar] [CrossRef]
Chen, M.; Tang, Y.; Ye, J.; Jiang, X.; Chen, Y.; Ren, Z.; Choo, Y.S. A Constant Parameter Time-Domain Simulator for a Tugboat with Forward-Speed Dependent Hydrodynamics and Speed Control. J. Mar. Eng. Technol. 2026, 25, 1–15. [Google Scholar] [CrossRef]
Chen, S.; Wang, F.; Wei, X.; Tan, Z.; Wang, H. Analysis of Tugboat Activities Using AIS Data for the Tianjin Port. Transp. Res. Rec. 2020, 2674, 498–509. [Google Scholar] [CrossRef]
Wu, L.; Xu, Y.; Wang, F. Identifying Port Calls of Ships by Uncertain Reasoning with Trajectory Data. ISPRS Int. J. Geo-Inf. 2020, 9, 756. [Google Scholar] [CrossRef]
Liu, K.; Yu, Z.; Gan, L.; Xiao, J.; Jie, M. Inland Vessel Behavior Identification and Trajectory Reconstruction Based on Multi-Feature Fusion. Reg. Stud. Mar. Sci. 2025, 92, 104579. [Google Scholar] [CrossRef]
Yang, D.; Li, X.; Zhang, L. A Novel Vessel Trajectory Feature Engineering for Fishing Vessel Behavior Identification. Ocean Eng. 2024, 310, 118677. [Google Scholar] [CrossRef]
Mujal-Colilles, A.; Guarasa, J.N.; Fonollosa, J.; Llull, T.; Castells-Sanabra, M. COVID-19 Impact on Maritime Traffic and Corresponding Pollutant Emissions. The Case of the Port of Barcelona. J. Environ. Manag. 2022, 310, 114787. [Google Scholar] [CrossRef]
Xu, Q.; Bian, Z.; Chen, Y.; Jin, Z.-H. Scheduling Optimization of Port Tugboat Operation Considering Multi-Anchorage. Shanghai Jiaotong Daxue Xuebao/J. Shanghai Jiaotong Univ. 2014, 48, 132–139+145. [Google Scholar]
Piaggio, B.; Villa, D.; Viviani, M. Numerical Analysis of Escort Tug Manoeuvrability Characteristics. Appl. Ocean Res. 2020, 97, 102075. [Google Scholar] [CrossRef]
Qiang, H.; Guo, Z.; Peng, X.; Jia, C. FDBR: Ultra-Fast and Data-Efficient Behavior Recognition of Port Vessels Using a Statistical Framework. Ocean Eng. 2025, 315, 119737. [Google Scholar] [CrossRef]
Rong, H.; Teixeira, A.P.; Guedes Soares, C. A Framework for Ship Abnormal Behaviour Detection and Classification Using AIS Data. Reliab. Eng. Syst. Saf. 2024, 247, 110105. [Google Scholar] [CrossRef]
Mangé, V.; Tourneret, J.-Y.; Vincent, F.; Mirambell, L.; Manzoni Vieira, F. Anomaly Detection in Ship Trajectories Using Machine Learning and Dynamic Time Warping. Eng. Appl. Artif. Intell. 2025, 157, 111185. [Google Scholar] [CrossRef]
Mehta, B.; Bharany, S.; Ghoniem, R.M.; Kaur, U.; Tran, T.A. HAMSCNN: A Hybrid Attention Multi-Scale CNN for Accurate Ship Detection in Maritime Surveillance. Reg. Stud. Mar. Sci. 2025, 91, 104493. [Google Scholar] [CrossRef]
Yu, H.; Xiao, Y.; Chen, C.; Zhou, J.; Xu, L. Incorporating Knowledge Graph and Multi-Model Stacking Ensemble Learning for Prediction of Fines for Illegal Fishing. Reg. Stud. Mar. Sci. 2025, 89, 104332. [Google Scholar] [CrossRef]
Zhang, Z.; Yuan, W.; Fan, Z.; Song, X.; Shibasaki, R. AISFuser: Encoding Maritime Graphical Representations With Temporal Attribute Modeling for Vessel Trajectory Prediction. IEEE Trans. Knowl. Data Eng. 2025, 37, 1571–1584. [Google Scholar] [CrossRef]
Cheng, C.; Li, Z.; Yan, Y.; Cui, Q.; Zhang, Y.; Liu, L. Maritime Freight Carbon Emission in the U.S. Using AIS Data from 2018 to 2022. Sci. Data 2024, 11, 542. [Google Scholar] [CrossRef]
Zhang, Z.; Huang, L.; Peng, X.; Wen, Y.; Song, L. Loitering Behavior Detection and Classification of Vessel Movements Based on Trajectory Shape and Convolutional Neural Networks. Ocean Eng. 2022, 258, 111852. [Google Scholar] [CrossRef]
Rong, H.; Teixeira, A.P.; Guedes Soares, C. Maritime Traffic Probabilistic Prediction Based on Ship Motion Pattern Extraction. Reliab. Eng. Syst. Saf. 2022, 217, 108061. [Google Scholar] [CrossRef]
Yasukawa, H.; Yoshimura, Y. Introduction of MMG Standard Method for Ship Maneuvering Predictions. J. Mar. Sci. Technol. 2015, 20, 37–52. [Google Scholar] [CrossRef]

Figure 1. Classification of basic behaviour patterns and typical trajectories of tugs.

Figure 2. The trajectory shape, speed change and heading distribution of tugs in different states.

Figure 3. Flow chart (Non-English terms refer to Chinese geographical names).

Figure 4. Training of the model.

Figure 5. Statistical characteristic distribution (Blue indicates the Cruise status, red indicates the Assist Berthing/Unberthing status, and the overlapping area appears dark red).

Figure 6. Descriptive feature distribution.

Figure 7. Pearson Correlation Heatmap.

Figure 8. Permutation Importance with Error Bars (the error bars indicate the standard deviation calculated).

Figure 9. Trajectory Identification Figure. (The base map shows the partial waters of Ningbo Port, China, Non-English terms refer to Chinese geographical names.)

Figure 10. Visual Validation of the Tugboat Test Set. (The base map shows the partial waters of Ningbo Port, China, Non-English terms refer to Chinese geographical names.)

Table 1. Variables and definitions.

Variable Symbol	Explanation	Unit/Format
$v_{threshold}$	Berthing state speed threshold	knot
$T_{k}$	Trajectory dataset of the k-th tugboat	Integer
$t_{m}$	Timestamp	Time
$ϕ_{k, m}$ , $λ_{k, m}$	Latitude and Longitude of the k-th tugboat at time $t_{m}$	WGS-84 Degree
$v_{k, m}$	Filtered speed over ground (SOG) of the k-th tugboat at time $t_{m}$	Knot
$c_{k, m}$	Course over ground (COG) of the k-th tugboat at time $t_{m}$	Degree
n	Number of AIS data points in the windowed trajectory segment	Integer
i, j	The i-th AIS data point within the windowed trajectory segment	Integer
$\bar{v}$	Mean speed within the windowed trajectory segment\|	Knot
$v_{m a x}$	Maximum speed within the windowed trajectory segment\|	Knot
$s_{c h a n g e, m a x}$	Maximum speed change within the windowed trajectory segment	Knot
$s_{c h a n g e, m e d i a n}$	Median speed change within the windowed trajectory segment	Knot
$s_{d i f f}$	Sum of speed changes within the windowed trajectory segment	Knot
${\bar{s}}_{c h a n g e}$	Mean speed change within the windowed trajectory segment	Knot
${\bar{c}}_{c h a n g e}$	Mean course change within the windowed trajectory segment	Degree
$c_{c h a n g e, m a x}$	Maximum course change within the windowed trajectory segment	Degree
$c_{c h a n g e, m e d i a n}$	Median course change within the windowed trajectory segment Range of course changes within the windowed trajectory segment	Degree
$c_{r a n g e}$	The span between the maximum and minimum course over ground (COG) values within a segment	Degree
$d_{s t a r t - e n d}$	Straight-line distance between start and end points of the windowed trajectory segment	Meter
$S O G_d i f f_r a t i o$	Speed change ratio of the windowed trajectory segment	Ratio
$d_{m a x}$	Maximum straight-line distance within the windowed trajectory segment	Meter
$s t a r t_e n d_d i s t a n c e_r a t i o$	Start–end point distance ratio of the windowed trajectory segment	Ratio
$d_{t o t a l}$	Total traveled distance within the windowed trajectory segment	Meter
$m a x_d i s t a n c e_r a t i o$	Maximum distance ratio of the windowed trajectory segment	Ratio
$o v e r l a p_{n u m}$	Count of entries into overlapping grid cells within the windowed trajectory segment	Integer
$n e w_{n u m}$	Count of first entries into unique grid cells within the windowed trajectory segment	Integer
$o v e r l a p_r a t i o$	Overlap ratio of the windowed trajectory segment	Ratio
$c o n s t$	Amplification Factor	Integer
C	The total number of classes in the classification task	Integer
$y_{i, c}$	The binary indicator (0 or 1) if class label C is the correct classification for observation i	Binary (0/1)
${\hat{y}}_{i, c}$	The predicted probability that observation i belongs to class C	Probability [0, 1]
$W_{y i}$	The class weight assigned to the class of sample i to address sample imbalance	Decimal
${A c c u r a c y}_{o r i g}$	The original model accuracy on the test set before feature permutation	Ratio
${A c c u r a c y}_{p e r m, j}$	The model accuracy on the test set after permuting the values of feature j	Ratio
$L_{i}$	Cross-entropy loss value for the i-th sample	Dimensionless
$L_{i}^{w e i g h t e d}$	The weighted cross-entropy loss for the i-th sample, adjusted by class weights to mitigate class imbalance	Dimensionless

Table 2. Statistical characteristics and calculation formula.

Symbol	Value	Formula
$\bar{v}$	Average Speed	$\bar{v} = \frac{1}{n} \sum_{i = 1}^{n} v_{k, m + i}$	(2)
$v_{m a x}$	Maximum Speed	$v_{m a x} = \underset{i}{m a x} (v_{k, m + i})$	(3)
$s_{c h a n g e, m a x}$	Maximum Speed Change	$s_{c h a n g e, m a x} = \underset{i}{m a x} ∣ (v_{k, m + i + 1}) - (v_{k, m + i}) ∣$	(4)
$s_{c h a n g e, m e d i a n}$	Median Speed Change	$s_{c h a n g e, m e d i a n} = median (∣ (v_{k, m + i + 1}) - (v_{k, m + i}) ∣)$	(5)
$s_{d i f f}$	Sum of Absolute Speed Changes	$s_{d i f f} = \sum_{i = 1}^{n - 1} ∣ (v_{k, m + i + 1}) - (v_{k, m + i}) ∣$	(6)
${\bar{s}}_{c h a n g e}$	Mean Speed Change	${\bar{s}}_{c h a n g e} = \frac{1}{n - 1} \sum_{i = 1}^{n - 1} ∣ (v_{k, m + i + 1}) - (v_{k, m + i}) ∣$	(7)
${\bar{c}}_{c h a n g e}$	Mean Course Change	${\bar{c}}_{c h a n g e} = \frac{1}{n - 1} \sum_{i = 1}^{n - 1} ∣ (c_{k, m + i + 1}) - (c_{k, m + i}) ∣$	(8)
$c_{c h a n g e, m a x}$	Maximum Course Change	$c_{c h a n g e, m a x} = \underset{i}{m a x} (∣ (c_{k, m + i + 1}) - (c_{k, m + i}) ∣)$	(9)
$c_{c h a n g e, m e d i a n}$	Median Course Change	$c_{c h a n g e, m e d i a n} = median (∣ (c_{k, m + i + 1}) - (c_{k, m + i}) ∣)$	(10)
$c_{r a n g e}$	Range of Course Changes	$c_{r a n g e} = \min ((\max (c_{k, m + i}) - \min (c_{k, m + j})), 360 - (\max (c_{k, m + i}) - \min (c_{k, m + j})))$	(11)
$d_{s t a r t - e n d}$	Start–End Point Distance of Trajectory	$d_{s t a r t - e n d} = h (λ_{k, m}, ϕ_{k, m}, λ_{k, m + n}, ϕ_{k, m + n})$	(12)

Table 3. Basic Statistical Information of the Case Data.

Item	Value
Total Tugboat Data Points	623,136
Number of Trajectories	483
Average Time Interval	6 s

Table 4. Evaluation Metrics on the Test Set.

Class	Precision	Recall	F1-Score	Support
Cruising or Transfer	0.93	0.86	0.89	73
Assist in berthing/unberthing	0.87	0.93	0.90	73
Average/Total	0.90	0.90	0.90	146

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jia, X.; Feng, H.; Grifoll, M.; Lin, Q. A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition. Systems 2026, 14, 225. https://doi.org/10.3390/systems14020225

AMA Style

Jia X, Feng H, Grifoll M, Lin Q. A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition. Systems. 2026; 14(2):225. https://doi.org/10.3390/systems14020225

Chicago/Turabian Style

Jia, Xiang, Hongxiang Feng, Manel Grifoll, and Qin Lin. 2026. "A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition" Systems 14, no. 2: 225. https://doi.org/10.3390/systems14020225

APA Style

Jia, X., Feng, H., Grifoll, M., & Lin, Q. (2026). A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition. Systems, 14(2), 225. https://doi.org/10.3390/systems14020225

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Spatiotemporal Feature-Driven Deep Learning Framework for Fine-Grained Tugboat Operation Recognition

Abstract

1. Introduction

2. Problem Description

3. Methodology

3.1. Methodology Overview

3.2. Trajectory Segmentation

3.3. Feature Engineering

3.3.1. Statistical Features

3.3.2. Descriptive Features

3.4. FCNN-Based Tugboat Trajectory Classification Model

3.4.1. Model Architecture

3.4.2. Training and Loss Function

3.4.3. Model Interpretability Strategy

3.5. Fine-Grained Classification: Distinguishing Assistance During Berthing and Unberthing

4. Case Study

4.1. Data and Experimental Setup

4.2. Model Training Results and Performance Analysis

4.3. Feature Distribution and Discriminative Analysis

4.3.1. Distributional Characteristics of Features

4.3.2. Correlation and Redundancy Analysis

4.3.3. Feature Importance and Model Interpretability

4.4. Visual Verification

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI