1. Introduction
The Internet of Underwater Things (IoUT) aims to create an interconnected network of underwater systems that facilitates real-time monitoring and data transfer within marine ecosystems. Developing a comprehensive oceanic wireless 3D observation grid enables extensive surveillance of the marine environment but also plays a pivotal role in disaster preparedness, resource exploration, and national defense. Challenges arise due to the distinct nature of aquatic and aerial communication mediums and the adverse conditions prevalent in marine settings [
1]. These challenges manifest as increased complexity in the communication channels spanning air and sea, as well as diminished efficiency in the data transmission capabilities of underwater networks.
Underwater Acoustic Communication (UAC) emerges as a pivotal technology within underwater wireless communication, offering a means to transmit data across substantial distances. It is capable of facilitating transmission over several kilometers at rates of hundreds of bits per second (bps) and across hundreds of meters at tens of kilobits per second (kbps) [
2]. This capability establishes it as the preeminent communication technology utilized in underwater wireless sensor networks. Despite its efficacy, the marine environment introduces considerable temporal and spatial variability, marked by intricate and mutable periodic variations. Such dynamics engender pronounced spatio-temporal uncertainty, which presents substantial challenges to effective communication in these aquatic realms.
In underwater settings, various factors, including currents, temperature shifts, and salinity changes, can adversely affect signal transmission, potentially leading to significant attenuation and distortion. The oceanographic characteristics’ distribution is pivotal in dictating the attenuation, reflection, refraction, and scattering of acoustic waves. This interaction results in the creation of complex convergence and shadow zones within the IoUT [
3,
4]. The elaborate distribution of these zones is critical in determining the connectivity and communication performance among IoUT nodes. Moreover, it significantly influences the network topology, thereby impacting the reliability and stability of communications. This multifaceted interplay underscores the challenges and necessitates a nuanced understanding of the marine environment to enhance and innovate underwater wireless communications.
The oceanic thermocline, located between the surface and deeper layers of the ocean, is a layer of water whose temperature decreases significantly with depth. The depth and thickness of the thermocline varies according to geographic location, season, and climatic conditions. The thermocline is particularly pronounced in temperate and tropical waters [
5]. Within the thermocline, the temperature drop is very rapid and can drop dramatically by tens of degrees over a vertical distance of a few tens of meters. This rapid temperature change significantly affects the speed of sound, causing the speed of sound waves to change as they pass through the thermocline. If the angle of incidence and temperature difference for sound wave propagation reaches certain conditions, the sound wave will also undergo a total reflection phenomenon [
6]. The total reflection phenomenon may lead to limited connectivity of the hydroacoustic network. It also affects the communication between different nodes in the IoUT. Hence, the precise forecasting of crucial oceanic characteristics is imperative for formulating routing strategies in underwater acoustic networks.
In the oceanic thermocline, key parameters such as temperature, salinity, and density vary significantly depending on ocean depth, geographic coordinates, and time factors. In a dynamic ocean environment, these variations pose challenges for sensor deployment and network optimization and have significant implications for achieving effective data transmission and communication maintenance. While providing routing and media access control strategies for designing underwater network topology nodes, current prediction techniques still fail to meet the demand for longer-term and more accurate predictions for underwater communication systems. Specific challenges in the current prediction task include the following: (1) Oceanographic features display seasonal and cyclical variations. Long-term changes and trends in the ocean, such as global climate-induced shifts in sea temperature and ocean acidification, can impact the nature of the thermocline. Only multi-step temporal forecasting can trace the evolution of thermocline characteristics over extended time scales. (2) The process of collecting ocean data involves significant data redundancy. Temporal forecasting tasks in such intricate marine environments require extensive data collection and analysis. Insufficient or excessive data can both negatively impact the accuracy of thermocline predictions, thereby compromising the reliability of the final forecasting outcomes.
Our research objective is to leverage the historical states and patterns of the ocean to conduct multi-step, high-accuracy investigations into the temporal dynamics of the thermocline. Current methodologies for temporal forecasting predominantly encompass numerical techniques grounded in mathematical or statistical principles, as well as models derived from deep learning paradigms [
7]. Within this spectrum, the numerical approaches relying on mathematical or statistical foundations typically employ polynomials, differential equations, and other mathematical constructs to approximate the patterns of ocean temperature variations [
8]. Although these methods are straightforward to implement and facilitate rapid prediction generation, they often fail to accurately represent the ocean’s inherently nonlinear characteristics, leading to suboptimal model fittings.
In oceanographic research, the application of deep learning methodologies has gained significant traction in recent years, particularly in analyzing temporal characteristics of marine environments. This transition from rudimentary neural network architectures to more sophisticated deep learning frameworks has proven efficacious in accurately modeling ocean temperature variations’ intricate, nonlinear dynamics. Notably, convolutional neural networks (CNNs) exhibited remarkable proficiency in abstracting spatial features, a pivotal capability for predicting future states, as elucidated in [
9]. Similarly, recurrent neural networks (RNNs) were adeptly employed for deciphering temporal dependencies in traffic speed prediction at discrete locations, drawing upon historical traffic data series, as expounded in [
10]. Despite these advancements, it remains a challenge for a singular deep learning model to comprehensively assimilate the nuanced characteristics inherent in a diverse dataset, often resulting in an inability to concurrently address both spatial and temporal correlations effectively.
To better understand temporal and channel characteristics, the researchers developed an approach integrating feature vectors, encapsulating spatial dependencies, into recurrent neural network (RNN) sequences [
11]. This integration aims to learn all spatio-temporal dependencies synergistically. Furthermore, Qin et al. [
12] introduced a predictive model for red tide disasters, amalgamating the autoregressive integrated moving average (ARIMA) model with Deep Belief Networks (DBNs). This innovative model is employed to meticulously analyze the temporal correlations and spatial heterogeneity inherent among factors sensitive to red tides.
While existing methodologies demonstrate remarkable efficacy in concurrent spatio-temporal forecasting, their primary utility is confined to two-dimensional terrestrial landscapes, rendering them inadequate for the intricacies of three-dimensional marine environments characterized by latitude, longitude, and depth dimensions. As elaborated in [
13], the proposed multilayer ConvLSTM (M-convLSTM) framework aims to prognostify internal oceanic temperatures. Nonetheless, the challenge of effectively parsing interdependencies within expansive and intricate datasets persists. Song et al. [
14] innovatively employed SKYLINE’s three-dimensional geospatial information technology to forge an interactive, tri-dimensional marine disaster forecasting system. This system, pioneering in its field, capitalizes on synchronized spatio-temporal monitoring principles and leverages a holistic approach to data management underpinned by data synchronization techniques. However, a critical examination of contemporary research shows a significant reliance on single-step forecasting methods in marine data forecasting. In contrast to past approaches, multi-step forecasts provide a broader window of time to understand the future state of the marine environment, thus providing an essential buffer for proactive marine environmental management strategies.
In this paper, we introduce Ocean-Mixer, an advanced deep learning framework engineered to predict oceanic feature distributions precisely. This innovative model comprises three integral components: an embedding module, a mixer module, and a prediction module. This paper delineates the intricate architecture and the operational efficacy of the Ocean-Mixer model, setting a new benchmark in oceanographic data analysis and prediction.
A rigorous and representative task demonstrated the efficacy of Ocean-Mixer: the model achieved time-series predictions of ocean features using raw observational data. We initiated our study by undertaking an analytical assessment of a genuine oceanic thermocline dataset, providing robust evidence to substantiate the redundant nature of temporal data. Additionally, we engaged in the comprehensive, multi-step forecasting of the thermocline’s temporal progression in select maritime regions. These regions exhibit variations in the thermocline’s position and configuration, influenced by geographic coordinates (longitude and latitude), depth, and temporal factors. Our investigation encompassed an evaluation of oceanic temperature and salinity fluctuations across different stages and depths. Through this, we deduced the positioning of the oceanic thermocline by projecting the associated temperature profiles. This paper delineates its principal contributions as follows:
- (1)
We observe a notable redundancy in the existing oceanic time series data, where both the original and down-sampled series demonstrated parallel temporal characteristics. This finding led us to mitigate the influence of redundant material information by decomposing the time sub-series, thereby enhancing the accuracy of oceanic time series forecasts.
- (2)
We employ the Mixer module, as proposed in this study, to achieve high-accuracy temporal predictions. Addressing the redundant attributes of oceanic data, we enhanced prediction precision and efficiency through strategic temporal series sub-sampling and sub-sequence integration.
- (3)
We utilize the Mixer module proposed in this paper to implement multi-step time series prediction, which captures long-range temporal information and predicts the future state and change trend by fixedly superimposing the decomposed temporal subsequence to the self-attention module, and the Mixer module retains the valid material information throughout the prediction process.
- (4)
We innovate a predictive methodology for oceanic time series, dubbed ‘Ocean-Mixer,’ which proficiently captures temporal and channel dependencies. This method distinguishes itself by its capacity to effectively utilize original, multi-step historical ocean data observations, enabling a broad spectrum of applications.
- (5)
We demonstrate the effectiveness and superiority of the proposed method Ocean-Mixer in ocean time-series prediction by extensive comparison with classical and state-of-the-art baseline methods.
The remainder of this paper is organized as follows: In
Section 2, we summarize related works. In
Section 3, we analyze data to motivate our work and give detailed problem definitions. In
Section 4, we present the details of our proposed Ocean-Mixer approach. In
Section 5, we conduct a series of experiments to demonstrate the effectiveness and superiority of our proposed method. Finally, we conclude this paper.
2. Related Works
Traditional methods for predicting ocean remote sensing data include linear, logistic, and support vector regression (SVR) [
15]. Cornejo [
16] implemented combined genetic algorithms and extreme learning techniques to forecast critical information in marine energy contexts. Jiang [
17] utilized SVR for the regression prediction analysis of ocean temporal characteristics. Gou et al. [
18] tackled the KNN algorithm for the task of predicting critical attributes of the ocean. Such algorithms tend to consider only one of the temporal and channel dependencies of the sea, ignoring the potential relationship between the two features.
In recent developments, deep learning has become the dominant method for predicting the state of the marine environment. This dominance is attributed to its powerful nonlinear computational ability to extract temporal and spatial features from unprocessed data. Schuckmann et al. [
19] proposed a methodology for a simple modeling scheme, which used historical ocean data to build a global ocean routine monitoring system. Cornejo et al. [
16] used population genetic algorithms and extreme learning machine methods to predict effective wave heights and wave energy fluxes for ocean energy applications. Although these methods can solve the limitations of traditional prediction methods and obtain higher prediction accuracy, there are still many shortcomings in feature capture for sequence data.
Sequential datasets are processed effectively using recurrent neural networks (RNNs) and long short-term memory (LSTM) structures. These techniques found significant applications in forecasting ocean water temperatures in deep learning. In their study, Zhang et al. [
20] approached the prediction of sea surface temperatures as a sequential data problem, employing the LSTM model. Yang integrated a fully connected LSTM layer with a convolutional layer, effectively merging spatial and temporal data for enhanced forecasting accuracy [
21]. This research presents an advanced methodology tailored for marine time series forecasting. It integrates the intricacies of spatio-temporal dynamics with the interplay of diverse oceanographic features, thereby markedly elevating the accuracy of the forecasts.
Recent research activities have increasingly concentrated on the thermocline. They [
22] discuss wind stress’s role in affecting the thermocline’s depth through processes like Ekman pumping. It also explores the impact of wind-driven circulation in the North Pacific concerning thermocline variability. Meanwhile, utilizing the sea temperature dataset, Jiang and colleagues successfully identified the depth of the upper thermocline in the South Sea. Additionally, their research provides new insights into the seasonal variations of the thermocline [
23].
To the best of our understanding, Mixer-Ocean represents a pioneering approach in multi-step time series forecasting models that tackle the redundancy in oceanic data. This innovative model captures the inherent connections among various oceanic features and simultaneously addresses temporal and spatial dependencies. In addition, it is compelling research to realize a multi-step thermocline position prediction.
5. Experiments
To fully validate the effectiveness of our proposed method on the problem of predicting ocean thermocline, in this section, comprehensive experiments are conducted on real oceanic datasets to answer the following research question (RQ):
RQ1: Does our proposed Ocean-Mixer method outperform classical and state-of-the-art baseline methods?
RQ2: Can Ocean-Mixer proficiently execute multi-step predictions?
RQ3: How does the choice of different parameters affect the prediction accuracy of Ocean-Mixer?
RQ4: Can the proposed Ocean-Mixer method adapt to time series prediction scenarios beneath the thermocline?
5.1. Preparation
(1)
Datasets. The data used in this experiment are derived from the monthly average temperature and salinity dataset of the bottom layer of the Bohai Sea, Yellow Sea, and East China Sea [
26]. The dataset covers the ocean from 117.01° E to 131.66° E and from 29.04° N to 42.09° N for the years 1997 to 2016, spanning a total of 20 years. The spatial resolution is (1/24)° × (1/24)°, and the vertical stratification ranges from 0 to 75 m with 30 standard layers. Different oceans are marked and distinguished using different data, such as longitude and latitude, and the overall data are stored in a unified format.
To enhance the efficiency of model training, we delineated an experimental area spanning from 110.5° E to 130.5° E and from 30° N to 40° N, with a depth range of 0–80 m, based on the dataset. This region is in the temperate zone, characterized by a notable temperature disparity between the surface and deep sea, resulting in a distinct thermocline. The collected sample data encompass multiple dimensions, such as longitude, latitude, water depth, collection time, temperature, and salinity. The selected samples have a wide spatial distribution and time span, which is advantageous for studying and analyzing the characteristics of the oceanic thermocline.
(2)
Metrics. In the context of time prediction tasks for ocean temperature and salinity, we employed two widely used evaluation metrics, Mean Absolute Error (MAE) and Root Mean Square Error (RMSE), to assess the predictive accuracy of all methods. The definitions of MAE and RMSE are as follows:
where
and
present the actual value and the predicted value of the
sample, MAE and RMSE convey the average prediction error.
(3) Settings. All the experiments are carried out on a 64-bit Windows server equipped with an Intel (R) Core (TM) i9-13900HX processor running at 2.20 GHz and featuring 64G of RAM.
5.2. Performance Comparison with Other Baseline Models (RQ1)
To assess the efficacy of our proposed Ocean-Mixer, we compared it with four different methods, including three baseline models, RR [
27], LSTM [
28], and KNN, as well as one advanced model, SCINet (Liu et al., 2022a) [
29].
To maintain the neutrality and integrity of our experimental results, we employ the cross-validation method to optimize the parameters for each algorithm in the comparative algorithm experiments. We randomly divide the historical data into 60% training and 40% testing sets. We further divide the training set into 10 mutually exclusive subsets of similar size to be used as validation sets. For each experiment, we randomly select two of these subsets as validation sets, with the remaining eight subsets serving as the training set, ensuring that there are five different combinations of training and validation sets. Cross-validation enhances the model’s ability to generalize to unknown data and better reflects the optimal performance of each algorithm.
Ridge Regression (RR): This method is employed in multiple regression analysis to address multicollinearity issues.
Long Short-Term Memory Network (LSTM): LSTM can preserve long-term dependencies while avoiding the problem of vanishing gradients caused by these long-term dependencies.
K-Nearest Neighbors (KNN): The KNN approach, based on regression, estimates the target value using the k closest neighbors. Through cross-validation, the optimal values selected for parameter k during the temperature and salinity prediction analysis were 16 and 6, respectively.
SCINet is a neural network model for time series forecasting based on dilated causal convolution.
We conducted a detailed comparison with four other methods to assess the effectiveness of our proposed Ocean-Mixer method in predicting ocean temperature and salinity time series. Specifically, our experiments were carried out on ocean salinity and temperature datasets, with a fixed prediction step length of 5, varying the ocean depth from 20 to 80 during the experiment.
Figure 5 shows that our proposed Ocean-Mixer achieves lower MAE and RMSE in ocean temperature and salinity prediction tasks than KNN. Although the KNN model can capture temporal features, it fails to capture channel features simultaneously. This further substantiates the superiority of our proposed Ocean-Mixer method. Moreover, the RR model struggles to make accurate predictions when faced with complex and redundant information. Additionally, while SCINet demonstrates good predictive performance in predicting thermocline salinity, it is less effective than Ocean-Mixer in predicting thermocline temperature, probably because ocean salinity does not change as drastically as temperature [
30].
As observed from
Figure 5, the overall distribution of predicted values by our proposed method, Ocean-Mixer, is closer to the actual values. In addition, we found that when predicting the temperature of the ocean thermocline, the deeper the seawater, the lower the accuracy of the prediction. One of the reasons may be that it is still challenging to make long-term, continuous, and high-resolution observations of the temperature change at the bottom of the thermocline due to the limitation of the ocean observation equipment, which implies that the model based on the observational data may have some uncertainty in predicting the temperature change at the bottom of the thermocline.
5.3. Multi-Step Ocean Time Series Prediction (RQ2)
Multi-step prediction helps to understand the potential future state of the marine environment, aiding in assessing long-term trends and seasonal variations. To evaluate the effectiveness of our proposed Ocean-Mixer method for multi-step time series forecasting, we conducted a detailed comparison with four other methods. Specifically, we conducted multiple experiments on ocean temperature and salinity datasets, with the fixed depth of the ocean set at 15 m. The prediction step lengths were set to 2, 4, 6, and 12 for comparison purposes.
Table 1 shows the forecasting accuracy of different models at various step lengths.
Table 1 presents the MAE and RMSE comparison results on the ocean thermocline temperature and salinity dataset at different step lengths. From
Table 1, we observe the following:
- (1)
With the help of sub-sampling and self-attention mechanisms, the Ocean-Mixer‘s MAE on the temperature dataset is 0.02 higher than the best baseline LSTM. For the salinity dataset, the forecasting accuracy of Ocean-Mixer decreases as the step length increases, but it still performs better than other models.
- (2)
As the forecasting step length increases, our proposed Ocean-Mixer method consistently maintains the highest accuracy in multistep-length forecasting tasks. This result demonstrates the superior performance of the Ocean-Mixer method in adapting to a variety of multistep-length forecasting tasks. Even minor improvements are significant in critically studying ocean thermocline time series prediction. Such advances have important implications for the routing design of underwater acoustic networks and the application of media access control in the Industrial Internet.
5.4. Results of Different Ocean-Mixer Variants (RQ3)
The experimental results in
Section 5.2 and
Section 5.3 show that the Ocean-Mixer method proposed in this paper outperforms the existing techniques in time series forecasting and especially demonstrates excellent performance when dealing with multi-step forecasting. In order to explore, in-depth, the specific effects of different variable values on the forecasting results, this section demonstrates the process of model parameter selection through two variant experiments. These include the embedding length set in the generation module and the length of the sampling sequence employed in the hybrid module. We maintained a parameter setting of ocean depth = 15 and step = 12 during the experiment.
Different sizes of embeddings. Previous studies confirmed that the size of the input layer plays a decisive role in the overall performance of the model. While increasing the dimensionality of the input layer improves the model’s ability to capture complex features, this expansion significantly increases the time and cost required to run the model. In addition, excessive growth may also weaken the model’s ability to generalize to new datasets. Therefore,
Figure 6 explores, in detail, how different embedding lengths affect model performance.
Figure 6 shows that embeddings that are too large and too small affect the ocean time series prediction performance in the ocean temperature and salinity datasets. At the same time, we observe that when embedding = 4, the model exhibits the most petite MAE and RMSE. As the size of the embedding increases, the evaluation metrics show an upward trend. These results intuitively demonstrate the optimal parameter selection of Ocean-Mixer, which can better help achieve ocean time series prediction.
Different number of interleaved subsequences.
Figure 7 illustrates the effect of the hyperparameter
, highlighting the influence of the number of interlaced subsequences in data after downsampling on the predictive capabilities. From the experimental results in
Figure 7a, it can be seen that the prediction accuracy improves as the number of interleaved subsequences increases. When the number of interleaved subsequences equals 256, the prediction accuracy is the best, indicating that a certain number of interleaved subsequences can effectively remove temporal redundancy in the dataset. When the number of interleaved subsequences exceeds a certain threshold, we should adjust other parameters to improve performance. Similar results were also observed on the salinity dataset in
Figure 7c,d. Therefore, selecting an appropriate number of interlaced subsequences is crucial during the application of the model.
5.5. Prediction of the Thermocline (RQ4)
In the Yellow and East China Seas, the thermocline emerges as a critical hydrographic feature within the continental shelf sea regions, exhibiting marked seasonal variations. This distinct layer demarcates the relatively warmer superficial waters from the cooler, deeper waters. The attributes of the thermocline are subject to interpretation, influenced by factors such as geographic positioning, seasonal dynamics, and additional environmental variables. Notably, we define a thermocline as a column of thermoclines when it is above the critical value
. Equation (8) shows how to calculate the thickness of the thermocline
, where
denotes the upper term of the thermocline and
denotes the lower term of the thermocline [
31].
In the computation of the thermocline, the primary task is to ascertain the curvature points
on the curve that delineates the vertical temperature distribution, which outlines the top and bottom boundaries of the thermocline. Subsequently, it becomes essential to determine the vertical span of the thermocline
. Equation (8) gives a specific calculation of the thickness of the thermocline
, where
denotes the upper term of the thermocline and
denotes the lower term of the thermocline. The final step involves calculating the ratio of the temperature disparity between the upper and lower boundaries of the thermocline to this vertical span, which indicates the thermocline’s strength
. When the temperature gradient is higher than the critical value
δ, we define a thermocline, and Equation (9) represents the gradient value
that meets the definition of a thermocline.
The marine environment, a complex system of ocean currents, tides, and atmospheric circulation, presents a high challenge for prediction, especially in predicting thermocline temperature changes. Seasonal thermocline variations are not only affected by climate variability but also constrained by regional environmental conditions, which makes the prediction of its temperature change more complex and challenging. Given these intertwined and complex processes, an accurate forecast of thermocline temperature change becomes challenging. In this section, the effectiveness of the Ocean-Mixer model in predicting thermocline temperatures in different months is explored in detail to provide further insights into our understanding of this complex ocean system.
The Ocean-Mixer utilizes historical observations to achieve accurate predictions of ocean temperature in a 3D scenario by capturing the correlation between the time dimension and channels.
Figure 8 illustrates the temperature prediction for the next 12 months (from 14 January to 14 December) using data from the previous 36 months, with a prediction step 12.
Figure 8 covers data from 0 to 80 m from ocean depths, where the red curve represents the investigated period’s observed temperature values, and the blue curve shows the predicted temperature values.
In
Figure 9, the red and blue lines indicate the investigated period and predicted temperature gradients
, respectively. Combining
Figure 8 and
Figure 9, the ocean temperature does not vary much with water depth during January, February, March, and December. However, from April onwards, the temperature fluctuations gradually increase with deeper water depth, especially in July, August, and September, when the temperature fluctuations peak, consistent with the real-world behavior of seasonal variations in the temperate ocean thermocline. This phenomenon also confirms the efficiency of our Ocean-Mixer model in capturing long-term time-series information and temperature trends, as well as its ability to fit nicely into real-world data.