Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach

Muhamed Ali, Ali; Zhuang, Hanqi; Huang, Yu; Ibrahim, Ali K.; Altaher, Ali Salem; Chérubin, Laurent M.

doi:10.3390/jmse12091680

Open AccessArticle

Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach

by

Ali Muhamed Ali

^1,2,†

,

Hanqi Zhuang

^1,†

,

Yu Huang

¹

,

Ali K. Ibrahim

^1,3,

Ali Salem Altaher

¹

and

Laurent M. Chérubin

^3,*,†

¹

Electrical Engineering and Computer Science Department, Florida Atlantic University, Boca Raton, FL 33431, USA

²

Ministry of Higher Education and Scientific Research, Baghdad 10065, Iraq

³

Harbor Branch Oceanographic Institute, Florida Atlantic University, Fort-Pierce, FL 34946, USA

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

J. Mar. Sci. Eng. 2024, 12(9), 1680; https://doi.org/10.3390/jmse12091680

Submission received: 16 August 2024 / Revised: 4 September 2024 / Accepted: 18 September 2024 / Published: 20 September 2024

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

Today’s prediction of ocean dynamics relies on numerical models. However, numerical models are often unable to accurately model and predict real ocean dynamics, leading to a lack of fulfillment of a range of services that require reliable predictions at various temporal and spatial scales. Indeed, a numerical model cannot fully resolve all the physical processes in the ocean due to various reasons, including biases in the initial field and calculation errors in the numerical solution of the model. Thus, bias-correcting methods have become crucial to improve the dynamical accuracy of numerical model predictions. In this study, we present a machine learning-based three-dimensional velocity bias correction method derived from historical observations that applies to both hindcast and forecast. Our approach is based on the modification of an existing deep learning model, called U-Net, designed specifically for image segmentation analysis in the biomedical field. U-Net was modified to create a Transform Model that retains the temporal and spatial evolution of the differences between the model and observations to produce a correction in the form of regression weights that evolves spatially and temporally with the model both forward and backward in time, beyond the observation period. Using daily ocean current observations from a 2.5-year current meter array deployment, we show that significant bias corrections can be conducted up to 50 days pre- or post-observations. Using a 3-year-long virtual array, valid bias corrections can be conducted for up to one year.

Keywords:

bias correction; ocean current; loop current; U-Net; transformer

1. Introduction

Today’s prediction of ocean dynamics relies on numerical models. The improvement and development of parameterization methods and the increase in spatial resolution along with computing power have improved the accuracy of numerical predictions. However, numerical models are often unable to accurately model and predict real ocean dynamics, leading to a lack of fulfillment of a range of services that require reliable predictions at various temporal and spatial scales. Indeed, numerical models cannot fully resolve all the physical processes in the ocean for various reasons, including biases in the initial field [1] and calculation errors in the numerical solution of the model. Thus, bias-correcting methods have become crucial to improving the dynamic accuracy of numerical model predictions. Currently, there are mainly two kinds of methods for numerical forecast product correction: traditional statistical methods and data-driven methods. Post-processing of numerical model outputs with statistical methods is the most common approach. Such methods consist of model output statistics (MOS) [2,3], Kalman filtering [4,5], and Bayesian probability decision [6], among others. Other methods involve correcting the bias within the numerical model if the source is known by creating a bias-forecasting model, as shown in the study by Chepurin et al. [7]. Observations can also be biased, and a bias removal algorithm can be included in the data assimilation scheme used in the forecast [8].

Data-driven approaches to numerical forecast product correction rely on sustained and reliable observations of the parts of the ocean where the predictions are made. Because oceans play a key role in global issues such as climate change, food security, and human health, efforts to create long-term repositories with global quality assurance and quality control standards have received significant support over the last decade. These efforts have come to fruition in the Global Ocean Observing System (GOOS) [9]. GOOS has opened a field of opportunity for new collaborations across regions, communities, and technologies, facilitating enhanced engagement in the global ocean-observing enterprise to benefit all nations. In the United States, these efforts led to the creation of the U.S. Integrated Ocean Observing System (IOOS), which through its eleven Regional Associations, implements regional observation systems covering all U.S. coasts and Great Lakes with activities spanning from the head of the tide to the U.S. exclusive economic zone [10]. Through these observatories, comprehensive ocean datasets, spanning several decades, have become available to the research community and have promoted the emergence of machine learning methods [11].

Machine learning algorithms are thus poised to determine patterns and structures in the increasing volume of observational and modeled data [12,13,14,15,16,17]. Deep learning-based bias-correction methods have also been developed to address systematic forecast errors of numerical ocean models. One of the applications has been minimizing biases in sea surface temperature (SST) prediction, for example, in coupled models, where SST biases can have significant impacts on climate predictions and heat flux calculations [18]. Another application has been on the bias correction in sea surface height (SSH) prediction, which is inherently biased due to sampling errors [19]. Yang et al. [20] used an ANN to correct non-stationary bias in SST forecast products of the U.S. National Center for Environmental Prediction Climate Forecast System (CFSv2), which are widely used in climate research and hurricane prediction for example. Choi et al. [21] used a deep learning model to restore real-time SST before assimilation by filling the data gaps with the deep learning model. Liu et al. [19] proposed a bias correction of a sea surface height model using an ML sequence-to-sequence model, which is a special class of recurrent neural network. Their model was applied to the Loop Current (LC) system in the Hybrid Coordinate Ocean Model–Navy-Coupled Ocean Data Assimilation (HYCOM-NCODA) Gulf of Mexico (GoM) simulation, which exhibits bias errors against the Gulf of Mexico Coastal Ocean Observing System’s (GCOOS) altimetry data, chosen here as the truth. Altimetry-derived products (SSH anomaly and geostrophic currents) have been shown to be as, and at times, more accurate than the HYCOM analysis, as well as data-assimilative models in general—also in relation to the LC patterns despite inherent sampling errors [22]. For the chosen prediction period of up to 15 days, Liu et al.’s [19] approach improves HYCOM’s skills by learning to forecast the systematic bias in the HYCOM-NCODA, outperforming persistence and improving the HYCOM forecast. However, according to Fei et al. [23], one of the weaknesses of the current methods for numerical forecast product correction is the lack of spatio-temporal relationships between the model and the observations.

In the study herein, we propose a new ML-based approach to conduct a bias correction on the 3D velocity field of ocean current models. This approach takes into account the joint temporal and spatial evolution of both the model and the observational fields, which is encapsulated in the Transform Model concept. To demonstrate our method, we take advantage of a recent and comprehensive set of observations in the GoM (Figure 1a) that provided current measurements of the full water column [24] over a large area of the GoM. In recent decades, the GoM has received and continues to receive a lot of attention from oceanographers. The GoM ocean dynamics is controlled by the pulsating LC, which is the most energetic circulation feature of the basin. Predicting the LC evolution is fundamental to almost all aspects of the GoM, including (1) anthropogenic and natural disaster response; (2) the prediction of short-term weather anomalies and hurricane intensity and trajectories; (3) national security and safety; and (4) ecosystem services [25]. The Deepwater Horizon oil spill in 2010 has bolstered a large amount of research toward improving our understanding and forecasting of the LC System in hope of mitigating further environmental and ecological damage. Furthermore, because of the reinforcing interaction between hurricanes, the LC and its eddies, long-term prediction of the LC states is becoming more and more relevant [26,27]. Thus, developing accurate and robust medium-term forecasting models of the LC System and its eddy formation is imperative and relies on the availability of ocean observations. Improved ocean observing systems are expected to reduce the uncertainty of ocean/weather forecasting and to enhance the value of ocean/weather information throughout the Gulf region [28].

The remainder of the paper is organized as follows. In Section 2 we provide an overview of the Transform Model concept. Section 3 describes the datasets, including the observations and the numerical model fields, the deep learning model and its implementation. Both free and data-assimilated simulations were used in this study. The metrics of the bias correction evaluation are also presented in this section. Section 4 presents the results of the bias correction and their evaluation against the testing datatset. Discussion and conclusion follow in Section 5.

2. Transform Model Concept

The rationale behind this concept lies in the fact that the dynamic history contained in the observations can be learned with a deep learning model, as shown by prediction exercises of the LC dynamics [29,30,31,32]. Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structures in large data sets by using the back-propagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer [33].

Numerical ocean models are mathematical approximations of the laws of physics that are used to estimate ocean states given a set of boundary conditions and constraints, such as the atmospheric and tidal forcing. When not constrained by data assimilation, the simulated states of the “free-run” are likely to be very different from the real ocean state, although the main features of the ocean dynamics will be captured and resolved. These features include the mesoscale dynamics, the water masses, and the eddy kinetic energy, among others, whose variability and magnitude is likely to be dissimilar to the observed ones, although present in both natural and virtual systems. Therefore, one can assume that both the simulated and real systems are on a parallel track although different in many ways.

Data assimilation techniques have been used in numerical predictions as a means of reducing the bias between the model’s field and observations at the time of the forecast when the background field is created. The primary objective is to optimally combine the observations and the model output. Observations assimilated in ocean forecasting systems now include altimetry, ocean color, surface velocities, sea ice and data from emerging platforms such as ocean gliders. However, direct measurements of surface current velocities are rare and subsurface current velocities even rarer. Many systems now employ multi-model approaches or ensemble modeling techniques. A variety of data assimilation are used in operational oceanography, and a detailed review of the data assimilation methods can be found in the study by Moore et al. [34]. Despite the large amount of research and the widespread use of data assimilation techniques in numerical model prediction, errors between assimilated models and real ocean dynamics remain significant. The useful forecast window of numerical ocean models is no longer than two days over a given area with the best initialization possible, as shown by Cooper et al. [35] in a dynamically active current system, such as the LC in the GoM. In this study, we show that remaining biases in data assimilated numerical models such as HYCOM can be reduced by our Transform Model when water column current velocities are used and the correction extended up to one year.

While deep learning has mostly been applied to a single data set as input for a variety of applications, we show in this study that it can be re-purposed to learn the differences between the simulated and real systems and their evolution over time and space. The acquisition of that knowledge between a given ocean numerical model simulation and a set of observations in the model domain over a time period concurrent to both enables the prediction of the differences and thus bias with a deep learning model. Therefore, the knowledge of the differences can be converted into a model field observations-based correction tool, called a Transform Model, with the advantage of predicting the correction of the model field beyond the period of observation. This method also accounts for the common history of the model fields and measurements in the correction process. We will show that the improvements to the model fields, namely the reduction in errors, are significant, despite the fact that no physical constraints were applied to the correction.

The concept of the Transform Model is depicted in Figure 1b. It is based on the availability of a long enough concurrent time series of model fields and observations. Both are jointly used in a deep learning model that creates a transformation tool of the numerical model field based on the observations. The Transform Model can then be used to correct the numerical model field at any time in the future and the past outside of the period of measurements.

3. Methods

3.1. Data Sets

For the demonstration of our Transform Model, we used three data sets that overlap in time, composed of two numerical model’s current velocity vector fields and one set of in-situ current measurements from an observing array in the GoM (Figure 1a). The model’s inherent dynamics may influence the effectiveness of the Transform Model, which will be demonstrated later on in the paper.

The first numerical model data set was obtained from the HYbrid Coordinate Ocean Model (HYCOM) Consortium. The data were generated by the HYCOM + NCODA Gulf of Mexico

1 / 25^{\circ}

Reanalysis (GoMu

0.04

/expt_

50.1

), which spans from 1993 to 2012 and has a horizontal resolution of

4.4

km. HYCOM + NCODA is a data-assimilative hybrid isopycnal-sigma-pressure (generalized) coordinate ocean model. The system uses the NCODA system [36,37] for data assimilation. NCODA uses the model forecast as a first guess in a three-dimensional variational scheme [37] and assimilates available satellite altimeter observations (along-track obtained via the NAVOCEANO Altimeter Data Fusion Center), satellite and in-situ sea surface temperature (SST) as well as available in-situ vertical temperature and salinity profiles from XBTs, ARGO floats and moored buoys. The Modular Data Assimilation System (MODAS) is used for downward projection of surface information [38]. The experiment includes tidal forcing. As noted here, no current vector flow information is used in the data assimilation process. Surface forcing values were obtained from the National Centers for Environmental Prediction (NCEP) Climate Forecast System Reanalysis (CFSR [39]).

The second model’s current vector fields were obtained from a free-running Massachusetts Institute of Technology (MIT) generalized circulation model simulation of the GoM circulation (MITgcm-GoM)—a z-level model. Surface forcing values were obtained from the National Centers for Environmental Prediction (NCEP)–National Center for Atmospheric Research reanalyses ([40]). The MITgcm–GoM model was originally developed for state estimation and prediction of the upper ocean circulation in the GoM, including LC evolution and eddy shedding. For these purposes, in its original version, satellite-derived ocean surface observations and subsurface in situ observations were assimilated using a four-dimensional variational (4DVAR) method [41]. For the data set used in this study, however, no data assimilation was performed. The model uses a telescopic grid with a horizontal resolution of 1/20° × 1/20° in the central GoM, which decreases to 1/10° × 1/10° toward the boundaries and western part of the domain. For this study, daily outputs from the simulation between 2009 to 2012 are used. No tidal or atmospheric pressure forcing was applied. More details on the simulation can be found in Morey et al. [42].

These two models are commonly used for forecasting and scientific studies. They were evaluated based on their basic characteristics of the deep GoM circulation [42]. Aspects of the models or their configurations were shown to impact the representation of major features of the deep circulation. Because of these differences and the fact that HYCOM simulation is data-assimilated and the MITgcm-GoM is not, we can estimate the relative difference in the bias correction of the two models’ simulations. The assimilated model field should require fewer corrections than the “free run” simulation.

The in-situ data were obtained from the Dynamics of the Loop Current in the U.S. Waters experiment (hereafter Dynloop)—a comprehensive observational study of the LC in the eastern GoM [24,43,44]. The observational array consisted of the instrumentation of nine tall moorings and seven short moorings, and an array of 25 pressure-equipped inverted echo sounders (PIES) (Figure 1a). The array system measured the water column velocity for 2.5 years (905 days), beginning in March 2009. Each tall mooring included water velocity measurements made from an upward-facing 75 kHz ADCP deployed near a depth of 450 m, as well as point current meters at five depths from around 600 to 3000 m, providing high-resolution water velocity data at depths between about 60 and 440 m and much lower resolution water velocity data below this. Additionally, each short mooring included a single point current sensor located 100 m above the sea floor, providing near-bottom water velocity data. The PIES array provided direct measurement of pressure and acoustic round-trip travel times from the sea floor to the sea surface, which were used to create vertical profiles of density, salinity, and temperature. These pressure records combined with estimated horizontal density gradients were used to calculate geostrophic water velocities. This array was located to cover both east and west sides of the LC between the West Florida Slope and the Mississippi Fan and was also centered over the zone where LC eddies typically separate from the LC. The horizontal separation between moorings was around 50–80 km and between the PIES sensors was around 40–50 km. These recorded data were used to construct the measurement-based water velocity matrix used in this study. More details on the creation of the velocity matrix can be found in [31]. The horizontal resolution varies between 30 and 50 km, and only the first 500 m, corresponding to 26 vertical layers, was used in this study. The time resolution for the velocity data was 12 h, which corresponds to 1810 data frames for each u and v velocity component. The final matrix dimensions were of 1810 × 26 × 29 × 36 for each component.

3.2. Transform Model

The deep learning network of choice for our Transform Model is the U-shaped Convolution Neural Network (CNN), also known as U-Net [45]. The typical use of convolutional networks is in classification tasks, where the output to an image is a single class label. However, in many visual tasks, especially in biomedical image processing, the desired output should include localization, i.e., a class label is supposed to be assigned to each pixel. The architecture of this network consists of two parts. The first part is the analysis path called the encoder, which is similar to the CNN architecture that is widely used for image feature extraction. The second part is the synthesis path called the decoder, where the compact features are expanded to the input dimension. In the image case, the feature set, or code, is decoded back to a (segmented) image of the same dimension as the input image. The name of U-Net is based on the U structure of the network. This type of network has been widely used in medical image segmentation applications [46,47], for land segmentation studies in remote sensing images [48], for cloud and shadow segmentation [49], for modeling and prediction of coastal weather events in the Netherlands [50], and for coastal wetland classification [51], among many other applications.

For our application, the U-Net was modified at the output layers to make it suitable for a regression problem instead of the original structure that was designed for image segmentation. The problem can be seen as a time series transformation through a regression analysis. Hence, the last layers after the convolution layers have been replaced by regression layers, instead of a Softmax and a segmentation layer in the original structure. The regression layer is a type of neural network layer used for predicting continuous values, as opposed to classification or segmentation. It is typically used in machine learning tasks where the goal is to predict a numerical value or a set of numerical values. The output value can be a scalar, a vector, or a matrix, depending on the specific task and architecture of the neural network. Generally, the choice of layer architecture is determined by the particular activity and data that are being utilized, and there is no solution that is universally applicable to all situations. Because the U-Net will learn both the spatial and temporal features of the training data, it is called hereafter the Spatio-Temporal U-Net (STU-Net). The final model structure consists of four stages of encoders, as shown in Figure 2. The input is made of a four-dimensional input layer, which is organized as

x, y, u / v, t

. Here,

x, y

are the coordinates of the sample location,

u / v

are the zonal or meridional velocities, respectively, and t denotes the time of the measurements. The output of the model is the transformed velocity field time series, thus the same variables as the input. In the vertical direction, a Transform Model is created for each depth layer.

The Transform Model is the outcome of the training phase, where both the numerical model and the observations are fed to the STU-Net. The relationship between the measurements and the numerical model is represented by the neural network structure, including its hyper parameters and weights. However, in order to apply bias corrections both forward and backward in time, a Transform Model is created for each temporal direction. During the training phase, the errors between the two fields are used to adjust the weights of the Transform Model until it converges.

For the experiments conducted in this study, the hyperparameters were set by trial and error to be as follows: InitialLearnRate =

5 \times 10^{- 4}

, MiniBatchSize = 4, MaxEpochs = 300, LearnRateDropFactor = 0.1. The loss function utilized for the regression output is the Root Mean Squared Error (RMSE), and an Adam optimizer was used in the training phase. We arbitrarily set the limit of the overall loss to 0.1 and Mini-batch RMSE to 0.5, for which only 120 epochs were necessary. Traditionally, the model is left to reach a plateau that determines the loss value. However, in our case, the congruence between the transformed field of the numerical model and the observations (Dynloop) was considered adequate with a loss value set at 0.1. At the end of the training stage, the obtained regression layers constitute the Transform Model that will be used for bias corrections. This model was implemented with the Matlab Deep Learning Toolbox (R2021b). The same loss value and hyperparameters were used to create the transformer of each numerical model.

3.3. Experimental Strategy and Input Data Formatting

In this section, the strategy to evaluate our Transform Model concept is presented. First, the short-term transformation of the HYCOM and MITgcm-GoM velocity fields by the in-situ observations is assessed. The training is conducted with the first 855 days and the transformation results are evaluated with the remaining 50 days of measurements. The purpose of this experiment is to assess the transformative potential of limited observation periods and the efficacy on assimilated and non-assimilated numerical model fields, respectively. Second, for the long-term transformation capability of the concept, longer-term time series are required. In this case, HYCOM was used as virtual observations to transform the MITgcm-GoM velocity fields. The training took place over a period of three years and the testing period was either the preceding or following year of the training period. The relatively long training period over the testing period warrants that a large diversity of relationships are learned by the model, which prevents overfitting [52]. For the short-term Transform Model, the period March 2009–June 2011 was used for training and validation, and July–August 2011 was used for testing. For the long-term Transform Model, the years 2009–2011 (2012–2010) were used for training and validation, and 2012 (2009) was used for testing in the forward (backward) experiment. Although a longer training period could have been selected, the three years was found to be sufficient to enable acceptable bias corrections up to one year before or after the training period, as shown in the results section. In the remainder of this study, we refer to a bias-corrected field as a transformed field—the product of the Transform Model.

Both model fields and observations were interpolated on the same spatial and temporal grids. Therefore, the time interval was set to daily, i.e., the coarsest temporal resolution of the three time series. Because the spatial resolution of the three datasets differed over the geographical area of the measurements,

29 \times 36

for the Dynloop data,

64 \times 88

for HYCOM and

52 \times 70

for the MITgcm-GoM simulations, all three data sets were interpolated to a

64 \times 64

grid using a bicubic interpolation [53]. Vertically, the same depth layers as in the Dynloop dataset were used to reinterpolate the other two model fields, which resulted in 26 layers of 20 m intervals from the surface to 500 m (maximum depth used in this study). The interpolated fields were checked for spatial consistency with the original fields, and increasing the temporal resolution to twice daily (original temporal resolution of the Dynloop dataset) did not change the performance of the bias correction.

3.4. Bias Correction Evaluation Metrics

To evaluate the bias correction of the numerical model field by the Transform Model, we applied metrics that are commonly used in the field of physical oceanography to compare the output of different numerical models, as done, for example, in [29,54]. The time series of the spatial Correlation Coefficients (CCs) and Root Mean Squared Errors (RMSEs) between the “observed” and predicted fields were calculated. The observed field is either the in-situ observations for the short-term transformations or HYCOM—virtual observations—for the long-term transformations. The degree of transformation, also called correction gain, can be estimated and was calculated as follows:

G a i n = \frac{| M_{t r a n s f o r m e d} - M_{m o d e l} |}{M_{m o d e l}} \times 100

(1)

where

M_{m o d e l} = \frac{\sum_{n = 1}^{T} R M S E_{1} (n)}{T}

,

M_{t r a n s f o r m e d} = \frac{\sum_{n = 1}^{T} R M S E_{2} (n)}{T}

, T is the length of the time series,

R M S E_{1}

is the RMSE between the model and the observed data, and the

R M S E_{2}

is the RMSE between the transformed field and observed data. A high (low) gain signifies that the transformed model field errors are low (high).

Taylor diagrams [55] were used as an assessment method that both quantifies various aspects of model performance and visually summarizes these aspects within compact diagrams. The Taylor diagrams used herein are applied to the comparison of the two-dimensional fields. They capture the CC, the Root Mean Square Difference (RMSD), the so-called RMSE, and the standard deviation between the models and the in-situ observations, which are the reference fields.

In order to capture whether the dynamical field is properly resolved in the corrected model fields, we applied an EOF decomposition, which is a form of principal component analysis, for spatio-temporal data [56]. The data are decomposed on orthogonal spatial modes, whose net response as a function of time accounts for the combined variance in all of the modes. The first modes usually account for most of the variance in the LC system [29] and are used to demonstrate that the corrected fields contain the same modes as the observations. The EOF decomposition is applied to the 50-day testing period of the short-term bias correction and the first 120 days of the testing period in the long-term bias correction experiment.

4. Results

4.1. Comparison between Observed and Modeled Fields

To evaluate how different the three concurrent field times series are, we calculated the Taylor diagrams of the spatially averaged velocity magnitudes for each of them at four depths, i.e., 0, 20, 100, and 500 m, respectively (Figure 3). Because the MITgcm-GoM is a free-run, the model velocity exhibits the lowest correlation coefficient (CC) and largest RMSE and standard deviation at all depths. While the assimilated HYCOM model exhibits a lower standard deviation than the MITgcm-GoM, the RMSE values and difference between the two models are similar at 0, 20 and 100 m but not at 500 m (Figure 3d). A large RMSE signifies that the amplitude of the variation in the velocity components is overestimated. At 500 m, both models’ RMSE is an order of magnitude lower than at shallower depths, which is explained by the weaker velocities at that depth. HYCOM’s CC is lower at 500 m than at shallower depths, suggesting that the data assimilation at 500 m might be less efficient than in shallower waters.

4.2. Evaluation of the Bias Correction by the Short-Term Transform Model

Surface observations are, in general, more commonly available than subsurface observations, such as three-dimensional arrays. Therefore, predicting subsurface dynamics has remained more challenging than at the ocean surface because of the lack of continuous sampling at the same location. Vertical projections of the surface corrections in data assimilation models are still reliant on the methods described in [38,57]. The transformation of three-dimensional fields is, therefore, critical to full water column prediction. For this evaluation, two types of experiments were conducted. In the first one, the transformation was only applied to the two-dimensional surface velocity, and in the second experiment, the transformation was applied to a three-dimensional tensor of the velocity field. The efficacy of the transformation was evaluated by calculating the RMSE between the transform model solution and the reference, which are the observations in this case, over the last 50 days of the observational period not used in the training phase. The same evaluation was performed for the original numerical model velocity field.

4.2.1. Two-Dimensional Field Bias Correction

The results of the RMSE for both the original numerical model and the transformed model fields are shown in Figure 4 for both HYCOM and MITgcm-GoM. Figure 4a shows a significant reduction in the HYCOM model velocity RMSE, both in the mean and in the variability. The RMSE of the transformed HYCOM field remained almost constant throughout the 50-day period, unlike the original HYCOM field, with a slight increase toward the end of the period. Figure 4b shows that the transformation of the MITgcm-GoM fields was as efficient as for the HYCOM fields in terms of reduction in the RMSE but the RMSE of the transformed fields increased over time. HYCOM fields’ RMSE, although quite variable, did not exhibit an increasing trend over time as the MITgcm-GoM fields did. This difference is most likely due to the fact that data were assimilated in the HYCOM simulation and not in the MITgcm-GoM model. The alignment of the dynamical features between the observations and the numerical model is shown by the increased CC of the transformed model for both HYCOM and MITgcm-GOM (Figure 4c,d). However, for the latter, the CC of the transformed model exhibits larger variations than the CC of the transformed HYCOM due to the larger deviation of the MITgcm-GOM dynamics from the observations.

The transformation gain is shown in Table 1. It was higher for the HYCOM model than for the MITgcm-GoM field. However, the non-assimilated model transformation exhibited an RMSE < 0.4 m.s⁻¹ for up to 30 days, which is less than the RMSE of the assimilated model fields (Figure 4b). This result suggests that the correction of the flow field with in-situ observations by the Transform Model can be as effective as, if not better than, the data assimilation method currently used in the HYCOM operational forecast. This result is significant because the transformation was made outside of the observation period. In terms of the spatial structure of the flow field variability, the transformation is also able to carry through the spatial modes contained in the observed field, as shown in Figure 5. The efficacy of the transformation is stronger for the HYCOM than for the MITgcm-GoM field, which also shows that there is room for improvement in the data assimilation process, whether in the NCODA system or by applying our Transform Model.

4.2.2. Three-Dimensional Field Bias Correction

For the three-dimensional velocity field transformation, a Transform Model was created for each depth level between the numerical model and the observations, and the transformation was applied to the last 50 days not included in the training.

The results in terms of the RMSE and correlation coefficient (CC) for each depth level for the transformed HYCOM model over the 50-day period are shown in Figure 6. As seen in the two-dimensional case, the transformation efficacy is significant. The RMSE is reduced by a factor of 2.5 from the surface down to 200 m and by half at the 500 m level (Figure 6a,b). The HYCOM field’s RMSE decreases with depth whereas the gain of the transformation only changes by a factor of 9% (4%) for the zonal (meridional) velocity between 0 and 500 m (Table 1). While the RMSE of the HYCOM zonal velocity field is higher than the one of the meridional velocity field, when transformed, the resulting RMSEs of each component are nearly identical. The CC of the zonal field is much less than the CC of the meridional field across all levels (Figure 6c,d). The difference persists in the transformed fields but is strongly reduced. In addition, the CC of the transformed fields is larger than the CC of the original fields and varies less vertically as well. The difference between zonal and meridional velocity CC is not surprising since the current meridional velocities dominate the flow pattern of the LC.

For the MITgcm-GoM three-dimensional transformed velocity fields, the RMSE is reduced by a factor of 1.5 at the surface to about 1 at 200 m and by 1.5 at the 500 m level (Figure 7a,b). For the CC, the MITgcm-GoM original model shows a very small correlation with the observations. The transformed fields, however, show an increase in the meridional velocity by a factor of sixteen (Figure 7c,d), which is relatively constant over depth, in contrast with the CC of the transformed zonal velocity, which is not as uniformly improved; although, the correction can reach a factor of fifteen near 100 and 350 m. The 50-day gain is, therefore, less for the MITgcm-GoM overall, as shown by Table 1, but the correction factor is much higher for the latter than for HYCOM. The RMSE of the transformed MITgcm-GoM fields (red lines in Figure 7a,b) is similar to the HYCOM’s original RMSE (blue lines in Figure 6a,b). From a correlation standpoint, there is a similar improvement, in particular for the meridional velocity (red lines in Figure 7d vs. blue lines in Figure 6d) and to a lesser degree for the zonal velocity. This shows that our method is at least as efficient at correcting the free-run model with just the observed velocity field as the NCODA is with all the different data sources used for HYCOM. The latter, on the other hand, requires less correction by the Transform Model, but the improvements by the Transform Model can still be significant, especially near the surface.

While the statistical metrics such as RMSE and CC may indicate a good agreement between the quantities compared, the spatial features that comprise the system may look dissimilar. Therefore, we show here the efficacy of the transformation in the three-dimensional structures of the transformed field on day 50 after the end of the observation period (Figure 8 and Figure 9). For the HYCOM model, the vertical and horizontal flow features in the observations are present in the transformed model field, showing similarities in magnitude and location in space. The presence of these features and their timing is critical to the development of the instabilities that lead to the growth of the frontal cyclones near the surface and then in the deep waters and ultimately to the separation of a LC eddy [43,44,58,59]. The three-dimensional transformation confirms the results seen in the two-dimensional transformation. The Transform Model is capable of extending the correction of observations in HYCOM at least 50 days past the last measurement. For the MITgcm-GoM model, the flow field difference is quite significant, as shown by Figure 9. The transformed flow field is shifted toward the path of the observed flow, both horizontally and vertically, although the magnitude of the transformed flow is reduced from the observations.

4.3. Evaluation of the Bias Correction by the Long-Term Transform Model

For this evaluation, HYCOM is used for virtual observations to transform the MITgcm-GoM surface velocity field. In the forward transformation experiment, the transformation was applied for up to 365 days after the virtual observations ended. As seen in the short-term transformation, the RMSE of the transformed field is reduced by up to a factor of four and exhibits less temporal variability than the MITgcm-GoM RMSE (Figure 10a,b). The RMSE of the transformed field varies over time along with the evolution of the LC system, whose dynamics can affect the efficacy of the transformation. Indeed between days 120 and 160, the LC evolution between the HYCOM and the MITgcm-GoM significantly diverged. While the eddy shed and reattached in the HYCOM flow, the LC remained in its extended position in the MITgcm-GoM model. This event affected the capacity of the Transform Model to correct the divergent dynamics, which increased the RMSE of the transformed model. Nonetheless, the transformation is able to strongly reduce the MITcgm-GoM velocity field RMSE even 300 days after the virtual observation ended, which reveals the long-lasting potential of the transformation process. In terms of velocity field variability, the CC shown in Figure 10c confirms the skill of the transformer at improving the flow field features, especially when the HYCOM and MITgcm-GoM diverge the most. After 300 days, the CC of the transformed field is less than the one of the MITgcm-GoM although the RMSE of the transformed field is much smaller.

The backward transformation also shows a decrease in the RMSE of the transformed MITgcm-GoM velocity fields for most parts except during periods when the original numerical model RMSE is also low. In this case, both MITgcm-GoM and HYCOM circulation features exhibit dynamical similarities. In addition, the transformed RMSE is less variable than the original one. The CC of the transformed field is also improved over the CC of the MITgcm-GoM field and is similar to the one of the MITgcm-GoM field when the latter improves (days 100–150). As in the forward transformation, the transformed field CC becomes less than the MITgcm-GoM field CC toward the end of the time window. The error field variance is, however, strongly reduced by the transformation.

Although the transformation could lead to unrealistic features after several months, the EOF analysis of the velocity field reveals that the modal structure of the virtual observations is present in the transformed field, both in the forward and backward transformations (Figure 11). The transformation is also able to convey the timeliness of the separation of the LC eddy (Figure 12) in the forward transformation, which is paramount to the generation of reliable and useful predictions of the LC. In the backward direction, the Transform Model is capable of readjusting the LC state in the MITgcm-GoM to closely follow the state of the LC in the virtual observations (Figure 13).

5. Conclusions

In this study, we have presented a new method based on deep learning that enables the extension of the bias correction of a numerical ocean model velocity field up to one year after the observations ended. We have focused on ocean observations, but we believe that our approach could be applied to any biogeochemical or physical bias correction exercise, as long as the observations and model output are concurrently available.

Our approach is based on the re-purpose of an existing deep learning model, called U-Net, designed specifically for image segmentation analysis in the biomedical field. We have modified the network in such a way that its original segmentation layer is replaced by a regression layer. That layer is used to create a Transform Model that retains the temporal and spatial evolution of the differences between the model and observations to produce a correction in the form of regression weights that evolves spatially and temporally with the model both forward and backward in time, beyond the observation period. The results shown in this study also reveal the efficacy of the Transform Model at correcting the three-dimensional flow field, even on existing reanalyses, such as the HYCOM data used herein. Finally, with the increased availability of surface current data from high-frequency (HF) radars, a Transform Model could be easily created for any numerical model with just a few years of concurrent model data and observations. Such a model, which can be updated daily with new data, would provide useful corrections to the model’s surface flow field that would be transferred vertically to other variables, either by the numerical model itself or through a similar type of AI technique as the one described here.

Overall, our U-Net application has shown promising results in the numerical ocean model’s bias correction of the velocity field. Most bias corrections are designed for SST or SSH, and even sea surface salinity, but none for the velocity field. The reason for this is the lack of systematic in-situ current measurements. Despite the increase in collection efforts and duration because of the climate study needs, ocean full water column measurement arrays remain scarce and very expensive to operate. However, another measurement array in the GoM, funded by the National Academies of Science, Engineering and Medicine under the Understanding Gulf Ocean Systems initiative (UGOS), was deployed from June 2019 to May 2021 and expanded to the north, west, and south beyond the DynLoop array used in this study. Although not used in this study, this new GoM array could be used to train another STU-Net transformer to conduct bias correction of the velocity in numerical simulations of the LC system before, during and after that period. Other longer-term arrays have been deployed or are currently active as part of the international program such as the Global Tropical Moored Array [60]. Such arrays, because of their duration, offer opportunities for the application of transformers like ours that can virtually expand their longevity. There is, nonetheless, an effort to provide real-time surface current information in various areas along the coastline, which is happening in various regions around the world. However, the footprint is limited to about 200 km from shore [61]. As long-term deployments become more available and measurements more reliable, our Transformer method could be used to correct regional numerical current predictions that exhibit significant challenges in coastal areas [62].

Author Contributions

Conceptualization, A.M.A. and H.Z.; methodology, A.M.A. and H.Z.; software, A.M.A.; validation, A.M.A., Y.H., A.K.I. and A.S.A.; formal analysis, A.M.A.; investigation, A.M.A.; resources, H.Z. and L.M.C.; data curation, L.M.C.; writing—original draft preparation, A.M.A., H.Z. and L.M.C.; writing—review and editing, L.M.C.; visualization, A.M.A.; supervision, H.Z. and L.M.C.; project administration, L.M.C.; funding acquisition, L.M.C. and H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by a grant from the National Academy of Science/United States (NAS/GRP 2000011052)/Understanding Gulf Ocean Systems II to L.M.C. and H.Z.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets analyzed during the current study are available publicly, and the numerical model data set was obtained from the HYbrid Coordinate Ocean Model (HYCOM) Consortium / the HYCOM + NCODA Gulf of Mexico 1/25° Reanalysis. The data are publicly available from the website https://www.hycom.org/data/gomu0pt04/expt-50pt1, accessed on 18 November 2022. The free-running data generated based on the MITgcm model are publicly available online from the Scripps Institute of Oceanography and were downloaded from http://www.ecco.ucsd.edu/gom_results2.html, accessed on 18 November 2022.

Acknowledgments

The authors are thankful to Kathleen Donohue and her team at the University of Rhode Island for providing the velocity field from the Dynloop dataset, which made this study possible.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Pozo Buil, M.; Fiechter, J.; Jacox, M.G.; Bograd, S.J.; Alexander, M.A. Evaluation of Different Bias Correction Methods for Dynamical Downscaled Future Projections of the California Current Upwelling System. Earth Space Sci. 2023, 10, e2023EA003121. [Google Scholar] [CrossRef]
Vannitsem, S. Dynamical Properties of MOS Forecasts: Analysis of the ECMWF Operational Forecasting System. Weather Forecast. 2008, 23, 1032–1043. [Google Scholar] [CrossRef]
Tian, D.; Martinez, C.J.; Graham, W.D.; Hwang, S. Statistical Downscaling Multimodel Forecasts for Seasonal Precipitation and Surface Temperature over the Southeastern United States. J. Clim. 2014, 27, 8384–8411. [Google Scholar] [CrossRef]
Libonati, R.; Trigo, I.; DaCamara, C.C. Correction of 2 m-temperature forecasts using Kalman Filtering technique. Atmos. Res. 2008, 87, 183–197. [Google Scholar] [CrossRef]
Pelosi, A.; Medina, H.; den Bergh, J.V.; Vannitsem, S.; Chirico, G.B. Adaptive Kalman Filtering for Postprocessing Ensemble Numerical Weather Predictions. Mon. Weather Rev. 2017, 145, 4837–4854. [Google Scholar] [CrossRef]
Wang, J.; Chen, C.; Long, K.; Feng, L.; Research, M. Temporal and spatial distribution of short-time heavy rain of Sichuan Basin in summer. Plateau Mt. Meteorol. Res. 2015, 35, 16–20. [Google Scholar]
Chepurin, G.A.; Carton, J.A.; Dee, D. Forecast Model Bias Correction in Ocean Data Assimilation. Mon. Weather Rev. 2005, 133, 1328–1342. [Google Scholar] [CrossRef]
Mirouze, I.; Rémy, E.; Lellouche, J.M.; Martin, M.J.; Donlon, C.J. Impact of assimilating satellite surface velocity observations in the Mercator Ocean International analysis and forecasting global 1/4^∘ system. Front. Mar. Sci. 2024, 11, 1376999. [Google Scholar] [CrossRef]
Jäger, J.; Ferguson, H.L. Climate Change: Science, Impacts and Policy; Cambridge University Press: Cambridge, UK, 1991. [Google Scholar]
Snowden, J.; Hernandez, D.; Quintrell, J.; Harper, A.; Morrison, R.; Morell, J.; Leonard, L. The U.S. Integrated Ocean Observing System: Governance Milestones and Lessons From Two Decades of Growth. Front. Mar. Sci. 2019, 6, 242. [Google Scholar] [CrossRef]
Trice, A.; Robbins, C.; Philip, N.; Rumsey, M. Challenges and Opportunities for Ocean Data to Advance Conservation and Management; Ocean Conservancy: Washington, DC, USA, 2021. [Google Scholar]
Boehme, L.; Rosso, I. Classifying Oceanographic Structures in the Amundsen Sea, Antarctica. Geophys. Res. Lett. 2021, 48, e2020GL089412. [Google Scholar] [CrossRef]
Maze, G.; Mercier, H.; Fablet, R.; Tandeo, P.; Lopez Radcenco, M.; Lenca, P.; Feucher, C.; Le Goff, C. Coherent heat patterns revealed by unsupervised classification of Argo temperature profiles in the North Atlantic Ocean. Prog. Oceanogr. 2017, 151, 275–292. [Google Scholar] [CrossRef]
Houghton, I.A.; Wilson, J.D. El Niño Detection Via Unsupervised Clustering of Argo Temperature Profiles. J. Geophys. Res. Ocean. 2020, 125, e2019JC015947. [Google Scholar] [CrossRef]
Callaham, J.L.; Koch, J.V.; Brunton, B.W.; Kutz, J.N.; Brunton, S.L. Learning dominant physical processes with data-driven balance models. Nat. Commun. 2021, 12, 1016. [Google Scholar] [CrossRef] [PubMed]
Pauthenet, E.; Roquet, F.; Madec, G.; Sallée, J.B.; Nerini, D. The Thermohaline Modes of the Global Ocean. J. Phys. Oceanogr. 2019, 49, 2535–2552. [Google Scholar] [CrossRef]
Jones, D.C.; Holt, H.J.; Meijers, A.J.S.; Shuckburgh, E. Unsupervised Clustering of Southern Ocean Argo Float Temperature Profiles. J. Geophys. Res. Ocean. 2019, 124, 390–402. [Google Scholar] [CrossRef]
Han, G.; Zhou, J.; Shao, Q.; Li, W.; Li, C.; Wu, X.; Cao, L.; Wu, H.; Li, Y.; Zhou, G. Bias correction of sea surface temperature retrospective forecasts in the South China Sea. Acta Oceanol. Sin. 2022, 41, 41–50. [Google Scholar] [CrossRef]
Liu, G.; Bracco, A.; Brajard, J. Systematic Bias Correction in Ocean Mesoscale Forecasting Using Machine Learning. J. Adv. Model. Earth Syst. 2023, 15, e2022MS003426. [Google Scholar] [CrossRef]
Yang, Z.; Liu, J.; Yang, C.Y.; Hu, Y. Correcting Nonstationary Sea Surface Temperature Bias in NCEP CFSv2 Using Ensemble-Based Neural Networks. J. Atmos. Ocean. Technol. 2023, 40, 885–896. [Google Scholar] [CrossRef]
Choi, Y.; Park, Y.; Hwang, J.; Jeong, K.; Kim, E. Improving Ocean Forecasting Using Deep Learning and Numerical Model Integration. J. Mar. Sci. Eng. 2022, 10, 450. [Google Scholar] [CrossRef]
Alvera-Azcárate, A.; Barth, A.; Weisberg, R.H. The Surface Circulation of the Caribbean Sea and the Gulf of Mexico as Inferred from Satellite Altimetry. J. Phys. Oceanogr. 2009, 39, 640–657. [Google Scholar] [CrossRef]
Fei, T.; Huang, B.; Wang, X.; Zhu, J.; Chen, Y.; Wang, H.; Zhang, W. A Hybrid Deep Learning Model for the Bias Correction of SST Numerical Forecast Products Using Satellite Data. Remote Sens. 2022, 14, 1339. [Google Scholar] [CrossRef]
Hamilton, P.; Lugo-Fernández, A.; Sheinbaum, J. A Loop Current experiment: Field and remote measurements. Dyn. Atmos. Ocean. 2016, 76, 156–173. [Google Scholar] [CrossRef]
Walker, N. Coauthors, 2011: Impacts of Loop Current frontal cyclonic eddies and wind forcing on the 2010 Gulf of Mexico oil spill. Monitoring and Modeling the Deepwater Horizon Oil Spill: A Record-Breaking Enterprise. Geophys. Monogr. 2011, 195, 103–116. [Google Scholar]
Jaimes, B.; Shay, L.K.; Brewster, J.K. Observed air-sea interactions in tropical cyclone Isaac over Loop Current mesoscale eddy features. Dyn. Atmos. Ocean. 2016, 76, 306–324. [Google Scholar] [CrossRef]
Oey, L.Y.; Ezer, T.; Wang, D.P.; Fan, S.J.; Yin, X.Q. Loop current warming by Hurricane Wilma. Geophys. Res. Lett. 2006, 33, 1–4. [Google Scholar] [CrossRef]
Kaiser, M.J.; Pulsipher, A.G. The potential value of improved ocean observation systems in the Gulf of Mexico. Mar. Policy 2004, 28, 469–489. [Google Scholar] [CrossRef]
Wang, J.L.; Zhuang, H.; Chérubin, L.M.; Ibrahim, A.K.; Muhamed Ali, A. Medium-Term Forecasting of Loop Current eddy Cameron and eddy Darwin formation in the Gulf of Mexico with a Divide-and-Conquer Machine Learning Approach. J. Geophys. Res. Ocean. 2019, 124, 5586–5606. [Google Scholar] [CrossRef]
Wang, J.L.; Zhuang, H.; Chérubin, L.; Muhamed Ali, A.; Ibrahim, A. Loop Current SSH Forecasting: A New Domain Partitioning Approach for a Machine Learning Model. Forecasting 2021, 3, 570–579. [Google Scholar] [CrossRef]
Muhamed Ali, A.; Zhuang, H.; VanZwieten, J.; Ibrahim, A.K.; Chérubin, L. A Deep Learning Model for Forecasting Velocity Structures of the Loop Current System in the Gulf of Mexico. Forecasting 2021, 3, 934–953. [Google Scholar] [CrossRef]
Huang, Y.; Tang, Y.; Zhuang, H.; VanZwieten, J.; Cherubin, L. Physics-Informed Tensor-Train ConvLSTM for Volumetric Velocity Forecasting of the Loop Current. Front. Artif. Intell. 2021, 4, 780271. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Moore, A.M.; Martin, M.J.; Akella, S.; Arango, H.G.; Balmaseda, M.; Bertino, L.; Ciavatta, S.; Cornuelle, B.; Cummings, J.; Frolov, S.; et al. Synthesis of Ocean Observations Using Data Assimilation for Operational, Real-Time and Reanalysis Systems: A More Complete Picture of the State of the Ocean. Front. Mar. Sci. 2019, 6, 90. [Google Scholar] [CrossRef]
Cooper, C.; Danmeier, D.; Frolov, S.; Stuart, G.; Zuckerman, S.; Anderson, S.; Sharma, N. Real Time Observing and Forecasting of Loop Currents in 2015. In Proceedings of the Offshore Technology Conference, Houston, TX, USA, 2–5 May 2016. [Google Scholar] [CrossRef]
Cummings, J.A. Operational multivariate ocean data assimilation. Q. J. R. Meteorol. Soc. 2005, 131, 3583–3604. [Google Scholar] [CrossRef]
Cummings, J.A.; Smedstad, O.M. Variational Data Assimilation for the Global Ocean; Springer: Berlin/Heidelberg, Germany, 2013; pp. 303–343. [Google Scholar]
Fox, D.N.; Teague, W.J.; Barron, C.N.; Carnes, M.R.; Lee, C.M. The Modular Ocean Data Assimilation System (MODAS). J. Atmos. Ocean. Technol. 2002, 19, 240–252. [Google Scholar] [CrossRef]
Saha, S.; Moorthi, S.; Pan, H.L.; Wu, X.; Wang, J.; Nadiga, S.; Tripp, P.; Kistler, R.; Woollen, J.; Behringer, D.; et al. The NCEP Climate Forecast System Reanalysis. Bull. Am. Meteorol. Soc. 2010, 91, 1015–1058. [Google Scholar] [CrossRef]
Kalnay, E.; Kanamitsu, M.; Kistler, R.; Collins, W.; Deaven, D.; Gandin, L.; Iredell, M.; Saha, S.; White, G.; Woollen, J.; et al. The NCEP/NCAR 40-Year Reanalysis Project. Bull. Am. Meteorol. Soc. 1996, 77, 437–472. [Google Scholar] [CrossRef]
Gopalakrishnan, G.; Cornuelle, B.D.; Hoteit, I. Adjoint sensitivity studies of loop current and eddy shedding in the Gulf of Mexico. J. Geophys. Res. Ocean. 2013, 118, 3315–3335. [Google Scholar] [CrossRef]
Morey, S.L.; Gopalakrishnan, G.; Sanz, E.P.; de Souza, J.M.A.C.; Donohue, K.A.; Pérez-Brunius, P.; Dukhovskoy, D.S.; Chassignet, E.P.; Cornuelle, B.D.; Bower, A.S.; et al. Assessment of Numerical Simulations of Deep Circulation and Variability in the Gulf of Mexico Using Recent Observations. J. Phys. Oceanogr. 2020, 50, 1045–1064. [Google Scholar] [CrossRef]
Donohue, K.; Watts, D.; Hamilton, P.; Leben, R.; Kennelly, M.; Lugo-Fernández, A. Gulf of Mexico loop current path variability. Dyn. Atmos. Ocean. 2016, 76, 174–194. [Google Scholar] [CrossRef]
Donohue, K.; Watts, D.; Hamilton, P.; Leben, R.; Kennelly, M. Loop Current Eddy formation and baroclinic instability. Dyn. Atmos. Ocean. 2016, 76, 195–216. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Siddique, N.; Sidike, P.; Elkin, C.; Devabhaktuni, V. U-Net and its variants for medical image segmentation: Theory and applications. arXiv 2020, arXiv:2011.01118. [Google Scholar] [CrossRef]
Chacón, R.; Neila, P.; Salzmann, M.; Fua, P. A domain-adaptive two-stream U-net for electron microscopy image segmentation. In Proceedings of the 15th IEEE International Symposium Biomedical Imaging, number CONF, Washington, DC, USA, 4–7 April 2018. [Google Scholar]
Li, R.; Liu, W.; Yang, L.; Sun, S.; Hu, W.; Zhang, F.; Li, W. Deepunet: A deep fully convolutional network for pixel-level sea-land segmentation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 3954–3962. [Google Scholar] [CrossRef]
Jiao, L.; Huo, L.; Hu, C.; Tang, P. Refined unet: Unet-based refinement network for cloud and shadow precise segmentation. Remote Sens. 2020, 12, 2001. [Google Scholar] [CrossRef]
Fernández, J.G.; Abdellaoui, I.A.; Mehrkanoon, S. Deep coastal sea elements forecasting using U-Net based models. arXiv 2020, arXiv:2011.03303. [Google Scholar]
Dang, K.B.; Nguyen, M.H.; Nguyen, D.A.; Phan, T.T.H.; Giang, T.L.; Pham, H.H.; Nguyen, T.N.; Tran, T.T.V.; Bui, D.T. Coastal Wetland Classification with Deep U-Net Convolutional Networks and Sentinel-2 Imagery: A Case Study at the Tien Yen Estuary of Vietnam. Remote Sens. 2020, 12, 3270. [Google Scholar] [CrossRef]
Shaikhina, T.; Khovanova, N.A. Handling limited datasets with neural networks in medical applications: A small-data approach. Artif. Intell. Med. 2017, 75, 51–63. [Google Scholar] [CrossRef]
Keys, R. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 1981, 29, 1153–1160. [Google Scholar] [CrossRef]
Muhamed Ali, A.; Zhuang, H.; Ibrahim, A.K.; Wang, J.L.; Chérubin, L.M. Deep learning prediction of two-dimensional ocean dynamics with wavelet-compressed data. Front. Artif. Intell. 2022, 5, 923932. [Google Scholar] [CrossRef]
Jolliff, J.K.; Kindle, J.C.; Shulman, I.; Penta, B.; Friedrichs, M.A.; Helber, R.; Arnone, R.A. Summary diagrams for coupled hydrodynamic-ecosystem model skill assessment. J. Mar. Syst. 2009, 76, 64–82. [Google Scholar] [CrossRef]
Thomson, R.E.; Emery, W.J. Data Analysis Methods in Physical Oceanography, 3rd ed.; Elsevier Science: Amsterdam, The Netherlands, 2014. [Google Scholar] [CrossRef]
Cooper, M.; Haines, K. Altimetric assimilation with water property conservation. J. Geophys. Res. 1996, 101, 1059–1077. [Google Scholar] [CrossRef]
Chérubin, L.M.; Morel, Y.; Chassignet, E.P. Loop Current Ring Shedding: The Formation of Cyclones and the Effect of Topography. J. Phys. Oceanogr. 2006, 36, 569–591. [Google Scholar] [CrossRef]
Chérubin, L.M.; Sturges, W.; Chassignet, E.P. Deep flow variability in the vicinity of the Yucatan Straits from a high-resolution numerical simulation. J. Geophys. Res. Ocean. 2005, 110, C04009. [Google Scholar] [CrossRef]
Speich, S.; Rodrigues, R.R.; Santos, J.; Li, J. Special Issue “Tropical Atlantic Ocean Observing System”. Clivar Exch. 2022, 82, 156–173. [Google Scholar] [CrossRef]
Mantovani, C.; Corgnati, L.; Horstmann, J.; Rubio, A.; Reyes, E.; Quentin, C.; Cosoli, S.; Asensio, J.L.; Mader, J.; Griffa, A. Best Practices on High Frequency Radar Deployment and Operation for Ocean Current Measurement. Front. Mar. Sci. 2020, 7, 210. [Google Scholar] [CrossRef]
Fringer, O.B.; Dawson, C.N.; He, R.; Ralston, D.K.; Zhang, Y.J. The future of coastal and estuarine modeling: Findings from a workshop. Ocean Model 2019, 143, 101458. [Google Scholar] [CrossRef]

Figure 1. (a) Observational array of individual mooring locations and types used in this study. The dates indicate the beginning and end of the measurements. The green contours show the depth in meters. The thick pink line shows the average path of the loop current. (b) Transform model concept with a deep learning approach. Despite the shorter duration of the observations, the overlapping model field can be corrected beyond the observation period.

Figure 2. The STU-Net structure used in this study in which the original SoftMax and segmentation layers were replaced with a regression layer.

Figure 3. Taylor diagrams of the spatially averaged current velocity magnitude for HYCOM and MITgcm-GoM and Dynloop (observations) at depths of (a) 0 m, (b) 20 m, (c) 100 m, and (d) 500 m. The red dot shows the RMSD/RMSE for each model at the same time by the relative position to the dashed green curves, the correlation coefficient by the straight blue dashed lines, the standard deviation on the x or y axis, and the red curve showing the correspondence between the two axes. The red dot on the x axis shows the standard deviation of the in-situ observations.

Figure 4. Surface velocity magnitude Root Mean Squared Error (RMSE) in m.s⁻¹ of the original and the transformed model for (a) the HYCOM model and (b) of the MITgcm-GoM model (note the different vertical axis range), and the correlation coefficient (CC) of the original and the transformed model for (c) the HYCOM model and (d) the MITgcm-GoM model. The shaded regions show the variance of the error field between the original and transformed.

Figure 5. First Empirical Orthogonal Function (EOF) mode of the surface velocity fields over the 50-day testing period of the Transform Model. From left to right is shown the observed, model original, and transformed field. (a) HYCOM and (b) MITgcm-GoM.

Figure 6. Average subsurface velocity metrics between the original and the transformed HYCOM velocity tensor time series. Root Mean Squared Error (RMSE) in m.s⁻¹ for (a) zonal flow and (b) meridional flow, and correlation coefficient (CC) for (c) zonal flow and (d) meridional flow. The shaded regions show the variance of the error field between the original and transformed.

Figure 7. Same as Figure 6 but for the MITgcm-GoM model. Root Mean Squared Error (RMSE) in m.s⁻¹ for (a) zonal flow and (b) meridional flow, and correlation coefficient (CC) for (c) zonal flow and (d) meridional flow. The shaded regions show the variance of the error field between the original and transformed.

Figure 8. Transformed HYCOM three-dimensional velocity structures on day 50 after the end of the observation period. Both the structures inside (a) and on the boundary (b) of the transformed volume are shown. The left, middle and right plots show the original HYCOM, the Dynloop, and the transformed flow fields.

Figure 9. Same as Figure 8 but for the MITgcm-GoM model. Both the structures inside (a) and on the boundary (b) of the transformed volume are shown. The left, middle and right plots show the original MITgcm-GoM, the Dynloop, and the transformed flow fields.

Figure 10. Surface velocity time series evaluation metrics for the original (blue line) and transformed (red line) MITgcm-GoM in the (a) RMSE in the forward direction, (b) RMSE in the backward direction, (c) CC in the forward direction, and (d) CC in the backward direction. The shaded regions show the variance of the error field between the original and transformed MITgcm-GoM.

Figure 11. First Empirical Orthogonal Function (EOF) mode of the surface velocity fields over the first 120 days of the transformed field in the forward (a) and backward (b) direction. From left to right is the observed, the original model, and the transformed field.

Figure 12. Velocity magnitude snapshots of the long-term transformation in the forward direction. The first, second and third columns show the virtual observation field (HYCOM), the MITgcm-GoM original field, and the transformed MITgcm-GoM, respectively. The last two columns show the difference between HYCOM and the transformed fields and between HYCOM and the original MITgcm-GoM fields, respectively. The rows indicate time. Note that no observations have been provided since January 2012.

Figure 13. Same as Figure 12 but in the backward direction.

Table 1. The 50-day average transformation gain (percentage) of the velocity corrections at different depths by applying the Transform Model to HYCOM and MITgcm velocity fields.

Dataset	Zonal	Zonal	Zonal	Meridional	Meridional	Meridional
Depth	0 m	100 m	500 m	0 m	100 m	500 m
HYCOM	$78.85 %$	$76.23 %$	$71.78 %$	$74.80 %$	$69.16 %$	$71.99 %$
MITgcm	$54.88 %$	$43.43 %$	$35.53 %$	$48.35 %$	$21.29 %$	$42.91 %$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Muhamed Ali, A.; Zhuang, H.; Huang, Y.; Ibrahim, A.K.; Altaher, A.S.; Chérubin, L.M. Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach. J. Mar. Sci. Eng. 2024, 12, 1680. https://doi.org/10.3390/jmse12091680

AMA Style

Muhamed Ali A, Zhuang H, Huang Y, Ibrahim AK, Altaher AS, Chérubin LM. Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach. Journal of Marine Science and Engineering. 2024; 12(9):1680. https://doi.org/10.3390/jmse12091680

Chicago/Turabian Style

Muhamed Ali, Ali, Hanqi Zhuang, Yu Huang, Ali K. Ibrahim, Ali Salem Altaher, and Laurent M. Chérubin. 2024. "Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach" Journal of Marine Science and Engineering 12, no. 9: 1680. https://doi.org/10.3390/jmse12091680

APA Style

Muhamed Ali, A., Zhuang, H., Huang, Y., Ibrahim, A. K., Altaher, A. S., & Chérubin, L. M. (2024). Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach. Journal of Marine Science and Engineering, 12(9), 1680. https://doi.org/10.3390/jmse12091680

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Ocean Currents Velocity Hindcast and Forecast Bias Correction Using a Deep-Learning Approach

Abstract

1. Introduction

2. Transform Model Concept

3. Methods

3.1. Data Sets

3.2. Transform Model

3.3. Experimental Strategy and Input Data Formatting

3.4. Bias Correction Evaluation Metrics

4. Results

4.1. Comparison between Observed and Modeled Fields

4.2. Evaluation of the Bias Correction by the Short-Term Transform Model

4.2.1. Two-Dimensional Field Bias Correction

4.2.2. Three-Dimensional Field Bias Correction

4.3. Evaluation of the Bias Correction by the Long-Term Transform Model

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI