Prediction of Unstable Hydrodynamic Forces on Submerged Structures under the Water Surface Using a Data-Driven Modeling Approach

: Catastrophic failures of partially or fully submerged structures, e.g., offshore platforms, hydrokinetic turbine blades, bridge decks, etc., due to the dynamic impact of free surface ﬂows such as waves or ﬂoods have revealed the need to evaluate their reliability. In this respect, an accurate estimation of hydrodynamic forces and their relationship to instability in structures is required. The computational ﬂuid dynamics (CFD) solver is known as a powerful tool to identify dynamic characteristics of ﬂow; however, it commonly consumes a huge computational cost, especially in cases of re-simulations needed. In this paper, an efﬁcient surrogate model based on the Gaussian process is developed to rapidly predict the nonlinear hydrodynamic pressure coefﬁcients on submerged bodies near the water surface. For this purpose, a CFD model is ﬁrst developed, which is based on a two-dimensional incompressible Navier–Stokes solver incorporating free surface treatment and turbulent ﬂow models. Then, an experimental design is adopted to generate initial training samples considering the effect of the submerged body shape ratio and ﬂow Re number. Surrogate models of hydrodynamic pressure coefﬁcients and their instability based on Gaussian process modeling are established using the outcome from the CFD simulations, where optimal trend and correlation functions are also investigated. Once surrogate models are obtained, the mean and oscillation amplitudes of hydrodynamic pressure coefﬁcients on a submerged rectangular body, which represents the shape of most civil structures, can be rapidly predicted without the attempt at re-simulation. The ﬁndings can be practically applied in rapidly assessing hydrodynamic forces and their instability of existing submerged civil structures or in designing new structures, where a suitable shape ratio should be adopted to avoid ﬂow-induced instability of hydrodynamic forces


Introduction
Past failures of partially or fully submerged civil structures, e.g., offshore platforms, hydrokinetic turbine blades, bridge decks, etc., due to the dynamic impact caused by floods, waves or tsunamis have revealed the significant need for evaluating the reliability of these structures under hydrodynamic forces [1][2][3].Regarding this issue, one of the challenges is the accurate evaluation of multiphase flows and their impact on submerged structures.This is essential not only for optimum designs of the newly designed structures but also for the estimation of the degree of risk for existing ones.
Large-scale structures under free surface flows commonly require time-consuming experimental analyses.In addition, flow characteristics, e.g., flow patterns, velocities and pressure fields, are uncertain and nonlinear; hence, resulting in a very high cost and time required for many experimental setups in order to observe accurate flow characteristics [4,5].
While analytical models for hydrodynamic forces on submerged bodies have been developed and incorporated into design codes, they are often overestimated and with a degree of error [6][7][8].An alternative is the development of numerical models for free surface flows and their impact on structures; this can reduce much effort in terms of cost and time as compared with experimental models [9].The numerical models are mainly based on computational fluid dynamic (CFD) approaches that solve Navier-Stokes equations along with the treatment of the free surface.However, due to the uncertain flow characteristics, varying with each particular case, the design of these numerical simulations becomes complicated and requires numerous computing resources, especially in cases of the large number of samples considered in a reliability analysis [10,11].
With the development of computer science, besides various computer models which have been adopted in many fields of CFD [12], machine learning techniques have been developed and widely used to predict multiphase flow characteristics [13][14][15], in which, data-driven techniques, e.g., neural network [13], support vector machine [14] and Gaussian process [15], have been implemented to build surrogate models for the prediction of the multiphase flow pattern, as well as the hydrodynamic pressure distribution.These surrogate models, which are used when an outcome of interest cannot be easily measured or computed, can efficiently reduce the computational effort and rapidly estimate flow characteristics in the context of uncertainty treatments and reliability analyses.The first two methods (i.e., neural network and support vector machine) commonly require an adequate dataset for a reliable prediction, which depends on the number of input and output parameters, while the Gaussian process regression makes it possible to predict the model response with little observed data.In addition, the Gaussian process offers a flexible kernel method for regression due to various available trends and correlation functions.Therefore, in many complex problems, this technique is more suitable and efficient in terms of reducing computational costs [11,16].
The objective of this paper is to develop surrogate models, which are based on Gaussian process modeling, to rapidly predict nonlinear hydrodynamic pressure coefficients and their instability effect on submerged bodies.As a case study, rectangular submerged bodies near the water surface are considered.This type of shape is standard and represents the shape of many engineering applications such as bridge decks, offshore platforms and hydrokinetic turbine blades.Firstly, a modeling approach of the flow passing a submerged body is presented based on a two-dimensional incompressible Navier-Stokes solver.The free surface is treated using the volume of fluid method and the effect of the turbulent flow is also considered by using the shear stress transport turbulence model.Then, an appropriate experimental design is used to generate initial training samples considering the effect of the aspect ratio of the submerged body and the Re number of the flow.Surrogate models of hydrodynamic pressure coefficients based on the Gaussian process modeling are established using the outcome from the CFD simulations, where optimal trend and correlation functions are also investigated.Once surrogate models are obtained, the mean and oscillation amplitudes of hydrodynamic pressure coefficients of the free surface flow on a submerged cylinder with an arbitrary aspect ratio can be accurately and rapidly predicted without the need for attempt at re-simulation.The findings from the work can be practically applied in rapidly assessing hydrodynamic force and its instability effect on existing submerged civil structures, or in designing new structures, where a suitable shape ratio range is recommended to avoid the detrimental effects of flow-induced instability from hydrodynamic forces.

Numerical Model of Free Surface Flow
In this study, a two-dimensional (2D) incompressible Reynolds-averaged Navier-Stokes (RANS) homogeneous two-phase mixture model is adopted to simulate the nonlinear interactions of a submerged cylinder beneath the free surface [9,17].The model solves the mixture continuity and momentum equations to obtain the mean flow velocity and pressure fields.The RANS model is closed by including a turbulence model to predict fluctuating velocity components; thus, the shear stress transport k − ω model is adopted.In addition, to capture complex free surface behaviors and non-linear hydrodynamic pressure coefficients, the interface between the water and air phases is numerically treated using the volume-of-fluid (VOF) method [18,19].These equations can be described in the Cartesian, Here, the subscripts i, j = 1, 2 represent two directions of x and y in the computational domain, respectively; t is the computation time; p denotes the pressure, u and u are the mean and the fluctuating velocities; g is the gravitational term; and ρ m and µ m are the mixture density and viscosity, respectively.
The interface position is numerically treated via the phasic volume fraction of the gas phase, α g .Here, the pure gas phase is obtained in the case of α g equals 1.0, while the pure water phase is obtained in the case of α g equals 0.0.The interfaces with a limited thickness between two phases are identified by the values in the range from 0.0 to 1.0.The properties of the mixture phase at the interface are predicted by a function of the volume of fraction of the individual phase, an example, for the calculation of the mixture values for density and mixture valuables where ρ w , µ w and ρ g , µ g are the density and viscosity of the individual water and gas phases, respectively.The numerical discretization of the equation system in the generally structured grid is based on the finite volume method with a pressure-based solver.For time discretization, the first-order implicit method is applied.The first-order upwind scheme is adopted for both the convective and viscous terms and the advection equation is approximated using the implicit compressive scheme.The main reason behind using first-order and implicit schemes is to obtain better convergence than high-order and explicit schemes, especially for strongly deformable free surfaces with breaking wave phenomena.All simulation results are performed using the Ansys Fluent software [20].

Basic Formulations
Gaussian process regression uses a set of observed training data to predict spatially correlated data, which postulates a combination of a functional basis and departure in the following form [21], where the first term is the unknown multivariate polynomial function, f = f j (x (i) ) with j = 1, . . ., p, called the trend, and Z(x) is the realization of the Gaussian process having zero mean and variance σ 2 ; Z(x) is expressed as where σ 2 is the variance of the Gaussian process, whereas R is the correlation function which is the function of the difference x − x and scale parameters ( i > 0, i = 1, . . ., n).
Several correlation functions are proposed, e.g., the exponential (Equation ( 8)), Gaussian (Equation ( 9)), and Matérn-3/2 (Equation (10)), The vector of the prediction Ŷ0 and the true response where f 0 is the vector of regression models evaluated at x (0) , F is the regression matrix, r 0 is the vector of cross-correlations between the point The best linear unbiased predictor of the unknown quantity of interest y 0 is the Gaussian random variate Ŷ0 with mean and variance, where β The maximum likelihood estimation technique is better suited for deriving estimators.Here, the likelihood of the observations y is defined concerning its multivariate normal distribution, which depends on β, σ 2 , and , By maximizing the quantity described in Equation ( 14), the following analytical estimates of β and σ 2 that are functions of are obtained as By substituting these two solutions into Equation ( 14), its corresponding opposite log-likelihood reads ) and thus, the maximum likelihood estimate of is given as ˆ = argmin ψ( ). (18)

Procedure of Surrogate Model-Based Hydrodynamic Pressure Coefficient Prediction
The overall procedure for the development of a Gaussian process based-surrogate model is as follows: (i) In the first step, the most important variables and their distribution functions should be identified.An appropriate DOE is then conducted within the range of interest variables.As a result, several initial training samples are generated and corresponding CFD models are then built.(ii) CFD simulations of the flow field are conducted for each combination of training conditions.The flow field characteristics, as well as hydrodynamic pressure coefficients, are obtained at each simulation.In this study, the mean and oscillation values of hydrodynamic pressure coefficients, i.e., drag, lift and moment, are considered as the model responses.(iii) Once the training dataset has been established based on the DOE and the corresponding model responses, a surrogate model of the model response is built using the Gaussian process modeling incorporated in a Matlab-based software, Uqlab [22].In this step, different trend and correlation functions that compose the Gaussian process are tested.An optimal surrogate model is finally obtained based on the error estimation from the cross-validation.

CFD Simulation and Design of Experiments
In this study, a rectangular shape body fully submerged beneath a free surface is selected, which represents the shape of most bridge deck or other civil structure components under the free surface flow.The computational domain and simulation conditions are adopted following the experimental work by Chu et al. [23], in which the problem of an open channel with a particular size is shown schematically in Figure 1.The rectangular body submerges in the water at a depth of h and a distance between the channel bed and the cylinder S. To reduce the computational cost, a planar symmetric numerical model is used under the main assumption that there are no effects in the spanwise direction.This assumption was also adopted for numerical computations of free surface flows over a submerged body in many studies [9].The boundary conditions are applied as follows: (i) the fixed uniform velocity is specified at the inlet condition, (ii) at the outlet condition, the extrapolation values are applied for the pressure and velocity fields, (iii) at the top boundary condition, open conditions are applied and (iv) at the bottom line and cylinder, non-slip wall conditions are used.
variables.As a result, several initial training samples are generated and corresponding CFD models are then built.(ii) CFD simulations of the flow field are conducted for each combination of training conditions.The flow field characteristics, as well as hydrodynamic pressure coefficients, are obtained at each simulation.In this study, the mean and oscillation values of hydrodynamic pressure coefficients, i.e., drag, lift and moment, are considered as the model responses.(iii) Once the training dataset has been established based on the DOE and the corresponding model responses, a surrogate model of the model response is built using the Gaussian process modeling incorporated in a Matlab-based software, Uqlab [22].In this step, different trend and correlation functions that compose the Gaussian process are tested.An optimal surrogate model is finally obtained based on the error estimation from the cross-validation.

CFD Simulation and Design of Experiments
In this study, a rectangular shape body fully submerged beneath a free surface is selected, which represents the shape of most bridge deck or other civil structure components under the free surface flow.The computational domain and simulation conditions are adopted following the experimental work by Chu et al. [23], in which the problem of an open channel with a particular size is shown schematically in Figure 1.The rectangular body submerges in the water at a depth of ℎ and a distance between the channel bed and the cylinder .To reduce the computational cost, a planar symmetric numerical model is used under the main assumption that there are no effects in the spanwise direction.This assumption was also adopted for numerical computations of free surface flows over a submerged body in many studies [9].The boundary conditions are applied as follows: (i) the fixed uniform velocity is specified at the inlet condition, (ii) at the outlet condition, the extrapolation values are applied for the pressure and velocity fields, (iii) at the top boundary condition, open conditions are applied and (iv) at the bottom line and cylinder, nonslip wall conditions are used.The mesh distribution for the whole computational domain and zoomed regions near the submerged body are presented in Figure 2. Here, the meshing strategy with a highresolution value close to the body and free surface is used to obtain high accuracy predictions for the pressure and velocity fields, particularly for the free surface shape motion.The grid and time step sensitivity tests were performed through convergency analyses in the previous work [9]; thus the fine grid with a total number of nodes of 112,649, the  + value at the body surface of 1.1, and the time step of 0.002 (s) are used in the present simulations.The mesh distribution for the whole computational domain and zoomed regions near the submerged body are presented in Figure 2. Here, the meshing strategy with a high-resolution value close to the body and free surface is used to obtain high accuracy predictions for the pressure and velocity fields, particularly for the free surface shape motion.The grid and time step sensitivity tests were performed through convergency analyses in the previous work [9]; thus the fine grid with a total number of nodes of 112,649, the y+ value at the body surface of 1.1, and the time step of 0.002 (s) are used in the present simulations.The complex simulation of the above-mentioned CFD model reveals the need for developing a more efficient surrogate model to possibly and rapidly identify nonlinear hydrodynamic pressure coefficients on a submerged body under free surface flows.The development of a surrogate model or metamodel first requires the generated samples that involve modeling parameters.In this study, Latin Hypercube Sampling (LHS) [24] is utilized to generate training samples; this technique is widely used and has demonstrated its efficiency in the construction of surrogate models, especially ones based on the Gaussian process [25,26].
Many studies have demonstrated that the shape ratio () (i.e., the length and depth ratios of the rectangular body,  = /) and the  number are the most significant parameters that affect the flow characteristic and impact hydrodynamic pressure [8,10].Therefore, these two parameters are chosen as input random variables in the study with the ranges selected and presented in Table 1, which are assumed to be a uniform distribution.The other geometry parameters, such as the depth ℎ and the clearance distance  are deterministic.By using the LHS on two modeling parameters, a total of 40 samples are generated, as distributed in Figure 3.It should be noticed that there is no specific standard for the number of initial training samples, depending on the number of input variables, particular problem and training method.Since the Gaussian process has not needed the pre-assuming of a specified model and just requires a small number of initial training samples, an optimized design of 40 samples is chosen, as proposed by [26].The complex simulation of the above-mentioned CFD model reveals the need for developing a more efficient surrogate model to possibly and rapidly identify nonlinear hydrodynamic pressure coefficients on a submerged body under free surface flows.The development of a surrogate model or metamodel first requires the generated samples that involve modeling parameters.In this study, Latin Hypercube Sampling (LHS) [24] is utilized to generate training samples; this technique is widely used and has demonstrated its efficiency in the construction of surrogate models, especially ones based on the Gaussian process [25,26].
Many studies have demonstrated that the shape ratio (AR) (i.e., the length and depth ratios of the rectangular body, AR = L/W) and the Re number are the most significant parameters that affect the flow characteristic and impact hydrodynamic pressure [8,10].Therefore, these two parameters are chosen as input random variables in the study with the ranges selected and presented in Table 1, which are assumed to be a uniform distribution.The other geometry parameters, such as the depth h and the clearance distance S, are deterministic.By using the LHS on two modeling parameters, a total of 40 samples are generated, as distributed in Figure 3.It should be noticed that there is no specific standard for the number of initial training samples, depending on the number of input variables, particular problem and training method.Since the Gaussian process has not needed the pre-assuming of a specified model and just requires a small number of initial training samples, an optimized design of 40 samples is chosen, as proposed by [26].As examples of the flow characteristic, Figure 4 shows the velocity field and pressure contour for three cases of  and  values, marked by large circles in Figure 3.The free surface shape is plotted by the red solid line using the air volume fraction value of 0.5.Under the presence of the submerged body, significantly increasing free surface flow and reduction in the depth at the upward and the downward regions is observed.Therefore, a high-velocity water flow in the downstream region is formed.In addition, submerged wake vortices behind the body are generated under the effects of the inclination of the free surface, as shown by streamline fields on the left side of Figure 4.This nonlinear evolution behavior is found to be a major mechanism that increases the hydrodynamic force coefficients in comparison with the unbounded free surface flow.The pressure distributions around the submerged body are also shown on the right side of Figure 4.In the presence of the free surface, asymmetric low-pressure regions at the top and bottom of the submerged body are observed.As examples of the flow characteristic, Figure 4 shows the velocity field and pressure contour for three cases of AR and Re values, marked by large circles in Figure 3.The free surface shape is plotted by the red solid line using the air volume fraction value of 0.5.Under the presence of the submerged body, significantly increasing free surface flow and reduction in the depth at the upward and the downward regions is observed.Therefore, a high-velocity water flow in the downstream region is formed.In addition, submerged wake vortices behind the body are generated under the effects of the inclination of the free surface, as shown by streamline fields on the left side of Figure 4.This nonlinear evolution behavior is found to be a major mechanism that increases the hydrodynamic force coefficients in comparison with the unbounded free surface flow.The pressure distributions around the submerged body are also shown on the right side of Figure 4.In the presence of the free surface, asymmetric low-pressure regions at the top and bottom of the submerged body are observed.As examples of the flow characteristic, Figure 4 shows the velocity field and pressure contour for three cases of  and  values, marked by large circles in Figure 3.The free surface shape is plotted by the red solid line using the air volume fraction value of 0.5.Under the presence of the submerged body, significantly increasing free surface flow and reduction in the depth at the upward and the downward regions is observed.Therefore, a high-velocity water flow in the downstream region is formed.In addition, submerged wake vortices behind the body are generated under the effects of the inclination of the free surface, as shown by streamline fields on the left side of Figure 4.This nonlinear evolution behavior is found to be a major mechanism that increases the hydrodynamic force coefficients in comparison with the unbounded free surface flow.The pressure distributions around the submerged body are also shown on the right side of    For each sample, hydrodynamic pressure coefficients (i.e., drag, lift and moment coefficients) acting on submerged bodies are obtained from CFD analyses.Figure 5 shows an example of the time evolution of hydrodynamic pressure coefficients for  = 0.208,  = 8,689.223(Figure 5a),  = 2.036,  = 12,958.475(Figure 5b), and  = 3.989,  = 9,339.984(Figure 5c).The numerical results show that the hydrodynamic force coefficients are significantly varied under the nonlinear interaction with the free surface.In the cases of lower  values ( = 0.208), the force coefficients are in periodic evolutions with time and are characterized by a mean value and oscillated magnitude.These predicted values are determined by an averaged method over five oscillation cycles.In the case of higher values ( = 2.036 and 3.989), stable behaviors of the force coefficients without oscillation features are observed.For each sample, hydrodynamic pressure coefficients (i.e., drag, lift and moment coefficients) acting on submerged bodies are obtained from CFD analyses.Figure 5 shows an example of the time evolution of hydrodynamic pressure coefficients for AR = 0.208, Re = 8689.223(Figure 5a), AR = 2.036, Re = 12,958.475(Figure 5b), and AR = 3.989, Re = 9339.984(Figure 5c).The numerical results show that the hydrodynamic force coefficients are significantly varied under the nonlinear interaction with the free surface.In the cases of lower AR values (AR = 0.208), the force coefficients are in periodic evolutions with time and are characterized by a mean value and oscillated magnitude.These predicted values are determined by an averaged method over five oscillation cycles.In the case of higher values (AR = 2.036 and 3.989), stable behaviors of the force coefficients without oscillation features are observed.For each sample, hydrodynamic pressure coefficients (i.e., drag, lift and moment coefficients) acting on submerged bodies are obtained from CFD analyses.Figure 5 shows an example of the time evolution of hydrodynamic pressure coefficients for  = 0.208,  = 8,689.223(Figure 5a),  = 2.036,  = 12,958.475(Figure 5b), and  = 3.989,  = 9,339.984(Figure 5c).The numerical results show that the hydrodynamic force coefficients are significantly varied under the nonlinear interaction with the free surface.In the cases of lower  values ( = 0.208), the force coefficients are in periodic evolutions with time and are characterized by a mean value and oscillated magnitude.These predicted values are determined by an averaged method over five oscillation cycles.In the case of higher values ( = 2.036 and 3.989), stable behaviors of the force coefficients without oscillation features are observed.Similarly, the outcomes of interest including six above-mentioned quantities (the mean values of C L , C D , C M and their oscillations) are obtained and summarized in Table A1 (Appendix A), resulting in a total of 40 examples of training data in the dataset.The observed responses from the dataset are later used to train surrogate models of the outcomes.

Surrogate Model Development
Based on the above model output, the surrogate model is then developed using the above-mentioned Gaussian process modeling.The major advantage of this modeling approach is that it requires less observed data for the regression as compared to other datadriven techniques, such as support vector machine or artificial neural network.To optimize the surrogate model for the prediction, several trend and correlation functions are tested in this study.Due to the nonlinearity of the hydrodynamic pressure coefficients, nonlinear regression and correlation models are selected.In particular, for the trend, polynomial (one, two, and three degrees) functions are employed.On the other term, Matérn 3/2, exponential and Gaussian (in Equations ( 8)-( 10)) correlation functions are examined.
To estimate the accuracy of each tested surrogate model, the leave-one-out (LOO) cross-validation is adopted, where one point is randomly ignored for the cross-validation and the other points are for training the surrogate model.This procedure is repeated until all the points are used.Therefore, to perform the LOO, one point x (i) from the initial DOE is subsequently removed and the surrogate model Ŷ0,(−i) x (i) is built from the remaining points of the design.The LOO cross-validation error is calculated on the true response design and its corresponding predicted responses as where Var(Y) defines the estimated variance of the output variable.
Table 2 shows the cross-validation error of the tested surrogate models as combinations of different trend and observation functions.It can be observed that the estimated LOO errors vary with different combinations and outcomes of interest.In most of the cases, the combination of 3rd degree polynomial function and Matérn 3/2 correlation function results in the best performance with a minimum mean error of the prediction for the outcomes (highlighted in bold in Table 2).Also of note is that combinations of 2nd degree polynomial-Matérn 3/2 and 3rd degree polynomial-Gaussian also exhibit a good prediction.Examples of optimal surrogate models for the drag coefficient outcomes, i.e., mean and oscillation amplitude quantities, are shown in Figure 6, where the red dots represent the DOE.The estimated parameters of all six models for six quantities of interest are summarized in Table A2 (Appendix B).Once a surrogate model is built with its estimated parameters, the hydrodynamic pressure coefficients of an arbitrary design of AR and Re can be rapidly predicted without the need for an attempt at re-simulation.
parameters, the hydrodynamic pressure coefficients of an arbitrary design of  and  can be rapidly predicted without the need for an attempt at re-simulation.

Validation of Surrogate Models with a Test Set
A test set of the hydrodynamic pressure coefficients is obtained from the previo work to validate the developed surrogate models.In particular, five different designs the submerged body under free surface flows were numerically performed, which we uniformly composed by  = 11,850 and five body shape ratios,  = 0.25, 0.5, 1.0, 2 and 4.0.As a comparison, the plots of observed mean and oscillation magnitude togeth with those predicted for three hydrodynamic pressure coefficients are shown in Figure Careful readers can see a good fit between the observed and predicted values both for t mean and oscillation quantities.To quantify the goodness of fit between the observed an predicted data, the mean square error (MSE) and coefficient of determination ( ) are c culated and presented in Table 3.It can be re-confirmed that the surrogate models pred the mean and oscillation amplitude of the hydrodynamic pressure coefficients with a hi degree of accuracy.Hence, the developed models are reliable in prediction and efficie in terms of computational effort.

Validation of Surrogate Models with a Test Set
A test set of the hydrodynamic pressure coefficients is obtained from the previous work to validate the developed surrogate models.In particular, five different designs of the submerged body under free surface flows were numerically performed, which were uniformly composed by Re = 11,850 and five body shape ratios, AR = 0.25, 0.5, 1.0, 2.0, and 4.0.As a comparison, the plots of observed mean and oscillation magnitude together with those predicted for three hydrodynamic pressure coefficients are shown in Figure 7. Careful readers can see a good fit between the observed and predicted values both for the mean and oscillation quantities.To quantify the goodness of fit between the observed and predicted data, the mean square error (MSE) and coefficient of determination (R 2 ) are calculated and presented in Table 3.It can be re-confirmed that the surrogate models predict the mean and oscillation amplitude of the hydrodynamic pressure coefficients with a high degree of accuracy.Hence, the developed models are reliable in prediction and efficient in terms of computational effort.In the practice design of potentially submerged civil structures, such as bridge decks, offshore platforms and hydrokinetic turbine blades, it is important to avoid unstable regions caused by the oscillation of the hydrodynamic forces.By considering a wide range of  number and shape aspect ratio ( = 6000-20,000,  = 0.1-10), the unstable or oscillation regions are plotted based on the developed surrogate models for the three coefficients, as shown in Figure 8, in which the bar color represents the oscillation amplitude of the examined coefficients.It can be observed that the most unstable region appears with small aspect ratios of the submerged body.amplitude of the oscillation mostly decreases with the increase of both  and .In the cases of  > 2, the oscillation amplitude significantly drops and almost equals zero.These observations are criteria for the dynamic instability assessment of existing submerged civil structures or for practice design of new ones under the free surface flow to avoid adverse effects of the dynamic impact.In the practice design of potentially submerged civil structures, such as bridge decks, offshore platforms and hydrokinetic turbine blades, it is important to avoid unstable regions caused by the oscillation of the hydrodynamic forces.By considering a wide range of Re number and shape aspect ratio (Re = 6000-20,000, AR = 0.1-10), the unstable or oscillation regions are plotted based on the developed surrogate models for the three coefficients, as shown in Figure 8, in which the bar color represents the oscillation amplitude of the examined coefficients.It can be observed that the most unstable region appears with small aspect ratios of the submerged body.The amplitude of the oscillation mostly decreases with the increase of both Re and AR.In the cases of AR > 2, the oscillation amplitude significantly drops and almost equals zero.These observations are criteria for the dynamic instability assessment of existing submerged civil structures or for practice design of new ones under the free surface flow to avoid adverse effects of the dynamic impact.

Conclusions
This study aimed to develop a computationally efficient and accurate surrogate model to estimate hydrodynamic pressure coefficients on submerged bodies beneath the water surface.Using the LHS sampling method, several computational fluid dynamics analyses, based on a Navier-Stokes solver implemented with the shear stress transport turbulence model and the volume of fluid method, were performed to extract hydrodynamic pressure coefficients and their instability.
From the outcomes of the CFD analyses, a Gaussian process modeling-based surrogate model was trained to predict the hydrodynamic pressure coefficients around submerged bodies with a rectangular shape considering a range of shape ratio and Re number values.
As cross-validation for several testing surrogate models, the optimized model was found to be a combination of the 3rd degree polynomial and Matérn 3/2 correlation functions.
Since the surrogate models were developed, the hydrodynamic pressure coefficients were then predicted for a wide range of input parameters.The finding from the study highlighted the efficiency of the surrogate model in rapidly estimating the hydrodynamic pressure coefficients in place of complex and expensive CFD analyses.
By plotting unstable regions of the hydrodynamic pressure coefficients within the ranges of the shape ratio and Re number, it is concluded that the most unstable region appeared at small aspect ratios of the submerged body.In most of the cases, the oscillation amplitude significantly dropped with the increase of both AR and Re and reached almost zero with AR > 2.
The surrogate model in this study can be practically applied in rapidly assessing the hydrodynamic force and its instability effect on existing submerged civil structures, or in designing new structures, where a suitable shape ratio should be adopted to avoid flow-induced instability of hydrodynamic forces.
The present study can also be enabled and facilitate future sensitivity, fragility and reliability studies across a broad range of submerged bodies and flow conditions that are involved in civil structures under flood and wave flows.

Figure 1 .
Figure 1.Computational model of the free surface flow around a submerged body.

Figure 1 .
Figure 1.Computational model of the free surface flow around a submerged body.

Figure 2 .
Figure 2. The mesh domain and mesh distribution around the submerged body.

Figure 2 .
Figure 2. The mesh domain and mesh distribution around the submerged body.

Figure 3 .
Figure 3. Design of experiments using LHS.

Figure 3 .
Figure 3. Design of experiments using LHS.

Figure 4 .
In the presence of the free surface, asymmetric low-pressure regions at the top and bottom of the submerged body are observed.

Figure 6 .
Figure 6.Examples of surrogate models of the drag coefficient in terms of   : (a) Mean and ( Oscillation amplitude.

Figure 6 .
Figure 6.Examples of surrogate models of the drag coefficient in terms of µ Ŷ: (a) Mean and (b) Oscillation amplitude.

Table 1 .
Modeling parameters for the design of experiments.

Table 1 .
Modeling parameters for the design of experiments.

Table 2 .
Cross-validation error estimation of the tested surrogate models.

Table 3 .
Error estimations of the observed and predicted hydrodynamic pressure coefficients in t case of  = 11,850.

Table 3 .
Error estimations of the observed and predicted hydrodynamic pressure coefficients in the case of Re = 11,850.