Cross-Scale Modeling of Shallow Water Flows in Coastal Areas with an Improved Local Time-Stepping Method

: A shallow water equations-based model with an improved local time-stepping (LTS) scheme is developed for modeling coastal hydrodynamics across multiple scales, from large areas to detailed local regions. To enhance the stability of the shallow water model for long-duration simulations and at larger LTS gradings, a prediction-correction method using a single-layer interface that couples coarse and fine time discretizations is adopted. The proposed scheme improves computational efficiency with an acceptable additional computational burden and ensures accurate conservation of time truncation errors in a discrete sense. The model performance is verified with respect to conservation and computational efficiency through two idealized tests: the spreading of a drop of shallow water and a tidal flat/channel system. The results of both tests demonstrate that the improved LTS scheme maintains precision as the LTS grading increases, preserves conservation properties, and significantly improves computational efficiency with a speedup ratio of up to 2.615. Furthermore, we applied the LTS scheme to simulate tides at grid scales of 40,000 m to 200 m for a portion of the Northwest Pacific. The proposed model shows promise for modeling cross-scale hydrodynamics in complex coastal and ocean engineering problems.


Introduction
With the continuous advancements in numerical methods and parallel technologies, cross-scale ocean models [1][2][3][4] have undergone significant developments.These models offer the ability to simulate multiple resolutions by manipulating grid sizes on a large scale.In many applications of ocean modeling, the practice is to employ smaller cells in areas where higher precision is required, while larger cells are used in other regions to optimize computational efficiency and cost-effectiveness.Time-stepping methods typically favor explicit global time-stepping (GTS) schemes due to their inherent parallelism and lower programming complexities.However, GTS schemes have a drawback-they require determining time steps that must satisfy the Courant-Friedrichs-Lewy (CFL) condition, resulting in substantial discrepancies between the local time steps of large cells and the global minimum time step.These discrepancies result in wasted computational resources, thus affecting the computational efficiency of cross-scale ocean models.Local time-stepping (LTS) schemes overcome these constraints by allowing cells in different regions to use their own local time steps rather than the global minimum time step.
In pursuit of enhancing computational efficiency in solving one-dimensional (1D) scalar conservation equations, the LTS algorithm was originally proposed by Osher and Sanders [5].The proposed scheme achieves first-order accuracy in both space and time.Kleb et al. [6] presented a temporal adaptive algorithm for the time integration of the 2D Euler or Navier-Stokes equations.Dawson [7] proposed an extension of the high-resolution scheme of the finite element method for the advection equations and conducted a detailed analysis of the second-order temporal format of the 1D scalar conservation laws in Dawson and Kirby [8].The results demonstrate that the LTS scheme is highly competitive regarding temporal accuracy.In addition, Fumeaux [9], Muller and Stiriba [10], and Tan et al. [11] investigated LTS schemes for 2D and 3D problems and corrected numerical fluxes by interpolating variable predictions during flux solving.Trahan and Dawson [12] also studied LTS schemes for shallow water equations, using a Runge-Kutta discontinuous Galerkin finite element method for spatial discretization.The LTS scheme was secondorder accurate in time away from local time-stepping interfaces and first-order accurate at the interfaces.Furthermore, higher-order LTS schemes have also been studied, and Krivodonova [13] introduced an explicit LTS scheme based on higher-order Runge-Kutta (RK) for solving conservation laws and wave propagation.Hoang et al. [14] developed second and third-order accurate explicit LTS schemes for shallow water equations.The proposed LTS schemes are of predictor-corrector type, constructed using Taylor series expansion and the stability-preserving Runge-Kutta (SSP-RK) stepping algorithm for constructing predictors, enabling the algorithm to achieve second and third-order temporal accuracy across the entire domain.Dazzi et al. [15]; Hu et al. [16] also combined GPU (Graphics Processing Unit)-accelerated algorithms with LTS algorithms to make the model computation more efficient.
Kramer and Jozsa [17] have introduced a simple but robust LTS algorithm based on the finite volume method for the 2D shallow water equations (SWEs).And the interface fluxes are corrected according to the correction algorithm of Tan et al. [11].Sanders [18] applied the LTS method to an explicit Godunov-type shallow water model to improve operational efficiency.Hu et al. [19,20] improved Sanders' algorithm by modifying the LTS levels, which are the localized gradations of time steps, at the wet/dry front and dynamic/static front and applied the improved model to calculating tides and hydro-sediment-morphodynamic processes.This represents an enhancement for numerical simulations of 2D shallow water flow.Yang et al. [21] studied the implementation of the LTS algorithm on a 2D shallow hydraulic model and discussed the influencing factors and adaptive conditions of the algorithm.Dawson et al. [22,23], employing the Runge-Kutta discontinuous Galerkin finite element method, investigated the LTS algorithm of the 2D SWEs.These findings were subsequently applied to simulate tidal flows and storm surges.Lilly et al. [4] developed a fourth-order algorithm based on Hoang et al. and achieved good results in storm surge simulation using the Model for Prediction Across Scales-Ocean (MPAS-O).In ocean modeling, a simple but stable and efficient model with guaranteed accuracy is one of our main concerns.Kramer and Jozsa [17] recommended that the maximum LTS level should be less than three and two neighboring cells' LTS level should not differ by more than one.Sanders [18] only recommended setting the LTS level to a maximum of four to ensure the stability of the calculation.Hu et al. [20] proposed a slight but useful modification to use larger LTS levels but uncorrected for interface layer fluxes with significant differences in LTS levels.Yang et al. [21] did not mention the method of correction in their article.
In this study, we develop a 2D shallow water model using a conservative, explicit LTS algorithm intended for cross-scale ocean hydrodynamic modeling applications.The time step is determined by the local CFL condition, allowing for a combination of local time and spatial refinement instead of relying on the global CFL condition.The model combined LTS scheme for the shallow water equations proposed by Hu et al. [20]; Kramer et al. [17]; Krivodonova [13]; and Sanders [18] with flux correction theory based on the Taylor series expansion proposed by Hoang et al. [14], and applies the newly designed LTS scheme to the calculation of cross-scale 2D shallow water equations model.It is important to note that our prediction scheme yields the same results as the first-order prediction scheme employed by Kramer et al. [17].However, based on the specific characteristics of the spatial discretization scheme and the solver scheme used in this paper, we chose the corrector defined by Hoang et al. [14] to ensure the accurate conservation of the fluxes.Moreover, with respect to the equation-solving and time-stepping methods used in this paper, we introduce a single interface to couple the intermediate time increments of two distinct LTS levels.This streamlines the algorithm's complexity but still maintains accuracy and conservation.Additionally, the governing equations are solved on unstructured meshes by the finite volume method.Numerical fluxes between cells are evaluated using Roe's approximate Riemann solver [24,25].And the bottom slope term and wet/dry boundaries treatment by using the method proposed by Yu et al. [26].Numerical tests demonstrate that the model significantly improves computational efficiency and effectively balances fluxes between regions with varying LTS grading.We also attempted to establish an offshore tidal flow model to verify its practical performance in ocean engineering applications.
This paper is structured as follows: Section 2 introduces the governing equations, finite volume discretization method, and details model processing.We also present the mathematical formulas for the improved LTS algorithm.Section 3 offers two ideal tests to verify the model's performance by simulating the spreading of a drop of shallow water and tidal flats' overflow and drainage processes.Section 4 demonstrates the application of the established cross-scale 2D shallow water model to simulate the currents in the Bohai Sea, Yellow Sea, and East China Sea from the Zhoushan Archipelago to the external sea areas and compares the results with actual measurement data.Finally, Sections 5 and 6 conclude the paper and provide further discussions.

Governing Equations
Shallow water models based on unstructured meshes and finite volume method [27,28] have been well developed and widely used to simulate coastal processes, including tides, storm surges, and sediment transport in estuaries and nearshore regions [26,[29][30][31][32].The 2D shallow water governing equations are written in matrix form as follows: U represents the conserved physical vector; F, G denote the convective fluxes in the x and y directions, respectively; S stands for the source term; h is the water depth as depicted in Figure 1a; The variables u, v correspond to the depth-averaged velocities in the x and y directions, respectively; g is the gravitational acceleration; where ( ( ), ( )) = E F U G U is the numerical flux.L is the length of the grid edge and m is the LTS level of neighboring cells (see Section 2.3) in counterclockwise order.

Finite Volume Discretization and Flux Solving
To improve the adaptability to complex areas, this paper utilizes unstructured triangular meshes to partition the computational domain.A non-boundary triangular mesh has three nodes, three edges, and three adjacent cells (Figure 1b).An edge has two nodes and two adjacent cells.The elevation of the bottom bed is defined at the nodes and the other physical variables (h, u, v) are defined at the center of the cells.
Using the finite volume method, Equation (3) is integrated over the control volume: Utilizing the rotational invariance of the Euler equations, the governing equations are locally rotated within each computational cell to coordinate systems that are orthogonal to the cell boundaries for solving.According to the rotational invariance of Euler equations, the discretization of cell i can be derived as follows: where U represent two consecutive time steps; i is the cell number, 1, 2, 3 , = iN , N is the total number of cells; j is the cell edge number, 1, 2,3, , = jM , and M is the total number of cell edges; i A is the area of the i-th control volume; T is the rotation matrix; is the numerical flux at the j-th edge of the i-th cell; j L is the edge length at the j-th edge of the i-th cell.
The solution of the numerical flux nj E is the core of solving Equation (5).This paper uses Roe's approximate Riemann solver [24] to compute the numerical fluxes at the cell interfaces: )) (6) where () are the fluxes at the left and right sides of the edge interface, and j λ , j γ are the eigenvalues and eigenvectors of the modified Jacobian matrix [24], respectively.When encountering extreme situations such as the dry bed dam-break problem, entropy correction is necessary to ensure that the Roe scheme satisfies the entropy condition and produces accurate results.In addition, this paper adopts the bottom slope and wet/dry boundary treatments proposed by Yu et al. [26], ensuring the accuracy of the spatial scheme and conserving the interface flux.

Local Time-Stepping (LTS) Scheme
The explicit time discretization scheme is usually used for nonlinear discrete systems because of its simplicity and natural parallelism.However, the explicit method is inefficient for cross-scale models due to the global time step.Therefore, we propose an improved LTS scheme that enables the cells to advance based on their local time steps.
The first-order explicit time discretization format is given by Equation ( 5), which can be simplified to ,where For simplicity, we display the "coarse" and "fine" time increments of part of the computational domain as follows (Figure 2a).
where ,0 = n n tt , and To define the LTS scheme for the discrete system obtained from the shallow water model, we refer to Hoang et al. [14] and further partition the cells in the local computational domain, which contain different time increments, into three classes (Figure 2b).
The cells in this set belong to the following: P F C -"fine" cells, .In this paper, we initially apply the algorithm to the first-order forward Euler method.Only one interface is introduced to couple the "fine" and "coarse" regions.Since the time-stepping method consists of only one prediction and one correction process, we expect the predicted value of the interface cells to affect only the "fine" cells.
The algorithm begins by hierarchizing all cells and identifying the interface layers, aiming to maximize the computational efficiency of each cell while ensuring compliance with the local CFL conditions.Then evaluate numerical flux and advance for different time increments until a complete increment ends.To obtain the advancement of intermediate time levels in the interface layers, we use the predictor proposed by Hoang et al. [14].Once the predicted values are obtained, we can advance the values at the intermediate time levels for the coarse grids in the interface.Finally, a correction ensures the conservation of the flux.The following section presents the specific LTS algorithm based on this method.
Based on the local CFL conditions, calculate the maximum local time step  i t that each cell can be satisfied [20].
( ) ( ) Cr is the Courant number, typically set to 0.9; ij r is the distance from the center of the i-th cell to its j-th edge; , ij ij uv are the flow velocities in the normal local coordinate system to the j-th edge of the i-th cell; i h is the water depth of the i-th cell; is the critical water depth, which is determined according to the actual situation.It is worth noting that when the h of a cell is less than the cwd h , the cell is considered a dry cell, and its maximum local time step can be infinitely large.However, considering the actual situation, it is set to the maximum value of the local time step in wet cells.
The minimum global time step can be obtained from min 1, min( ) , the model is equivalent to the GTS model.Then, we need to modify the LTS level of each cell and calculate the LTS level at the cell edges.Sanders [18] proposed that when the LTS levels change significantly in the interface region, the LTS level of adjacent cells in the coarse region should be modified to a smaller value.Based on numerical experiments, it is optimal to modify the interface LTS level according to the maximum LTS level of int P C .At the same time, when the interface is a dry/wet front or a dynamic/static front, i.e., when the flow velocity of one side of the interface is 0 or the water depth  i cwd hh, the interface LTS level must also be modified.For conservatism, this paper adopts the method of Hu et al. [20] to correct the particular fronts.After the modification is completed, the Depending on the obtained LTS level of each cell, the local time step Assuming that the solution at the moment t is known, we aim to find the secondorder approximation of which gives a truncation error of second order in time.By we obtain a second-order approximation of where t kM =− .After obtaining the values of the intermediate time levels, the numerical fluxes and physical variables of the "fine" cells on the subcycle can be further calculated.We note, however, that when using first-order Taylor expansion, the result is the same as the linear interpolation of the forward Eulerian solution.The need for Taylor expansion becomes apparent when extended to higher orders.When ( )   0 0, 1, mod( , 2 ) 0 ( 1) temp , where temp 0, mod( , 2 ) 0 correct the physical variables of int P C using the interface fluxes, where 0 l is the threshold used for updating.Referring to the treatment described by Osher and Sanders [5] and Hoang et al. [14], the specific correction algorithm is: In order to minimize computation time, , ( )

FU
does not need to be recalculated but is stored as shared information that is called upon when corrections are made.Consequently, the correction process incurs additional overhead compared to the original LTS method.We will delve into this in Section 3.2 for a detailed analysis.Figure 3 provides a flow chart of the numerical structure.Taking a complete cycle containing two LTS levels (  = 2) as an example, the LTS algorithm can be summarized as the following steps (Table 1).  1.
Table 1.First-order LTS algorithm for a complete cycle containing two LTS levels.
Step 1 Calculate the LTS parameters for all cells.
Step 2 Start the first subcycle.Calculate the () n FU for all cells.
Step 3 Update solutions of the first subcycle using () n FU for all cells.End the first subcycle.
Step 4 Start the next subcycle.Calculate Update solutions for fine cells.
Step 6 Based on the fluxes calculated in the two subcycles ( using Equation (24).
End the complete cycle.

Idealized Test
To assess the stability and efficiency of the improved LTS scheme, two ideal numerical tests are conducted to demonstrate the model's performance.In Section 3.1, we initially consider a simplified 2D model to simulate the 1D Spreading of a Drop of Shallow Water [33].Contrasting the numerical results with the exact solution, we establish that the improved LTS scheme offers enhanced accuracy and conservation properties compared to the original scheme.In Section 3.2, a 2D test is presented, utilizing a complex scenario of a channel-flat system [34].And we primarily delve into the advantages and limitations of the improved LTS scheme in terms of efficiency.To facilitate a more comprehensive efficiency comparison, all models are run on a PC with an Intel ® Xeon ® CPU E5-2640 v3 (2.60 GHz) and 64.0 GB RAM, without parallel processing.

Test 1: Spreading of a Drop of Shallow Water
Test 1 involved simulating the spreading of a parabolic-shaped water droplet to assess the capability of the LTS schemes in modeling dry bed dam-break scenarios, as well as the precision and conservation properties of the LTS models.In this test, we employed a simplified 2D model to simulate a 1D problem.The computational domain consisted of a rectangular, horizontal riverbed measuring 12 m in length and 1 m in width, with a parabolic water drop at x = 0 m.When the droplet suddenly breached, it led to the generation of dam-breaking water flows in two directions.This particular scenario has exact solutions.The gravitational acceleration g = 1.0 is taken and frictional resistance is neglected.The initial conditions are: The computational domain is discretized using 12,952 unstructured grids, with a minimum cell size of 0.02 m located at x = 0 m and a maximum cell size of 0.1 m at both ends of the river.A total of 6 levels of user m from 0 to 5 were selected, and a total of 12 working conditions including the original LTS models and the improved LTS models were calculated.To facilitate error calculations, the minimum global time step for all working conditions is set to 0.002 s.This section primarily focuses on validating the advantages of the improved LTS scheme, while a discussion on computational efficiency is provided in Section 3.2.
Figure 4 provides comparisons between simulation results and exact solutions at t = 1 s, 2 s, and 3 s.In order to achieve 1D plotting, the 2D simulation results are projected onto various coordinates along the channel's centerline.Combined with the local zoomedin images, it becomes evident that the improved LTS scheme performs exceptionally well in terms of water depth (h).The results align closely with the exact solutions and are notably consistent with the outcomes obtained using the GTS scheme.However, the results of the original LTS scheme have oscillated at t = 1 s and gradually deteriorated according to the advancement of time; in terms of the flow velocity (u), similarly, the improved scheme is also better than the original scheme.Figure 5 illustrates the LTS level and water depth distribution in each cell at t = 0 s.According to the comparison of the step-by-step output results, we observe that the regions where the oscillations begin all appear near the interface.This is due to the fact that in the initial setup of our model, each interface within the water droplet acts as a dynamic "front".Without appropriate corrective measures, this leads to losses in mass and momentum conservation at the interfaces in coarse grid cells, thereby causing oscillations in the computations.In the improved LTS scheme, corrections made at the interface layers reduce these losses, thereby ensuring energy conservation and mitigating oscillations.To further quantify the performance of the improved LTS scheme in terms of accuracy and conservation, the root mean square error (RMSE) between the results of each working condition and the exact solutions, as well as the mean global relative error (  ) in water mass, are calculated using the following formulas: where ij f is the simulated value, ' j f is the exact solution, and N is the total number of comparison grids.

( )
i Vol t and ( ) Vol t − represent the total volume of water within the computational domain at time instances i t and 1 − i t , respectively.Vol denotes the net volume passing through the boundaries between the time instances, and K represents the total number of time intervals considered.Notably, ( ) 0 Vol t corresponds to the total vol- ume of water within the computational domain at the initial time instance.
The RMSEs in water depth and velocity are depicted in Figure 6.As previously analyzed, it is evident that the improved LTS scheme ( user m = 1 − 5) does not exhibit decreased computational accuracy compared to the GTS scheme ( user m = 0).The RMSEs in water depth (h) remain in the order of

Test 2: Tidal Flat/Channel System
In this test, we focus on the performance and limitations of the improved LTS scheme in terms of computational efficiency.The ideal test is a tidal process simulation on a conical tidal flat (Figure 8a).The model consists of 19,257 nodes and 38,352 unstructured meshes (Figure 8b), with the minimum cell edge length being approximately 5 m and the maximum being approximately 100 m.Through a series of numerical tests, a total of six working conditions were adopted for user m values from 0 to 5. Table 2 presents a comparison of the total simulation times and efficiency for various conditions in two tests.In Test 1, the minimum time step is restricted to 0.002 s, while Test 2 does not impose this limitation.Firstly, it can be seen that the total computational time decreases with the increase in the given user m .Notably, the computational efficiency in Test 1 improves by a factor of 1.860 at .Although both LTS schemes exhibit similar acceleration efficiency across LTS levels, the original scheme requires less total computation time than the improved scheme.This difference arises due to the predictive-corrective algorithm.While the two schemes are consistent in calculating LTS parameters, and both involve almost identical memory and time overheads, the improved LTS scheme introduces additional computational costs when updating fluxes and advancing time steps. .Since memory information from the prediction module can be shared and does not involve additional loops, we consider the cost associated with the prediction phase negligible.The 3.8% cost at 0 = user m is a result of conditionals in the loop, and this cost increases gradually with user m .Compared to the original LTS scheme, we find these costs acceptable.Additionally, it is noteworthy that computational efficiency does not linearly increase with user m .Take Test 2 as an example, when user m increases from 0 to 1, the computational acceleration is only 1.059, and when user m increases from 4 to 5, it only improves by about 0.1.However, a significant improvement of about 0.7 occurs when user m increases from 1 to 2. This discrepancy is determined by the total cell counts at various LTS levels, which is why, despite the possibility of increasing user m infinitely, it is not advisable.Examining the proportion of cell counts for each LTS level in the entire computation when , we see that cells within LTS levels 4 and below account for approximately 96%.Therefore, increasing user m further will not result in a significantly accelerated computation but would introduce unnecessary computational costs that reduce efficiency.Figure 9 shows the distribution of cells at each LTS level when T = 3 h, T = 6 h, and T = 9 h.  Figure 10 compares water level and v-velocity at six sites (denoted as a-f) in the model under six different LTS conditions.The selected sites are all located along the line at x = 2000.Site "a" is positioned at the entrance of the channel, site "b" is located midway between the entrance and the center of the channel, and site "c" is at the center of the channel.Sites "d", "e", and "f" are situated on the cone-shaped tidal flat at elevations of 0.2 m, 0.4 m, and 0.6 m, respectively.The results indicate that water level and velocity values for all LTS conditions match well with those of the GTS model.Tables 4 and 5 display the RMSEs of the simulated variables for each condition.The RMSEs for water level and velocity are relatively small, indicating a good consistency between the LTS and GTS models.Although the RMSE gradually increases as user m increases, the intermediate error results are acceptable.We also validate the conservation of the model.Figure 11 shows the mean global mass conservation relative error  for each condition and compares it with the compu- tational results of the original model.It can be observed that both models are similar in their calculated errors, but the improved model demonstrates noticeably better conservation when user m is increased.

Application to Simulating Tides in a Complex Coastal Area
In this section, to verify the capability of the cross-scale 2D shallow water model based on the improved LTS scheme for practical engineering applications, we established a selected NW Pacific tidal model containing Zhoushan Islands.Zhoushan Islands are a group of islands located in the northeastern part of Zhejiang Province, on the south side of the mouth of the Yangtze River and the outer edge of Hangzhou Bay in the East China Sea.Although there have been many studies on tides and currents in the Hangzhou Bay and Zhoushan Islands area in the past, most of them focused on the essential characteristics of tides and currents.Many numerical simulations had relatively low spatial resolution (1 km), which needs to be improved for such a complex terrain as the Zhoushan Islands area.
In this paper, the computational domain contains the entire Bohai Yellow Sea, and East China Sea, as well as the main streams of the Yangtze and Qiantang Rivers (Figure 12a).The computational domain has three open boundaries, including the Sanjiangying boundary of the Yangtze River, the Hangzhou boundary of the Qiantang River, and the outer sea boundary located near the continental shelf.
Considering that the tidal variation in the computational domain during the simulation time is little affected by weather, the annual average runoff is taken for both open river boundaries.The outer open boundary is the hydrostatic level superimposed on the astronomical tide level, derived from the Chinese sea tidal wave model TPXO [35], which contains nine main sub-tides, Q1, P1, 01, K1, N2, M2, S2, K2, Sa.The computational domain is divided by triangular meshes, with denser meshes in the Zhejiang coastal area, especially in the Hangzhou Bay and Zhoushan Islands.A total of 75,492 cells and 39,936 nodes are arranged, the minimum cell edge length is approximately 150 m, and the maximum cell edge length is approximately 30,000 m.The coordinate projection uniformly adopts the Beijing 54 coordinate 3-degree zone (central meridian 120° E).The topography of the outer sea is based on the latest nautical charts released in China and data provided by the National Oceanic and Atmospheric Administration (NOAA) of the United States.Moreover, the recently measured terrain data are used for the Yangtze River estuary, Zhejiang coast, Hangzhou Bay, and Qiantang River estuary. Figure 12b,c    , the simulation time is only 9.23 h, which is about 3.9 times higher than the computational efficiency of the GTS model.However, it is worth noting that the simulation elapsed time of the model is 10.95 h when .This is because the cell correction algorithm becomes more complicated with the local time step level increase.Combined with the ideal test in the previous section, this indicates that although the local time step level can be infinite, the number of cells at each time level still needs to be considered to obtain optimal computational efficiency.Figure 13 illustrates the distribution of LTS levels for cells at different time instances.At a global scale, the changes in LTS levels during the computation process remain relatively small.However, noticeable variations in LTS levels do occur in regions with grid refinement.This shows the tidal model, indicating that tidal fluctuations have minimal impact on LTS levels in deep water areas with larger grid cells.Conversely, in shallower water regions with denser grids, tidal effects trigger significant changes in LTS levels.Importantly, these changes tend to originate at the interfaces.Thus, for extended-duration simulations, the incorporation of the new LTS scheme proves essential to accurately capture these variations.7 and 8 present the skill scores for the GTS and LTS models with m user values ranging from 0 to 7. The skill scores for all tidal levels consistently reach above 0.97, with exceptional performances at the Xiaoqushan and Changbai stations achieving a score of 0.98.Although there is a marginal decline in scores with the increase in m user , the overall results remain highly satisfactory.In terms of flow velocity, except for the flow velocity during the spring tides at station a, all other stations consistently score 0.90 or above.Remarkably, the results at m7 = user even outperform those at m0 = user .Model performance is also evaluated using RMSE (Tables 9 and 10).LTS model results are in close agreement with the observed values.

Conclusions
A 2D cross-scale shallow water equations model based on an improved LTS scheme is developed in this paper, which is suitable for modeling cross-scale ocean hydrodynamic problems.The governing equations are numerically solved on unstructured triangular meshes using the finite volume method and Roe's approximate Riemann solver.A more stable and conservation-focused LTS scheme has been developed, employing a singlelayer interface strategy and a prediction-correction method.
Model performance is validated in the spreading of a drop of shallow water and the tidal flat/channel system.The results indicate that the improved LTS scheme exhibits superior conservation properties, ensuring precision throughout the computational process without compromising water quality.Moreover, the novel LTS scheme significantly improves computational efficiency without incurring additional burdens compared to the original scheme.
Due to the efficiency and conservativeness of the improved LTS scheme, we successfully simulated the tidal processes in the Bohai Sea, Yellow Sea, East China Sea, and Zhoushan Islands using the established high-efficiency cross-scale 2D shallow water model.The simulation results are well matched with the observed data at the Zhoushan Islands.Furthermore, the computational efficiency is improved by a factor of 3.92.The new LTS scheme guarantees accuracy even at LTS level 7.This improvement paves the way for a promising future in a wide range of cross-scale ocean modeling applications.

Discussions
Note that, the speedup factor of the LTS scheme proposed in this paper to improve the efficiency of the model varies with the non-uniformity of the meshes and flow velocities but does not increase indefinitely.Its variations are contingent on the number of cells at different levels and are dictated by the LTS level encompassing the most cells.And the overhead of the algorithm increases as the LTS rank increases, which is also a tradeoff.Additionally, the time scheme used in this article is first-order, which has relatively low computational accuracy.In future research, we plan to explore the development of highorder accuracy LTS schemes.In conjunction with the study conducted by Hoang et al. [14], the explicit LTS scheme employed in this paper, based on prediction-correction, exhibits inherent parallelism.Therefore, it is also possible to combine the improved LTS algorithm with program parallel algorithms [15,16] to further improve the computational efficiency of the model.Moreover, the cross-scale coastal hydrodynamic model developed based on the improved LTS scheme in this study holds promise for simulating nearshore storm surges and tidal flats, offering broad applications.

Figure 1 .
Figure 1.(a) Non-constant flow over complex topography. is the water surface elevation, b z is

Figure 2 .
Figure 2. (a) Coarse and fine time increments,  n t is the local time step of the coarse time incre- ments and coarse" cells.The cell edges in this set belong to the following:E F C -"fine" cell edges; int E C -interface cell edges; E C C-"coarse" cell edges.Notably, for the cells sharing a boundary with the interface cells, the shared edges belong to the finer cells, i.e., the shared edges of P

C
are identified, and the LTS level ij m of the cell edges is calculated. min and the time interval of a complete cycle are calculated.Once all the LTS parameters are calculated, the numerical fluxes and physical variables can be updated through the intermediate time steps n s , fluxes.After the physical variables of int P C are updated to the next time cycle with a "coarse" time step, physical variables of P F C have been synchronized to the first cycle.We use Taylor series expansion to predict the values of intermediate time levels in int P C .

Figure 3 .
Figure 3. Numerical flowchart of the improved LTS, where "step" represents the algorithmic step in Table1.

Figure 4 .
Figure 4. Comparison between the exact solutions and simulation results at t = 1 s, 2 s, 3 s.Each figure includes a detailed zoomed-in image, indicated by a red box in the lower right corner.Note that in the comparison of h, the improved LTS results override the GTS (  = 0) results.

Figure 5 .
Figure 5. Distribution of LTS levels and water depths in each cell at t = 0 s.Note that the left scale is the river width, and the right is the initial water depth.

5 10 − to 4 10−
. In terms of velocity (u), while accuracy experiences a slight decline with the progression of time, the improved LTS scheme continues to demonstrate its advantages.Conversely, the original LTS scheme shows a sharp decline in computational accuracy at user m = 2, deteriorating further with higher values of user m .

Figure 6 .
Figure 6.RMSE statistics for improved LTS and original LTS: (a-c) are the water depth RMSEs and (d-f) are the flow velocity RMSEs.

Figure 7
Figure 7 illustrates the mean global relative error in water mass for each working condition.The global water mass error is computed as an average across the total number of time steps, as opposed to assessing the error at individual time intervals.Notably, the original LTS scheme results in a substantial loss of mass, scaling up to four orders of magnitude with increasing user m .In contrast, the improved scheme maintains a stable conservation of mass, with errors in the range of approximately 6 10 − to

Figure 7 .
Figure 7.The user m -series of the mean global relative error in water mass errors for original LTS and improved LTS.

Figure 9 .
Figure 9. Distribution of cells LTS levels at T = 0 h, T = 3 h, and T = 9 h.

Figure 10 .
Figure 10.Time series of water level and v velocity of fringe at stations (a-f).

Figure 11 .
Figure 11.RMSE comparison of mean global relative error in water mass errors for original LTS and improved LTS.
show the topography and meshes of the model in areas of validation sites.The selection of the Manning coefficient affects the calculation results considerably.After several numerical experiments and debugging, the Manning coefficients are taken in the range of 0.016~0.018for the outer sea, 0.010~0.016for the Zhoushan Islands to Hangzhou Bay boundary, and 0.014~0.016for the Yangtze River estuary to the open boundary.We selected eight working conditions with user m ranging from 0 to 7 and simulated the tidal process from 20 May to 21 June 2019, for a total of 32 days.The minimum global time step for all conditions is set to 2 s to facilitate the calculation error.

Figure 12 .
Figure 12.Calculation domain location, topography, and meshes.(a) zthe location of the calculation domain and the water depth topography; (b,c) the location of the validated sites and partial refined meshes.

Figure 13 .
Figure13.LTS levels distribution for the entire computational domain and the Zhoushan regional grids at these moments.Note that the location of the zoom-in localized map is marked by the red box.

Figures 14 and 15
Figures 14 and 15 show the time series of observed and simulated values of tide level and tidal current (velocity and direction) measurement stations in the Zhoushan Islands.Stations Daishan, Qushan, Ximatou, Xiaoqushan, and Changbai are tide stations, while a and b are tidal current measurement stations, each measuring four sets of tidal data during spring and neap tides.To better demonstrate the agreement between the simulated and observed values while comparing the results of different time-step models, we compare the simulated results with m0 =user

Figure 14 .
Figure 14.Comparison of simulated and observed water level time series.Stations, Xiaoqushan and Changbai, retained only 7 days of comparisons to show detail.Model performance is assessed using the skill score given by

Figure 15 .
Figure 15.Comparison of simulated and measured flow velocity and flow direction time series.The left side of the figures shows the tidal flow velocity time series for sites a and b during spring and neap tides, while the right side displays the corresponding flow directions.
nk at the intermediate time step , ,U, U nk at by truncating Equation (17) to the second term (the first-order Taylor expansion):

Table 2 .
Ideal tests simulation time and efficiency for different working conditions.When   = 0, the scheme is equivalent to the GTS.
user m

Table 3 .
Overhead proportion (OP) of the correction module and total number of cells proportion (CP) of each LTS level in the whole computation when

Table 4 .
RMSEs of water level for different LTS conditions at sites a-f with GTS model.userm

Table 5 .
RMSEs of v-velocity for different LTS conditions at sites a-f with the GTS model.

Table 6
shows the comparison of the computational efficiency for different conditions.The LTS model can significantly improve the calculation efficiency, and the overall computation decreases as m increases.When 0 = user m

Table 6 .
Comparison of simulation time for different working conditions of tidal simulation in Zhoushan Islands.When   = 0, the scheme is equivalent to the GTS.

Table 7 .
Skill scores of water level (h) for tidal simulation in Zhoushan Islands.

Table 8 .
Skill scores of flow velocity (m/s) and direction (°) for tidal simulation in Zhoushan Islands.

Table 9 .
RMSEs of water level (h) for tidal simulation in Zhoushan Islands.

Table 10 .
RMSEs of flow velocity (m/s) and direction (°) of tidal simulation in Zhoushan Islands.