1. Introduction
Short-term wave forecasting is essential for ensuring the safety and efficiency of ship navigation, coastal engineering operations, and disaster mitigation efforts, especially under extreme weather conditions [
1,
2]. Accurate forecasts can substantially reduce the risks posed by rapidly evolving sea states, particularly during tropical cyclones, which often generate extreme wave conditions and threaten coastal infrastructure and maritime activities [
3].
Numerical models are widely used in studying and forecasting the significant wave height (Hs) [
4]. Well-established models such as the Simulating Waves Nearshore (SWAN) model [
5], MIKE 21 [
6], and Delft-3D [
7] have been extensively applied in predicting Hs in coastal and nearshore regions, and have demonstrated good performance under typhoon-induced extreme wave conditions [
8,
9,
10]. In operational practice, however, these models are more often applied in hindcast mode to reconstruct past wave conditions, as their real-time use is limited by high computational cost, the need for extensive input data, and the time required for setup and calibration [
11]. This limitation creates a clear need for alternative approaches capable of producing rapid forecasts. Machine learning (ML) models, once trained, can generate predictions almost instantaneously, making them a promising complement to numerical models for short-term wave forecasting.
In recent years, ML techniques have shown great promise in coastal and ocean engineering applications [
12,
13,
14,
15], especially for forecasting wave conditions [
16,
17,
18]. A variety of models, such as support vector machines (SVMs), random forest (RF), artificial neural network (ANN), and recurrent architectures like long short-term memory (LSTM) networks, have been successfully employed to capture the dynamic behavior of ocean waves [
19,
20,
21,
22,
23,
24]. For example, ANN has been applied to estimate wave breaking height using environmental parameters [
25]. RF, ANN, and SVMs have been compared for swell occurrence prediction, with RF achieving the highest accuracy [
26]. Jörges et al. [
27] found that incorporating bathymetric features alongside meteorological inputs can significantly enhance the accuracy of LSTM-based wave height forecasting models. Lu et al. [
28] proposed a hybrid deep learning framework named Extreme-Enhanced LSTM-NBEATS, which achieved high accuracy in 24 h Hs forecasts, particularly under extreme wave conditions in the Gulf of Mexico. More recently, Tan et al. [
29] developed a Swin Transformer-based deep learning model for regional wave height prediction. With a carefully designed architecture, the model accurately reproduces wave heights up to 24 h in advance across the target region. Overall, ML approaches can improve wave height prediction accuracy while reducing computational cost [
30,
31]. In additions, previous studies have demonstrated the strong performance of RF in short-term environmental forecasting [
32] and the ability of LSTM to model dynamic wave processes with high accuracy [
27]. Therefore, in this study, RF and LSTM are selected as two representative and widely used ML models to evaluate and compare their predictive capabilities for significant wave height forecasting. These two models represent distinct methodological paradigms (tree-based ensemble learning vs. recurrent neural networks), enabling a meaningful comparative evaluation of their predictive capabilities within the same experimental framework.
Despite recent progress, the effectiveness of data-driven ML models for wave prediction still largely depends on the availability of sufficient observational data for training [
28,
33,
34]. Most existing applications have been conducted in open-ocean or well-instrumented coastal environments where long-term buoy records are available. For example, an innovative deep-learning framework combining Variational Mode Decomposition, LSTM, and Transfer Learning has been successfully applied to Hs forecasting using buoy measurements and ECMWF wind data [
35]. Similarly, generalized machine learning approaches such as ANN, SNN, XGBoost, and LightGBM have been trained on large coastal datasets from 47 stations along the North American coast and evaluated on 6 independent stations [
36]. To mitigate the limitation of sparse in situ data, especially during extreme weather events, researchers have begun integrating physics-based numerical simulations with ML algorithms to improve prediction skill in data-scarce but high-risk scenarios [
11,
37,
38]. One approach involves using data generated from established wave models, such as SWAN, to train surrogate ML models that can approximate wave conditions with reduced computational demand. For instance, Chen et al. [
39] developed a surrogate prediction framework based on the random forest algorithm, trained on spatial wave data derived from SWAN simulations, which enabled efficient wave condition forecasting without running the full numerical model. Expanding on this idea, Chen et al. [
40] demonstrated that coupling SWAN with machine learning techniques, including backpropagation neural networks and random forest regression, can significantly improve the prediction of wave heights under typhoon conditions, outperforming the original SWAN model in both accuracy and responsiveness.
In contrast, research in the offshore waters adjacent to the Pearl River Estuary (PRE) in the northern South China Sea (the northern part of the South China Sea) remains relatively limited, particularly under typhoon conditions when reliable Hs measurements are scarce due to safety constraints and instrument failures. This region is one of the most economically developed and densely populated coastal areas in China, with intensive maritime traffic, port operations, and coastal infrastructure. The scarcity of accurate and timely wave forecasts under extreme conditions poses substantial risks to navigation safety, coastal engineering, and disaster preparedness. These limitations posed a significant challenge for developing robust short-term forecasting systems in such high-risk areas.
This study aims to develop a hybrid prediction framework that integrates high-resolution SWAN simulations with RF and LSTM to improve short-term significant wave height forecasting in data-scarce estuarine environments. The framework is trained and validated using SWAN-simulated wave data from multiple historical typhoon events in the PRE, a region in the northern South China Sea where frequent tropical cyclones generate complex and highly variable wave fields. The novelty of this study lies in systematically evaluating both temporal- and spatial-generalization performance under typhoon conditions and revealing the stage-dependent predictive behavior of RF and LSTM across multiple events. This approach provides a transferable methodology for enhancing wave forecasting capability in similar coastal regions worldwide. This paper is organized as follows:
Section 2 describes the study area, SWAN model setup, machine learning model architectures, and experimental design.
Section 3 presents the results of wave height prediction performance and model generalization across typhoon events.
Section 4 discusses the findings, and
Section 5 concludes the study with key implications and future perspectives.
4. Discussion
This study developed and evaluated RF and LSTM for predicting Hs under typhoon conditions in the PRE region. SWAN-simulated data from 87 historical typhoon events were used, with 77 events for model training and 10 independent events for validation, enabling assessment of both event-level and spatial-generalization performance.
Our results indicate that RF tends to maintain higher stability during the early development and late dissipation stages of typhoon-induced waves, whereas LSTM performs better near the peak stage, when wave growth is most rapid. This stage-dependent behavior reflects the different learning mechanisms of the two models: RF excels at capturing relatively stable patterns with less temporal dependence, while LSTM is better suited for modeling highly dynamic transitions. These findings are consistent with previous literature that highlights the effectiveness of deep learning in capturing highly dynamic wave processes [
38]. Moreover, regarding forecast horizons, the reduced accuracy in the 6 h predictions compared to the 3 h predictions is likely due to the absence of future wind-field inputs, which limits the models ability to capture subsequent changes in wave growth and decay. This effect is more pronounced in LSTM, which relies heavily on recent temporal patterns that may lose predictive relevance over longer horizons.
Although model performance across all validation typhoon events was broadly similar, certain variations were observed. For instance, during Typhoon Koinu 2023, the spatial-generalization performance at QF306 was notably lower than at QF307, despite both sites being geographically close (shown in
Figure 12f and
Figure 13f). A likely contributing factor is the relative position of the typhoon center in relation to the prediction site, which can strongly influence the local wind–wave generation environment. This variable was not explicitly included in the present study, meaning that spatial differences in storm forcing may not have been fully captured by the models. A similar perspective was emphasized by [
49], who highlighted the importance of incorporating spatially varying wind fields for improving wave forecasts. Future work could therefore explore adding the relative distance and orientation between the typhoon center and prediction sites, as additional input features, to enhance model robustness in cross-event applications. Moreover, our findings highlight the concerns raised by [
50] that ML models trained on historical data may perform unsatisfactorily when applied to newer data, due to shifting atmospheric and oceanic conditions. Future studies may mitigate these limitations by incorporating additional and more recent datasets, or by adopting physics-informed machine learning frameworks that embed governing wave dynamics into the predictive process.