Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison

Guo, Donglin; Zhang, Xinyou; Chen, Xue; Gao, Song; Zhao, Yiding; Li, Ge; Hou, Qiaokun

doi:10.3390/w18101242

Open AccessArticle

Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison

by

Donglin Guo

,

Xinyou Zhang

,

Xue Chen

,

Song Gao

,

Yiding Zhao

^*,

Ge Li

and

Qiaokun Hou

North China Sea Marine Forecasting and Hazard Mitigation Center, Ministry of Natural Resources, Qingdao 266061, China

^*

Author to whom correspondence should be addressed.

Water 2026, 18(10), 1242; https://doi.org/10.3390/w18101242

Submission received: 14 April 2026 / Revised: 10 May 2026 / Accepted: 14 May 2026 / Published: 21 May 2026

(This article belongs to the Special Issue Advances and Challenges in the Lake, River, and Sea Ice Sciences and Engineering)

Download

Browse Figures

Versions Notes

Abstract

Even under a warming climate, winter sea ice in the Bohai Sea continues to threaten ships and offshore/coastal infrastructure. Reliable pre-season prediction of the overall wintertime sea ice condition in the Bohai Sea, as represented by the Bohai Sea Ice Grade (BSIG), is therefore important for disaster preparedness and mitigation. Based on the 1979–2024 BSIG record, this study compares seven statistical and AI-based seasonal prediction methods: analog year analysis, multiple linear regression, stepwise regression, Principal Component Regression, a cross-correlation-based regression model, support vector regression, and the Bayesian Ensemble Bohai Ice Grade Net (BE-BIGNet). As potential precursors, we considered sea ice extent in 14 Arctic regions together with 114 large-scale atmospheric and oceanic circulation indices. The results suggest substantial differences in predictive skill among the methods. Among the tested approaches, BE-BIGNet, which combines Bayesian regularization with bootstrap median ensembling, achieves strong full-period performance and stable skill during the independent test period, suggesting that it may provide a useful framework for operational BSIG forecasting in the Bohai Sea.

Keywords:

Bohai Sea; sea ice; seasonal prediction; disaster mitigation

1. Introduction

The Bohai Sea (37° N–41° N, 117° E–122° E), a shallow semi-enclosed marginal sea located in northeastern China along the northwestern Pacific coast (Figure 1a), represents the southernmost seasonally frozen coastal sea in the Northern Hemisphere. With an average depth of approximately 18 m, its unique geographical configuration—characterized by restricted water exchange with the open ocean and significant freshwater discharge from surrounding rivers—results in a relatively low heat capacity that makes it highly susceptible to rapid winter cooling. Sea ice in this region typically begins to form in late November and persists through March, characterized by a distinct spatial asymmetry where the northern part, particularly Liaodong Bay, acts as the primary and most ice-prone formation center. Climatologically, the maximum sea ice extent in the Bohai Sea averages approximately 20,000–30,000 km², typically covering 30–40% of the total sea area during normal winters. The thickness of level ice generally ranges from 10 to 30 cm, while in severe cases, the thickness of deformed ice in nearshore regions of Liaodong Bay can exceed 50 cm. As a critical hub for maritime transport and a strategic base for China’s offshore oil and gas industry, severe ice conditions can result in shipping blockages and structural damage to vessels, while also triggering intense ice-induced vibrations on offshore platforms such as jacket structures, thereby threatening structural integrity and operational safety [1]. The overall sea ice condition of the Bohai Sea in each winter is operationally assessed by national ocean forecasting institutes in China using a sea ice grade index, which ranges from 1 to 5 at 0.5 intervals and is calculated based on floating ice extent [2]. Forecasts of the sea ice grade are routinely released about one month before the ice season, serving as a key reference for sea ice disaster preparedness and mitigation. Thus, the establishment of a robust prediction system for winter ice grades is of great importance for both disaster mitigation and maritime logistical scheduling.

Multi-decadal analyses indicate that, although the Bohai Sea ice area has generally declined since the late 20th century, its long-term evolution is dominated by strong interannual and decadal variability rather than a simple monotonic decrease. Much of this interannual variability can be attributed to large-scale climate modes, particularly the Arctic Oscillation (AO) and the North Atlantic Oscillation (NAO), which regulate regional atmospheric circulation patterns, near-surface air temperature, and wintertime surface heat loss over the Bohai Sea [3]. At the planetary scale, ongoing global warming has led to a significant contraction of Arctic sea ice extent and substantial adjustments in atmospheric circulation, contributing to more frequent extreme mid-latitude weather events [4]. Consequently, damaging sea ice events can still affect the region during periods of extreme cold anomalies, despite the overall warming trend. The complex interactions between large-scale climate variability and regional atmospheric forcing introduce considerable uncertainty into Bohai Sea ice evolution, rendering accurate sea ice prediction a formidable challenge.

Early statistical predictions of sea ice conditions predominantly relied on linear regression frameworks that related ice condition to antecedent meteorological forcing, such as air temperature and surface pressure fields [5,6,7]. While these methods established the baseline for regional forecasting, they often suffer from multicollinearity among atmospheric predictors, which can obscure physical mechanisms and degrade forecast stability. To address this, more sophisticated statistical methods were introduced. Stepwise regression has been adopted to optimize predictor selection and reduce overfitting [8,9], while Principal Component Regression (PCR)—a standard tool in broader climate prediction—offers a robust way to extract dominant modes of variability while avoiding collinearity issues [10]. Parallel to these regression efforts, the analog year analysis leverages historical similarity in circulation fields to infer future states, providing an intuitive physical basis for prediction [11].

Despite their wide usage, these statistical methods rely on linear assumptions or historical stationarity, which may limit their ability to represent the nonlinear processes of a changing climate system. Recently, machine learning approaches have been increasingly applied to sea ice prediction. For example, support vector regression (SVR) and neural networks can capture nonlinear relationships between environmental drivers and sea ice variability. They have been applied to seasonal and interannual forecasting, frequently yielding good predictive skill compared to conventional baselines [12,13]. However, the application of complex machine learning methods in Bohai Sea Ice Grade (BSIG) prediction is constrained by the “small sample” problem—reliable satellite and observational records span only a few decades (since 1979). Standard machine learning models often lack the probabilistic reasoning required to quantify uncertainty in such data-limited contexts.

Although both statistical and AI-based approaches have been successfully applied in sea ice prediction, their relative performance has not been systematically assessed in the Bohai Sea. To address this gap, we compare seven prediction methods representing analog, statistical, and machine learning frameworks:

(1): Physical Analog Approaches, specifically the analog year analysis, which represents a physically motivated similarity approach.
(2): Linear statistical approaches, including multiple linear regression, stepwise regression, as well as Principal Component Regression (PCR). This category also includes a linear fitting model based on key physical precursors identified through cross-correlation analysis of Arctic sea ice, atmospheric circulation, and oceanic drivers.
(3): AI models, specifically support vector regression (SVR) for kernel-based learning and a novel Bayesian ensemble net (BE-BIGNet). The latter combines a neural network with Bayesian regularization to improve generalization under the limited sample size.

By evaluating these methods in both historical hindcasts and an independent test period, this study assesses their relative performance for BSIG prediction. The results may contribute to improving the accuracy of sea ice forecasts and supporting disaster mitigation.

2. Data and Methods

2.1. Data

The 1979–2024 Bohai Sea Ice Grade (BSIG) data adopted in this study were obtained from the historical records of the North China Sea Marine Forecasting and Hazard Mitigation Center of Ministry of Natural Resources. The BSIG is a comprehensive metric used to quantify the severity of ice conditions, classified into five levels: Grade 1 (Light), Grade 2 (Normal–Light), Grade 3 (Normal), Grade 4 (Normal–Heavy), and Grade 5 (Heavy). To ensure greater precision in operational forecasts and hindcasts, 0.5-grade intervals are adopted in practice. This classification standard is based on key parameters including the maximum sea ice extent and ice thickness, as described by Sui et al. (2022) [2].

The Arctic Sea Ice Concentration (SIC) dataset, published by the National Snow and Ice Data Center (NSIDC), is derived from remote sensing brightness temperature. The dataset was processed using the NASA Team (NT) algorithm developed by the NASA Goddard Space Flight Center’s Cryospheric Sciences Laboratory. The dataset is available online: https://nsidc.org/data/nsidc-0051/versions/2 (accessed on 5 December 2025). This dataset spans from 26 October 1978 to the present and includes daily and monthly averaged SIC values with a spatial resolution of 25 km × 25 km. The monthly averaged sea ice extent data used in this study was computed from the monthly SIC fields.

A total of 114 large-scale atmospheric and oceanic indices are adopted in this study. Monthly means time series were provided by the Beijing Climate Center (BCC). The dataset is available at https://cmdp.ncc-cma.net/Monitoring/cn_index_130.php (accessed on 12 November 2025). and a full list of indices used here is attached in Appendix A.

2.2. Prediction Models

(1) Analog Year Analysis

The analog year analysis is widely used for monitoring and predicting climate indices such as the MJO and BSISO [14]. It quantifies the similarity between historical circulation fields and a target-year pattern in the spatial domain, providing an objective basis for analog year selection. Let

X (r, t)

denote the anomaly field after removing the climatology and trend. The standardized field is

Z (r, t) = \frac{X (r, t)}{σ_{X} (r)} .

(1)

The projection index is calculated as

I (t) = \sum_{r} Z (r, t) M (r)

(2)

where

σ_{X} (r)

represents the temporal standard deviation of the anomaly field at spatial location

r

, and

M (r)

is the target projection pattern. The magnitude of

I (t)

represents the degree of spatial similarity between the historical field and the target pattern. Years satisfying

∣ I (t) ∣ \geq θ

are selected as analog years for subsequent ice severity prediction.

(2) Linear and Stepwise Regression

Multiple linear regression (MLR) is employed to quantify the linear relationship between the target variable and multiple predictor variables. This method assumes that the combined effect of the predictors on the target variable can be expressed as a linear superposition. To obtain a parsimonious and robust model, stepwise regression is also adopted to identify statistically significant predictors, eliminate redundant variables, and optimize the model structure. The MLR model is expressed as:

Y = β_{0} + \sum_{i = 1}^{p} β_{i} X_{i} + ε

(3)

where

Y

indicates the target variable,

X_{i}

(

i = 1, \dots, p

) are the predictors,

β_{0}

is the intercept,

β_{i} (i \geq 1)

are the regression coefficients associated with each predictor,

p

is the number of selected predictors, and

ε

represents the residual error.

(3) Principal Component Regression (PCR)

Principal Component Regression (PCR) first uses Principal Component Analysis (PCA) to transform the original variables into a set of orthogonal principal components. Then, the principal components with higher contribution rates are selected for regression modeling to enhance the robustness of the model [10]. The PCR model can be written as

P = X W,

(4)

Y = β_{0} + \sum_{i = 1}^{m} β_{i} P_{i} + ε,

(5)

where

X

is the

N \times K

data matrix of original predictor variables (

N

observations as rows and

K

predictors as columns),

W

is the

K \times m

PCA loading matrix,

P

is the resulting

N \times m

matrix of the retained principal components,

m

is the number of selected components,

β_{i}

are the regression coefficients, and

ε

is the residual error.

(4) Cross-Correlation-Based Linear Regression

To account for climate signals with longer seasonal memory, cross-correlation analysis is applied to identify potential precursors of BSIG. This method quantifies the linear relationship between the time series of the BSIG (Y) and potential factors (X) with a time lag

τ

[15]. The correlation coefficient R is calculated as:

R (τ) = \frac{\sum_{i = 1}^{N} (X_{i, t - τ} - \bar{X}) (Y_{i, t} - \bar{Y})}{\sqrt{\sum_{i = 1}^{N} (X_{i, t - τ} - \bar{X})^{2}} \sqrt{\sum_{i = 1}^{N} (Y_{i, t} - \bar{Y})^{2}}}

(6)

where

N

is the sample size and

τ

denotes the time lag in months. Climatic factors that exhibit statistically significant correlations with the ice grade at lead times of 1–5 seasons (significance with

p < 0.05

) are selected as candidate precursors for subsequent linear regression.

(5) Support Vector Regression (SVR)

SVR is a machine learning algorithm based on the principle of minimizing structural risk. It projects low-dimensional input data into a higher-dimensional feature space using nonlinear mapping. The algorithm then finds the best regression function within a set error tolerance, making it effective for small sample sizes and complex nonlinear relationships [16]:

f (x) = ω^{T} ϕ (x) + b

(7)

where

ϕ (x)

is the nonlinear mapping function that projects the input vector

x

into a high-dimensional feature space,

ω

is the weight vector, and

b

is the bias term.

(6) Bayesian Regularized Neural Network

Although large neural models handle complex problems, only 46 records of Bohai ice grade are available from 1979 to 2024. For this situation, simpler models often perform better; thus, this study uses a single hidden-layer fully connected model with 7 neurons. Furthermore, the bootstrap aggregating (bagging) method is adopted to create training datasets by sampling with replacement from the original dataset. Each bootstrap dataset is used to train an individual model, and the ensemble prediction is obtained by aggregating the outputs of all models, e.g., by averaging or taking the median of individual predictions. This approach reduces model variance and improves prediction stability, particularly in the presence of noise or small sample sizes [17]. The median-based ensemble prediction is defined as

\hat{y} = m e d i a n {f^{(1)} (x), f^{(2)} (x), \dots, f^{(B)} (x)}

(8)

where

f^{(b)} (x)

is the model trained on the

b

-th bootstrap dataset, and

B

is the total number of bootstrap models.

2.3. Standardization of the Predictions

To ensure a strictly fair and mathematically rigorous intercomparison among the diverse models evaluated in this study, it is essential to align their output resolutions. Because the observed Bohai Sea Ice Grade (BSIG) is inherently recorded at 0.5-pitch intervals (as introduced in Section 2.1), evaluating continuous floating-point predictions directly against these discrete observations introduces artificial error penalties for traditional regression models. Therefore, prior to plotting the comparative time series and calculating the final evaluation metrics (e.g., root mean square error, RMSE), the continuous forecast outputs from all statistical models were standardized to 0.5 intervals (i.e., rounded to the nearest whole or half point). This standardization ensures that the calculated RMSE values accurately reflect a leveled operational baseline, emphasizing the intrinsic predictive skill of each architecture under real-world forecasting conditions.

3. Results

The prediction of the upcoming winter’s BSIG in the Bohai Sea is usually made and released in October, making summer predictors a standard and operationally practical choice. In this study, analog year analysis, multiple linear regression (MLR), stepwise regression, and PCR serve as baseline statistical models, all utilizing the same summer predictor window. This ensures methodological consistency and allows for a fair comparison that highlights differences in model structure rather than discrepancies arising from predictor selection.

3.1. Analog Year Analysis

The analog year analysis, originally proposed by Wheeler and Hendon (2004) [14], has been widely adopted by operational centers (e.g., NOAA and the China National Climate Center) to monitor and predict climate phenomena such as the Madden–Julian Oscillation (MJO) and the Boreal Summer Intraseasonal Oscillation (BSISO). The method first removes the climatological seasonal cycle and long-term trend from multi-year datasets of the target field; the resulting anomaly fields are then standardized and projected onto a predefined spatial pattern associated with the phenomenon of interest. This projection yields a projection index (PI) that quantifies the spatial similarity between each historical field and the target pattern, thereby providing an objective basis for analog year selection.

First, the summer (June–August) 500 hPa geopotential height anomaly for 2024 is used as the target projection pattern (Figure 2a) to illustrate the analog year analysis workflow and to derive a forecast for the Bohai Sea Ice Grade (BSIG) in the winter of 2024. Specifically, multi-year atmospheric circulation fields are projected onto the 2024 target pattern to generate a time series of PI values. Years with |PI| ≥ 0.5 standard deviations are identified as analog years (blue dots in Figure 3 with similar circulation patterns), and their composite 500 hPa geopotential height anomaly field is shown in Figure 2b. The composite mean sea ice grade derived from the selected analog years serves as the prediction for the Bohai Sea winter sea ice grade in 2024 (red diamonds in Figure 3). To evaluate predictive performance, we predicted the BSIG over the recent five-year period 2020–2024 and the results are compared with the observed records. This comparison yields a root mean square error (RMSE) of 0.59 (Figure 4), indicating that this method captures the recent interannual variability in sea ice conditions with relatively small overall error.

3.2. Multiple Linear Regression

To quantify the linear modulation of atmospheric circulation and oceanic forcing on the BSIG, we constructed a multiple linear regression (MLR) model. The model utilizes indices from the preceding summer (June–August) to predict the ice grade of the following winter. The dataset is partitioned into a training period (1979–2019) and an independent validation period (2020–2024). From the 114 indices listed in Appendix A, the ten predictors exhibiting the highest correlation coefficients with the BSIG were selected for the regression analysis. These key predictors include: the Cold-Tongue ENSO index (

X_{C T - E N S O}

), NINO 4 and NINO-W sea surface temperature (SST) anomalies (

X_{N I N O 4} & X_{N I N O - W}

), the Tibet Plateau Region 1 and 2 indices (

X_{T P 1}

&

X_{T P 2}

), the Eastern Pacific Subtropical High Ridge Position index (

X_{E P - S H}

), North American Subtropical High Area and Intensity indices (

X_{N A - S H A}

&

X_{N A - S H I}

), the ENSO Modoki Index (

X_{E M I}

), and the Eurasian Meridional Circulation (EMC). Prior to calculation, all indices were standardized. The resulting regression equation is expressed as:

Y = 2.81 - 1.06 \cdot X_{C T - E N S O} + 0.47 \cdot X_{N I N O 4} + 0.30 \cdot X_{N I N O - W} + 0.06 \cdot X_{T P 1} - 0.09 \cdot X_{T P 2} + 0.21 \cdot X_{E P - S H} + 0.07 \cdot X_{N A - S H A} - 0.21 \cdot X_{N A - S H I} + 0.23 \cdot X_{E M I} - 0.27 \cdot X_{E M C}

(9)

where

Y

represents the predicted ice grade. The predicted BSIG is shown by the red line in Figure 5. The root mean square error (RMSE) during the training period is 0.61, while for the validation period it is 0.74. The results suggest that interannual variability in the Bohai Sea may primarily be related to ENSO and its spatial SST anomalies. These anomalies modulate the intensity of the East Asian Winter Monsoon and local air–sea heat fluxes via teleconnections involving the Subtropical High, the Walker–Hadley circulation, and the Eurasian wave train. However, the significant increase in RMSE from the training to the validation period indicates that this standard MLR model is subject to potential overfitting.

3.3. Stepwise Regression

To address the potential overfitting and multicollinearity issues observed in the above MLR model, we employed stepwise regression using the Akaike Information Criterion (AIC) to optimize predictor selection. Using the standardized summer (June–August) indices from the training period (1979–2019) as the candidate pool, the algorithm automatically screened the variables to retain only those significantly correlated to BSIG, thereby refining the input set to six key precursors. The optimized prediction equation for the BSIG is expressed as:

Y = 2.81 - 1.25 \cdot X_{C T - E N S O} + 0.88 \cdot X_{N I N O 4} + 0.18 \cdot X_{E P - S H} + 0.44 \cdot X_{E M I} - 0.29 \cdot X_{E M C} + 0.41 \cdot X_{W P - T W}

(10)

where

Y

represents the predicted ice grade and

X_{W P - T W}

indicates the West Pacific 850 hPa Trade Wind index. The predicted results are shown against the observations in Figure 5.

Both regression approaches confirm that ENSO (specifically its spatial morphology), the Subtropical High, and the Eurasian Meridional Circulation are the dominant controls on Bohai Sea ice evolution. While the MLR provides a broad linear projection including secondary factors, the stepwise approach effectively isolates the core drivers.

Over the full study period (1979–2024), the stepwise regression yielded a slightly higher RMSE (0.64) compared to the standard MLR (0.62). However, the stepwise model reached an RMSE of 0.39 during the independent validation period (2020–2024), significantly outperforming the MLR (RMSE = 0.74). This sharp contrast between the training and testing performances suggests that by reducing model complexity, stepwise regression mitigates the overfitting inherent in the standard MLR. Consequently, it minimizes noise amplification and offers better extrapolation stability under the rapidly shifting background state of the recent climate.

3.4. Principal Component Regression

To further mitigate multicollinearity among the numerous atmospheric and oceanic indices, we employed Principal Component Regression (PCR), a hybrid approach that integrates the dimensionality reduction advantages of Principal Component Analysis (PCA) with the adaptive variable selection of stepwise regression.

The model was constructed using standardized summer (June–August) indices in Appendix A. First, we screened the dataset to retain the 40 predictors exhibiting the highest correlation with the BSIG (listed in Appendix B). PCA was then performed on these key factors to extract orthogonal modes of variability. The variance explained by each PC is shown in Figure 6. The leading principal components (PCs) that accounted for 90% of the cumulative variance were retained as candidate variables. A stepwise regression algorithm was then applied—starting from an intercept-only baseline—to iteratively select the optimal subset of PCs that minimized the Akaike Information Criterion (AIC). The optimized prediction equation, trained on the 1979–2019 period, is expressed as:

Y = 2.82 - 0.05 \cdot P C_{1} - 0.07 \cdot P C_{2} + 0.14 \cdot P C_{3} + 0.11 \cdot P C_{4} - 0.10 \cdot P C_{5} - 0.18 \cdot P C_{7}

(11)

and the predicted BSIG is compared with the observations in Figure 7.

The PCR model demonstrated robust performance, achieving an RMSE of 0.65 for the full historical period (1979–2024) and a notably low RMSE of 0.32 during the independent validation period (2020–2024). By extracting orthogonal modes, PCR effectively filters noise and redundancy, allowing for a stable estimation of general climate forcing. However, while the model captures the mean state and baseline trends effectively, it exhibits reduced skill in reproducing extreme anomalies (e.g., severe heavy or light ice years). This limitation is characteristic of linear variance maximization techniques, which tend to prioritize dominant statistical signals at the expense of rare, high-amplitude events.

3.5. Cross-Correlation-Based Regression Model

Since summer-only predictors may not fully capture the longer memory of the climate system, we extended the analysis to earlier seasons. Specifically, we examined lead–lag relationships between BSIG and large-scale circulation indices as well as Arctic sea ice extent, with the aim of identifying precursor signals that persist across seasons and high-latitude drivers that may precondition winter ice severity in the Bohai Sea.

3.5.1. Circulation and Oceanic Factors

To investigate the physical mechanisms driving interannual variability, we analyzed cross-correlations between the detrended BSIG and 114 large-scale climate indices over the five preceding seasons. For robustness, we retained indices that were significant at the 95% confidence level in at least one lag season. The results (Figure 8) suggest that Bohai ice variability is associated with both tropical oceanic memory and atmospheric teleconnections.

One potential source of the predictability of BSIG arises from the persistence of tropical Pacific oceanic anomalies. The slow upper-ocean heat-content evolution and equatorial wave adjustment associated with ENSO provide an interseasonal memory that can precondition subsequent SST anomalies several seasons in advance [18,19,20]. The west Pacific trade winds during the previous winter may also contribute to the predictability of BSIG. Trade-wind and wind-stress-curl anomalies over the western/subtropical Pacific can induce Rossby wave propagation and subtropical cell adjustment, altering subsurface heat content in the western-to-central Pacific and favoring the persistence or later re-emergence of central-equatorial SST anomalies [21,22]. These anomalies may then influence East Asia through the Pacific–East Asian teleconnection, by modulating the western North Pacific anticyclonic/cyclonic circulation and thereby weakening or strengthening the East Asian Winter Monsoon [23,24,25]. For the Bohai Sea, such remote atmospheric forcing is likely converted into sea ice variability through changes in cold air outbreak frequency, surface heat loss, and wind-driven ice transport [26,27].

3.5.2. Arctic Sea Ice

We further investigated the teleconnection between Arctic sea ice extent and Bohai ice grades. Correlations were calculated for sea ice extent/area across 14 key Arctic sectors (Figure 9) with lead times of 1–12 months (previous December to current November), with all time series detrended to remove long-term trends. The highest correlation coefficients were identified in the following two regions: the Barents Sea ice extent (in the Atlantic Sector) exhibits a strong negative correlation (−0.52) at a 1-month lead time (November); the Chukchi and Bering Seas (in the Pacific Sector) show significant negative correlations (−0.48 and −0.39, respectively) at a 3-month lead time (September).

These correlations (Figure 10) are consistent with a possible thermodynamic forcing pathway: a reduction in sea ice in these marginal seas during the current autumn (September to November) releases anomalous heat fluxes to the atmosphere via air–sea interaction. This surface heating excites stationary Rossby wave trains [28,29], which enhance mid-to-high-latitude blocking patterns over Eurasia—specifically the Ural Blocking High—and consolidate the Siberian High [30,31]. The resulting circulation anomalies intensify the East Asian Winter Monsoon, increase the frequency of cold air intrusions affecting the Bohai Sea, and intensify Bohai ice conditions. The observed 1–3-month lag is consistent with the timescale required for the establishment and propagation of this Atmospheric Bridge mechanism [4].

3.5.3. Predictor Selection and Sensitivity Experiments

To determine the optimal predictor set, we first established a baseline forcing field comprising the autumn Western Pacific Subtropical High Ridge Point and the Current Summer Cold-Tongue ENSO index. Building on this core configuration, we conducted 18 sensitivity experiments (Table 1) combining atmospheric/oceanic indices with sea ice metrics from different Arctic marginal seas. The regression coefficients were calculated on the training period (1979–2019), and performance was evaluated using RMSE for both the full historical period (1979–2024) and the 5-year independent validation period (2020–2024).

Similar RMSE values are found in Exp-8 and Exp-14, both of which achieve the lowest RMSEs during the training and validation periods. However, Exp-8 was selected as the optimal model because it employs fewer predictors than Exp-14, indicating that the additional variable (WP850_TW) included in Exp-14 does not provide meaningful improvement in predictive skill. Exp-8 achieves an independent validation RMSE of 0.354 by integrating high-latitude Arctic sea ice signals (Barents and Chukchi Seas) with mid-to-low-latitude atmospheric–oceanic predictors (WP_SH_WRP, CT_ENSO, and NINO4).

A comparative analysis with experiments incorporating Bering Sea ice (Exp-1, Exp-7, and Exp-13) reveals a clear overfitting problem. Although these experiments show training-period RMSEs comparable to that of Exp-8, their RMSEs are substantially higher than Exp-8 during the independent 5-year validation period. This result suggests that the inclusion of Bering Sea ice information weakens the model’s generalization capability.

Notably, experiments excluding key predictors, such as Barents Sea ice or the equatorial Pacific index (CT_ENSO), exhibited marked degradation in predictive performance. This finding demonstrates that a spatially comprehensive predictor set, spanning both high-latitude Arctic and tropical climate signals, is essential for maintaining the predictive stability of the Bohai Sea Ice Grade (BSIG). Therefore, to ensure robustness across the full 46-year record while effectively minimizing overfitting risk, Exp-8 was selected as the final baseline predictor configuration. A comparison between the predictions from Exp-8 and the observed BSIG is shown in Figure 11. This optimal predictor combination provides a balanced representation of both long-term climatological variability (RMSE = 0.557) and recent independent prediction skill (RMSE = 0.354), thereby establishing a rigorous and reliable benchmark for seasonal BSIG prediction.

3.6. Support Vector Regression

While Principal Component Regression (PCR) effectively mitigates multicollinearity through dimensionality reduction, it remains fundamentally constrained by linear assumptions, limiting its ability to resolve the nonlinear feedback mechanisms characteristic of the climate system. To address these issues and manage the “small sample” constraint, we employed support vector regression (SVR)—a kernel-based machine learning algorithm designed to capture nonlinear relationships while prioritizing robust generalization over fitting to noise.

Based on the preceding correlation analysis and physical mechanisms, we constructed a feature vector comprising five key atmospheric and oceanic precursors, plus a sea ice persistence term. The selected predictors include: (1) tropical forcing via the Current Summer Cold-Tongue ENSO index and the Previous Autumn NINO4 index; (2) atmospheric coupling via the Current Autumn Western Pacific Subtropical High Western Ridge Point; and (3) Arctic forcing via the sea ice extent of the Barents Sea in November and Chukchi Sea in September. Together, these predictors represent tropical forcing, intermediate atmospheric circulation, and autumn Arctic sea ice anomalies. Additionally, to capture the interannual persistence of the cryosphere, the BSIG from the previous winter was included as an autoregressive feature. Prior to training, all input features were standardized to eliminate dimensional discrepancies [32]. Model performance was assessed using a rolling-origin evaluation (time series cross-validation) scheme: for each iteration, the model was trained on all historical data prior to the target year to predict the subsequent winter. This approach strictly prevents data leakage and provides a robust evaluation of the model’s generalization ability through the cumulative assessment of multiple prediction errors [33].

The prediction results of the SVR model are illustrated in Figure 12. During the training phase (1979–2019), the model achieved an RMSE of 0.55. In the independent validation phase (2020–2024), the SVR demonstrated comparable predictive accuracy, yielding an RMSE of 0.53. The absence of significant error amplification during the testing period indicates that the model effectively avoids the overfitting and numerical divergence issues commonly associated with small-sample nonlinear regression, thereby demonstrating satisfactory robustness.

Despite SVR showing acceptable performance in overall error control and predictive stability, deviations persist in forecasting extreme anomalies—specifically the heavy ice years of 2000 and 2009, and the light ice year of 1994—where predictions tend to gravitate toward the multi-year climatological mean. This behavior is intrinsically linked to the SVR’s use of an

ϵ

-insensitive loss function and smooth kernel settings, which are designed to suppress the impact of observational noise and random perturbations. In the climate system, extreme events are typically triggered by the superposition of multi-scale anomalous processes, featuring mechanisms and statistical properties that differ markedly from conventional interannual variability. Consequently, while SVR is effective at characterizing the predictable interannual background evolution of Bohai ice conditions, its capacity to reconstruct extreme intensities remains constrained. This limitation reflects a universal challenge for statistical methods in a non-stationary climate but does not undermine the model’s overall validity for operational forecasting.

3.7. Bayesian Ensemble Neural Network

In the prediction of variables driven by multiple factors using limited sample sizes, forecast stability and generalization performance are often highly sensitive to feature selection and model structure. To address these issues and leverage the complementary advantages of differing methodologies, this study proposes a hybrid prediction framework: the Bayesian Ensemble Bohai Ice Grade Net (BE-BIGNet). This framework combines nonlinear neural network fitting, Bayesian regularization, and ensemble averaging to improve prediction stability under a limited sample size.

Although the BSIG is operationally classified into discrete levels (1–5 at 0.5 intervals), sea ice growth and decay are physically continuous dynamic processes driven by the heat budget and factors such as cumulative Freezing Degree Days (FDDs) and ocean heat content [34]. Traditional classification algorithms, however, treat these grades as independent categories, failing to capture the continuous variation between adjacent levels. In the context of interannual prediction for the Bohai Sea, this structural discrepancy between the continuous physical process and the discrete classification system—compounded by the characteristics of a small sample size with high noise—creates significant uncertainty for direct classification modeling.

To resolve this, we employed a Bayesian Regularized Back-Propagation (BP) Neural Network and introduced a “continuous fitting–discrete quantization” modeling strategy. By establishing continuous nonlinear mapping between climate factors, Arctic sea ice, and ice grades, the model characterizes the physical continuity of the ice evolution process. For the historical training samples (1979–2019), the model incorporates a Bayesian probabilistic framework that adaptively adjusts weight penalty terms in the loss function via Maximum A Posteriori (MAP) estimation. This approach achieves more stable parameter estimation and prevents overfitting [35]. Combined with linear detrending preprocessing, this architecture constrains model complexity while focusing on capturing interannual physical fluctuations (after removing the background warming trend), effectively mitigating the risk of overfitting inherent in small-sample training.

To further mitigate prediction uncertainty arising from random weight initialization and data sampling variability, this study employs a bootstrap median ensemble strategy. Specifically, we utilized bootstrap aggregation (Bagging) to generate 100 training subsets via random resampling with replacement, with each subset training an independent Bayesian regularized sub-model [36]. For the final output, the median of the predictions from all 100 sub-models was adopted as the ensemble forecast, rather than the arithmetic mean. Statistical studies indicate that, compared to mean aggregation, a median ensemble possesses a higher breakdown point and handles non-normal prediction errors more robustly [37]. This strategy effectively serves as a nonlinear filter, removing outlier predictions caused by individual sub-models falling into local minima or overfitting to noise, thereby ensuring the physical consistency and reliability of the system output.

The BE-BIGNet model shows stable performance and relatively low prediction error (Figure 13). Adopting a three-layer feedforward neural network architecture (Input–Hidden–Output) with an optimal four neurons in the hidden layer, the model yielded a training RMSE of 0.44 over the historical period (1979–2019) and a testing RMSE of 0.38 for the independent period (2020–2024), indicating a strong fit to observations. Notably, the model successfully reproduced the heavy ice years of 1979 and 1984 and exhibited enhanced sensitivity to extreme anomalies compared to SVR, although larger deviations persisted for the extreme events of 2000 and 2009. Overall, this framework performs well for the Bohai Sea case and may be useful for operational BSIG prediction.

4. Discussions and Conclusions

This study conducted an intercomparison of seven prediction methodologies for BSIG, ranging from static analogs to dynamic machine learning approaches. By systematically evaluating the analog year analysis (AYA), multiple linear regression (MLR), stepwise regression, Principal Component Regression (PCR), cross-correlation-based regression model, support vector regression (SVR), and the novel Bayesian Ensemble Bohai Ice Grade Net (BE-BIGNet), we established a hierarchy of model capability where each successive methodological advancement addresses the specific limitations of preceding approaches. The RMSEs of those prediction methods are compared in Figure 14. The advantages and limitations of those methods are summarized in Table 2.

The substantial increase in cross-model RMSE differences between training and testing periods suggests that traditional statistical methods are limited by overfitting and may lack predictive skill for BSIG within a nonstationary climate system. The analog year analysis serves as a useful reference but relies on “stationarity” assumptions that are becoming invalid in the context of global warming. Similarly, while regression analyses establish quantitative baselines, they often struggle with multicollinearity among atmospheric predictors. Although PCR resolves the multicollinearity issue through orthogonal decomposition, it remains bound by linear assumptions, restricting its ability to resolve the complex nonlinear interactions characteristic of air–sea coupling.

These limitations were reflected by the performance evaluation. During the independent testing period (2020–2024), the RMSE of the analog year analysis was 0.59, and the standard multiple linear regression (MLR) model degraded to 0.74, reflecting a significant inability to adapt to the non-stationary shifts in the recent climate background. While optimized linear models such as stepwise regression and PCR achieved superficially low RMSEs (0.39 and 0.32, respectively) in the testing period, their historical training performances were notably poorer (RMSEs of 0.66 and 0.68); this lack of consistency across different periods renders their long-term skill unstable and less reliable for operational baselines. Furthermore, while the machine learning approach SVR avoided severe overfitting and maintained relative stability (testing RMSE of 0.46), it failed to outperform the optimized linear models during this specific period, largely because its loss function structure tends to be overly conservative regarding extreme anomalies. Among the evaluated methods, BE-BIGNet demonstrated good stability and accuracy across both the training and independent testing periods. Over the full 1979–2024 period, it achieved an RMSE of 0.44, outperforming all other baseline models evaluated in this study. By integrating the nonlinear fitting capabilities of a neural network with Bayesian regularization and ensemble averaging, this architecture is particularly well-suited to the small-sample constraints inherent in BSIG prediction. Ultimately, these results indicate that BE-BIGNet provides a highly reliable framework for operational BSIG forecasting and regional disaster mitigation.

While the BE-BIGNet framework exhibits robust predictive skill for the overall winter BSIG, it is important to acknowledge that sea ice severity exhibits significant spatial variability across different regions of the Bohai Sea. This study focuses exclusively on a macroscopic, basin-wide index because our primary objective is to support strategic, large-scale disaster preparedness. Fundamentally, the BSIG is dictated by the most severe ice event recorded anywhere within the basin. Consequently, predicting this single comprehensive grade aligns directly with the operational mandate of preparing for worst-case scenarios.

Author Contributions

Conceptualization, D.G. and X.Z.; methodology, Y.Z.; software, X.C.; validation, S.G., G.L. and Q.H.; formal analysis, D.G.; investigation, X.Z.; resources, Y.Z.; data curation, X.C.; writing—original draft preparation, D.G.; writing—review and editing, Y.Z.; visualization, S.G.; supervision, G.L.; project administration, Q.H.; funding acquisition, D.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China grant number 42206222.

Data Availability Statement

The Bohai Sea Ice Grade data are derived from the China Marine Disaster Bulletin and are publicly available at https://www.mnr.gov.cn/sj/sjfw/hy/gbgg/zghyzhgb/ (accessed on 30 October 2025). The Arctic sea ice data are obtained from the National Snow and Ice Data Center (NSIDC) and are available at https://noaadata.apps.nsidc.org/NOAA/G02135/north/monthly/data/ (accessed on 5 December 2025). The 114 atmospheric and oceanic circulation indices are provided by the National Climate Center of China and can be accessed at https://cmdp.ncc-cma.net/Monitoring/cn_index_130.php (accessed on 12 November 2025).

Acknowledgments

The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A

Table A1. List of atmospheric circulation and oceanic indices used in this study.

	Full Name	Short Name
1	Northern Hemisphere Subtropical High Area	NH_SH_A
2	North African Subtropical High Area	NAf_SH_A
3	North African–North Atlantic–North American Subtropical High Area	NAf_NAtl_NAm_SH_A
4	Indian Subtropical High Area	Ind_SH_A
5	Western Pacific Subtropical High Area	WP_SH_A
6	Eastern Pacific Subtropical High Area	EP_SH_A
7	North American Subtropical High Area	NAm_SH_A
8	Atlantic Subtropical High Area	Atl_SH_A
9	South China Sea Subtropical High Area	SCS_SH_A
10	North American–Atlantic Subtropical High Area	NAm_Atl_SH_A
11	Pacific Subtropical High Area	Pac_SH_A
12	Northern Hemisphere Subtropical High Intensity	NH_SH_I
13	North African Subtropical High Intensity	NAf_SH_I
14	North African–North Atlantic–North American Subtropical High Intensity	NAf_NAtl_NAm_SH_I
15	Indian Subtropical High Intensity	Ind_SH_I
16	Western Pacific Subtropical High Intensity	WP_SH_I
17	Eastern Pacific Subtropical High Intensity	EP_SH_I
18	North American Subtropical High Intensity	NAm_SH_I
19	North Atlantic Subtropical High Intensity	NAtl_SH_I
20	South China Sea Subtropical High Intensity	SCS_SH_I
21	North American–North Atlantic Subtropical High Intensity	NAm_NAtl_SH_I
22	Pacific Subtropical High Intensity	Pac_SH_I
23	Northern Hemisphere Subtropical High Ridge Position	NH_SH_RP
24	North African Subtropical High Ridge Position	NAf_SH_RP
25	North African–North Atlantic–North American Subtropical High Ridge Position	NAf_NAtl_NAm_SH_RP
26	Indian Subtropical High Ridge Position	Ind_SH_RP
27	Western Pacific Subtropical High Ridge Position	WP_SH_RP
28	Eastern Pacific Subtropical High Ridge Position	EP_SH_RP
29	North American Subtropical High Ridge Position	NAm_SH_RP
30	Atlantic Subtropical High Ridge Position	Atl_SH_RP
31	South China Sea Subtropical High Ridge Position	SCS_SH_RP
32	North American–North Atlantic Subtropical High Ridge Position	NAm_NAtl_SH_RP
33	Pacific Subtropical High Ridge Position	Pac_SH_RP
34	Northern Hemisphere Subtropical High Northern Boundary Position	NH_SH_NB
35	North African Subtropical High Northern Boundary Position	NAf_SH_NB
36	North African–North Atlantic–North American Subtropical High Northern Boundary Position	NAf_NAtl_NAm_SH_NB
37	Indian Subtropical High Northern Boundary Position	Ind_SH_NB
38	Western Pacific Subtropical High Northern Boundary Position	WP_SH_NB
39	Eastern Pacific Subtropical High Northern Boundary Position	EP_SH_NB
40	North American Subtropical High Northern Boundary Position	NAm_SH_NB
41	Atlantic Subtropical High Northern Boundary Position	Atl_SH_NB
42	South China Sea Subtropical High Northern Boundary Position	SCS_SH_NB
43	North American–Atlantic Subtropical High Northern Boundary Position	NAm_Atl_SH_NB
44	Pacific Subtropical High Northern Boundary Position	Pac_SH_NB
45	Western Pacific Subtropical High Western Ridge Point	WP_SH_WRP
46	Asia Polar Vortex Area	Asia_PV_A
47	Pacific Polar Vortex Area	Pac_PV_A
48	North American Polar Vortex Area	NAm_PV_A
49	Atlantic–European Polar Vortex Area	AtlEu_PV_A
50	Northern Hemisphere Polar Vortex Area	NH_PV_A
51	Asia Polar Vortex Intensity	Asia_PV_I
52	Pacific Polar Vortex Intensity	Pac_PV_I
53	North American Polar Vortex Intensity	NAm_PV_I
54	Atlantic–European Polar Vortex Intensity	AtlEu_PV_I
55	Northern Hemisphere Polar Vortex Intensity	NH_PV_I
56	Northern Hemisphere Polar Vortex Central Longitude	NH_PV_Lon
57	Northern Hemisphere Polar Vortex Central Latitude	NH_PV_Lat
58	Northern Hemisphere Polar Vortex Central Intensity	NH_PV_CI
59	Eurasian Zonal Circulation	Eur_ZC
60	Eurasian Meridional Circulation	Eur_MC
61	Asian Zonal Circulation	Asia_ZC
62	Asian Meridional Circulation	Asia_MC
63	East Asian Trough Position	EAT_RP
64	East Asian Trough Intensity	EAT_I
65	Tibet Plateau Region 1	TP_R1
66	Tibet Plateau Region 2	TP_R2
67	India–Burma Trough Intensity	IBT_I
68	Arctic Oscillation	AO
69	Antarctic Oscillation	AAO
70	North Atlantic Oscillation	NAO
71	Pacific/North American Pattern	PNA
72	East Atlantic Pattern	EA
73	West Pacific Pattern	Western Pacific
74	North Pacific Pattern	NP
75	East Atlantic–West Russia Pattern	EA_WR
76	Tropical–Northern Hemisphere Pattern	TNH
77	Polar–Eurasia Pattern	POL
78	Scandinavia Pattern	SCA
79	Pacific Transition Pattern	PT
80	30 hPa Zonal Wind	ZW30
81	50 hPa Zonal Wind	ZW50
82	Mid-Eastern Pacific 200 mb Zonal Wind	MEP200_ZW
83	West Pacific 850 mb Trade Wind	WP850_TW
84	Central Pacific 850 mb Trade Wind	CP850_TW
85	East Pacific 850 mb Trade Wind	EP850_TW
86	Atlantic–European Circulation W Pattern	AtlEu_W
87	Atlantic–European Circulation C Pattern	AtlEu_C
88	Atlantic–European Circulation E Pattern	AtlEu_E
89	NINO 1 + 2 SSTA	NINO1 + 2
90	NINO 3 SSTA	NINO3
91	NINO 4 SSTA	NINO4
92	NINO 3.4 SSTA	NINO3.4
93	NINO W SSTA	NINO_W
94	NINO C SSTA	NINO_C
95	NINO A SSTA	NINO_A
96	NINO B SSTA	NINO_B
97	NINO Z SSTA	NINO_Z
98	Tropical Northern Atlantic SST	TNA_SST
99	Tropical Southern Atlantic SST	TSA_SST
100	Western Hemisphere Warm Pool	WH_WP
101	Indian Ocean Warm Pool Area	IO_WP_A
102	Indian Ocean Warm Pool Strength	IO_WP_I
103	Western Pacific Warm Pool Area	WP_WP_A
104	Western Pacific Warm Pool Strength	WP_WP_I
105	Atlantic Multi-Decadal Oscillation	AMO
106	Oyashio Current SST	Oya_SST
107	West Wind Drift Current SST	WWD_SST
108	Kuroshio Current SST	Kur_SST
109	ENSO Modoki	ENSO_M
110	Warm-Pool ENSO	WP_ENSO
111	Cold-Tongue ENSO	CT_ENSO
112	Indian Ocean Basin-Wide	IOBW
113	Tropic Indian Ocean Dipole	TIOD
114	South Indian Ocean Dipole	SIOD

Appendix B

Table A2. Circulation and oceanic factors utilized in Principal Component Analysis (PCA). These factors were selected from 40 candidates that were retained based on their correlation with sea ice grade.

Rank	Name
1	Cold-Tongue ENSO Index
2	NINO W SSTA Index
3	North American Subtropical High Area Index
4	Tibet Plateau Region 1 Index
5	North American Subtropical High Intensity Index
6	Western Pacific Warm Pool Strength Index
7	Tibet Plateau Region 2 Index
8	NINO 4 SSTA Index
9	North American Subtropical High Northern Boundary Position Index
10	North American–Atlantic Subtropical High Northern Boundary Position Index
11	North American-Atlantic Subtropical High Area Index
12	East Atlantic–West Russia Pattern
13	North American–North Atlantic Subtropical High Intensity Index
14	South Indian Ocean Dipole Index
15	Eurasian Meridional Circulation Index
16	ENSO Modoki Index
17	Eastern Pacific Subtropical High Intensity Index
18	North African–North Atlantic–North American Subtropical High Northern Boundary Position Index
19	Eurasian Zonal Circulation Index
20	Eastern Pacific Subtropical High Area Index
21	North American Polar Vortex Area Index
22	Northern Hemisphere Subtropical High Intensity Index
23	Northern Hemisphere Subtropical High Area Index
24	North African–North Atlantic–North American Subtropical High Intensity Index
25	Atlantic Subtropical High Northern Boundary Position Index
26	Atlantic–European Polar Vortex Area Index
27	Pacific Subtropical High Area Index
28	North Atlantic Subtropical High Intensity Index
29	Pacific Subtropical High Intensity Index
30	East Atlantic Pattern
31	North African–North Atlantic–North American Subtropical High Area Index
32	West Pacific Pattern
33	North Pacific Pattern
34	Polar–Eurasia Pattern
35	Northern Hemisphere Polar Vortex Area Index
36	Atlantic Subtropical High Area Index
37	Northern Hemisphere Polar Vortex Central Intensity Index
38	Indian Ocean Warm Pool Strength Index
39	India-Burma Trough Intensity Index
40	Asian Zonal Circulation Index

Appendix C

Table A3. The sea ice extent for the Barents Sea (November) and the Chukchi and Bering Seas (September).

	Barents		Chukchi		Bering
Year	Month	Extent (km²)	Month	Extent (km²)	Month	Extent (km²)
1979	11	377,561.72	9	308,077.97	9	4215.978
1980	11	623,931.25	9	506,905.57	9	3898.599
1981	11	354,210.26	9	489,316.62	9	4060.381
1982	11	613,847.3	9	421,360.25	9	3373.778
1983	11	477,744.32	9	600,991.06	9	3216.116
1984	11	263,543.78	9	404,113.33	9	1567.284
1985	11	402,748.3	9	490,673.71	9	2130.472
1986	11	391,234.61	9	361,453.77	9	2210.436
1987	11	484,374.95	9	468,233.04	9	2291.433
1988	11	673,910.61	9	585,884.55	9	2372.224
1989	11	457,087.19	9	375,303.81	9	2712.125
1990	11	429,451.59	9	319,286.29	9	1948.287
1991	11	486,556.48	9	437,913.92	9	2733.598
1992	11	487,757.15	9	501,605.57	9	2532.877
1993	11	500,023.37	9	230,513.8	9	2853.114
1994	11	562,963.18	9	527,528.22	9	3699.229
1995	11	506,111.64	9	410,419.13	9	3477.757
1996	11	300,882.28	9	369,134.8	9	2813.529
1997	11	481,642.22	9	279,711.13	9	1710.225
1998	11	626,914.32	9	177,868.9	9	1288.197
1999	11	330,594.24	9	218,604.6	9	1447.892
2000	11	184,354.35	9	342,758.12	9	1005.904
2001	11	354,509.16	9	403,426.12	9	1086.855
2002	11	482,335.6	9	174,862.15	9	1005.474
2003	11	426,018.26	9	171,492.46	9	584.201
2004	11	353,862.65	9	140,685.65	9	403.342
2005	11	301,938.03	9	235,048.29	9	1949.663
2006	11	180,516.81	9	302,418.46	9	1488.86
2007	11	121,206.67	9	10,905.095	9	1709.774
2008	11	308,402.01	9	13,886.307	9	644.788
2009	11	932,22.729	9	68,777.373	9	563.888
2010	11	238,730.39	9	30,460.549	9	563.278
2011	11	147,293.64	9	43,348.593	9	1026.039
2012	11	56,450.46	9	3463.569	9	602.667
2013	11	844,22.264	9	211,466.61	9	382.756
2014	11	403,570.11	9	137,598.01	9	845.387
2015	11	64,105.81	9	115,851.51	9	442.74
2016	11	43,942.011	9	86,152.303	9	845.371
2017	11	113,114.08	9	44,679.091	9	482.039
2018	11	89,673.005	9	52,616.728	9	1045.789
2019	11	242,829.93	9	36,843.315	9	1707.377
2020	11	37,772.681	9	7025.352	9	944.181
2021	11	227,374.13	9	280,898.7	9	442.539
2022	11	98,802.296	9	41,333.839	9	1206.623
2023	11	202,680.71	9	31,880.308	9	703.175
2024	11	52,933.724	9	186,051.28	9	865.742

Appendix D

Table A4. Statistical relations between predicting variables and BSIG.

Variable		Correlation Coefficient	Regression Coefficient	Standard Deviation	p-Value
Arctic Region Sea Ice Extent	Barents	−0.5428	−3.52 × 10⁻⁶	1.23 × 105	0.0002
	Chukchi	−0.4672	−3.86 × 10⁻⁶	9.67 × 104	0.0021
	Bering	−0.3928	−4.62 × 10⁻⁴	679.1926	0.0111
Key Index	WP_SH_WRP (Aut0)	0.3481	0.0197	14.0780	0.0257
	CT_ENSO (Sum0)	−0.4701	−1.1342	0.3310	0.0019
	WP850_TW (Win-1)	0.4334	0.1774	1.9511	0.0046
	NINO4 (Aut-1)	−0.4291	−0.5075	0.6754	0.0051

References

Xu, N.; Liu, X.; Yuan, S.; Zhang, D.; Wang, Y.; Yue, Q.; Shi, W.; Chen, W. Sea Ice Risk Monitoring System on Offshore Structure Based on Emergent Marine Ecological Disaster Risk Prevention: A Case Study of Oil Platform in the Bohai Sea. Ocean Dev. Manag. 2018, 35, 89–92, (In Chinese with English abstract). [Google Scholar]
GB/T 42254—2022; Sea Ice Grade of Bohai Sea and Northern Yellow Sea. Standardization Administration of the People’s Republic of China: Beijing, China, 2022; (In Chinese with English abstract).
Yan, Y.; Uotila, P.; Huang, K.; Gu, W. Variability of Sea Ice Area in the Bohai Sea from 1958 to 2015. Sci. Total Environ. 2020, 709, 136164. [Google Scholar] [CrossRef]
Cohen, J.; Screen, J.A.; Furtado, J.C.; Barlow, M.; Whittleston, D.; Coumou, D.; Francis, J.; Dethloff, K.; Entekhabi, D.; Overland, J.; et al. Recent Arctic Amplification and Extreme Mid-Latitude Weather. Nat. Geosci. 2014, 7, 627–637. [Google Scholar] [CrossRef]
Zhang, N.; Wu, Y.; Zhang, Q. Forecasting the Evolution of the Sea Ice in the Liaodong Bay Using Meteorological Data. Cold Reg. Sci. Technol. 2016, 125, 21–30. [Google Scholar] [CrossRef]
Yao, L.; Su, J. Relationships Between Bohai Sea Ice and Siberian High and Possible Connections Between Bohai Sea Ice and North Atlantic Oscillation. Period. Ocean Univ. China 2018, 48, 1–12, (In Chinese with English abstract). [Google Scholar]
Li, C.; Liu, Q.; Huang, H. Statistic Relation between Sea Ice in the Bohai Sea and North Yellow Sea of China and the Subtropical High in the Pacific Ocean. Mar. Sci. Bull. 2009, 28, 43–47, (In Chinese with English abstract). [Google Scholar]
Tian, B.; Fan, K. A Novel Stepwise Regression Method for Predicting September Pan-Arctic Sea-Ice Extent: Comparison with Long Short-Term Memory Neural Networks. Atmos. Ocean. Sci. Lett. 2025, 100727. [Google Scholar] [CrossRef]
Zheng, D.; Zhang, S.; Zhou, Z.; Jia, N.; Wang, Z.; Chen, H. Application of stepwise regression analysis in Bohai sea ice grade forecasting. Mar. Forecast. 2015, 32, 57–61, (In Chinese with English abstract). [Google Scholar]
Jolliffe, I.T. Principal Component Analysis; Springer Science & Business Media: New York, NY, USA, 2002. [Google Scholar]
van den Dool, H. Empirical Methods in Short-Term Climate Prediction; Oxford University Press: Oxford, UK, 2007. [Google Scholar]
Asadi, N.; Lamontagne, P.; King, M.; Richard, M.; Scott, K.A. Probabilistic Spatiotemporal Seasonal Sea Ice Presence Forecasting Using Sequence-to-Sequence Learning and ERA5 Data in the Hudson Bay Region. Cryosphere 2022, 16, 3753–3773. [Google Scholar] [CrossRef]
Yang, Z.; Liu, J.; Song, M.; Hu, Y.; Yang, Q.; Fan, K.; Graversen, R.G.; Zhou, L. Extended Seasonal Prediction of Antarctic Sea Ice Concentration Using ANTSIC-UNet. Cryosphere 2025, 19, 6381–6402. [Google Scholar] [CrossRef]
Wheeler, M.C.; Hendon, H.H. An All-Season Real-Time Multivariate MJO Index: Development of an Index for Monitoring and Prediction. Mon. Weather. Rev. 2004, 132, 1917–1932. [Google Scholar] [CrossRef]
Wilks, D.S. Statistical Methods in the Atmospheric Sciences. In International Geophysics, 3rd ed.; Elsevier: Amsterdam, The Netherlands, 2011. [Google Scholar]
Smola, A.J.; Schölkopf, B. A Tutorial on Support Vector Regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef]
Breiman, L. Bagging Predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef]
Ashok, K.; Behera, S.K.; Rao, S.A.; Weng, H.; Yamagata, T. El Niño Modoki and Its Possible Teleconnection. J. Geophys. Res. Ocean. 2007, 112, C11007. [Google Scholar] [CrossRef]
Yu, J.-Y.; Kao, H.-Y.; Lee, T. Subtropics-Related Interannual Sea Surface Temperature Variability in the Central Equatorial Pacific. J. Clim. 2010, 23, 2869–2884. [Google Scholar] [CrossRef]
Kug, J.-S.; Jin, F.-F.; An, S.-I. Two Types of El Niño Events: Cold Tongue El Niño and Warm Pool El Niño. J. Clim. 2009, 22, 1499–1515. [Google Scholar] [CrossRef]
Anderson, B.T.; Perez, R.C.; Karspeck, A. Triggering of El Niño Onset through Trade Wind–Induced Charging of the Equatorial Pacific. Geophys. Res. Lett. 2013, 40, 1212–1216. [Google Scholar] [CrossRef]
Xu, K.; Tam, C.-Y.; Liu, B.; Chen, S.; Yang, X.; He, Z.; Xie, Q.; Wang, W. Attenuation of Central Pacific El Niño Amplitude by North Pacific Sea Surface Temperature Anomalies. J. Clim. 2020, 33, 6673–6688. [Google Scholar] [CrossRef]
Kim, J.-W.; An, S.-I.; Jun, S.-Y.; Park, H.-J.; Yeh, S.-W. ENSO and East Asian Winter Monsoon Relationship Modulation Associated with the Anomalous Northwest Pacific Anticyclone. Clim. Dyn. 2017, 49, 1157–1179. [Google Scholar] [CrossRef]
Wang, B.; Wu, R.; Fu, X. Pacific–East Asian Teleconnection: How Does ENSO Affect East Asian Climate? J. Clim. 2000, 13, 1517–1536. [Google Scholar] [CrossRef]
Kim, J.-W.; Chang, T.-H.; Lee, C.-T.; Yu, J.-Y. On the Varying Responses of East Asian Winter Monsoon to Three Types of El Niño: Observations and Model Hindcasts. J. Clim. 2021, 34, 4089–4101. [Google Scholar] [CrossRef]
Yan, Y.; Shao, D.; Gu, W.; Liu, C.; Li, Q.; Chao, J.; Tao, J.; Xu, Y. Multidecadal Anomalies of Bohai Sea Ice Cover and Potential Climate Driving Factors during 1988–2015. Environ. Res. Lett. 2017, 12, 094014. [Google Scholar] [CrossRef]
Gui, S.; Cheng, J.; Yang, R.; He, Q.; Dong, Z.; Ma, J.; Chu, Q.; Hou, M. Interannual Variation of the Westward Ridge Point of the Western Pacific Subtropical High in Boreal Winter. Glob. Planet. Change 2024, 240, 104528. [Google Scholar] [CrossRef]
Honda, M.; Inoue, J.; Yamane, S. Influence of Low Arctic Sea-Ice Minima on Anomalously Cold Eurasian Winters. Geophys. Res. Lett. 2009, 36, L08707. [Google Scholar] [CrossRef]
Liu, J.; Curry, J.A.; Wang, H.; Song, M.; Horton, R.M. Impact of Declining Arctic Sea Ice on Winter Snowfall. Proc. Natl. Acad. Sci. USA 2012, 109, 4074–4079. [Google Scholar] [CrossRef]
Wu, B.; Su, J.; Zhang, R. Effects of Autumn-Winter Arctic Sea Ice on Winter Siberian High. Chin. Sci. Bull. 2011, 56, 3220–3228. [Google Scholar] [CrossRef]
Inoue, J.; Hori, M.E.; Takaya, K. The Role of Barents Sea Ice in the Wintertime Cyclone Track and Emergence of a Warm-Arctic Cold-Siberian Anomaly. J. Clim. 2012, 25, 2561–2568. [Google Scholar] [CrossRef]
Bansal, N.; Defo, M.; Lacasse, M.A. Application of Support Vector Regression to the Prediction of the Long-Term Impacts of Climate Change on the Moisture Performance of Wood Frame and Massive Timber Walls. Buildings 2021, 11, 188. [Google Scholar] [CrossRef]
Tashman, L.J. Out-of-Sample Tests of Forecasting Accuracy: An Analysis and Review. Int. J. Forecast. 2000, 16, 437–450. [Google Scholar] [CrossRef]
Su, H.; Wang, Y. Using MODIS Data to Estimate Sea Ice Thickness in the Bohai Sea (China) in the 2009–2010 Winter. J. Geophys. Res. 2012, 117, C10018. [Google Scholar] [CrossRef]
Burden, F.; Winkler, D. Bayesian Regularization of Neural Networks. In Artificial Neural Networks: Methods and Applications; Livingstone, D.J., Ed.; Methods in Molecular Biology; Humana Press: Totowa, NJ, USA, 2008; Volume 458, pp. 23–42. [Google Scholar]
Choi, M.; De Silva, L.W.A.; Yamaguchi, H. Artificial Neural Network for the Short-Term Prediction of Arctic Sea Ice Concentration. Remote Sens. 2019, 11, 1071. [Google Scholar] [CrossRef]
Kourentzes, N.; Barrow, D.K.; Crone, S.F. Neural Network Ensemble Operators for Time Series Forecasting. Expert Syst. Appl. 2014, 41, 4235–4244. [Google Scholar] [CrossRef]

Figure 1. (a) Topography of the Bohai Sea with key coastal cities marked by red dots; (b) spatial distribution of sea ice in the Bohai Sea observed by the Geostationary Ocean Color Imager (GOCI) on 24 January 2024.

Figure 2. (a) Summer (June–August) 500 hPa geopotential height anomaly field for 2024 used as the target projection pattern; (b) composite 500 hPa geopotential height anomaly field constructed from historical analog years identified using the projection index method (|PI| ≥ 0.5 standard deviations).

Figure 3. Composite winter Bohai Sea Ice Grade (BSIG) corresponding to historical analog years selected based on similarity of summer 500 hPa geopotential height anomaly patterns. Blue dots indicate analog years identified by the projection index, and red diamond denotes the observed BSIG in the subsequent winters.

Figure 4. Comparison between projected (red) and observed (black) Bohai Sea Ice Grade (BSIG) during the independent evaluation period (2020–2024).

Figure 5. Comparison between observed sea ice grade and fitted results obtained by the Top 10 correlation regression and Stepwise Seasonal Regression during the period 1979–2024.

Figure 6. (Blue) individual and (orange) cumulative variance explained by the principal components.

Figure 7. Comparison between observed sea ice grade and fitted results obtained by the PCR during the period 1979–2024.

Figure 8. Correlation coefficients between detrended BSIG and atmospheric/oceanic indices (1979–2024). The red and blue shading indicate positive and negative correlations, respectively. Out of 114 calculated indices, only those showing statistical significance (p < 0.05) in at least one lag season are displayed.

Figure 9. Locations of the Arctic sea regions utilized in the analysis.

Figure 10. Correlation heatmaps between the detrended Bohai Sea Ice Grade (BSIG) and Arctic Sea Ice Extent (1979–2024). The left and right panels display the correlation coefficients for sea ice extent and sea ice area, respectively, across 14 key Arctic regions. The horizontal axis indicates lead times ranging from 1 to 12 months (from the previous December to the current November).

Figure 11. Comparison of observed and Exp-8 fitted ice grades.

Figure 12. Comparison of observed and SVR model fitted ice grades.

Figure 13. Comparison of observed and BE-BIGNet predicted ice grades.

Figure 14. Comparison of prediction performance (RMSE) among six models across different periods.

Table 1. Configuration of predictors and prediction skill (RMSE) across different experimental schemes (Exp1–Exp18). The checkmarks (●) denote the variables included in each experiment. The regression coefficients, standard errors, and p-values for each parameter against the BSIG are also detailed in Appendix D for readers’ convenience.

	Arctic Region Sea Ice Extent			Key Index
	Barents (Lag 1 mon)	Chukchi (Lag 3 mon)	Bering (Lag 3 mon)	WP_SH_WRP (Aut0)	CT_ENSO (Sum0)	WP850_TW (Win-1)	NINO4 (Aut-1)	RMSE	RMSE (5 Yr)
Exp-1	●	●	●	●	●	●		0.557	0.559
Exp-2	●	●		●	●	●		0.571	0.354
Exp-3	●		●	●	●	●		0.599	0.612
Exp-4	●			●	●	●		0.576	0.500
Exp-5		●		●	●	●		0.599	0.433
Exp-6			●	●	●	●		0.651	0.612
Exp-7	●	●	●	●	●		●	0.557	0.559
Exp-8	●	●		●	●		●	0.557	0.354
Exp-9	●		●	●	●		●	0.571	0.612
Exp-10	●			●	●		●	0.594	0.433
Exp-11		●		●	●		●	0.612	0.433
Exp-12			●	●	●		●	0.626	0.612
Exp-13	●	●	●	●	●	●	●	0.557	0.559
Exp-14	●	●		●	●	●	●	0.557	0.354
Exp-15	●		●	●	●	●	●	0.603	0.612
Exp-16	●			●	●	●	●	0.612	0.612
Exp-17		●		●	●	●	●	0.580	0.433
Exp-18			●	●	●	●	●	0.612	0.612

Table 2. Summary of advantages and limitations of seven BSIG prediction methods.

Method	Type	Advantages	Limitations
Analog Year Analysis (AYA)	Empirical/analog-based	Physically intuitive; effective for small samples; operationally established	Assumes stationarity; limited robustness under climate change; cannot represent nonlinear dynamics
Multiple Linear Regression (MLR)	Linear statistical	Simple and interpretable; computationally efficient	Sensitive to multicollinearity; restricted to linearity; prone to overfitting under nonstationarity
Stepwise Regression	Linear statistical (feature selection)	Reduces predictor redundancy; improves parsimony	Selection instability; sampling dependence; weak physical interpretability
Principal Component Regression (PCR)	Linear statistical (dimension reduction)	Mitigates multicollinearity; improves numerical stability	Loss of physical meaning; linear constraint; limited extreme-event representation
Cross-Correlation-Based Regression	Physically constrained statistical	Incorporates lagged climate signals; partially physically interpretable	Still linear; unstable lag relationships under changing climate conditions
Support Vector Regression (SVR)	Machine learning	Captures nonlinear relationships; robust for small samples; good generalization	Sensitive to hyperparameters; limited interpretability; underestimates extremes
BE-BIGNet	Bayesian ensemble learning	Strong nonlinear representation; reduces overfitting; robust for small datasets	High computational cost; structural complexity; limited interpretability

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guo, D.; Zhang, X.; Chen, X.; Gao, S.; Zhao, Y.; Li, G.; Hou, Q. Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison. Water 2026, 18, 1242. https://doi.org/10.3390/w18101242

AMA Style

Guo D, Zhang X, Chen X, Gao S, Zhao Y, Li G, Hou Q. Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison. Water. 2026; 18(10):1242. https://doi.org/10.3390/w18101242

Chicago/Turabian Style

Guo, Donglin, Xinyou Zhang, Xue Chen, Song Gao, Yiding Zhao, Ge Li, and Qiaokun Hou. 2026. "Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison" Water 18, no. 10: 1242. https://doi.org/10.3390/w18101242

APA Style

Guo, D., Zhang, X., Chen, X., Gao, S., Zhao, Y., Li, G., & Hou, Q. (2026). Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison. Water, 18(10), 1242. https://doi.org/10.3390/w18101242

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Seasonal Prediction of the Bohai Sea Ice Grade: A Multi-Model Intercomparison

Abstract

1. Introduction

2. Data and Methods

2.1. Data

2.2. Prediction Models

2.3. Standardization of the Predictions

3. Results

3.1. Analog Year Analysis

3.2. Multiple Linear Regression

3.3. Stepwise Regression

3.4. Principal Component Regression

3.5. Cross-Correlation-Based Regression Model

3.5.1. Circulation and Oceanic Factors

3.5.2. Arctic Sea Ice

3.5.3. Predictor Selection and Sensitivity Experiments

3.6. Support Vector Regression

3.7. Bayesian Ensemble Neural Network

4. Discussions and Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI