1. Introduction
Sea ice is a mixture of ice crystals, air bubbles, and brine formed in seawater. It covers approximately 7–15% of the world’s oceans and is one of the most important components of the cryosphere. Sea ice influences the global ocean radiative flux balance [
1], thermohaline circulation [
2], biogeochemical cycles [
3], and polar navigation and operations [
4], as one of the global cold sources. The increase in global temperature in recent decades has led to significant changes in polar sea ice. Satellite records suggest that the extent of Arctic sea ice has repeatedly reached record lows [
5], while the extent of Antarctic sea ice varies greatly, experiencing periods of positive trends and maxima of sea ice extent [
6,
7,
8]. However, it has also reached record lows in recent years [
9]. This change in sea ice triggers several global impacts, such as changes in marine ecosystems, atmospheric circulation, and Arctic warming [
10]. Therefore, it is important to monitor the distribution of sea ice in the polar regions.
Microwave remote sensing is widely used due to its capabilities for weather-independence, all-day observation, long-term monitoring, and comprehensive coverage [
11]. In particular, satellite scatterometers have attracted attention as an active microwave remote sensing system that operates without imaging capabilities. These systems are crucial for providing real-time, accurate parameters of sea ice by emitting microwave pulses towards the Earth’s surface and analyzing the backscattered signals to infer surface properties.
Table 1 provides a comprehensive compilation of various microwave scatterometer systems along with relevant bibliographic references, demonstrating their application in sea ice monitoring by several international agencies. It is noteworthy that these scatterometers predominantly work in the C-band (5.3 GHz) and Ku-band (13.5 GHz) frequencies. Based on their beam system, they are further classified into fixed fan beam, rotating pencil beam, and rotating fan beam scatterometers. Based on polarization, these systems are differentiated into vertical polarization (VV) and horizontal polarization (HH). The European Space Agency (ESA) is leading the advancement of C-band single polarized scatterometers with fixed fan beam, particularly through the development of instruments such as the Active Microwave Instrument-Scatterometer (AMI-SCAT) and Advanced Scatterometer (ASCAT). On the other hand, the National Aeronautics and Space Administration (NASA) focused more on Ku-band dual-polarization scatterometers. This technology has evolved significantly, moving from early fixed fan beam scatterometers (such as SeaSat-A Scatterometer System, SASS, and the NASA Scatterometer Satellite, NSCAT) to advanced rotating pencil beam scatterometers (such as SeaWinds and RapidScat). In addition to ESA and NASA, India and China have also made commendable progress in this field through the independent development of the Oceansat Scatterometer (OSCAT) and HY-2 Scatterometer (HSCAT). These systems are comparable to NASA’s SeaWinds in terms of frequency, polarization, and beam system. A notable milestone was reached in 2021 with the launch of the Chinese–French Oceanography Satellite (CFOSAT), a collaborative project between China and France that carries the innovative Chinese-French Ocean Satellite Scatterometer (CSCAT)—a Ku-band dual polarization rotating fan beam scatterometer. To further expand this range of instruments, China’s FY-3E satellite was deployed in 2022, equipped with Wind Radar (WindRad), which uses dual-frequency and dual-polarization. This deployment is particularly notable, as it marks the first use of horizontal C-band polarization in Earth observation [
12], signifying a new chapter in the microwave remote sensing of sea ice.
Satellite scatterometers can provide daily observations of polar regions and are commonly used to map the extent of sea ice.
Table 1 lists retrieval algorithms used for sea ice detection related to satellite scatterometers, which are broadly divided into two groups. The first group is called the Remund/Long-NSCAT (RL-N) algorithm, which uses linear discriminant analysis to classify ice and water by constructing features such as polarization ratio and frequency ratio [
13]. The versatility of the RL-N algorithm is evident in its application to data from a variety of scatterometers, including OSCAT, HSCAT, and CSCAT, among others. These results show good consistency with sea ice concentration. However, the method has certain limitations; the accuracy of the RL-N algorithm can be significantly affected by wind-induced surface roughness and summer ice melt [
14]. These effects need to be mitigated in the post-classification process by employing binary image processing techniques and sea ice growth/retreat constraint methods [
15,
16]. The second group refers to the Royal Netherlands Meteorological Institute (KNMI) algorithm proposed by the Royal Netherlands Meteorological Institute. This algorithm is designed for the analysis of sea ice using AMI-SCAT data and introduces geophysical model functions (GMFs) for sea ice detection. It contains Bayesian classifiers that are used to determine sea ice, as described by de Haan and Stoffelen [
17] and Verspeek [
18]. This approach has been successfully applied to satellite data from SeaWinds, ASCAT, HSCAT, and CSCAT. These GMFs for different sensors exhibit unique characteristics. AMI-SCAT’s GMF describes sea ice probability using an ice cone line in the three-dimensional σ0 space, assuming isotropic backscattering [
17,
18]. ASCAT’s GMF is based on a linear relationship between the forward, mid, and aft beams, with sea ice backscatter characteristics changing with incidence angle [
19]. SeaWinds’ GMF represents sea ice properties through a linear relationship between vertically and horizontally polarized signals [
20]. CSCAT, on the other hand, uses a look-up table to define the GMF due to the complex observation model [
21]. This validation indicates that the method provides more accurate information for characterizing sea ice during the melting season. The GMFs have strong dependency on specific types of remote sensors, which can impose significant limitations when adapting to data from other sensor types or platforms.
It is worth noting that the introduction of new rotating fan beam systems such as CSCAT and WindRad introduces variability in incidence and azimuth angles, which has a significant impact on backscatter from OW and sea ice surfaces [
22]. To deal with the complexity that arises from multiple angles of incidence and azimuth, the RL-N and KNMI algorithms have also been adapted for use with the CSCAT. Zhai et al. [
23] addressed this problem by calculating the polarization ratio using the average of the horizontal and vertical polarization CSCAT backscatter coefficients over angles of incidence and azimuth. However, directly calculating the polarization ratio may reduce the detection accuracy because sea ice and seawater have different sensitivities to incidence and azimuth angles. At the same time, the CSCAT GMF model was first introduced in CSCAT by Liu et al. [
24], which simplified the problem of mixed geometry observations by selecting backscatter observations near a 40° incidence angle to build a geophysical model of sea ice. The method of selecting a 40° incidence angle reduces the rotating fan beam system to a rotating pencil bean system. Li et al. [
21] proposed a method of creating an incidence angle lookup table to construct a GMF to solve the problem of mixing incidence and azimuth observations. They observed a large standard deviation between the lowest and highest incidence angles when constructing a look-up table model, and therefore truncated the observations at both incidence angles. Moreover, the linear relationship between the incidence angle and the backscatter coefficient has also been used to correct this effect [
13,
25]. In essence, polarization ratio, frequency ratio, and normalization coefficients are still unable to fully mitigate the errors caused by wind-induced sea surface roughness. For CSCAT, with its multi-angle observations, the construction of GMFs is significantly more complex than traditional three-dimensional spatial distributions. It can no longer be expressed using a simple three-dimensional mathematical formula and instead requires a lookup table for implementation. Principal component analysis (PCA), traditionally viewed as a downscaling method, serves a dual role: It reduces the dimensionality of the feature space and highlights uncorrelated features [
26,
27]. Previous studies used PCA in sea ice retrieval by combining passive microwave radiometers and scatterometers [
28] and have showed that it is effective in reducing the complexity of feature space and improving sea ice classification efficiency. However, the application of PCA to scatterometer observations for classification feature extraction has not been studied. This represents an opportunity for novel research that could potentially affect the accuracy of sea ice detection using scatterometer data.
The study aims to (1) extract classification features from CSCAT observations using PCA, (2) build an ensemble machine learning model to detect sea ice in both the Northern and Southern Hemispheres, and (3) validate the results of CSCAT sea ice detection by comparing with similar types of sea ice products and assessing the validity and feasibility of the developed model. As such, this paper not only eliminates the dependency on specific functions through the use of PCA, but also presents an automated algorithmic framework that requires no empirical parameters or manual identification, making it versatile enough to be applied across various scatterometer platforms.
Table 1.
Representative microwave scatterometer system and related references for sea ice application from various agencies in different countries (modified based on Long [
29]).
Table 1.
Representative microwave scatterometer system and related references for sea ice application from various agencies in different countries (modified based on Long [
29]).
Sensor | Agency | Frequency | Polarization | Reference | Mission | Dates |
---|
SASS | NASA | Ku | 2VV*2 2HH*2 | Yueh et al. [30] | SeaSat | 1978.06–1978.10 |
AMI-SCAT | ESA | C | 3VV | Verspeek [18] 2 de Haan and Stoffelen [17] 2 | ERS-1 | 1991.07–1996.07 |
ERS-2 | 1995.04–2001.01 |
NSCAT | NASA | Ku | 3VV*2 1HH*2 | Remund and Long [13] 1 | ADEOS | 1996.09–1997.06 |
SeaWinds | NASA | Ku | HH-inner VV-outer | Belmonte Rivas and Stoffelen [20] 2 | ADEOS-2 | 2002.12–2003.10 |
QuikSCAT | 1999.06–2009.11 |
ASCAT | ESA | C | 3VV*2 | Belmonte Rivas et al. [19] 2 Breivik et al. [31] 2 Lindell and Long [32] 2 Aaboe et al. [33] 2 | Metop-A | 2006.10–2021.11 |
Metop-B | 2012.09– |
Metop-C | 2018.11– |
OSCAT | ISRO | Ku | HH-inner VV-outer | Hill and Long [34] 1 | OceanSat-2 ScatSAT-1 | 2009.09–2014.02 2016.09–2021.02 |
HSCAT | NSOAS | Ku | HH-inner VV-outer | Xu et al. [35] 1 Li et al. [16] 1 Zou et al. [36] 1 | HY-2A | 2011.08–2022.04 |
HY-2B | 2018.10– |
HY-2C | 2020.09– |
HY-2D | 2021.05– |
RapidScat | NASA | Ku | VV HH | Singh et al. [28] 1 | ISS RapidScat | 2014.09–2016.08 |
CSCAT | CNSA | Ku | VV HH | Liu et al. [24] 2 Zhai et al. [23] 1 Li et al. [21] 2 Liu et al. [37] 2 Xu et al. [38] 1 | CFOSAT | 2018.10–2023.01 |
WindRAD | CMA | C/Ku | VV*2 HH*2 | Zhai et al. [39] 1 | FY-3E | 2021.05– |
3. Results
After reviewing the L2 orbit data and daily projected data, there were a total of 116 days where CSCAT data was unusable. Specifically, 50 days of Level 2A orbital observations were missing, and 66 days were eliminated due to quality control measures (e.g., removal of invalid values, rain-affected pixels, and land pixels). Additionally, since we needed to collect samples from the past 5 consecutive days, a total of 22 days were affected due to insufficient sample sizes. The specific relevant data and reasons for deletion are listed in the
Table 3. No sea ice forecasts were made in the study for these 138 data days. The number of days used for statistical analysis was 90%.
3.1. Characteristics of CSCAT Features
The CSCAT data included eight VV polarization (
), eight HH polarization (
), and eight polarization ratios (
). There were obvious correlations between the properties of these observations.
Figure 5a,b show the Pearson’s correlation for these observations in the Northern and Southern Hemisphere, respectively. As can be seen from the figures, there was not only a high correlation between adjacent
(or
), but also a correlation between the set of
and
for each WVC. We applied PCA to the CSCAT polar backscatter observation dataset
and assigned it to fewer principal components. With the bivariate plots of the principal component analysis (
Figure 5c,d), we found that among the eight WVCs, the contributions of WVCs in views 3, 4, and 8 were more significant in
and
. In comparison, the results of view 8 showed a significant deviation from the results of views 3 and 4. Views 3 and 4 primarily occupied the outer swath and nadir swath, respectively. The WVCs of the nadir swath were characterized by azimuth angles of 0/360 or 180 degrees, corresponding to the forward and backward perspectives, accompanied by a wide range of incidence angles. Conversely, the WVCs in the outer swath were defined by azimuth angles of approximately 90 or 270 degrees, indicating lateral viewing angles, again with a wide range of incidence angles. View 8 was consistently located in the region designated the sweet swath, characterized by significant variability in the antenna’s azimuth angle. According to Li et al. [
40], the WVCs in the sweet swath were the optimal area for measuring sea surface wind. Likewise, the WVCs of the sweet swath in the principal component analysis occupied a larger amount of information. Therefore, the first two principal components of
and
effectively characterized the observed information in the outer swath, nadir swath, and sweet swath. In contrast, the bivariate plot of
shows that the first two principal components could only represent the outer swath, suggesting that the amount of information gathered using only the first two principal components accounted for almost 50% of the total observations. Spatial distribution analyses of the first four (out of eight) principal components conducted on 10 January 2019 for both the Northern and Southern Hemispheres (as shown in
Figure 6a,b), revealed that the importance of
and
was mainly concentrated within the first two components. Meanwhile, the significance of the VH polarization was evenly distributed across all four components.
Figure 7 shows the daily variation curves for the contributions of the first two principal components of
and
and the first four principal components of
. These results suggested that the choice of the first two principal components for
(
) polarization accounted for over 80% of the total variance explained, while the choice of the first four principal components for
explained more than 65% of the total variance.
3.2. Period Choice for Dynamic Sampling
Figure 8 presents the statistical averages of the first principal component of
in both the Northern and Southern Hemispheres using different sampling periods of 5, 7, 10, 15, and 20 days. The analysis shows that shorter sample periods result in significant noise in the first principal component of the VV polarization, thereby enhancing contrast during seasonal variations. Conversely, extending the sampling period results in a smoother curve for the first principal component feature of VV polarization, suggesting a diminishing emphasis on seasonal variations. Subsequent comparisons of the detection performance of the random forest model were performed for the Northern and Southern Hemispheres in January–March and June–August, respectively, over the same range of sampling periods (5, 7, 10, 15, and 20 days).
Table 4 presents the F1 scores for five different models across various sampling periods It highlights which sampling period (e.g., 5 days, 7 days, etc.) resulted in the highest F1 scores for each model. The table summarizes a total of 10 statistical scenarios, with 8 scenarios showing that the 5-day sampling period consistently achieved the highest F1 score.
After comprehensive evaluation, a sample period of 5 days was chosen for the dynamic sample statistics. Taking into account the possibility of incomplete sampling periods due to missing observations on certain dates, a forward search was performed to find available data no longer than 5 days. If the forward search exceeded 20 days, then dynamic sample data were considered missing on that date, and no modeling or sea ice detection was performed for that day.
3.3. Assessment for Single and Ensemble Models
In this study, five individual models were modelled: Gnb, Log, Knn, Dt, and Rfc. The variable importance of these models was assessed during the model building process, and cross-validation of F1 scores was performed to provide insight into the performance of the models and the relative importance of each feature. Variable importance is an important measure of how strongly features influence the predictive power of a model. We assessed feature importance using different metrics tailored to each model. For Dt and Rfc, Gini importance was used, which reflects the reduction in contamination by each feature. For Log and Gnb, feature weights from regression coefficients were used, with the magnitude of the coefficients indicating feature influence, and although Knn does not directly indicate feature importance, it can be assessed indirectly via model performance or weighted configurations.
Figure 9a shows the feature importance of these five models for the Northern and Southern Hemispheres. There are some differences in how different models rank for feature importance. Different models assigned different levels of importance to features. The most important feature was the first principal component of HH polarization. The ranking of
and
varied between the models: They took second and third place in Log-ranked third and second in others The situation in the Southern Hemisphere was more complex, as there were significant differences in the order of feature importance among models. In the Dt, Knn, and Rfc models,
was the most important feature, while in the Gnb model,
was the most important, and in the Log model,
was the most important. This reflected the geographical differences between the Northern and Southern Hemispheres and the different responses to polarization features. Consequently, the models also showed variations in the ranking of radar polarization features for different regions.
Figure 9b summarizes the distribution of F1 scores across the models during a 10-fold cross-validation period from 1 January 2019 to 31 December 2022. The F1 scores, which ranged from 0.660 to 0.750 for both hemispheres, indicated a commendable balance of precision and recall achieved by the models over the past four years. The Log model in the Southern Hemisphere showed the largest standard deviation (0.088), indicating greater variability in performance across different subsets compared to the other models. The standard deviations of the other models were between 0.028 and 0.069.
Figure 9c shows the time series of F1 values obtained by 10-fold cross-validation for different machine learning models from 1 January 2019 to 31 December 2022. The Knn, Rfc, and ensemble models consistently showed relatively stable and high F1 values throughout both the Northern and Southern Hemispheres, indicating strong and sustained forecast performance in these regions. In contrast, Log showed larger fluctuations in F1 results and generally lower values, suggesting weaker and more unstable generalization abilities, especially in the Northern Hemisphere. For all models, there was no obvious temporal trend showing significant improvements or declines in F1 score over the analyzed period, suggesting stable model performance without noticeable deterioration or improvement. However, the different models showed differences in performance over the course of the season. Notably, in the Northern Hemisphere, the F1 score for all models increased continuously from July to September, and similarly, in the Southern Hemisphere, F1 scores were higher from January to March compared to other months. This pattern, observed continuously from 2019 to 2022, indicates that Ku-band CSCAT is particularly effective in identifying the melting status of sea ice during the summer months. On the other hand, the Log model outperformed Dt and Gnb in both hemispheres from January to March, but in the Southern Hemisphere, its F1 values were significantly lower than those of Dt and Gnb from July to September. The consistent performance of these models highlights their robustness and reliability in analyzing sea ice dynamics using radar polarization features, despite the inherent variability in environmental conditions between the Northern and Southern Hemispheres.
To evaluate the models, we summarized confusion matrices for individual and integrated models (
Table 5). In both the Northern and Southern Hemispheres, the performance of the five individual models varied in different classification scenarios. For open water and close ice, Knn and Rfc achieved F1 values above 0.9, while the other three models also achieved values around 0.8. However, all five models had very low F1 values of less than 0.4 on open ice. Among them, Knn performed the best with an F1 value above 0.3, followed by Rfc in the Northern Hemisphere and Dt in the Southern Hemisphere. Among the results of five single models between 2019 and 2022, Knn and Rfc showed strong performance in the Northern Hemisphere, with F1 and OA values above 0.9 and kappa coefficients above 0.8, indicating high data consistency. In contrast, Dt, Gnb, and Log showed F1 and OA values between 0.7 and 0.9, with kappa coefficients ranging between 0.569 and 0.6887, reflecting moderate consistency. While Dt and Log had OA values around 0.7 in the Southern Hemisphere, Knn, Rfc, and Gnb reached OA values between 0.8 and 0.9. Only Knn and Rfc had kappa values above 0.7, indicating high model consistency, while Dt, Gnb, and Log had Kappa values below 0.6, indicating medium consistency. Overall, Knn and Rfc performed better than the other single models in terms of detection accuracy.
In the Northern Hemisphere, the ensemble models slightly outperformed the best two individual models, Rfc and Knn, in terms of OA and Kappa values. Additionally, while the F1 score for open ice improved compared to Rfc, it still remained lower than that of Knn. A similar trend was observed in the Southern Hemisphere. The ensemble models showed a slight advantage over the best individual models in terms of OA and Kappa values. For open ice, the F1 score was higher than that of Rfc but still lower than Knn. Based on these observations, the ensemble models can be considered the preferred choice. Their overall performance, demonstrated by the improved OA and Kappa values, indicates their effectiveness in achieving better consistency and reliability across different scenarios. While KNN still outperforms the ensemble models in terms of F1 scores for open ice, the ensemble models’ superior overall performance suggests that they offer a more balanced and reliable solution across various classification tasks.
Figure 10 illustrates the time series of assessment parameters from 1 January 2019 to 31 December 2022. The overall classification metrics (OA and Kappa) had more similar daily trends to the UA, PA, and F1 values for close ice. However, these values were significantly lower from July to October in the Northern Hemisphere and from January to April in the Southern Hemisphere. UA, PA, and F1 values were generally higher in open water, with less seasonal variation and relatively low values, mainly in the Northern Hemisphere from January to April and in the Southern Hemisphere from April to October. The UA, PA, and F1 values for close ice and open water were higher than those for open ice.
Figure 11 presents an error analysis based on the model classification confusion matrix, with the y-axis representing the misclassification rates for each class. Specifically, these rates were calculated by determining the number of pixels for each class (e.g., “Close Ice”, “Open Ice”, and “Open Water”) that were incorrectly classified as other classes and then dividing these misclassified pixel counts by the total number of pixels for that class. Error analysis showed that close ice was frequently mistakenly classified as open ice from July to October in the Northern Hemisphere and from January to April in the Southern Hemisphere, resulting in decreased classification accuracy. On the other hand, open water was frequently mistakenly identified as close ice from April to October in the Southern Hemisphere and January to April in the Northern Hemisphere, which also affected the accuracy of the classification. Furthermore, a significant portion of open ice was incorrectly identified as close ice, which helps to explain why close ice classification accuracy was typically lower.
3.4. Single and Ensemble Model-Based Sea Ice Mapping
This study used five single models and an averaged ensemble model for sea ice detection. The Northern and Southern Hemisphere sea ice detection results for 10 December 2019 and 10 June 2019 are shown in
Figure 12. Each column represents origin reference SIC, classified reference SIC, and different models, namely Dt, Gnb, Knn, Log, and Rfc, from left to right. The reference map provided the observed sea ice distribution, serving as a comparison baseline. It is evident that the amount of sea ice was similar in both the Northern and Southern Hemispheres. There were differences between the models in terms of sea ice detection. The Knn and Rfc models overall performed better than the other models in classifying sea ice in both the Northern and Southern Hemispheres. The Knn model detected sea ice by calculating the distance between samples, while the Rfc model classified it by constructing a decision tree integration. These models might be better suited to the capture the spatiotemporal correlation of sea ice and dealing with its nonlinear features. Compared to the original reference maps, the Knn and Rfc single models tended to misclassify open ice as close ice at the sea ice boundary, whereas Dt and Log were more prone to misclassifying close ice as open ice. These differences were likely due to variations in the algorithmic principles and feature extraction functions. The advantage of Rfc and Knn lay in their overall robustness and stability for close ice, while Dt and Log demonstrated certain advantages in handling complex boundaries, particularly in distinguishing open ice from close ice.
By averaging the results of those five models, a sea ice map of the Northern and Southern Hemispheres was created. However, when comparing these with the ensemble model results, it was clear that the significant misclassifications in single models were effectively mitigated in the ensemble model. The ensemble model not only retained the correct classification of close ice from Knn and Rfc, but it also gained the accurate classification of open ice from the other three single models. Ensemble models reduce errors that may occur with single models by combining predictions from multiple models. Therefore, they are expected to improve the accuracy of sea ice detection to some extent.
4. Discussion
In this section, we focus on discussing the performance of the ensemble model across different time periods and spatial ranges in order to comprehensively assess its applicability and accuracy. We achieve this by using the sea ice extent derived from other sea ice concentration to validate the CSCAT sea ice classification results and by conducting detailed evaluations using metrics such as R2 and RMSE. Additionally, we compare the CSCAT sea ice classification with similar sea ice edge and concentration products as well as with Sentinel 1 SAR images.
4.1. Comparison with Daily Sea Ice Extent
Sea ice extent (SIE) is usually defined as the sum of the area of ocean grid cells with a sea ice concentration greater than 15%. A SIC threshold of 15% is not applied regularly [
60]. This parameter represents the maximum sea ice cover and is crucial for assessing climate change. However, because our model classifies sea ice using a 30%/70% SIC threshold, we adjusted our analysis to use a 30% SIC threshold to calculate sea ice extent in order to ensure a fair comparison:
. For CSCAT, sea ice extent was calculated by summing the area of pixels classified as close ice and open ice.
To assess the temporal accuracy of our sea ice detection, we conducted a daily comparison with the daily sea ice area data released by the NSIDC and OSISAF from 2019 to 2022. Despite significant daily fluctuations, the sea ice extent derived from CSCAT showed a high level of agreement with the officially published daily data.
Figure 13b1,b2 show the time series of sea ice extent differences between CSCAT, OSISAF, and NSIDC across different hemispheres, respectively. In the Northern Hemisphere, CSCAT underestimated sea ice extent by −0.06 ± 0.36 million km
2 compared to NSIDC, while OSISAF underestimated it by −0.12 ± 0.09 million km
2. In the Southern Hemisphere, CSCAT underestimated sea ice extent by −0.03 ± 0.48 million km
2, and OSISAF underestimated it by −0.11 ± 0.09 million km
2. The comparison results showed that CSCAT generally estimated lower sea ice extent than NSIDC. Zhai et al. [
23] used a random forest approach to estimate the CSCAT distribution and compared the differences to OSISAF sea ice extent. Their results suggested a lower estimate for the Northern Hemisphere, consistent with our study, but a higher estimate for the Southern Hemisphere, contradicting our results. According to the model error analysis, open ice was misclassified as close ice in most months in the Southern Hemisphere, with many open ice areas misclassified as open water from January to April (
Figure 11b (2)). This likely led to the overall lower sea ice extent in the Southern Hemisphere observed in our study.
To better present the results, we have graphically displayed the daily sea ice extent on a monthly basis. The visualization showed that sea ice extent from CSCAT matched NSIDC in January, February, May, June, July, and August. However, the results were slightly overestimated in March and April and slightly underestimated from September to December. In the Southern Hemisphere, the sea ice extent in February, August, and September corresponded closely, yet we observed an underestimation from March to July and an overestimation from November to January of the following year. These variations indicated that while the model performed well under specific conditions, such as during months with relatively stable sea ice conditions, its applicability under other conditions, particularly during seasonal transitions, was challenged. The observed underestimations or overestimations were likely associated with dynamic environmental factors that significantly impacted sea ice formation and melting during these transitional periods. This is in stark contrast to the discrepancies found between QuikSCAT and ASCAT compared to AMSR-E, particularly during sea ice melt months, as reported by Belmonte Rivas et al. [
19]. It is noteworthy that active scatterometers and passive microwave radiometers showed significant differences during periods of rapid sea ice change. Furthermore, linear regression analysis of the CSCAT and NSIDC sea ice extent data yielded R-squared values of 0.991 and 0.993 with corresponding RMSE values of 0.340 and 0.485 million km
2. These results confirm that our method for estimating sea ice extent provided consistent results over longer time periods compared to other accepted data sources.
4.2. Comparison with Sea Ice Concentration and Sea Ice Edge Datasets
ASCAT, SSMIS, and AMSR2 are the three other sources of sea ice cover data. ASCAT is a C-band HH-polarized scatterometer that provides sea ice detection similar to CSCAT. SSMIS and AMSR2 are the two dominant microwave radiometric instruments commonly used to determine sea ice concentration and provide more reliable sea ice distributions. To compare the spatial differences between the CSCAT sea ice edge and other sea ice cover products, we selected the sea ice edge from ASCAT and sea ice concentration from NSIDC. The comparison involved contrasting the CSCAT and ASCAT sea ice edges with the SSMIS sea ice concentration. This comparison employed a statistical method similar to a confusion matrix (Equation (6)). First, the NSIDC sea ice concentration was divided into three classes based on thresholds: less than 30% was open water (OW), 30–70% was open ice (OI), and more than 70% was close ice (CI). The CSCAT and ASCAT sea ice edge results also included these three types. Monthly mode statistics were calculated for each pixel compared to the NSIDC sea ice edge, resulting in the monthly average differences for sea ice edge between CSCAT, ASCAT, and NSIDC. The consistency of
, and
can be calculated using Equation (7).
For this study, we compared sea ice detection in the Northern Hemisphere on 10 January 2019 and in the Southern Hemisphere on 10 June 2019 from threedifferent sources: CSCAT, ASCAT, and NSIDC (
Figure 14). The results of each data source were represented by a color-coded scale indicating the sea ice concentration and sea ice edge results. For NSIDC SIC, the scale ranges from dark red (0%, open water) to deep purple (100%, close ice). For sea ice edge products, the scale ranges from dark blue (open water) to light blue (open ice) to white (close ice). The comparison results indicate that the sea ice edge obtained by CSCAT was consistent with the sea ice acquired by ASCAT and NSIDC, but there were also some deviations. In the Northern Hemisphere, CSCAT detected open ice in the Greenland Sea, Baffin Bay, and Hudson Bay, while in the Southern Hemisphere, open ice was detected at the ice edge regions of the Ross Sea. Compared to ASCAT, this open ice tended to be less, which can also be observed in the sea ice distribution detected by Li et al. [
21] using GMFs.
From the consistency time series for sea ice edge (
Figure 15), it was observed that, in both the Southern and Northern Hemispheres, the consistency of close ice observed by CSCAT with that observed by NSIDC (0.903, 0.931) was slightly better than that of ASCAT (0.898, 0.922), while ASCAT (0.998, 0.997) showed slightly better performance for open water compared to CSCAT (0.988, 0.988). For open ice, CSCAT performed worse overall than ASCAT, although with slight differences in the different hemispheres. In the Northern Hemisphere, CSCAT showed lower consistency with NSIDC compared to ASCAT in most months. In the Southern Hemisphere, the differences between ASCAT and CSCAT in terms of open ice exhibited a seasonal symmetry: in the months when the consistency of ASCAT was poor (January to April), CSCAT performed well, and in the months when the consistency of CSCAT was poor (May to December), ASCAT performed well. This seasonal symmetry for the consistency of open ice in the Southern Hemisphere was also observed in Zhai et al. [
23]. This suggested that CSCAT had a better ability to identify open ice compared to ASCAT from January to April. These differences were mainly due to the different frequencies of the two sensors. CSCAT operates in the Ku band, which has a shorter wavelength than the C-band ASCAT. The shorter wavelength is more sensitive to the rough surface of multiyear ice but has a weaker response to thin ice edges.
In the Northern Hemisphere, CSCAT showed lower consistency for open ice from January to April and lower consistency for close ice from May to August (
Figure 15a (1),(2)). The spatial distribution of sea ice cover differences between CSCAT, ASCAT, and NSIDC in February and May revealed that the misclassification of sea ice by CSCAT in the Northern Hemisphere was mainly concentrated along the ice edges in the Barents Sea and the Greenland Sea, where close ice was incorrectly classified as open ice, which was less common in the spatial differences observed with ASCAT (
Figure 16a,c). In August, CSCAT misclassified open ice as close ice, especially in the northern sea areas of Canada, while ASCAT showed less misclassification in these areas. In the Southern Hemisphere, CSCAT showed lower consistency for open ice from April to December but higher consistency from January to March. Misclassification still occurred at the ice edge, with open ice being mostly classified as close ice in August and November (
Figure 16b,d). In March, mutual misclassification between close and open ice occurred at the sea ice edge of East Antarctica.
The comparison between CSCAT and ASCAT revealed that CSCAT had slightly better consistency with NSIDC for close ice in both hemispheres, while ASCAT performed better for open water. CSCAT generally underperformed compared to ASCAT for open ice, especially in the Northern Hemisphere. However, in the Southern Hemisphere, CSCAT outperformed ASCAT from January to April, whereas ASCAT performed better from May to December. Misclassifications were primarily observed along the ice edges in the Barents Sea and Greenland Sea, where CSCAT often misclassified close ice as open ice, unlike ASCAT. In August, CSCAT also misclassified open ice as close ice in central sea areas, while ASCAT’s misclassifications occurred mainly at the boundary between open and close ice. In the Southern Hemisphere, CSCAT showed lower consistency in detecting open ice from April to December but higher consistency from January to March, with misclassifications mostly occurring along the ice edges.
4.3. Comparison with High Resolution SAR Imagery
To confirm the regional detection of sea ice, a comparison with Sentinel-1 SAR data was performed. For the validation process, a section of the Northern Hemisphere on 19 June and 8 March 2019 and a section of the Southern Hemisphere on 19 June 2019 were selected and verified using the map data. During the process, we performed various data processing procedures using the SNAP 8.0.0 software, namely orbital corrections, radiometric calibrations, dB conversions, and latitude/longitude projections. We then created a geo-mosaic using ArcGIS 10.7 software, which allowed us to create SAR images covering many parts of the polar regions. In addition, we used ArcGIS software to extract the peripheral sea ice boundary from the sea ice results and overlay it on the corresponding SAR image.
Figure 17a depicts the expanse of open ice in the Greenland Sea as captured by the Sentinel-1 SAR imagery. The open ice appears as a lighter shade of gray in contrast to the deep gray of the seawater. The features observed were consistent with the sea ice boundaries extracted by CSCAT. Moreover, the Sentinel-1 SAR image dated 8 March 2019, facilitated the clear identification of sea ice and open water in Baffin Bay. This identification closely aligned with our extracted sea ice boundary positions, as illustrated in
Figure 17b. Similar outcomes were demonstrated over two regions in the Southern Hemisphere above the Indian Ocean. The sea ice boundaries extracted by CSCAT showed superior performance in delineating sea ice from open waters (
Figure 17c). Near the Antarctic Peninsula, CSCAT’s detection of sea ice boundaries differed from the SAR images and incorrectly marked some open water as sea ice. Despite this, CSCAT had accurately identified a distinct, isolated region of sea ice. Our detection results showed similarity in the shape and position of sea ice compared to Sentinel-1 SAR imagery while also capturing the variations and structural features at the sea ice edge.
5. Conclusions
The aim of this study was to automatically extract feature information from CSCAT using PCA to retrieve sea ice data in the Northern and Southern Hemispheres and to use an ensemble machine learning algorithm to obtain reliable daily sea ice distributions from 2019 to 2022. PCA effectively extracted principal component features representing outer swath, zones, and close ice. We trained ensemble models based on Knn, Log, Dt, Gnb, and Rfc. Rfc/Knn exhibited high error rates in detecting open ice, often misclassifying it as close ice, particularly at the sea ice boundary. In contrast, Dt/Log/Gnb performed more effectively in identifying open ice at the sea ice boundary. By combining these models, we improved overall classification accuracy both for open ice and close ice. The sea ice edge detected by CSCAT was also independently validated against NSIDC’s sea ice concentration and ASCAT’s sea ice edge, showing high correlation in sea ice extent and temporal–spatial consistency as well as good alignment with SAR imagery at the sea ice–water boundary. The PCA extraction method significantly enhances the feature extraction capabilities of scatterometers with fan-beam and rotating-beam configurations. It complements traditional sea ice detection approaches and allows for the precise and reliable classification of sea ice and open water. Although CSCAT performed well in distinguishing between sea ice and open water, it was prone to confusion between different types of sea ice, with especially limited capability to identify open ice. In the Arctic, such misclassifications were most notable in the Greenland Sea in February and May and in parts of the central region in August; in the Antarctic, they were primarily observed across the entire Antarctic sea ice–water boundary in August and November and along the Antarctic coastline and sea water boundary in May. Open ice is typically associated with the marginal ice zone (MIZ), which is more important than the central region (close ice) for shipping route development, fishery resources, ecosystems, and climate responses. So, good performance on open ice is more important. Future work should focus on further optimizing the algorithm to improve open ice recognition and extending its application to other radar systems, incorporating additional data from ship-based observations, optical imagery, and SAR to enhance detection performance.