1. Introduction
Leads can be generally defined as quasi-linear, narrow fractures in sea ice that form because of shear or divergence within the ice pack. Leads can be less than a kilometer or extend several hundred kilometers in length. Some glossary definitions of leads include language that defines minimal lead width as wide enough to be navigable by ships, differentiate coastal or shore leads from pack ice leads, or have open water requirements rather being covered by new ice [
1]. Sea ice leads affect the energy and mass exchanges between the ocean and atmosphere in the Arctic directly and indirectly, particularly in winter. At night, without ice cover, the relatively warm sea water releases more heat and moisture into the atmosphere than sea ice. In winter, the turbulent heat flux over sea ice leads can be two orders of magnitude larger than over ice due to the air-water temperature difference [
2]. Due to a lower albedo than the surrounding ice, leads absorb more solar energy during daylight periods. This can result in warming the water and accelerating melt. Though sea ice leads cover a small percentage of the total surface area of the Arctic Ocean [
3], the heat fluxes dominate the wintertime Arctic boundary layer heat budget [
2,
4]. In the winter, spring, and fall, leads impact the local boundary layer structure [
5], boundary layer cloud [
6], and aerosol [
7] properties, thus affecting the surface energy balance. Wind stresses and stresses within the sea ice are the primary factors in the formation of leads [
8]. For climate studies (on the order of 20 or more years), trends in lead characteristics [
9] help advance our understanding of both thermodynamic and dynamic [
10,
11,
12] processes in the Arctic.
There has been a recent focus on lead detection from moderate resolution thermal infrared (IR) satellite imagers [
9,
13,
14,
15,
16]. Sea ice leads can also be detected using satellite passive and active microwave data [
10,
17,
18,
19,
20], with the advantage being that clouds are transparent or semi-transparent at microwave wavelengths. Ongoing work is investigating the application of an AI technique towards the detection of leads in microwave imagery [
21]. Microwave platforms, however, lack either wide spatial coverage or are at such coarse resolutions that many sea ice fractures are too narrow to resolve.
In recent years, projects have started to investigate the application of artificial intelligence (AI) in sea ice detection and classification [
20,
21,
22,
23,
24]. An approach has been developed and implemented for lead detection where a convolution neural network is used to detect sea ice leads based on the thermal contrast and unique spatial features in infrared satellite imagery from the Moderate Resolution Imaging Spectroradiometer (MODIS) or Visible Infrared Imaging Radiometer Suite (VIIRS) [
16]. In this approach, spatial information and trainable AI replace the empirically derived detection algorithm involving traditional image-processing techniques referred to here as the “legacy” sea ice leads detection algorithm [
15].
The leads detected with the AI approach applied to MODIS [
25] and VIIRS [
26] data have resulted in a long-term product covering the period from 2002 to the present. This product provides an opportunity to study the climatology of Arctic Sea ice leads and their changes over the last two decades.
2. Materials and Methods
For this study, the MODIS dataset covers a twenty-year period beginning in the winter of 2002–2003 [
25], the first winter after the Aqua satellite joined Terra in providing two-satellite, moderate-resolution MODIS coverage of the Arctic Ocean. The VIIRS dataset begins in the winter of 2017–2018 [
26], the first winter after VIIRS on NOAA-20 joined VIIRS on Suomi National Polar-orbiting Partnership (SNPP) satellite. VIIRS results are compared against MODIS results over the time series as both are operating as two satellite systems. The VIIRS results are not included in the twenty-year climatology presented here because there would be a detection bias with more leads being detected in the seasons since the 2011 launch of VIIRS on SNPP and the 2017 launch of VIIRS on NOAA-20. Due to observations at new times and observations with different viewing geometry, the addition of observations from VIIRS would result in more lead detections. The intention is to identify a trend in lead occurrence and the addition of lead detections from VIIRS would only increase the rate of lead detection without any change in the likelihood for leads to occur.
In this application, leads do not have constraints on minimum width or length requirements. Similarly, the shape of the leads can generally be described as quasi-linear, but deformation of the ice can introduce curvature, and intersecting leads can result in more complex shapes that are not necessarily excluded from detection as leads. The detection criteria also does not have any predefined temperature detection thresholds, which allows for the detection of leads that is either open water, a mixture of ice and water (slush), leads covered by a thin layer of new ice, or leads covered by optically thin clouds. The native lead detection product [
25,
26] has a domain of ocean water above 60°N, any lead detection outside of the ice pack would be assumed to be a false positive. To screen out false positives, the daily NOAA/NSIDC Climate Data Record of Passive Microwave Sea Ice Concentration product [
27] is used. An ice pack mask is derived by finding the boundary of the largest continuous region of non-zero ice concentration values. Outside of the ice pack mask, lead detection products are ignored. This reduces possible erroneous results in warmer regions and limits ambiguity between leads and open water after ice floes break away from the ice pack.
The study utilizes the sea ice leads technique first described by Hoffman et al. [
16]. The method uses a particular kind of convolutional neural network, a U-Net to perform image segmentation on an atmospheric window channel (11 µm) brightness temperature imagery. The results of the image segmentation are aggregated from all daily satellite overpasses, and positive lead detections are defined as locations were the image segmentation identified as a possible lead in multiple overpasses over the course of the day. A detection bias was discovered when processing a long time series of imagery using a fixed grid space (described in Figure 3 of Hoffman et al. [
16]). Detection artifacts around the processing grid boundaries would appear when processing a long time series of data. Leads can be detected near processing region boundaries, and leads that span across a processing region boundary can be detected in both regions when the lead is sufficiently long in each processing region (as long as the lead segment in each sub-region maintains the spatial characteristics of a lead). However, when a lead spans a processing region boundary, often a lead segment on one side of the processing region boundary becomes too short for lead detection. As a result of discontinuities along the processing grid, boundaries appear in a long time series. To mitigate this issue, rather than using a fixed processing grid, the location of the processing grids are made dynamic with the origin defined as the first line and element of the satellite imagery after the being is projected in the standard 1 km resolution EASE-Grid 2.0 projection [
28]. This way, over a long time series of satellite imagery, the processing sub-regions rarely overlap and detection artifacts caused by the processing method do not appear in the data sets [
25,
26].
For the trend analysis, least squared linear regression is used to calculate best-fit trend lines along with 95% confidence intervals. Results are computed as monthly averages. Each monthly average is assigned a date of the first of each corresponding month, and in this way, slopes are calculated at a rate of change per day. Due to the change rate per day being so small, the slopes were multiplied by 365.25 to represent the rate of change per year.
3. Results
The first result to highlight is that the AI-based product [
25] is the first satellite lead detection product to span 20 seasons in the Arctic. No other published dataset can offer the combination of a long time series, wide spatial coverage, and moderate resolution (1 km) of surface features, all from a consistent satellite platform. Spatial patterns of seasonal lead detections are shown in
Figure 1, where lead detections are given as the percentage of days with a lead detection occurring at each location within the winter season, defined as 1 November to 30 April. The majority of the area in
Figure 1 has at least one lead detection per season (shades of blue to red). High lead densities appear over the Beaufort Sea, Chukchi Sea, areas north of Greenland, and the Canadian Archipelago. Low lead densities are seen over the central Arctic Ocean and around the North Pole, though the locations with high lead detection density change from season to season. The spatial patterns of leads are similar year to year but there is some regional variability. Similar spatial distributions of leads are seen in monthly means from November to April (
Figure 2). For the seasonal evolution, the overall lead activity tends to be low in November and April, and highest between December and March (
Figure 2).
As cloud coverage can obscure leads at thermal infrared wavelengths, analysis of daily lead detections would be incomplete without analysis of changes in cloud coverage. Recent studies [
29,
30,
31] have shown cloud coverage has been increasing over the Arctic. Our own analysis of the MODIS clear sky mask [
32,
33] over the winter seasons since 2002 in the Arctic is shown by year in
Figure 3, by month in
Figure 4, and quantitatively in
Figure 5. Seasonally, comparing
Figure 1 and
Figure 3 regions of high lead detection frequency tend to correspond with regions with higher clear sky frequencies, and locations with relatively low lead detection frequency tend to have lower rates of clear sky coverage. Comparing monthly results in
Figure 2 and
Figure 4, clear sky frequency is generally lower in November and April, which also corresponds with months with low lead detection frequencies. Overall, we observe a decrease in clear sky observations over sea ice on the order of 3.5% per decade (
Figure 5). December has a relatively small decrease in clear sky observations on the order of 2.9% per decade. Larger decreases of 5.1% and 5.6% per decade are observed in February and March. Similarly, it is important to consider how a decrease in the extent of sea ice could cause a reduction in the area in which lead formation is possible. The majority of the sea ice area is lost in the warm season while ice coverage in the Arctic remains high through the majority of the winter season as shown in
Figure 6; quantitative results are shown in
Figure 7. While there is some ice area loss, the amount of area lost is relatively small and limited to the marginal ice zone. Further, relatively few leads are detected in the marginal ice zone; the majority of the lead detections are in areas where there is sea ice all winter long. Some of this may be an artifact of a detection bias; leads are harder to detect in the marginal ice zones because of lower thermal contrast between leads and open water. Additionally, ice floes have a higher velocity than pack ice, and the lead detection technique is dependent on the assumption that leads move slow enough that they will persist at the same location over a span of multiple satellite overpasses. Due to these factors, our work focuses on leads within the ice pack, and within the ice pack we do not find a correlation between the rate of change of ice cover and the proportion of lead area to ice area.
To interpret trends in lead detection, results are presented in a few different ways in
Figure 8. Leads are shown as a percentage of the ocean area (Panel A), as a percentage of the sea ice area (Panel B), and as total area (Panel C). There is a slight decrease in lead area as a percentage of the ocean area (A) and cumulative lead area (C), but no statistically significant trend as a proportion of sea ice area (B). An interpretation of these results is that lead detection area is decreasing at about the same rate that sea ice is decreasing. This might be expected as lead detection frequency normalized by sea ice area is relatively constant while absolute lead detection area is decreasing. However, this may be a coincidence rather than causation. As previously discussed, there are relatively few lead detections in the marginal ice zone (where there are changes in ice coverage), so the change in lead area divided by ice area is largely driven by a change in the denominator rather than the numerator. On the other hand, a decrease in clear sky (increase in cloudiness) makes it difficult to attribute a change in lead detection frequency to a change in the formation of leads as opposed to a detection bias where there is a decrease in clear sky coverage that is making leads increasingly difficult to detect. Sky coverage appears to be the more important factor and may suggest that lead activity is increasing despite a decrease in detections. It is interesting to note that the local minima in lead activity occurs in November and April while there is variability year to year with the local maxima in lead activity occurring anytime between December and March. These trends do not follow the pattern in ice area (see
Figure 7) but more closely follow the pattern in clear sky frequency (see
Figure 5).
Studies such as Qu et al. [
9] have examined the relationship between cloud coverage and lead area. We have expanded that type of analysis to look at the relationship between clear sky frequency and lead detection frequency as illustrated in
Figure 9. In Panel A, monthly lead detection frequency can be approximated over the range of clear sky frequency between 25–55% by Equation (1):
With the equation for lead frequency, it is possible to extrapolate or calculate an expected lead detection frequency (in Panel B) as a function of clear sky frequency (
Figure 3). This is not to suggest that there is a causal relationship between clear sky and lead, but rather a way to approximate the detection bias caused by the presence of clouds. This linear relationship is only valid on a monthly time scale and will fail when clear sky frequency is above 55% or below 25%. Lead detection may still be possible in the presence of clouds either because the cloud coverage may not persist all day long or the cloud may be optically thin enough that the warm lead may radiate through some clouds. Equation (1) will not be valid to estimate lead frequency on a daily basis where clear sky frequency can range from 0% or 100%. While it is possible to conduct an analysis of trends based on means from subsets of days and locations under ideal conditions (where clear sky conditions persist all day long), the subset is too small to be useful in a time series analysis. Furthermore, the relationship between the clear sky frequency and lead frequency is derived on monthly data, and is not intended to be used to derive the daily sea ice lead frequency and area. The linear model is only valid on a monthly basis, where clear sky frequencies tend to span the range of 25–55%. There are feedback relationships that would suggest that the heat and moisture fluxes associated with leads can contribute to cloud formation; the presence of clouds contributes to atmospheric warming [
34] and this warming may also make the ice more prone to fracturing. Overall, our assumption is that lead formation is still largely caused by ice deformation and stress, and is largely independent of sky cover. However, there is a distinction between the formation of leads and lead detection. Our finding is that lead detection varies as a function of clear sky fraction. The extrapolated results show that there was a larger decrease in lead area: −8800 km
2/year (
Figure 9B), and that the loss was larger than was observed: −5000 km
2/year (
Figure 8B).
Since the decrease in lead detections extrapolated from clear sky frequency (from
Figure 9, Panel B) exceeds the rate of decrease in observed lead area (
Figure 8, Panel C), the conclusion is that leads may actually be increasing, but becoming increasingly difficult to detect because of the increasing cloud cover. In
Figure 10, leads are shown to be increasing seasonally at a rate of 3700 +/− 3500 km
2 per year. April has the largest and is the only statistically significant month, with an increase of 12,100 +/−11,500 km
2 per year. The caveat is that the trends are small relative to the size of the domain and the uncertainty tends to be on the same order of magnitude as the slope. Other than the seasonal trend and April, no other months have statistically significant trends. This appears to be a result of intra-season variability, and there is an overall seasonal trend, but the month in which lead activity peaks can be any month between December to March and there is little correlation between years for what month the peak will occur. There are also some multi-year patterns to consider. From about 2004 through 2007, there was little trend, followed by an increasing trend from 2007 through 2010, then another decrease starting in 2010 through around 2016, and finally an increasing trend from 2016 through 2022. While the results are fit to a linear regression, given the cyclical nature of the results, it might not be appropriate to try to extrapolate lead trends into the future. The current upward trend may continue, or it remains possible that there will be more periods of flat or decreasing leads. Future work will have the benefit of more data points as the archive of years grows in future years. Additionally, analysis of trends on a regional or sub-seasonal basis similar to other studies [
9,
18] could be beneficial.
One of the primary advantages of our new lead detection technique [
14] is its portability between satellites, resulting in a similar detection performance between MODIS [
25] and VIIRS [
26]. Aggregate detection results from the winter of 2017–2018 (the first season after NOAA-20 became operational) through 2021–2022 are shown in
Figure 11. Of the MODIS lead detections, 28% are unique to MODIS (72% are also detecting by VIIRS). Additionally, of the VIIRS lead detections, 27% are unique to VIIRS (73% are also detected by MODIS). Approximately 57% of leads are detected by both MODIS and VIIRS, 22% are unique to MODIS, and 21% are unique to VIIRS. Further, the majority of the leads that are detected by both satellites tend to be within the continuous sea ice. In contrast, leads that are detected by one satellite tend to be along the ice edge. MODIS tends to detect more unique leads along the Greenland–Iceland–Norwegian (GIN) Seas, Barents Sea, Kara Sea, and Laptev Sea. VIIRS tends to detect more unique leads in the Chukchi Sea, Beaufort Sea, Canada Basin, and Baffin Bay areas. Future work will examine the characteristics of lead detections by MODIS and VIIRS, with the assumption that VIIRS, with finer spatial resolution (375 m), would be able to detect narrower leads than the 1 km resolution MODIS. Outside of the marginal ice zone, the initial analysis suggests there is minimal detection bias between MODIS and VIIRS. Particularly in the marginal ice zone, differences in coverage patterns (fewer overpasses at lower latitudes) and faster moving leads along the ice edge may be the primary reasons for the product differences. The MODIS mission is nearing its end-of-life and the VIIRS mission will soon be extending to JPSS-2 (the future NOAA-21), so it will become increasingly important to examine cross-platform continuity in order to develop a leads climatology that extends past the duration of the MODIS mission.
4. Discussion
The AI-based product [
25,
26] is a replacement for the earlier or legacy product [
15], developed by the same principal group of researchers. The primary difference is that AI replaced more traditional image processing techniques. The differences in the product are illustrated in
Figure 12. The AI technique [
25] has higher detection rates nearly everywhere with the exception of the marginal ice zones. Moreover, note that the product comparison is calculated where both products provide coverage; the legacy product was limited to clear sky conditions and had a scan angle limitation that prevented coverage near the North Pole. Lead detections outside of the ice pack are excluded by an ice mask [
27]. The AI technique shows fewer detections in the marginal ice zones, though not necessarily less skill because this region is prone to false detections in the legacy product. Outside of the marginal ice zone, the AI technique consistently detects more leads. The current detection model was trained primarily to detect leads within the ice pack, but a future detection model could be trained specifically for application in the marginal ice zone. Similarly, this AI model was designed for detection across the whole Arctic. It would be possible to train future models to optimize detections in specific geographic regions.
Willmes and Heinemann have a published dataset [
24] of leads from January to April of 2003 through 2015. A comparison with the AI product [
25] over the same time period is shown in
Figure 13. Our new technique, again, has a slightly higher detection rate almost everywhere inside the pack ice, whereas the Willmes and Heinemann technique has a higher detection rate in the marginal ice zones. This is further evidence that the AI product [
25] does well at detecting leads within the continuous sea ice pack, but detection performance decreases in the marginal ice zone where it becomes more difficult to differentiate between leads and ice floes, as the leads become wider and faster moving.
Reiser et al. [
14] is a follow up to the earlier Willmes and Heinemann technique, and lead detection results have been published in a dataset [
35] from 2002 through 2019. The Reiser et al. results are reported as lead frequency generalized over the entire time series; daily products from individual days have not been published. This makes direct comparisons more difficult. Two comparisons are presented in
Figure 14. If the aggregate of daily Hoffman et al.’s AI results [
25] are compared directly against that of Reiser et al.’s results, the AI technique has a lower detection rate almost everywhere, shown in
Figure 14, Panel A. However, Reiser et al.’s dataset is described as an estimate of daily lead frequency that accounts for cloud coverage. So, if Hoffman et al.’s results [
25] are limited to the subset of locations that have probably clear or confidently clear sky in the MODIS sky cover product [
32,
33], then the Hoffman technique has a higher detection rate inside the continuous sea ice and lower detection rates around the marginal ice zones. Panel B in
Figure 14 is more applicable in terms of a direct comparison. The primary disadvantage of Reiser et al.’s dataset is that by only reporting an average of leads, it makes an implicit assumption that leads are not changing with time.
5. Conclusions
The AI lead detection results are an improvement over earlier lead detection products. The advantages of the AI-based lead detection product is that it is trainable and adaptable to new instruments, which allows for climate-scale studies of leads to continue after the end of the MODIS mission using current and future VIIRS instruments. With an extensive time series of the sea ice lead products now available for the Arctic Ocean, the initial analysis shows a small but statistically significant seasonal decline in detected lead areas. Meanwhile, there is also a decrease in the amount of clear sky coverage. The key finding is that the decline in lead detection area is smaller than would be expected based on extrapolation of the relationship between lead detection frequency and clear sky frequency. Due to inter-season and intra-season variability, a non-linear representation of the slope may be more appropriate to approximate the long-term trend. Due to the non-linearity of the seasonality of the trends, the magnitude of linear regression results in a small trend that is on the same order of magnitude as the range of uncertainty. However, the conclusion is that leads are not becoming less likely to occur, but less likely to detect. After the effects of cloud cover is removed, leads are likely increasing at a rate of 3700 +/− 3500 km2 per year. Although the trends are small, they are nonetheless important in the context of a thinning Arctic Ocean sea ice cover.
Future work will examine lead characteristics (width, length, orientation) and causes of leads (wind stress, ice deformation). For example, if leads are becoming smaller or thinner, that could help explain why leads becoming harder to detect, and that could be independent of changes in cloud coverage. More rigorous work can be conducted to better approximate the relationship between clouds and leads; comparing results from microwave-based product could help quantify how often clouds obscure leads from detection as well as how often the presence of leads may contribute to the formation of clouds. Additionally, detection performance in the marginal ice zone was not prioritized when training the model; a new detection model could be trained specifically to focus on the areas that are known weaknesses of the current detection model. An alternate approach would be to extrapolate lead occurrences under cloudy or partly cloudy conditions. Ultimately, the intent here is to present a comparison of lead detection results over a twenty-year time period. The users’ application may dictate if the dataset should be used natively (as daily lead detections), combined to produce aggregate results over a time series, and/or extrapolated with the aim to minimize the detection bias caused by cloud coverage. The best representation of daily lead activity would likely be achieved from a composite of multiple sensor types. Moderate resolution infrared imagers such as MODIS and VIIRS offer wide spatial coverage, but these products could be supplemented with a cloud penetrating microwave-based product and/or an altimeter-based product.