Next Article in Journal
Effects of Prescribed Fire on Spatial Patterns of Plant Functional Traits and Spectral Diversity Using Hyperspectral Imagery from Savannah Landscapes on the Edwards Plateau of Texas, USA
Next Article in Special Issue
Urban Green Space Mapping from Sentinel-2 and OpenStreetMap via Weighted-Sample SVM Classification
Previous Article in Journal
Mass Change in Antarctica from 2002 to 2025 Using GRACE and GRACE-FO
Previous Article in Special Issue
Lane Graph Extraction from Aerial Imagery via Lane Segmentation Refinement with Diffusion Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using K-Means-Derived Pseudo-Labels and Machine Learning Classification on Sentinel-2 Imagery to Delineate Snow Cover Ratio and Snowline Altitude: A Case Study on White Glacier from 2019 to 2024

by
Wai Yin (Wilson) Cheung
* and
Laura Thomson
Ice Climate and Environment Lab (ICElab), Department of Geography and Planning, Queen’s University, 68 University Avenue, Kingston, ON K7L 3N6, Canada
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(23), 3872; https://doi.org/10.3390/rs17233872
Submission received: 30 September 2025 / Revised: 22 November 2025 / Accepted: 27 November 2025 / Published: 29 November 2025
(This article belongs to the Special Issue AI-Driven Mapping Using Remote Sensing Data)

Highlights

What are the main findings?
  • A Random Forest classifier trained on K-means pseudo-labels reproduces spectral clustering with >99.9% fidelity, enabling automated mapping without manual annotation.
  • Automated Snowline Altitude (SLA) extraction using a high-confidence threshold ( SCR 0.8 ) achieves strong agreement with field-derived ELA ( r = 0.994 , RMSE = 30 m).
What are the implications of the main findings?
  • The hybrid workflow provides a validated, operational proxy for glacier mass balance in data-sparse Arctic regions where in situ records are absent.
  • High-threshold elevation binning mitigates mixed-pixel errors in 10 m satellite imagery, offering a validated alternative to standard thresholding methods.

Abstract

Accurate equilibrium-line altitude (ELA) estimates are a valuable proxy for evaluating glacier mass balance conditions and interpreting climate-driven change in the Canadian high Arctic, where sustained in situ observations are limited. A scalable remote-sensing framework is evaluated to extract the snow cover ratio (SCR) and snowline altitude (SLA) on White Glacier (Axel Heiberg Island, Nunavut) and to assess the agreement with in situ ELA measurements. Ten-metre Sentinel-2 imagery (2019–2024) is processed with a hybrid pipeline comprising the principal component analysis (PCA) of four bands (B2, B3, B4, and B8), unsupervised K-means for pseudo-label generation, and a Random Forest (RF) classifier for snow/ice/ground mapping. SLA is defined based on the date of seasonal minimum SCR using (i) a snowline pixel elevation histogram (SPEH; mode) and (ii) elevation binning with SCR thresholds (0.5 and 0.8). Validation against field-derived ELAs (2019–2023) is performed; formal SLA precision from DEM and binning is quantified ( ± 4.7 m), and associations with positive degree days (PDDs) at Eureka are examined. The RF classifier reproduces the spectral clustering structure with >99.9% fidelity. Elevation binning at SCR 0.8 yields SLAs closely matching field ELAs (Pearson r = 0.994 , p = 0.0006 ; RMSE = 30 m), whereas SPEH and lower-threshold binning are less accurate. Interannual variability is pronounced as follows: minimum SCR spans 0.46–0.76 and co-varies with SLA; correlations with PDDs are positive but modest. Results indicate that high-threshold elevation-bin filtering with machine learning provides a reliable proxy for ELA in clean-ice settings, with potential transferability to other data-sparse Arctic sites, while underscoring the importance of image timing and mixed-pixel effects in residual SLA–ELA differences.

1. Introduction

Glaciers are sensitive indicators of regional climate forcing, exhibiting dynamic responses to variability in air temperature and precipitation. Monitoring these responses is essential for quantifying cryospheric change and its implications for sea-level rise, regional water resources, and climate feedbacks [1,2,3,4]. In the Canadian high Arctic, logistical constraints and severe environmental conditions limit field observations. Although the Canadian Arctic Archipelago (CAA) hosts the largest glacierized area outside the Greenland and Antarctic ice sheets, sustained in situ mass balance records remain sparse, hindering robust regional assessments and the evaluation of climate models that require reliable observational constraints [5,6].
Satellite remote sensing provides synoptic observations of glacier surfaces via the following two key metrics: the snow cover ratio (SCR) and snowline altitude (SLA) [7,8,9]. These serve as proxies for the field-derived accumulation-area ratio (AAR) and equilibrium-line altitude (ELA), respectively. While SCR and SLA represent transient states, they approximate the annual AAR and ELA when derived from imagery captured at the end of the ablation season. Multi-temporal analyses exploit the established monotonic relationships between these metrics to infer interannual mass balance variability and climatic controls across diverse settings [7,9,10].
Despite their utility, the delineation of the SCR and SLA from satellite imagery remains challenging. Manual mapping is labour intensive (often requiring hours of expert analysis per scene) and is subject to operator variability that can introduce vertical uncertainties exceeding the sensor resolution [7,11]. Automated approaches, such as Otsu thresholding and the Automatic Snowline Mapping and Glacier delineation (ASMAG) workflow, reduce manual effort but rely on empirical thresholds (e.g., SCR = 0.5) that can underestimate ELA relative to in situ observations (Figure 1) [7,11,12]. Elevation binning (Bin) methods further introduce uncertainty through sensitivity to bin size, threshold selection, and post processing choices [7,12]. Alternatively, the snowline pixel elevation histogram (SPEH) method estimates SLA directly from the distribution of snow–ice boundary pixel elevations, but its performance across glacier geometries and melt conditions has not been systematically evaluated.
Machine learning (ML) approaches provide adaptive classification capabilities essential for analyzing large, high-dimensional datasets such as Sentinel-2 satellite image time series (SITS) [13]. However, the application of ML in glaciology is frequently constrained by the scarcity of labelled training data. Collecting ground truth for snow and ice in remote Arctic environments is logistically prohibitive, making fully supervised models difficult to implement without extensive manual annotation [11,14].
To address this limitation, a practical strategy is to couple unsupervised clustering with supervised classification. In this framework, K-means clustering partitions the spectral data into coherent groups (pseudo labels), which then serve as training targets for ensemble classifiers like Random Forest (RF) or Support Vector Machines (SVMs). By treating statistically coherent spectral clusters as proxy ground truth, this method automates the generation of training samples and eliminates the bottleneck of human digitization. Recent studies have demonstrated the efficacy of this approach as follows: Nambiar et al. [15] achieved high accuracy in semantic segmentation using self training, while Mirpulatov et al. [16] showed that K-means-derived pseudo-labels substantially improve land cover classification in data-sparse regions.
The objectives of this study are therefore threefold as follows:
  • Develop an operational integrated workflow that couples unsupervised K-means clustering (for automated pseudo-label generation) with supervised classification, thereby eliminating the need for manual training data in data-sparse regions.
  • Evaluate the efficacy of statistical SLA extraction methods (SPEH vs. elevation binning) and demonstrate that a high-threshold binning strategy ( SCR 0.8 ) is required to mitigate mixed-pixel errors inherent in Sentinel-2 imagery.
  • Validate the end-to-end pipeline against the long-term in situ mass balance record of White Glacier, quantifying the physical agreement between the satellite-derived SLA and the field-measured ELA.
This methodological integration offers the following distinct advantage over existing approaches: it eliminates the reliance on manual training data while simultaneously mitigating the mixed pixel bias that limits standard automated thresholding. This approach builds upon recent semi-supervised segmentation studies [14,15] by extending the analysis beyond classification accuracy to physical glaciological validation. This study develops a scalable hybrid ML workflow for White Glacier in the Canadian high Arctic, a benchmark glacier with continuous mass balance records since 1960. By quantifying SCR and SLA over melt seasons and comparing these with in situ mass balance-derived ELA, the approach seeks to improve the reliability of remote sensing-based glacier monitoring and reduce uncertainties associated with manual classification and empirical thresholding.

2. Study Site

White Glacier, located on the southern margin of Axel Heiberg Island in the Canadian high Arctic (79.75°N, 90.67°W; Figure 2), is a benchmark reference site within the Global Terrestrial Network for Glaciers (GTN-G). Continuous mass balance observations have been maintained since 1960, providing one of the longest unbroken cryospheric records in the high Arctic. These data, including annual ELA and AAR metrics, are archived by the World Glacier Monitoring Service [17] and support regional assessments of climate-driven change.
The glacier is named for its distinctively bright, debris-free ice surface. The ablation period (typically July–August) is characterized by significant surface melt in the glacier’s lower zones, exposing bare ice and intensifying mass loss. This seasonal transition is a key driver of snowline dynamics and mass balance state observable via remote sensing [7,8].
White Glacier is a valley glacier extending from 1782 m above sea level at its maximum to 85 m at its terminus. The glacier is characterized by minimal supraglacial debris and an open valley geometry that reduces topographic shading in the ablation zone. These physical attributes minimize spectral ambiguity, rendering White Glacier a suitable testbed for validating optical classification algorithms. Over the past six decades, it has experienced a significant retreat, shrinking from an area of 41.07 km2 in 1960 to 38.54 km2, underscoring its sensitivity to regional climatic variations [18]. The glacier is situated in an environment with a mean annual temperature of approximately −20 °C. Precipitation in the region ranges from 58 mm per year at sea level, as recorded at Eureka (100 km to the east), to 370 mm per year at 2120 m above sea level [18]. These climatic and environmental conditions, frequented by cloud-free skies during key periods, provide a well-suited setting for optical and multispectral remote sensing analyses.

3. Data

3.1. Optical Imagery

The Harmonized Sentinel-2 Level-2A (L2A) Surface Reflectance (SR) dataset, available as COPERNICUS/S2_SR_HARMONIZED in the Google Earth Engine (GEE) catalogue, was selected for this study. This product is derived from Sentinel-2 Level-1C (L1C) Top-of-Atmosphere (TOA) reflectance data through the Sen2Cor algorithm, which applies atmospheric correction to produce surface reflectance values. This step is critical for accurate snow cover mapping, as it minimizes atmospheric scattering effects that can alter the spectral signature of snow and ice [19]. The harmonized collection ensures radiometric consistency for time-series analysis by adjusting the digital number (DN) range of scenes processed after 25 January 2022 (baseline ‘04.00’) to match older acquisitions, thereby mitigating processing artifacts.
The following four spectral bands with 10 m spatial resolution were used for analysis: Blue (B2), Green (B3), Red (B4), and Near-Infrared (NIR, B8). These bands are standard for glaciological applications; the visible bands (B2–B4) facilitate the discrimination of snow from other bright surfaces via albedo and spectral slope contrasts, while the NIR band (B8) is critical for separating snow, glacier ice, and liquid water.
Cloud masking was performed using the Sentinel-2 QA60 quality bitmask. During the data availability gap in QA60 (25 January 2022 to 28 February 2024), the MSK_CLASSI_OPAQUE and MSK_CLASSI_CIRRUS masks were applied to identify opaque and cirrus clouds, respectively.
The study period spans from 15 June 2019 to 30 August 2024, covering six ablation seasons. The Sentinel-2 constellation offers a nominal 5-day revisit interval, which is further enhanced in high latitudes by orbital swath overlap. This high temporal resolution is sufficient to capture rapid fluctuations in snow extent and SLA in glacierized terrain [20]. The final dataset comprises 388 cloud-free images suitable for classification; annual acquisition counts are detailed in Table 1.

3.2. Digital Elevation Model

The ArcticDEM mosaic used for White Glacier in this study is a product generated from data spanning 2011–2021. This elevation dataset, based on the ArcticDEM Version 4.1 Mosaic, was produced by the Polar Geospatial Center (PGC) at the University of Minnesota using stereoscopic imagery from the Maxar optical satellite constellation [21,22]. With a nominal spatial resolution of 2 m, the dataset provides elevation information for determining snowline altitudes. Although ArcticDEM products have a reported general vertical accuracy of approximately 4 m [21], optical saturation in high-reflectivity accumulation zones can introduce localized elevation errors. To ensure data reliability prior to application, the internal precision of the DEM was assessed using the associated Median Absolute Deviation (MAD) quality layer. A zonal analysis restricted to the glacier surface yielded a mean MAD value of 0.58 m. This sub-pixel consistency indicates that the multi-temporal median-stacking algorithm successfully mitigated saturation-induced artifacts, confirming the DEM’s suitability for this study.

3.3. Glacier Outline

The outline of White Glacier (RGI2000-v7.0-G-03-02830) was obtained from the Randolph Glacier Inventory (RGI) Version 7.0 [23]. The RGI provides vector outlines delineating glacier extents at a specific point in time [24]. In this study, the RGI outline served two primary purposes as follows: (1) defining the spatial extent of White Glacier for focused analysis of snow cover and snowline altitude, and (2) masking non-glaciated areas to ensure accurate snow cover classification and analysis.

3.4. Mass Balance Record

Direct (glaciological) mass balance observations for White Glacier have been collected annually for several decades, with derived ELA and AAR reported for each balance year. Measurements are acquired on a seasonal schedule to quantify accumulation and ablation, and annual summaries are compiled following standard WGMS protocols. The complete White Glacier dataset, including ELA and AAR, is available through the WGMS and is used in this study for validation [17].

3.5. Meteorological Record

Daily meteorological observations were obtained from the Environment and Climate Change Canada (ECCC) Historical Climate Data portal (https://climate.weather.gc.ca/, accessed on 13 August 2025) for Eureka (Ellesmere Island, Nunavut), the nearest long-term automatic weather station (AWS) to White Glacier. Positive degree-day (PDD) sums (°C·d) were computed for each mass balance year (1 October–30 September) as the sum of daily mean air temperature above 0 °C:
PDD = d = 1 N max 0 , T d
where T d is the daily mean air temperature (°C) on day d and N is the number of days in the balance year; days with T d 0 °C contribute zero.

4. Method

This study proposes a hybrid, semi-supervised machine learning framework designed to overcome the scarcity of ground-truth training data in glacial environments. The workflow integrates unsupervised clustering with supervised classification to automate the delineation of the SLA from high-resolution Sentinel-2 imagery. As illustrated in Figure 3, the methodology comprises the following four primary algorithmic stages: (1) data acquisition and pre-processing; (2) unsupervised K-means clustering to generate pseudo-labels; (3) supervised snow–ice classification guided by the pseudo-labels; and (4) annual SLA delineation using the minimum-SCR date.

4.1. Stage 1: Data Acquisition and Pre-Processing

Sentinel-2 Level-1C top-of-atmosphere reflectance data were acquired for White Glacier via the Google Earth Engine (GEE) application programming interface (API). The temporal analysis window was restricted to the ablation season (15 June–31 August) to capture the maximum retreat of the snowpack. Scenes exhibiting cloud cover > 20 % were discarded. For every valid acquisition, the native 10 m bands—blue (B2), green (B3), red (B4), and near-infrared (B8)—were extracted to characterize the glacial surface.

4.2. Stage 2: Unsupervised K-Means Clustering and Pseudo-Labelling

To prepare the spectral data for unsupervised labelling, a feature vector x was constructed for each pixel i. To mitigate spectral redundancy and computational cost prior to clustering, Principal Component Analysis (PCA) was applied, retaining the first two principal components ( P C 1 , P C 2 ). The high-dimensional spectral space R 4 was projected onto a lower-dimensional subspace R 2 containing the two principal components as follows:
Z = X W k
where X is the centred data matrix and W k is the matrix of the top k = 2 eigenvectors. While PC1 (> 99 % variance) captures the dominant albedo signal, PC2 ( 0.31 % variance) was retained because it encodes the spectral contrast between visible and near-infrared wavelengths. This chromatic information is essential for distinguishing surface types with similar brightness but different physical properties (e.g., wet snow vs. clean ice) [25,26].
To generate training samples without manual annotation, the workflow employs K-means clustering on the PCA-transformed feature space. The algorithm partitions the pixel set into K = 4 clusters. Although a statistical Elbow Method analysis suggested an inertia plateau at k = 3 , empirical validation demonstrated that this configuration failed to resolve critical internal snowpack variance, often conflating wet snow with ice. Consequently, K = 4 was selected as the optimal configuration to distinguish between Ground, Ice, and two spectral states of Snow. This choice ensures the separation of distinct snow facies (Table 2), which are later aggregated into a single ‘Snow’ class—without over-segmenting the ‘Ground’ class. The objective function J is defined as follows:
J = j = 1 K i = 1 n x i ( j ) μ j 2
where x i ( j ) represents a pixel vector assigned to cluster j, and μ j is the centroid of cluster j. The resulting cluster assignments serve as ‘pseudo-labels’, providing an initial structural estimation of surface classes. To ensure the distinctness of these clusters prior to their use as ground truth, the Silhouette Coefficient (s) was computed for each sample.

Class Balancing and Resampling

Glacial environments exhibit inherent class imbalance, where dominant classes (e.g., Ground or Ice) can vastly outnumber specific snow facies. To mitigate this bias and optimize training data for the subsequent supervised classifier, three resampling approaches were evaluated as follows:
  • Random Resampling: Samples were selected randomly from each cluster without regard for distribution.
  • Stratified Resampling: This approach ensured the proportional representation of each cluster based on its size. By maintaining a balance among clusters, stratified resampling effectively addresses the imbalance problem.
  • Cluster Resampling: A random subset of samples was extracted within each cluster, capping the maximum number of samples from larger clusters to mitigate their dominance while preserving sufficient variability.
All resampling methods were implemented with a fixed training set size of 100,000 pixels. Empirical evaluations of these resampling strategies were conducted by analyzing key performance metrics, such as accuracy and F1-score, in the final supervised classification task.

4.3. Stage 3: Supervised Snow–Ice Classification

The pseudo-labels generated in Stage 2 served as training targets for the supervised classification stage. The workflow employs a RF classifier, utilizing an ensemble of B decision trees to mitigate overfitting. For an input vector x, the RF prediction y ^ is determined by the mode of the ensemble output as follows:
y ^ = mode { h 1 ( x ) , h 2 ( x ) , , h B ( x ) }
where h b ( x ) is the prediction of the b-th tree. The model was configured with B = 50 trees and a maximum depth of 10. To ensure temporal consistency and minimize overfitting to specific atmospheric conditions, the training dataset was constructed using sampling from the entire 2019–2024 image time series. This generated a global spectral feature space, ensuring that the classifier learned decision boundaries robust to interannual variability. A SVM was also implemented as a comparative baseline.
Training the classifier on K-means outputs functions as a knowledge distillation process. Rather than predicting independent ground truth, the RF is designed to learn a robust, generalized decision boundary that reproduces the spectral partitioning of the global K-means model across the full spatial extent of the glacier.
Classification performance was evaluated through standard metrics including hlreproduction fidelity (agreement with pseudo-labels), confusion matrices, and the F1-score. The F1-score was prioritized as the harmonic mean of Precision and Recall, providing robustness against false positives and negatives in imbalanced datasets as follows:
F 1 = 2 · Precision · Recall Precision + Recall
Beyond numerical metrics, a visual comparison was conducted by overlaying the classification results on composite Sentinel-2 images to assess spatial accuracy and contextual relevance.

4.4. Stage 4: Annual End-of-Ablation SCR and SLA Delineation

The trained classifier was applied to the full Sentinel-2 time series. For each scene, the SCR was computed as follows:
S C R = N snow N snow + N ice × 100
where N represents the pixel count within the glacier mask. The date corresponding to the seasonal minimum SCR ( S C R m i n ) was identified as the end-of-ablation epoch. To derive the SLA from the S C R m i n image, two complementary hypsometric methods were implemented.
1.
Statistical Mode (SPEH)
Adapted from Aberle et al. [27], this method analyses the frequency distribution of the snow–ice boundary. A binary classification mask was generated to separate snow/ice pixels from non-glacial surfaces. The snow–ice boundary was then refined with a two-step morphological procedure (dilated ice mask intersection followed by morphological erosion) to suppress edge artifacts.
To minimize noise and ensure physiographic relevance, snow–ice boundary segments containing fewer than 100 valid boundary pixels or occurring below 900 m a.s.l. were excluded from analysis. The 100-pixel filter removes “salt-and-pepper” classification noise (≈1 ha), while the 900 m elevation floor is based on the long-term mass balance record of White Glacier, which indicates that the ELA has consistently remained above this altitude. This constraint prevents the algorithm from erroneously identifying late-lying snowbanks or river icing (aufeis) in the terminus zone as the snowline.
Elevation for the refined boundary pixels was sampled from ArcticDEM v4.1, co-registered to the Sentinel-2 imagery and resampled to the same 10 m grid. Pixels with invalid or missing elevations (NaN) were removed prior to subsequent analyses.
Boundary pixels transitioning from snow to ice were extracted, and their elevations (derived from co-registered ArcticDEM v4.1) were grouped into 5 m intervals to construct a histogram H ( z ) . The SLA was defined as the mode of this distribution as follows:
S L A S P E H = argmax z ( H ( z ) )
This metric was chosen for its robustness against outliers and localized classification noise, offering a stable estimate that reflects the dominant elevation of the snow–ice transition zone (Figure 4).
2.
Elevation Binning ( SLA b i n )
Adapted from Rastner et al. [7], this method relies on discretizing the glacier hypsometry into elevation bins b of height Δ z = 10 m. For each bin, a local SCR ( S C R b ) was calculated as the proportion of snow-covered pixels relative to the total number of ice-and-snow pixels in that bin. The SLA was determined as the lowest elevation bin where the snow cover consistently exceeds a threshold τ as follows:
S L A bin = min { z b S C R b > τ }
where τ was tested at 0.5 and 0.8. The threshold τ = 0.5 was selected as a baseline following standard automated protocols (e.g., ASMAG; [7]). The stricter threshold τ = 0.8 was introduced to enforce a higher fraction of snow within the elevation bin. This conservative threshold acts as a filter for the spatially coherent snowline, rejecting transitional zones (e.g., patchy firn or mixed pixels) that frequently trigger false positives at lower thresholds.

Validation and Visualization

Delineated snowlines were validated by overlaying the mapped snow–ice boundary on ArcticDEM topography and on Sentinel-2 false-colour composites to confirm consistency with observed surface features (Figure 5). Independent validation was performed by comparison with annual, in situ field-derived ELA. This combined approach, comprising visual inspection against topography and imagery alongside a comparison to field-derived ELA, supports the reliability of the SLA estimates.

4.5. Uncertainty Assessment

Multiple factors contribute to the overall uncertainty in the SLA estimates derived for White Glacier in this study, outlined as follows:

4.5.1. DEM Vertical Accuracy

This study utilizes the ArcticDEM Version 4.1 dataset, which has a reported vertical accuracy of approximately ± 4 m. It is important to note that this value might not fully reflect local variations due to differences in topography, slope, or satellite viewing angles, potentially leading to localized inaccuracies in SLA estimation.

4.5.2. Statistical Variability in Snow–Ice Boundary Delineation

The snow–ice boundary was aggregated into 5 m elevation bins for SLA metric calculation. While this method mitigates the impact of minor outliers, it introduces an inherent variability of up to ± 2.5 m (half the bin size). This binning strategy might obscure subtle transitions at the boundary, thus adding to the uncertainty in precise SLA determination.

4.5.3. Temporal Offset Between DEM and Imagery

A significant source of uncertainty arises from the temporal mismatch between the ArcticDEM V4.1 mosaic, covering 2011 to 2021, and the more recent Sentinel-2 imagery used for snowline detection. Elevation dynamics relating to thinning, accumulation, and mass transfer through the glacier system during this period could introduce discrepancies in elevation data. However, these unquantified shifts are excluded from uncertainty calculations, as their effect is deemed minor compared to other sources and challenging to measure.

4.5.4. Combined Uncertainty Calculation

The combined vertical uncertainty for the SLA is expressed using a root-sum-square approach that incorporates the known error sources. The equation is given by the following:
σ SLA = ( σ DEM ) 2 + ( σ bin ) 2
where the following hold:
  • σ SLA is the total SLA uncertainty from the known vertical and binning error sources.
  • σ DEM is the vertical accuracy of the ArcticDEM (approximately ± 4 m).
  • σ bin is half the width of the 5 m elevation bin used for statistical aggregation (i.e., ± 2.5 m).
Substituting the known values into the equation
σ SLA = ( 4 m ) 2 + ( 2.5 m ) 2 16 + 6.25 22.25 4.7 m
Thus, based on the known vertical accuracy of the DEM and the binning strategy for delineating the snow–ice boundary, the combined uncertainty is approximately ± 4.7 m.

5. Results

5.1. Dimensionality Reduction and Variance Retention for the Pseudo-Labels

The four-band Sentinel-2 feature space—blue (B2), green (B3), red (B4) and near-infrared (B8) reflectance, all at 10 m—was reduced to two principal components (PCs) using PCA. The first two PCs retained 99.96% of the variance (PC1: 99.65%; PC2: 0.31%), indicating negligible information loss for subsequent clustering and classification. Given the strong inter-band correlation, PCA primarily reoriented the feature space into (i) a broadband albedo axis and (ii) a minor chromatic contrast. PC1, accounting for 99.65% of the variance, represents an albedo gradient as follows: high values correspond to bright snow or clean ice, while low values indicate darker surfaces such as debris-covered ice or exposed ground. PC2 captures a weak contrast between visible and near-infrared reflectance, marginally enhancing separability among bright surfaces (e.g., wet/coarse versus dry/fine snow). The clustering structure shown in Figure 6b is predominantly resolved along PC1, suggesting a near-univariate decision boundary.
K-means clustering with k = 4 on the PCA scores yielded compact, well-separated groups (mean silhouette coefficient s ¯ = 0.83 ; Figure 6). Class labels were assigned post hoc based on spectral characteristics and image context. The emergence of two distinct snow clusters (Cluster 1 and Cluster 3) validates the choice of k = 4 . These clusters likely capture internal variance within the snowpack, such as the transition between fine-grained dry snow and coarse-grained wet snow, or differences in surface illumination. This separation provides a more refined training target for the supervised classifier than a monolithic “snow” class. A summary of cluster populations is provided in Table 2.

5.2. Pseudo-Label Resampling and Classifier Performance

The comparative analysis of three resampling methods—Random Resampling, Cluster Resampling, and Stratified Resampling—on the performance of RF and SVM classifiers is summarized in Table 3. The metrics reported here represent the reproduction fidelity of the supervised models relative to the K-means pseudo-labels, rather than the accuracy against independent ground truth. All methods achieved high fidelity (>0.999) and perfect F1-scores (1.00) for both RF and SVM classifiers. This indicates that the supervised models successfully learned the spectral partitioning defined by the unsupervised clustering. Cluster Resampling slightly outperformed the others with a RF fidelity of 0.999 and was the fastest in terms of RF classification time at 1.7 s. In contrast, SVM classification times were substantially longer, ranging from 113.6 s for Random Resampling to 159.2 s for Cluster Resampling. This suggests that while the resampling method impacts clustering quality, it has minimal effect on classification performance, with computational efficiency being a distinguishing factor, particularly highlighting the efficiency of RF over SVM.

5.3. SCR Trends and SLA Variability

The analysis of White Glacier for 2019––2024 shows pronounced interannual variability in both SCR and SLA, reflecting sensitivity to summer weather (Figure 7). The annual minimum SCR (end of ablation season) ranged from 0.46 in 2023 to 0.76 in 2021, with minimum-SCR dates varying from 18 August 2023 to 6 July 2024.
The interannual variability tracks regional climatic forcing. The anomalous conditions of 2021, characterized by the highest SCR (0.76) and lowest SLA (967 m), correspond to a regionally documented cool summer with persistent cloud cover and suppressed ablation. Conversely, the low SCR (0.46) and high SLA (1132 m) observed in 2023 reflect a high-melt season driven by elevated summer air temperatures and extended clear-sky conditions. This inverse relationship between SCR and thermal forcing underscores the sensitivity of White Glacier’s accumulation area to summer temperature anomalies.
Interpretation for 2024 is constrained by the limited image availability during the melt season. Field observations indicated an early onset of summer-like conditions, consistent with the early-season SCR decline, followed by unseasonably cool weather and intermittent snowfall that increased SCR. Overall, summer 2024 was unusually cool relative to other study years.
Table 4 summarizes end-of-season SCR, field-measured AAR, and SLA for 2019–2024. Interannual variability in both snow cover and snowline position is pronounced, reflecting changing summer ablation conditions.
The minimum SCR declined from 0.54 (2019) to 0.46 (2023) before rebounding to 0.60 in the partial 2024 record. Consistently, field AAR peaked at 0.62 in 2021—coincident with the greatest snow retention—and fell to 0.20 in 2023, indicating an extensive ablation area.
SPEH-derived SLAs ranged from 967.76 ± 4.7 m in 2021 to 1142.39 ± 4.7 m in 2020. Lower SCR/AAR years generally correspond to higher SLAs, consistent with the inverse relationship between accumulation area and snowline height. In 2023, the lowest SCR (0.46) and AAR (0.20) coincided with a high SLA SPEH ( 1132.98 ± 4.7 m ) and the highest field ELA (1380 m), indicating strong melt (see also Figure 8).
SLA bin at thresholds of 0.5 and 0.8 fall within 20–200 m of the SPEH values. For example, in 2023 the Bin 0.5 SLA ( 1170 ± 4.7 m ) is 37 m below the SPEH mode, whereas Bin 0.8 ( 1350 ± 4.7 m ) closely matches the field-derived ELA. Agreement among SLA SPEH , SLA bin , and ELA supports the robustness of the snowline delineation framework.
Overall, years with reduced snow cover and accumulation (e.g., 2020 and 2023) produced markedly elevated snowlines, while years with greater accumulation (e.g., 2021) yielded lower snowlines. The close correspondence between remotely-sensed SLA ( SLA SPEH and SLA bin ) and in situ ELA validates the efficacy of the snowline extraction methods for White Glacier.

5.4. Comparison of Fieldwork-Derived ELAs and Remote Sensing SLAs

The accuracy of remote sensing-based snowline extraction was assessed by comparing SLA from the SPEH and from elevation-bin method; thresholds SCR 0.5 and SCR 0.8 with field-derived ELAs for 2019–2023 (Figure 8; Table 5). All methods show strong positive correlations with in situ ELA, indicating that interannual variability in snowline position is captured effectively. SPEH yields r = 0.875 ( p = 0.052 ). The elevation-bin method strengthens as the threshold increases, with r = 0.983 ( p = 0.0027 ) for SCR 0.5 and r = 0.994 ( p = 0.0006 ) for SCR 0.8 . The near-perfect agreement at SCR 0.8 suggests that high snow cover thresholds most reliably isolate the true snow–ice transition.
Root-mean-square error (RMSE) and Mean Bias Error (MBE) further differentiate the methods. SPEH shows the largest deviation (RMSE = 231.9 m). The elevation-bin method at SCR 0.5 reduces RMSE to 190.1 m but it exhibits a large negative bias ( MBE 187 m). The method at SCR 0.8 yields the best performance, with the lowest RMSE (30.0 m). However, it is notable that even at this high threshold, the SLA systematically underestimates the ELA (SLA < ELA) with an MBE of 23.4 m. This consistent negative offset suggests that the satellite-derived snowline generally sits slightly lower in elevation than the mass-balance equilibrium line.

5.5. Relation to Interannual Climate Forcing

To examine climatic forcing, annual PDD sums (°C·d) at Eureka were compared with snowline metrics ( SLA speh , SLA bin and field-derived ELA) (Figure 8; Table 5 and Table 6). PDD peaks in 2019 (729.6 °C·d) and reaches its minimum in 2024 (505.6 °C·d). These thermal maxima and minima broadly mirror the snowline variability: high PDD years (2019, 2023) coincide with elevated SLAs, while low thermal forcing (2021, 2024) corresponds to extensive snow retention (Figure 8). Pearson correlations over 2019–2024 show weak-to-moderate positive associations between PDD and SLA bin , strongest for SCR 0.8 ( r = 0.524 , p = 0.286 , n = 6 ), and weaker for SCR 0.5 ( r = 0.319 , p = 0.538 , n = 6 ); SLA speh shows little linear association ( r = 0.095 , p = 0.858 , n = 6 ). For field-derived ELAs (2019–2023 only), the association is positive but not significant ( r = 0.445 , p = 0.452 , n = 5 ) (Table 6). The modest correlations likely reflect the short record length, spatial representativeness of Eureka as a proxy for White Glacier, and additional controls (accumulation/precipitation, winds, and late-season cloud/snow affecting SLA retrievals). A longer record and a closer-site temperature index would improve the signal-to-noise ratio.

6. Discussion

The integration of a hybrid machine learning approach, combining K-means-derived pseudo-labels with a RF classifier, has demonstrated efficacy in accurately delineating the annual SCR and SLA on White Glacier. This approach highlights both significant advantages and inherent limitations that should guide its refinement and broader application in glaciological monitoring.

6.1. Advantages and Limitations of Hybrid ML in Glacier Monitoring

The primary advantage of this hybrid framework is its reduced reliance on extensive field-validated datasets, addressing a persistent limitation in Arctic remote sensing [11,14]. By applying K-means clustering to the full Sentinel-2 spectral dataset, the workflow generates a large volume of pseudo-labels. This approach significantly decreases the need for labour-intensive manual annotation.
Furthermore, the unsupervised clustering effectively captures within-class variability across surface types such as snow, ice, and debris. This process yields spectrally representative training data for the RF classifier, which subsequently achieved a reproduction fidelity exceeding 99.9%. These results are consistent with observed surface conditions and highlight the computational efficiency of the RF model compared to SVM; notably, RF classification runtimes were reduced by an order of magnitude.
Despite these advantages, a limitation of this approach is the reliance on pseudo-labels for training and evaluation. The high classification metrics reported (Table 3) indicate that the RF model effectively reproduces the K-means spectral clustering, but they do not quantify pixel-level errors against independent, manually digitized ground truth. Consequently, the physical validity of the classification relies on the downstream agreement between the derived SLA and the field-measured ELA (Table 5), rather than on direct pixel-by-pixel validation.
Additionally, the generation of pseudo-labels imposes computational demands due to the large data volumes involved, often resulting in redundant labels or outliers, even after resampling optimizations. Although classification accuracy remained consistently high across resampling schemes, the quality of input imagery remains a critical factor, particularly with respect to cloud cover and shadow contamination. Moreover, the method exhibits limitations in distinguishing spectrally similar surface types, such as snow and superimposed ice or firn, where spectral overlap can lead to subtle misclassifications. While the clustering approach captures heterogeneous surface conditions more effectively than fully supervised methods, further methodological refinement is required to resolve these ambiguities.
From a glaciological perspective, it is important to note that while the derived ELA is strongly correlated with annual mass balance, it serves as a proxy rather than a complete indicator. This method accurately delineates the transition boundary but cannot determine the magnitude of accumulation or ablation occurring respectively above and below the ELA. Finally, the operational success of this framework is strictly reliant on the availability of high-quality, cloud-free optical imagery during the end-of-season window; persistent cloud cover remains a fundamental constraint that may necessitate the integration of SAR data for all-weather monitoring.

6.2. Optimizing Snowline Altitude Extraction

Glacier surfaces in the Arctic exhibit complex topography, including steep slopes, ridges, and variable aspects. These topographic factors dictate solar radiation loading, leading to spatially heterogeneous melt patterns (e.g., differential ablation on sun-exposed versus shaded slopes) that often confound static snowline mapping techniques [11]. The present results demonstrate that embedding a statistical validation step into snowline extraction markedly improves accuracy by smoothing these local radiative effects. Elevation binning of snow–ice boundary pixels at 5 m intervals, followed by summary statistics, effectively reduces the influence of outliers and anomalous pixels driven by local topography.
The SPEH method alone achieves a respectable yet borderline-significant correlation with field-derived ELA ( r = 0.875 , p = 0.052 ) but yields a high RMSE of 231.9 m. Introducing an elevation-bin threshold at a SCR of 0.5 raises correlation to r = 0.983 ( p = 0.0027 ) and lowers RMSE to 190.1 m (Table 5), indicating substantial error reduction. A more stringent threshold of SCR = 0.8 yields near-perfect agreement with in situ ELA ( r = 0.994 , p = 0.0006 ) and a substantially reduced RMSE of 30.0 m. These results indicate that elevation-bin filtering, especially at higher SCR thresholds, reliably isolates the snow–ice transition zone, even under interannual variability (2019–2023). The integration of high-fidelity classification with elevation-stratified filtering thus provides a validated framework for SLA extraction in complex Arctic terrain.
As shown in Table 5, increasing the SCR threshold from SCR 0.5 to SCR 0.8 improves agreement with field-derived ELA, reducing the RMSE from 190.1 m to 30.0 m and improving the correlation from r = 0.983 to r = 0.994 . This improvement reflects a substantial reduction in the systematic negative bias associated with lower thresholds.
To quantify this effect, Table 7 lists the year-wise residuals Δ ELA = SLA RS ELA field (from Table 4) for both SCR 0.5 and SCR 0.8 .
These residuals confirm the following:
  • At SCR 0.5 , SLARS persistently underestimates ELA by approximately 150–220 m, indicating a strong negative bias likely attributable to mixed-pixel effects near the snow–ice boundary.
  • At SCR 0.8 , residuals are smaller and more symmetrically distributed around zero (e.g., +4 m in 2019, and −14 to −53 m in other years), resulting in a mean bias of −23.4 m and RMSE of approximately 30 m.
Because SLARS increases monotonically with an increase in SCR thresholds, further raising the threshold (e.g., to 0.85–0.90) would likely result in overestimation of SLARS and increase sensitivity to classification noise due to reduced sample sizes near the snow–ice transition. Given that SCR 0.8 already yields minimal bias and high stability, it represents an effective trade-off between bias and variance. Evidence from Table 5 and Table 7 supports SCR 0.8 as a near-optimal threshold for SLA mapping at White Glacier.

6.3. Validation of SLARS Against Field-Derived ELAs

Validation of remotely sensed snowline altitude (SLARS) using the elevation-bin method at a SCR threshold of 0.8 against field-derived ELAs on White Glacier (2019–2023) yielded a near-perfect Pearson correlation ( r = 0.994 , p = 0.0006 ) and a RMSE of 30.0 m (Table 5). Such a strong relationship confirms the efficacy of high-threshold SCR binning for capturing interannual SLA variability in Arctic optical imagery and aligns with previous reports of robust SLA–ELA agreement [7,8,12]. Despite the strong correlation, SLARS consistently underestimated field-derived ELAs by an average of 30 m in this study. This systematic bias is well documented in the literature, with typical SLA underestimations ranging from 10–30 m and RMSE values of 40–60 m [28], while a pan-Arctic analysis found a mean SLA–ELA discrepancy of approximately −100 m [10].
The dominant contributor to this bias is the temporal mismatch between image acquisition and the true end of the ablation season. Satellite imagery often fails to capture the glacier surface precisely at the end of the ablation season, instead recording transient snowlines during late-season accumulation or refreezing events [8]. Even small timing differences—in the order of days to weeks—can cause SLARS to fall tens of metres below the in situ ELA [10].
In addition, subpixel mixing at the snow–ice boundary further biases the mapped snowline downward. In moderate-resolution imagery, a pixel straddling snow and bare ice can be classified as snow-covered, effectively placing the snowline too far into the ablation zone (e.g., a 30 m Landsat-class pixel). Additionally, thin snow layers, superimposed ice and wet snow exhibit high reflectance in Sentinel-2 bands, promoting premature delineation of the snow–ice transition [29,30]. Late-season snowfall or lingering firn/superimposed-ice patches can make low elevations appear snow-covered even where net balance is near zero or negative [7]. These effects encourage automated classifiers to map snow where bare ice had emerged, thereby lowering the derived SLA.
Topographic shading and cloud contamination introduce additional uncertainty. Shadows can obscure snow, and bright cloud or cloud-shadow artifacts may be misclassified as snow, either masking the true snowline or shifting it downward [7,28].
Errors in the DEM used to assign elevations to mapped snow pixels also propagate into SLA estimates. Vertical uncertainties and co-registration offsets on the order of 10–20 m can shift SLARS by several metres [7]. Use of DEM mosaics (e.g., 2011–2021 composites) may introduce temporal mismatches with recent surface change, although such effects are likely random rather than systematic [21].
Taken together, the observed mean bias of −23.4 m and overall accuracy (RMSE = 30 m) are consistent with previously reported SLA–ELA agreement Rastner et al. [7] (±19 m) and Li et al. [12] (±24 m), underscoring the importance of acquisition timing and surface-condition variability in SLA–ELA comparisons. In years of extreme melt or rapid topographic change, these offsets can increase substantially [31]. Future work should prioritize tighter temporal matching to the end of ablation, contemporaneous DEMs, and the integration of spectral–angular constraints (e.g., radiative-transfer or multi-angular methods) to mitigate misclassification. Despite the noted challenges, the very high correlation and modest, interpretable bias demonstrate that Sentinel-2 imagery combined with elevation-bin statistical filtering provides a repeatable and scalable proxy for in situ ELA in data-sparse Arctic settings.

6.4. Methodological Robustness and Parameter Selection

The architecture of the classification pipeline necessitates specific parameter selection to balance statistical stability with glaciological representativeness.
Cluster Selection (k = 4): Although statistical heuristics (specifically the Elbow Method) indicated an inertia plateau near k = 3 , empirical validation demonstrated that this configuration conflated distinct snow facies, often merging wet snow with ice. Conversely, increasing to k 5 introduced noise by over-segmenting coherent features, such as topographic shadows. Consequently, k = 4 was identified as the optimal configuration to resolve Ground, Ice, and two spectral states of Snow without inducing artificial segmentation.
PCA Dimensionality: While PC1 dominates the variance (> 99 % ) and represents broadband albedo, the retention of PC2 ( 0.31 % ) was essential. This second component encodes the spectral gradient between visible and near-infrared wavelengths—a critical feature for distinguishing wet snow from clean ice that is typically lost in single-component models.
Supervised Learning and Leakage: Random Forest hyperparameters ( B = 50 , max depth 10) were explicitly constrained to prevent the memorization of pixel-level noise inherent in the pseudo-labels. It is acknowledged that training the RF on K-means outputs constitutes information sharing (“leakage”) by design. Within this semi-supervised framework, the high fidelity metrics (Table 3) confirm that the RF successfully distilled the global clustering structure into a generalized decision boundary, rather than overfitting to local artifacts.

6.5. Uncertainties and Residual Bias

Uncertainties in the derived snowline altitudes arise from a combination of atmospheric conditions, sensor resolution, and the temporal offset between satellite acquisition and glaciological events. Although cloud screening using the Sentinel-2 quality-assurance band (QA60) reduced classification errors, thin clouds and topographic shadows occasionally produce subtle misclassification [9]. Consequently, continued development of cloud/shadow masks and the integration of multi-sensor data (e.g., synthetic aperture radar, thermal infrared) are recommended to better capture ephemeral refreezing events and resolve atmospheric artifacts.
A significant source of systematic bias lies in the temporal mismatch between the satellite image acquisition and the stratigraphic end of the ablation season. The “Minimum SCR” date derived from satellite imagery serves as a proxy for the end of the melt season but rarely coincides perfectly with the true transition. In the high Arctic, ablation often continues into late August or early September, potentially occurring after the last cloud-free optical acquisition. As a result, the satellite-derived SLA frequently captures a transient snowline that has not yet reached its maximum elevation, contributing to the observed negative bias (systematic underestimation) relative to the field-derived ELA.
Structural uncertainty is introduced by the spectral ambiguity of superimposed ice (SI) and firn layers. Koerner [32] emphasized the influence of SI on sub-polar glacier mass balance, highlighting the difficulty of delineating the late-season snowline where refrozen meltwater obscures the transition. Similarly, Woodward et al. [33] demonstrated that superimposed-ice formation can confound both optical and radar-derived snowline mapping. In the context of this study, SI is often spectrally indistinguishable from bare ice or wet snow depending on its surface roughness and water content. If the classifier groups SI with glacier ice rather than the accumulation zone, the derived SLA will be overestimated; conversely, if SI is classified as snow, the SLA will be underestimated. This spectral ambiguity represents a physical limitation that standard multispectral classification cannot fully resolve without ancillary data.
Spatial resolution limitations at 10 m introduce mixed-pixel effects, particularly at the snowline where pixels often contain a heterogeneous mixture of firn, bare ice, and shadow. The influence of this sub-pixel mixing is underscored by the sensitivity of the SLA to the binning threshold, where the RMSE reduced from 190 m at SCR 0.5 to 30 m at SCR 0.8 . A standard majority threshold ( τ = 0.5 ) systematically classifies these mixed pixels as snow, artificially lowering the SLA. The necessity of the higher threshold confirms that a strict “dominant snow” criterion is required to filter out this sub-pixel noise. Additionally, the use of a static ArcticDEM mosaic (derived largely from 2011–2017 data) to extract elevations for 2019–2024 imagery introduces a vertical error component due to surface drawdown. However, field observations suggest that thinning magnitude in the accumulation zone is secondary to the timing and mixed-pixel errors described above.
Finally, a limitation of this study is the absence of independent quantitative validation at the pixel level (e.g., Kappa coefficients derived from manual digitization), which was restricted by the scarcity of contemporaneous high-resolution data. While the RF classifier demonstrates high internal consistency with the pseudo-labels (Table 3), this does not guarantee immunity to semantic errors, such as the misclassification of firn as ice or topographic shadows as wet snow. However, the impact of these potential pixel-level errors is mitigated by the aggregation strategy used to derive the SLA. The elevation binning method ( SCR 0.8 ) effectively acts as a spatial filter, requiring a consistent snow cover fraction across the glacier width to define the snowline. The strong correlation with independent field-derived ELA ( r = 0.994 ) suggests that while pixel-level noise may exist, the aggregate snowline determination remains robust and physically representative.

6.6. Applicability to Complex Glacier Terrain

While the proposed workflow demonstrates high efficacy for White Glacier, it is important to contextualize this success within the glacier’s morphology. White Glacier is a predominantly clean-ice system with high spectral contrast between the glacier surface and the surrounding bedrock. For debris-covered glaciers or those with complex topography, the current feature space (Sentinel-2 bands) may be insufficient to distinguish supraglacial debris from periglacial terrain. To transfer this semi-supervised approach to such environments, specific adaptations are recommended:
  • Spectral Expansion: Incorporating Shortwave Infrared (SWIR) bands (e.g., Sentinel-2 B11/B12) and thermal data (e.g., Landsat TIRS) would enhance the discrimination between debris-covered ice and bedrock based on mineral composition and surface temperature.
  • Topographic Features: Integrating DEM derivatives, such as slope, aspect, and illumination corrections, into the feature vector could reduce misclassification in deeply shadowed accumulation zones.
  • Texture Analysis: Utilizing texture metrics (e.g., Grey Level Co-occurrence Matrix) may help resolve the morphological differences between hummocky debris tongues and stable ground, which spectral data alone cannot capture.

6.7. Future Research Directions in SLA and SCR Detection

To enhance the quality of the raw dataset for clustering, future research should focus on integrating multi-sensor datasets, such as Synthetic Aperture Radar (SAR) and thermal imagery, to complement optical data and improve the detection of surface meltwater, refreezing events, and superimposed ice layers. Nambiar et al. [15] showcased the effectiveness of a self-trained model for cloud, shadow, and snow detection in Sentinel-2 imagery of snow- and ice-covered regions, highlighting the transformative potential of automated frameworks in mitigating classification challenges. These methodologies can address persistent issues with cloud cover and shadow variability in complex, glaciated terrain. Furthermore, leveraging High-Spectral-Dimensional Neighborhood Search (HSDNS) could enhance pseudo-label creation by extracting intricate feature similarities in raw datasets, thus increasing classification accuracy and reliability.
Furthermore, while this study demonstrated the utility of a static high-confidence threshold ( S C R 0.8 ), future iterations of the workflow will implement an automated parameter sweep (e.g., testing SCR thresholds from 0.5 to 0.95). This would allow for dynamic threshold optimization based on scene-specific conditions, potentially minimizing the residual SLA–ELA offset.
Transitioning from static classifiers to time-resolved machine learning approaches, such as recurrent neural networks (RNNs), would support explicit tracking of melt progression and refine end-of-season snowline altitude estimates. Incorporating contemporaneous elevation data from ongoing ArcticDEM updates and/or unmanned aerial vehicle (UAV) surveys would reduce uncertainties associated with temporal DEM–surface mismatches. Automating the processing pipeline on scalable cloud platforms would enable near-real-time analysis of large image time series. Finally, coupling derived snowline altitudes with ELA gradients offers a pathway to infer glacier mass balance at individual and regional scales, improving the utility of the approach for Arctic cryospheric monitoring.

7. Conclusions

This study developed a hybrid machine learning framework to monitor glacier surface variables, specifically estimating SCR and SLA on White Glacier, Canadian high Arctic. The methodology combines unsupervised K-means clustering for pseudo-label generation with supervised classification using a RF algorithm, addressing limitations associated with traditional field-based monitoring in remote Arctic environments.
Satellite-derived SLAs were strongly correlated with field-derived ELAs (r = 0.99), demonstrating the potential of optical remote sensing for proxy mass balance estimation. A systematic offset between SLA and ELA was identified, indicating the need for further refinement in spectral discrimination and temporal alignment between remote sensing observations and in situ measurements.
Key contributions of this study include the following:
  • Automated Label Generation via Spectral Clustering: A semi-supervised classification pipeline was established wherein a Random Forest classifier reproduced unsupervised spectral clustering with >99% fidelity. This approach effectively circumvents the bottleneck of manual annotation in data-sparse regions while preserving critical spectral variance through PCA dimensionality reduction.
  • Operational Integration and Physical Validation: This study demonstrates that optical pseudo-labels can be converted into physically meaningful mass balance proxies (SLA) when constrained by high-threshold elevation binning ( SCR 0.8 ). Validation against the long-term in situ record of White Glacier confirmed a strong agreement ( r = 0.994 ), establishing the workflow as a viable proxy for equilibrium line monitoring.
  • Characterization of Interannual Variability: The derived time series (2019–2024) effectively captured the glacier’s interannual response to climatic forcing, resolving distinct ablation regimes such as the suppressed melt of 2021 versus the intense ablation of 2023. These trends provide observational constraints for regional mass balance assessments.
  • Uncertainty Benchmarking: A systematic assessment of errors identified acquisition timing and mixed-pixel effects as dominant sources of bias. The quantification of the residual SLA–ELA offset (≈−30 m) provides a calibrated benchmark for transferring this methodology to unmonitored basins.
Limitations of the current framework include spectral ambiguity in superimposed ice zones, residual cloud contamination, and vertical errors associated with the static digital elevation model. Future research should prioritize the integration of multi-sensor datasets (e.g., SWIR, SAR) to resolve complex surface facies and the application of temporally resolved classification models. Furthermore, synchronizing contemporaneous elevation data with optical acquisitions is recommended to minimize geometric mismatches in rapidly thinning ablation zones.
This study demonstrates the applicability of remote sensing and machine learning for monitoring clean-ice glaciers in data-sparse polar regions. The framework offers a scalable tool for tracking glacier change in clean-ice Arctic environments and contributes to broader efforts to understand and model cryospheric responses to climate forcing. Future research should focus on improving classification accuracy, increasing temporal resolution, and integrating multi-source datasets to support glaciological analysis and climate policy development.

Author Contributions

Conceptualization, W.Y.C.; methodology, W.Y.C.; software, W.Y.C.; validation, W.Y.C. and L.T.; formal analysis, W.Y.C.; investigation, W.Y.C.; resources, L.T.; data curation, W.Y.C.; writing—original draft preparation, W.Y.C.; writing—review and editing, W.Y.C. and L.T.; visualization, W.Y.C.; supervision, L.T.; project administration, L.T.; funding acquisition, L.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received partial financial support from Queen’s University (Queen’s Graduate Award to W.Y.C.), Parks Canada (Partnership Agreement with Nunavut Field Unit to L.T.), the Canadian Foundation for Innovation (L.T.), NSERC Discovery and Northern Supplement (L.T.), and in-kind logistical support from Natural Resources Canada’s Polar Continental Shelf Program (L.T.).

Data Availability Statement

The data and code supporting the findings of this study are openly available as follows: Code (Google Colab/Python v3.12.2). Complete, executable notebooks and scripts for K-means pseudo-label generation, Random Forest classification, snow cover ratio (SCR) calculation, and snowline-altitude (SLA) extraction: https://github.com/cwywilson/KMeans-Derived-Pseudo-Labels-for-Machine-Learning-Classification (accessed on 5 June 2025). Sentinel-2 imagery (2019–2024). Level-1C/Level-2A scenes are available from the Copernicus Data Space Ecosystem: https://dataspace.copernicus.eu/ (accessed on 6 January 2025). Elevation data. ArcticDEM mosaics used for elevation referencing can be obtained from the Polar Geospatial Center/Harvard Dataverse (Version 4.1): https://doi.org/10.7910/DVN/3VDC4W (accessed on 6 January 2025). Field ELA information (if referenced). White Glacier records via the WGMS Reference Glaciers Database: https://wgms.ch/products_ref_glaciers/ (accessed on 1 June 2025). Temperature data for PDD. Daily air temperatures for Eureka, NU, are available from Environment and Climate Change Canada (Historical Climate Data): https://climate.weather.gc.ca/ (accessed on 13 August 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zemp, M.; Huss, M.; Thibert, E.; Eckert, N.; McNabb, R.; Huber, J.; Cogley, J.G. Global Glacier Mass Changes and Their Contributions to Sea-Level Rise from 1961 to 2016. Nature 2019, 568, 382–386. [Google Scholar] [CrossRef]
  2. Hugonnet, R.; McNabb, R.; Berthier, E.; Menounos, B.; Nuth, C.; Girod, L.; Farinotti, D.; Huss, M.; Dussaillant, I.; Brun, F.; et al. Accelerated global glacier mass loss in the early twenty-first century. Nature 2021, 592, 726–731. [Google Scholar] [CrossRef]
  3. Sharp, M.; Burgess, D.O.; Cawkwell, F.; Copland, L.; Davis, J.A.; Dowdeswell, E.K.; Dowdeswell, J.A.; Gardner, A.S.; Mair, D.; Wang, L. Remote Sensing of Recent Glacier Changes in the Canadian Arctic. In Global Land Ice Measurements from Space; Kargel, J.S., Leonard, G.J., Bishop, M.P., Kääb, A., Raup, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 205–228. [Google Scholar]
  4. Hock, R.; Truffer, M. Glacier hazards: Will they change in the future? Arctic Antarct. Alp. Res. 2024, 56, 2335991. [Google Scholar] [CrossRef]
  5. Box, J.E.; Colgan, W.T.; Wouters, B.; Burgess, D.O.; O’Neel, S.; Thomson, L.I.; Mernild, S.H. Global Sea-Level Contribution from Arctic Land Ice: 1971–2017. Environ. Res. Lett. 2018, 13, 125012. [Google Scholar] [CrossRef]
  6. Noël, B.; van de Berg, W.J.; Lhermitte, S.; Wouters, B.; Schaffer, N.; van den Broeke, M.R. Six Decades of Glacial Mass Loss in the Canadian Arctic Archipelago. J. Geophys. Res. Earth Surf. 2018, 123, 1430–1449. [Google Scholar] [CrossRef]
  7. Rastner, P.; Prinz, R.; Notarnicola, C.; Nicholson, L.; Sailer, R.; Schwaizer, G.; Paul, F. On the Automated Mapping of Snow Cover on Glaciers and Calculation of Snow Line Altitudes from Multi-Temporal Landsat Data. Remote Sens. 2019, 11, 1410. [Google Scholar] [CrossRef]
  8. Rabatel, A.; Dedieu, J.P.; Vincent, C. Using Remote-Sensing Data to Determine Equilibrium-Line Altitude (ELA) and Mass-Balance Time Series: Validation on Three French Glaciers, 1994–2002. J. Glaciol. 2005, 51, 539–546. [Google Scholar] [CrossRef]
  9. Barandun, M.; Callegari, M.; Strasser, U.; Notarnicola, C. Towards Daily Snowline Observations on Glaciers Using Multi-Source and Multi-Resolution Satellite Data. In Proceedings of the Microwave Remote Sensing: Data Processing and Applications, Online, 13–18 September 2021; SPIE: St Bellingham, WA, USA, 2021. Proc. SPIE 11861. [Google Scholar] [CrossRef]
  10. Larocca, L.J.; Lea, J.M.; Erb, M.P.; McKay, N.P.; Phillips, M.; Lamantia, K.A.; Kaufman, D.S. Arctic glacier snowline altitudes rise 150 m over the last 4 decades. Cryosphere 2024, 18, 3591–3611. [Google Scholar] [CrossRef]
  11. Prieur, C.; Rabatel, A.; Thomas, J.B.; Farup, I.; Chanussot, J. Machine Learning Approaches to Automatically Detect Glacier Snow Lines on Multi-Spectral Satellite Images. Remote Sens. 2022, 14, 3868. [Google Scholar] [CrossRef]
  12. Li, X.; Wang, N.; Wu, Y. Automated Glacier Snow Line Altitude Calculation Method Using Landsat Series Images in the Google Earth Engine Platform. Remote Sens. 2022, 14, 2377. [Google Scholar] [CrossRef]
  13. Maxwell, A.E.; Warner, T.A.; Fang, F. Implementation of Machine-Learning Classification in Remote Sensing: An Applied Review. Int. J. Remote Sens. 2018, 39, 2784–2817. [Google Scholar] [CrossRef]
  14. Guo, W.; Zhang, W.; Zhang, Z.; Tang, P.; Gao, S. Deep Temporal Iterative Clustering for Satellite Image Time Series Land Cover Analysis. Remote Sens. 2022, 14, 3635. [Google Scholar] [CrossRef]
  15. Nambiar, K.; Morgenshtern, V.; Hochreuther, P.; Seehaus, T.; Braun, M. A Self-Trained Model for Cloud, Shadow and Snow Detection in Sentinel-2 Images of Snow- and Ice-Covered Regions. Remote Sens. 2022, 14, 1825. [Google Scholar] [CrossRef]
  16. Mirpulatov, I.; Illarionova, S.; Shadrin, D.; Burnaev, E. Pseudo-Labeling Approach for Land Cover Classification Through Remote Sensing Observations with Noisy Labels. IEEE Access 2023, 11, 82570–82583. [Google Scholar] [CrossRef]
  17. World Glacier Monitoring Service. White Glacier. WGMS Reference Glaciers Database. 2023. Available online: https://wgms.ch/products_ref_glaciers/ (accessed on 15 August 2025).
  18. Thomson, L.I.; Copland, L. Glacier Change on Axel Heiberg Island, Nunavut, Canada. J. Glaciol. 2016, 62, 133–145. [Google Scholar] [CrossRef]
  19. Masek, J.; Vermote, E.; Saleous, N.; Wolfe, R.; Hall, F.; Huemmrich, K.; Gao, F.; Kutler, J.; Lim, T.K. A Landsat Surface Reflectance Dataset for North America, 1990–2000. IEEE Geosci. Remote Sens. Lett. 2006, 3, 68–72. [Google Scholar] [CrossRef]
  20. Runge, A.; Grosse, G. Comparing Spectral Characteristics of Landsat-8 and Sentinel-2 Same-Day Data for Arctic-Boreal Regions. Remote Sens. 2019, 11, 1730. [Google Scholar] [CrossRef]
  21. Porter, C.; Howat, I.; Noh, M.J.; Husby, E.; Khuvis, S.; Danish, E.; Tomko, K.; Gardiner, J.; Negrete, A.; Yadav, B.; et al. ArcticDEM–Mosaics, Version 4.1. 2023. Available online: https://www.pgc.umn.edu/data/arcticdem/ (accessed on 6 January 2025). [CrossRef]
  22. Noh, M.J.; Howat, I.M.; Porter, C.; Merryman Bonanza, S. Automated stereo-photogrammetric DEM generation at high latitudes: ArcticDEM. Cryosphere 2020, 14, 163–183. [Google Scholar]
  23. RGI Consortium. Randolph Glacier Inventory—A Dataset of Global Glacier Outlines; [Data Set]; National Snow and Ice Data Center: Boulder, CO, USA, 2023; NSIDC-0770, Version 7. [Google Scholar] [CrossRef]
  24. Pfeffer, W.T.; Arendt, A.A.; Bliss, A.; Bolch, T.; Cogley, J.G.; Gardner, A.S.; Hagen, J.-O.; Hock, R.; Kaser, G.; Kienholz, C.; et al. The Randolph Glacier Inventory: A Globally Complete Inventory of Glaciers. J. Glaciol. 2014, 60, 537–552. [Google Scholar] [CrossRef]
  25. Cai, W. A dimension reduction algorithm preserving both global and local clustering structure. Knowl.-Based Syst. 2017, 118, 191–203. [Google Scholar] [CrossRef]
  26. Raj, A.; Minz, S. Spatial Granule based Clustering Technique for Hyperspectral Images. In Proceedings of the 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon), Mysuru, India, 16–17 October 2022; pp. 1–6. [Google Scholar] [CrossRef]
  27. Aberle, R.; Enderlin, E.; O’Neel, S.; Florentine, C.; Sass, L.; Dickson, A.; Marshall, H.P.; Flores, A. Automated snow cover detection on mountain glaciers using space-borne imagery. EGUsphere 2024, 2024, 1–28. [Google Scholar] [CrossRef]
  28. Zeller, L.; McGrath, D.J.; Sass, L.C.; Florentine, C.; Downs, J. Equilibrium-Line Altitudes, Accumulation Areas, and the Vulnerability of Glaciers in Alaska. J. Glaciol. 2025, 71, e28. [Google Scholar] [CrossRef]
  29. Peterson, G. Accumulation Area Ratio (AAR) and Equilibrium Line Altitude (ELA) on the Southern Patagonia Icefield, 2000–2010, Retrieved Using MODIS Satellite Images. Master’s Thesis, Stockholms Universitet, Stockholm, Sweden, 2010. [Google Scholar]
  30. Wang, Y.; Huang, J.; Guo, Z.; Chen, A. Limitations in Identifying the Equilibrium-Line Altitude from the Optical Remote-Sensing Derived Snowline in the Tien Shan, China. J. Glaciol. 2014, 60, 1093–1100. [Google Scholar] [CrossRef]
  31. Yue, X.; Li, Z.; Zhao, J.; Li, H.; Wang, P.; Wang, L. Changes in the End-of-Summer Snow Line Altitude of Summer-Accumulation-Type Glaciers in the Eastern Tien Shan Mountains from 1994 to 2016. Remote Sens. 2021, 13, 1080. [Google Scholar] [CrossRef]
  32. Koerner, R. The Mass Balance of the Devon Island Ice Cap, Northwest Territories, Canada, 1961–66. J. Glaciol. 1970, 9, 325–336. [Google Scholar] [CrossRef]
  33. Woodward, J.; Sharp, M.; Arendt, A. The influence of superimposed-ice formation on the sensitivity of glacier mass balance to climate change. Ann. Glaciol. 1997, 24, 186–190. [Google Scholar] [CrossRef][Green Version]
Figure 1. Snowline elevation analysis of White Glacier in 2019 using the elevation binning method with 20 m intervals. This image corresponds to the date with the lowest Snow Cover Ratio (SCR) observed in 2019, selected for Snowline Altitude (SLA) delineation. The panel (a) illustrates the variation in SCR as a function of elevation; the red dashed line marks the SCR threshold of 0.5, used in the Automatic Snowline Mapping and Glacier delineation (ASMAG) algorithm by [7] for SLA delineation. However, this threshold tends to underestimate the ELA when compared with in situ field measurements. The panel (b) displays the corresponding classified image, with pixels categorized as snow, ice, or ground, and 20 m elevation contours overlaid to assist in SLA interpretation.
Figure 1. Snowline elevation analysis of White Glacier in 2019 using the elevation binning method with 20 m intervals. This image corresponds to the date with the lowest Snow Cover Ratio (SCR) observed in 2019, selected for Snowline Altitude (SLA) delineation. The panel (a) illustrates the variation in SCR as a function of elevation; the red dashed line marks the SCR threshold of 0.5, used in the Automatic Snowline Mapping and Glacier delineation (ASMAG) algorithm by [7] for SLA delineation. However, this threshold tends to underestimate the ELA when compared with in situ field measurements. The panel (b) displays the corresponding classified image, with pixels categorized as snow, ice, or ground, and 20 m elevation contours overlaid to assist in SLA interpretation.
Remotesensing 17 03872 g001
Figure 2. Geographical setting of White Glacier on Axel Heiberg Island, Nunavut, Canadian high Arctic. (a) Regional map of the Canadian Arctic Archipelago showing Axel Heiberg Island, with the study site location marked by a red box, and the nearby Eureka weather station on Ellesmere Island marked by a red triangle. (b) Inset globe indicating the position of the study area relative to Greenland and North America marked by a red box. (c) Sentinel-2 true-colour composite (bands 4–3–2) of White Glacier with the glacier boundary delineated by the red outline.
Figure 2. Geographical setting of White Glacier on Axel Heiberg Island, Nunavut, Canadian high Arctic. (a) Regional map of the Canadian Arctic Archipelago showing Axel Heiberg Island, with the study site location marked by a red box, and the nearby Eureka weather station on Ellesmere Island marked by a red triangle. (b) Inset globe indicating the position of the study area relative to Greenland and North America marked by a red box. (c) Sentinel-2 true-colour composite (bands 4–3–2) of White Glacier with the glacier boundary delineated by the red outline.
Remotesensing 17 03872 g002
Figure 3. Overview of the processing workflow used to estimate the snow cover ratio (SCR) and delineate snowline altitude (SLA) from Sentinel-2 imagery (10 m). Pre-processing includes scene selection during the ablation season, cloud masking, and glacier masking. Unsupervised K-means clustering of multispectral inputs produces pseudo-labels that guide the supervised classification of snow, ice, and ground. Post-processing and elevation-based filtering yield per-scene SCR, after which the image with the seasonal minimum SCR is identified and used for end-of-ablation SLA delineation.
Figure 3. Overview of the processing workflow used to estimate the snow cover ratio (SCR) and delineate snowline altitude (SLA) from Sentinel-2 imagery (10 m). Pre-processing includes scene selection during the ablation season, cloud masking, and glacier masking. Unsupervised K-means clustering of multispectral inputs produces pseudo-labels that guide the supervised classification of snow, ice, and ground. Post-processing and elevation-based filtering yield per-scene SCR, after which the image with the seasonal minimum SCR is identified and used for end-of-ablation SLA delineation.
Remotesensing 17 03872 g003
Figure 4. Snow–ice boundary analysis on White Glacier based on Sentinel-2 imagery from 14 August 2019. (a) Histogram of elevation values at the snow–ice boundary; the mode (1046.90 m) was selected as the Snowline Altitude (SLA). (b) Refined snow–ice boundary overlaid on a topographic map of the glacier, illustrating spatial distribution. (c) Classified image showing the refined snow–ice boundary; the corresponding Snow Cover Ratio (SCR) for this date is 0.54.
Figure 4. Snow–ice boundary analysis on White Glacier based on Sentinel-2 imagery from 14 August 2019. (a) Histogram of elevation values at the snow–ice boundary; the mode (1046.90 m) was selected as the Snowline Altitude (SLA). (b) Refined snow–ice boundary overlaid on a topographic map of the glacier, illustrating spatial distribution. (c) Classified image showing the refined snow–ice boundary; the corresponding Snow Cover Ratio (SCR) for this date is 0.54.
Remotesensing 17 03872 g004
Figure 5. Snow–ice boundary visualization for White Glacier in 2019. (a) ArcticDEM v4.1 elevation map with contours and the delineated snow–ice boundary (red). (b) Sentinel-2 false-colour RGB composite (bands 4–3–2) from 14 August 2019 showing the snow–ice boundary (red) within the glacier outline (yellow). The selected scene corresponds to the date with the minimum snow cover ratio (SCR = 0.54) in 2019.
Figure 5. Snow–ice boundary visualization for White Glacier in 2019. (a) ArcticDEM v4.1 elevation map with contours and the delineated snow–ice boundary (red). (b) Sentinel-2 false-colour RGB composite (bands 4–3–2) from 14 August 2019 showing the snow–ice boundary (red) within the glacier outline (yellow). The selected scene corresponds to the date with the minimum snow cover ratio (SCR = 0.54) in 2019.
Remotesensing 17 03872 g005
Figure 6. K-means clustering k = 4 applied to randomly sampled Sentinel-2 pixels after the PCA of four 10 m bands (B2—blue; B3—green; B4—red; and B8—near-infrared). (a) Silhouette plot showing the silhouette coefficient for each sample (mean = 0.83), indicating strong intra-cluster cohesion and inter-cluster separation. (b) Scatter plot of PC1 versus PC2 (together retaining 99.96% of the original variance; PC1 = 99.65%, PC2 = 0.31%), with points coloured by cluster, convex hulls outlining cluster extents and black “X” markers denoting cluster centroids.
Figure 6. K-means clustering k = 4 applied to randomly sampled Sentinel-2 pixels after the PCA of four 10 m bands (B2—blue; B3—green; B4—red; and B8—near-infrared). (a) Silhouette plot showing the silhouette coefficient for each sample (mean = 0.83), indicating strong intra-cluster cohesion and inter-cluster separation. (b) Scatter plot of PC1 versus PC2 (together retaining 99.96% of the original variance; PC1 = 99.65%, PC2 = 0.31%), with points coloured by cluster, convex hulls outlining cluster extents and black “X” markers denoting cluster centroids.
Remotesensing 17 03872 g006
Figure 7. Time series of summer snow cover ratio (SCR) on White Glacier derived from Sentinel-2 L2A imagery (2019–2024; ablation season). SCR is the fraction of glacier area classified as snow (Methods).
Figure 7. Time series of summer snow cover ratio (SCR) on White Glacier derived from Sentinel-2 L2A imagery (2019–2024; ablation season). SCR is the fraction of glacier area classified as snow (Methods).
Remotesensing 17 03872 g007
Figure 8. Comparison of remote sensing-derived snowline altitude (SLA; SPEH and SLAbin at SCR 0.5 and SCR 0.8 ) with field-measured equilibrium-line altitude (ELA) on White Glacier for 2019–2023. The 2024 season is excluded from validation due to the absence of a field-derived ELA.
Figure 8. Comparison of remote sensing-derived snowline altitude (SLA; SPEH and SLAbin at SCR 0.5 and SCR 0.8 ) with field-measured equilibrium-line altitude (ELA) on White Glacier for 2019–2023. The 2024 season is excluded from validation due to the absence of a field-derived ELA.
Remotesensing 17 03872 g008
Table 1. Annual number of Sentinel-2 image acquisitions used in this study during the ablation seasons (2019–2024). Counts refer to scenes selected for glacier classification and snowline-altitude (SLA) analysis after cloud masking.
Table 1. Annual number of Sentinel-2 image acquisitions used in this study during the ablation seasons (2019–2024). Counts refer to scenes selected for glacier classification and snowline-altitude (SLA) analysis after cloud masking.
YearNumber of Images
201979
2020103
202135
202265
202367
202439
Total388
Table 2. Pixel distribution by cluster following unsupervised K-means classification ( k = 4 ) applied to principal component scores derived from four 10 m Sentinel-2 bands (B2—blue; B3—green; B4—red; and B8—near-infrared). Labels were assigned post hoc from spectral characteristics and image context. Note that Cluster 1 and Cluster 3 both represent snow-covered surfaces with distinct reflectance states.
Table 2. Pixel distribution by cluster following unsupervised K-means classification ( k = 4 ) applied to principal component scores derived from four 10 m Sentinel-2 bands (B2—blue; B3—green; B4—red; and B8—near-infrared). Labels were assigned post hoc from spectral characteristics and image context. Note that Cluster 1 and Cluster 3 both represent snow-covered surfaces with distinct reflectance states.
Cluster LabelPixel Count
Cluster 0 (Ground)390,127,924
Cluster 1 (Snow)52,892,331
Cluster 2 (Ice)44,652,149
Cluster 3 (Snow)62,910,846
Table 3. Performance comparison of three pseudo-label resampling strategies—Random, Cluster-based, and Stratified—for training Random Forest (RF) and Support Vector Machine (SVM) classifiers (four-class problem; input features: Sentinel-2 B2, B3, B4, and B8, reduced by PCA). Metrics include the clustering silhouette score, classification accuracy, F1-score, and prediction time. Values are reported with the original precision used in the analysis.
Table 3. Performance comparison of three pseudo-label resampling strategies—Random, Cluster-based, and Stratified—for training Random Forest (RF) and Support Vector Machine (SVM) classifiers (four-class problem; input features: Sentinel-2 B2, B3, B4, and B8, reduced by PCA). Metrics include the clustering silhouette score, classification accuracy, F1-score, and prediction time. Values are reported with the original precision used in the analysis.
Resampling MethodSilhouetteRandom Forest (RF)SVM
Accuracy F1-Score Time (s) Accuracy F1-Score Time (s)
Random Resampling0.8360.9991.001.80.9991.00113.6
Cluster-based0.6090.9991.001.70.9971.00159.2
Stratified Resampling0.6080.9991.001.80.9981.00139.2
Table 4. Annual snowline and glaciological metrics for White Glacier (2019–2024). Snow cover ratio (SCR) from Sentinel-2, accumulation-area ratio (AAR) from in situ field measurements, and snowline altitude (SLA) from the snowline pixel elevation histogram (SPEH) and elevation-bin method at thresholds SCR 0.5 (Bin 0.5) and SCR 0.8 (Bin 0.8). The field equilibrium-line altitude (ELA) is listed for comparison. “±” denotes the reported SLA precision (m); see Section 4 for details. All elevations are in metres above sea level (m a.s.l.).
Table 4. Annual snowline and glaciological metrics for White Glacier (2019–2024). Snow cover ratio (SCR) from Sentinel-2, accumulation-area ratio (AAR) from in situ field measurements, and snowline altitude (SLA) from the snowline pixel elevation histogram (SPEH) and elevation-bin method at thresholds SCR 0.5 (Bin 0.5) and SCR 0.8 (Bin 0.8). The field equilibrium-line altitude (ELA) is listed for comparison. “±” denotes the reported SLA precision (m); see Section 4 for details. All elevations are in metres above sea level (m a.s.l.).
YearDate ofSnow CoverSnowline Altitude (m a.s.l.)ELA
Min. SCR SCR AAR (Field) SPEH Bin 0.5 Bin 0.8 ± (Field)
201914 August 20190.540.281043.08113013304.71326
202010 August 20200.510.241142.39117013304.71344
202113 August 20210.760.62967.7695010304.71083
20227 August 20220.560.241082.16113013304.71354
202318 August 20230.460.201132.98117013504.71380
20246 July 20240.601122.27113012304.7
Table 5. Performance of snowline estimation methods against field-derived ELAs (2019–2023). Pearson correlation coefficients (r), p-values, and RMSE quantify agreement and error.
Table 5. Performance of snowline estimation methods against field-derived ELAs (2019–2023). Pearson correlation coefficients (r), p-values, and RMSE quantify agreement and error.
Methodrp-ValueRMSE (m)
SPEH0.8750.052231.9
Elevation-bin method (SCR > 0.5)0.9830.0027190.1
Elevation-bin method (SCR > 0.8)0.9940.000630.0
Table 6. Correlations between annual PDD at Eureka and snowline/ELA metrics. n is the number of years used (2019–2024 for SLA metrics; 2019–2023 for field-derived ELAs).
Table 6. Correlations between annual PDD at Eureka and snowline/ELA metrics. n is the number of years used (2019–2024 for SLA metrics; 2019–2023 for field-derived ELAs).
Series vs. PDDrp-Valuen
SLA bin ( SCR 0.5 )0.3190.5386
SLA bin ( SCR 0.8 )0.5240.2866
SLA speh (mode) 0.095 0.8586
field-derived ELA0.4450.4525
Table 7. Year-wise residuals Δ ELA = SLA RS ELA field (m) using existing values for 2019–2023.
Table 7. Year-wise residuals Δ ELA = SLA RS ELA field (m) using existing values for 2019–2023.
20192020202120222023
SCR 0.5 196 174 133 224 210
SCR 0.8 + 4 14 53 24 30
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheung, W.Y.; Thomson, L. Using K-Means-Derived Pseudo-Labels and Machine Learning Classification on Sentinel-2 Imagery to Delineate Snow Cover Ratio and Snowline Altitude: A Case Study on White Glacier from 2019 to 2024. Remote Sens. 2025, 17, 3872. https://doi.org/10.3390/rs17233872

AMA Style

Cheung WY, Thomson L. Using K-Means-Derived Pseudo-Labels and Machine Learning Classification on Sentinel-2 Imagery to Delineate Snow Cover Ratio and Snowline Altitude: A Case Study on White Glacier from 2019 to 2024. Remote Sensing. 2025; 17(23):3872. https://doi.org/10.3390/rs17233872

Chicago/Turabian Style

Cheung, Wai Yin (Wilson), and Laura Thomson. 2025. "Using K-Means-Derived Pseudo-Labels and Machine Learning Classification on Sentinel-2 Imagery to Delineate Snow Cover Ratio and Snowline Altitude: A Case Study on White Glacier from 2019 to 2024" Remote Sensing 17, no. 23: 3872. https://doi.org/10.3390/rs17233872

APA Style

Cheung, W. Y., & Thomson, L. (2025). Using K-Means-Derived Pseudo-Labels and Machine Learning Classification on Sentinel-2 Imagery to Delineate Snow Cover Ratio and Snowline Altitude: A Case Study on White Glacier from 2019 to 2024. Remote Sensing, 17(23), 3872. https://doi.org/10.3390/rs17233872

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop