A random-forest-derived 35-year snow phenology record reveals climate trends in the Yukon River Basin

Pan, Caleb G.; Lasko, Kristofer; Griffin, Sean P.; Kimball, John S.; Du, Jinyang; Meehan, Tate G.; Kirchner, Peter B.

doi:https://doi.org/10.5194/tc-19-2797-2025

Articles | Volume 19, issue 8

https://doi.org/10.5194/tc-19-2797-2025

Articles | Volume 19, issue 8

Research article

05 Aug 2025

Research article |

| 05 Aug 2025

A random-forest-derived 35-year snow phenology record reveals climate trends in the Yukon River Basin

Caleb G. Pan, Kristofer Lasko, Sean P. Griffin, John S. Kimball, Jinyang Du, Tate G. Meehan, and Peter B. Kirchner

Abstract

This study presents a 35-year snow phenology record for the Yukon River Basin (YRB), developed using a random forest (RF) model at a 3.125 km resolution, capturing detailed trends in snowmelt onset and snow-off. The RF model, incorporating dynamic daily variables, improves upon traditional threshold-based methods by reducing sensitivity to transient thaw events and atmospheric noise. Model evaluation against station observations yielded a mean absolute error (MAE) of 10.5 d and a root mean square error (RMSE) of 13.7 d for snowmelt onset. For snow-off, the model achieved an MAE of 18.1 d and an RMSE of 20.7 d. This approach successfully mapped snow phenology across the diverse YRB landscape, providing valuable insight into how variations in snow cover align with regional climate patterns. Challenges such as sample bias due to limited ground-based data coverage highlight the need to expand in situ measurements to improve model performance further. Trend analysis segmented by two timeframes, 1988–2005 and 2006–2023, revealed distinct climate impacts on snow phenology. During 1988–2005, high snowfall and stable temperatures resulted in hastened snowmelt onset and lengthened snowmelt durations, reflecting early-season snow abundance. In contrast, from 2006–2023, warming spring and summer temperatures corresponded to progressively earlier snowmelt onset and snow-off. These shifts in snowmelt patterns align with a lengthened snow-free season, indicating an increasing influence of warmer temperatures on the snowpack. This RF-derived dataset provides an essential tool for tracking climate-driven snow changes, offering insights into hydrologic and ecologic dynamics in the YRB under accelerating climate change.

Download & links

Article (PDF, 1644 KB)

Download & links

How to cite.

Received: 19 Nov 2024 – Discussion started: 16 Jan 2025 – Revised: 22 May 2025 – Accepted: 27 May 2025 – Published: 05 Aug 2025

1 Introduction

Snow cover and its seasonal progression, or phenology, play a crucial role in regulating the global energy budget and shaping ecosystem structure and function (Callaghan et al., 2011). These processes directly drive ecologic and hydrologic responses to seasonal variability. In the Yukon River Basin (YRB), regional warming (Ballinger et al., 2023; Rantanen et al., 2022) has reduced snow cover (Derksen and Brown, 2012), triggering widespread environmental changes. A warmer and longer snow-free season has disrupted permafrost, boosted vegetation growth, and increased ecosystem carbon uptake (Ling and Zhang, 2003; Pulliainen et al., 2017) but also enhanced regional drought and fire disturbance (Scholten et al., 2021), led to a decline in plant diversity (Niittynen et al., 2018), and disrupted wildlife movements (Berger et al., 2018; Cosgrove et al., 2021). Seasonal snowmelt drives much of the discharge into the Yukon River and its adjoining stream networks, and the timing of this melt has significant hydrologic impacts. Earlier snowmelt has heightened flood risks, intensified the spring flood pulse, and accelerated river ice breakup (Beltaos and Prowse, 2009; Lesack et al., 2014; Semmens and Ramage, 2013). These changes are reshaping the region's geomorphology and directly affecting the communities that rely on stable snow and ice conditions in the Yukon for winter travel, recreation, and harvest (Cold et al., 2020).

Enhanced monitoring and understanding of spatiotemporal variability in snow phenology are essential for assessing risks and mitigating potential impacts on Alaskan communities reliant on the Yukon River. Ground-based observations, like snow water equivalent (SWE) and snow depth measurements from SNOTEL sites, provide valuable insights into snow phenology. However, the vast landscape heterogeneity and limited ground observation locations make it challenging to forecast snow phenology reliably across large spatial scales (Bair et al., 2023). Satellite microwave remote sensing offers a valuable alternative for mapping snow phenology, especially in remote, high-latitude regions. The moderate-frequency ( $\sim \leq 37$ GHz) retrievals from operational satellite microwave radiometers are sensitive to snow cover conditions, providing nearly continuous, year-round data. Importantly, the propagation of microwave energy through the snowpack is responsive to changes in snow structure, including liquid water content (LWC) and grain size and density, which are key indicators of snowmelt onset (Tedesco et al., 2015). However, the sampling footprint from the passive microwave (PMW) retrievals can range from ∼ 12–25 km resolution depending on frequency and can be too coarse to capture snow spatial heterogeneity, especially in mountain environments. While higher-frequency K-band and Ka-band radiometers are limited by their coarser spatial resolution, they provide twice-daily acquisitions for polar latitudes from 1988 to the present, offering a valuable long-term data record.

In contrast, synthetic aperture radar (SAR) sensors are sensitive to snow conditions and offer improved spatial resolution over microwave radiometers and scatterometers. C-band SAR data from the European Space Agency (ESA) Sentinel-1 mission have proven valuable for detecting snowmelt onset using a median minimum backscatter approach, often in combination with optical infrared remote sensing imagery (Darychuk et al., 2023; Gagliano et al., 2023; Marin et al., 2020; Nagler and Rott, 2000). The ability of SAR to detect changes in snowpack structure and LWC makes it particularly effective for identifying the onset of snowmelt, as the C-band radar backscatter at VV and VH polarizations decreases when snow transitions from dry to wet. The extraction of snowmelt onset using Sentinel-1 missions shows great promise, providing excellent detail with a spatial resolution of 10 m. However, a current limitation of these data is the relatively short temporal record. Sentinel-1A began operations in April 2014, followed by Sentinel-1B nearly 2 years later in April 2016. Unfortunately, Sentinel-1B was decommissioned in December 2021 due to power issues.

Several snow phenology algorithms utilize K- and Ka-band radiometric brightness temperature (T_b) measurements collected from the Defense Meteorological Satellite Program (DMSP) Special Sensor Microwave Imager (SSM/I) (1987–present) and Special Sensor Microwave Imager/Sounder (SSMIS) (2004–present). Various retrieval algorithms using these data to derive snow properties include (1) the T_b diurnal amplitude variation (DAV) method (Ramage and Isacks, 2002; Tedesco and Miller, 2007), (2) the T_b differencing approach (K-Ka) (Wang et al., 2013, 2016), (3) a single-frequency T_b temporal change algorithm coupled with reanalysis surface temperature (Kim et al., 2017), (4) the gradient ratio polarization (GRP) approach (Dolant et al., 2016; Du et al., 2025; Pan et al., 2018), and (5) a remote sensing and physics-based hybrid method (Dattler et al., 2024). Each algorithm leverages the interaction between the surface snowpack, its liquid water content (LWC), and the resulting effect on the T_b signal at each band or polarization. Specifically, dry snow conditions lead to volumetric scattering in both K and Ka bands, with stronger scattering at higher frequencies. In contrast, when the LWC within the snowpack increases, snow emissivity at lower frequencies likely decreases due to attenuated soil emission, while increasing at higher frequencies such as K and Ka bands due to enhanced emission from the wet snow layers (Dolant et al., 2016). Due to these interactions, past algorithms have successfully derived snow phenology by analyzing T_b time series using these approaches and applying thresholds to identify transitioning snow conditions.

While threshold-based methods have successfully predicted snow phenology, they often fail to fully capture landscape variability in snow conditions due to their coarse spatial resolution. Additionally, these methods are susceptible to atmospheric noise, which can lead to potential false positives. Alternatively, machine learning (ML) offers a flexible empirical modeling approach for estimating snow properties from satellite observations and other ancillary data. ML provides the ability to model complex interactions across diverse datasets and has been applied widely in cryosphere applications (Campbell et al., 2021; Dunmire et al., 2024; Guidicelli et al., 2023; Meloche et al., 2022; Tedesco et al., 2004; Tsai et al., 2019). Among ML methods, random forest (RF) has demonstrated success, often bettering other methods, due to its flexibility, ability to handle high-dimensional data, and success in handling complex environmental datasets. RF constructs multiple decision trees during training and aggregates their outputs, reducing overfitting and increasing robustness in diverse datasets. Furthermore, RF can manage missing data and maintain accuracy even with uncorrelated features (Breiman, 2001). Neural networks (NNs) also present a strong alternative, particularly for modeling nonlinear and temporal relationships in snow-related data. NN methods have shown strong performance in linking PMW observations to snow properties (Forman and Xue, 2017). However, model comparisons have shown that RF has outperformed NN for retrieving snow cover, emphasizing RF's robustness in cryosphere remote sensing applications (Xiao et al., 2021).

Previous efforts to characterize long-term snow phenology in the YRB have relied on threshold-based methods applied to PMW observations (Pan et al., 2021; Semmens et al., 2013). While these approaches have provided valuable insights, they face limitations related to model generalizability, temporal robustness, and sensitivity to landscape heterogeneity. Fixed thresholds can misclassify short-term warming events as melt onset and are often unable to capture the spatial complexity of snowpack transitions in mountainous terrain. In this study, we address these challenges by applying an ML framework that incorporates dynamic predictors and static landscape features. This approach enables daily classification of snow state, spatially explicit uncertainty mapping, and improved representation of the nonlinear interactions that govern snowmelt.

Furthermore, our dataset extends the snow phenology records for the YRB from 2018 to 2023, offering a 25 % increase in temporal coverage during a period of record-setting Arctic warming (Ballinger et al., 2023). The extended study period enables us to examine emerging trends in the snow season which may have been missed in prior datasets.

Our study integrates a temporal component into the RF framework, allowing the model to capture seasonal variations in snow cover. Unlike traditional thresholding methods that rely on fixed values (Pan et al., 2021), the RF model accounts for multiple variables and their interactions, producing more nuanced predictions. By incorporating time series data, our RF model tracks the evolution of snow conditions throughout the season (Rittger et al., 2021), improving predictions of snowmelt onset.

In this paper, we examine the question: how has amplified Arctic warming influenced the timing, duration, and variability of snow phenology in the YRB? To address this question, we use an ML framework informed with T_b time series from the K and Ka bands collected from SSM/I(S), along with other complementary dynamic and static variables, to estimate primary spring snowmelt onset and snow-off dates across the YRB from 1988 to 2023. The resulting annual snow phenology maps are produced at an enhanced resolution of 3.125 km, offering an improvement over previous records derived directly from PMW observations and enabling a more detailed delineation of landscape heterogeneity. We then apply the snow phenology outputs with other ancillary and in situ environmental data to (1) assess model performance and define relative quality maps, (2) examine YRB snow phenology climatology and compare it with anomalous years, (3) analyze spatiotemporal trends in snow phenology over the period of record, and (4) explore interactions between snow phenology and seasonal snowfall and temperature trends.

2 Study area

The YRB constitutes one of North America's largest river basins (Fig. 1). This region experiences 6 to 9 months of snow cover annually, and spring snowmelt runoff is the main hydrologic contribution to the discharge (Brown et al., 2020). The YRB has a mean annual discharge of 6400 m³ s⁻¹ (Brabets et al., 2000) with a drainage area exceeding 853 300 km², covers 10° of latitude from 59 to 69° N, extends into the Canadian Yukon and British Columbia territories to the east, and extends to the west coast of Alaska before draining into the Bering Sea. The diverse topography, with a median elevation of 617 m and extending from sea level to the highest elevations of the Brooks Range (2735 m) and Alaska (6190 m) Range, encompasses a diversity of northern boreal, arctic, alpine, and maritime biomes. Evergreen needleleaf forests are the dominant vegetation cover (54 %), followed by broadleaf deciduous forests (9 %) covering the valley bottoms and into the mid-elevations. The Yukon Delta and higher elevations have tall and low shrubs (9 %) mixed with some dry and wet herbaceous (9 %) tundra as the dominant plant community. Permafrost is present to a large extent in the YRB and comprises several types including sporadic (14 %), discontinuous (46 %), continuous (16 %), and moderately thick to thin permafrost (24 %) (Brabets et al., 2000). Historically the Yukon River served as the main travel corridor of the region and the YRB is the ancestral homelands of several Native Alaskan cultures. Presently, many communities are inextricably linked to and rely upon the Yukon and its tributaries for travel, subsistence, and livelihood (Cold et al., 2020).

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f01

Figure 1The YRB study areas delineated by the dashed black polygon and the black upside-down triangles indicate climate stations utilized for creating a training and testing dataset for our ML models.

3 Data

3.1 Training and testing datasets

We acquired daily in situ snow depth measurements from the Global Historical Climatology Network (GHCNd) (Menne et al., 2012) to build the RF model training and testing dataset. Filtering stations across Alaska, 77 stations included snow depth measurements spanning at least 1 year between 1988 and 2023. While our focus is on the YRB, we included sites located outside the basin to supplement the sparse in situ observations within the YRB and enhance model generalizability. This expanded dataset increases the robustness of the model across diverse snowmelt regimes. However, we acknowledge that incorporating data from regions with different climatic conditions (e.g., maritime vs. continental) may introduce some bias, and we interpret model outputs within the YRB with that potential limitation in mind.

Although many researchers use the day of peak SWE or a breakpoint after peak SWE to determine the onset of snowmelt (Darychuk et al., 2023; Gagliano et al., 2023), the lack of SWE measurements in the GHCNd led us to use peak snow depth instead. Specifically, for each station and year, we identified snowmelt onset by locating the day with the highest snow depth in spring (March–May). However, peak snow depth often did not accurately represent the true onset of snowmelt, as decreases in snow depth can occur due to factors like wind redistribution, sublimation, or compaction, unrelated to snowmelt. For this reason, we could not apply rigid rules to defining snowmelt onset using snow depth.

To improve the identification of snowmelt onset, we used our best judgment for interpretation, supported by air temperature data. When average daily temperatures consistently rose above freezing and snow depth began to steadily decrease from its peak, it became easier to pinpoint the onset of snowmelt. Identifying snow-off from snow depth was more straightforward and defined as the first day when snow depth reached 0 for at least 10 consecutive days in spring. After analyzing each in situ snow depth time series, we compiled 971 observations for snowmelt onset and 933 snow-off observations for RF training and testing. The lower number of snow-off observations is primarily due to gaps in the snow depth records – many time series did not extend far enough into the melt season to capture when snow depth reached zero.

3.2 Time series datasets

In this study, we employ a combination of dynamic and static datasets as RF model predictors and for analyzing the model snow phenology outputs. The dynamic RF predictors include the T_b-derived indices, T_b difference (TBD), and gradient ratio polarization (GRP), as well as their respective 3 d moving averages (MA_TBD and MA_GRP). We also utilize daily thaw degree days (TDDs), day of year (DOY), total water vapor (TQV), and daily snow cover. Together, these dynamic datasets provide a comprehensive basis for capturing both seasonal and interannual variability in snow phenology.

We also include several static landscape factors and assess how landscape features influence model sensitivity. Static variables include fractional water (FW) cover, fractional tree cover (FTC), elevation (GTOPO), topographic variability, aspect, and proximity, described by a pixel's proximity to the nearest ocean. These datasets are summarized in Table 1 and described in more detail in the following section. In addition, a comprehensive table with all datasets used in this study is found in Table A1.

Table 1Dynamic and static predictor summary, including their abbreviation as well as spatial and temporal resolutions.

Download Print Version | Download XLSX

3.2.1 Passive microwave satellite record

We acquired K-band (19 GHz) and Ka-band (37 GHz) afternoon T_b retrievals at vertical (V) and horizontal (H) polarizations from the MEaSUREs Calibrated Enhanced Resolution Passive Microwave Daily EASE-Grid 2.0 Brightness Temperature ESDR, available from the National Snow and Ice Data Center (NSIDC) (Brodzik and Long, 2016). This T_b record is multidecadal and calibrated across multiple sensors and platforms from different frequencies and polarizations from the NOAA DMSP SSM/I and SSMIS. Each platform has several sensors: from SSMI/I we selected F08 (1998–1991), F11 (1992–1995), and F13 (1996–2007), and from SSMIS we used F17 (2007–2016) and F18 (2017–2023). These sensors were selected because their equatorial overpass time remained consistent while in commission. Missing temporal observations were gap-filled using a temporal linear interpolation of adjacent T_b retrievals (Wang et al., 2016). These missing observations are generally infrequent and short in duration, typically affecting only 1–2 consecutive days due to sensor outages, orbital gaps, or data transmission gaps (Long and Brodzik, 2016).

We chose to focus only on the SSM/I-SSMIS record to ensure continuity and minimize calibration uncertainty across the 35-year period. These sensors offer comparable radiometric characteristics and orbital parameters, enabling a harmonized time series suitable for long-term analysis. Other PMW sensors such as SMMR, AMSR, and AMSR2 were not included. The SMMR dataset (1978–1987) exhibits frequent data dropouts, and its inclusion would have required extensive temporal interpolation. The Advanced Microwave Scanning Radiometer for EOS (AMSRE) and the Advanced Microwave Scanning Radiometer 2 (AMSR2) offer valuable observations, but their shorter and non-overlapping time periods with the SSM/I-SSMIS record would require additional cross-sensor calibration and harmonization to reconcile differences in spatial resolution and observation geometry. By focusing on SSM/I-SSMIS, we preserved the temporal consistency and data integrity critical for reliable empirical model training and long-term phenological trend detection.

The native sampling resolution of the combined K and Ka T_b retrievals is ∼25 km or coarser; however, the MEaSUREs products used were processed using the scatterometer image reconstruction (SIR) approach to obtain an enhanced spatial grid resolution of 6.25 km (K) and 3.125 km (Ka) from the overlapping T_b antenna patterns (Brodzik et al., 2018; Long and Brodzik, 2016). We then resampled K-band T_b retrievals to match the Ka resolution of 3.125 km using a nearest-neighbor interpolation.

We then reduced the vertically polarized K and Ka bands into a T_b difference index, henceforth termed TBD, defined as the difference between K and Ka bands (Wang et al., 2013). We also reduced the K and Ka bands into an additional index, the GRP, by first calculating the gradient ratio (GR) at vertical and horizontal polarizations using Eq. (1) (Grenfell and Putkonen, 2008).

\begin{matrix} (1) & GR ({pol}_{(37, 19)}) = \frac{[T_{b} (pol, 37) - T_{b} (pol, 19)]}{[T_{b} (pol, 37) + T_{b} (pol, 19)]} \end{matrix}

The GRP is then ratioed using Eq. (2) (Dolant et al., 2016).

\begin{matrix} (2) & GRP = \frac{{GR}_{V}}{{GR}_{H}} \end{matrix}

Together, both the TBD and GRP provide a source for identifying daily snow conditions such as dry and stable, melting, and disappeared (Pan et al., 2019; Wang et al., 2016).

3.2.2 Daily snow cover

We use ancillary daily snow-covered area estimates to determine the presence or absence of snow at a given location and time. For the period from 1988 to 2023, we relied on two data sources. From 2004 to 2023, we used the Interactive Multisensor Snow and Ice Mapping System (IMS) daily snow cover extent record, which has a 4 km resolution. The IMS provides global coverage and is informed by expert interpretation of geostationary visible satellite imagery, polar-orbiting multispectral satellite sensors, PMW sensors, and ground observations (Helfrich et al., 2007). Although the IMS also provides daily snow cover area outputs at a coarser 24 km resolution dating back to 1997, we opted to use alternative higher-spatial-resolution snow cover estimates from SnowModel (Liston et al., 2020) to fill in the earlier years (1988–2003), as the 24 km resolution IMS data is less able to resolve snow cover heterogeneity in complex terrain and does not cover the entire period of interest.

The SnowModel dataset was developed using meteorological data derived from both MERRA-2 and ERA5 reanalysis data. These reanalyses were used to create bias-corrected inputs for SnowModel, resulting in daily estimates of snow properties for the North American domain at a 3 km resolution from 1980 to 2020 (Liston et al., 2020). Although SnowModel includes several snow variables, we used the modeled snow depth to define daily snow presence or absence. Specifically, if the estimated snow depth exceeded 0 on any given day, we assigned a value of 1; if snow depth was 0, we assigned a value of 0.

3.2.3 Daymet

We calculated daily cumulative thaw degree days (TDDs) using the North American Daymet (V4) record (Thornton et al., 2021). We obtained the Daymet data through the Microsoft Planetary Computer STAC, which is produced by the Oak Ridge National Laboratory DAAC. Daymet provides 1 km spatial resolution, interpolated from daily weather station temperature observations, but with potential bias introduced from the sparse regional weather station network, especially at higher elevations. TDD serves as a useful proxy for assessing the amount of incoming solar radiation the snowpack has been exposed to at a given location and reflects the seasonal dynamics of anomalous temperatures that influence snowmelt onset. TDD was created by summing daily mean temperatures above 0 °C for each pixel during the melt season. To facilitate comparison across regions and years, TDD values were normalized to a cumulative percent scale (0 %–100 %), where 0 % represents the onset of above-freezing temperatures and 100 % represents the total seasonal accumulation. TDD was created by summing daily mean temperature above 0 °C for each pixel and is returned as a cumulative percent.

3.2.4 MERRA-2 total column water vapor

Precipitable water vapor and precipitating clouds can affect PMW observations (Du et al., 2015) and have adverse effects on snow retrievals (Dolant et al., 2016). To account for these effects on SSM/I and SSMIS evening observations, we incorporated total column water vapor (TQV) from the NASA MERRA-2 (Modern-Era Retrospective analysis for Research and Applications, Version 2) product (Gelaro et al., 2017). MERRA-2 is a global atmospheric reanalysis dataset that provides TQV estimates at a native spatial resolution of 0.5° latitude × 0.625° longitude and spans from 1980 to the present. We extracted TQV values corresponding to 18:00 Alaska local time to coincide with the satellite overpasses and resampled to 3.125 km using a nearest-neighbor interpolation.

3.3 Static datasets

We also used several static datasets for model training and to examine the influence of land cover on snow phenology prediction. We represented elevation using the GTOPO30 dataset at a 1 km resolution and used it to derive terrain aspect. To better capture fine-scale terrain variability, we also incorporated additional topographic metrics derived from the ALOS World 3D-30m DEM (Tadono et al., 2014). Specifically, we calculated the standard deviation of elevation at the 3.125 km EASE grid scale, which provides a proxy for local topographic complexity and helps mitigate known T_b retrieval uncertainties in high-relief regions; henceforth this variable is referred to as topographic variability (Xiong et al., 2022).

We acquired average fractional water inundation (FW) from the global land parameter data record, generated from the AMSRE and AMSR-2 records (Du et al., 2017). In addition to prediction and assessing uncertainty, FW served as a mask to screen model outputs likely affected by water contamination.

To represent percent fractional tree cover (FTC), we utilized the MODIS MOD44B V005 500 m Vegetation Continuous Fields product. Additionally, we created a custom dataset, termed “proximity”, which captures the distance of each pixel from the ocean. This is important because T_b pixels near large water bodies are prone to water contamination and frequent cyclonic events that can influence LWC in the regional snowpack (Rees et al., 2010).

3.4 Ancillary datasets

We used an established satellite-based snow phenology dataset for comparison with our ML results. These data include the annual timing of snowmelt onset and snow-off for the YRB from 1988 to 2018 mapped to a 6.25 km resolution grid as a day of year (DOY) (Pan et al., 2020, 2021). The data were also derived using a similar thresholding approach of the GRP and TBD derived from microwave T_b observations. Additionally, a glacier land cover dataset for the YRB was obtained from the Glacier Covered Area for the State of Alaska dataset (Roberts-Pierel et al., 2022), which we used to identify pixels that maintain year-round snow cover, as they do not experience a “snow-off”.

4 Methods

4.1 RF framework

We implemented an RF classifier to predict snow phenology, specifically focusing on estimating the annual timing (DOY) of snowmelt onset and snow-off across the YRB. The RF approach was chosen due to its ability to handle complex, high-dimensional data and robustness to overfitting, making it well-suited for cryosphere applications (Breiman, 2001; Alifu et al., 2020; Blandini et al., 2023). Although RF does not inherently model temporal sequences like some other algorithms, our approach used daily inputs to generate a sequence of predictions that are later interpreted in chronological order. Each day is treated as an independent sample during training and prediction, but the outputs are interpreted sequentially to identify the timing of snowpack transitions, such as the first wet snow day, indicating snowmelt onset. This post-prediction sequencing is critical for deriving the correct DOY phenology metrics. Accordingly, we used the RF implementation in scikit-learn (Pedregosa et al., 2011).

In this study, the RF model snow phenology metrics were derived at 3.125 km resolution from 1988–2023, representing a valuable spatial and temporal enhancement over previous snow records developed for the YRB (Pan et al., 2021). The earlier threshold-based dataset was produced at 6.25 km and covered the years 1988–2016, later extended to 2018. In contrast, the dataset presented here extends through 2023, offering a 7-year increase in temporal coverage. Although the original K-band T_b grid resolution was 6.25 km, the 3.125 km resolution of the RF predictions is consistent with the grid resolution of the Ka-band T_b record, which may help to improve the spatial delineation of snowpack characteristics. Additionally, the 3.125 km resolution is approximate to – or still coarser than – other RF model predictor datasets used, which helps to ensure spatial coherence in representing landscape heterogeneity.

4.1.1 RF model setup

To enhance the prediction of snow phenology, we configured the RF model to delineate daily snow conditions. We did this by classifying expected snow conditions for each day in a time series leading up to the observed snowmelt onset or snow-off day in spring as either “dry snow” or “present”. After the observed onset or snow-off day, the conditions are labeled as “wet snow” or “absent”, respectively. The labeling approach transforms each time series into a sequence of daily snow condition classes, enabling a clear and daily description of a given evolving phenological event. By labeling the time series accordingly, we were able to (1) add a temporal dimension to the RF classifier, (2) expand the RF training and testing datasets, and (3) use the day each labeled time series changes as the snow phenology date (Fig. 2).

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f02

Figure 2Comparison between daily in situ air temperature and snow depth measurements (a) taken from Fairbanks International Airport in 2012 and collocated brightness-temperature-derived TBD and GRP (b). Panel (c) shows the daily snowmelt onset RF model output. The snowmelt onset, marked by the transition from dry to wet snow, is identified as the day when the model output changes from 0 to 1.

Download

To assess daily snow conditions, each model was trained using a set of daily and static predictors. Daily predictors included TDD, TBD, GRP TQV, and DOY. Static predictors, representing landscape and environmental characteristics, included proximity to oceans, FTC, FW, elevation, topographic variability, and aspect. All static variables were scaled from 0 to 1. Table 1 includes more detail on the model training datasets. Snow-off included snowmelt onset as a predictor with the intention that this variable would ensure that snow-off predictions would occur after the snowmelt onset.

We parameterized the RF models using a cross-validated grid search method. This approach systematically evaluates various combinations of hyperparameters to identify the best configuration by performing cross-validation. It selects the combination of parameters that minimizes the user-defined evaluation metric, such as the cross-validated score (Jääskeläinen et al., 2022). The grid of adjustable parameters we provided for hyper-tuning included the number of RF decision trees, the maximum depth of each tree, the minimum sample size for node splitting, the minimum sample size for leaf nodes, and the maximum number of features considered at each split. In addition to parameter selection, the cross-validated score also helps minimize overfitting. A full list of tested parameter ranges and the selected final values for each model is provided in Table A2.

Finally, we identified the timing of snowmelt onset and snow-off from the model outputs by applying a logic that returned the first day of 10 consecutive days classified as either “wet snow” or “absent”.

4.1.2 Assessing uncertainty and error

To ensure independence between training and testing data, we implemented our $80 / 20$ split at the time series level such that each time series represents a single pixel across a full snow season and is treated as an individual unit. This precaution avoided any overlap in time or space between the training and testing sets. Consequently, no individual pixel's time series contributed data to both training and testing subsets in each iteration, ensuring that model evaluation reflected only out-of-sample predictions.

Model performance was assessed using a bootstrapping approach, with an $80 / 20$ split between training and testing data. For each iteration, both datasets were randomly sampled with replacement from the full dataset, allowing us to evaluate model accuracy and variability. Performance was evaluated with the testing data by (1) extracting the R² value to quantify the agreement between observed and predicted dates and (2) aggregating the mean absolute error (MAE) across different land cover types. To determine whether differences in model error across land cover characteristics were statistically significant, we applied a one-way analysis of variance (ANOVA). For each iteration, we also calculated feature importance using the built-in mean decrease in impurity method from the RF algorithm and calculated the average importance and standard deviation for each feature.

The output absolute error from our model bootstrapping was used as the dependent variable in an ordinary least-squares (OLS) regression, with land cover variables such as FW, FTC, elevation, topographic variability, aspect, and proximity serving as the explanatory variables (Kim et al., 2011). The goal was to establish a relationship between the observed error and the land cover characteristics to identify pixels in the YRB where we may expect lower or higher errors. We then applied the OLS model across the YRB to predict anticipated error. These values were scaled from 0 to 1, creating a dimensionless quality control (QC) metric. The QC metric was further classified into a quantile classification, with qualitative labels of “best”, “good”, “moderate”, and “low” to describe the relative quality of the model predictions. This qualitative classification simplifies interpretation and communication of model quality across the basin. While this process discretizes a continuous variable, the underlying QC values are retained and available.

4.2 Trend analysis

4.2.1 Snow phenology

To analyze snow phenology trends over time, we developed snow phenology climatology for the period 1991–2020, which corresponds to the current 30-year climate normal (Palecki et al., 2021). Using a natural break classification method, we divided the data into two categories: “earlier” and “later” snow events. In this context, a “snow event” refers to an annual snow phenology transition – such as snowmelt onset or complete snow disappearance – at the pixel level. Each pixel's event date was compared to its long-term climatological mean (1991–2020). If the event occurred earlier than the mean, it was classified as “earlier”; if it occurred later, it was classified as “later”.

This classification enabled us to calculate the total area (in km²) of the basin experiencing earlier or later snow events in each year. This spatially aggregated approach provides a more hydrologically meaningful metric than raw day-of-year values, as it reflects how much of the landscape is contributing to earlier or delayed runoff. Framing the results relative to the climatological mean also contextualizes each year's snowmelt timing within a long-term baseline, helping to interpret the magnitude and direction of interannual variability.

Next, for each year, we classified the snow metrics using the same two categories derived from the climatology and calculated the annual change in area for each class. If the area of the “earlier” class decreased, we expected a corresponding increase in the “later” class. To assess the trends over time, we applied a linear regression and performed a Mann–Kendall test (MKT) to evaluate the direction and strength of the annual changes in area, where the MKT outputs tau, a dimensionless, nonparametric measure of monotonic trend and strength (Kendall, 1962).

Because trends over the full period of record were generally weak or nonsignificant, we further examined the potential for meaningful sub-period patterns by dividing the 35-year time series (1988–2023) into two equal halves: 1988–2005 and 2006–2023. This split allowed us to evaluate whether changes in snow phenology were occurring within shorter timeframes that may have been masked by variability across the full record.

4.2.2 Temperature and snow depth

Seasonal air temperature across the YRB was analyzed using data from GHCNd climate stations to assess long-term climate trends and their relationship to changes in snowpack conditions. To create a single, harmonized air temperature time series, we selected stations with at least 17 years of data for each of the two time periods (1988–2005 and 2006–2023) from an initial set of 35 stations in the YRB. This selection criterion reduced the set to 8 stations for 1988–2005 and 15 stations for 2006–2023. With the selected stations, we then calculated seasonal average temperature time series for each period, specifically for winter, spring, summer, and combined spring/summer temperatures. These seasonal means were used in a trend analysis to evaluate how air temperatures have changed over time across the YRB and to explore their potential associations with observed snowmelt dynamics.

We also extracted the snow depth at the day of snowmelt onset across YRB using the GHCNd climate stations. Like temperature, we required a station to have recorded at least 17 years of snow depth measurements. We also checked each of these stations to determine if the annual snow depth measurements were complete and without extended data gaps. These screening criteria resulted in 4 stations selected for 1988–2005 and 13 stations for 2006–2023. The resulting snow depth data were used to analyze trends in snow cover.

5 Results

5.1 RF model performance

The RF model effectively classified daily snow conditions for both snowmelt onset and snow-off, as demonstrated by the bootstrapped testing results. For snowmelt onset, the model classified snow conditions as either “dry snow” or “wet snow” with high accuracy. The model achieved an F1 score, precision, and recall averaging 0.97 on the testing data, indicating a strong ability to balance false positives and false negatives. These metrics suggest that the RF model reliably distinguished between dry and wet snow conditions leading up to snowmelt onset. Grid search results for the RF are found in Table A2.

Similarly, for snow-off, the RF model successfully classified daily snow conditions as either “present” or “absent”. The model maintained an average F1 score, precision, and recall of 0.96 on the testing data. This consistent performance highlights the model's ability to accurately capture the transition between snow presence and absence throughout the snow-off period, providing dependable predictions of snow cover dynamics.

Once the snowmelt onset and snow-off days of year were extracted, they were compared against our testing data generated during bootstrap iterations. These stations are located both inside and outside the YRB. In each iteration, an $80 / 20$ (training/testing) split with replacement ensured that the testing data represented a unique subset of high-quality observations from different years and locations, allowing the model's generalizability to be evaluated across a variety of conditions. The bootstrapped results yielded an average R² of 0.72 for snowmelt onset and 0.80 for snow-off, demonstrating that the predicted snow DOY metrics closely matched the observed values from the testing data. Additionally, the model produced an MAE of 6.11 d for snowmelt onset and 5.73 d for snow-off. The root mean square error (RMSE) values of the model results were 8.30 d for snowmelt onset and 7.70 d for snow-off. We also assessed model bias by evaluating the mean error across all testing observations. Results showed minimal bias in both snowmelt onset (mean error = 0.59 d) and snow-off timing (mean error = 0.20 d), indicating that the model does not systematically overpredict or underpredict the timing of these events. Overall, these metrics indicate favorable model performance in predicting the timing of these key snow phenology events across the YRB.

The final error assigned to the snow phenology dataset is assessed by comparing the RF model predictions across the full YRB gridded dataset with an independent testing dataset derived from the limited number of GHCNd stations located within the YRB. From these stations we calculated an MAE 10.54 d and RMSE of 13.68 d relative to the snowmelt onset product. The modeled snow-off results showed an MAE of 18.1 d and RMSE of 20.77 d relative to the station observations. The higher final observed errors, compared to the bootstrapped errors, are attributed to the greater heterogeneity in land cover and terrain across the basin, which contributes to larger differences between the sparse in situ ground station measurements of local snow conditions and the coarser landscape-level RF predictions.

5.1.1 Model feature importance

On average, the most influential features for predicting snowmelt onset were DOY, TDD, TQV, snow cover presence, and TBD, in that order (Fig. A1). The snow-off predictions followed a similar pattern, with TDD, DOY, snow cover presence, and TBD emerging as the top-ranked features (Fig. A2). In both models, the dynamic time series features – such as temperature and snow cover presence – played a significantly larger role in the predictions compared to the static features, such as proximity, elevation, and fractional water cover. Interestingly, the GRP had relatively low importance for the snow-off model, which is likely due to its erratic behavior during no-snow conditions and in areas with significant vegetation cover.

5.1.2 Land cover and uncertainty

Mean absolute errors were binned by land cover features to assess whether these characteristics had a significant influence on model performance. These MAE values were derived from the bootstrapped model predictions, aggregated across all iterations, and grouped by land cover type. Land cover features such as elevation, topographic variability, FTC, and proximity, were grouped into four natural breaks and compared with the corresponding model MAE to identify potential patterns or relationships. FW was grouped into three bins including “low (FW <5 %)”, “medium (5 % $\leq FW < 10$ %”, and “high (FW ≥10 %)”. Figure 3 indicates that when elevation, proximity, and FW decrease, MAE also decreases for both snowmelt onset and snow-off predictions. Conversely, as FTC increases, MAE also increases, though this is only observable for snowmelt onset predictions. Also notable is that higher FW is associated with higher MAE values for snowmelt onset. FW values exceeding 10 % are often indicative of the presence of standing water, riparian zones, or proximity to lakes and rivers (Du et al., 2016). These hydrologically active areas introduce substantial variability in surface emissivity, which can obscure or distort the PMW signal used to detect snow state transitions. Hence, FW may be a major factor behind the overall lower model performance, relative to snow-off. Yet, overall, these land cover and error interactions are as anticipated – error increases with higher surface water cover, coastal proximity, and tree cover.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f03

Figure 3Average snowmelt onset (left) and snow-off (right) MAE aggregated by land cover features. Land cover was binned using a Jenks classification, except for aspect and FW with lower values on the left and higher values moving to the right.

Download

The one-way ANOVA results indicate that each land cover characteristic has a statistically significant influence on model error (MAE), with p values well below 0.05. For both snowmelt onset and snow-off, FW had the largest effect size, with F statistics of 28.34 and 26.27, respectively. FTC and proximity to water bodies also exerted strong influences on MAE. Specifically, FTC yielded F statistics of 75.88 for snowmelt onset and 22.62 for snow-off, while proximity showed F=18.65 for snowmelt onset and 19.57 for snow-off. Topographic variability and latitude also contributed significantly, suggesting that terrain complexity and regional gradients play important roles in prediction uncertainty.

These results indicate that model performance varies significantly with variations in certain land cover features, suggesting that these land characteristics are associated with higher or lower prediction errors. However, ANOVA alone does not demonstrate whether these static land features improve model performance when included as predictors. To test this, we compared RF model performance with and without the inclusion of static variables. When all predictors (both dynamic and static) were used, the R² values were 0.72 for snowmelt onset and 0.80 for snow-off. In contrast, when static variables were excluded, R² dropped to 0.64 and 0.40, respectively. These results support the inclusion of static variables as additional RF predictors, despite their relatively low individual importance, as they contribute to overall model performance gain.

5.2 Model comparisons

5.2.1 RF and threshold comparison

A comparison between the annual median snowmelt onset and snow-off dates derived from the RF model and another established snow phenology record (Pan et al., 2021) is presented in Fig. 5. For snowmelt onset, the results show a moderate correlation between the two records for the YRB, with an r value of 0.58 (p<0.05). However, the previous record was derived using a T_b thresholding method and consistently predicted earlier snowmelt onset dates, averaging about 1.5 d earlier than our RF model. When compared to the in situ testing dataset within the YRB, the previous snow record produced an MAE of 11 d and an RMSE of 14.57 d, similar to our RF model performance.

For snow-off, the two records displayed a stronger correlation, with an r value of 0.80 (p<0.05). The previous approach still showed earlier snow-off bias, averaging about 2 d earlier than the RF-derived snow-off date. Despite the stronger correlation, the thresholding approach returned a high MAE of 32 d and an RMSE of 54 d, which is about double the error derived for the RF method. The RF-derived snowmelt onset and snow-off results also exhibit significantly lower standard deviations compared to the previous approach, indicating that the ML method is less susceptible to outliers.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f04

Figure 41988–2018 annual median dates ± 1 standard deviation for snowmelt onset (a) and snow-off (b) for the ABoVE GRP threshold model (Pan et al., 2020) (black triangles) and the RF model (this study) (gray circles).

Download

5.2.2 Snow-off model comparison

Our snow-off predictions incorporate two different modeled snow cover datasets, IMS and SnowModel, because no dataset alone spans our full period of record; both datasets were used as features in the RF model. Specifically, we used the 4 km IMS dataset (2004–2023) and the 3 km SnowModel (1988–2003) to calculate annual snow-off using a 10 d moving window. We then evaluated these outputs against the YRB training dataset to assess their performance relative to the RF results. For the period from 2004–2023, the IMS dataset achieved an MAE of 15.87 d and an RMSE of 21 d. During this same period, the RF-based snow-off dataset achieved an MAE of 16.03 d and an RMSE of 19.2 d.

For the earlier period (1988–2003), the SnowModel dataset produced an MAE of 13.6 d and an RMSE of 16.42 d for snow-off timing. In comparison, the RF snow-off dataset during this period produced an MAE of 18.56 d and an RMSE of 21.2 d. These results reflect the performance of each dataset over their respective timeframes and provide insight into the reliability of the RF-based snow-off model across different periods.

To assess the consistency and variability between the two input datasets, we compared overlapping IMS- and SnowModel-derived snow-off dates for three representative years (2008, 2012, and 2016) (Fig. A3). These years span a range of snow conditions during the period of dataset overlap (2004–2020). We found strong agreement in the spatial patterns and distributions of snow-off timing. For example, the median snow-off date in 2008 was DOY 132 for IMS and DOY 135 for SnowModel. In 2012, both datasets reported a median DOY of 135, and in 2016, IMS had a median DOY of 126 compared to DOY 125 for SnowModel. This close agreement suggests that although the IMS and SnowModel datasets originate from different methodologies, they produce comparable snow-off estimates when applied over the same regions and timeframes. This general consistency indicates that our use of these datasets as temporally segmented predictors does not introduce significant bias and the RF model performance is stable across the full record.

5.3 QAQC maps

The QAQC maps provide a discrete qualitative index for assessing model output quality. These maps were generated by classifying the predicted model error into four categories – “best”, “good”, “moderate”, and “low” – using a quantile classification. This classification divides the full range of predicted error into bins such that each category contains close to equal numbers of valid pixels. Because the QAQC maps represent relative predicted error across the domain rather than fixed error thresholds, a quantile classification scheme was used to ensure consistent and interpretable comparisons between snowmelt and snow-off model performance (Fig. 5). The land-cover-derived QAQC map for snowmelt onset identified the mouth of the Yukon and lowlands of the YRB as principal regions of lower model quality. This is likely due to the abundant small lakes and wetlands in this region and its comparatively lower elevation and closer ocean proximity, delivering periodic systems, more ephemeral snowmelt events, and LWC to the snowpack.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f05

Figure 5Snowmelt onset (a) and snow-off (b) QAQC maps were developed to identify regions of relative high- to low-quality classification results in relation to land cover characteristics.

In the snow-off QAQC map, the spatial distributions for snow-off differ from snowmelt onset due to differences in the seasonal dynamics of the two events. Snow-off typically occurs more gradually and can span a longer duration, especially in mountainous terrain, where persistent late-season snow and greater terrain and microclimate heterogeneity contribute to larger model error. The snow-off QAQC map identified the upper headwaters of the YRB as having lower quality. However, lower-quality pixels are more prevalent at higher elevations and ridgelines. Given that snow cover at higher elevations can linger for extended periods and even through the summer months, it is not surprising that these pixels are ranked as “low”. The OLS models explained only 35 % and 42 % of the variability in error for snowmelt onset and snow-off, respectively. The relatively low explanatory power is likely due to the testing data not fully capturing the landscape heterogeneity across the YRB.

5.4 Snow phenology climatology, anomalies, and trends

5.4.1 Climatology of snowmelt onset, snow-off, and snowmelt duration

The climatology of snow phenology metrics – snowmelt onset, snow-off, and snowmelt duration (SMD) – offers valuable insights into seasonal patterns across the YRB, where snow phenology shows later snowmelt onset and snow-off as well as longer snow duration in headwaters and higher elevations; this pattern contrasts with generally earlier dates for these metrics at lower elevations and valley bottoms (Fig. 6). On average, snowmelt onset (MMOD) occurs around DOY 115±7.7 (∼24 April), with the earliest onset recorded on DOY 100 (∼9 April) and the latest on DOY 136 (∼15 May). Snow-off typically occurs around DOY 139±4.9 (∼18 May), with the earliest snow-off observed around DOY 125 (∼4 May) and the latest on DOY 168 (∼16 June). The snowmelt duration, defined as the period between snowmelt onset and snow-off, spans approximately 22±4.6 d.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f06

Figure 6Climatologies produced for snowmelt onset (a, d, g), snow-off (b, e, h), and snowmelt duration (c, f, i) for the years 1991–2020 in the top row. Panels (d)–(f) and (g)–(i) include the snow phenology for the years 2013 and 2016.

5.4.2 Anomalous years

Several anomalous years in snow phenology stand out, deviating from the climatological averages. In 2016, a record-breaking warm year (Walsh et al., 2017), snowmelt onset occurred ∼9 d earlier than average and snow-off 6 d earlier, lengthening snowmelt duration by 3 d beyond the mean. Conversely, in 2013, a cooler year, snowmelt onset was 12 d later and snow-off 7 d later, shortening the snowmelt duration by 5 d. These anomalies likely reflect broader climatic drivers, such as temperature fluctuations and abnormal precipitation, affecting snowmelt dynamics in these years. While these deviations are seen in other records (Pan et al., 2021), the RF model successfully reproduced the timing and magnitude of the anomalies, indicating its sensitivity to interannual climate variability.

5.4.3 Change in area over time

Temporal changes in area for the “earlier” class from 1988–2023 (Fig. A4) showed no significant correlations for any snow metric. However, the Mann–Kendall test revealed positive and statistically significant (p<0.05) tau values for both snow-off and SMD, though these were modest at 0.2 and 0.25, respectively. To further examine potential trends, we segmented the data into two periods – 1988–2005 and 2006–2023 – and performed trend analysis on each segment independently (Fig. 7).

Annual changes in the snow metric's “earlier” class during the first half of the data record (1988–2005) identified a strong negative trend toward later snowmelt onset ( $r = - 0.65$ , $p < - 0.05$ , $tau = - 0.35$ , p<0.05). This implies that in the initial period of record, snowmelt onset was occurring earlier across the YRB relative to later years of this period. Conversely, SMD showed a modest positive trend (r=0.55, p<0.05, tau=0.39, p<0.05), which suggested a longer snowmelt duration during years with earlier snowmelt onset. Snow-off exhibited no significant trends during this period, with an r value of −0.24, indicating minimal directional change.

In the second half of the data record (2006–2023), annual changes in snowmelt onset displayed a shift to a positive trend, with an r value of 0.54 (p<0.05) and tau trend of 0.42 (p<0.05). This shift suggested that snowmelt onset had been occurring progressively earlier. Snow-off during this period also exhibited a positive trend, with an r value of 0.69 (p<0.05) and tau of 0.50 (p<0.05), indicating an earlier occurrence of snow-off as well. Interestingly, SMD did not show any significant trends during this period due to compensating changes in snowmelt onset and snow-off timing.

To determine whether the temporal patterns we observed were consistent with previous threshold-based snow phenology records, we conducted the same trend analysis using the threshold-based dataset for the period 1988–2018. This record was similarly subdivided into two sub-periods: 1988–2002 and 2003–2018. During the first period, both snowmelt onset and snow-off area exhibited statistically significant negative trends. Snowmelt onset had a Pearson correlation of $r = - 0.62$ (p<0.05) and $tau = - 0.37$ (p>0.05), while snow-off showed a stronger decline with $r = - 0.66$ (p<0.05) and $tau = - 0.52$ (p<0.05), indicating a decreasing extent of earlier transitions across the basin. In contrast, SMD during this period showed no significant trend ( $r = - 0.25$ , p>0.05; $tau = - 0.18$ , p>0.05). For the second period (2003–2018), none of the snow metrics exhibited statistically significant trends. These results suggest that while some temporal changes were evident in earlier decades – particularly for snow-off – the RF model captures more consistent and recent trends (2006–2023).

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f07

Figure 7Annual changes in area between the “earlier” climatology and the “earlier” class for each year. The left plot shows a decreasing trend for snowmelt onset but little change for snow-off from 1988–2005, while the right plot shows a significant increase in change in area for both snow metrics from the latter period of 2006–2023.

Download

5.5 Temperature and snow depth trends across the YRB

Seasonal temperatures and annual snowfall in the YRB were analyzed using in situ measurements from GHCNd stations. For the period 1988–2023, no significant trends were identified in winter, spring, summer, or spring/summer temperatures or in annual median snow depth on the day of snowmelt onset. In the following sections, we present trend analysis results for annual snowfall and seasonal temperatures across the two sub-periods – 1988–2005 and 2006–2023 – as well as correlations with the annual snow metrics.

5.5.1 Snow depth at snowmelt onset

Between 1998 and 2005, median snow depth on the day of snowmelt onset exhibited a moderately strong negative correlation with time ( $r = - 0.58$ , p<0.05; $tau = - 0.40$ , p<0.05), indicating a decrease in snowfall totals during this period (Fig. 8). In contrast, from 2006 to 2023, median snow depth displayed positive and significant correlations and trends (r=0.68, p<0.001; tau=0.40, p<0.05). We did not examine correlations between snow depth at snowmelt onset and snow phenology metrics, as these analyses would likely introduce bias due to the use of the day of year (DOY) of snowmelt onset in both training and testing datasets.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f08

Figure 8Harmonized median snow depth on the day of snowmelt onset across the YRB as measured from GHCNd stations.

Download

5.5.2 Seasonal temperature

Trends in seasonal temperatures from 1988–2005 identified a significant increase in winter temperatures, with an r value of 0.46 (p<0.1) and tau of 0.35 (p<0.05) (Fig. 9). Winter temperatures were also negatively correlated with snowmelt onset ( $r = - 0.47$ , p<0.05), indicating that warmer winters were associated with earlier snowmelt. Additionally, snow-off showed a strong positive correlation with spring temperatures (r=0.69, p<0.01). Summer temperatures during this period were moderately correlated with both snowmelt onset and snow-off, with r values of 0.52 and 0.58 (p<0.05), respectively.

From 2006–2023, no seasonal temperatures exhibited significant trends over time. However, winter and spring/summer temperatures had positive tau values of 0.33 and 0.32 (p<0.05), suggesting a slight warming trend. Interestingly, snowmelt onset and snow-off were positively correlated with spring/summer temperatures, with r values of 0.49 and 0.65 (p<0.01), respectively. A significant correlation was also identified between spring temperatures and snowmelt onset (r=0.41, p<0.1), indicating that warmer springs may contribute to earlier snowmelt.

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f09

Figure 9Harmonized seasonal temperatures across the YRB as measured from GHCNd stations.

Download

6 Discussion

6.1 Model performance and limitations

The RF model classified daily snow conditions effectively, achieving high precision and recall scores, underscoring its reliability in predicting snowmelt onset and snow-off. This performance is particularly noteworthy in the complex landscape of the YRB, where traditional threshold-based methods often struggle due to heterogeneous land cover and atmospheric conditions (Pan et al., 2021). By incorporating dynamic time series data, such as cumulative TDD and TBD, the model produced favorable predictions of snow phenology events. The inclusion of a temporal dimension within the RF framework further enabled the model to track the seasonal evolution of snow cover, enhancing model accuracy in predicting both snowmelt onset and snow-off.

In comparing bootstrapped performance metrics, the snow-off model outperformed the snowmelt onset model, a result anticipated due to the greater variability and influencing factors associated with detecting snowmelt onset. However, errors calculated by comparing in situ observations with full model outputs showed that snow-off predictions had a higher MAE and RMSE. Notably, when RF snow-off errors were compared with errors derived from IMS and SnowModel snow-off data, they were found to be similar. This suggests that (1) accurately capturing snow-off at a single point location remains challenging due to high spatial variability at the 3.125 km spatial scale, (2) the RF snow-off model performs on par with other established snow-off datasets, and (3) similar uncertainties and bias are also present in other available snow products.

The RF model also faces challenges related to sampling bias due to the uneven distribution of ground-based snow depth measurements used for training and testing (Tedesco and Jeyaratnam, 2016; Tsai et al., 2019). The GHCNd stations are mostly located in accessible, lower-elevation areas and represent a very small relative local area within a coarser grid cell being evaluated. This also introduces bias into the modeled predictions for underrepresented regions, such as higher elevations or areas of higher FW in the YRB. The scarcity of in situ observations in these areas further limits the ability of the model to generalize across different land cover features. However, emerging technologies, such as camera traps, have shown promise in measuring seasonal snow depth at varying elevations (Breen et al., 2023, 2024). These tools could expand the spatial distribution of snow measurements while reducing potential spatial bias, offering a valuable enhancement for future snow phenology studies.

We found that the RF model predicted snowmelt onset on average 3 d later than an established satellite snow phenology record derived from a T_b threshold method (Pan et al., 2021). This earlier onset predicted by the previous record is likely due to the threshold algorithm misinterpreting seasonal melt events as the main snowmelt onset event. Addressing this misinterpretation was one of the primary motivations for exploring ML in snowmelt onset detection. The RF model, with its logical structure and ancillary data, is likely better able to distinguish between transient melt events and the true seasonal melt onset. Additionally, the coarser resolution of the previous dataset (6.25 km) may introduce bias by failing to capture finer landscape heterogeneity, particularly at higher elevations. These high-altitude areas, which are a smaller portion of the domain, tend to exhibit a lag in spring snow metrics compared to lower elevations. Importantly, the RF model incorporates predictors such as topographic variability and total precipitable water from MERRA-2, which help account for topographic complexity and atmospheric water vapor content – two major sources of T_b error and cloud-related signal contamination in PMW retrievals. This allows for lower ML uncertainty in high-relief regions where terrain-driven heterogeneity and atmospheric interference are common.

The RF models showed greater uncertainty in certain YRB sub-regions, particularly at higher elevations and in coastal areas. These regions likely pose challenges due to their topographic complexity and proximity to large water bodies, which can introduce both T_b noise and variability in snowpack LWC (Du et al., 2016; Nagler and Rott, 2000). The QAQC maps highlighted areas with higher RF prediction errors and indicate the need for further model improvements in these regions. Incorporating higher-resolution satellite data like SAR and other predictors to better account for landscape heterogeneity and other influential features could improve future iterations of the model and address some of the scale-dependent uncertainties (Darychuk et al., 2023; Gagliano et al., 2023; Marin et al., 2020).

Although the RF model achieved MAE values like or slightly higher than those from previous threshold-based methods, it offers distinct advantages not reflected in summary metrics alone. The bootstrapping results indicate that when the prediction data are well-represented within the training dataset – such as under similar climate or land cover conditions – the RF model yields substantially lower errors. This highlights the model's ability to generalize effectively when sufficient representative training data are available.

Furthermore, the RF approach reduces false detections of snowmelt onset caused by transient warming events by learning from multivariate seasonal context, rather than relying on fixed thresholds. It also performs more reliably in heterogeneous and hydrologically complex regions, such as areas near lakes or coastlines, where PMW land signals are often confounded by water contamination. Finally, the RF flexibility allows for seamless integration of new predictors, such as SAR data or reanalysis-based forcings, enabling continuous model refinement. Together, these strengths – daily classification of snow state, spatially explicit uncertainty, improved seasonal tracking, and enhanced adaptability – represent a meaningful advancement over traditional thresholding methods for operational snow monitoring in the YRB.

6.2 Implications of changes in snow phenology

Annual changes in snow phenology are closely tied to current climate conditions, with snowmelt onset and snow-off generally occurring later in cooler years and earlier in warmer years. Notably, during warm years, snowmelt occurred earlier – by ∼9 d – while in cooler years, the delay was ∼12 d, relative to the climatology. And the difference in snowmelt onset between a warm year and cool year could be as much as 21 d. Earlier occurrence in snowmelt onset during warm years also generally translated into a longer melt duration, as noted from previous studies (Musselman et al., 2017).

Our analysis revealed that snowmelt onset trended toward a later date from 1988–2005, a result that might seem counterintuitive given the warming temperatures typically observed at northern latitudes. However, during this period, annual snowfall was particularly high at the beginning of the period (especially before 1994), and seasonal temperatures showed small changes. The deeper snowpack meant that more snow was available for melting earlier in the season, as reflected in the RF model outputs. This suggests that snowmelt onset is influenced more by the quantity of accumulated snow than by temperature alone (Barnett et al., 2005; Frei and Henry, 2022; Trujillo et al., 2012), except in cases of anomalously warm years, where elevated temperatures tend to override other factors (Musselman et al., 2017).

With temperatures remaining relatively stable from 1988–2005, snow-off timing followed typical seasonal patterns, explaining why we observed no significant changes during this period. Here we demonstrated the important role of winter snowfall in shaping springtime snow phenology, yet trends in snowfall patterns across Alaska are complex. From 1957 to 2021, winter snowfall equivalent increased across Alaska. However, northern and southern regions have seen a decline in snowfall during the spring and fall shoulder seasons, effectively shortening the snow cover duration (Ballinger et al., 2023).

We also showed an acceleration toward earlier snowmelt onset and snow-off timing during the latter half of the data record (2005–2023). These changes align with recent warming trends in average spring/summer air temperature measurements in the YRB. Over the last century, Alaska has experienced varying increases in temperature across different climate divisions (Bieniek et al., 2014), with record-breaking warmth in recent years (Lara et al., 2021; Swanson et al., 2021; Walsh et al., 2017).

7 Conclusion

This study introduced an application of an RF approach to derive a 35-year snow phenology record across the YRB, delivering new insights into the timing and variability of seasonal snowmelt onset and snow-off across Alaska's largest drainage basin. Designed for enhanced delineation of spatial and temporal heterogeneity in snow metrics over more established satellite and model data records, the RF model effectively classified daily snow conditions, achieving reasonable accuracy in delineating snow phenology metrics across a highly varied landscape. By working with an improved spatial resolution of 3.125 km, the RF model was able to provide a more detailed representation of landscape features than previous T_b threshold-based snow phenology datasets, supporting more precise predictions across the YRB's diverse terrain. The enhanced spatial resolution proved especially valuable in depicting snow phenology in the region's remote, high-latitude environments, allowing the RF model to capture nuances in snowmelt timing across varying topographies, elevation ranges, and vegetation covers.

One significant advantage of the RF approach over traditional thresholding methods is the reduction in sensitivity to transient melt events and atmospheric fluctuations, making a more reliable identification of primary snowmelt onset rather than temporary thaw and early melt events caused by brief warming episodes. Additionally, the integration of dynamic predictions within the RF model, including cumulative thaw degree days and snow cover presence, allowed the capturing of the seasonal evolution of snow conditions, while remaining less prone to errors related to isolated atmospheric warming events. However, certain challenges emerged, particularly in areas with complex topography, such as in high-elevation and coastal zones, where model prediction errors were greater. These challenges likely reflect the difficulties in addressing highly localized factors, like changes in snowpack liquid water content, or terrain-induced microclimate variability.

As with many remote sensing models, sample bias in the RF model due to uneven ground-based data coverage poses a limitation, as in situ snow depth measurements are predominantly collected in accessible, lower-elevation regions. This bias suggests the need for continuous updates to the in situ training dataset, particularly by expanding measurements in higher-altitude and coastal areas within the YRB. Incorporating more extensive in situ observations would improve the model's accuracy in underrepresented regions, allowing for a more comprehensive understanding of snow phenology across the YRB. By overcoming these current limitations and incorporating higher-resolution remotely sensed data sources, such as SAR, future iterations of the model could further enhance snow phenology monitoring in the YRB, making it a critical tool for understanding snow-related dynamics in response to climate change.

Finally, this study produced an extended snow phenology record spanning more than 30 years to better distinguish climate normals and quantify long-term climate trends in the YRB. By segmenting the 35-year record into two timeframes (1988–2005 and 2006–2023), we were able to detect distinct temporal trends in the spring snow metrics that corresponded to changes in seasonal temperatures and snowfall patterns. The analysis revealed that in earlier years, snowmelt onset tended toward later dates, influenced largely by higher snowfall amounts and stable seasonal temperatures. However, in more recent years (2006–2023), both snowmelt onset and snow-off timing have advanced significantly, coinciding with rising spring and summer temperatures across the YRB. These phenological shifts, along with the lengthening of the snow-free season, align with observed patterns of earlier spring onset and more frequent anomalous warming events in recent years. The resulting snow phenology trends offer valuable insight into the YRB's changing climate and highlight the increasing influence of warming on snowpack dynamics, which hold implications for regional water availability, ecosystem health, and community resilience in the face of accelerated climate change.

Appendix A

Table A1Descriptions of datasets used in this study and their sources.

Download Print Version | Download XLSX

Table A2Grid search results for RF hyperparameters.

Download Print Version | Download XLSX

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f10

Figure A1Snowmelt onset variable feature importance with ± 1 standard deviation over 20 bootstrap iterations.

Download

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f11

Figure A2Snow-off variable feature importance with ± 1 standard deviation over 20 bootstrap iterations.

Download

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f12

Figure A3Histogram comparison of snow-off detection from SnowModel (orange) and IMS (blue) over three overlapping years including 2008, 2012, and 2016.

Download

https://tc.copernicus.org/articles/19/2797/2025/tc-19-2797-2025-f13

Figure A4Snowmelt onset climatology binned into two classes, “earlier” and “later”. This climatology was used to assess annual changes in snowmelt onset.

Code and data availability

Code and datasets produced in this study are available upon request.

Author contributions

CGP wrote the paper. CGP processed data and analyzed the results. CGP, KL, SPG, JSK, JD, and PBK contributed to the design and conceptualization. All coauthors contributed to writing and editing of the paper. All coauthors have read and agreed to the published version of this paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Any opinions expressed in this paper are those of the authors and are not to be construed as official positions of the funding agency.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We thank Benoit Montpetit and Devon Dunmire for their constructive reviews and contributions, which helped improve this paper. This research was funded by the US Army Corps of Engineers, Engineer Research and Development Center (ERDC) under PE 0602146A/AT9, project “Tactical Geospatial Information Capabilities”, task “Geospatial Analytics and Prediction”. Permission to publish was granted by the ERDC Public Affairs Office.

Financial support

This research has been supported by the Engineer Research and Development Center (grant no. PE 0602146A/AT9).

Review statement

This paper was edited by Jürg Schweizer and reviewed by Benoit Montpetit and Devon Dunmire.

References

Alifu, H., Vuillaume, J.-F., Johnson, B. A., and Hirabayashi, Y.: Machine-learning classification of debris-covered glaciers using a combination of Sentinel-1/-2 (SAR/optical), Landsat 8 (thermal) and digital elevation data, Geomorphology, 369, 107365, https://doi.org/10.1016/j.geomorph.2020.107365, 2020.

Bair, E. H., Dozier, J., Rittger, K., Stillinger, T., Kleiber, W., and Davis, R. E.: How do tradeoffs in satellite spatial and temporal resolution impact snow water equivalent reconstruction?, The Cryosphere, 17, 2629–2643, https://doi.org/10.5194/tc-17-2629-2023, 2023.

Ballinger, T. J., Bhatt, U. S., Bieniek, P. A., Brettschneider, B., Lader, R. T., Littell, J. S., Thoman, R. L., Waigl, C. F., Walsh, J. E., and Webster, M. A.: Alaska Terrestrial and Marine Climate Trends, 1957–2021, J. Climate, 36, 4375–4391, https://doi.org/10.1175/JCLI-D-22-0434.1, 2023.

Barnett, T. P., Adam, J. C., and Lettenmaier, D. P.: Potential impacts of a warming climate on water availability in snow-dominated regions, Nature, 438, 303–309, https://doi.org/10.1038/nature04141, 2005.

Beltaos, S. and Prowse, T.: River-ice hydrology in a shrinking cryosphere, Hydrol. Process., 23, 122–144, https://doi.org/10.1002/hyp.7165, 2009.

Berger, J., Hartway, C., Gruzdev, A., and Johnson, M.: Climate Degradation and Extreme Icing Events Constrain Life in Cold-Adapted Mammals, Sci. Rep., 8, 1156, https://doi.org/10.1038/s41598-018-19416-9, 2018.

Bieniek, P. A., Walsh, J. E., Thoman, R. L., and Bhatt, U. S.: Using Climate Divisions to Analyze Variations and Trends in Alaska Temperature and Precipitation, J. Climate, 27, 2800–2818, https://doi.org/10.1175/JCLI-D-13-00342.1, 2014.

Blandini, G., Avanzi, F., Gabellani, S., Ponziani, D., Stevenin, H., Ratto, S., Ferraris, L., and Viglione, A.: A random forest approach to quality-checking automatic snow-depth sensor measurements, The Cryosphere, 17, 5317–5333, https://doi.org/10.5194/tc-17-5317-2023, 2023.

Brabets, T., Wang, B., and Mead, R.: Environmental and hydrologic overview of the Yukon River basin, Alaska and Canada, Water-Resources Investigations Report 99-4204, U.S. Dept. of the Interior, U.S. Geological Survey ; Branch of Information Services, https://doi.org/10.3133/wri994204, 2000.

Breen, C., Vuyovich, C., Odden, J., Hall, D., and Prugh, L.: Evaluating MODIS snow products using an extensive wildlife camera network, Remote Sens. Environ., 295, 113648, https://doi.org/10.1016/j.rse.2023.113648, 2023.

Breen, C. M., Currier, W. R., Vuyovich, C., Miao, Z., and Prugh, L. R.: Snow Depth Extraction From Time-Lapse Imagery Using a Keypoint Deep Learning Model, Water Resour. Res., 60, e2023WR036682, https://doi.org/10.1029/2023WR036682, 2024.

Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.

Brodzik, M. J. and Long, D. G.: MEaSUREs Calibrated Enhanced-Resolution Passive Microwave Daily EASE-Grid 2.0 Brightness Temperature ESDR, Version 1, Boulder, Colorado USA. NASA National Snow and Ice Data Center Distributed Active Archive Center [data set], https://doi.org/10.5067/MEASURES/CRYOSPHERE/NSIDC-0630.001, 2016.

Brodzik, M. J., Long, D. G., and Hardman, M. A.: Best Practices in Crafting the Calibrated, Enhanced-Resolution Passive-Microwave EASE-Grid 2.0 Brightness Temperature Earth System Data Record, Remote Sens., 10, 1793, https://doi.org/10.3390/rs10111793, 2018.

Brown, D. R. N., Brinkman, T. J., Bolton, W. R., Brown, C. L., Cold, H. S., Hollingsworth, T. N., and Verbyla, D. L.: Implications of climate variability and changing seasonal hydrology for subarctic riverbank erosion, Clim. Change, 162, 1–20, https://doi.org/10.1007/s10584-020-02748-9, 2020.

Callaghan, T. V., Johansson, M., Brown, R. D., Groisman, P. Ya., Labba, N., Radionov, V., Barry, R. G., Bulygina, O. N., Essery, R. L. H., Frolov, D. M., Golubev, V. N., Grenfell, T. C., Petrushina, M. N., Razuvaev, V. N., Robinson, D. A., Romanov, P., Shindell, D., Shmakin, A. B., Sokratov, S. A., Warren, S., and Yang, D.: The Changing Face of Arctic Snow Cover: A Synthesis of Observed and Projected Changes, AMBIO, 40, 17–31, https://doi.org/10.1007/s13280-011-0212-y, 2011.

Campbell, S. W., Briggs, M., Roy, S. G., Douglas, T. A., and Saari, S.: Ground-penetrating radar, electromagnetic induction, terrain, and vegetation observations coupled with machine learning to map permafrost distribution at Twelvemile Lake, Alaska, Permafrost Periglac., 32, 407–426, https://doi.org/10.1002/ppp.2100, 2021.

Carroll, M., Townshend, J., Hansen, M., DiMiceli, C., Sohlberg, R., and Wurster, K.: MODIS Vegetative Cover Conversion and Vegetation Continuous Fields, in: Land Remote Sensing and Global Environmental Change: NASA’s Earth Observing System and the Science of ASTER and MODIS, Remote Sensing and Digital Image Processing, vol. 11, edited by: Ramachandran, B., Justice, C. O., and Abrams, M. J., 725, https://doi.org/10.1007/978-1-4419-6749-7_32, 2011.

Cold, H. S., Brinkman, T. J., Brown, C. L., Hollingsworth, T. N., Brown, D. R. N., and Heeringa, K. M.: Assessing vulnerability of subsistence travel to effects of environmental change in Interior Alaska, E&S, 25, art20, https://doi.org/10.5751/ES-11426-250120, 2020.

Cosgrove, C. L., Wells, J., Nolin, A. W., Putera, J., and Prugh, L. R.: Seasonal influence of snow conditions on Dall's sheep productivity in Wrangell-St Elias National Park and Preserve, PLoS ONE, 16, e0244787, https://doi.org/10.1371/journal.pone.0244787, 2021.

Darychuk, S. E., Shea, J. M., Menounos, B., Chesnokova, A., Jost, G., and Weber, F.: Snowmelt characterization from optical and synthetic-aperture radar observations in the La Joie Basin, British Columbia, The Cryosphere, 17, 1457–1473, https://doi.org/10.5194/tc-17-1457-2023, 2023.

Dattler, M. E., Medley, B., and Stevens, C. M.: A physics-based Antarctic melt detection technique: combining Advanced Microwave Scanning Radiometer 2, radiative-transfer modeling, and firn modeling, The Cryosphere, 18, 3613–3631, https://doi.org/10.5194/tc-18-3613-2024, 2024.

Derksen, C. and Brown, R.: Spring snow cover extent reductions in the 2008–2012 period exceeding climate model projections, Geophys. Res. Lett., 39, 2012GL053387, https://doi.org/10.1029/2012GL053387, 2012.

Dolant, C., Langlois, A., Montpetit, B., Brucker, L., Roy, A., and Royer, A.: Development of a rain-on-snow detection algorithm using passive microwave radiometry, Hydrol. Process., 30, 3184–3196, https://doi.org/10.1002/hyp.10828, 2016.

Du, J., Kimball, J. S., and Jones, L. A.: Satellite Microwave Retrieval of Total Precipitable Water Vapor and Surface Air Temperature Over Land From AMSR2, IEEE T. Geosci. Remote, 53, 2520–2531, https://doi.org/10.1109/TGRS.2014.2361344, 2015.

Du, J., Kimball, J. S., Jones, L. A., and Watts, J. D.: Implementation of satellite based fractional water cover indices in the pan-Arctic region using AMSR-E and MODIS, Remote Sens. Environ., 184, 469–481, https://doi.org/10.1016/j.rse.2016.07.029, 2016.

Du, J., Kimball, J. S., Jones, L. A., Kim, Y., Glassy, J., and Watts, J. D.: A global satellite environmental data record derived from AMSR-E and AMSR2 microwave Earth observations, Earth Syst. Sci. Data, 9, 791–808, https://doi.org/10.5194/essd-9-791-2017, 2017.

Du, J., Kirchner, P. B., Pan, C. G., Watts, J. D., and Kimball, J. S.: Assessing rain-on-snow event dynamics over Alaska using 30 year satellite microwave observations, Environ. Res. Lett., 20, 034048, https://doi.org/10.1088/1748-9326/adb9ff, 2025.

Dunmire, D., Lievens, H., Boeykens, L., and De Lannoy, G. J. M.: A machine learning approach for estimating snow depth across the European Alps from Sentinel-1 imagery, Remote Sens. Environ., 314, 114369, https://doi.org/10.1016/j.rse.2024.114369, 2024.

Forman, B. A. and Xue, Y.: Machine learning predictions of passive microwave brightness temperature over snow-covered land using the special sensor microwave imager (SSM/I), Phys. Geogr., 38, 176–196, https://doi.org/10.1080/02723646.2016.1236606, 2017.

Frei, E. R. and Henry, G. H. R.: Long-term effects of snowmelt timing and climate warming on phenology, growth, and reproductive effort of Arctic tundra plant species, Arct. Sci., 8, 700–721, https://doi.org/10.1139/as-2021-0028, 2022.

Gagliano, E., Shean, D., Henderson, S., and Vanderwilt, S.: Capturing the Onset of Mountain Snowmelt Runoff Using Satellite Synthetic Aperture Radar, Geophys. Res. Lett., 50, e2023GL105303, https://doi.org/10.1029/2023GL105303, 2023.

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2)., J Climate, 30, 5419–5454, https://doi.org/10.1175/JCLI-D-16-0758.1, 2017.

Grenfell, T. C. and Putkonen, J.: A method for the detection of the severe rain-on-snow event on Banks Island, October 2003, using passive microwave remote sensing, Water Resour. Res., 44, 2007WR005929, https://doi.org/10.1029/2007WR005929, 2008.

Guidicelli, M., Huss, M., Gabella, M., and Salzmann, N.: Spatio-temporal reconstruction of winter glacier mass balance in the Alps, Scandinavia, Central Asia and western Canada (1981–2019) using climate reanalyses and machine learning, The Cryosphere, 17, 977–1002, https://doi.org/10.5194/tc-17-977-2023, 2023.

Helfrich, S. R., McNamara, D., Ramsay, B. H., Baldwin, T., and Kasheta, T.: Enhancements to, and forthcoming developments in the Interactive Multisensor Snow and Ice Mapping System (IMS), Hydrol. Process., 21, 1576–1586, https://doi.org/10.1002/hyp.6720, 2007.

Jääskeläinen, E., Manninen, T., Hakkarainen, J., and Tamminen, J.: Filling gaps of black-sky surface albedo of the Arctic sea ice using gradient boosting and brightness temperature data, Int. J. Appl. Earth Obs., 107, 102701, https://doi.org/10.1016/j.jag.2022.102701, 2022.

Kendall, M. G.: Rank Correlation Methods, Hafner Publishing Company, ISBN 0-85264-199-0, 1962.

Kim, Y., Kimball, J. S., McDonald, K. C., and Glassy, J.: Developing a Global Data Record of Daily Landscape Freeze/Thaw Status Using Satellite Passive Microwave Remote Sensing, IEEE T. Geosci. Remote, 49, 949–960, https://doi.org/10.1109/TGRS.2010.2070515, 2011.

Kim, Y., Kimball, J. S., Glassy, J., and Du, J.: An extended global Earth system data record on daily landscape freeze–thaw status determined from satellite passive microwave remote sensing, Earth Syst. Sci. Data, 9, 133–147, https://doi.org/10.5194/essd-9-133-2017, 2017.

Lara, M. J., Chen, Y., and Jones, B. M.: Recent warming reverses forty-year decline in catastrophic lake drainage and hastens gradual lake drainage across northern Alaska, Environ. Res. Lett., 16, 124019, https://doi.org/10.1088/1748-9326/ac3602, 2021.

Lesack, L. F. W., Marsh, P., Hicks, F. E., and Forbes, D. L.: Local spring warming drives earlier river-ice breakup in a large Arctic delta, Geophys. Res. Lett., 41, 1560–1567, https://doi.org/10.1002/2013GL058761, 2014.

Ling, F. and Zhang, T.: Impact of the timing and duration of seasonal snow cover on the active layer and permafrost in the Alaskan Arctic, Permafrost Periglac., 14, 141–150, https://doi.org/10.1002/ppp.445, 2003.

Liston, G. E., Itkin, P., Stroeve, J., Tschudi, M., Stewart, J. S., Pedersen, S. H., Reinking, A. K., and Elder, K.: A Lagrangian Snow‐Evolution System for Sea‐Ice Applications (SnowModel‐LG): Part I – Model Description, J. Geophys. Res.-Oceans, 125, e2019JC015913, https://doi.org/10.1029/2019JC015913, 2020.

Long, D. G. and Brodzik, M. J.: Optimum Image Formation for Spaceborne Microwave Radiometer Products, IEEE T. Geosci. Remote, 54, 2763–2779, https://doi.org/10.1109/TGRS.2015.2505677, 2016.

Marin, C., Bertoldi, G., Premier, V., Callegari, M., Brida, C., Hürkamp, K., Tschiersch, J., Zebisch, M., and Notarnicola, C.: Use of Sentinel-1 radar observations to evaluate snowmelt dynamics in alpine regions, The Cryosphere, 14, 935–956, https://doi.org/10.5194/tc-14-935-2020, 2020.

Meloche, J., Langlois, A., Rutter, N., Royer, A., King, J., Walker, B., Marsh, P., and Wilcox, E. J.: Characterizing tundra snow sub-pixel variability to improve brightness temperature estimation in satellite SWE retrievals, The Cryosphere, 16, 87–101, https://doi.org/10.5194/tc-16-87-2022, 2022.

Menne, M. J., Durre, I., Vose, R. S., Gleason, B. E., and Houston, T. G.: An Overview of the Global Historical Climatology Network-Daily Database, J. Atmos. Ocean. Tech., 29, 897–910, https://doi.org/10.1175/JTECH-D-11-00103.1, 2012.

Musselman, K. N., Clark, M. P., Liu, C., Ikeda, K., and Rasmussen, R.: Slower snowmelt in a warmer world, Nat. Clim. Change, 7, 214–219, https://doi.org/10.1038/nclimate3225, 2017.

Nagler, T. and Rott, H.: Retrieval of wet snow by means of multitemporal SAR data, IEEE T. Geosci. Remote, 38, 754–765, https://doi.org/10.1109/36.842004, 2000.

Niittynen, P., Heikkinen, R. K., and Luoto, M.: Snow cover is a neglected driver of Arctic biodiversity loss, Nat. Clim. Change, 8, 997–1001, https://doi.org/10.1038/s41558-018-0311-x, 2018.

Palecki, M., Durre, I., Applequist, S., Arguez, A., and Lawrimore, J.: U.S. Climate Normals 2020: U.S. Hourly Climate Normals (1991–2020), NOAA National Centers for Environmental Information, https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.ncdc:C01622 (last access: 1 August 2025), 2021.

Pan, C. G., Kirchner, P. B., Kimball, J. S., Kim, Y., and Du, J.: Rain-on-snow events in Alaska, their frequency and distribution from satellite observations, Environ. Res. Lett., 13, 075004, https://doi.org/10.1088/1748-9326/aac9d3, 2018.

Pan, C. G., Kimball, J. S., Munkhjargal, M., Robinson, N. P., Tijdeman, E., Menzel, L., and Kirchner, P. B.: Role of Surface Melt and Icing Events in Livestock Mortality across Mongolia's Semi-Arid Landscape, Remote Sens., 11, 2392, https://doi.org/10.3390/rs11202392, 2019.

Pan, C. G., Kirchner, P. B., Kimball, J. S., and Du, J.: A Long-Term Passive Microwave Snowoff Record for the Alaska Region 1988–2016, Remote Sens., 12, 153, https://doi.org/10.3390/rs12010153, 2020.

Pan, C. G., Kirchner, P. B., Kimball, J. S., Du, J., and Rawlins, M. A.: Snow Phenology and Hydrologic Timing in the Yukon River Basin, AK, USA, Remote Sens., 13, 2284, https://doi.org/10.3390/rs13122284, 2021.

Pedregosa, F., Pedregosa, F., Varoquaux, G., Varoquaux, G., Org, N., Gramfort, A., Gramfort, A., Michel, V., Michel, V., Fr, L., Thirion, B., Thirion, B., Grisel, O., Grisel, O., Blondel, M., Prettenhofer, P., Prettenhofer, P., Weiss, R., Dubourg, V., Dubourg, V., Vanderplas, J., Passos, A., Tp, A., and Cournapeau, D.: Scikit-learn: Machine Learning in Python, MACHINE LEARNING IN PYTHON, 12, 2825–2830, 2011.

Pfeffer, W. T., Arendt, A. A., Bliss, A., Bolch, T., Cogley, J. G., Gardner, A. S., Hagen, J.-O., Hock, R., Kaser, G., Kienholz, C., Miles, E. S., Moholdt, G., Mölg, N., Paul, F., Radić, V., Rastner, P., Raup, B. H., Rich, J., Sharp, M. J., and The Randolph Consortium: The Randolph Glacier Inventory: a globally complete inventory of glaciers, J. Glaciol., 60, 537–552, https://doi.org/10.3189/2014JoG13J176, 2014.

Pulliainen, J., Aurela, M., Laurila, T., Aalto, T., Takala, M., Salminen, M., Kulmala, M., Barr, A., Heimann, M., Lindroth, A., Laaksonen, A., Derksen, C., Mäkelä, A., Markkanen, T., Lemmetyinen, J., Susiluoto, J., Dengel, S., Mammarella, I., Tuovinen, J.-P., and Vesala, T.: Early snowmelt significantly enhances boreal springtime carbon uptake, P. Natl. Acad. Sci. USA, 114, 11081–11086, https://doi.org/10.1073/pnas.1707889114, 2017.

Ramage, J. M. and Isacks, B. L.: Determination of melt-onset and refreeze timing on southeast Alaskan icefields using SSM/I diurnal amplitude variations, Ann. Glaciol., 34, 391–398, https://doi.org/10.3189/172756402781817761, 2002.

Rantanen, M., Karpechko, A. Yu., Lipponen, A., Nordling, K., Hyvärinen, O., Ruosteenoja, K., Vihma, T., and Laaksonen, A.: The Arctic has warmed nearly four times faster than the globe since 1979, Commun. Earth Environ., 3, 168, https://doi.org/10.1038/s43247-022-00498-3, 2022.

Rees, A., Lemmetyinen, J., Derksen, C., Pulliainen, J., and English, M.: Observed and modelled effects of ice lens formation on passive microwave brightness temperatures over snow covered tundra, Remote Sens. Environ., 114, 116–126, https://doi.org/10.1016/j.rse.2009.08.013, 2010.

Rittger, K., Krock, M., Kleiber, W., Bair, E. H., Brodzik, M. J., Stephenson, T. R., Rajagopalan, B., Bormann, K. J., and Painter, T. H.: Multi-sensor fusion using random forests for daily fractional snow cover at 30 m, Remote Sens. Environ., 264, 112608, https://doi.org/10.1016/j.rse.2021.112608, 2021.

Roberts-Pierel, B. M., Kirchner, P. B., Kilbride, J. B., and Kennedy, R. E.: Glacier Covered Area for the State of Alaska, 1985–2020, Boulder, Colorado USA. National Snow and Ice Data Center [data set], https://doi.org/10.7265/8ESQ-W553, 2022.

Scholten, R. C., Jandt, R., Miller, E. A., Rogers, B. M., and Veraverbeke, S.: Overwintering fires in boreal forests, Nature, 593, 399–404, https://doi.org/10.1038/s41586-021-03437-y, 2021.

Semmens, K. A. and Ramage, J. M.: Recent changes in spring snowmelt timing in the Yukon River basin detected by passive microwave satellite data, The Cryosphere, 7, 905–916, https://doi.org/10.5194/tc-7-905-2013, 2013.

Semmens, K. A., Ramage, J., Bartsch, A., and Liston, G. E.: Early snowmelt events: detection, distribution, and significance in a major sub-arctic watershed, Environ. Res. Lett., 8, 014020, https://doi.org/10.1088/1748-9326/8/1/014020, 2013.

Swanson, D. K., Sousanes, P. J., and Hill, K.: Increased mean annual temperatures in 2014–2019 indicate permafrost thaw in Alaskan national parks, Arct. Antarct. Alp. Res., 53, 1–19, https://doi.org/10.1080/15230430.2020.1859435, 2021.

Tadono, T., Ishida, H., Oda, F., Naito, S., Minakawa, K., and Iwamoto, H.: Precise Global DEM Generation by ALOS PRISM, ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., II–4, 71–76, https://doi.org/10.5194/isprsannals-II-4-71-2014, 2014.

Tedesco, M. and Jeyaratnam, J.: A New Operational Snow Retrieval Algorithm Applied to Historical AMSR-E Brightness Temperatures, Remote Sens., 8, 1037, https://doi.org/10.3390/rs8121037, 2016.

Tedesco, M. and Miller, J.: Observations and statistical analysis of combined active–passive microwave space-borne data and snow depth at large spatial scales, Remote Sens. Environ., 111, 382–397, https://doi.org/10.1016/j.rse.2007.04.019, 2007.

Tedesco, M., Pulliainen, J., Takala, M., Hallikainen, M., and Pampaloni, P.: Artificial neural network-based techniques for the retrieval of SWE and snow depth from SSM/I data, Remote Sens. Environ., 90, 76–85, https://doi.org/10.1016/j.rse.2003.12.002, 2004.

Tedesco, M., Mote, T., Steffen, K., Hall, D. K., and Abdalati, W.: Remote sensing of melting snow and ice, in: Remote Sensing of the Cryosphere, edited by: Tedesco, M., Wiley, 99–122, https://doi.org/10.1002/9781118368909.ch6, 2015.

Thornton, P. E., Shrestha, R., Thornton, M., Kao, S.-C., Wei, Y., and Wilson, B. E.: Gridded daily weather data for North America with comprehensive uncertainty quantification, Sci. Data, 8, 190, https://doi.org/10.1038/s41597-021-00973-0, 2021.

Trujillo, E., Molotch, N. P., Goulden, M. L., Kelly, A. E., and Bales, R. C.: Elevation-dependent influence of snow accumulation on forest greening, Nat. Geosci., 5, 705–709, https://doi.org/10.1038/ngeo1571, 2012.

Tsai, Y.-L., Dietz, A., Oppelt, N., and Kuenzer, C.: Wet and Dry Snow Detection Using Sentinel-1 SAR Data for Mountainous Areas with a Machine Learning Technique, Remote Sens., 11, 895, https://doi.org/10.3390/rs11080895, 2019.

Walsh, J. E., Bieniek, P. A., Brettschneider, B., Euskirchen, E. S., Lader, R., and Thoman, R. L.: The Exceptionally Warm Winter of 2015/16 in Alaska, J. Climate, 30, 2069–2088, https://doi.org/10.1175/JCLI-D-16-0473.1, 2017.

Wang, L., Derksen, C., Brown, R., and Markus, T.: Recent changes in pan-Arctic melt onset from satellite passive microwave measurements, Geophys. Res. Lett., 40, 522–528, https://doi.org/10.1002/grl.50098, 2013.

Wang, L., Toose, P., Brown, R., and Derksen, C.: Frequency and distribution of winter melt events from passive microwave satellite data in the pan-Arctic, 1988–2013, The Cryosphere, 10, 2589–2602, https://doi.org/10.5194/tc-10-2589-2016, 2016.

Xiao, X., Liang, S., He, T., Wu, D., Pei, C., and Gong, J.: Estimating fractional snow cover from passive microwave brightness temperature data using MODIS snow cover product over North America, The Cryosphere, 15, 835–861, https://doi.org/10.5194/tc-15-835-2021, 2021.

Xiong, C., Yang, J., Pan, J., Lei, Y., and Shi, J.: Mountain Snow Depth Retrieval From Optical and Passive Microwave Remote Sensing Using Machine Learning, IEEE Geosci. Remote Sens. Lett., 19, 1–5, https://doi.org/10.1109/LGRS.2022.3226204, 2022.

Articles

Short summary

This study examines 35 years of snow cover changes in Alaska’s Yukon River Basin using machine learning to track snowmelt timing and disappearance. Results show snow is melting earlier and disappearing faster due to rising temperatures, highlighting the effects of climate change on water resources, ecosystems, and communities. The findings improve understanding of snow dynamics and provide critical insights for addressing climate-driven challenges in the region.