Articles | Volume 20, issue 5
https://doi.org/10.5194/tc-20-3131-2026
https://doi.org/10.5194/tc-20-3131-2026
Research article
 | 
29 May 2026
Research article |  | 29 May 2026

PIXAL: a physics-inspired explainable machine learning architecture for Greenland ice albedo modeling

Raf Antwerpen, Marco Tedesco, Pierre Gentine, Willem Jan van de Berg, and Xavier Fettweis
Abstract

The Greenland ice sheet (GrIS) is a major contributor to global sea level rise. A significant portion of the GrIS' contribution can be attributed to increased ice surface melting, which is strongly controlled by ice albedo, a property that regulates the amount of absorbed solar radiation that leads to surface melting. Yet, we lack a comprehensive understanding of the complex and nonlinear relationships ice albedo has with its environment and is, therefore, often simplified or crudely parameterized in climate models. However, an accurate representation of future ice albedo evolution is essential for reducing uncertainties in sea level rise projections. This study presents PIXAL, a physics-inspired explainable machine learning architecture that significantly outperforms the Modèle Atmosphérique Régional (MAR), a state-of-the-art regional climate model, in modeling ice albedo on the southwestern GrIS. PIXAL is based on an Extreme Gradient Boosting (XGBoost) model and is trained on a suite of modeled topographic, atmospheric, radiative, and glaciologic variables from MAR to capture the complex and nonlinear relationships with ice albedo observations obtained from the Moderate Resolution Imaging Spectroradiometer (MODIS). Performance metrics show that PIXAL achieves an R2 of 0.563, a structural similarity index measure (SSIM) of 0.620, a mean squared error (MSE) of 0.005, and a mean absolute percentage error (MAPE) of 14.699 %, compared to MAR's R2 of 0.062, SSIM of 0.112, MSE of 0.032, and MAPE of 46.202 %. Explainable artificial intelligence analysis reveals that topographic features, specifically ice sheet surface height and slope, are important drivers of ice albedo variability due to their relationships with ice exposure duration and the effectiveness in accumulating meltwater and light-absorbing constituents (LACs) on flat ice surfaces. Near-surface air temperature and runoff further significantly impact ice albedo variability by affecting the ice metamorphic state and accumulation of meltwater and LACs. These findings highlight that understanding the complex physical processes underlying ice albedo variability is essential for accurate climate modeling and sea level rise predictions. PIXAL represents a crucial advancement in ice albedo modeling and paves the way for improved representation of ice sheets in Earth system models.

Share
1 Introduction

A significant portion of the acceleration of the global mean sea level rise over the last few decades has been attributed to increased surface melting from the Greenland ice sheet (GrIS) (Aschwanden et al., 2019). Climate models project a contribution to global mean sea level rise of 9 to 18 cm from the GrIS by 2100 for the Shared Socioeconomic Pathway SSP5-8.5 (Fox-Kemper et al., 2021; Riahi et al., 2017). The large uncertainty in this projection contributes to the hindrance to accurate and effective mitigation of the implications of sea level change on coastal communities. A large portion of the uncertainty stems from an incomplete understanding of the physical processes that control ice surface melting on the GrIS and their linkages to atmospheric and oceanic processes (van den Broeke et al., 2017). Specifically, we lack an understanding of the spatiotemporal variability of ice albedo, which plays a crucial role in modulating ice surface melt processes (Antwerpen et al., 2022).

Ice is exposed on the GrIS during the summer melting season (June–August) when increased insolation and increased atmospheric temperature induce melting of the winter snowpack overlying the ice. Snow melt generally occurs in the ablation zone at lower elevations near the ice sheet margin. Here, the surface mass balance is negative, with the ablation of snow and ice (melting, evaporation, and sublimation) being larger than the accumulation (snowfall, rainfall, and refreezing), resulting in a net surface mass loss and potential exposure of the bare ice surface. Ice melt is strongly controlled by the broadband ice surface albedo, which represents the ratio of reflected (upward) solar radiation flux to incoming (downward) solar radiation flux, weighted by wavelength (visible to near-infrared). For ice, the broadband albedo ranges from ∼0.1 to ∼0.7 (Klein and Stroeve, 2002; Liang, 2001; Tedstone et al., 2020; Warren and Wiscombe, 1980; Wiscombe and Warren, 1980) while the typical albedo of snow is ∼0.7–0.8. The incoming solar radiation that is not reflected by the ice is, instead, absorbed. Because of its lower albedo, a considerably higher amount of incoming solar radiation can be absorbed by ice than by snow. The absorbed radiation heats the surface and shallow subsurface ice and snow layers and induces melting. Meltwater from snow and ice has a low albedo of ∼0.1 and when mixed with ice and snow therefore further decreases the albedo, promoting additional melting. This is referred to as the meltwater-albedo feedback (Stroeve, 2001). Under future, warmer atmospheric conditions, snowmelt over the GrIS is expected to increase (Yue et al., 2021). Therefore, the snowline is expected to retreat earlier and further inland, increasing the low-albedo ice areas and accelerating surface melting and sea level rise (Ryan et al., 2019).

Ice albedo on the GrIS is a highly complex property that is controlled by many factors. The metamorphic state of the surface and shallow subsurface of the ice, determined by ice grain size, density, porosity, specific surface area, and the size and shape of englacial air bubbles, plays a key role in the scattering, absorption, and reflectance of incoming solar radiation (Flanner and Zender, 2006). Moreover, meltwater and rainwater can pond on the ice surface and infiltrate shallow subsurface cavities in the ice, promoting absorption of incoming solar radiation during low cloud-cover days (Tedstone et al., 2020). Solar radiation heats the meltwater and promotes ice melting at the water-ice interface, increasing the presence of meltwater and further darkening the ice sheet. The presence of light-absorbing constituents (LACs), including black carbon, mineral dust, and algae can significantly lower ice albedo (Goelles and Bøggild, 2017; Hofer et al., 2017; MacGregor et al., 2020; McCutcheon et al., 2021; Williamson et al., 2020). A considerable fraction of LACs that affect GrIS ice albedo are aerosols from distant and local sources (Flanner et al., 2021; Goelles and Bøggild, 2017). Natural sources, including North American, Siberian, and Greenlandic wildfires, emit black carbon particles that have been found on the GrIS (Calì Quaglia et al., 2022; Keegan et al., 2014). Greenlandic wildfires can deposit up to 30 % of their emissions onto the ice sheet (Evangeliou et al., 2019). Asian and African deserts (Újvári et al., 2022), Icelandic volcanoes (Meinander et al., 2016; Moroni et al., 2018), and Greenlandic ice-free areas (Amino et al., 2021; Nagatsuka et al., 2021) also release dust particles that have been found on the ice sheet. Further, anthropogenic sources, including transportation, industrial, and residential, emit aerosols (Bond et al., 2013), which can be transported across large distances by atmospheric circulations and deposited on the GrIS (Khan et al., 2023; Thomas et al., 2017; Ward et al., 2018). LACs accumulate on the GrIS surface throughout the year and as the snowpack melts during the melting season, LACs can be left behind on the ice and cause darkening while the melted snowpack is flushed out. Additionally, dust accumulated on the GrIS during the last ∼15 000 years is melted at the ice surface in the ablation zone, increasing the dust concentration (MacGregor et al., 2020; Wientjes et al., 2012). Moreover, mineral dust provides nutrients for ice algae (McCutcheon et al., 2021). Algal blooms cause a considerable lowering of ice albedo and accelerate surface melting (Cook et al., 2020; Stibal et al., 2017; Wang et al., 2020; Williamson et al., 2020). In areas with heterogeneous ice surfaces, ice roughness and crevasses are also known to significantly influence ice albedo (Cathles et al., 2011). Lastly, a broad swath of environmental and radiative conditions, including temperature, atmospheric composition and circulation, and solar zenith angle, affect the metamorphic state of the ice and the accumulation of LACs and thus play an essential role in controlling ice albedo (Flanner et al., 2021; Hofer et al., 2017; Tedesco et al., 2016).

The development of a comprehensive and predictive ice albedo model is hindered by a lack of understanding of the drivers of ice albedo, which can lead to an underestimation of surface melting and thus sea level rise (Antwerpen et al., 2022). Typically, Earth system models (ESMs) prescribe constant and uniform values for ice albedo in the visible and near-infrared wavelength regions (van Kampenhout et al., 2020). Recent ice albedo modeling efforts show improved capabilities in representing ice albedo in ESMs and regional climate models (RCMs). For example, through recent improvements, the Snow, Ice, and Aerosol Radiative model (SNICAR), a multi-layer heterogeneous snow albedo radiative transfer model, can now account for the influence of LACs on snow and ice albedo (Flanner et al., 2021; Whicker et al., 2022). SNICAR is currently used in the Energy Exascale Earth System Model (E3SM) and has improved ice albedo and surface melt estimates (Whicker-Clarke et al., 2024). However, due to limitations in quantifying concentrations of individual LACs on the GrIS, the use of SNICAR poses limitations in predicting ice albedo beyond the observational period. The RCM Regional Atmospheric Climate Model (RACMO) approaches this limitation by estimating ice albedo on the GrIS with the 2000–2015 mean broadband ice albedo observations from the Moderate Resolution Imaging Spectroradiometer (MODIS) (van Dalum et al., 2020). While these estimates provide a high ice albedo accuracy during the observation and evaluation period, future changes in environmental and surface conditions on the GrIS may decrease the accuracy of the albedo estimates. Therefore, RACMO may not necessarily employ an accurate representation of ice albedo for end-of-century surface melt projections. The RCM Modèle Atmosphérique Régional (MAR) prescribes ice albedo as a function of accumulated runoff and ice sheet slope, which leads to an overestimation of ice albedo and potential underestimation of meltwater production (Antwerpen et al., 2022). While this model configuration can account for some future changes in environmental and surface conditions, it does not incorporate essential dependencies of ice albedo to environmental variables and, therefore, does not accurately capture the physical processes that underlie ice albedo variability.

These considerable efforts played important roles in advancing our understanding of ice albedo modeling. Yet, a comprehensive, accurate, and predictive ice albedo model has not yet been developed. Here, we present PIXAL, a Physics-Inspired eXplainable machine learning architecture for ice ALbedo modeling. PIXAL is based on an eXtreme Gradient Boosting (XGBoost) model (Chen and Guestrin, 2016) and accurately models and predicts ice albedo on the southwestern GrIS. We extract essential information from the suite of environmental variables modeled by MAR that have previously not been used for ice albedo modeling and train PIXAL to capture the complex and nonlinear relationships between the environment modeled by MAR and the MODIS-derived ice albedo observations. Additionally, we elucidate important environmental drivers of ice albedo by employing SHapley Additive exPlanations (SHAP) (Lundberg and Lee, 2017), an explainable artificial intelligence method. Through this work, we address limitations of ESMs and RCMs in modeling ice albedo on the southwestern GrIS. This includes the dark ice zone, where LACs have a dominant role in controlling ice albedo (Ryan et al., 2019). However, we show that ice albedo modeling improvements can be made even without directly accounting for LACs. Moreover, an improved understanding of the environmental and physical processes underlying ice albedo variability is vital for robust model developments and ice albedo predictions that are stable against uncertain changes in future environmental conditions.

2 Data

2.1 MAR

We use the Modèle Atmosphérique Régional (MAR) v3.12, an RCM developed to simulate the coupled surface-atmosphere system over polar regions (Fettweis et al., 2017; Gallée, 1997; Lefebre et al., 2003; Ridder and Schayes, 1997). We run MAR over the GrIS and force the lateral boundaries and ocean surface with 6 h ERA5 reanalysis output (Hersbach et al., 2020), from the European Centre for Medium-Range Weather Forecasts (ECMWF). The atmosphere component of MAR is described by (Gallée and Schayes, 1994) and the surface component is represented by the Soil Ice Snow Vegetation Atmosphere Transfer (SISVAT) scheme (Ridder and Schayes, 1997). The SISVAT scheme includes the Crocus snow model (Brun et al., 1992), which simulates a predefined number of snow, ice, or firn layers with variable thickness and allows energy and mass transport between each layer. Ice albedo (α) is calculated as a function of the model-predicted runoff from melt and rainwater accumulated over the preceding day as:

(1) α = 0.5 + 0.05 1 e runoff 50 ,

In MAR, the ice albedo varies exponentially between a maximum of 0.55 when no surface water is present on the ice surface and a minimum of 0.5 when large amounts of runoff (≫50 mmWE) are present (Zuo and Oerlemans, 1996). The cumulative runoff is negatively corrected for the ice slope, with steeper ice slopes holding less runoff. MARv3.5.2 is validated over the GrIS (Fettweis et al., 2017) with updates to MARv.311 (Fettweis et al., 2021). Updates to MARv3.12 regarding the base of the snowpack temperature and rainfall to snowfall conversion are provided in Antwerpen et al. (2022).

We use MAR to produce daily output of topographic, atmospheric, radiative, and glaciologic variables at its native spatial resolution of 6.5 km: albedo (–), near-surface air temperature (°C; average height is 2 m), runoff of melt and rain water (mmWE d−1), shortwave upward radiation (W m−2), shortwave downward radiation (W m−2), longwave upward radiation (W m−2), longwave downward radiation (W m−2), sensible heat flux (W m−2), latent heat flux (W m−2), cloud cover (down) (–), cloud cover (middle) (–), cloud cover (up) (–), cloud optical depth (–), average ice density of the top 1 m (kg m−3), zonal wind (m s−1), meridional wind (m s−1), sublimation (mmWE d−1), average liquid water content of the top 1 m (kg kg−1), snowfall (mmWE d−1), rainfall (mmWE d−1), surface height (m), surface slope (degrees), surface aspect (azimuth degrees). While surface melt is available as an output variable of MAR, we did not include this variable in this list because it strongly correlates with runoff of melt and rainwater. We chose to include runoff over surface melt because we want to capture the potential redistribution processes of LACs through runoff and the consequent impacts on ice albedo.

We select a subset of the MAR output that covers the exposed ice in southwest GrIS during June, July, and August (JJA) in 2000–2021. This period encompasses the GrIS melt season when surface albedo has the largest impact on surface melting (Alexander et al., 2014). Following Antwerpen et al. (2022), we distinguish exposed bare ice from snow in MAR as cells where (1) snow is absent and (2) the average ice density of the top 1 m is higher than 907 kg m−3. While ice in MAR has a density of 920 kg m−3, a thin layer of fresh snowfall with a density of 300 kg m−3 can lower the average density of the top 1 m to slightly below that of ice without significantly affecting its albedo (Warren et al., 2006). Moreover, using the average ice density ensures ice lenses are not erroneously detected as bare ice. We further constrain ice in MAR to be located below the long-term average equilibrium line altitude (ELA) of 1679 m a.s.l., which represents the 95th percentile value of all sorted elevation values with a negative SMB during JJA of 2000–2021, which denotes the ablation zone.

2.2 MODIS

We collect daily MOD10A1 broadband albedo images (Hall et al., 2016) over the GrIS from the Moderate Resolution Imaging Spectroradiometer (MODIS) for JJA in 2000–2021 at 500 m spatial resolution with Google Earth Engine (Gorelick et al., 2017). We also collect daily MOD09GA v6 band 2 (841–876 nm) surface reflectance images (Vermote and Wolfe, 2015). This product has been corrected for atmospheric conditions, including aerosols, gasses, and Rayleigh scattering. We remove cloud-obstructed pixels using daily MOD10A1 v6 cloud mask images. We average and co-locate the MODIS data to the MAR projection and resolution to allow for a pixel-by-pixel analysis.

We distinguish exposed bare ice from snow in the MODIS imagery by applying an upper threshold of 0.6 to band 2 of the MOD09GA product (Shimada et al., 2016). We further constrain ice exposure below the long-term average ELA of 1679 m a.s.l. using the static ice mask and digital elevation model (DEM) from the Greenland Ice Mapping Project (GIMP) (Howat et al., 2014). While this may yield a conservative estimate of the ELA during warm high-melt years, we ensure no anomalously high-elevation ablation or cloudy cells erroneously detected as ice are included. The upper threshold of 0.6 may cause some firn to be erroneously detected as ice. Moreover, a thin layer of fresh snowfall over ice may not result in a reflectance value over 0.6 in band 2 of the MOD09GA product and will, therefore, be identified as ice. However, the broadband albedo may increase due to the thin snow layer. Further, low-albedo outcrops, cloud shadows, and meltwater ponding may decrease the apparent ice albedo within one pixel.

3 Methods

First, we use two linear regression approaches to show baseline improvements to the ice albedo originally modeled by MAR. Next, we use XGBoost to develop PIXAL, an optimized ice albedo model, and SHAP to elucidate the drivers of ice albedo. The methods are described in more detail in Sect. 3.1 to 3.3. We train the ice albedo models on cloud-free MODIS-derived ice albedo observations since MODIS is not able to detect ice albedo conditions through clouds. For a fair comparison, we limit our analysis to the co-located data points where cloud-free ice conditions are simultaneously modeled by MAR and observed by MODIS. We evaluate the performance of the ice albedo models through a comparison with the test data set (the last 2 years of the 2000–2021 period) of MODIS-derived ice albedo observations. We calculate the coefficient of determination (R2) to measure how well the models predict ice albedo compared to the MODIS observations. We also determine the mean squared error (MSE) and mean absolute percentage error (MAPE) to measure the amount of error in the ice albedo models. The MAPE provides a relative measure in percentages of the error of a prediction (de Myttenaere et al., 2016). Lastly, we calculate the structural similarity index measure (SSIM). The SSIM is a performance metric from the field of computer vision developed to determine the similarity between two images (Wang et al., 2004). The SSIM value we report in Sect. 4 is the mean of the SSIM values between the daily images of the modeled and predicted ice albedo values from the test data set. The SSIM ranges between −1 and 1, where 1 denotes a perfect similarity, −1 denotes perfect anti-similarity, and 0 indicates no similarity between the two images. While we show in Sect. 4 that the XGBoost model shows optimal performance in modeling ice albedo, we also tested a random forest (RF) and a high-performance symbolic regression (PySR), i.e. equation discovery. The RF is described in Sect. 3.2. The PySR is a supervised ML model that aims to find an interpretable symbolic expression that optimizes the simulation of a target variable (Cranmer, 2023). PySR uses a multi-population evolutionary algorithm, consisting of a unique evolve-simplify-optimize loop designed to optimize unknown scalar constants in new empirical expressions. The configurations for the RF and PySR are described in Appendix A.

3.1 Linear regression

The first baseline ice albedo model consists of updating the slope (0.05) and intercept (0.5) coefficients used in MAR by training the original ice albedo equation (Eq. 1) on the MODIS-derived ice albedo using linear regression. For the second baseline ice albedo model, we create a linear regression of the form:

(2) α ( x 1 , , x n ) = β 0 + β 1 x 1 + + β n x n ,

where x1,,xn denote the MAR-based variables (or features) and β0,,βn denote the coefficients. Again, we train the linear regression on the MODIS-derived ice albedo. We use the seven MAR features that have the most impact on ice albedo: near-surface temperature, runoff, shortwave downward radiation, meridional wind, surface height, surface slope, surface aspect. The features are determined from the XGBoost and SHapley Additive exPlanations (SHAP) analyses, described in Sects. 3.2, 3.3, and 5.

3.2 XGBoost

Classic linear regression tools generally do not fully capture complex relationships between real-world properties, which are often dynamic and nonlinear. We, therefore, employ the machine learning method XGBoost (Chen and Guestrin, 2016) to learn the nonlinear relationships between observed ice albedo and the environmental drivers of ice albedo modeled by MAR. XGBoost is an extension of the basic decision tree, a learning algorithm for classification and regression tasks. Decision trees are supervised models that predict a target variable through simple threshold decisions on every variable in the input dataset. Threshold decisions are made at each tree node, splitting the input data and the prediction of the target variable into two branches that each connect to a threshold decision at the next node. Through each sequence of nodes and branches, a target prediction can be made from a new set of input data. However, individual decision trees do not generalize to data well and are prone to overfitting. Averaging the target predictions of an ensemble of trees, such as in an RF architecture, can mitigate this instability.

XGBoost is a scalable tree-boosting algorithm that uses a gradient descent algorithm to build an ensemble of parallel decision trees based on subsets of the dataset (Chen and Guestrin, 2016). To minimize the prediction error, each decision tree in the ensemble is built iteratively using targeted outcomes based on the gradient of the previous prediction error residuals. The final prediction is the weighted average of the individual trees. XGBoost has seen successful applications in Earth and climate-related studies in prediction (Fan et al., 2018; Huang et al., 2021; Ibrahem Ahmed Osman et al., 2021; Ma et al., 2020), image classification (Colkesen and Ozturk, 2022; Nkiruka et al., 2021), reconstruction of remote sensing data gaps (Tan et al., 2021), and risk assessment (Ma et al., 2021).

The predictor dataset consists of the MAR features (Sect. 2.1). We standardize the data to ensure no feature bias is present due to different feature value ranges. We exclude albedo (AL2), cloud cover (CD, CM, and CU), and cloud optical depth (COD) as input features because we include only cloud-free data points in both MAR and MODIS. To ensure snow, meltwater ponding, cloud shadows, and outcrops erroneously identified as ice are not included in training the XGBoost, we constrain the predictand albedo values from MODIS to be within the 2σ standard deviation range (0.165–0.671). We apply an 80–10–10 training-validation-testing split on our data stack, consisting of 5 384 250 data points spread over 22 years with 92 d in each year and with 14 environmental features from MAR (described in Sect. 2.1). The test data set consists of the last two years (2020 and 2021) of the data to avoid data leakage in the first years. We construct a regression tree ensemble with a Pseudo-Huber loss function, which is less sensitive to outliers than the commonly used squared error loss (Huber, 1964). We use an exact greedy tree construction algorithm for split finding to minimize the loss and a gbtree booster, which, each iteration, builds the next tree and gives higher weights to misclassified points in the previous tree. We perform the hyper-parameter search using Optuna (Akiba et al., 2019) and find the XGBoost configuration that yields optimal performance using an MSE evaluation against the validation dataset. The configuration includes a maximum tree depth (21), learning rate (0.07), number of boosted trees (500), gamma (2.38×10-8), minimum child weight (10), subsample ratio (0.92), column subsample ratio (0.79), alpha (L1 regularization; 0.74), and lambda (L2 regularization; 0.33).

3.3 Shapley additive explanations

Machine learning models and their outcomes often lack interpretability, leading to complexities in their reliability assessment. This challenge is addressed by a set of explainable AI tools and algorithms developed for understanding and interpreting ML models that are regarded as inherently uninterpretable. We use SHAP (Lundberg and Lee, 2017), an explainable AI tool rooted in the field of cooperative game theory, to explain and interpret our XGBoost model output and understand the roles of the environmental properties, or features, in the input dataset in driving variability in ice albedo. The importance value of each feature in the dataset, the SHAP value, is determined by iteratively training the XGBoost model on subsets of features. In each iteration, a feature is systematically added or removed from the training dataset. The SHAP value of each feature is calculated based on the difference in the predicted value between the model variations before and after adding or removing a feature. To fully capture the additive SHAP value of all features, the model is trained on all possible feature subsets and a weighted average of SHAP values for all model variations is determined. Positive SHAP values drive a positive change to the model prediction with respect to the mean prediction and vice versa. Missing values have a zero SHAP value and, therefore, do not affect the model prediction, making SHAP insensitive to data sparsity.

Significant successes have been made with SHAP in explaining machine learning models developed for the Earth and climate studies related to classification (Descals et al., 2023; Temenos et al., 2023), predictions (Batunacun et al., 2021; Dikshit and Pradhan, 2021; Ghafarian et al., 2022; Silva et al., 2022; Tang et al., 2022), and process understanding (Ishfaque et al., 2022). Moreover, SHAP has seen uses in classifying Antarctic sea ice imagery (Koo et al., 2023), interpreting sea-level projections (Rohmer et al., 2022), and studying the freeze-thaw cycle on the Tibetan plateau (Li et al., 2024). However, SHAP applications in the cryosphere sciences are still limited. To our knowledge, this work is the first application of SHAP to ice albedo.

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f01

Figure 1Scatter plot for JJA in 2020–2021 between MODIS-derived ice albedo (x axis) and ice albedo modeled with MAR (a–c; red) and ice albedo modeled with (a) the updated slope and intercept coefficients of the MAR ice albedo equation (Eq. 1) (purple), (b) linear regression with additional features (green), (c) and XGBoost (blue). The dashed line represents the 1:1 line.

Download

4 Results

MAR tends to have low performance in predicting ice albedo when evaluated against MODIS (Fig. 1; reds) with a low coefficient of determination (R2=0.062), mean squared error (MSE = 0.032), mean absolute percentage error (MAPE = 46.202 %), and structural similarity index measure (SSIM = 0.112). The MAR albedo shows little variability with a mean of 0.56±0.04, an overestimation compared to the MODIS-derived ice albedo observations, which have a mean of 0.42±0.11.

4.1 Linear regression

The two baseline linear regression approaches, described in Sect. 3.1, show an improvement over the ice albedo modeled with MAR, with slightly better performance metrics when evaluated against the MODIS-derived ice albedo. The improved slope (0.35) and intercept (0.20) compared to the MAR ice albedo equation (Eq. 1 and Sect. 2.1) result in a factor 2–3 improvement of the R2=0.092, MSE = 0.009 and MAPE = 22.385 % (Fig. 1a; purples). However, a slight reduction is seen for the SSIM = 0.103. While the updated MAR equation provides a mean ice albedo of 0.42±0.03, similar to the mean MODIS-derived ice albedo, it shows insufficient variability and generally underestimates the MODIS-observed ice albedo.

The ice albedo derived from the linear regression with runoff, surface slope, and the additional features mentioned in Sect. 3.1 shows a moderate improvement on the test set with regard to MAR, with R2=0.202, MSE = 0.008, MAPE = 20.309 %, and SSIM = 0.285 (Fig. 1b; greens). Adding additional features beyond the seven most important ones did not achieve better performance. The additional features we tested are: meltwater production (ME, mmWE d−1), longwave downward radiation (LWD, W m−2), surface atmospheric pressure (SP, hPa), ice density (RO1, kg m−3), liquid water content (WA1, kg kg−1), snowfall (SF, mmWE d−1), and rainfall (RF, mmWE d−1). The linear regression-derived ice albedo has a mean of 0.42±0.05 but still generally underestimates the MODIS-derived ice albedo. The spread in the linear regression-derived albedo values is larger than for MAR and the updated MAR equation. However, the large variability seen in the MODIS-derived ice albedo is not achieved with the linear regression.

4.2 XGBoost

The ice albedo modeled with XGBoost shows major improvements on the test set in all performance metrics with R2=0.568, MSE = 0.005, MAPE = 14.646 %, and SSIM = 0.624. The XGBoost-modeled ice albedo has a mean of 0.42±0.08, and exhibits a variability similar to the MODIS-derived ice albedo. XGBoost represents ice albedo values between 0.4 and 0.6 well but slightly overestimates low-albedo values (Fig. 1c; blues).

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f02

Figure 2Mean ice albedo for JJA in 2020–2021 for (a) MODIS, (b) XGBoost, (c) difference MODIS-XGBoost, and (d) MAR.

Figure 2a shows the mean MODIS-derived ice albedo for JJA in 2020–2021. The ice albedo modeled by XGBoost (Fig. 2b) below 66° N shows a similar decreasing pattern in ice albedo from the snowline to the ice sheet margin as observed with MODIS. Above 66° N, the XGBoost-modeled ice albedo exhibits a bimodal distribution, similar to MODIS, with high values near the ice margin and near the snow line. Low albedo values are found in between the margin and snow line, which represent the dark ice zone. Compared to the MODIS-derived ice albedo, the XGBoost slightly underestimates albedo at the snowline near Jakobshavn Glacier at 69–70° N and slightly overestimates albedo in some areas near the ice margin in the central and southern regions (Fig. 2c). Generally, the XGBoost provides a higher spatial ice albedo variability across the study area than MAR (Fig. 2d).

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f03

Figure 3Daily mean ice albedo across the southwestern GrIS for JJA averaged over 2020 and 2021 (test set) for MAR (red), MODIS (green), and XGBoost (blue). The black line indicates the mean number of ice albedo data points per day.

Download

Additionally, XGBoost provides considerable improvements in temporal ice albedo modeling compared to the ice albedo from MAR. Figure 3 shows the ice albedo during JJA averaged over 2020 and 2021. XGBoost (blue line) shows close daily alignment with the MODIS-derived ice albedos (Fig. 3: green line) throughout the melting season. Especially in July and August, when larger ice areas are exposed compared to earlier in the melting season (Fig. 3; black line) (Antwerpen et al., 2022; Noël et al., 2019). In the first half of June, when ice exposure is low, XGBoost slightly underestimates the MODIS-derived ice albedo while still outperforming MAR (Fig. 3; red line). Generally, XGBoost slightly underestimates high-albedo values and slightly overestimates low-albedo values. The peak albedo values (>0.6) from MAR represent data points with fresh snow or firn cover that have been misidentified as ice. Anomalously high albedo values can occur, especially during low ice exposure days, due to a skewed average when only a few data points are available.

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f04

Figure 4Performance metrics of ice albedo models vs MODIS-derived albedo. For visualization purposes, MSE is multiplied by 10 and MAPE is divided by 100. For MSE and MAPE, lower scores represent a better performance. For R2 and SSIM, higher scores represent a better performance.

Download

4.3 Model performance evaluation

The performance metrics of all our ice albedo model configurations evaluated against the MODIS-derived ice albedo observations are shown in Fig. 4. The correlations between the ice albedo modeled with RF and with PySR and the ice albedo observed with MODIS are shown in Figs. A1 and A2. The ML architectures (XGBoost, RF, and PySR) perform considerably better than the non-ML architectures (MAR, updated MAR equation, and linear regression), showcasing the superiority of ML in modeling ice albedo. While the symbolic regression method used in the PySR architecture can provide explicit insights into the developed ice albedo model, it shows a considerably lower performance than the XGBoost. We, therefore, do not use the symbolic regression insights from PySR in this study as they still have low predictive power. The RF architecture shows considerable improvement over MAR. However, the XGBoost model shows optimal performance with the highest R2 and SSIM scores and the lowest MSE and MAPE scores.

4.4 SHAP analysis

Our PIXAL algorithm includes two parts: a predictive model, based on XGBoost, and an explainable AI component to reveal the drivers of ice albedo that can be used to gain insights into potential MAR model improvement. The SHAP values of the seven most important MAR-based features are listed in Fig. 5. The importance of each feature in controlling ice albedo is determined by the magnitude of its impact on the final ice albedo prediction from XGBoost (SHAP value), with respect to the mean ice albedo prediction of 0.42. In other words, a feature's SHAP value represents the extent to which that variable pushes the prediction above or below that mean. The features in Fig. 5 are ordered by the mean absolute SHAP values, which emphasizes the average impact and gives less weight to high-magnitude SHAP values. The topographic features of surface height and slope, as well as temperature, are the key drivers of ice albedo. Radiative and atmospheric features are important additional drivers. The significance and impact of each variable listed in Fig. 5 is described in more detail in Sect. 5.1.

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f05

Figure 5SHAP values for the seven most important MAR features. The SHAP value represents the impact a feature has on the ice albedo prediction, relative to the mean ice albedo prediction. A feature with a positive SHAP value indicates that the feature increases the ice albedo prediction and vice versa. The features are sorted by their mean absolute SHAP values in ascending order. Red indicates high feature values and blue indicates low feature values.

Download

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f06

Figure 6Feature values, SHAP values (impact of feature values on ice albedo), and the feature and SHAP value correlation for surface height (a–c), slope (d–f), near-surface air temperature (g–i), and runoff (j–l). The maps show the average over JJA in 2020–2021. Note the larger range of SHAP values for surface height in (b), compared to the SHAP value range used in (e), (h), and (k).

5 Discussion

5.1 Drivers of ice albedo

Surface height exhibits a nonlinear relationship with its SHAP values (Fig. 6a–c). The SHAP values show the strong impact of surface height on albedo, ranging from −0.05 to 0.1, with the lowest albedo values present at low elevations near the ice margin and high albedo values present at high elevations near the snow line. These findings are in line with previous studies on ice albedo drivers (Feng et al., 2024; Moustafa et al., 2015). As the melting season commences and temperatures rise, snowmelt first occurs at the ice margin, exposing the underlying ice. The snowline then retreats to higher elevations, resulting in lower-elevation ice experiencing longer exposure. Longer ice exposure can result in a lower albedo as it allows for more accumulation of LACs from local and distant sources. Algal blooms are also given more time to grow and spread if uninterrupted by snowfall events. Snowmelt events at higher elevations may add to the LAC concentrations at lower elevations when meltwater is transported toward the ice margin and LACs may remain behind on the ice surface. Higher ice sheet elevations are generally further from local LAC sources, such as moraines, dunes, dry proglacial floodplains, ground transport, and industry, reducing the potential for LAC deposition. Furthermore, englacial Holocene dust mostly melts out at lower elevations due to the flow configuration of the GrIS, adding to the LAC concentration and albedo darkening at low elevations (MacGregor et al., 2020; Wientjes et al., 2012). A large low-albedo region is present at ∼800–1100 m a.s.l. above 66° N, representing the dark ice zone where the highest concentrations of LACs are found (Shimada et al., 2016; Wang et al., 2020). Moreover, at low elevations, older, denser ice is exposed, which has fewer and smaller englacial air bubbles, reducing the albedo.

Conversely, longer ice exposure can cause an increase in albedo. Ice erodes as it is exposed to the environment in strong incoming shortwave radiation conditions, affecting the metamorphic state and creating a porous weathering crust with a low density (Munro, 1990). During conditions with low amounts of meltwater, a weathering crust has many interfaces between the ice and air, allowing for light scattering with a high angle, which increases albedo (Jonsell et al., 2003). The relationship between albedo and surface height will likely change in the future with increasing temperatures, expanding the melting season and causing earlier ice exposure, as well as later snow-covering. Therefore, ice is exposed for longer periods, potentially exacerbating the processes related to ice metamorphism and LAC accumulation, lowering the albedo. These processes will also occur more frequently and for longer periods at higher elevations as the snowline rises with increasing temperature. The impact of these effects on the performance of PIXAL is further discussed in Sect. 5.2.

The surface slope and its SHAP values have a linear and positive relationship, with SHAP values ranging between −0.05 and 0.05 (Fig. 6d–f). A flat ice surface, typically found at higher elevations and the dark ice zone, correlates to low albedo values. Higher albedo values are found on steeper ice surface slopes at the ice margin. This relationship likely represents the potential for meltwater and LACs to accumulate more efficiently on flat ice surfaces (Wientjes and Oerlemans, 2010; Zuo and Oerlemans, 1996). The positive relationship between surface slope and albedo is at odds with the results from Feng et al. (2024), who find that darker ice is found on steeper ice slopes. However, this relationship mostly holds for the southeastern GrIS while the link between slope and albedo is less strong for the southwestern GrIS. Note that the MODIS-derived albedo may be affected by high solar zenith angles (SZA) >55° (Wang and Zender, 2010; Alexander et al., 2014). This may introduce a negative albedo bias, especially in areas with a high ice surface slope.

While surface aspect is shown as an important driver of ice albedo in Fig. 5, this is due to the large impact of a few extreme feature values on ice albedo (Fig. A3). The common west-facing aspect angle (250–300°) does not affect ice albedo much on the southwestern GrIS (Fig. A3c). Some of the southwest-facing ice surfaces (300–360°) show a lower albedo, potentially due to increased algal bloom activity in response to increased solar radiation. However, we find no significant relationship between shortwave downward radiation and ice albedo or aspect (Fig. A4). Additionally, the 6.5 km resolution is likely not sufficient to represent the spatial variability of the aspect and its effects on ice albedo.

The topographic features in MAR do not change with time. Potential changes in the relationships between topographic features and ice albedo may, therefore, not be fully captured by MAR. Nonetheless, during the early Holocene (∼12–7 ka), the most recent period with warmer-than-present temperatures in Greenland (Badgeley et al., 2020), maximum ice margin retreat rates of the southwestern GrIS were ∼35 m yr−1 (Briner et al., 2020). Assuming similar present-day retreat rates, the estimate of ice margin retreat at the end of this century would be ∼2660 m, which only constitutes ∼40 % of one 6.5 km grid cell in MAR. We, therefore, assume that the physical configuration of the GrIS will not change significantly by 2100 and that the static topographic features in MAR are sufficiently accurate for the purposes of this study.

There is a strong negative linear relationship (−0.009 °C−1; R2=0.884) between near-surface air temperature and its SHAP values (Fig. 6g–i). High temperatures are generally found at the ice sheet margin, while low temperatures are found at higher elevations. Temperatures below 0 °C can cause refreezing of meltwater in the shallow subsurface ice layers and superficial meltwater ponds and streams, increasing the albedo. Conversely, temperatures above 0 °C can cause thin, freshly fallen snow layers to melt and expose the lower-albedo ice underneath. High temperatures generally promote biological growth of algal blooms (Uetake et al., 2010), decreasing the albedo. Moreover, high temperatures are mostly found near the ice margin, where there is closer proximity to local LAC sources and, thus, a higher likelihood of LAC deposition, including bioavailable nutrients, which further promotes algae growth.

Runoff has a strong negative near-linear relationship with ice albedo, for SHAP values for runoff below ∼25 mmWE d−1, which can be partly explained by the dependence of runoff on near-surface temperature. In this range, runoff promotes ice albedo decrease through the accumulation of meltwater in ponds and streams and by filling up shallow microcavities in the upper ice layers. Runoff and decreasing ice albedo are further enhanced by the positive meltwater-albedo feedback. At runoff values of ∼25 mmWE d−1, maximum albedo reduction is achieved and almost no further ice albedo decrease is observed (Fig. 6l). The cutoff of 25 mmWE d−1 may signify a saturated ice surface and sub-surface where no more meltwater can be retained, causing any additional meltwater to run off to lower elevations. Moreover, increased runoff and meltwater production on the upper ice layers can cause increased melt-out of englacial Holocene dust and the development of cryoconite holes, further lowering the albedo.

The additional features listed in Fig. 5 are shown as important features because they are ordered by their absolute mean SHAP values, which gives more weight to the average impact of the feature and de-emphasizes high-impact SHAP values. However, these features do not show a high impact on ice albedo and are, therefore, not discussed here.

5.2 Limitations

Some MAR features can be biased due to potential dependencies on the inaccurately modeled ice albedo in MAR. This bias could be propagated to the PIXAL output. A solution to reduce this bias would be an iterative approach of embedding PIXAL in MAR, rerunning MAR, and updating PIXAL with the updated MAR output. This process can be repeated until a desired accuracy and error reduction is achieved. While this iterate-update approach is likely required before permanent embedding of PIXAL in MAR or other models can be considered, this is unfortunately outside of the scope of this study. We therefore cannot confidently comment on the impact of this approach, nor on the number of iterations required to reach a desired accuracy. We look forward to applying this method in a future publication.

Our results reveal relationships between ice albedo and environmental ice sheet conditions. Several of these drivers may account for some processes related to LAC-driven ice sheet darkening. However, PIXAL is trained on the available topographic, atmospheric, radiative, and glaciologic features in MAR and can, thus, only learn physical processes that can be inferred from these features. Additionally, other physical processes may not be fully captured by PIXAL, including the biological activity of algae, deposition of black carbon, dust, and ash released by deserts, local dried-up flood plains, forest fires, volcanic eruptions, anthropogenic emissions, and the melt-out of englacial Holocene dust. Positive feedback systems related to LACs, which could accelerate ice albedo darkening, may, therefore, also be missed. Some datasets are available that could represent these LAC-driven darkening processes, e.g. annual algal bloom data (Wang et al., 2020). However, this data is currently not available with the same spatiotemporal range and resolution as the MAR and MODIS datasets used in this study. Initial experiments showed that this 2-year annual algal bloom dataset created uneven input data availability resulting in biases and unwanted artefacts in the XGBoost output. We therefore omitted this data from our input data collection. We are unaware of other, more comprehensive datasets that are able to add sufficient information to improve the performance of the XGBoost model. Beyond adding direct observational inputs, alternative approaches to representing LAC-driven darkening processes could include proxy variables that co-vary with LAC concentration, or explicitly accounting for the contribution of unknown processes through uncertainty quantification. However, identifying reliable proxies at the required spatiotemporal resolution is an open challenge. We consider both directions valuable for future development of PIXAL.

Moreover, we use modeled estimates of the environmental features which poses limitations to the trustworthiness of the derived results. While MAR is a state-of-the-art regional climate model that has been validated over the GrIS (Fettweis et al., 2017), it still exhibits some biases, especially on albedo. Similarly, while MODIS has outstanding capabilities in measuring albedo, it has limitations related to atmospheric corrections, a fundamental reflectance data processing step. To account for radiance, reflectance, and transmittance effects due to atmospheric aerosol loading, a radiative transfer model is applied (Vermote et al., 2002). However, a per-pixel analysis of MODIS observations is infeasible due to computational costs (Vermote et al., 1997). Therefore, a simplified approach is applied using a look-up table for different aerosol loadings and sun-view geometries, potentially leading to inaccurate corrections. Further, NASA's Terra 10:30 a.m. local overpass time could incur a bias towards higher albedo measurements because the darkening effect of meltwater production will occur mostly in strong incoming radiation and high-temperature conditions during the afternoon. Moreover, some spatial variability is present in the performance of XGBoost compared to MODIS (Fig. 2c). This discrepancy may be partly due to spatial variability in the number of valid MODIS data points, which can lead to a bias in the mean observed ice albedo in areas with fewer valid data points (Fig. 2a). Lastly, MODIS exhibits a positive albedo bias above 70° N due to sun and satellite viewing angles at high latitudes (Alexander et al., 2014). The largest differences between XGBoost and MODIS albedo are found near this latitude, potentially explaining the XGBoost-MODIS discrepancy as a positive bias from MODIS rather than an underestimate from XGBoost.The relationships we find between the MAR features and the MODIS-derived ice albedo might, therefore, in part arise from model imperfections and measurement biases.

While we train PIXAL only on the southwestern GrIS ice albedo, many physical processes are generalizable and hold in other glaciated areas, e.g., the relationship between temperature, ice melting, and ice albedo change. However, some processes may be specific to the southwestern GrIS, such as the dependency of ice albedo on surface height, slope, and aspect. Moreover, processes relating to LAC deposition are likely specific to the southwestern GrIS as these are dependent on the LAC source location, atmospheric circulation patterns for transport, and topographical characteristics of the ice.

PIXAL is intended for both hindcasting and forecasting purposes. Because of the unknown effects of climate change on the environmental conditions on and near Greenland, a changing distribution of environmental feature values is expected for the rest of the century, e.g. more frequent high temperatures. Out-of-distribution values for input features generally pose a problem for the stability of tree-based models. However, the current spread of values in our large dataset (∼6σ) accounts for most of the expected extreme values. While these extreme values may become more common toward the end of the century, PIXAL is trained on and familiar with these values. We, therefore, do not expect significant issues relating to out-of-distribution values in forecasts. Except for surface height, with ice exposure likely occurring at higher and unprecedented elevations with continued atmospheric warming during the rest of the century.

Beyond these distributional considerations, we make two architectural trade-offs by opting for XGBoost as the basis of PIXAL. First, neural networks can in principle generalize outside of the training distribution, whereas tree-based models will assign out-of-distribution inputs the same prediction that is associated with the nearest boundary of the training data, effectively limiting extrapolation. However, as described above, the ∼6σ spread of our training dataset partially mitigates this limitation. Moreover, preliminary experiments showed a lower performance for NNs than tree-based approaches in modeling ice albedo. Second, tree-based models treat each grid cell as an independent data point and do not encode spatial relationships between neighboring pixels. This may limit their ability to capture spatial gradients such as those in the dark ice zone. Convolutional neural networks (CNNs) are theoretically better suited to leverage the information surrounding each pixel due to the spatially-aware nature of their convolutional kernels. However, our preliminary tests showed a superior performance of tree-based models over a CNN and a CNN-long short-term memory (CNN-LSTM) model in modeling ice albedo. Lastly, while SHAP can be applied to NNs and CNNs, these implementations rely on approximations of the SHAP values, whereas the SHAP implementation for tree-based models computes the exact SHAP values. Given the importance of model explainability to identify drivers of ice albedo variability, we opted for XGBoost as the basis for PIXAL.

6 Conclusions

We explored a suite of linear and nonlinear regression architectures to develop PIXAL and improve ice albedo estimates on the southwestern Greenland ice sheet. Our findings highlight XGBoost as the best-performing architecture, which outperforms a state-of-the-art regional climate model, MAR, in modeling ice albedo. The performance metrics as evaluated against the MODIS-derived ice albedo observations improved substantially compared to MAR with an increased R2 from 0.062 to 0.563, an increased SSIM from 0.112 to 0.620, a decreased MSE from 0.032 to 0.005, and a decreased MAPE from 46.202 % to 14.699 %. We find that the most important drivers of ice albedo are the topographic features, specifically the ice sheet surface height and slope, and near-surface air temperature. Surface height has a nonlinear impact on albedo, but low albedo values are generally found at lower elevations, likely due to the longer ice exposure which induces increased accumulation of LACs, melting out of englacial Holocene dust, and algal bloom activity. The ice at lower elevations is generally older and denser and has fewer and smaller englacial air bubbles, further reducing the albedo. We find low ice albedo values on flat ice surfaces where meltwater and LACs are more likely to accumulate. A steep ice surface less efficiently retains meltwater and LACs and, therefore, has a higher potential for increased albedo. Temperatures below 0 °C can increase ice albedo by refreezing meltwater, while temperatures above 0 °C can decrease ice albedo by melting thin snow layers and promoting algal bloom activity. Additionally, we find that runoff is an important additional driver. Runoff can cause a decrease in ice albedo up to runoff values of ∼25 mmWE d−1. Above this threshold, the upper ice layers are likely saturated with meltwater and the albedo does not decrease further. An explicit understanding of the emission, transport, and deposition of LACs onto the GrIS and other glaciated areas is essential for further improvements to PIXAL and general ice albedo modeling. PIXAL paves the way for a new generation of climate models that are more adept at modeling ice albedo and ice sheet melting. This work provides a significant step forward in ice sheet modeling for Earth system models and provides new insights on ice albedo and its drivers, the short and long-term future of the GrIS, and global and local sea level change.

Appendix A

A1 Model parameters for Random Forest

We perform a hyper-parameter search and find the Random Forest configuration with the best performance using a mean squared error loss function and bootstrap sampling. The configuration includes number of trees in the forest (75), minimum number of samples needed to split a node (15), minimum number of samples needed at each leaf node (8), method to determine the number of considered features for the best split (square root of number of features), maximum tree depth (25).

A2 Model parameters for PySR

The PySR architecture uses a Random Forest model for feature selection to build the symbolic expression. We set the maximum number of features to seven, which yields the following set: near-surface temperature, runoff, shortwave downward radiation, meridional wind, surface height, surface slope, surface aspect. We choose a range of potential operators On(x,y) with varying complexity n applied to (xy)∈ R2:

O3(x,y)=[x+yx-y,-xxyx2]O6(x,y)=[x/y,|x|,xx3]O9(x,y)=[exln(x),log2(x),log10(x),sin(x),cos(x),tan(x),sinh(x),cosh(x),tanh(x)]O27(x,y)=[xytan(x),asinh(x),acosh(x),atanh(x)]

Operators with low complexity are preferred by the PySR algorithm in building the symbolic expression and we set the maximum complexity to 70. The best results are obtained when we use 10 000 random samples from the training data set (JJA in 2000–2019) and restrict the model to run for 400 iterations or 18 h, whichever occurs first. Other parameters include a population size of 400, each with 100 samples, and 400 mutations to run, per 10 samples, per iteration.

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f07

Figure A1Correlation between MODIS-derived ice albedo (x axis) and ice albedo modeled with MAR (reds) and ice albedo modeled with Random Forest (oranges). The dashed line represents the 1:1 line.

Download

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f08

Figure A2Correlation between MODIS-derived ice albedo (x axis) and ice albedo modeled with MAR (reds) and ice albedo modeled with PySR (green-blues). The dashed line represents the 1:1 line.

Download

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f09

Figure A3(a) Aspect, (b) SHAP values for aspect, and (c) their correlation. The maps show the average over JJA in 2020–2021.

https://tc.copernicus.org/articles/20/3131/2026/tc-20-3131-2026-f10

Figure A4(a) Shortwave downward radiation, (b) SHAP values for shortwave downward radiation, and (c) their correlation. The maps show the average over JJA in 2020–2021.

Code availability

The MAR code and output are available from ftp://ftp.climato.be/fettweis/MARv3.12 (last access: 10 June 2024). The code used to analyze the model and satellite data is available upon request from the corresponding author.

Data availability

All data needed to evaluate the conclusions in the paper are present in the paper. Additional data related to this paper may be requested from the authors. The MODIS MOD10A1 and MOD09GA data are available at: https://developers.google.com/earth-engine/datasets/catalog/MODIS_061_MOD10A1 (last access: 10 June 2024) and https://developers.google.com/earth-engine/datasets/catalog/MODIS_061_MOD09GA (last access: 10 June 2024).

Author contributions

Conceptualization: RA, MT, PG. Methodology: RA, MT, PG, XF, WJvdB. Investigation: RA. Visualization: RA. Supervision: MT, PG. Writing – original draft: RA. Writing – review & editing: MT, PG, XF, WJvdB. Funding acquisition: MT, PG.

Competing interests

At least one of the (co-)authors is a member of the editorial board of The Cryosphere. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Financial support

This research has been supported by the National Science Foundation (NSF) (award no. OPP-2136938), NSF Science and Technology Center (STC) Learning the Earth With Artificial Intelligence and Physics (LEAP) (award no. 2019625), Heising-Simons Foundation (award no. 1029-1160), and NASA GISS (award no. 16426).

Review statement

This paper was edited by Andrew Orr and reviewed by Signe Hillerup Larsen and one anonymous referee.

References

Akiba, T., Sano, S., Yanase, T., Ohta, T., and Koyama, M.: Optuna: A Next-generation Hyperparameter Optimization Framework, arXiv [preprint], https://doi.org/10.48550/arXiv.1907.10902, 2019. 

Alexander, P. M., Tedesco, M., Fettweis, X., van de Wal, R. S. W., Smeets, C. J. P. P., and van den Broeke, M. R.: Assessing spatio-temporal variability and trends in modelled and measured Greenland Ice Sheet albedo (2000–2013), The Cryosphere, 8, 2293–2312, https://doi.org/10.5194/tc-8-2293-2014, 2014. 

Amino, T., Iizuka, Y., Matoba, S., Shimada, R., Oshima, N., Suzuki, T., Ando, T., Aoki, T., and Fujita, K.: Increasing dust emission from ice free terrain in southeastern Greenland since 2000, Polar Sci. 27, 100599, https://doi.org/10.1016/j.polar.2020.100599, 2021. 

Antwerpen, R. M., Tedesco, M., Fettweis, X., Alexander, P., and van de Berg, W. J.: Assessing bare-ice albedo simulated by MAR over the Greenland ice sheet (2000–2021) and implications for meltwater production estimates, The Cryosphere, 16, 4185–4199, https://doi.org/10.5194/tc-16-4185-2022, 2022. 

Aschwanden, A., Fahnestock, M. A., Truffer, M., Brinkerhoff, D. J., Hock, R., Khroulev, C., Mottram, R., and Khan, S. A.: Contribution of the Greenland Ice Sheet to sea level over the next millennium, Sci. Adv., 5, eaav9396, https://doi.org/10.1126/sciadv.aav9396, 2019. 

Badgeley, J. A., Steig, E. J., Hakim, G. J., and Fudge, T. J.: Greenland temperature and precipitation over the last 20 000 years using data assimilation, Clim. Past, 16, 1325–1346, https://doi.org/10.5194/cp-16-1325-2020, 2020. 

Batunacun, Wieland, R., Lakes, T., and Nendel, C.: Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China, Geosci. Model Dev., 14, 1493–1510, https://doi.org/10.5194/gmd-14-1493-2021, 2021. 

Bond, T. C., Doherty, S. J., Fahey, D. W., Forster, P. M., Berntsen, T., DeAngelo, B. J., Flanner, M. G., Ghan, S., Kärcher, B., Koch, D., Kinne, S., Kondo, Y., Quinn, P.K., Sarofim, M. C., Schultz, M. G., Schulz, M., Venkataraman, C., Zhang, H., Zhang, S., Bellouin, N., Guttikunda, S. K., Hopke, P. K., Jacobson, M. Z., Kaiser, J. W., Klimont, Z., Lohmann, U., Schwarz, J. P., Shindell, D., Storelvmo, T., Warren, S. G., and Zender, C. S.: Bounding the role of black carbon in the climate system: A scientific assessment, J. Geophys. Res.-Atmos., 118, 5380–5552, https://doi.org/10.1002/jgrd.50171, 2013. 

Briner, J. P., Cuzzone, J. K., Badgeley, J. A., Young, N. E., Steig, E. J., Morlighem, M., Schlegel, N.-J., Hakim, G. J., Schaefer, J. M., Johnson, J. V., Lesnek, A. J., Thomas, E. K., Allan, E., Bennike, O., Cluett, A. A., Csatho, B., de Vernal, A., Downs, J., Larour, E., and Nowicki, S.: Rate of mass loss from the Greenland Ice Sheet will exceed Holocene values this century, Nature, 586, 70–74, https://doi.org/10.1038/s41586-020-2742-6, 2020. 

Brun, E., David, P., Sudul, M., and Brunot, G.: A numerical model to simulate snow-cover stratigraphy for operational avalanche forecasting, J. Glaciol., 38, 13–22, https://doi.org/10.3189/S0022143000009552, 1992. 

Calì Quaglia, F., Meloni, D., Muscari, G., Di Iorio, T., Ciardini, V., Pace, G., Becagli, S., Di Bernardino, A., Cacciani, M., Hannigan, J. W., Ortega, I., and di Sarra, A. G.: On the Radiative Impact of Biomass-Burning Aerosols in the Arctic: The August 2017 Case Study, Remote Sens., 14, 313, https://doi.org/10.3390/rs14020313, 2022. 

Cathles, L. M., Abbot, D. S., Bassis, J. N., and MacAyeal, D. R.: Modeling surface-roughness/solar-ablation feedback: application to small-scale surface channels and crevasses of the Greenland ice sheet, Ann. Glaciol., 52, 99–108, https://doi.org/10.3189/172756411799096268, 2011. 

Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794, https://doi.org/10.1145/2939672.2939785, 2016. 

Colkesen, I. and Ozturk, M. Y.: A comparative evaluation of state-of-the-art ensemble learning algorithms for land cover classification using WorldView-2, Sentinel-2 and ROSIS imagery, Arab. J. Geosci., 15, 942, https://doi.org/10.1007/s12517-022-10243-x, 2022. 

Cook, J. M., Tedstone, A. J., Williamson, C., McCutcheon, J., Hodson, A. J., Dayal, A., Skiles, M., Hofer, S., Bryant, R., McAree, O., McGonigle, A., Ryan, J., Anesio, A. M., Irvine-Fynn, T. D. L., Hubbard, A., Hanna, E., Flanner, M., Mayanna, S., Benning, L. G., van As, D., Yallop, M., McQuaid, J. B., Gribbin, T., and Tranter, M.: Glacier algae accelerate melt rates on the south-western Greenland Ice Sheet, The Cryosphere, 14, 309–330, https://doi.org/10.5194/tc-14-309-2020, 2020. 

Cranmer, M.: Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl, ARXIV, https://doi.org/10.48550/ARXIV.2305.01582, 2023. 

de Myttenaere, A., Golden, B., Le Grand, B., and Rossi, F.: Mean Absolute Percentage Error for regression models, Neurocomputing, Advances in Artificial Neural Networks, Machine Learning and Computational Intelligence, 192, 38–48, https://doi.org/10.1016/j.neucom.2015.12.114, 2016. 

Descals, A., Verger, A., Yin, G., Filella, I., and Peñuelas, J.: Local interpretation of machine learning models in remote sensing with SHAP: the case of global climate constraints on photosynthesis phenology, Int. J. Remote Sens., 44, 3160–3173, https://doi.org/10.1080/01431161.2023.2217982, 2023. 

Dikshit, A. and Pradhan, B.: Interpretable and explainable AI (XAI) model for spatial drought prediction, Sci. Total Environ., 801, 149797, https://doi.org/10.1016/j.scitotenv.2021.149797, 2021. 

Evangeliou, N., Kylling, A., Eckhardt, S., Myroniuk, V., Stebel, K., Paugam, R., Zibtsev, S., and Stohl, A.: Open fires in Greenland in summer 2017: transport, deposition and radiative effects of BC, OC and BrC emissions, Atmos. Chem. Phys., 19, 1393–1411, https://doi.org/10.5194/acp-19-1393-2019, 2019. 

Fan, J., Wang, X., Wu, L., Zhou, H., Zhang, F., Yu, X., Lu, X., and Xiang, Y.: Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China, Energy Convers. Manag., 164, 102–111, https://doi.org/10.1016/j.enconman.2018.02.087, 2018. 

Feng, S., Cook, J. M., Naegeli, K., Anesio, A. M., Benning, L. G., and Tranter, M.: The Impact of Bare Ice Duration and Geo-Topographical Factors on the Darkening of the Greenland Ice Sheet, Geophys. Res. Lett., 51, e2023GL104894, https://doi.org/10.1029/2023GL104894, 2024. 

Fettweis, X., Box, J. E., Agosta, C., Amory, C., Kittel, C., Lang, C., van As, D., Machguth, H., and Gallée, H.: Reconstructions of the 1900–2015 Greenland ice sheet surface mass balance using the regional climate MAR model, The Cryosphere, 11, 1015–1033, https://doi.org/10.5194/tc-11-1015-2017, 2017. 

Fettweis, X., Hofer, S., Séférian, R., Amory, C., Delhasse, A., Doutreloup, S., Kittel, C., Lang, C., Van Bever, J., Veillon, F., and Irvine, P.: Brief communication: Reduction in the future Greenland ice sheet surface melt with the help of solar geoengineering, The Cryosphere, 15, 3013–3019, https://doi.org/10.5194/tc-15-3013-2021, 2021. 

Flanner, M. G. and Zender, C. S.: Linking snowpack microphysics and albedo evolution, J. Geophys. Res., 111, D12208, https://doi.org/10.1029/2005JD006834, 2006. 

Flanner, M. G., Arnheim, J. B., Cook, J. M., Dang, C., He, C., Huang, X., Singh, D., Skiles, S. M., Whicker, C. A., and Zender, C. S.: SNICAR-ADv3: a community tool for modeling spectral snow albedo, Geosci. Model Dev., 14, 7673–7704, https://doi.org/10.5194/gmd-14-7673-2021, 2021. 

Fox-Kemper, B., Hewitt, H. T., Xiao, C., Aðalgeirsdóttir, G., Drijfhout, S. S., Edwards, T. L., Golledge, N. R., Hemer, M., Kopp, R. E., Krinner, G., Mix, A., Notz, D., Nowicki, S., Nurhati, I. S., Ruiz, L., Sallée, J.-B., Slangen, A. B. A., and Yu, Y.: Ocean, cryosphere, and sea level change, in: Climate Change 2021: The Physical Science Basis, Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S.L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M.I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, Ö., Yu, R., and Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 1211–1362, https://doi.org/10.1017/9781009157896.001, 2021. 

Gallée, H.: Air-sea interactions over Terra Nova Bay during winter: Simulation with a coupled atmosphere-polynya model, J. Geophys. Res.-Atmos., 102, 13835–13849, https://doi.org/10.1029/96JD03098, 1997. 

Gallée, H. and Schayes, G.: Development of a Three-Dimensional Meso-γ Primitive Equation Model: Katabatic Winds Simulation in the Area of Terra Nova Bay, Antarctica, Mon. Weather Rev., 122, 671–685, https://doi.org/10.1175/1520-0493(1994)122<0671:DOATDM>2.0.CO;2, 1994. 

Ghafarian, F., Wieland, R., Lüttschwager, D., and Nendel, C.: Application of extreme gradient boosting and Shapley Additive explanations to predict temperature regimes inside forests from standard open-field meteorological data, Environ. Model. Softw., 156, 105466, https://doi.org/10.1016/j.envsoft.2022.105466, 2022. 

Goelles, T. and Bøggild, C. E.: Albedo reduction of ice caused by dust and black carbon accumulation: a model applied to the K-transect, West Greenland, J. Glaciol., 63, 1063–1076, https://doi.org/10.1017/jog.2017.74, 2017. 

Gorelick, N., Hancher, M., Dixon, M., Ilyushchenko, S., Thau, D., and Moore, R.: Google Earth Engine: Planetary-scale geospatial analysis for everyone, Remote Sens. Environ., 202, 18–27, https://doi.org/10.1016/j.rse.2017.06.031, 2017. 

Hall, D. K., Salomonson, V. V., and Riggs, G. A.: MODIS/Terra Snow Cover Daily L3 Global 500m Grid. Version 6, Boulder, Colorado, USA: NASA National Snow and Ice Data Center Distributed Active Archive Center [data set], https://doi.org/10.5067/MODIS/MOD10A1.006, 2016. 

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R.J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. 

Hofer, S., Tedstone, A. J., Fettweis, X., and Bamber, J. L.: Decreasing cloud cover drives the recent mass loss on the Greenland Ice Sheet, Sci. Adv., 9, https://doi.org/10.1126/sciadv.1700584, 2017. 

Howat, I. M., Negrete, A., and Smith, B. E.: The Greenland Ice Mapping Project (GIMP) land classification and surface elevation data sets, The Cryosphere, 8, 1509–1518, https://doi.org/10.5194/tc-8-1509-2014, 2014. 

Huang, L., Kang, J., Wan, M., Fang, L., Zhang, C., and Zeng, Z.: Solar Radiation Prediction Using Different Machine Learning Algorithms and Implications for Extreme Climate Events, Front. Earth Sci., 9, https://doi.org/10.3389/feart.2021.596860, 2021. 

Huber, P. J.: Robust Estimation of a Location Parameter, Ann. Math. Stat., 35, 73–101, https://doi.org/10.1214/aoms/1177703732, 1964. 

Ibrahem Ahmed Osman, A., Najah Ahmed, A., Chow, M.F., Feng Huang, Y., and El-Shafie, A.: Extreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia, Ain Shams Eng. J., 12, 1545–1556, https://doi.org/10.1016/j.asej.2020.11.011, 2021. 

Ishfaque, M., Salman, S., Jadoon, K. Z., Danish, A. A. K., Bangash, K. U., and Qianwei, D.: Understanding the Effect of Hydro-Climatological Parameters on Dam Seepage Using Shapley Additive Explanation (SHAP): A Case Study of Earth-Fill Tarbela Dam, Pakistan, Water, 14, 2598, https://doi.org/10.3390/w14172598, 2022. 

Jonsell, U., Hock, R., and Holmgren, B.: Spatial and temporal variations in albedo on Storglaciären, Sweden, J. Glaciol., 49, 59–68, https://doi.org/10.3189/172756503781830980, 2003. 

Keegan, K. M., Albert, M. R., McConnell, J. R., and Baker, I.: Climate change and forest fires synergistically drive widespread melt events of the Greenland Ice Sheet, P. Natl. Acad. Sci. USA, 111, 7964–7967, https://doi.org/10.1073/pnas.1405397111, 2014. 

Khan, A. L., Xian, P., and Schwarz, J. P.: Black carbon concentrations and modeled smoke deposition fluxes to the bare-ice dark zone of the Greenland Ice Sheet, The Cryosphere, 17, 2909–2918, https://doi.org/10.5194/tc-17-2909-2023, 2023. 

Klein, A. G. and Stroeve, J.: Development and validation of a snow albedo algorithm for the MODIS instrument, Ann. Glaciol., 34, 45–52, https://doi.org/10.3189/172756402781817662, 2002. 

Koo, Y., Xie, H., Kurtz, N. T., Ackley, S. F., and Wang, W.: Sea ice surface type classification of ICESat-2 ATL07 data by using data-driven machine learning model: Ross Sea, Antarctic as an example, Remote Sens. Environ., 296, 113726, https://doi.org/10.1016/j.rse.2023.113726, 2023. 

Lefebre, F., Gallée, H., van Ypersele, J.-P., and Greuell, W.: Modeling of snow and ice melt at ETH Camp (West Greenland): A study of surface albedo, J. Geophys. Res.-Atmos., 108, https://doi.org/10.1029/2001JD001160, 2003. 

Li, J., Wu, C., Zhang, Y., Peñuelas, J., Liu, L., and Ge, Q.: Weakening warming on spring freeze–thaw cycle caused greening Earth's third pole, P. Natl. Acad. Sci. USA, 121, e2319581121, https://doi.org/10.1073/pnas.2319581121, 2024. 

Liang, S.: Narrowband to broadband conversions of land surface albedo I: algorithms, Remote Sens. Environ., 76, 213–238, https://doi.org/10.1016/S0034-4257(00)00205-4, 2001. 

Lundberg, S. and Lee, S. I.: A Unified Approach to Interpreting Model Predictions, ARXIV, https://doi.org/10.48550/ARXIV.1705.07874, 2017. 

Ma, M., Zhao, G., He, B., Li, Q., Dong, H., Wang, S., and Wang, Z.: XGBoost-based method for flash flood risk assessment, J. Hydrol., 598, 126382, https://doi.org/10.1016/j.jhydrol.2021.126382, 2021. 

Ma, X., Fang, C., and Ji, J.: Prediction of outdoor air temperature and humidity using Xgboost, IOP Conf. Ser. Earth Environ. Sci., 427, 012013, https://doi.org/10.1088/1755-1315/427/1/012013, 2020. 

MacGregor, J. A., Fahnestock, M. A., Colgan, W. T., Larsen, N. K., Kjeldsen, K. K., and Welker, J. M.: The age of surface-exposed ice along the northern margin of the Greenland Ice Sheet, J. Glaciol., 66, 667–684, https://doi.org/10.1017/jog.2020.62, 2020. 

McCutcheon, J., Lutz, S., Williamson, C., Cook, J. M., Tedstone, A. J., Vanderstraeten, A., Wilson, S. A., Stockdale, A., Bonneville, S., Anesio, A. M., Yallop, M. L., McQuaid, J. B., Tranter, M., and Benning, L. G.: Mineral phosphorus drives glacier algal blooms on the Greenland Ice Sheet, Nat. Commun., 12, 570, https://doi.org/10.1038/s41467-020-20627-w, 2021. 

Meinander, O., Dagsson-Waldhauserova, P., and Arnalds, O.: Icelandic volcanic dust can have a significant influence on the cryosphere in Greenland and elsewhere, Polar Res., https://doi.org/10.3402/polar.v35.31313, 2016. 

Moroni, B., Arnalds, O., Dagsson-Waldhauserová, P., Crocchianti, S., Vivani, R., and Cappelletti, D.: Mineralogical and Chemical Records of Icelandic Dust Sources Upon Ny-Ålesund (Svalbard Islands), Front. Earth Sci., 6, 415699, https://doi.org/10.3389/feart.2018.00187, 2018. 

Moustafa, S. E., Rennermalm, A. K., Smith, L. C., Miller, M. A., Mioduszewski, J. R., Koenig, L. S., Hom, M. G., and Shuman, C. A.: Multi-modal albedo distributions in the ablation area of the southwestern Greenland Ice Sheet, The Cryosphere, 9, 905–923, https://doi.org/10.5194/tc-9-905-2015, 2015. 

Munro, D. S.: Comparison of Melt Energy Computations and Ablatometer Measurements on Melting Ice and Snow, Arct. Alp. Res., 22, 153–162, https://doi.org/10.2307/1551300, 1990. 

Nagatsuka, N., Goto-Azuma, K., Tsushima, A., Fujita, K., Matoba, S., Onuma, Y., Dallmayr, R., Kadota, M., Hirabayashi, M., Ogata, J., Ogawa-Tsukagawa, Y., Kitamura, K., Minowa, M., Komuro, Y., Motoyama, H., and Aoki, T.: Variations in mineralogy of dust in an ice core obtained from northwestern Greenland over the past 100 years, Clim. Past, 17, 1341–1362, https://doi.org/10.5194/cp-17-1341-2021, 2021. 

Nkiruka, O., Prasad, R., and Clement, O.: Prediction of malaria incidence using climate variability and machine learning, Inform. Med. Unlocked, 22, 100508, https://doi.org/10.1016/j.imu.2020.100508, 2021. 

Noël, B., van de Berg, W.J., Lhermitte, S., and van den Broeke, M. R.: Rapid ablation zone expansion amplifies north Greenland mass loss, Sci. Adv. 5, eaaw0123, https://doi.org/10.1126/sciadv.aaw0123, 2019. 

Riahi, K., van Vuuren, D. P., Kriegler, E., Edmonds, J., O'Neill, B. C., Fujimori, S., Bauer, N., Calvin, K., Dellink, R., Fricko, O., Lutz, W., Popp, A., Cuaresma, J. C., Kc, S., Leimbach, M., Jiang, L., Kram, T., Rao, S., Emmerling, J., Ebi, K., Hasegawa, T., Havlik, P., Humpenöder, F., Da Silva, L. A., Smith, S., Stehfest, E., Bosetti, V., Eom, J., Gernaat, D., Masui, T., Rogelj, J., Strefler, J., Drouet, L., Krey, V., Luderer, G., Harmsen, M., Takahashi, K., Baumstark, L., Doelman, J. C., Kainuma, M., Klimont, Z., Marangoni, G., Lotze-Campen, H., Obersteiner, M., Tabeau, A., and Tavoni, M.: The Shared Socioeconomic Pathways and their energy, land use, and greenhouse gas emissions implications: An overview, Glob. Environ. Change, 42, 153–168, https://doi.org/10.1016/j.gloenvcha.2016.05.009, 2017. 

Ridder, K. D. and Schayes, G.: The IAGL Land Surface Model, J. Appl. Meteorol. Climatol., 36, 167–182, https://doi.org/10.1175/1520-0450(1997)036<0167:TILSM>2.0.CO;2, 1997. 

Rohmer, J., Thieblemont, R., Le Cozannet, G., Goelzer, H., and Durand, G.: Improving interpretation of sea-level projections through a machine-learning-based local explanation approach, The Cryosphere, 16, 4637–4657, https://doi.org/10.5194/tc-16-4637-2022, 2022. 

Ryan, J. C., Smith, L. C., van As, D., Cooley, S. W., Cooper, M. G., Pitcher, L. H., and Hubbard, A.: Greenland Ice Sheet surface melt amplified by snowline migration and bare ice exposure, Sci. Adv., 5, eaav3738, https://doi.org/10.1126/sciadv.aav3738, 2019. 

Shimada, R., Takeuchi, N., and Aoki, T.: Inter-Annual and Geographical Variations in the Extent of Bare Ice and Dark Ice on the Greenland Ice Sheet Derived from MODIS Satellite Images, Front. Earth Sci., 4, https://doi.org/10.3389/feart.2016.00043, 2016. 

Silva, S. J., Keller, C. A., and Hardin, J.: Using an Explainable Machine Learning Approach to Characterize Earth System Model Errors: Application of SHAP Analysis to Modeling Lightning Flash Occurrence, J. Adv. Model. Earth Sy., 14, e2021MS002881, https://doi.org/10.1029/2021MS002881, 2022. 

Stibal, M., Box, J. E., Cameron, K. A., Langen, P. L., Yallop, M. L., Mottram, R. H., Khan, A. L., Molotch, N. P., Chrismas, N. A. M., Calì Quaglia, F., Remias, D., Smeets, C. J. P. P., Broeke, M. R., Ryan, J. C., Hubbard, A., Tranter, M., As, D., and Ahlstrøm, A. P.: Algae Drive Enhanced Darkening of Bare Ice on the Greenland Ice Sheet, Geophys. Res. Lett., 44, 11463–11471, https://doi.org/10.1002/2017GL075958, 2017. 

Stroeve, J.: Assessment of Greenland albedo variability from the advanced very high resolution radiometer Polar Pathfinder data set, J. Geophys. Res.-Atmospheres, 106, 33989–34006, https://doi.org/10.1029/2001JD900072, 2001. 

Tan, W., Wei, C., Lu, Y., and Xue, D.: Reconstruction of All-Weather Daytime and Nighttime MODIS Aqua-Terra Land Surface Temperature Products Using an XGBoost Approach, Remote Sens., 13, 4723, https://doi.org/10.3390/rs13224723, 2021. 

Tang, Y., Duan, A., Xiao, C., and Xin, Y.: The Prediction of the Tibetan Plateau Thermal Condition with Machine Learning and Shapley Additive Explanation, Remote Sens., 14, 4169, https://doi.org/10.3390/rs14174169, 2022. 

Tedesco, M., Doherty, S., Fettweis, X., Alexander, P., Jeyaratnam, J., and Stroeve, J.: The darkening of the Greenland ice sheet: trends, drivers, and projections (1981–2100), The Cryosphere, 10, 477–496, https://doi.org/10.5194/tc-10-477-2016, 2016. 

Tedstone, A. J., Cook, J. M., Williamson, C. J., Hofer, S., McCutcheon, J., Irvine-Fynn, T., Gribbin, T., and Tranter, M.: Algal growth and weathering crust state drive variability in western Greenland Ice Sheet ice albedo, The Cryosphere, 14, 521–538, https://doi.org/10.5194/tc-14-521-2020, 2020. 

Temenos, A., Temenos, N., Kaselimi, M., Doulamis, A., and Doulamis, N.: Interpretable Deep Learning Framework for Land Use and Land Cover Classification in Remote Sensing Using SHAP, IEEE Geosci. Remote Sens. Lett., 20, 1–5, https://doi.org/10.1109/LGRS.2023.3251652, 2023. 

Thomas, J. L., Polashenski, C. M., Soja, A. J., Marelle, L., Casey, K. A., Choi, H. D., Raut, J.-C., Wiedinmyer, C., Emmons, L. K., Fast, J. D., Pelon, J., Law, K. S., Flanner, M. G., and Dibb, J. E.: Quantifying black carbon deposition over the Greenland ice sheet from forest fires in Canada, Geophys. Res. Lett., 44, 7965–7974, https://doi.org/10.1002/2017GL073701, 2017. 

Uetake, J., Naganuma, T., Hebsgaard, M. B., Kanda, H., and Kohshima, S.: Communities of algae and cyanobacteria on glaciers in west Greenland, Polar Sci., 4, 71–80, https://doi.org/10.1016/j.polar.2010.03.002, 2010. 

Újvári, G., Klötzli, U., Stevens, T., Svensson, A., Ludwig, P., Vennemann, T., Gier, S., Horschinegg, M., Palcsu, L., Hippler, D., Kovács, J., Biagio, C. D., and Formenti, P.: Greenland Ice Core Record of Last Glacial Dust Sources and Atmospheric Circulation, J. Geophys. Res.-Atmos., 127, e2022JD036597, https://doi.org/10.1029/2022JD036597, 2022. 

van Dalum, C. T., van de Berg, W. J., Lhermitte, S., and van den Broeke, M. R.: Evaluation of a new snow albedo scheme for the Greenland ice sheet in the Regional Atmospheric Climate Model (RACMO2), The Cryosphere, 14, 3645–3662, https://doi.org/10.5194/tc-14-3645-2020, 2020. 

van den Broeke, M., Box, J., Fettweis, X., Hanna, E., Noël, B., Tedesco, M., van As, D., van de Berg, W. J., van and Kampenhout, L.: Greenland Ice Sheet Surface Mass Loss: Recent Developments in Observation and Modeling, Curr. Clim. Change Rep., 3, 345–356, https://doi.org/10.1007/s40641-017-0084-8, 2017. 

van Kampenhout, L., Lenaerts, J. T. M., Lipscomb, W. H., Lhermitte, S., Noël, B., Vizcaíno, M., Sacks, W. J., and van den Broeke, M. R.: Present-Day Greenland Ice Sheet Climate and Surface Mass Balance in CESM2, J. Geophys. Res.-Earth Surf., 125, e2019JF005318, https://doi.org/10.1029/2019JF005318, 2020. 

Vermote, E. and Wolfe, R.: MOD09GA MODIS/Terra Surface Reflectance Daily L2G Global 1km and 500m SIN Grid V006, NASA EOSDIS Land Processes DAAC [data set], https://doi.org/10.5067/MODIS/MOD09GA.006, 2015. 

Vermote, E. F., Tanre, D., Deuze, J. L., Herman, M., and Morcette, J. J.: Second Simulation of the Satellite Signal in the Solar Spectrum, 6S: an overview, IEEE T. Geosci. Remote, 35, 675–686, https://doi.org/10.1109/36.581987, 1997. 

Vermote, E. F., El Saleous, N. Z., and Justice, C. O.: Atmospheric correction of MODIS data in the visible to middle infrared: first results, Remote Sens. Environ., 83, 97–111, https://doi.org/10.1016/S0034-4257(02)00089-5, 2002. 

Wang, S., Tedesco, M., Alexander, P., Xu, M., and Fettweis, X.: Quantifying spatiotemporal variability of glacier algal blooms and the impact on surface albedo in southwestern Greenland, The Cryosphere, 14, 2687–2713, https://doi.org/10.5194/tc-14-2687-2020, 2020. 

Wang, X. and Zender, C.: MODIS snow albedo bias at high solar zenith angles relative to theory and to in situ observations in Greenland, Remote Sens. Environ., 114, 563–575, https://doi.org/10.1016/j.rse.2009.10.014, 2010. 

Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P.: Image quality assessment: from error visibility to structural similarity, IEEE T. Image Process., 13, 600–612, https://doi.org/10.1109/TIP.2003.819861, 2004. 

Ward, J. L., Flanner, M. G., Bergin, M., Dibb, J. E., Polashenski, C. M., Soja, A. J., and Thomas, J. L.: Modeled Response of Greenland Snowmelt to the Presence of Biomass Burning-Based Absorbing Aerosols in the Atmosphere and Snow, J. Geophys. Res.-Atmos., 123, 6122–6141, https://doi.org/10.1029/2017JD027878, 2018. 

Warren, S. G. and Wiscombe, W. J.: A model for the spectral albedo of snow. II: Snow containing atmospheric aerosols, J. Atmos. Sci., 37, 2734–2745, https://doi.org/10.1175/1520-0469(1980)037<2734:AMFTSA>2.0.CO;2, 1980. 

Warren, S. G., Brandt, R. E., and Grenfell, T. C.: Visible and near-ultraviolet absorption spectrum of ice from transmission of solar radiation into snow, Appl. Optics, 45, 5320, https://doi.org/10.1364/AO.45.005320, 2006. 

Whicker, C. A., Flanner, M. G., Dang, C., Zender, C. S., Cook, J. M., and Gardner, A. S.: SNICAR-ADv4: a physically based radiative transfer model to represent the spectral albedo of glacier ice, The Cryosphere, 16, 1197–1220, https://doi.org/10.5194/tc-16-1197-2022, 2022. 

Whicker-Clarke, C. A., Antwerpen, R., Flanner, M. G., Schneider, A., Tedesco, M., and Zender, C. S.: The Effect of Physically Based Ice Radiative Processes on Greenland Ice Sheet Albedo and Surface Mass Balance in E3SM, J. Geophys. Res.-Atmos., 129, e2023JD040241, https://doi.org/10.1029/2023JD040241, 2024. 

Wientjes, I. G. M. and Oerlemans, J.: An explanation for the dark region in the western melt zone of the Greenland ice sheet, The Cryosphere, 4, 261–268, https://doi.org/10.5194/tc-4-261-2010, 2010. 

Wientjes, I. G. M., Van De Wal, R. S. W., Schwikowski, M., Zapf, A., Fahrni, S., and Wacker, L.: Carbonaceous particles reveal that Late Holocene dust causes the dark region in the western ablation zone of the Greenland ice sheet, J. Glaciol., 58, 787–794, https://doi.org/10.3189/2012JoG11J165, 2012. 

Williamson, C.J., Cook, J., Tedstone, A., Yallop, M., McCutcheon, J., Poniecka, E., Campbell, D., Irvine-Fynn, T., McQuaid, J., Tranter, M., Perkins, R., and Anesio, A.: Algal photophysiology drives darkening and melt of the Greenland Ice Sheet, P. Natl. Acad. Sci. USA, 117, 5694–5705, https://doi.org/10.1073/pnas.1918412117, 2020.  

Wiscombe, W. J. and Warren, S. G.: A model for the spectral albedo of snow. I: Pure Snow, J. Atmos. Sci., 37, 2712–2733, https://doi.org/10.1175/1520-0469(1980)037<2712:AMFTSA>2.0.CO;2, 1980. 

Yue, C., Zhao, L., Wolovick, M., and Moore, J. C.: Greenland Ice Sheet Surface Runoff Projections to 2200 Using Degree-Day Methods, Atmosphere, 12, 1569, https://doi.org/10.3390/atmos12121569, 2021. 

Zuo, Z. and Oerlemans, J.: Modelling albedo and specific balance of the Greenland ice sheet: calculations for the Søndre Strømfjord transect, J. Glaciol., 42, 305–317, https://doi.org/10.3189/S0022143000004160, 1996. 

Download
Short summary
We study why Greenland ice melts faster by improving how ice brightness is represented. This is important because it controls how much sunlight is absorbed by the ice. Using satellite data and a new transparent machine learning method trained with climate model information, we capture how the shape of the ice sheet, temperature, and meltwater change ice brightness. Our approach outperforms existing climate models and can reduce uncertainty in future sea level rise projections.
Share